JP2017175343A

JP2017175343A - Fault analysis device, fault analysis system, fault analysis method, and program for fault analysis

Info

Publication number: JP2017175343A
Application number: JP2016058428A
Authority: JP
Inventors: 明紀日下部; Akinori Kusakabe
Original assignee: NEC Platforms Ltd
Current assignee: NEC Platforms Ltd
Priority date: 2016-03-23
Filing date: 2016-03-23
Publication date: 2017-09-28
Anticipated expiration: 2036-03-23
Also published as: JP6168628B1

Abstract

PROBLEM TO BE SOLVED: To provide a fault analysis device which easily detects the occurrence of a network fault having a possibility of hindrance to voice communication, so as to promptly recover from the fault.SOLUTION: The fault analysis device includes: RTP packet generation means which generates an RTP packet; DTMF signal addition means which adds a DTMF signal to the RTP packet to be used as a fault detection signal; transmission means which transmits the fault detection signal to another fault analysis device; reception means which receives the fault detection signal returned from the other fault analysis device; problem detection means which detects a problematic module in the fault analysis device on the basis of the DTMF signal included in the fault detection signal received by the reception means; and reboot means which reboots the problematic module detected by the problem detection means.SELECTED DRAWING: Figure 4

Description

本発明は、障害解析装置、障害解析システム、障害解析方法、及び障害解析用プログラムに関し、特に、通信ネットワークで用いられる障害解析装置、障害解析システム、障害解析方法、及び障害解析用プログラムに関する。 The present invention relates to a failure analysis device, a failure analysis system, a failure analysis method, and a failure analysis program, and more particularly, to a failure analysis device, a failure analysis system, a failure analysis method, and a failure analysis program used in a communication network.

ネットワーク機器は、その機能の性質上、さまざまなネットワーク環境に置かれ、多種多様な機器との通信を行う。そのため、ひとたび障害が発生すると、原因の切り分けに膨大な時間がかかり、かかる費用も甚大である。 Network devices are placed in various network environments due to the nature of their functions and communicate with a wide variety of devices. For this reason, once a failure occurs, it takes an enormous amount of time to isolate the cause, and the cost is enormous.

また、障害が発生したとき、どのような通信が行われていたか知りたくても、ネットワークプロトコルアナライザは通常設置されていないため、現場からの情報をもとに再現試験を行い、問題を再現させて原因を究明するしかないのが、多くの場合の現状である。しかし、再現試験を行うにしても不確かな情報をもとに、考えられる試験をするほかない場合が多かった。 Even if you want to know what kind of communication was going on when a failure occurred, a network protocol analyzer is not usually installed, so a reproduction test is performed based on information from the field to reproduce the problem. In many cases, there is no choice but to investigate the cause. However, there were many cases where there was no choice but to do a conceivable test based on uncertain information even if a reproducibility test was performed.

したがって多くの場合、手当り次第の試験にならざるを得ず、現象の再現だけでも膨大な工数が必要となる。さらに、ネットワーク機器は金融系、交通系などミッションクリティカルが要求される業務で使用されることも多く、障害が発生した場合、速やかに復旧することが求められる。しかし、多くの場合、障害が発生したことさえシステムは検知できないため、障害が発生したままの状態で運用をつづけ、業務に甚大な影響を与えてしまい、金銭的にも信用的にも大きな打撃となっていた。 Therefore, in many cases, it is unavoidable to carry out tests depending on the handling, and enormous man-hours are required to reproduce the phenomenon alone. Furthermore, network devices are often used in operations that require mission criticality, such as financial and transportation systems, and are required to recover quickly when a failure occurs. However, in many cases, the system cannot detect even the occurrence of a failure, so it will continue to operate in the state where the failure has occurred, and it will have a profound impact on operations, resulting in a significant financial and credit impact. It was.

例として、図１を示す。図１のゲートウェイ（Ｇａｔｅｗａｙ）装置１０−１及び１０−２は、電話機端末を収容し、上位装置２０の制御に従ってＩＰと電話機端末プロトコルとの変換を行う装置である。 As an example, FIG. The gateway devices 10-1 and 10-2 in FIG. 1 are devices that accommodate telephone terminals and perform conversion between IP and telephone terminal protocols according to the control of the host apparatus 20.

例えば端末３０−１がオフフックし、相手先である端末３０−２の電話番号を押下したとき、それらの情報はＩＰパケットの制御信号として上位装置２０に送られ、上位装置２０はその番号に該当する電話機を収容しているゲートウェイ装置１０−２に対して端末３０−２の鳴動を指示するＩＰ制御パケットを送出する。ゲートウェイ装置１０−２はその制御信号を受信し、電話機を実際に鳴動させる。その端末３０−２がオフフックすると、その信号が上位装置２０に送られ、それを受けて上位装置２０から各ゲートウェイ間で通話を行うよう指示が行われる。ゲートウェイ装置１０−１は、その指示に従い、各ゲートウェイ間で通話のためのネゴシエーションを行い、完了後、相互にＲＴＰによる音声パケットをやり取りして通話を行う。 For example, when the terminal 30-1 goes off-hook and presses the telephone number of the terminal 30-2 which is the other party, the information is sent to the host device 20 as a control signal of the IP packet, and the host device 20 corresponds to the number. An IP control packet instructing ringing of the terminal 30-2 is sent to the gateway apparatus 10-2 that accommodates the telephone to be operated. The gateway device 10-2 receives the control signal and actually rings the telephone. When the terminal 30-2 goes off-hook, the signal is sent to the host device 20, and an instruction is given from the host device 20 to make a call between the gateways. According to the instruction, the gateway device 10-1 negotiates for a call between the gateways. After completion, the gateway device 10-1 performs a call by exchanging RTP voice packets with each other.

これらの制御パケットや音声パケットは、さまざまなネットワーク機器を経由するためパケットの遅延や揺らぎ等も起きやすく、それらを想定していないタイミングで処理することにより、ゲートウェイ装置が不具合を引き起こすことも多い。 Since these control packets and voice packets pass through various network devices, packet delays and fluctuations are likely to occur, and processing them at timings that do not assume them often causes problems in the gateway device.

これに関し、特許文献１は、センター側から端末側に送信したＲＴＰパケットと、このＲＴＰパケットに応じて、端末側からセンター側に送信されたＤＴＭＦパケットとの対応関係を判定する発明を開示している。 In this regard, Patent Document 1 discloses an invention for determining a correspondence relationship between an RTP packet transmitted from the center side to the terminal side and a DTMF packet transmitted from the terminal side to the center side according to the RTP packet. Yes.

また、特許文献２は、サーバ、ネットワークＡ、ルータ、ネットワークＢ、端末の順に接続されているシステムにおいて、ネットワークＡでＲＴＰパケットが消失した場合、消失パケットのシーケンス番号を有すると共に、欠落に関する情報を記述したＲＴＰパケットをルータが挿入することにより、ネットワークＡとネットワークＢの何れにおいて障害が発生しているかを判定する発明を開示している。 Further, in Patent Document 2, in a system connected in the order of a server, a network A, a router, a network B, and a terminal, when an RTP packet is lost in the network A, it has a sequence number of the lost packet and information on the loss. An invention is disclosed in which a router inserts the described RTP packet to determine in which network A or network B a failure has occurred.

特開２００４−１６６０４９号公報JP 2004-166049 A 特開２００９−０４４３００号公報JP 2009-044300 A

しかし、特許文献１に係る発明は、上記のようにあくまでＲＴＰパケットとＤＴＭＦパケットとの対応関係を判定する発明であって、障害を検出するものではなかった。また、特許文献２に係る発明においては、例えばルータにおいて障害が発生したために、欠落に関する情報を記述したＲＴＰパケットを挿入できなかった場合、ルータとネットワークＢとのいずれにおいて障害が発生しているのか、原因を切り分けることができなかった。 However, the invention according to Patent Document 1 is an invention for determining a correspondence relationship between an RTP packet and a DTMF packet as described above, and does not detect a failure. Further, in the invention according to Patent Document 2, for example, when a failure has occurred in a router, an RTP packet describing information about omission cannot be inserted. Which of the router and network B has a failure? , Could not isolate the cause.

従って、ＶｏＩＰ技術を用いてＩＰネットワーク上で音声通話を行うＩＰ電話システムにおいて、ネットワーク障害が発生し音声通話に支障が出た場合、これらの発明を用いても、障害解析・原因究明・障害除去には膨大な労力が必要であった。 Therefore, in an IP telephone system that performs voice calls over an IP network using VoIP technology, if a network failure occurs and a voice call is hindered, failure analysis, cause investigation, and fault removal can be achieved even with these inventions. It took a lot of effort.

そこで本発明においては、音声通話に支障の出る可能性のあるネットワーク障害の発生を容易に検知し、迅速に障害から回復するための障害解析装置を提供することを目的とする。 Therefore, an object of the present invention is to provide a failure analysis device for easily detecting the occurrence of a network failure that may interfere with a voice call and quickly recovering from the failure.

本発明の第１の観点によれば、障害解析装置であって、ＲＴＰパケットを生成するＲＴＰパケット生成手段と、前記ＲＴＰパケットにＤＴＭＦ信号を付加し、障害検知信号とするＤＴＭＦ信号付加手段と、前記障害検知信号を他障害解析装置に送信する送信手段と、他障害解析装置から返信されてきた前記障害検知信号を受信する受信手段と、前記受信手段が受信した前記障害検知信号に含まれるＤＴＭＦ信号に基づき、当該障害解析装置内の問題モジュールを検出する問題検出手段と、前記問題検出手段によって検出された前記問題モジュールをリブートするリブート手段とを備えることを特徴とする障害解析装置が提供される。 According to a first aspect of the present invention, there is provided a failure analysis apparatus, an RTP packet generation unit that generates an RTP packet, a DTMF signal addition unit that adds a DTMF signal to the RTP packet and forms a failure detection signal, Transmission means for transmitting the failure detection signal to the other failure analysis device, reception means for receiving the failure detection signal returned from the other failure analysis device, and DTMF included in the failure detection signal received by the reception means Provided is a failure analysis device comprising: a problem detection unit that detects a problem module in the failure analysis device based on the signal; and a reboot unit that reboots the problem module detected by the problem detection unit. The

本発明の第２の観点によれば、障害解析方法であって、ＲＴＰパケットを生成するＲＴＰパケット生成ステップと、前記ＲＴＰパケットにＤＴＭＦ信号を付加し、障害検知信号とするＤＴＭＦ信号付加ステップと、前記障害検知信号を他障害解析装置に送信する送信ステップと、他障害解析装置から返信されてきた前記障害検知信号を受信する受信ステップと、受信した前記障害検知信号に含まれるＤＴＭＦ信号に基づき、自障害解析装置内の問題モジュールを検出する問題検出ステップと、前記問題検出ステップにおいて検出された前記問題モジュールをリブートするリブートステップとを有することを特徴とする障害解析方法が提供される。 According to a second aspect of the present invention, there is provided a failure analysis method, an RTP packet generation step of generating an RTP packet, a DTMF signal addition step of adding a DTMF signal to the RTP packet and making it a failure detection signal, Based on the transmission step of transmitting the failure detection signal to the other failure analysis device, the reception step of receiving the failure detection signal returned from the other failure analysis device, and the DTMF signal included in the received failure detection signal, There is provided a failure analysis method comprising: a problem detection step for detecting a problem module in the own failure analysis device; and a reboot step for rebooting the problem module detected in the problem detection step.

本発明の第３の観点によれば、障害解析装置としてコンピュータを機能させるための障害解析用プログラムであって、コンピュータを、ＲＴＰパケットを生成するＲＴＰパケット生成手段と、前記ＲＴＰパケットにＤＴＭＦ信号を付加し、障害検知信号とするＤＴＭＦ信号付加手段と、前記障害検知信号を他障害解析装置に送信する送信手段と、他障害解析装置から返信されてきた前記障害検知信号を受信する受信手段と、前記受信部が受信した前記障害検知信号に含まれるＤＴＭＦ信号に基づき、当該障害解析装置内の問題モジュールを検出する問題検出手段と、前記問題検出手段によって検出された前記問題モジュールをリブートするリブート手段として機能させるための障害解析用プログラムが提供される。 According to a third aspect of the present invention, there is provided a failure analysis program for causing a computer to function as a failure analysis device, the computer comprising: an RTP packet generation means for generating an RTP packet; and a DTMF signal in the RTP packet. A DTMF signal adding means for adding a fault detection signal; a transmission means for transmitting the fault detection signal to the other fault analysis apparatus; and a reception means for receiving the fault detection signal returned from the other fault analysis apparatus; Based on a DTMF signal included in the failure detection signal received by the receiver, a problem detection unit for detecting a problem module in the failure analysis device, and a reboot unit for rebooting the problem module detected by the problem detection unit A failure analysis program for functioning as a server is provided.

本発明によれば、容易な障害発生の検知と障害からの自律復旧により、運用環境下での障害による影響を最小限にとどめることが可能となる。 According to the present invention, it is possible to minimize the influence of a failure in the operating environment by easily detecting the occurrence of the failure and autonomous recovery from the failure.

ゲートウェイ装置とそれらに接続する上位装置とで構成されるシステムの従来例を示す図である。It is a figure which shows the prior art example of the system comprised by a gateway apparatus and the high-order apparatus connected to them. 本発明の実施形態に係る障害検知の仕組みを示す概念図である。It is a conceptual diagram which shows the mechanism of the failure detection which concerns on embodiment of this invention. 本発明の実施形態に係る障害検知の手法を示すフロー図である。It is a flowchart which shows the method of the fault detection which concerns on embodiment of this invention. 本発明の実施形態に係る障害解析装置の構成を示す構成図である。It is a block diagram which shows the structure of the failure analysis apparatus which concerns on embodiment of this invention. 本発明の実施形態に係る障害解析装置の動作を示すフロー図である。It is a flowchart which shows operation | movement of the failure analysis apparatus which concerns on embodiment of this invention. 本発明の実施形態に係る障害解析装置が実施する外部信号記録の概念図である。It is a conceptual diagram of the external signal recording which the failure analysis apparatus which concerns on embodiment of this invention implements. 本発明の実施形態に係る障害解析装置が実施する内部状態記録の概念図である。It is a conceptual diagram of the internal state recording which the failure analysis apparatus which concerns on embodiment of this invention implements. 本発明の実施形態に係る障害解析システムの構成を示す構成図である。It is a block diagram which shows the structure of the failure analysis system which concerns on embodiment of this invention. 本発明の実施形態に係る障害解析システムのデータを用いた再現試験方法を示す図である。It is a figure which shows the reproduction test method using the data of the failure analysis system which concerns on embodiment of this invention. 本発明の実施形態に係る障害解析システムの動作例を示す図である。It is a figure which shows the operation example of the failure analysis system which concerns on embodiment of this invention.

（障害解析装置による障害検知及び障害からの回復）
図２に障害を検知する仕組みに関する概念図を示す。 (Fault detection by fault analyzer and recovery from fault)
FIG. 2 shows a conceptual diagram regarding a mechanism for detecting a failure.

ゲートウェイ装置と上位装置、又はゲートウェイ装置間でやり取りされる制御情報を有するパケットは、図２に示す通り、先頭にヘッダ情報として順序番号（Ｎｏ．ｘｘ）、ＣｈｅｃｋＳｕｍ、送信時間（Ｔｉｍｅ）の情報をもっており、ヘッダ情報の次に実制御情報であるＤａｔａを持つ。 As shown in FIG. 2, a packet having control information exchanged between the gateway device and the host device or the gateway device has information on a sequence number (No. xx), CheckSum, and transmission time (Time) as header information at the beginning. And has Data which is actual control information next to the header information.

ゲートウェイ装置は、これらの情報をもとにネットワークの状態を診断する。例えば、順序番号が抜けた場合はパケットのロスが発生したと判断し、ＣｈｅｃｋＳｕｍが間違っている場合はネットワークにノイズなどの異常が起きていると判断する。 The gateway device diagnoses the state of the network based on such information. For example, if the sequence number is missing, it is determined that a packet loss has occurred, and if CheckSum is incorrect, it is determined that an abnormality such as noise has occurred in the network.

次に音声信号であるＲＴＰパケットについてであるが、こちらは音声信号であるため、揺らぎやパケットロスがないことが保証されておらず、上述の方法でエラーを検知することが難しい。そのため、ＲＴＰパケットにＤＴＭＦ信号を音声として付加することで、品質チェックの手段としてＤＴＭＦ信号を利用する方法を考える。すなわち、他装置との間でＲＴＰ信号をやり取りする際、別のチャネルでもう一経路余分にＲＴＰ信号をやり取りするパスを開いておき、そこでＤＴＭＦ音を含んだＲＴＰパケットを送出し、エラー検出の手段として用いる。 Next, regarding the RTP packet which is a voice signal, since this is a voice signal, it is not guaranteed that there is no fluctuation or packet loss, and it is difficult to detect an error by the above-described method. Therefore, a method of using a DTMF signal as a quality check means by adding a DTMF signal as voice to an RTP packet is considered. That is, when exchanging RTP signals with other devices, a path for exchanging another RTP signal with another channel is opened, and an RTP packet containing DTMF sound is transmitted there for error detection. Use as a means.

図３は、その手法を示す図である。 FIG. 3 is a diagram showing the method.

まず、ゲートウェイ装置１００−１とゲートウェイ装置１００−２との間に音声品質チェック用の通話パスを開く。図３の通り、ゲートウェイ装置１００−１からＤＴＭＦ音声をＲＴＰ信号に乗せ、番号１から９まで連続に送信し、それを繰り返す。 First, a voice quality check call path is opened between the gateway device 100-1 and the gateway device 100-2. As shown in FIG. 3, the DTMF voice is added to the RTP signal from the gateway apparatus 100-1, and the numbers 1 to 9 are continuously transmitted, and this is repeated.

受け手であるゲートウェイ装置１００−２はその信号をＤＴＭＦ信号として認識し、認識した通りに同様にゲートウェイ装置１００−１に対してＤＴＭＦ信号を返信する。ここで、ゲートウェイ装置１００−１からゲートウェイ装置１００−２にＲＴＰを送る経路で音声品質に異常が生じた場合、ＤＴＭＦ信号はゆがみ、あるいは欠損し、ゲートウェイ装置１００−２はＤＴＭＦ信号を正しく認識しない。そのため、連続に来るはずの番号が欠損したと認識するので、音声品質に影響があったと判断できる。 The gateway device 100-2 as the receiver recognizes the signal as a DTMF signal, and similarly returns the DTMF signal to the gateway device 100-1 as recognized. Here, when an abnormality occurs in the voice quality in the route for sending RTP from the gateway apparatus 100-1 to the gateway apparatus 100-2, the DTMF signal is distorted or lost, and the gateway apparatus 100-2 does not recognize the DTMF signal correctly. . For this reason, since it is recognized that the numbers that should come consecutively are missing, it can be determined that the voice quality has been affected.

また、ゲートウェイ装置１００−１としては、ゲートウェイ装置１００−２の送信するＤＴＭＦ信号がやはり欠損するため、同様に問題があったことを認識できる。 Moreover, since the DTMF signal transmitted from the gateway apparatus 100-2 is still missing, the gateway apparatus 100-1 can recognize that there was a problem in the same manner.

そして、問題を検出したタイミングで直ちに当該問題モジュール（例えば音声通話で問題を検出した場合はＲＴＰ送出を行うＶｏＩＰ制御パッケージ）の初期化を行い、迅速にエラー状態から回復できる。 Then, the problem module (for example, a VoIP control package that performs RTP transmission when a problem is detected in a voice call) is immediately initialized at the timing when the problem is detected, so that the error state can be quickly recovered.

これにより、障害発生時間を最小限に抑えることが可能となる。 As a result, the failure occurrence time can be minimized.

図４は、上記の手法を用いる、第一の実施形態である障害解析装置（ゲートウェイ装置）１００の構成を示す図である。また、図５は、図４に示される各構成要素の挙動の順序を説明するフローチャートである。 FIG. 4 is a diagram illustrating a configuration of a failure analysis apparatus (gateway apparatus) 100 according to the first embodiment using the above-described method. FIG. 5 is a flowchart for explaining the order of behavior of each component shown in FIG.

図４に示されるように、障害解析装置１００は、ＲＴＰパケット生成部１１０、ＤＴＭＦ信号付加部１２０、送信部１３０、受信部１４０、問題検出部１５０、リブート部１６０、外部信号記録部１７０、内部状態記録部１８０、及びこれらを制御する制御部１９０を含む。なお、外部信号記録部１７０及び内部状態記録部１８０は、主要部であるＲＴＰパケット生成部１１０、ＤＴＭＦ信号付加部１２０、送信部１３０、受信部１４０、問題検出部１５０、リブート部１６０、制御部１９０と別体としてもよい。あるいは、上記の主要部を含む障害解析装置１００自体と別体としてもよい。各構成要素の機能については、以下に図５を用いて示す、障害検知及び回復方法に係る説明内で述べる。 As shown in FIG. 4, the failure analysis apparatus 100 includes an RTP packet generation unit 110, a DTMF signal addition unit 120, a transmission unit 130, a reception unit 140, a problem detection unit 150, a reboot unit 160, an external signal recording unit 170, an internal It includes a status recording unit 180 and a control unit 190 that controls them. The external signal recording unit 170 and the internal state recording unit 180 include an RTP packet generation unit 110, a DTMF signal addition unit 120, a transmission unit 130, a reception unit 140, a problem detection unit 150, a reboot unit 160, and a control unit, which are main units. It may be separated from 190. Alternatively, the failure analysis apparatus 100 including the main part may be separated from the device itself. The function of each component will be described below in the description of the failure detection and recovery method shown in FIG.

図５は、障害解析装置１００が実施する障害検知及び回復方法のフローを示すフローチャートである。 FIG. 5 is a flowchart illustrating a flow of a failure detection and recovery method performed by the failure analysis apparatus 100.

最初にＲＴＰパケット生成部１１０が、ＲＴＰパケットを生成する（ＳＴＥＰ１００１）。 First, the RTP packet generation unit 110 generates an RTP packet (STEP 1001).

次に、ＤＴＭＦ信号付加部１２０が、ＳＴＥＰ１００１において生成されたＲＴＰパケットにＤＴＭＦ信号を付加し、障害検知信号とする（ＳＴＥＰ１００２）。この際、上述のように、ＤＴＭＦ音声を番号１から９まで連続に付加し、それを繰り返す。 Next, the DTMF signal adding unit 120 adds a DTMF signal to the RTP packet generated in STEP 1001 to obtain a failure detection signal (STEP 1002). At this time, as described above, DTMF voices are continuously added from numbers 1 to 9, and this is repeated.

次に、送信部１３０が、ＳＴＥＰ１００２において生成された障害検知信号を他障害解析装置に送信する（ＳＴＥＰ１００３）。 Next, the transmission unit 130 transmits the failure detection signal generated in STEP 1002 to the other failure analysis device (STEP 1003).

次に、受信部１４０が、他障害解析装置から返信されてきた障害検知信号を受信する（ＳＴＥＰ１００４）。 Next, the receiving unit 140 receives the failure detection signal returned from the other failure analysis device (STEP 1004).

次に、問題検出部１５０が、受信した障害検知信号に含まれるＤＴＭＦ信号を基に問題を検出する（ＳＴＥＰ１００５）。この際、ＤＴＭＦ信号を波形として検出することにより、その波形の揺らぎ、途切れ等もそのまま検出することが可能である。これにより、単なるＤＴＭＦ信号の受信有無ではなく、ＤＴＭＦ信号の品質も確認する。 Next, the problem detection unit 150 detects a problem based on the DTMF signal included in the received failure detection signal (STEP 1005). At this time, by detecting the DTMF signal as a waveform, it is possible to detect fluctuations, breaks, etc. of the waveform as they are. As a result, the quality of the DTMF signal is confirmed, not just whether or not the DTMF signal is received.

次に、リブート部１６０が、ＳＴＥＰ１００５において検出された問題を有するモジュールを初期化し、再起動する（ＳＴＥＰ１００６）。 Next, the reboot unit 160 initializes and restarts the module having the problem detected in STEP 1005 (STEP 1006).

なお、上記の説明では、受信部１４０が受信した障害検知信号に含まれるＤＴＭＦ信号を基に問題を検出するとしたが、受信部１４０が受信した制御信号を基に問題を検出してもよい。この際、仮に問題のあるモジュールを特定できなかった場合は、外部信号記録部１７０及び内部状態記録部１８０が障害解析装置１００と別体である場合は、障害解析装置１００自体を初期化し、再起動してもよい。あるいは、外部信号記録部１７０及び内部状態記録部１８０が、障害解析装置１００に含まれるものの、障害解析装置１００の主要部である、図４内の点線部、すなわち、ＲＴＰパケット生成部１１０、ＤＴＭＦ信号付加部１２０、送信部１３０、受信部１４０、問題検出部１５０、リブート部１６０、制御部１９０と別体である場合は、主要部のみをリブートしてもよい。 In the above description, the problem is detected based on the DTMF signal included in the failure detection signal received by the receiving unit 140. However, the problem may be detected based on the control signal received by the receiving unit 140. At this time, if the problematic module cannot be identified, if the external signal recording unit 170 and the internal state recording unit 180 are separate from the failure analysis device 100, the failure analysis device 100 itself is initialized and re-executed. It may be activated. Alternatively, although the external signal recording unit 170 and the internal state recording unit 180 are included in the failure analysis apparatus 100, they are the principal parts of the failure analysis apparatus 100, that is, the dotted line portions in FIG. 4, that is, the RTP packet generation unit 110, DTMF. When the signal adding unit 120, the transmitting unit 130, the receiving unit 140, the problem detecting unit 150, the reboot unit 160, and the control unit 190 are separate from each other, only the main unit may be rebooted.

また、上記のＳＴＥＰ１００５でＤＴＭＦ信号の品質をチェックする際は、揺らぎの大きなＤＴＭＦ信号を受信側内部ＤＳＰにてＤＴＭＦとして認識しないといった調整をしてもよい。 Further, when checking the quality of the DTMF signal in the above STEP 1005, an adjustment may be made such that a DTMF signal having a large fluctuation is not recognized as DTMF by the receiving side internal DSP.

なお、障害解析装置１００が有する外部信号記録部１７０、及び内部状態記録部１８０等の機能については、後述する。 The functions of the external signal recording unit 170 and the internal state recording unit 180 included in the failure analysis apparatus 100 will be described later.

（障害解析装置による障害情報の記録）
次に、障害情報の記録に関して説明する。 (Recording fault information by the fault analyzer)
Next, recording of failure information will be described.

ゲートウェイ装置は、外部からの入力であるパケットの信号をもとに動作し、それ以外の信号によって動作が変化することは基本的にない。そのため、パケットは障害解析のために最も重要な情報であるが、全てのパケットデータを保存することは、そのデータ量が膨大であるためできない。 The gateway device operates based on a packet signal that is input from the outside, and the operation is basically not changed by other signals. Therefore, the packet is the most important information for failure analysis, but it is impossible to save all packet data because the data amount is enormous.

そのため、前述の手段によって障害を検知した時点の近辺に受信したパケットのみを、外部信号記録部１７０により記録する。 For this reason, the external signal recording unit 170 records only the packets received in the vicinity of the time when the failure is detected by the above-described means.

図６に外部信号記録部１７０の概念図を示す。ゲートウェイ装置は、外部信号記録部１７０により、外部からのパケットをリングバッファ形式で保存し、古いものは上書きして常に保存する。障害を検知したとき、その前後数パケットのデータのみが記録される。こうすることで、障害が発生した時点、及びその近傍の時点でのパケットデータのみを記録でき、保存領域を圧迫しない。これによって障害が発生した時点の外部信号が明らかになり、障害を起こすに至ったトリガーとなる信号を特定できる。 FIG. 6 shows a conceptual diagram of the external signal recording unit 170. The gateway device stores external packets in a ring buffer format by the external signal recording unit 170, and always overwrites and overwrites old ones. When a failure is detected, only a few packets of data before and after the failure are recorded. In this way, only packet data at the time when a failure occurs and at a time close thereto can be recorded, and the storage area is not compressed. As a result, an external signal at the time of occurrence of the failure is clarified, and a signal that becomes a trigger for causing the failure can be specified.

一方、障害の発生は必ずしも外部信号のみによって決まるものではない。ゲートウェイ装置としての状態が、ある特定の状態（ステータス）の時にのみ、特定の外部信号を受けることにより、装置として想定しない動作となり、障害を引き起こすことも多い。 On the other hand, the occurrence of a failure is not necessarily determined only by an external signal. By receiving a specific external signal only when the state as a gateway device is a specific state (status), an operation that is not assumed as a device is caused and often causes a failure.

そのため、障害を引き起こしたときの装置の内部状態を記録することも重要となってくる。 Therefore, it is also important to record the internal state of the device when a failure is caused.

これについて、図７を用いて説明する。 This will be described with reference to FIG.

今日の一般的なデジタルコンピュータのアーキテクチャはノイマン型コンピュータであり、メインメモリにプログラムとデータを記憶し、ＣＰＵがプログラムカウンタによって実行中のプログラムのアドレスを記憶しながら順番にプログラムを実行していく方式となっている。そのため、メインメモリと、ＣＰＵ上のプログラムカウンタ（各種レジスタも含む）は、デジタルコンピュータの状態すべてを表す。したがって、これを保存し、同じアーキテクチャを持つデジタルコンピュータにロードすれば、保存した時点の状態をプログラムの実行状態含め、すべて再現できることになる。 Today's general digital computer architecture is a Neumann computer, which stores programs and data in the main memory, and the CPU sequentially executes the programs while storing the addresses of the programs being executed by the program counter. It has become. Therefore, the main memory and the program counter (including various registers) on the CPU represent all the states of the digital computer. Therefore, if this is saved and loaded on a digital computer having the same architecture, the state at the time of saving, including the execution state of the program, can be reproduced.

したがって、デジタルコンピュータであるゲートウェイ装置の動作は、内部状態と外的信号によって一意に決定されるため、外部信号記録部１７０及び内部状態記録部１８０を用いて、障害発生を検知した段階でのプログラムカウンタ含むメモリおよび外部信号を保存することで、問題が起こるであろう状態をすべて保存できる。 Therefore, since the operation of the gateway device, which is a digital computer, is uniquely determined by the internal state and the external signal, the program at the stage when the occurrence of the failure is detected using the external signal recording unit 170 and the internal state recording unit 180. By saving the memory containing the counter and the external signal, you can save all the conditions that would cause problems.

そして、このデータを再生することで、別のゲートウェイ装置で何度でも問題を発生させることが可能となる。 Then, by reproducing this data, it is possible to cause a problem repeatedly in another gateway device.

一般的に問題の再現条件の究明は、問題解析にかかる時間の大半を占めていると言っても過言ではない状況であり、上述の仕組みにより１００％問題を再現させることができれば、問題解析のスピードは飛躍的に向上する。 In general, investigating the conditions for reproducing a problem is not an exaggeration to say that it takes up most of the time required for problem analysis. If 100% of the problem can be reproduced by the above mechanism, Speed increases dramatically.

上記のように本発明に係る障害解析装置は、問題モジュールを特定したら、当該問題モジュールをとりあえずリブートし、その後、以下に示す障害解析システムが上記の外部信号及び内部記録のデータを用いて、具体的な障害の解析を行うものである。これにより、ユーザに障害を障害として認識させる以前の段階で、問題モジュールをリブートすることが可能となる。 As described above, when the failure analysis apparatus according to the present invention identifies a problem module, the problem analysis module reboots for the time being, and then the failure analysis system described below uses the above external signal and internal recording data to perform specific processing. Analysis of a typical failure. As a result, the problem module can be rebooted before the user recognizes the failure as a failure.

（障害解析システムの概要）
図８は、上記のゲートウェイ装置でもある上記の障害解析装置１００−１及び１００−２を備える障害解析システム５００の概要を示す図である。 (Outline of failure analysis system)
FIG. 8 is a diagram illustrating an outline of a failure analysis system 500 including the failure analysis devices 100-1 and 100-2 that are also the gateway devices.

障害解析システム５００は、図２に示されるシステムと同様に、互いに接続されたゲートウェイ装置（障害解析装置）１００−１及び１００−２と上位装置２００を含む。更に障害解析システム５００は、これらゲートウェイ装置（障害解析装置）１００−１及び１００−２と上位装置２００とに接続するＰＣ端末１５０を含む。 The failure analysis system 500 includes gateway devices (failure analysis devices) 100-1 and 100-2 and a host device 200 that are connected to each other, similarly to the system shown in FIG. The failure analysis system 500 further includes a PC terminal 150 connected to the gateway devices (failure analysis devices) 100-1 and 100-2 and the host device 200.

このゲートウェイ装置（障害解析装置）１００−１及び１００−２としては、電話機端末を収容するゲートウェイ装置を考える。 As gateway devices (failure analysis devices) 100-1 and 100-2, gateway devices that accommodate telephone terminals are considered.

また、図２を用いて説明したように、ゲートウェイ装置１００−１及び１００−２と上位装置２００との間、及びゲートウェイ装置１００−１及び１００−２同士は、エラー検出可能な信号で通信を行っており、また図３で示した仕組みにより、通話異常や動作遅延などが発生した場合に、直ちに問題を検出することができる。 In addition, as described with reference to FIG. 2, the gateway devices 100-1 and 100-2 and the higher-level device 200, and the gateway devices 100-1 and 100-2 communicate with each other with an error detectable signal. In addition, the problem shown in FIG. 3 can be immediately detected when a call abnormality or an operation delay occurs.

問題を検出した場合、前述の手順に従い直ちにゲートウェイ装置のメモリ状態を退避し、外部信号の保存を行う。情報の退避が完了したら直ちに当該問題モジュールの部分的再起動を行い、自律的に障害復旧を行う。 When a problem is detected, the memory state of the gateway device is immediately saved according to the above procedure, and the external signal is saved. Immediately after the saving of information is completed, the problem module is partially restarted and the failure is recovered autonomously.

障害通知については、図８に示す経路で行われる。 The failure notification is performed through the route shown in FIG.

障害が発生したという情報は、ゲートウェイ装置１００−１から上位装置２００に通知され、上位装置２００は音声ネットワークを保守する保守担当者のＰＣ端末４００に対して、メールなどで通知を行う。メールで障害発生を検知した保守担当者は障害発生を即座に認識し、外部からゲートウェイ装置１００−１にアクセスする。これにより、ゲートウェイ装置１００−１に保存してある外部信号とメモリ状態（内部状態）を即座に入手できる。 Information that a failure has occurred is notified from the gateway device 100-1 to the host device 200, and the host device 200 notifies the PC terminal 400 of the person in charge of maintenance of the voice network by e-mail or the like. The maintenance staff who has detected the failure by mail immediately recognizes the failure and accesses the gateway device 100-1 from the outside. Thereby, the external signal and memory state (internal state) stored in the gateway device 100-1 can be obtained immediately.

取得したデータは、保守担当者から技術担当者に渡され、技術担当者はラボなどで、当該ラボに存在するゲートウェイ装置に対して、取得したデータのロードを行う。この時、ゲートウェイ装置の状態は、図６で示した外部信号と図７で示した内部情報とにより一意に決定されるのでネットワーク環境なども特に現地に合わせる必要はなく、上位装置、端末等も必要ない。つまり、図９に示すようにゲートウェイ装置１０００とＰＣ端末１５００さえあれば、ゲートウェイ装置１０００の内部状態として、障害が発生した現地と全く同じ状態をつくることができ、再現試験を極めて簡単に行うことができる。さらに、この状態でゲートウェイ装置１０００にＩＣＥなどの解析機器（デバッガツール）１１００を接続することで、ゲートウェイ装置１０００内部の解析が容易になり、直ちに問題の特定に至ることも困難ではない。 The acquired data is transferred from the maintenance staff to the technical staff, and the technical staff loads the acquired data to the gateway device existing in the lab at a lab or the like. At this time, since the state of the gateway device is uniquely determined by the external signal shown in FIG. 6 and the internal information shown in FIG. 7, it is not necessary to match the network environment with the local site. unnecessary. In other words, as shown in FIG. 9, if there is only the gateway device 1000 and the PC terminal 1500, the internal state of the gateway device 1000 can be made exactly the same as the site where the failure occurred, and the reproduction test can be performed very easily. Can do. Further, by connecting an analysis device (debugger tool) 1100 such as ICE to the gateway apparatus 1000 in this state, the analysis inside the gateway apparatus 1000 becomes easy, and it is not difficult to immediately identify the problem.

図８における構成例では、上位装置２００から外部ネットワークを経由して保守担当者のＰＣ端末４００に直接メールを送信し、リアルタイムな障害通知を可能にしている。また、保守担当者は、メール経由で障害発生したゲートウェイ装置１００−１のＩＰアドレス情報も取得できるため、同じく外部からゲートウェイ装置１００−１に直接アクセスでき、必要な情報を吸い上げることが出来る。そして、保守担当者が必要だと判断したときには、技術担当者にメールで連絡し、吸い上げた情報を技術担当者に送信することができる。 In the configuration example in FIG. 8, a mail is directly transmitted from the host device 200 to the PC terminal 400 of the person in charge of maintenance via an external network to enable real-time failure notification. Further, since the maintenance staff can also obtain the IP address information of the gateway device 100-1 in which a failure has occurred via e-mail, the maintenance staff can also directly access the gateway device 100-1 from the outside and suck up necessary information. When it is determined that the maintenance staff is necessary, the technical staff can be contacted by e-mail, and the collected information can be transmitted to the technical staff.

また、上位装置２００からの通知をトリガーに、ゲートウェイ装置１００−１へのアクセスおよび情報収集を自動的に行うことも可能である。これによって、保守担当者が注視しなくても自動的にＰＣ端末１５０に必要な情報は保存されていき、定期的なメンテナンスにてエラーの発生有無をチェックすることも可能である。 It is also possible to automatically access the gateway device 100-1 and collect information using the notification from the host device 200 as a trigger. As a result, the necessary information is automatically stored in the PC terminal 150 without the maintenance person paying attention, and it is possible to check whether or not an error has occurred during regular maintenance.

（障害解析システムの使用方法概要）
図８及び図９で示した実施例に関して、実際の障害解析の流れについて説明する。 (Overview of how to use the failure analysis system)
With respect to the embodiment shown in FIGS. 8 and 9, the flow of actual failure analysis will be described.

障害を検知したとき、ゲートウェイ装置１００−１は外部信号及び内部のメモリ状態（内部状態）を全て保存し、上位装置２００に障害が発生したことを通知する。 When a failure is detected, the gateway device 100-1 stores all the external signals and the internal memory state (internal state), and notifies the host device 200 that a failure has occurred.

ゲートウェイ装置１００−１は、上記の外部信号の情報及び内部状態の情報を含む必要な情報を、各々、外部記憶装置等の外部信号記録部１７０及び内部状態記録部１８０に保存後、問題箇所の部分的なリブートもしくは装置自体のリブートによりプログラムの異常状態等を速やかに復旧させる。このとき、障害解析に必要な情報はすべて外部記憶装置に保存されており、装置のリブートによって情報が消えることはない。 The gateway device 100-1 stores necessary information including the information on the external signal and the information on the internal state in the external signal recording unit 170 and the internal state recording unit 180 such as an external storage device, respectively, An abnormal state of the program is quickly recovered by a partial reboot or reboot of the device itself. At this time, all information necessary for failure analysis is stored in the external storage device, and the information is not lost by rebooting the device.

上位装置２００は、ゲートウェイ装置１００−１から障害通知を受信したとき、あらかじめ設定された保守者のメールアドレス等を用いて、保守者が使用するＰＣ端末４００に対し、障害が発生した旨とゲートウェイ装置１００−１のＩＰアドレスを通知する。保守者は、障害が発生したゲートウェイ装置１００−１のＩＰアドレスを知ることができるので、そのゲートウェイ装置１００−１にアクセスでき、必要なデータを取得することが可能となる。 When the host device 200 receives the failure notification from the gateway device 100-1, the gateway 200 uses the preset mail address of the maintenance person, etc., to indicate that a failure has occurred to the PC terminal 400 used by the maintenance person and the gateway. The IP address of the device 100-1 is notified. Since the maintenance person can know the IP address of the gateway apparatus 100-1 in which the failure has occurred, the maintenance person can access the gateway apparatus 100-1 and obtain necessary data.

ゲートウェイ装置１００−１の不具合である可能性がある場合、保守担当者は開発担当者に取得したデータを渡し、解析を依頼する。取得データは、開発担当者から、ラボにあるゲートウェイ装置１０００にロードされ、不具合発生時点のゲートウェイ装置１００−１の状態を再現することが可能となる。 If there is a possibility that the gateway device 100-1 is defective, the maintenance staff passes the acquired data to the development staff and requests analysis. Acquired data is loaded from the developer in charge into the gateway device 1000 in the laboratory, and the state of the gateway device 100-1 at the time of occurrence of the failure can be reproduced.

（障害解析システムの動作例）
実際の障害内容については多種多様であるが、例として、上位装置２００からゲートウェイ装置１００−１に対して想定外の指示を与えた場合を示す。 (Operation example of failure analysis system)
Although there are a wide variety of actual failure contents, as an example, a case where an unexpected instruction is given from the host device 200 to the gateway device 100-1 is shown.

ゲートウェイ装置１００−１が設計考慮漏れ等により、上位装置２００から想定外のパラメータ等を含む指示を受信した場合、ゲートウェイ装置１００−１はパラメータエラーとなり、正常に通話動作を行うことができない。 When the gateway device 100-1 receives an instruction including an unexpected parameter or the like from the higher-level device 200 due to a design consideration omission or the like, the gateway device 100-1 becomes a parameter error and cannot normally perform a call operation.

図１０に例を示す。 An example is shown in FIG.

上位装置２００がゲートウェイ装置１００−１に対し通話指示を行った時、ゲートウェイ装置１００−１は他のゲートウェイ装置１００−２と通話ネゴシエーションを行い、その後ＲＴＰのやり取りを行って通話を行うが、プログラムのエラーにより、ＲＴＰパケットを送出することができない状態となったとする。 When the host device 200 issues a call instruction to the gateway device 100-1, the gateway device 100-1 negotiates with the other gateway device 100-2 and then performs RTP exchange to make a call. It is assumed that the RTP packet cannot be transmitted due to the error.

この時、前述のＤＴＭＦ信号のやり取りによって通話障害が検出される。 At this time, a call failure is detected by the exchange of the DTMF signal described above.

図６で示した仕組みにより、障害発生時点及びその近傍の時点の外部信号は全て記録されている。これにより、障害が検出されたのは「３．ＲＴＰパケット」においてであるが、通話開始のトリガーとなる「１．通話指示」も記録されている。 With the mechanism shown in FIG. 6, all external signals at the time of the occurrence of the failure and the time in the vicinity thereof are recorded. Thus, the failure is detected in “3. RTP packet”, but “1. Call instruction” that triggers the start of the call is also recorded.

ゲートウェイ装置１００−１の内部状態も各々の外部信号に対して対となるよう記録されており、各々の信号を処理する直前の内部状態がメモリに保存されている。すなわち、この状態のメモリをゲートウェイ装置１０００にロードすれば、その信号を処理する直前のゲートウェイ装置１００−１の状態を再生可能となり、外部信号を与えてそのままプログラムを走らせれば、その信号を処理したときのプログラムのトレースを行うことができる。もし、現地にて障害発生のトリガーとなった信号と内部状態の組み合わせを特定できれば、プログラム上問題となる箇所（バグ）に到達する。 The internal state of the gateway device 100-1 is also recorded as a pair for each external signal, and the internal state immediately before each signal is processed is stored in the memory. That is, if the memory in this state is loaded into the gateway device 1000, the state of the gateway device 100-1 immediately before processing the signal can be reproduced. If an external signal is given and the program is run as it is, the signal is processed. You can trace the program. If the combination of the signal that triggered the occurrence of the failure and the internal state can be identified locally, the problem (bug) will be reached in the program.

この例における問題は、上位装置からの想定外の指示によるゲートウェイ装置１００−１の動作不良であるので、ゲートウェイ装置１０００において、「１．通話指示処理時」の内部状態の下で、「１．通話指示（上位装置）」なる外部信号の入力を与えたとき、プログラム上想定していない処理に流れ、ＲＴＰの送出が正常に行えない状態が再現する。 The problem in this example is a malfunction of the gateway apparatus 100-1 due to an unexpected instruction from the host apparatus. Therefore, in the gateway apparatus 1000, under the internal state of “1. When an input of an external signal “call instruction (higher level device)” is given, the process flows to a process that is not assumed in the program, and a state in which RTP cannot be normally transmitted is reproduced.

この時、ＩＣＥなどのデバッガツールを備えた環境であれば、プログラムのステップ実行が可能であるのでどのようなルートを通って異常処理に流れるか容易に特定可能である。 At this time, if the environment is equipped with a debugger tool such as ICE, the program can be executed step by step, so it is possible to easily identify the route through which abnormal processing flows.

以上の仕組みにより、問題の再現および、現象の特定は極めて簡単に行うことができる。 With the above mechanism, it is very easy to reproduce a problem and identify a phenomenon.

上記の実施形態により、ＤＴＭＦ信号が正しく受信できない場合、通話音声の遅延、ゆらぎ、途切れ等に繋がるＩＰネットワーク障害が発生していると判断され、解析に必要なデータの取得を開始することが可能となる。ＤＴＭＦ信号の送信回路や受信回路は、従来電話装置には搭載されているため、安価に発明を実施することが可能である。 According to the above embodiment, when the DTMF signal cannot be received correctly, it is determined that an IP network failure that leads to delay, fluctuation, interruption, etc. of call voice has occurred, and acquisition of data necessary for analysis can be started. It becomes. Since a DTMF signal transmission circuit and reception circuit are conventionally mounted in a telephone device, the invention can be implemented at low cost.

また、従来行われているキープアライブ信号の通信やＰＩＮＧパケットによる障害発生の検出方法だと、ＩＰパケットが通らないようなネットワーク障害や相手機器が動作不能になっていることは検知できたとしても、パケットは通っているものの、遅延や、音声信号のゆらぎ、途切れ等、音声通話に支障が出る、ネットワーク障害までを検知することはできなかった。この点、本実施形態によれば、パケットは通っているもののネットワーク障害が発生していることを検知することが可能である。 In addition, with the conventional method of detecting the occurrence of a failure due to keep-alive signal communication or PING packet, it is possible to detect that a network failure that does not allow IP packets to pass or that the other device is inoperable. Although the packets were passed, it was not possible to detect delays, fluctuations in voice signals, interruptions, etc., which hindered voice calls and network failures. In this regard, according to the present embodiment, it is possible to detect that a network failure has occurred although a packet passes.

更に、ソフトウェア開発において非常に大きな割合を占める保守工数を大幅に削減することが可能である。 Furthermore, it is possible to greatly reduce the maintenance man-hours that occupy a very large proportion in software development.

ひいては、障害となっている現象の確実かつ迅速な再現、その原因の迅速な特定と修正の適用、並びに、障害からの速やかな回復が可能である。具体的には、ネットワーク機器内部に障害記録、解析機能を内蔵することにより、障害解析をやりやすくするとともに、障害からの自律復旧機能を備えることにより、運用環境下での影響を最小限にとどめることが可能となる。 As a result, it is possible to reliably and quickly reproduce the phenomenon that causes the failure, quickly identify the cause and apply the correction, and quickly recover from the failure. Specifically, by incorporating fault recording and analysis functions inside the network equipment, it is easier to perform fault analysis, and by providing an autonomous recovery function from faults, the impact in the operating environment is minimized. It becomes possible.

（その他の実施例）
実用上の制約として、図７で示した内部状態すべてを記録することは、外部記録装置の制約等により難しい場合もある。その場合は、必要最小限のメモリにとどめることで記憶容量を圧縮する。きっかけとなる外部信号と、最低限必要なメモリだけでも知ることができれば、開発者にとっては非常に大きなヒントとなり、原因の究明に大きく寄与すると考えられる。 (Other examples)
As practical restrictions, it may be difficult to record all the internal states shown in FIG. 7 due to restrictions of the external recording device. In that case, the storage capacity is compressed by keeping the memory to the minimum necessary. If it is possible to know only the external signal that triggers and the minimum necessary memory, it will be a very big hint for developers, and it will contribute greatly to the investigation of the cause.

以上、上記各実施例を参照して本発明を説明したが、本発明は上記各実施例に限定されるものではない。本発明の構成や詳細には、本発明の範囲内で当業者が理解し得る様々な変更をすることができる。 Although the present invention has been described with reference to the above embodiments, the present invention is not limited to the above embodiments. Various changes that can be understood by those skilled in the art can be made to the configuration and details of the present invention within the scope of the present invention.

なお、上記障害解析装置及び障害解析システムの各部分は、ハードウェア、ソフトウェアのいずれか又はこれらの組み合わせにより実現することができる。また、上記の障害解析装置又は障害解析システムにより行われる障害解析方法も、ハードウェア、ソフトウェアのいずれか又はこれらの組み合わせにより実現することができる。ここで、ソフトウェアによって実現されるとは、コンピュータがプログラムを読み込んで実行すること、又は、ハードウェアがプログラムに相当するマイクロコードに従って動作することにより実現されることを意味する。 Each part of the failure analysis apparatus and the failure analysis system can be realized by either hardware, software, or a combination thereof. Further, the failure analysis method performed by the failure analysis apparatus or the failure analysis system described above can also be realized by either hardware or software, or a combination thereof. Here, being realized by software means that the computer reads and executes a program, or that hardware realizes operation according to microcode corresponding to the program.

プログラムは、様々なタイプの非一時的なコンピュータ可読媒体（non-transitory computer readable medium）を用いて格納され、コンピュータに供給することができる。非一時的なコンピュータ可読媒体は、様々なタイプの実体のある記録媒体（tangible storage medium）を含む。非一時的なコンピュータ可読媒体の例は、磁気記録媒体（例えば、フレキシブルディスク、磁気テープ、ハードディスクドライブ）、光磁気記録媒体（例えば、光磁気ディスク）、ＣＤ−ＲＯＭ（Read Only Memory）、ＣＤ−Ｒ、ＣＤ−Ｒ／Ｗ、半導体メモリ（例えば、マスクＲＯＭ、ＰＲＯＭ（Programmable ROM）、ＥＰＲＯＭ（Erasable PROM）、フラッシュＲＯＭ、ＲＡＭ（random access memory））を含む。また、プログラムは、様々なタイプの一時的なコンピュータ可読媒体（transitory computer readable medium）によってコンピュータに供給されてもよい。一時的なコンピュータ可読媒体の例は、電気信号、光信号、及び電磁波を含む。一時的なコンピュータ可読媒体は、電線及び光ファイバ等の有線通信路、又は無線通信路を介して、プログラムをコンピュータに供給できる。 The program may be stored using various types of non-transitory computer readable media and supplied to a computer. Non-transitory computer readable media include various types of tangible storage media. Examples of non-transitory computer-readable media include magnetic recording media (for example, flexible disks, magnetic tapes, hard disk drives), magneto-optical recording media (for example, magneto-optical disks), CD-ROMs (Read Only Memory), CD-ROMs. R, CD-R / W, semiconductor memory (for example, mask ROM, PROM (Programmable ROM), EPROM (Erasable PROM), flash ROM, RAM (random access memory)). The program may also be supplied to the computer by various types of transitory computer readable media. Examples of transitory computer readable media include electrical signals, optical signals, and electromagnetic waves. The temporary computer-readable medium can supply the program to the computer via a wired communication path such as an electric wire and an optical fiber, or a wireless communication path.

上記の実施形態の一部または全部は、以下の付記のようにも記載されるが、以下には限られない。
（付記１）
障害解析装置であって、
ＲＴＰパケットを生成するＲＴＰパケット生成手段と、
前記ＲＴＰパケットにＤＴＭＦ信号を付加し、障害検知信号とするＤＴＭＦ信号付加手段と、
前記障害検知信号を他障害解析装置に送信する送信手段と、
他障害解析装置から返信されてきた前記障害検知信号を受信する受信手段と、
前記受信手段が受信した前記障害検知信号に含まれるＤＴＭＦ信号に基づき、当該障害解析装置内の問題モジュールを検出する問題検出手段と、
前記問題検出手段によって検出された前記問題モジュールをリブートするリブート手段とを備えることを特徴とする障害解析装置。
（付記２）
付記１に記載の障害解析装置であって、
前記ＤＴＭＦ信号付加手段は、前記ＲＴＰパケットに対して複数種類のＤＴＭＦ信号を順番に付加し、前記問題検出手段は、前記受信手段が受信した前記障害検知信号に含まれるＤＴＭＦ信号が、付加された順番通りに前記障害検知信号に含まれるか否かに基づき、前記問題モジュールを検出することを特徴とする障害解析装置。
（付記３）
付記１又は２に記載の障害解析装置であって、
前記受信手段が、更に、他障害解析装置又は当該障害解析装置の上位装置から制御信号を受信し、
前記問題検出手段が、前記受信手段が受信した前記制御信号に基づき、前記問題モジュールを検出することを特徴とする障害解析装置。
（付記４）
付記１乃至３のいずれか１に記載の障害解析装置であって、
前記問題検出手段が、前記問題モジュールを特定できなかった際は、前記リブート手段が、当該障害解析装置自体をリブートすることを特徴とする障害解析装置。
（付記５）
付記３又は４に記載の障害解析装置であって、
前記受信手段が更に音声信号を受信し、
前記受信手段が受信した前記音声信号及び前記制御信号を含む外部信号のうち、前記問題モジュールにおける問題が発生した時点及びその前後所定個数分のパケットに対応する外部信号を記録する外部信号記録手段を更に備えることを特徴とする障害解析装置。
（付記６）
付記５に記載の障害解析装置であって、
前記受信手段が前記外部信号を受信した時点での当該障害解析装置の内部状態を記録する内部状態記録手段を更に備えることを特徴とする障害解析装置。
（付記７）
障害解析システムであって、
互いに接続された複数の付記１乃至６のいずれか１に記載の障害解析装置と、
前記複数の障害解析装置を制御する上位装置とを備えることを特徴とする障害解析システム。
（付記８）
付記７に記載の障害解析システムであって、
前記上位装置及び前記複数の障害解析装置に接続された外部端末を更に備え、
前記複数の障害解析装置のうち何れかの障害解析装置において、前記問題モジュール内の問題が発生した際、前記上位端末は前記外部端末に対して、前記問題の発生と前記問題が発生した障害解析装置のＩＰアドレスとを通知することを特徴とする障害解析システム。
（付記９）
互いに接続された複数の付記６に記載の障害解析装置と、
前記複数の障害解析装置を制御する上位装置とを備える障害解析システムであって、
前記外部端末が、前記問題が発生した障害解析装置が記憶する前記外部信号と前記内部状態とを、前記問題が発生した障害解析装置から受信することを特徴とする障害解析システム。
（付記１０）
付記９に記載の障害解析システムであって、
前記外部端末は、前記上位端末からの前記問題の発生と前記ＩＰアドレスとの通知をトリガーとして、前記問題が発生した障害解析装置が記憶する前記外部信号と前記内部状態とを自動的に受信することを特徴とする障害解析システム。
（付記１１）
障害解析方法であって、
ＲＴＰパケットを生成するＲＴＰパケット生成ステップと、
前記ＲＴＰパケットにＤＴＭＦ信号を付加し、障害検知信号とするＤＴＭＦ信号付加ステップと、
前記障害検知信号を他障害解析装置に送信する送信ステップと、
他障害解析装置から返信されてきた前記障害検知信号を受信する受信ステップと、
受信した前記障害検知信号に含まれるＤＴＭＦ信号に基づき、自障害解析装置内の問題モジュールを検出する問題検出ステップと、
前記問題検出ステップにおいて検出された前記問題モジュールをリブートするリブートステップとを有することを特徴とする障害解析方法。
（付記１２）
障害解析装置としてコンピュータを機能させるための障害解析用プログラムであって、
コンピュータを、
ＲＴＰパケットを生成するＲＴＰパケット生成手段と、
前記ＲＴＰ信号にＤＴＭＦ信号を付加し、障害検知信号とするＤＴＭＦ信号付加手段と、
前記障害検知信号を他障害解析装置に送信する送信手段と、
他障害解析装置から返信されてきた前記障害検知信号を受信する受信手段と、
前記受信部が受信した前記障害検知信号に含まれるＤＴＭＦ信号に基づき、当該障害解析装置内の問題モジュールを検出する問題検出手段と、
前記問題検出手段によって検出された前記問題モジュールをリブートするリブート手段として機能させるための障害解析用プログラム。 A part or all of the above embodiment is described as in the following supplementary notes, but is not limited thereto.
(Appendix 1)
A failure analysis device,
RTP packet generation means for generating an RTP packet;
A DTMF signal adding means for adding a DTMF signal to the RTP packet and making it a failure detection signal;
Transmitting means for transmitting the failure detection signal to another failure analysis device;
Receiving means for receiving the failure detection signal returned from the other failure analysis device;
Problem detection means for detecting a problem module in the failure analysis device based on the DTMF signal included in the failure detection signal received by the reception means;
A failure analysis apparatus comprising: rebooting means for rebooting the problem module detected by the problem detection means.
(Appendix 2)
The failure analysis apparatus according to appendix 1,
The DTMF signal adding means sequentially adds a plurality of types of DTMF signals to the RTP packet, and the problem detecting means is added with a DTMF signal included in the failure detection signal received by the receiving means. A failure analysis device that detects the problem module based on whether or not it is included in the failure detection signal in order.
(Appendix 3)
The failure analysis device according to appendix 1 or 2,
The receiving means further receives a control signal from another failure analysis device or a host device of the failure analysis device,
The failure analysis apparatus, wherein the problem detection unit detects the problem module based on the control signal received by the reception unit.
(Appendix 4)
The failure analysis apparatus according to any one of appendices 1 to 3,
The failure analysis device, wherein when the problem detection unit cannot identify the problem module, the reboot unit reboots the failure analysis device itself.
(Appendix 5)
The failure analysis apparatus according to appendix 3 or 4,
The receiving means further receives an audio signal;
External signal recording means for recording external signals corresponding to a predetermined number of packets before and after the occurrence of a problem in the problem module among the external signals including the audio signal and the control signal received by the receiving means. A failure analysis device further comprising:
(Appendix 6)
The failure analysis apparatus according to appendix 5,
The failure analysis device further comprising an internal state recording unit that records an internal state of the failure analysis device at the time when the reception unit receives the external signal.
(Appendix 7)
A failure analysis system,
A plurality of failure analysis apparatuses according to any one of appendices 1 to 6 connected to each other;
A failure analysis system comprising: a host device that controls the plurality of failure analysis devices.
(Appendix 8)
The failure analysis system according to appendix 7,
An external terminal connected to the host device and the plurality of failure analysis devices;
In any failure analysis device of the plurality of failure analysis devices, when a problem in the problem module occurs, the higher-level terminal transmits the problem occurrence and the failure analysis in which the problem has occurred to the external terminal. A failure analysis system that notifies an IP address of a device.
(Appendix 9)
A plurality of failure analysis devices according to appendix 6 connected to each other;
A failure analysis system comprising a host device for controlling the plurality of failure analysis devices,
The fault analysis system, wherein the external terminal receives the external signal and the internal state stored in the fault analysis apparatus in which the problem has occurred from the fault analysis apparatus in which the problem has occurred.
(Appendix 10)
The failure analysis system according to attachment 9, wherein
The external terminal automatically receives the external signal and the internal state stored in the failure analysis apparatus in which the problem has occurred, triggered by the occurrence of the problem from the host terminal and the notification of the IP address. Failure analysis system characterized by that.
(Appendix 11)
A failure analysis method,
An RTP packet generation step of generating an RTP packet;
A DTMF signal adding step of adding a DTMF signal to the RTP packet to be a failure detection signal;
A transmission step of transmitting the failure detection signal to another failure analysis device;
A reception step of receiving the failure detection signal returned from the other failure analysis device;
A problem detection step of detecting a problem module in the own failure analysis device based on the DTMF signal included in the received failure detection signal;
And a rebooting step of rebooting the problem module detected in the problem detection step.
(Appendix 12)
A failure analysis program for causing a computer to function as a failure analysis device,
Computer
RTP packet generation means for generating an RTP packet;
A DTMF signal adding means for adding a DTMF signal to the RTP signal and making it a failure detection signal;
Transmitting means for transmitting the failure detection signal to another failure analysis device;
Receiving means for receiving the failure detection signal returned from the other failure analysis device;
Problem detection means for detecting a problem module in the failure analysis device based on a DTMF signal included in the failure detection signal received by the reception unit;
A failure analysis program for causing a problem module detected by the problem detection means to function as a reboot means for rebooting.

本発明は、通信ネットワークの分野で用いることが可能である。とりわけ、通話端末を収容する通信ネットワークの分野で用いることが好適である。 The present invention can be used in the field of communication networks. In particular, it is suitable for use in the field of communication networks that accommodate call terminals.

１０−１１０−２ゲートウェイ装置
２０上位装置
３０−１３０−２端末
１００１００−１１００−２障害解析装置（ゲートウェイ装置）
１１０ＲＴＰパケット生成部
１２０ＤＴＭＦ信号付加部
１３０送信部
１４０受信部
１５０問題検出部
１６０リブート部
１７０外部信号記録部
１８０内部状態記録部
１９０制御部
２００上位装置
３００−１３００−２端末
４００ＰＣ端末
５００障害解析システム
１０００ゲートウェイ装置
１１００デバッガツール
１４００ＰＣ端末
10-1 10-2 Gateway device 20 Host device 30-1 30-2 Terminal 100 100-1 100-2 Failure analysis device (gateway device)
110 RTP packet generation unit 120 DTMF signal addition unit 130 transmission unit 140 reception unit 150 problem detection unit 160 reboot unit 170 external signal recording unit 180 internal state recording unit 190 control unit 200 host device 300-1 300-2 terminal 400 PC terminal 500 Failure analysis system 1000 Gateway device 1100 Debugger tool 1400 PC terminal

本発明の第１の観点によれば、障害解析装置であって、ＲＴＰパケットを生成するＲＴＰパケット生成手段と、前記ＲＴＰパケットにＤＴＭＦ信号を付加し、障害検知信号とするＤＴＭＦ信号付加手段と、前記障害検知信号を他障害解析装置に送信する送信手段と、他障害解析装置から返信されてきた前記障害検知信号を受信する受信手段と、前記受信手段が受信した前記障害検知信号に含まれるＤＴＭＦ信号に基づき、当該障害解析装置内の問題モジュールを検出する問題検出手段と、前記問題検出手段によって検出された前記問題モジュールをリブートするリブート手段とを備え、前記受信手段が、更に、他障害解析装置又は当該障害解析装置の上位装置から制御信号を受信し、前記問題検出手段が、前記受信手段が受信した前記制御信号に基づき、前記問題モジュールを検出することを特徴とする障害解析装置が提供される。 According to a first aspect of the present invention, there is provided a failure analysis apparatus, an RTP packet generation unit that generates an RTP packet, a DTMF signal addition unit that adds a DTMF signal to the RTP packet and forms a failure detection signal, Transmission means for transmitting the failure detection signal to the other failure analysis device, reception means for receiving the failure detection signal returned from the other failure analysis device, and DTMF included in the failure detection signal received by the reception means A problem detecting unit for detecting a problem module in the failure analysis device based on the signal; and a rebooting unit for rebooting the problem module detected by the problem detection unit , wherein the receiving unit further includes another failure analysis A control signal is received from a device or a host device of the failure analysis device, and the problem detection means receives the control signal received by the reception means. Based on, fault analysis device and detects the problem module is provided.

本発明の第２の観点によれば、障害解析方法であって、ＲＴＰパケットを生成するＲＴＰパケット生成ステップと、前記ＲＴＰパケットにＤＴＭＦ信号を付加し、障害検知信号とするＤＴＭＦ信号付加ステップと、前記障害検知信号を他障害解析装置に送信する送信ステップと、他障害解析装置から返信されてきた前記障害検知信号を受信する受信ステップと、受信した前記障害検知信号に含まれるＤＴＭＦ信号に基づき、自障害解析装置内の問題モジュールを検出する問題検出ステップと、前記問題検出ステップにおいて検出された前記問題モジュールをリブートするリブートステップとを有し、前記受信ステップが、更に、他障害解析装置又は当該障害解析装置の上位装置から制御信号を受信し、前記問題検出ステップが、前記受信ステップが受信した前記制御信号に基づき、前記問題モジュールを検出することを特徴とする障害解析方法が提供される。 According to a second aspect of the present invention, there is provided a failure analysis method, an RTP packet generation step of generating an RTP packet, a DTMF signal addition step of adding a DTMF signal to the RTP packet and making it a failure detection signal, Based on the transmission step of transmitting the failure detection signal to the other failure analysis device, the reception step of receiving the failure detection signal returned from the other failure analysis device, and the DTMF signal included in the received failure detection signal, possess a problem detection step of detecting a faulty module in the own fault analysis device, and a reboot step of rebooting the problem module detected in the problem detection step, said receiving step further, another fault analysis device or the A control signal is received from a host device of the failure analysis device, and the problem detection step includes the reception step. Based on the control signal flop receives, failure analysis method characterized by detecting the problem module is provided.

本発明の第３の観点によれば、障害解析装置としてコンピュータを機能させるための障害解析用プログラムであって、コンピュータを、ＲＴＰパケットを生成するＲＴＰパケット生成手段と、前記ＲＴＰパケットにＤＴＭＦ信号を付加し、障害検知信号とするＤＴＭＦ信号付加手段と、前記障害検知信号を他障害解析装置に送信する送信手段と、他障害解析装置から返信されてきた前記障害検知信号を受信する受信手段と、前記受信手段が受信した前記障害検知信号に含まれるＤＴＭＦ信号に基づき、当該障害解析装置内の問題モジュールを検出する問題検出手段と、前記問題検出手段によって検出された前記問題モジュールをリブートするリブート手段として機能させ、前記受信手段が、更に、他障害解析装置又は当該障害解析装置の上位装置から制御信号を受信し、前記問題検出手段が、前記受信手段が受信した前記制御信号に基づき、前記問題モジュールを検出することを特徴とする障害解析用プログラムが提供される。 According to a third aspect of the present invention, there is provided a failure analysis program for causing a computer to function as a failure analysis device, the computer comprising: an RTP packet generation means for generating an RTP packet; and a DTMF signal in the RTP packet. A DTMF signal adding means for adding a fault detection signal; a transmission means for transmitting the fault detection signal to the other fault analysis apparatus; and a reception means for receiving the fault detection signal returned from the other fault analysis apparatus; Based on a DTMF signal included in the failure detection signal received by the reception unit, a problem detection unit that detects a problem module in the failure analysis device, and a reboot unit that reboots the problem module detected by the problem detection unit to function as the upper of the receiving means is further another fault analysis device or the fault analysis device Receiving a placed al control signal, the problem detection means, based on the control signal received by the receiving unit, program failure analysis and detects the problem module is provided.

Claims

A failure analysis device,
RTP packet generation means for generating an RTP packet;
A DTMF signal adding means for adding a DTMF signal to the RTP packet and making it a failure detection signal;
Transmitting means for transmitting the failure detection signal to another failure analysis device;
Receiving means for receiving the failure detection signal returned from the other failure analysis device;
Problem detection means for detecting a problem module in the failure analysis device based on the DTMF signal included in the failure detection signal received by the reception means;
A failure analysis apparatus comprising: rebooting means for rebooting the problem module detected by the problem detection means.

The failure analysis apparatus according to claim 1,
The DTMF signal adding means sequentially adds a plurality of types of DTMF signals to the RTP signal, and the problem detecting means is added with a DTMF signal included in the failure detection signal received by the receiving means. A failure analysis device that detects the problem module based on whether or not it is included in the failure detection signal in order.

The failure analysis apparatus according to claim 1 or 2,
The receiving means further receives a control signal from another failure analysis device or a host device of the failure analysis device,
The failure analysis apparatus, wherein the problem detection unit detects the problem module based on the control signal received by the reception unit.

The failure analysis apparatus according to any one of claims 1 to 3,
The failure analysis device, wherein when the problem detection unit cannot identify the problem module, the reboot unit reboots the failure analysis device itself.

The failure analysis apparatus according to claim 3 or 4,
The receiving means further receives an audio signal;
Of the external signals including the audio signal and the control signal received by the receiving means, the external signal recording means for recording external signals corresponding to a predetermined number of packets before and after the time when the problem in the problem module occurs. A failure analysis apparatus further comprising:

The failure analysis apparatus according to claim 5,
The failure analysis device further comprising an internal state recording unit that records an internal state of the failure analysis device at the time when the reception unit receives the external signal.

A failure analysis system,
A plurality of failure analysis apparatuses according to any one of claims 1 to 6, which are connected to each other;
A failure analysis system comprising: a host device that controls the plurality of failure analysis devices.

The failure analysis system according to claim 7,
An external terminal connected to the host device and the plurality of failure analysis devices;
In any failure analysis device of the plurality of failure analysis devices, when a problem in the problem module occurs, the higher-level terminal sends the occurrence of the problem and the failure analysis in which the problem has occurred to the external terminal. A failure analysis system that notifies an IP address of a device.

A failure analysis method,
An RTP packet generation step of generating an RTP packet;
A DTMF signal adding step of adding a DTMF signal to the RTP signal and making it a failure detection signal;
A transmission step of transmitting the failure detection signal to another failure analysis device;
A reception step of receiving the failure detection signal returned from the other failure analysis device;
A problem detection step of detecting a problem module in the own failure analysis device based on the DTMF signal included in the received failure detection signal;
And a rebooting step of rebooting the problem module detected in the problem detection step.

A failure analysis program for causing a computer to function as a failure analysis device,
Computer
RTP packet generation means for generating an RTP packet;
A DTMF signal adding means for adding a DTMF signal to the RTP signal and making it a failure detection signal;
Transmitting means for transmitting the failure detection signal to another failure analysis device;
Receiving means for receiving the failure detection signal returned from the other failure analysis device;
Problem detection means for detecting a problem module in the failure analysis device based on a DTMF signal included in the failure detection signal received by the reception unit;
A failure analysis program for causing a problem module detected by the problem detection means to function as a reboot means for rebooting.