JPH02159636A - Network fault diagnostic system - Google Patents

Network fault diagnostic system

Info

Publication number
JPH02159636A
JPH02159636A JP63316036A JP31603688A JPH02159636A JP H02159636 A JPH02159636 A JP H02159636A JP 63316036 A JP63316036 A JP 63316036A JP 31603688 A JP31603688 A JP 31603688A JP H02159636 A JPH02159636 A JP H02159636A
Authority
JP
Japan
Prior art keywords
network
cause
equipment
inspection
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP63316036A
Other languages
Japanese (ja)
Inventor
Takuya Yamahira
山平 拓也
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Priority to JP63316036A priority Critical patent/JPH02159636A/en
Publication of JPH02159636A publication Critical patent/JPH02159636A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE:To effectively diagnose a fault generated in a network by hypothesizing the cause of a fault, verifying its hypothesis, hypothesizing a more detailed cause investigation and repeating a processing for instructing the inspection of information. CONSTITUTION:The diagnostic system is provided with an inference part 1 for searching the cause of those faults against a network fault, and also, hypothesizing the cause of the fault in order to be restored, instructing the inspection of network information to a state inspection part 3 in order to verify its hypothesis, hypothesizing a more detailed cause investigation, based on the result of inspection obtained by its state inspecting part 3 and repeating a processing for instructing the inspection of the information, determining the cause as its result, and notifying a restoring method corresponding to the cause. Also, this system is provided with the state inspection part 3 for deciding whether its collected information coincides with the using condition of each equipment determined in advance or not and sending the result of its decision to the inference part 1. In such a manner, the fault of the network can be diagnosed effectively.

Description

【発明の詳細な説明】 〔産業上の利用分野〕 本発明はネットワーク障害診断方式に関し、特にネット
ワーク障害の早期発見、診断、原因の究明、障害からの
早急な回復を行なうネットワーク障害診断方式に関する
DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to a network fault diagnosis method, and particularly to a network fault diagnosis method for early detection, diagnosis, investigation of the cause, and quick recovery from a network fault.

〔従来の技術〕[Conventional technology]

人間の考え方をコンピュータによる処理に反映するため
に効果的な方法として、知識処理の利用が知られている
。ネットワーク障害診断においても、従来の保守関係者
が行なっていた方法は、非常に専門的であり、かつ合理
的な手順を踏んでいる。そのため、このような専門家の
手順を装置化する場合、知識処理を利用することは必然
である。このような、知識処理を診断に利用し装置化し
た例は、医学的な診断システムに幾つか見られるが、本
発明の対象とするネットワーク障害診断に利用したもの
は、未だ見ない。
The use of knowledge processing is known as an effective method for reflecting human thinking in computer processing. Even when diagnosing network faults, the conventional methods used by maintenance personnel are highly specialized and follow rational procedures. Therefore, when converting such expert procedures into a device, it is necessary to utilize knowledge processing. Although there are some examples of devices in which knowledge processing is utilized for diagnosis in medical diagnostic systems, I have not yet seen one in which it is utilized for network fault diagnosis, which is the object of the present invention.

〔発明が解決しようとする課題〕[Problem to be solved by the invention]

最近のコンピュータや通信機器のネットワーク構築の傾
向として、広域分散化、大規模化が挙げられる。それに
伴い、障害が発生した場合、その損失は必然的に増加の
傾向にある。同時に、ネットワーク運用、保守・の複雑
化がからみ、そのコストも増大する。そのため、ネット
ワークの障害を効果的に診断する方法が必要となってき
ている。
Recent trends in the construction of networks for computers and communication equipment include wide-area decentralization and large-scale expansion. Accordingly, when a failure occurs, the loss inevitably tends to increase. At the same time, network operation and maintenance become more complex and costs increase. Therefore, there is a need for a method to effectively diagnose network failures.

本発明の目的は、大規模下、複雑化を極めるネットワー
クシステム環境化で、知識処理を利用し、専門家の診断
に近い方法で効果的にネットワークの障害の診断を行な
い、ユーザが安心して利用できるように、障害を効果的
に診断し得るネットワーク障害診断方式を提供すること
にある。
The purpose of the present invention is to effectively diagnose network failures in a large-scale, increasingly complex network system environment using knowledge processing in a manner similar to that of experts, so that users can use the system with peace of mind. An object of the present invention is to provide a network fault diagnosis method that can effectively diagnose faults.

〔課題を解決するための手段〕[Means to solve the problem]

本発明のネットワーク障害診断方式の構成は、コンピュ
ータや端末機器、あるいは交換機、通信機器からなる電
気的に情報を伝送することを目的としたネットワークに
おいて、そのネットワークを構成する機器や伝送路にお
いて発生する物理的故障や、ネットワーク機器の使用者
による誤操作、前記ネットワーク機器を使用するために
作成したコンピュータプログラムの不具合によるネット
ワーク機器の誤動作、あるいは、各種機器を動作させる
ためのパラメータ値の設定不備による機器の誤動作から
なるネットワーク障害に対し、それらの障害の原因を究
明し、かつ復旧を行なうため、障害原因の仮説を立て、
その仮説を検証するためにネットワーク情報の検査を状
態検査部に指示し、その状態検査部で得られた検査結果
を基に、更に詳細な原因究明の仮説を立て情報の検査を
指示する処理を繰返し、その結果として原因を決定し、
原因に応じた復旧方法を掲示する推論部と、この推論部
の指示に従い前記ネットワーク情報を収集し、その収集
した情報を予め定められた各機器の利用条件と一致して
いるかを判定し、その判定結果を推論部に送る状態検査
部とを含んで構成されることを特徴とする。
The configuration of the network fault diagnosis method of the present invention is such that in a network for the purpose of electrically transmitting information consisting of computers, terminal equipment, exchanges, and communication equipment, faults occur in the equipment and transmission paths that make up the network. Malfunction of network equipment due to physical failure, incorrect operation by the user of the network equipment, malfunction of the computer program created to use the network equipment, or failure of the equipment due to improper setting of parameter values for operating various equipment. In order to investigate the cause of network failures caused by malfunctions and perform recovery, we formulate a hypothesis of the cause of the failure.
In order to verify the hypothesis, the status inspection unit is instructed to inspect the network information, and based on the inspection results obtained by the status inspection unit, a process is performed to formulate a hypothesis for further detailed cause investigation and instruct the inspection of the information. Iteratively determines the cause as a result,
An inference section that posts recovery methods according to the cause, and a system that collects the network information according to the instructions of this inference section, determines whether the collected information matches the predetermined usage conditions of each device, and The present invention is characterized in that it includes a state checking section that sends a determination result to an inference section.

〔実施例〕〔Example〕

本発明の実施例を図面を用いて詳細に説明する。 Embodiments of the present invention will be described in detail using the drawings.

第1図は、本発明のネットワーク障害診断方式の一実施
例を示す図である。第1図において、推論部1は、推論
知識ベース2の障害診断に用いる知識を利用して原因究
明の戦略を立てる。推論知識ベース2には、ネットワー
クの障害に関する知識が記録されており、ある機器が陥
った状態、あるいは状態検査部3においての検査結果と
して得られたネットワーク情報の検査結果が各機器の故
障状態と対応付けられている。各機器の故障状態は、確
認されたデータや検査結果の種類や詳細度に従って階層
的に関係付けられており、ある故障状態が確認されると
、その原因究明に近付くために更に詳細な可能性のある
何種類かの故障状態を掲示している。
FIG. 1 is a diagram showing an embodiment of the network fault diagnosis method of the present invention. In FIG. 1, an inference unit 1 uses knowledge used in fault diagnosis in an inference knowledge base 2 to formulate a strategy for investigating the cause. The inference knowledge base 2 records knowledge regarding network failures, and the state in which a certain device has fallen, or the test result of network information obtained as a test result in the state inspection unit 3, is recorded as the fault state of each device. are associated. The failure states of each device are hierarchically related according to the type and level of detail of confirmed data and inspection results, and once a certain failure state is confirmed, more detailed possibilities may be developed to get closer to investigating the cause. Several types of failure conditions are posted.

推論部1は、推論知識ベース2で掲示されている可能の
ある故障状態を確認するため、その故障状態の検査方法
を推論知識ベース2を参照して状態検査部3に指示し、
実行させる。つまり、推論知識ベース2は、収集情報の
内容に従って、その故障状態を定義したデータカードの
集りと考えても良い。それらのカードは、収集情報の大
雑把な段階から詳細な段階へと階層状になっており、そ
のデータの変化に従って、ある故障状態から次の段階の
故障状態へは何通りかの故障状態が関係付けられている
。故障状態で原因が判明したものは復旧方法と結び付け
られている。
In order to check possible fault conditions posted in the inference knowledge base 2, the inference section 1 instructs the state inspection section 3 on how to test the fault conditions by referring to the inference knowledge base 2;
Let it run. In other words, the inference knowledge base 2 can be thought of as a collection of data cards that define their failure states according to the content of collected information. These cards are arranged in a hierarchical manner from the rough stages of collected information to the detailed stages, and as the data changes, several failure states are related from one failure state to the next failure state. It is attached. When the cause of a failure is identified, it is associated with a recovery method.

状態検査部3では、推論部lの指示に従ってネットワー
ク情報を収集する。収集方法としては、利用者にその状
況を質問する、回線トレースを検査する、システム構成
定義などの機器の利用に必要なパラメータを確認する等
がある。各収集データを、検査知識ベース4に記録され
ている利用条件〈パラメータ、回線トレース論理規定、
操作方法など〉と照合をとり、その正否を確認し、その
結果を推論部1に送る。
The state inspection section 3 collects network information according to instructions from the inference section 1. Collection methods include asking users about their status, inspecting line traces, and checking parameters necessary for device usage, such as system configuration definitions. Each collected data is subject to the usage conditions recorded in the inspection knowledge base 4 (parameters, line trace logic regulations,
operation method, etc.), confirm whether it is correct or not, and send the result to the inference section 1.

第2図は、本発明の各機能の動作内容を示す図である。FIG. 2 is a diagram showing the operation details of each function of the present invention.

第2図において、動作Aにおいて利用者5−、あるいは
対象ネットワーク機器6から状態検査部3を通して推論
部1に障害発生の報告が来る。動作Bにおいて、推論部
1では状態検査部3に利用者5あるいは対象ネットワー
ク機器6に初期問診を行なうように指示する。動作Cに
おいて、状態検査部3ではその指示に従い初期問診を行
なう。動作りにおいて、利用者5および対象ネットワー
ク機器6はその質問に対し、状態検査部3を通して推論
部′1にその答えを送る。動作Eにおいて、推論部1で
は、故障の状態の仮説を立て、その状態を確認するなめ
に状態検査部3に確認の指示を行なう、動作Fにおいて
、状態検査部3は利用者5、あるいは対象ネットワーク
機器6から関連するデータを収集しその検査を行なう。
In FIG. 2, in operation A, a report of the occurrence of a failure is sent to the reasoning unit 1 from the user 5- or the target network device 6 through the status inspection unit 3. In operation B, the inference section 1 instructs the state inspection section 3 to conduct an initial interview with the user 5 or the target network device 6. In operation C, the condition inspection unit 3 performs an initial interview according to the instructions. In operation, the user 5 and the target network device 6 send their answers to the questions through the status checking section 3 to the reasoning section '1. In operation E, the inference unit 1 hypothesizes the failure state and instructs the status inspection unit 3 to confirm the status.In operation F, the status inspection unit 3 hypothesizes the failure state Relevant data is collected from the network equipment 6 and inspected.

動作Gにおいて、状態検査部3は検査結果を推論部1に
送る。動作Hにおいて推論部1で原因が判明し復旧方法
が確認できた時、状態検査部3を通して利用者5あるい
は対象ネットワーク機器6に復旧方法を通知する。
In operation G, the state inspection section 3 sends the inspection result to the inference section 1. In operation H, when the inference unit 1 finds the cause and confirms the recovery method, the status inspection unit 3 notifies the user 5 or the target network device 6 of the recovery method.

第3図に、本発明の診断手順をフローチャートで示す。FIG. 3 shows a flowchart of the diagnostic procedure of the present invention.

手順は、障害の発生状況を大雑把に確認するための質問
を行なう初期問診、仮説の生成と検証を繰返し行ない原
因を究明する原因究明、そして復旧に分けられる。
The procedure is divided into an initial interview in which questions are asked to roughly confirm the status of the failure, cause investigation in which hypotheses are repeatedly generated and verified to determine the cause, and recovery.

初期問診では、診断手順P1とP2において、前記第2
図を用いた説明の様に利用者や対象ネットワーク機器か
ら情報を収集し、これらを利用して故障の大雑把な分類
を行なう。
In the initial interview, in the diagnostic procedures P1 and P2, the second
As explained in the diagram, information is collected from users and target network devices, and this information is used to roughly classify failures.

原因究明では、診断手順P3からP6において、仮説の
生成と検証を繰返し行ない原因を究明する。
In the cause investigation, the cause is investigated by repeatedly generating and verifying hypotheses in diagnostic steps P3 to P6.

復旧では、診断手順P7において、推論部の利用する前
記推論知識ベースの内容を参照し、原因に対する復旧方
法を利用者や対象ネットワーク機器に掲示する。
In the recovery, in the diagnosis step P7, the contents of the inference knowledge base used by the inference section are referred to, and a recovery method for the cause is posted to the user and the target network device.

二のように、本発明は、ネットワークに発生する障害を
専門家がとり得るであろうと考えられる方法に類似した
処理で原因を究明し、効果的に障害復旧を行なうことが
可能なネットワーク障害診断方式である。
As described in 2, the present invention provides network fault diagnosis that enables effective fault recovery by investigating the cause of faults that occur in a network using processing similar to the method that experts would take. It is a method.

〔発明の効果〕〔Effect of the invention〕

以上説明したように、本発明のネットワーク障害診断方
式では、自動的に障害復旧を行なうことができる効果が
ある。
As explained above, the network fault diagnosis method of the present invention has the advantage of automatically performing fault recovery.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明の一実施例を示すブロック図、第2図は
本発明の各機能の動作内容を示す図、第3図は本発明の
診断手順を示すフローチャートである。 A、B、・・・、H・・・動作内容、PL、P2.・・
・P7・・・診断手順、1・・・推論部、2・・・推論
知識ベース、3・・・状態検査部、4・・・検査知識ベ
ース、5・・・利用者、6・・・対象ネットワーク機器
FIG. 1 is a block diagram showing an embodiment of the present invention, FIG. 2 is a diagram showing the operation details of each function of the invention, and FIG. 3 is a flowchart showing the diagnostic procedure of the invention. A, B,..., H...Operation content, PL, P2.・・・
・P7...Diagnosis procedure, 1...Inference section, 2...Inference knowledge base, 3...Status inspection section, 4...Testing knowledge base, 5...User, 6... Target network equipment.

Claims (1)

【特許請求の範囲】[Claims]  コンピュータや端末機器、あるいは交換機、通信機器
からなる電気的に情報を伝送することを目的としたネッ
トワークにおいて、そのネットワークを構成する機器や
伝送路において発生する物理的故障や、ネットワーク機
器の使用者による誤操作、前記ネットワーク機器を使用
するために作成したコンピュータプログラムの不具合に
よるネッタワーク機器の誤動作、あるいは、各種機器を
動作させるためのパラメータ値の設定不備による機器の
誤動作からなるネットワーク障害に対し、それらの障害
の原因を究明し、かつ復旧を行なうため、障害原因の仮
説を立て、その仮説を検証するためにネットワーク情報
の検査を状態検査部に指示し、その状態検査部で得られ
た検査結果を基に、更に詳細な原因究明の仮説を立て情
報の検査を指示する処理を繰返し、その結果として原因
を決定し、原因に応じた復旧方法を掲示する推論部と、
この推論部の指示に従い前記ネットワーク情報を収集し
、その収集した情報を予め定められた各機器の利用条件
と一致しているかを判定し、その判定結果を推論部に送
る状態検査部とを含むこと特徴とするネットワーク障害
診断方式。
In a network for the purpose of electrically transmitting information consisting of computers, terminal equipment, switching equipment, and communication equipment, physical failures that occur in the equipment or transmission paths that make up the network, or caused by the users of the network equipment. Network failures resulting from malfunctions of network equipment due to incorrect operation, defects in computer programs created to use said network equipment, or malfunctions of equipment due to incorrect setting of parameter values for operating various equipment, In order to investigate the cause of a failure and perform recovery, we formulate a hypothesis of the cause of the failure, instruct the status inspection unit to inspect network information to verify that hypothesis, and use the inspection results obtained by the status inspection unit. Based on this, a reasoning unit repeats the process of forming a hypothesis for a more detailed cause investigation and instructing the inspection of information, and as a result, determines the cause and posts a recovery method according to the cause;
and a status inspection unit that collects the network information according to instructions from the inference unit, determines whether the collected information matches predetermined usage conditions for each device, and sends the determination result to the inference unit. A network fault diagnosis method characterized by:
JP63316036A 1988-12-13 1988-12-13 Network fault diagnostic system Pending JPH02159636A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP63316036A JPH02159636A (en) 1988-12-13 1988-12-13 Network fault diagnostic system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP63316036A JPH02159636A (en) 1988-12-13 1988-12-13 Network fault diagnostic system

Publications (1)

Publication Number Publication Date
JPH02159636A true JPH02159636A (en) 1990-06-19

Family

ID=18072545

Family Applications (1)

Application Number Title Priority Date Filing Date
JP63316036A Pending JPH02159636A (en) 1988-12-13 1988-12-13 Network fault diagnostic system

Country Status (1)

Country Link
JP (1) JPH02159636A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH06314209A (en) * 1992-07-01 1994-11-08 Hitachi Inf Syst Ltd Inference method and inference system for fault restoration time based on example
WO2008007442A1 (en) * 2006-07-14 2008-01-17 Fujitsu Limited System management program, system management device and system management method
WO2008007443A1 (en) * 2006-07-14 2008-01-17 Fujitsu Limited System management program, system management device and system management method

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH06314209A (en) * 1992-07-01 1994-11-08 Hitachi Inf Syst Ltd Inference method and inference system for fault restoration time based on example
WO2008007442A1 (en) * 2006-07-14 2008-01-17 Fujitsu Limited System management program, system management device and system management method
WO2008007443A1 (en) * 2006-07-14 2008-01-17 Fujitsu Limited System management program, system management device and system management method

Similar Documents

Publication Publication Date Title
US5394543A (en) Knowledge based machine initiated maintenance system
US5404503A (en) Hierarchical distributed knowledge based machine inititated maintenance system
US4847795A (en) System for diagnosing defects in electronic assemblies
Preece Evaluating verification and validation methods in knowledge engineering
US5018075A (en) Unknown response processing in a diagnostic expert system
JPH01267742A (en) System for diagnosing trouble
WO2014173276A1 (en) Method and system for judging reliability of dcs man-machine interfaces through hra
JPS6014303A (en) Knowledge-based diagnosis system
CN114860518A (en) Detection method and system of function safety system, electronic equipment and storage medium
JPH09205429A (en) Network fault diagnostic device, fault prediction device, and its diagnostic and prediction method
JPH02159636A (en) Network fault diagnostic system
Kapadia SymCure: A model-based approach for fault management with causal directed graphs
Eldh et al. Robustness testing of mobile telecommunication systems: A case study on industrial practice and challenges
JPS62175060A (en) Automatic trouble diagnosing system of electronic exchange
JP2637774B2 (en) Operation expert system
Vermesau Quality assessment of knowledge-based software: some certification considerations
Inozemtseva Understanding the software fault introduction process
JPH02159637A (en) Network fault diagnostic data describing system
JPS6022211A (en) Fault diagnosing device
Holmström et al. Experimental evaluation of a diagnostic rule-based expert system for the nuclear industry
JPH03151730A (en) Network fault diagnostic system
CN117734793A (en) Fault detection method, fault detection device, electronic device and storage medium
CN117938708A (en) Vehicle-mounted network testing method and device, storage medium, processor and vehicle
CN115114156A (en) Method and system for analyzing process failure state and computer equipment
CN112799899A (en) Equivalent fault injection method based on extended correlation model