JP2005182647A

JP2005182647A - Abnormality detector for apparatus

Info

Publication number: JP2005182647A
Application number: JP2003425547A
Authority: JP
Inventors: Ikuo Terasawa; 生郎寺澤
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2003-12-22
Filing date: 2003-12-22
Publication date: 2005-07-07

Abstract

<P>PROBLEM TO BE SOLVED: To provide an abnormality detector for an apparatus which learns even abnormality patterns which are not preliminarily registered as abnormality detection conditions in real time and utilizes a data mining method to detect the abnormalities of the apparatus. <P>SOLUTION: In a learning part 21, new learning results are generated from apparatus operation information 11 received from an electronic apparatus 1 and past learning results 31, which are stored in a storage part 3. During operation, operation information 11 of the electronic apparatus 1 being an abnormality detection object and the learning results 31 are compared in an analysis section 22 to represent the degree of difference as a numerical value, and the numerical value is stored as analysis results 32 in the storage part 3. A detecting/informing part 23 refers to the analysis results 32 to perform comparison with a threshold. When data showing an abnormality of the electronic apparatus 1 are found as comparison results, an output part 4 is informed that the abnormality occurs in the electronic apparatus 1. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

本発明は、予め異常を検知するための条件として登録されていない異常パターンに対しても、リアルタイムで学習し、データマイニングの手法を利用して機器の異常を検知する機器の異常検知装置に関する。 The present invention relates to a device abnormality detection apparatus that learns in real time even an abnormality pattern that is not registered as a condition for detecting an abnormality in advance and detects a device abnormality using a data mining technique.

近年、様々な電子機器や、これらを組み合わせたシステムの内部構成は、高度な処理を行うために複雑になっているが、その複雑さ故に、予期せぬトラブル、例えば停電等により、しばしば障害、故障が発生することがある。或いは、処理すべきデータの型がプログラムに適合しない、例えば、数値処理に対して、文字列が混在することにより処理不能になる、などの理由により、システムに障害が発生することもある。
また、ごく一部の障害が、電子機器やシステムの全体に、致命的な障害をもたらすこともある。
そのため、障害はできる限り早く検知しなければならない。そして、このような異常を検知するための装置は、現在の電子機器やシステムにおいては、必要不可欠なものとなっている。 In recent years, various electronic devices and the internal configuration of a system that combines these have become complicated in order to perform advanced processing, but due to its complexity, troubles often occur due to unexpected troubles such as power outages, Failure may occur. Alternatively, the system may fail because the type of data to be processed is not compatible with the program, for example, it becomes impossible to process due to a mixture of character strings for numerical processing.
In addition, a small number of failures may cause fatal failures to the entire electronic device or system.
Therefore, faults must be detected as soon as possible. And an apparatus for detecting such an abnormality is indispensable in current electronic devices and systems.

様々な異常に対する対策を施した従来の技術として、例えば、ログ情報の解析作業に要する時間を減少させ、種々のデータ形式とファイル・システム上の偏在性とを有するログ情報を統合し、既知の異常ではない、新たな異常を示すログ情報を抽出することができ、システム管理者が文字による膨大な量のログ情報の中から注目するべきログ情報を迅速に把握することが可能なログ情報解析装置等に関する発明がある。これにより、管理者は異常が発生した機器に対して、ログ情報を解析する際に、迅速なログ解析を行うことが可能となっている（特許文献１参照）。 As a conventional technique for taking measures against various abnormalities, for example, it reduces the time required for log information analysis work, integrates log information having various data formats and file system uneven distribution, Log information analysis that can extract log information indicating a new abnormality that is not abnormal, and allows the system administrator to quickly grasp the log information to be noticed from a huge amount of log information in characters There are inventions related to devices and the like. Thereby, the administrator can perform a quick log analysis when analyzing log information for a device in which an abnormality has occurred (see Patent Document 1).

また、例えば、消費者により入力されたデータの中から異常なローデータを検出する異常入力検出装置に関する発明がある。これにより、異常なデータを的確に排除することが可能となっている（特許文献２参照）。 Further, for example, there is an invention related to an abnormal input detection device that detects abnormal raw data from data input by a consumer. Thereby, it is possible to exclude abnormal data accurately (see Patent Document 2).

また、数多くの統計的データから外れ値／異常値を検出する技術として、ルール学習やニューラルネットワークを用いた教師（ラベル）付き学習が主流であったが、ラベル付きデータが入手できない場合には、ルール学習やニューロを用いたラベル付き学習は適応できないが、最近では、ラベル無しのデータでも外れ値／異常値を検出する技術が紹介されている（非特許文献１参照）。
特開２００１−３５６９３９号公報特開２００３−１３２０７５号公報山西健司，竹内純一，教師無しデータからの外れ値フィルタリングルールの発見，２００１年情報論的学習理論ワークショップ（２００１） Moreover, as a technique for detecting outliers / abnormal values from a large number of statistical data, rule learning or learning with a label using a neural network has been the mainstream, but if labeled data is not available, Although rule learning and labeled learning using a neuron cannot be applied, recently, a technique for detecting an outlier / abnormal value even with unlabeled data has been introduced (see Non-Patent Document 1).
JP 2001-356939 A JP 2003-132075 A Kenji Yamanishi, Junichi Takeuchi, Discovery of outlier filtering rules from unsupervised data, 2001 Information theory learning theory workshop (2001)

従来の機器異常を検知するための装置は、ログなどの機器から得られる情報に対して、検出するための条件を機器毎に登録しておき、機器から得られる情報が異常とする条件と一致することによって、異常の発生を判断していた。
しかしながら従来の技術では、予め異常とするパターンを作成して、機器に登録する必要があり、そのためには、異常となる状態を想定しておく必要がある。異常パターンを作成する作業は、作業者の高い熟練度を要するが、全てのパターンを網羅することが、現実的に困難である。従って、その想定から外れたものは、異常が発生したとしても検出することができなかった。 Conventional devices for detecting device abnormalities register conditions for detection for each device with respect to information obtained from devices such as logs, and the information that is obtained from devices matches the conditions for abnormalities. By doing so, the occurrence of abnormality was judged.
However, in the conventional technique, it is necessary to create a pattern to be abnormal in advance and register it in the device. For that purpose, it is necessary to assume a state in which an abnormality occurs. The operation of creating an abnormal pattern requires a high level of skill of the operator, but it is practically difficult to cover all patterns. Therefore, what deviated from the assumption could not be detected even if an abnormality occurred.

このような問題を回避するためには、異常となるパターンを設計する作業の際に、全ての場合を網羅する必要があるが、設計者にとって非常に負担が大きい作業である。
また機器の利用環境が変わり、機器から得られる情報に影響があるような場合には、パターンを新たに作成する必要があった。
そのため、機器異常を検知するための装置には、予め異常を検知するための条件を登録しなくても、機器で生成されたログを学習する機能を有することが望まれていた。 In order to avoid such a problem, it is necessary to cover all cases when designing an abnormal pattern, but this is a very heavy work for the designer.
In addition, when the usage environment of the device changes and the information obtained from the device is affected, it is necessary to create a new pattern.
Therefore, it has been desired that an apparatus for detecting a device abnormality has a function of learning a log generated by the device without registering conditions for detecting the abnormality in advance.

更に、従来の技術にもあるように、システム管理者は、出力された膨大なログファイルから特徴的な情報、例えば、機器の障害を示す情報を取得するために、データマイニングの手法等を用いてログ解析を行い、機器の異常等を把握していた。
しかしながら、ログファイルに記録されている情報は、正常動作の状態（正常動作の履歴）を示す情報が大部分であり、目的の情報（障害等を示す情報）を効率良く取得することが困難であった。
そのため、システム管理者に対して、全てのログ情報を提供するのではなく、データマイニングの手法を用いることによって特徴的な情報（機器の異常を示す情報等）のみを抽出し、管理者に提供（通知）する機能が望まれていた。 Further, as in the prior art, the system administrator uses a data mining method or the like to acquire characteristic information from the output log file, for example, information indicating a device failure. Log analysis was conducted to understand device abnormalities.
However, the information recorded in the log file is mostly information indicating the state of normal operation (normal operation history), and it is difficult to efficiently acquire the target information (information indicating a failure or the like). there were.
Therefore, not all log information is provided to the system administrator, but only characteristic information (information indicating device malfunction, etc.) is extracted and provided to the administrator by using a data mining technique. A function to (notify) was desired.

本発明は上記事情を鑑みてなされたものであり、予め異常を検知するための条件として登録されていない異常パターンに対しても、リアルタイムで学習し、更に、データマイニングの手法を利用して機器の異常を検知する機器の異常検知装置を提供することを目的とする。 The present invention has been made in view of the above circumstances, and learns in real time even an abnormal pattern that is not registered in advance as a condition for detecting an abnormality, and further uses a data mining technique to provide a device. An object of the present invention is to provide an apparatus for detecting an abnormality of an apparatus for detecting an abnormality of the apparatus.

前記課題を解決するために、請求項１記載の発明は、機器の運用状態を監視し、異常を検知する監視装置と、前記監視装置によって監視される機器とを含み構成されることを特徴とする機器の異常検知装置であって、前記監視装置は、前記機器の動作によって生成される運用状態を示す機器運用情報を受信し、前記機器運用情報を学習し、その内容を学習結果として記憶する学習手段と、前記機器運用情報と前記学習結果とを比較して解析し、その内容を解析結果として記憶する解析手段と、前記解析結果から前記機器の異常を検知する異常検知手段と、前記機器の異常を検知した旨を出力する出力手段とを有することを特徴とする。 In order to solve the above-mentioned problem, the invention described in claim 1 includes a monitoring device that monitors an operation state of a device and detects an abnormality, and a device monitored by the monitoring device. An apparatus for detecting an abnormality of a device, wherein the monitoring device receives device operation information indicating an operation state generated by operation of the device, learns the device operation information, and stores the contents as a learning result A learning means; an analysis means for comparing and analyzing the device operation information and the learning result; storing the contents as an analysis result; an abnormality detecting means for detecting an abnormality of the device from the analysis result; Output means for outputting the fact that the abnormality is detected.

請求項２記載の発明は、前記機器は、自身の動作によって生成される運用状態を示す機器運用情報を採取し、前記監視装置に送信する情報採取送信手段を有することを特徴とする。 The invention described in claim 2 is characterized in that the device has information collection and transmission means for collecting device operation information indicating an operation state generated by its own operation and transmitting the device operation information to the monitoring device.

請求項３記載の発明は、前記情報採取送信手段は、前記機器運用情報の中から、情報採取用設定ファイルで指定された情報のみを採取する情報採取手段を有することを特徴とする。 The invention described in claim 3 is characterized in that the information collection / transmission means includes information collection means for collecting only the information specified in the information collection setting file from the device operation information.

請求項４記載の発明は、前記監視装置は、前記情報採取用設定ファイルの内容を変更する第１の内容変更手段を有することを特徴とする。 The invention described in claim 4 is characterized in that the monitoring device has a first content changing means for changing the content of the information collection setting file.

請求項５記載の発明は、前記第１の内容変更手段を、外部から指示する第１の指示手段を有することを特徴とする。 The invention described in claim 5 is characterized in that the first content changing means has first instruction means for instructing from the outside.

請求項６記載の発明は、前記学習手段は、設定ファイルで指定された項目について学習し、学習結果として記憶する第２の学習手段を有することを特徴とする。 The invention according to claim 6 is characterized in that the learning means has second learning means for learning about an item specified in the setting file and storing it as a learning result.

請求項７記載の発明は、前記解析手段は、設定ファイルで指定された項目について解析し、解析結果として記憶する第２の解析手段を有することを特徴とする。 The invention described in claim 7 is characterized in that the analysis means has second analysis means for analyzing items specified in the setting file and storing them as analysis results.

請求項８記載の発明は、前記異常検知手段は、前記解析結果の値と、設定ファイルで指定された閾値とを比較することにより、異常であるか否かを判定する判定手段を有することを特徴とする。 The invention according to claim 8 is characterized in that the abnormality detection means has a determination means for determining whether or not there is an abnormality by comparing the value of the analysis result with a threshold value specified in a setting file. Features.

請求項９記載の発明は、前記監視装置は、前記設定ファイルの内容を変更する第２の内容変更手段を有することを特徴とする。 The invention according to claim 9 is characterized in that the monitoring device has a second content changing means for changing the content of the setting file.

請求項１０記載の発明は、前記第２の内容変更手段を、外部から指示する第２の指示手段を有することを特徴とする。 A tenth aspect of the invention is characterized in that the second content changing means includes second instruction means for instructing from the outside.

本発明によれば、機器の運用状態を監視し、異常を検知する監視装置と、前記監視装置によって監視される機器とを含み構成されることを特徴とする機器の異常検知装置であって、前記監視装置は、前記機器の動作によって生成される運用状態を示す機器運用情報を受信し、前記機器運用情報を学習し、その内容を学習結果として記憶する学習手段と、前記機器運用情報と前記学習結果とを比較して解析し、その内容を解析結果として記憶する解析手段と、前記解析結果から前記機器の異常を検知する異常検知手段と、前記機器の異常を検知した旨を出力する出力手段とを有することにより、予め異常を検知するための条件を登録すること無しに、データマイニングの手法を利用して機器の異常を検知することが可能となる。 According to the present invention, there is provided a device abnormality detection device characterized by including a monitoring device for monitoring an operation state of the device and detecting an abnormality, and a device monitored by the monitoring device, The monitoring device receives device operation information indicating an operation state generated by the operation of the device, learns the device operation information, and stores the contents as a learning result; the device operation information; An analysis unit that compares and analyzes the learning result and stores the content as an analysis result, an abnormality detection unit that detects an abnormality of the device from the analysis result, and an output that indicates that the abnormality of the device is detected By using the data mining technique, it is possible to detect an abnormality of the device without registering a condition for detecting the abnormality in advance.

近年、様々な電子機器や、これらを組み合わせたシステムの内部構成は、高度な処理を行うために複雑になっている。また、システムに用いられる機器も多種に渡るため、以下の実施例で示すように、サーバコンピュータのような汎用的な情報処理装置を監視装置として用いて、更に、システム管理者がサーバにアクセスするためのパーソナルコンピュータ等の情報処理装置を用いて、電子機器を監視するのが最も好ましい実施の形態である。 In recent years, various electronic devices and the internal configuration of a system combining these have become complicated in order to perform advanced processing. In addition, since there are a wide variety of devices used in the system, as shown in the following embodiments, a general-purpose information processing device such as a server computer is used as a monitoring device, and the system administrator accesses the server. It is the most preferred embodiment to monitor an electronic device using an information processing apparatus such as a personal computer.

次に、添付図面を参照しながら、本発明の実施形態を説明する。
図１は、異常検知装置の概要を示す図である。
異常検知装置は、異常検知の対象となる電子機器１と、前記電子機器１の異常を検知する検知部２と、情報を記憶する記憶部３と、検知結果を表示するためのディスプレイなどの出力部４を含み構成される。
異常検知の対象となる電子機器１は、自身の運用結果のログなどの機器運用情報１１をファイル等の形式で有する。
記憶部３には、今までの電子機器１の運用情報１１を蓄積した学習結果３１と、現時点での電子機器１の運用情報１１を解析した解析結果３２がある。 Next, embodiments of the present invention will be described with reference to the accompanying drawings.
FIG. 1 is a diagram showing an outline of the abnormality detection device.
The abnormality detection device includes an electronic device 1 that is an abnormality detection target, a detection unit 2 that detects an abnormality of the electronic device 1, a storage unit 3 that stores information, and an output such as a display for displaying a detection result. The unit 4 is configured.
The electronic device 1 that is the target of abnormality detection has device operation information 11 such as a log of its operation results in the form of a file or the like.
The storage unit 3 has a learning result 31 in which the operation information 11 of the electronic device 1 so far is accumulated and an analysis result 32 in which the operation information 11 of the electronic device 1 at the present time is analyzed.

検知部２の学習部２１では、機器運用情報１１を学習結果３１に反映させるために、機器運用情報１１と学習結果３１を読み込んで処理し、その結果を記憶部３の学習結果３１に戻す。
解析部２２では、機器運用情報１１と学習結果３１を比較して解析し、その結果を記憶装置３の解析結果３２に格納する。
検知・通知部２３では、解析結果３２を検索し、異常を検知した場合はその旨を出力部４に出力することにより管理者に通知する。 The learning unit 21 of the detection unit 2 reads and processes the device operation information 11 and the learning result 31 in order to reflect the device operation information 11 in the learning result 31, and returns the result to the learning result 31 of the storage unit 3.
In the analysis unit 22, the device operation information 11 and the learning result 31 are compared and analyzed, and the result is stored in the analysis result 32 of the storage device 3.
The detection / notification unit 23 searches the analysis result 32 and, when an abnormality is detected, notifies the administrator by outputting the fact to the output unit 4.

図２は、異常検知装置の動作の流れについて説明したフローチャートである。
異常検知の監視対象となる電子機器１は、運用中に運用ログやセキュリティログなどの機器運用情報１１を出力する。更に、学習部２１と解析部２２は、予め設定された時間に動作を行う（ステップＳ１）。
学習部２１は、学習結果３１から得られる既存の統計モデルＡ１を基にして、機器運用情報１１から得られる解析対象データＡ２の内容を追加した、新たな確率分布の計算処理を行う。計算した結果である新たな統計モデルＡ３は、新しい学習結果として学習結果３１を置き換える（ステップＳ２）。 FIG. 2 is a flowchart illustrating the operation flow of the abnormality detection device.
The electronic device 1 to be monitored for abnormality detection outputs device operation information 11 such as an operation log and a security log during operation. Furthermore, the learning unit 21 and the analysis unit 22 operate at a preset time (step S1).
Based on the existing statistical model A1 obtained from the learning result 31, the learning unit 21 performs a new probability distribution calculation process to which the content of the analysis target data A2 obtained from the device operation information 11 is added. The new statistical model A3, which is the calculated result, replaces the learning result 31 as a new learning result (step S2).

次に解析部２２は、学習結果３１から得られる新たな統計モデルＡ３を基にして、情報１１から得られる解析対象データＡ２を構成する各要素について、スコアを計算する。
ここで、スコアとは、基となる統計モデルと、解析対象のデータの中の評価対象の要素を加えて得られる統計モデルとの統計的距離を数値で表したものである（詳しくは、非特許文献１を参照のこと）。
従って、スコアの数値が大きければ大きいほど、その内容は統計モデルＡ３との統計的外れ度合い（揺らぎ）が大きいということになる。ステップＳ４で得られた、解析対象データＡ２を構成する各要素に対するスコアＡ４を解析結果３２として出力する（ステップＳ４）。 Next, the analysis unit 22 calculates a score for each element constituting the analysis target data A2 obtained from the information 11, based on the new statistical model A3 obtained from the learning result 31.
Here, the score is a numerical value representing the statistical distance between the statistical model that is the basis and the statistical model that is obtained by adding the elements to be evaluated in the data to be analyzed. (See Patent Document 1).
Therefore, the larger the numerical value of the score, the greater the degree of statistical deviation (fluctuation) of the content from the statistical model A3. The score A4 for each element constituting the analysis target data A2 obtained in step S4 is output as the analysis result 32 (step S4).

検知・通知部は解析結果３２に納められたスコアの中で、予め設定された閾値以上の値のものがあるかどうか検索する（ステップＳ５）。
予め設定された閾値以上のスコアが存在する場合（ステップＳ５／Ｙｅｓ）は、機器に異常が発生した場合であると判断し、電子メール等予め決められている手順で管理者に通知する（ステップＳ６）。 The detection / notification unit searches the score stored in the analysis result 32 for a value greater than or equal to a preset threshold value (step S5).
If there is a score equal to or higher than a preset threshold value (step S5 / Yes), it is determined that an abnormality has occurred in the device, and the administrator is notified in accordance with a predetermined procedure such as e-mail (step). S6).

図３は、異常検知装置のより詳細な構成を示した図である。
異常検知装置は、本図に示すように、監視対象となる電子機器１、監視装置２００、管理者用端末３００から構成される。
監視対象機器１において、情報採取処理部１１０では、機器で生成されるアクセスログ、ＳＹＳＬＯＧ、イベントログ等の各種ログを採取して、それらをＣＳＶデータに変換し、それを監視装置２００にネットワークを介してＦＴＰで送信し、監視装置２００上の抽出済情報ファイルに追加する。
なお、情報採取用設定ファイル１２０には、情報採取処理部１１０の動作で必要となるパラメータ等が記述されている。
例えば、送信先の監視装置の情報や、アクセスログ、ＳＹＳＬＯＧ、イベントログ等の各種ログ情報の中で、どのログ情報を採取するか、或いは、前記電子機器１を構成するどの内部ユニットについてログ情報を採取するか、などの情報採取処理部の動作で必要となる様々なパラメータが記述されている。 FIG. 3 is a diagram showing a more detailed configuration of the abnormality detection device.
As shown in the figure, the abnormality detection device includes an electronic device 1 to be monitored, a monitoring device 200, and an administrator terminal 300.
In the monitoring target device 1, the information collection processing unit 110 collects various logs such as an access log, SYSLOG, and event log generated by the device, converts them into CSV data, and connects the network to the monitoring device 200. And is added to the extracted information file on the monitoring apparatus 200.
The information collection setting file 120 describes parameters and the like necessary for the operation of the information collection processing unit 110.
For example, which log information is collected from various types of log information such as information on a transmission destination monitoring device, access log, SYSLOG, and event log, or log information about which internal unit constituting the electronic device 1 Various parameters necessary for the operation of the information collection processing unit, such as whether to collect data, are described.

監視装置２００において、情報加工処理部２１０では、監視対象の電子機器１上にある情報採取処理部２１０から受け取った抽出済情報ファイルのＣＳＶデータの中から、設定ファイル２８０にて桁位置で指定されている時間属性の項目について、学習解析がしやすいように秒に換算して数値化し、新たにＣＳＶデータを作成し、加工済情報ファイルに追加する。
ここで、「秒換算して数値化する」とは、ある基準時間からどの程度時間が経過したかを、秒で示すことである。例えば、基準時間を平成１５年１２月１日の０時００分００秒とすれば、平成１５年１２月１日の０時１０分１０秒という時間は、６１０と数値化される。
ただし、「秒換算」に限らず、より短い時間単位、例えば、１／１０秒単位、１／１００秒単位、ミリ秒（１／１０００秒）単位、などで数値化するという方法もある。 In the monitoring device 200, the information processing unit 210 is designated at the digit position in the setting file 280 from the CSV data of the extracted information file received from the information collection processing unit 210 on the electronic device 1 to be monitored. The time attribute item is converted into seconds for easy learning analysis, digitized, newly created CSV data, and added to the processed information file.
Here, “converting to a numerical value in terms of seconds” means indicating how much time has elapsed from a certain reference time in seconds. For example, if the reference time is 00:00:00 on December 1, 2003, the time of 0:10:10 on December 1, 2003 is quantified as 610.
However, the method is not limited to “second conversion”, and there is a method of quantifying in a shorter time unit, for example, 1/10 second unit, 1/100 second unit, millisecond (1/1000 second) unit, or the like.

学習処理部２２０では、統計ファイルを更新するために参照し、更に、前記加工済情報ファイルのＣＳＶデータの全項目の中から、設定ファイル２８０で指定された項目について、今までの統計ファイルに情報を追加して、新たな統計ファイルとして出力する。
解析処理部２３０では、学習結果と、設定ファイル２８０で指定された分析対象ＣＳＶデータの中の項目について、統計ファイルとの統計的距離を計算して、その異常度合いを数値化して解析結果ファイルに出力する。 The learning processing unit 220 refers to update the statistical file, and further, information on the items specified in the setting file 280 from all items of the CSV data of the processed information file is stored in the statistical file so far. And output as a new statistics file.
The analysis processing unit 230 calculates the statistical distance between the learning result and the items in the analysis target CSV data specified in the setting file 280 from the statistical file, quantifies the degree of abnormality, and generates the analysis result file. Output.

ここで、「設定ファイル２８０で指定された項目」とは、電子機器１の様々な動作に関わる項目を意味する。例えば、電子機器１として、高速データ通信を実現するデータ通信機器であれば、膨大なログ情報の中から、データのトラフィックに関する情報を取得するのが好ましいので、「トラフィック情報について処理する」旨の設定を行えばよい。
より具体的には、データのトラフィックを左右する要因として、回線スピード、データ通信待ちの状態であるデータが一時的に格納されるバッファ領域の状態などの複数の情報についてデータを取得すれば、データマイニングの手法を用いることにより、解析、異常検知することが可能である。 Here, “items specified in the setting file 280” mean items related to various operations of the electronic device 1. For example, if the electronic device 1 is a data communication device that realizes high-speed data communication, it is preferable to acquire information relating to data traffic from a large amount of log information. You only have to set it.
More specifically, if the data is acquired for multiple information such as the line speed and the buffer area status where data that is waiting for data communication is temporarily stored as factors that influence data traffic, By using a mining technique, analysis and abnormality detection can be performed.

或いは、電子機器の全体を制御するＣＰＵの監視をする場合は、処理能力（１秒あたりの処理スピード）や、発熱による温度上昇及び冷却による温度低下などの温度情報を取得するのが好ましいので、「処理能力について処理する」、或いは「温度情報について処理する」旨の設定を行えばよい。
或いは、「全てのログ情報について処理する」というような設定を行い、総合的に解析、異常検知することも可能である。 Alternatively, when monitoring the CPU that controls the entire electronic device, it is preferable to acquire temperature information such as processing capacity (processing speed per second) and temperature rise due to heat generation and temperature drop due to cooling. It is only necessary to make a setting of “processing for processing capacity” or “processing for temperature information”.
Alternatively, a setting such as “process all log information” can be performed to comprehensively analyze and detect an abnormality.

検知処理部２４０では、設定ファイル２８０から閾値を読み込み、解析結果ファイルの中の異常度合いを表した数値が閾値以上になっているものを検索し、検索の結果を検知結果ファイルとして出力する。
通知処理部２５０では、検知結果ファイルの中のデータをｅ−ＭａｉｌやＳＮＭＰ（Simple Network Management Protocol）を使用して、管理者端末３００に通知する。 The detection processing unit 240 reads a threshold value from the setting file 280, searches the analysis result file for a numerical value representing the degree of abnormality being equal to or greater than the threshold value, and outputs the search result as a detection result file.
The notification processing unit 250 notifies the administrator terminal 300 of data in the detection result file using e-Mail or SNMP (Simple Network Management Protocol).

また、前記情報加工処理部２１０、学習処理部２２０、解析処理部２３０、検知処理部２４０、通知処理部２５０を、一連の流れで自動的に実行するようにするために、実行制御部２６０で管理する。
また、管理機能２７０は、設定ファイル２８０を編集する機能をＷｅｂサーバ上で動作する機能として提供する。そのため、監視装置２００は、サーバコンピュータ等の汎用性の高い情報処理装置を用いるのが一般的である。 In addition, in order to automatically execute the information processing processing unit 210, the learning processing unit 220, the analysis processing unit 230, the detection processing unit 240, and the notification processing unit 250 in a series of flows, the execution control unit 260 to manage.
The management function 270 also provides a function for editing the setting file 280 as a function that operates on the Web server. Therefore, the monitoring device 200 generally uses a highly versatile information processing device such as a server computer.

管理者用端末３００では、Ｗｅｂブラウザ３１０から管理機能に接続して、監視装置２００上の設定ファイル２８０と、監視対象の電子機器１上の情報採取用設定ファイル１２０を編集することが可能である。
メーラ３２０は、監視装置２００の通知処理部２５０からの通知のメールを受信する。
なお、管理者用端末３００としては、パーソナルコンピュータ等の汎用性の高い情報処理装置を用いるのが一般的である。 The administrator terminal 300 can connect to the management function from the Web browser 310 and edit the setting file 280 on the monitoring device 200 and the information collection setting file 120 on the electronic device 1 to be monitored. .
The mailer 320 receives the notification mail from the notification processing unit 250 of the monitoring device 200.
As the administrator terminal 300, a highly versatile information processing apparatus such as a personal computer is generally used.

また、監視装置からの異常検知を通知する手段、或いは通知された内容を出力する手段は、ｅ−ＭａｉｌやＳＮＭＰに限らず、専用のプログラムを用いて、異常を検知する度に監視装置から管理者用端末に対して、画面上でポップアップ通知したり、または、警告音のような音声を発することにより通知するというような手段も考えられる。 The means for notifying the abnormality detection from the monitoring apparatus or the means for outputting the notified contents is not limited to e-Mail or SNMP, and is managed from the monitoring apparatus every time an abnormality is detected using a dedicated program. It is also conceivable to notify the user terminal by giving a pop-up notification on the screen or by issuing a sound such as a warning sound.

また、上記に示した実施形態は、本発明の好適な実施の一例である。しかしながら、本発明の実施形態は、これらに限定されるものではなく、本発明の要旨を逸脱しない範囲内で、様々な変形実施が可能である。
例えば、本実施例では、監視対象の電子機器１上に、機器運用情報１１を採取し、監視装置２００に送信する手段を設けているが、監視装置２００から電子機器１上の機器運用情報１１を参照し、取得する（ダウンロードする）ような手段でも構わない。 Moreover, embodiment shown above is an example of suitable implementation of this invention. However, the embodiments of the present invention are not limited to these, and various modifications can be made without departing from the scope of the present invention.
For example, in this embodiment, there is provided means for collecting device operation information 11 on the electronic device 1 to be monitored and transmitting it to the monitoring device 200. However, the device operation information 11 on the electronic device 1 from the monitoring device 200 is provided. It is possible to refer to and acquire (download).

（効果）
以上の説明から明らかなように、機器の運用状態を監視し、異常を検知する監視装置と、前記監視装置によって監視される機器とを含み構成されることを特徴とする機器の異常検知装置であって、前記監視装置は、前記機器の動作によって生成される運用状態を示す機器運用情報を受信し、前記機器運用情報を学習し、その内容を学習結果として記憶する学習手段と、前記機器運用情報と前記学習結果とを比較して解析し、その内容を解析結果として記憶する解析手段と、前記解析結果から前記機器の異常を検知する異常検知手段と、前記機器の異常を検知した旨を出力する出力手段とを有することにより、予め異常を検知するための条件を登録すること無しに、データマイニングの手法を利用して機器の異常を検知し、管理者はモニタ等の出力手段を通じて機器の異常を把握することが可能となる。 (effect)
As is apparent from the above description, an apparatus abnormality detection apparatus comprising: a monitoring apparatus that monitors an operation state of an apparatus and detects an abnormality; and an apparatus monitored by the monitoring apparatus. The monitoring device receives device operation information indicating an operation state generated by the operation of the device, learns the device operation information, and stores the contents as a learning result, and the device operation Information and the learning result are compared and analyzed, an analysis means for storing the contents as an analysis result, an abnormality detection means for detecting an abnormality of the device from the analysis result, and the fact that the abnormality of the device is detected By having an output means for outputting, without registering conditions for detecting an abnormality in advance, an apparatus abnormality is detected using a data mining technique. It is possible to grasp an abnormality of the device through the power unit.

また、機器運用情報を採取し、送信する際に、前記機器運用情報の中から、情報採取用設定ファイルで指定された情報のみを採取する手段を有することにより、必要な情報のみを採取し、効率よく解析し、異常検知をすることが可能となる。 In addition, when collecting and transmitting device operation information, only necessary information is collected from the device operation information by having means for collecting only the information specified in the information collection setting file, It is possible to analyze efficiently and detect anomalies.

また、前記情報採取用設定ファイルの内容を変更する際に、外部から指示することにより、システム管理者が、必要に応じてその設定を変更することが可能となる。 In addition, when changing the contents of the information collection setting file, the system administrator can change the settings as necessary by giving an external instruction.

また、設定ファイルで指定された項目について学習し、学習結果として記憶するので、効率のよい学習が可能となり、更に、効率のよい解析処理、異常検知処理を行うことが可能となる。 Further, since the items specified in the setting file are learned and stored as learning results, efficient learning is possible, and more efficient analysis processing and abnormality detection processing can be performed.

また、設定ファイルで指定された項目について解析し、解析結果として記憶するので、効率のよい解析が可能となり、更に、効率のよい異常検知を行うことが可能となる。 Further, since the items specified in the setting file are analyzed and stored as analysis results, efficient analysis can be performed, and furthermore, efficient abnormality detection can be performed.

また、前記設定ファイルの内容を変更する際に、外部から指示することにより、システム管理者が、必要に応じてその設定を変更することが可能となる。
例えば、閾値を大きくすることにより、異常検知の感度を大きくすることが可能であり、逆に、閾値を小さくすれば、異常検知の感度を小さくすることが可能である。
このように、システム管理者が状況に応じて異常検知装置の動作設定を変更することが可能となる。 In addition, when changing the contents of the setting file, the system administrator can change the settings as necessary by giving an external instruction.
For example, it is possible to increase the sensitivity of abnormality detection by increasing the threshold value. Conversely, if the threshold value is decreased, the sensitivity of abnormality detection can be decreased.
In this way, the system administrator can change the operation setting of the abnormality detection device according to the situation.

また、従来の技術では、電子機器の製造者が異常の状態を想定（エラーの設計）していたが、本発明により、異常の状態を想定する作業（エラーの設計作業）が不要となる。 In the conventional technique, the manufacturer of the electronic device assumes an abnormal state (error design). However, according to the present invention, an operation that assumes an abnormal state (error design work) becomes unnecessary.

異常検知装置の概要を示す図である。It is a figure which shows the outline | summary of an abnormality detection apparatus. 異常検知装置の動作の流れについて説明したフローチャートである。It is the flowchart explaining the flow of operation | movement of the abnormality detection apparatus. 図３は、異常検知装置の詳細な構成を示した図である。FIG. 3 is a diagram showing a detailed configuration of the abnormality detection device.

Explanation of symbols

１異常検知対象機器（電子機器）
２検知部
３記憶部
４出力部
１１機器運用情報
２１学習部
２２解析部
２３検知・通知部
３１学習結果
３２解析結果
１１０情報採取処理部
１２０情報採取用設定ファイル
２００監視装置
２１０情報加工処理部
２２０学習処理部
２３０解析処理部
２４０検知処理部
２５０通知処理部
２６０実行制御部
２７０管理機能
２８０設定ファイル
３００管理者用端末
３１０Ｗｅｂブラウザ
３２０メーラ
Ａ１既存の統計モデル
Ａ２解析対象データ
Ａ３新たな統計モデル
Ａ４スコアデータ 1 Anomaly detection target equipment (electronic equipment)
2 Detection Unit 3 Storage Unit 4 Output Unit 11 Device Operation Information 21 Learning Unit 22 Analysis Unit 23 Detection / Notification Unit 31 Learning Result 32 Analysis Result 110 Information Collection Processing Unit 120 Information Collection Setting File 200 Monitoring Device 210 Information Processing Processing Unit 220 Learning processing unit 230 Analysis processing unit 240 Detection processing unit 250 Notification processing unit 260 Execution control unit 270 Management function 280 Setting file 300 Administrator terminal 310 Web browser 320 Mailer A1 Existing statistical model A2 Analysis target data A3 New statistical model A4 Score data

Claims

A monitoring device that monitors the operational status of the device and detects anomalies;
An apparatus for detecting an abnormality of an apparatus, comprising an apparatus monitored by the monitoring apparatus,
The monitoring device receives device operation information indicating an operation state generated by the operation of the device, learns the device operation information, and stores the contents as a learning result;
Analyzing and analyzing the device operation information and the learning result, and storing the contents as an analysis result;
An abnormality detection means for detecting an abnormality of the device from the analysis result;
An apparatus abnormality detection apparatus comprising: output means for outputting a notification that an abnormality of the apparatus has been detected.

2. The apparatus abnormality detection device according to claim 1, further comprising information collection and transmission means for collecting device operation information indicating an operation state generated by its own operation and transmitting the device operation information to the monitoring device.

3. The apparatus abnormality detection device according to claim 2, wherein the information collection and transmission unit includes an information collection unit that collects only the information specified in the information collection setting file from the device operation information.

4. The apparatus abnormality detection device according to claim 3, wherein the monitoring device includes first content changing means for changing the content of the information collection setting file.

The apparatus according to claim 4, further comprising a first instruction unit that instructs the first content changing unit from outside.

6. The device abnormality detection according to claim 1, wherein the learning unit includes a second learning unit that learns about an item specified in a setting file and stores the learning result as a learning result. apparatus.

The apparatus according to claim 1, wherein the analysis unit includes a second analysis unit that analyzes an item specified in a setting file and stores the result as an analysis result. apparatus.

The abnormality detection unit includes a determination unit that determines whether or not there is an abnormality by comparing the value of the analysis result with a threshold value specified in a setting file. The apparatus abnormality detection device according to any one of the above.

9. The apparatus abnormality detection device according to claim 6, wherein the monitoring device includes a second content changing unit that changes the content of the setting file.

10. The apparatus abnormality detection device according to claim 9, further comprising second instruction means for instructing the second content changing means from outside.