JP2011192097A

JP2011192097A - Failure detection method and information processing system using the same

Info

Publication number: JP2011192097A
Application number: JP2010058618A
Authority: JP
Inventors: Tatsuya Kameyama; 達也亀山; Mitsuhiro Imai; 光洋今井; Junichi Kimura; 淳一木村
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2010-03-16
Filing date: 2010-03-16
Publication date: 2011-09-29

Abstract

<P>PROBLEM TO BE SOLVED: To provide a failure detection method for reducing false failure determination, in failure determination using a data mining method. <P>SOLUTION: The system includes a first learning means and a second learning means. The first learning means receives operation information indicating a feature of device operating state from a plurality of devices having different operation structures, performs learning by use of a statistic means from the operation information, and updates and stores learning data. The second learning means updates and stores a threshold for each operation structure from operation structures identifying the devices of different configurations and operation information obtained from devices of the same operation structure in the first learning means. A failure level analysis means calculates a level of failure from the received operation information and the learning data stored by the first learning means, and a failure determination means compares the level of failure with the threshold corresponding to the operation configuration of the device of the received operation information, the threshold being stored by the second learning means, and determines whether the level of failure is an abnormal value or not. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は、特に機器のハードウェア構成の追加・削除・更新などによる変更や、機器上で実行されるソフトウェアの追加・削除・更新などによる変更により、機器を構成する稼働構成の変化や、新たな稼働構成の機器の追加に対して、異常の検知を可能とする異常検知方法およびそれを用いた情報処理システムに関する。 In particular, the present invention can be applied to changes in the operating configuration of a device, new changes due to changes due to addition / deletion / update of the hardware configuration of the device, and changes due to addition / deletion / update of software executed on the device. The present invention relates to an abnormality detection method capable of detecting an abnormality with respect to addition of a device having a different operation configuration and an information processing system using the abnormality detection method.

従来、統計的手法を用いた学習により統計モデルを作成し、稼働情報と統計モデルから統計的距離を計算して異常度を求め、設定ファイルで指定したしきい値を使って異常度が異常であるか否かを判断する方法があった（例えば、特許文献１参照）。 Conventionally, a statistical model is created by learning using statistical methods, the statistical distance is calculated from the operation information and the statistical model, the degree of abnormality is obtained, and the degree of abnormality is abnormal using the threshold value specified in the configuration file. There was a method of determining whether or not there is (see, for example, Patent Document 1).

また、従来、機器の種類毎に異常を判定するしきい値を予め設定して、稼働情報が異常か否かを判断する方法があった（例えば、特許文献２参照）。 Conventionally, there has been a method of setting a threshold value for determining an abnormality for each type of device in advance and determining whether or not the operation information is abnormal (see, for example, Patent Document 2).

さらに、従来、異常を検出する正常動作モデルの動的な学習方法があった（例えば、特許文献３参照）。 Furthermore, conventionally, there has been a dynamic learning method of a normal operation model for detecting an abnormality (see, for example, Patent Document 3).

特開２００５−１８２６４７号公報JP 2005-182647 A 特開２０００−１８１７６１号公報JP 2000-181761 A 特開２００８−１２９７１４号公報JP 2008-129714 A

近年、ソフトウェアにより多様な機能を実現するマイクロプロセッサの登場により、マイクロプロセッサが工業、民生を問わずほとんどの機器に搭載されている。マイクロプロセッサの性能向上にともない、実行可能なソフトウェアの規模が大きくなり、より多機能な機器が実現できるようになっている。 In recent years, with the advent of microprocessors that realize various functions by software, microprocessors are installed in almost all devices regardless of industry or consumer. As the performance of microprocessors has improved, the scale of executable software has increased and more multifunctional devices can be realized.

さらに、ＯＳ（ＯｐｅｒａｔｉｎｇＳｙｓｔｅｍ）の搭載によりマルチタスク、マルチスレッドによるアプリケーションの実現が可能となり、多種多様なアプリケーションがＯＳ管理下で同時に実行できるようになっている。 Furthermore, by installing an OS (Operating System), it is possible to realize multitasking and multithreaded applications, and various applications can be executed simultaneously under OS management.

例えば、電話・データ通信・ストリーミング放送の統合したマルチメディアサービスを実現するＮＧＮ（ＮｅｘｔＧｅｎｅｒａｔｉｏｎＮｅｔｗｏｒｋ、次世代通信網）上で、加入者宅に設置されるＨＧＷ（ＨｏｍｅＧａｔｅｗａｙ）は、ＯＳＧｉ（ＯｐｅｎＳｅｒｖｉｃｅｓＧａｔｅｗａｙｉｎｉｔｉａｔｉｖｅ）フレームワーク技術を利用することで遠隔からアプリケーションの、インストール・起動・停止・アンインストールが可能となり、多様なサービスが実現できる。 For example, an HGW (Home Gateway) installed at a subscriber's home on an NGN (Next Generation Network) that realizes an integrated multimedia service of telephone, data communication, and streaming broadcasting is an OSGi (Open Services). Application (Gateway initiative) framework technology can be used to remotely install, start, stop, and uninstall applications, thereby realizing various services.

このような機器において、ソフトウェアの高機能化の反面、複数のソフトウェアの動作に起因する異常が問題となっている。ソフトウェアの高機能化大規模化に伴い、ユーザ環境の多様化、他社ソフトウェアとの相性、さらに、年々新しくソフトウェアをクラッシュさせる悪意のあるウィルスソフトウェアなど、予想が困難な異常の発生が益々高くなっている。このような背景において、機器の稼働中の異常（障害）を検出することが重要になっている。 In such a device, although software functions are enhanced, anomalies caused by the operation of a plurality of softwares are problematic. As software becomes more sophisticated and larger, the occurrence of abnormalities that are difficult to predict, such as diversified user environments, compatibility with other companies' software, and malicious virus software that crashes new software year by year, is increasing. Yes. In such a background, it is important to detect an abnormality (failure) during operation of the device.

例えば特許文献１において、学習部は、電子機器から受信した機器運用情報を既存の統計モデルを基にして確率分布計算処理を行い新たな統計モデルを作成し、解析部は、機器運用情報と統計モデルから統計的距離を計算して解析結果として各データのスコアを出力し、検知・通知部は、設定ファイルに予め設定されたしきい値以上のスコアが存在するか判断し、設定されたしきい値以上であれば異常があるとして管理者に電子メールで通知するようになっている。 For example, in Patent Document 1, a learning unit creates a new statistical model by performing a probability distribution calculation process on the basis of an existing statistical model for device operation information received from an electronic device. The statistical distance is calculated from the model and the score of each data is output as the analysis result. The detection / notification unit determines whether there is a score higher than the preset threshold in the setting file. If the threshold is exceeded, the administrator is notified by e-mail that there is an abnormality.

例えば特許文献２において、端末監視装置は、端末毎の障害監視情報を読み込み、端末の種類に応じたしきい値を用いて、障害監視情報としきい値を比較することにより障害の予測および検出を行うようになっている。 For example, in Patent Document 2, the terminal monitoring apparatus reads failure monitoring information for each terminal, and uses a threshold value according to the type of the terminal to compare the failure monitoring information with the threshold value, thereby predicting and detecting a failure. To do.

例えば特許文献３において、異常検知装置は、予め静的な解析に基づいて得た解析正常動作モデルを事前に用意し、解析正常動作モデルを用いて異常検知を行い。異常検知の結果を用い、学習に基づく正常動作モデルを用いた異常検知の判定結果を照合させながら正常動作モデルを学習するようになっている。 For example, in Patent Document 3, the abnormality detection device prepares in advance an analysis normal operation model obtained based on static analysis in advance, and performs abnormality detection using the analysis normal operation model. Using the result of abnormality detection, the normal operation model is learned while collating the determination result of abnormality detection using the normal operation model based on learning.

ＯＳＧｉフレームワークを導入したＨＧＷシステムのような環境では、システムの運用中に新たな機器の追加、機器のハードウェア構成の変更、実行されるアプリケーションの変更など稼働構成が変更される。特定の稼働構成における複数のソフトウェアの動作に関連する異常を予め予想するには、事前に同一の稼働構成において全てのソフトウェアの組合せにおける異常を調べる必要がある。しかしながら、例えば他社が作成したソフトウェアとの組合せにおける異常を全て調べる事は困難である。このため、機器におけるソフトウェアの実行に伴う機器の稼働中の状態の特徴を表す稼働情報から、統計的手法を用いた学習により統計モデルを作成し、稼働情報と統計モデルから統計的距離を計算して異常度を求め、異常度が異常か否かを判断する必要がある。しかしながら、全く新しい稼働構成の機器の稼働情報では、新しい構成の機器の稼働情報を使った学習がされていないため、誤った統計モデルを使って異常度を求め、異常度が異常か否かを誤って判断してしまう。 In an environment such as an HGW system in which the OSGi framework is introduced, an operation configuration such as addition of a new device, change of a hardware configuration of a device, or change of an application to be executed is changed during the operation of the system. In order to predict in advance an abnormality related to the operation of a plurality of software in a specific operation configuration, it is necessary to check in advance an abnormality in a combination of all software in the same operation configuration. However, it is difficult to examine all abnormalities in combination with software created by other companies, for example. For this reason, a statistical model is created by learning using statistical methods from the operational information that represents the characteristics of the operational state of the device associated with the execution of software on the device, and the statistical distance is calculated from the operational information and the statistical model. Therefore, it is necessary to determine the degree of abnormality and determine whether the degree of abnormality is abnormal. However, since the operation information of a device with a completely new operation configuration is not learned using the operation information of a device with a new configuration, the degree of abnormality is obtained using an incorrect statistical model, and whether or not the abnormality degree is abnormal is determined. Make a mistake.

これに対して、特許文献１では、統計的手法を用いた学習により統計モデルを作成し、稼働情報と統計モデルから統計的距離を計算して異常度を求め、設定ファイルで指定したしきい値を使って異常度が異常であるか否かを判断する方法が開示されているものの、新しい構成の機器の稼働情報による異常の判断に必要な構成が記載されていない。 On the other hand, in Patent Document 1, a statistical model is created by learning using a statistical method, the statistical distance is calculated from the operation information and the statistical model, the degree of abnormality is obtained, and the threshold value specified in the setting file Although a method for determining whether or not the degree of abnormality is abnormal is disclosed using, a configuration necessary for determining an abnormality based on operation information of a newly configured device is not described.

特許文献２では、機器の種類毎に異常を判定するしきい値を予め設定して、稼働情報が異常か否かを判断する方法が開示されているものの、統計的手法を用いた学習により統計モデルを作成していないので、事前に全ての機器の種類毎に適切なしきい値を設定するというコストや手間を強いることになる。 Although Patent Document 2 discloses a method of setting a threshold value for determining an abnormality for each type of device in advance and determining whether or not the operation information is abnormal, statistics are obtained by learning using a statistical method. Since a model has not been created, the cost and labor of setting an appropriate threshold value for each type of device in advance is imposed.

特許文献３では、異常を検出する正常動作モデルの動的な学習方法が開示されているものの、予め静的な解析に基づいて得た解析正常動作モデルを用意する必要がある。 Although Patent Document 3 discloses a dynamic learning method for a normal operation model for detecting an abnormality, it is necessary to prepare an analysis normal operation model obtained based on static analysis in advance.

図１は、本特許との違いを例示するために特許文献１の主要部の構成を例示するブロック図である。 FIG. 1 is a block diagram illustrating the configuration of the main part of Patent Document 1 in order to illustrate the difference from this patent.

稼働情報取得手段１５は、機器の運用時の稼働情報を収集し、稼働情報データ２１に記憶する。学習手段３０は、稼働情報データ２０から統計的手法を用いた学習を行い、学習された学習データを学習データ４０に記憶する。解析手段５１は、前記稼働情報から、前記学習データから統計的距離のスコアを出力する。異常判定手段６１は、予めしきい値情報７１に記憶されたしきい値と前記スコアを比較し、異常か否かを判断する。 The operation information acquisition unit 15 collects operation information during operation of the device and stores it in the operation information data 21. The learning unit 30 performs learning using the statistical method from the operation information data 20 and stores the learned data in the learning data 40. The analysis unit 51 outputs a statistical distance score from the learning data based on the operation information. The abnormality determination means 61 compares the score with the threshold value stored in advance in the threshold information 71 to determine whether or not there is an abnormality.

図２は、本特許との違いを例示するために特許文献２の主要部の構成を例示するブロック図である。稼働情報取得手段１５は、機器の運用時の稼働情報を収集し、稼働情報データ２０に記憶し、稼働情報を出力する。 FIG. 2 is a block diagram illustrating the configuration of the main part of Patent Document 2 in order to illustrate the difference from this patent. The operation information acquisition unit 15 collects operation information during operation of the device, stores the operation information in the operation information data 20, and outputs the operation information.

稼働構成取得手段１７は、機器の種類を識別し、予め機器毎のしきい値を記憶したしきい値情報７１から、前記稼働情報を収集した機器のしきい値を出力する。異常判定手段６１は、前記稼働情報と前記しきい値を比較し、異常か否かを判断する。 The operation configuration acquisition unit 17 identifies the type of device, and outputs the threshold value of the device that has collected the operation information from the threshold information 71 that stores the threshold value for each device in advance. The abnormality determination unit 61 compares the operation information with the threshold value and determines whether or not there is an abnormality.

本発明の目的は、機器の稼働構成の変更（ハードウェアおよびソフトウェアの変更、更新、追加、削除など）が必要に応じて随時行われる機器の稼働中の状態の特徴的な変化をとらえた稼働情報を用いた異常検知において、統計モデルを使った異常検知を行いながら機器の稼働情報から統計モデルを学習し、稼働情報と統計モデルから統計的距離を計算して求めた異常度が異常か否か判別するしきい値において、異常検知を行いながら統計モデルの学習量に応じて稼働構成毎のしきい値を学習して異常検知をおこなう異常検出方法および情報処理システムを提供することにある。 The object of the present invention is to change the operating configuration of a device (change, update, addition, deletion, etc. of hardware and software) as needed, and to detect characteristic changes in the operating state of the device. In anomaly detection using information, the statistical model is learned from the operation information of the device while detecting the anomaly using the statistical model, and the degree of abnormality obtained by calculating the statistical distance from the operation information and the statistical model is abnormal. An object of the present invention is to provide an anomaly detection method and an information processing system for detecting anomalies by detecting an anomaly according to the learning amount of a statistical model while detecting anomalies.

本発明の前記目的と新規な特徴は本明細書の記述及び添付図面から明らかになるであろう。 The above objects and novel features of the present invention will become apparent from the description of the present specification and the accompanying drawings.

本願において開示される発明のうち代表的なものについて簡単に説明すれば下記の通りである。 A typical one of the inventions disclosed in the present application will be briefly described as follows.

すなわち、本発明の異常検出方法は、複数の機器の稼働状態を監視し、前記機器の異常を検出する異常検出方法であって、前記稼働状態を示す稼働情報を前記機器から収集し、前記稼働情報を学習して学習結果を記憶する第１の学習ステップと、前記機器の稼働時の構成からなる稼働構成を収集して、前記第１の学習ステップにおいて前記稼働構成に対応した前記機器から収集した前記稼働情報の学習量に応じた前記稼働構成毎のしきい値を学習して稼働構成に対応したしきい値を記憶する第２の学習ステップと、前記機器から収集した前記稼働情報と前記学習結果とを比較して解析し、その内容を異常度として出力する解析ステップと、前記異常度と、前記解析ステップで前記異常度を求めた前記機器と同一の前記稼働構成に対応したしきい値を比較して、前記異常度が異常か否の値を示すかを判断する異常判定ステップとを有することを特徴とする。 That is, the abnormality detection method of the present invention is an abnormality detection method for monitoring an operation state of a plurality of devices and detecting an abnormality of the device, collecting operation information indicating the operation state from the device, and A first learning step for learning information and storing a learning result; and an operating configuration comprising a configuration during operation of the device is collected and collected from the device corresponding to the operating configuration in the first learning step. A second learning step of learning a threshold value corresponding to the operational configuration by learning a threshold value corresponding to the operational configuration according to the learned amount of the operational information, the operational information collected from the device, and the The analysis step of comparing and analyzing the learning result, and outputting the content as an abnormality level, the abnormality level, and the threshold corresponding to the same operation configuration as the device for which the abnormality level was obtained in the analysis step By comparing the values, the abnormality degree and having an abnormality determination step of determining whether indicating the value of the abnormal or not.

また、本発明の情報処理システムは、複数の機器と、前記機器の稼働状態を監視すると共に前記機器の異常を検出する異常監視装置とを備えて成る情報処理システムであって、前記稼働状態を示す稼働情報を前記機器から収集し、前記稼働情報を学習して学習結果を記憶する第１の学習処理部と、前記機器の稼働時の構成からなる稼働構成を収集して、前記第１の学習ステップにおいて前記稼働構成に対応した前記機器から収集した前記稼働情報の学習量に応じた前記稼働構成毎のしきい値を学習して稼働構成に対応したしきい値を記憶する第２の学習処理部と、前記機器から収集した前記稼働情報と前記学習結果とを比較して解析し、その内容を異常度として出力する解析処理部と、前記異常度と、前記解析ステップで前記異常度を求めた前記機器と同一の前記稼働構成に対応したしきい値を比較して、前記異常度が異常か否の値を示すかを判断する異常判定処理部とを前記異常監視装置に備えることを特徴とする。 An information processing system according to the present invention is an information processing system comprising a plurality of devices and an abnormality monitoring device that monitors an operation state of the device and detects an abnormality of the device. A first learning processing unit that collects operation information to be collected from the device, learns the operation information, and stores a learning result; and collects an operation configuration including a configuration at the time of operation of the device, 2nd learning which learns the threshold value for every said operating configuration according to the learning amount of the said operating information collected from the said apparatus corresponding to the said operating configuration in a learning step, and memorize | stores the threshold value corresponding to an operating configuration A processing unit, an analysis processing unit that compares and analyzes the operation information collected from the device and the learning result, and outputs the content as an abnormality level, the abnormality level, and the abnormality level in the analysis step Sought The abnormality monitoring apparatus includes an abnormality determination processing unit that compares a threshold value corresponding to the same operation configuration as the device and determines whether the abnormality degree indicates a value of abnormality. To do.

尚、本発明の情報処理システムにおいては、前記解析処理部と前記異常判定処理部とを前記異常監視装置の代わりに前記機器に備えるよう構成してもよい。 In the information processing system of the present invention, the analysis processing unit and the abnormality determination processing unit may be provided in the device instead of the abnormality monitoring device.

本発明によれば、機器の運用中に稼働情報を使って学習モデルを学習できるので、予め試験環境では検出できない異常の検出が、学習した学習モデルを使うことででき、さらに新設、機能の追加、削除、更新などによる機器の構成が変更されても学習済みの学習モデルを使った異常の判定を誤検出する確率を低減することができる。 According to the present invention, the learning model can be learned using the operation information during operation of the device. Therefore, the abnormality that cannot be detected in advance in the test environment can be detected by using the learned learning model. Even if the configuration of the device is changed due to deletion, update, etc., the probability of erroneously detecting an abnormality determination using a learned learning model can be reduced.

特許文献１の主要部の構成を例示するブロック図である。10 is a block diagram illustrating a configuration of a main part of Patent Document 1. FIG. 特許文献２の主要部の構成を例示するブロック図である。10 is a block diagram illustrating a configuration of a main part of Patent Document 2. FIG. 本発明の一実施の形態の概要を例示するブロック図である。It is a block diagram which illustrates the outline | summary of one embodiment of this invention. データマイニングによる異常度を求める方法の概要を例示する図である。It is a figure which illustrates the outline | summary of the method of calculating | requiring the abnormality degree by data mining. クラスタ範囲を求める方法の概要を例示する図である。It is a figure which illustrates the outline | summary of the method of calculating | requiring a cluster range. 学習済の稼働構成の機器における稼働情報の特徴ベクトルの遷移の概要を例示する図である。It is a figure which illustrates the outline | summary of the transition of the feature vector of the operation information in the apparatus of the learned operation configuration. 未学習の稼働構成の機器における稼働情報の特徴ベクトルの遷移の概要を例示する図である。It is a figure which illustrates the outline | summary of the transition of the feature vector of the operation information in the apparatus of an unlearned operation configuration. 学習済の稼働構成の機器における正規化された異常度の変化の一例を例示する説明図である。It is explanatory drawing which illustrates an example of the change of the normalized abnormality degree in the apparatus of the learned operation structure. 未学習の稼働構成の機器の稼働数の変化の概要を例示する説明図である。It is explanatory drawing which illustrates the outline | summary of the change of the operation number of the apparatus of an unlearned operation structure. 未学習の稼働構成の機器の稼働情報を用いて学習する学習量の変化の概要を例示する説明図である。It is explanatory drawing which illustrates the outline | summary of the change of the learning amount learned using the operation information of the apparatus of an unlearned operation structure. 未学習の稼働構成の機器の稼働情報から異常と判断するしきい値設定を例示する説明図である。It is explanatory drawing which illustrates the threshold value setting judged to be abnormal from the operation information of the apparatus of an unlearned operation configuration. 未学習の稼働構成の機器の稼働情報から異常と判断する他のしきい値設定を例示する説明図である。It is explanatory drawing which illustrates the other threshold value setting judged to be abnormal from the operation information of the apparatus of an unlearned operation configuration. 本発明の一実施の形態である情報処理システムの構成を例示するブロック図である。It is a block diagram which illustrates the composition of the information processing system which is one embodiment of the present invention. 本発明の一実施の形態である異常監視装置の構成を例示するブロック図である。1 is a block diagram illustrating a configuration of an abnormality monitoring apparatus according to an embodiment of the present invention. 本発明の一実施の形態であるＨＧＷの構成を例示するブロック図である。It is a block diagram which illustrates the composition of HGW which is one embodiment of the present invention. 本発明の一実施の形態である異常監視装置とＨＧＷの構成を例示するブロック図である。It is a block diagram which illustrates the composition of an abnormality monitoring device and HGW which are one embodiment of the present invention. 本発明の一実施の形態である学習処理（６１０）の動作を例示するフローチャート図である。It is a flowchart figure which illustrates the operation | movement of the learning process (610) which is one embodiment of this invention. 本発明の一実施の形態である稼働構成登録処理（６３０）の動作を例示するフローチャート図である。It is a flowchart figure which illustrates operation | movement of the operation | movement structure registration process (630) which is one embodiment of this invention. 本発明の一実施の形態であるしきい値学習処理（６４０）の動作を例示するフローチャート図である。It is a flowchart figure which illustrates the operation | movement of the threshold value learning process (640) which is one embodiment of this invention. 本発明の一実施の形態であるしきい値更新処理（ステップＳ２３０）の動作を例示するフローチャート図である。It is a flowchart figure which illustrates the operation | movement of the threshold value update process (step S230) which is one embodiment of this invention. 本発明の一実施の形態である他のしきい値更新処理（ステップＳ２３０）の動作を例示するフローチャート図である。It is a flowchart figure which illustrates the operation | movement of the other threshold value update process (step S230) which is one embodiment of this invention. 本発明の一実施の形態である異常判定処理（６２０）の動作を例示するフローチャート図である。It is a flowchart figure which illustrates the operation | movement of the abnormality determination process (620) which is one embodiment of this invention. 本発明の一実施の形態であるハード構成情報１８３を例示する図である。It is a figure which illustrates the hardware configuration information 183 which is one embodiment of this invention. 本発明の一実施の形態であるアプリ情報１８２を例示する図である。It is a figure which illustrates the application information 182 which is one embodiment of this invention. 本発明の一実施の形態である顧客情報１８１を例示する図である。It is a figure which illustrates the customer information 181 which is one embodiment of this invention. 本発明の一実施の形態である稼働構成データ８５の一例を例示する図である。It is a figure which illustrates an example of the operation structure data 85 which is one embodiment of this invention. 本発明の一実施の形態であるしきい値データ７０を例示する図である。It is a figure which illustrates the threshold value data 70 which is one embodiment of this invention. 本発明の一実施の形態である学習データ４０を例示する図である。It is a figure which illustrates the learning data 40 which is one embodiment of this invention. 本発明の一実施の形態である機器稼働情報１５５を例示する図である。It is a figure which illustrates the apparatus operation information 155 which is one embodiment of this invention. 本発明の一実施の形態である稼働情報データ２０を例示する図である。It is a figure which illustrates the operation information data 20 which is one embodiment of this invention. 本発明の一実施の形態である図１６の動作の流れを例示するシーケンス図である。FIG. 17 is a sequence diagram illustrating an operation flow of FIG. 16 as an embodiment of the present invention. 他の構成における動作の流れを例示するシーケンス図である。It is a sequence diagram which illustrates the flow of operation in other composition.

本発明は、運用中に、異なる働構成からなる複数の機器から、機器の稼働中の状態の特徴を表す稼働情報を受信し、稼働情報から統計的な手法を用いた学習を行い、学習データを更新して保存する第１の学習手段と、異なる構成の機器を識別する稼働構成と、第１の学習手段において同一の稼働構成の機器から得た稼働情報を使った学習量から、稼働構成毎のしきい値を更新して保存する第２の学習手段を備えることを特徴とする。 The present invention receives operation information representing characteristics of the operating state of a device from a plurality of devices having different operation configurations during operation, performs learning using a statistical method from the operation information, and learns data The first learning means for updating and storing the operation configuration, the operation configuration for identifying devices having different configurations, and the learning amount using the operation information obtained from the devices having the same operation configuration in the first learning means, A second learning means for updating and storing each threshold value is provided.

さらに、異常度解析手段は、受信した稼働情報と、第１の学習手段が保存した学習データから異常度を算出し、異常度と受信した稼働情報の機器の稼働構成に対応する第２の学習手段が保存したしきい値を比較し、異常度が異常を示す値か否かを判断する異常判定手段を備えることを特徴とする。 Further, the abnormality level analysis means calculates the abnormality degree from the received operation information and the learning data stored by the first learning means, and performs second learning corresponding to the operation configuration of the equipment of the abnormality degree and the received operation information. An abnormality determining means is provided for comparing the threshold values stored by the means and determining whether or not the degree of abnormality is a value indicating abnormality.

以下、本発明の好適な実施の形態について、図面を参照して詳細に説明する。なお、実施の形態を説明する図において、同一部には原則として同一の符号を付し、その繰り返し説明は省略する。 DESCRIPTION OF EXEMPLARY EMBODIMENTS Hereinafter, preferred embodiments of the invention will be described in detail with reference to the drawings. Note that in the drawings illustrating the embodiment, the same components are denoted by the same reference symbols in principle, and the repetitive description thereof is omitted.

以下一実施の形態について詳述する。 Hereinafter, an embodiment will be described in detail.

まず、本発明の代表的な実施の形態について、その概要を実施例１として説明する。 First, an outline of a typical embodiment of the present invention will be described as a first embodiment.

本発明の概要を図１と図２に対峙して図３に例示する。図３は、本発明の一実施の形態の概要を例示するブロック図である。 The outline of the present invention is illustrated in FIG. 3 in contrast to FIGS. FIG. 3 is a block diagram illustrating an outline of an embodiment of the present invention.

図３において、稼働情報取得手段１５は、任意の数の機器において、動作中に取得できる各種稼働情報を収集し稼働情報データに格納するように動作する。
稼働構成取得手段１６は、稼働情報取得手段１５と供に、任意の数の機器の構成を示す稼働構成を収集するように動作する。 In FIG. 3, the operation information acquisition unit 15 operates to collect various operation information that can be acquired during operation and store it in operation information data in an arbitrary number of devices.
The operation configuration acquisition unit 16 operates together with the operation information acquisition unit 15 to collect operation configurations indicating configurations of an arbitrary number of devices.

学習手段３０は、稼働情報データ２０の稼働情報から統計的手法を用いて統計モデルを作成し、学習データ４０に格納するように動作する。 The learning unit 30 operates to create a statistical model from the operation information of the operation information data 20 by using a statistical method and store it in the learning data 40.

管理手段８０は、稼働構成取得手段１６で収集された稼働構成情報において稼働構成データ８５に未登録の稼働構成情報を稼働構成データに登録するように動作する。 The management unit 80 operates to register the operation configuration information not registered in the operation configuration data 85 in the operation configuration information collected by the operation configuration acquisition unit 16 in the operation configuration data.

しきい値学習手段９０は、稼働構成データ８５の稼働構成毎に学習手段３０で学習に使われた稼働情報の数である学習量を測定し、学習量に応じたしきい値を計算し、求めたしきい値でしきい値データ７０を更新するように動作する。 The threshold learning unit 90 measures a learning amount that is the number of operation information used for learning by the learning unit 30 for each operation configuration of the operation configuration data 85, calculates a threshold corresponding to the learning amount, The threshold data 70 is updated with the obtained threshold value.

異常度解析手段５０は、稼働情報取得手段１５で収集した稼働情報と学習データ４０の統計モデルとの統計的距離を異常度として出力するように動作する。 The abnormality degree analysis unit 50 operates to output the statistical distance between the operation information collected by the operation information acquisition unit 15 and the statistical model of the learning data 40 as the degree of abnormality.

異常判定手段は、異常度解析手段５０の出力である異常度と、しきい値データ７０の、異常度を求めた稼働情報を出力した機器の稼働構成に基づくしきい値とを比較し、異常度が異常か否かを示すものか判断するように動作する。 The abnormality determination means compares the abnormality degree output from the abnormality degree analysis means 50 with the threshold value based on the operation configuration of the device that outputs the operation information for which the abnormality degree is obtained, in the threshold data 70, It operates to determine whether the degree indicates an abnormality.

学習処理（６００）は、稼働情報データ２０、学習手段３０、学習データ４０を含む。稼働構成登録処理（６３０）は、管理手段８０，稼働構成データ８５を含む。しきい値学習処理（６４０）は、しきい値学習手段９０としきい値データ７０を含む。異常判定処理（６２０）は、異常度解析手段５０と異常判定手段６０を含む。 The learning process (600) includes operation information data 20, learning means 30, and learning data 40. The operation configuration registration process (630) includes management means 80 and operation configuration data 85. The threshold learning process (640) includes threshold learning means 90 and threshold data 70. The abnormality determination process (620) includes an abnormality degree analysis unit 50 and an abnormality determination unit 60.

次に、学習手段３０における統計的手法を用いた学習データ４０の作成の一例を、図４を用いて説明する。図４は、データマイニングによる異常度を求める方法の概要を例示する図である。学習データ４０は、j個のクラスタω_jの情報（平均ベクトルｍ、標準偏差σ）と、クラスタしきい値Thからなる。図４に例示したｊ個のクラスタω_jは、例えば、データマイニング手法において、データをグループ分けするクラスタ分析方法を利用し、任意の数の機器より収集した稼働情報から例えばp次元の特徴ベクトルxを抽出し、多量の特徴ベクトルからクラスタ分析を行うことで求められる。 Next, an example of creation of learning data 40 using a statistical method in the learning means 30 will be described with reference to FIG. FIG. 4 is a diagram illustrating an outline of a method for obtaining the degree of abnormality by data mining. The learning data 40 includes information (average vector m, standard deviation σ) of _j clusters ω _j and a cluster threshold Th. The j clusters ω _j illustrated in FIG. 4 are obtained from, for example, a p-dimensional feature vector x based on operation information collected from an arbitrary number of devices by using a cluster analysis method for grouping data in a data mining method. Is extracted, and cluster analysis is performed from a large number of feature vectors.

クラスタω_iのクラスタしきい値を求める方法の一例を、図４と図５を用いて説明する。図５は、クラスタ範囲を求める方法の概要を例示する図である。クラスタしきい値はクラスタ範囲とする。例えば、学習に用いた稼働情報の特徴から求められた特徴ベクトルの分布が、クラスタω_iにおいて標準偏差をσ_iとしてN(m_i, σ_i ²)の正規分布に従うとすれば、図５に例示するようにN(m_i, σ_i ²)の正規分布において、予め設定した棄却率をαとした確率点をクラスタω_iにおけるクラスタしきい値とすることができる。 An example of a method for obtaining the cluster threshold value of the cluster ω _i will be described with reference to FIGS. 4 and 5. FIG. 5 is a diagram illustrating an outline of a method for obtaining a cluster range. The cluster threshold is the cluster range. For example, if the distribution of feature vectors obtained from the features of the operation information used for learning follows a normal distribution of N (m _i , σ _i ² ) with a standard deviation of σ _i in cluster ω _i , FIG. As illustrated, in the normal distribution of N (m _i , σ _i ² ), a probability point with a preset rejection rate α can be set as the cluster threshold value in the cluster ω _i .

また、例えば、クラスタω_iに属する学習に供する特徴ベクトルの中で、平均値m_iからの統計的距離が最も遠い特徴ベクトルをクラスタしきい値とすることができる。 Further, for example, among the feature vectors used for learning belonging to the cluster ω _i , the feature vector having the longest statistical distance from the average value m _i can be set as the cluster threshold value.

次に、異常度解析手段５０において、異常度を求める方法の一例を、図４を用いて説明する。図４において、i番目のクラスタをωｉ、クラスタω_iの平均ベクトルをm_iとすれば、平均ベクトルm_iと特徴ベクトルxとの統計的距離が求まる。この統計的距離が最も平均ベクトルm_iに近い値がクラスタω_iに属する異常度と判断することができる。 Next, an example of a method for obtaining the degree of abnormality in the degree of abnormality analysis means 50 will be described with reference to FIG. In FIG. 4, .omega.i the i-th cluster, if an average vector of the cluster omega _i and m _i, statistical distance between the mean vector m _i and the feature vector x is obtained. It can be determined that the value of the statistical distance closest to the average vector m _i is the degree of abnormality belonging to the cluster ω _i .

次に、異常判定手段６０において異常か否かを求める方法について図６乃至図１２を用いて説明する。 Next, a method for determining whether there is an abnormality in the abnormality determination means 60 will be described with reference to FIGS.

まず、学習済の稼働構成の機器の稼働情報を使用した異常判別方法について図６と図８を用いて説明する。図６は、学習済の稼働構成の機器における稼働情報の特徴ベクトルの遷移の概要を例示する図である。図８は、学習済の稼働構成の機器における正規化された異常度の変化の一例を例示する説明図である。 First, an abnormality determination method using the operation information of the learned operation configuration device will be described with reference to FIGS. 6 and 8. FIG. 6 is a diagram illustrating an outline of the transition of feature vectors of operation information in a learned operation configuration device. FIG. 8 is an explanatory diagram illustrating an example of a change in the normalized degree of abnormality in a device having a learned operation configuration.

図６は説明を簡単にするため２次元の特徴ベクトルを例示する。特徴ベクトルは、図６に例示するように各クラスタ間を遷移する。正常な稼働状態では、稼働情報の特徴ベクトルの分布は各クラスタ範囲内に分布する。しかし、特徴ベクトルが各クラスタ範囲の外に遷移した場合は、クラスタ範囲をクラスタしきい値として稼働情報が異常な状態を示すと判断することができる。 FIG. 6 illustrates a two-dimensional feature vector for ease of explanation. The feature vector transitions between the clusters as illustrated in FIG. In a normal operating state, the distribution of feature vectors of operating information is distributed within each cluster range. However, when the feature vector transitions outside each cluster range, it can be determined that the operation information indicates an abnormal state using the cluster range as a cluster threshold value.

図６の各クラスタしきい値が異なる値のため、異常度が属するクラスタのクラスタしきい値でしきい値が１になるように異常度を正規化し、正規化された異常度を求めることができる。したがって、図８において、正規化された異常度がしきい値１を超える時を異常と判断することができる。 Since each cluster threshold value in FIG. 6 is a different value, it is possible to normalize the abnormality level so that the threshold value becomes 1 with the cluster threshold value of the cluster to which the abnormality level belongs, and obtain the normalized abnormality level. it can. Therefore, in FIG. 8, when the normalized abnormality degree exceeds the threshold value 1, it can be determined as an abnormality.

次に、未学習の稼働構成の機器の稼働情報を使用した異常判別方法について図７と図９乃至図１２を用いて説明する。図７は、未学習の稼働構成の機器における稼働情報の特徴ベクトルの遷移の概要を例示する図である。図９は、未学習の稼働構成の機器の稼働数の変化の概要を例示する説明図である。図１０は、未学習の稼働構成の機器の稼働情報を用いて学習する学習量の変化の概要を例示する説明図である。図１１は、未学習の稼働構成の機器の稼働情報から異常と判断するしきい値設定を例示する説明図である。図１２は、未学習の稼働構成の機器の稼働情報から異常と判断する他のしきい値設定を例示する説明図である。図７は説明を簡単にするため２次元の特徴ベクトルを例示する。 Next, an abnormality determination method using operation information of an unlearned operation configuration device will be described with reference to FIGS. 7 and 9 to 12. FIG. 7 is a diagram illustrating an outline of the transition of the feature vector of the operation information in an unlearned operation configuration device. FIG. 9 is an explanatory diagram illustrating an overview of changes in the number of operating devices having an unlearned operating configuration. FIG. 10 is an explanatory diagram illustrating an overview of changes in the learning amount that is learned using the operation information of an unlearned operation configuration device. FIG. 11 is an explanatory diagram illustrating threshold setting for determining an abnormality from the operation information of an unlearned operation configuration device. FIG. 12 is an explanatory diagram illustrating another threshold setting for determining an abnormality from the operation information of an unlearned operation configuration device. FIG. 7 illustrates a two-dimensional feature vector for ease of explanation.

未学習の稼働情報を使用して学習した場合、学習前に対して新しいクラスタが追加されることが予想しえる。例えば、未学習の稼働情報より求まる特徴ベクトルは、図７において各クラスタの円の外に遷移する可能性もあり得る。特定の構成における稼働情報に対して未学習の段階では、異常とも新たなクラスタに属することになることもあり得る。 When learning is performed using unlearned operation information, a new cluster can be expected to be added before learning. For example, the feature vector obtained from the unlearned operation information may possibly move outside the circle of each cluster in FIG. In an unlearned stage with respect to the operation information in a specific configuration, both abnormalities may belong to a new cluster.

未学習の稼働構成の機器の稼働数と時刻の関係は、図９に例示することができる。未学習の稼働構成の機器の稼働情報の学習量は、図１０に例示するように稼働数の積分で増加する。したがって、稼働構成毎のしきい値の設定方法は、例えば図１１に例示するように学習量が少ないときは、予め設定した最大のしきい値とし、学習量が予め指定した値以上になればしきい値を１とすることができる。また、稼働構成毎の他のしきい値の設定方法は、図１２に例示するように、しきい値を学習量の引数とする関数として求め、学習量が予め指定した値以上になればしきい値を１とすることができる。 The relationship between the number of operating devices having an unlearned operating configuration and time can be illustrated in FIG. The learning amount of the operation information of the device having the unlearned operation configuration increases as the operation number is integrated as illustrated in FIG. Therefore, the threshold value setting method for each operational configuration is, for example, when the learning amount is small as illustrated in FIG. 11, the maximum threshold value set in advance is set, and the learning amount is equal to or greater than a predetermined value. The threshold can be 1. Further, as shown in FIG. 12, another threshold value setting method for each operation configuration is obtained as a function using the threshold value as an argument of the learning amount, and the learning amount is not less than a predetermined value. The threshold can be 1.

次に、本発明の他の一実施の形態である実施例２に係る情報処理システムの構成を、図１３を用いて説明する。図１３は、本発明の一実施の形態である情報処理システムの構成を例示するブロック図である。 Next, the configuration of an information processing system according to Example 2, which is another embodiment of the present invention, will be described with reference to FIG. FIG. 13 is a block diagram illustrating a configuration of an information processing system according to an embodiment of this invention.

当該情報処理システムは、インターネット１２０と、例えばＮＧＮ（ＮｅｘｔＧｅｎｅｒａｔｉｏｎＮｅｔｗｏｒｋ）等の加入者線ネットワーク１１０の双方に接続された加入者線基地局１００と、加入者線ネットワークに接続された複数の加入者宅を有する構成となっている。 The information processing system includes a subscriber line base station 100 connected to both the Internet 120, a subscriber line network 110 such as NGN (Next Generation Network), and a plurality of subscribers connected to the subscriber line network. It has a configuration with a home.

加入者線基地局１００は、プログラム配信装置１０１、ゲートウェイ装置１０２、異常監視装置１０３、を有する構成となっている。 The subscriber line base station 100 includes a program distribution device 101, a gateway device 102, and an abnormality monitoring device 103.

加入者宅１３０は、ＨＧＷ１３１、センサ装置１３４、情報家電１３５、を有する構成となっている。 The subscriber home 130 has a configuration including an HGW 131, a sensor device 134, and an information home appliance 135.

プログラム配信装置１０１、異常監視装置１０３は、加入者線ネットワーク１１０に接続しており、所定の手順にしたがって情報の送受信を行うことができる。 The program distribution apparatus 101 and the abnormality monitoring apparatus 103 are connected to the subscriber line network 110 and can send and receive information according to a predetermined procedure.

ゲートウェイ装置１０２は、インターネット１２０と加入者線ネットワーク１１０の双方に接続しており、インターネット１２０に接続された機器と、加入者線ネットワーク１１０に接続された機器との情報の送受信を行うためのゲートウェイとしての機能を有する。 The gateway device 102 is connected to both the Internet 120 and the subscriber line network 110, and is a gateway for transmitting and receiving information between a device connected to the Internet 120 and a device connected to the subscriber line network 110. As a function.

ＨＧＷ１３１は、加入者線ネットワーク１１０とホームネットワーク１３２の双方に接続しており、加入者線ネットワーク１１０に接続された機器と、ホームネットワーク１３２に接続された機器との情報の送受信を行うためのゲートウェイとしての機能を有する。 The HGW 131 is connected to both the subscriber line network 110 and the home network 132, and is a gateway for transmitting and receiving information between a device connected to the subscriber line network 110 and a device connected to the home network 132. As a function.

センサ装置１３４、情報家電１３５は、有線または無線で構成されるホームネットワーク１３２と接続し、所定の手順にしたがって情報の送受信を行うことができる。 The sensor device 134 and the information home appliance 135 are connected to a home network 132 configured by wire or wireless, and can transmit and receive information according to a predetermined procedure.

次に、本実施例における異常監視装置１０３とＨＧＷ１３１のハードウェア構成の一例を図１４，図１５を用いて説明する。図１４は、本発明の一実施の形態である異常監視装置１０３の構成を例示するブロック図である。 Next, an example of the hardware configuration of the abnormality monitoring apparatus 103 and the HGW 131 in the present embodiment will be described with reference to FIGS. FIG. 14 is a block diagram illustrating the configuration of the abnormality monitoring apparatus 103 according to the embodiment of this invention.

図１４において、異常監視装置１０３は、ＣＰＵ３００、通信ＩＦ３０１、不揮発性記憶装置３０２、メインメモリ３０３、不揮発性メモリ３０４、を有する構成となっており、これらはそれぞれバス３０５と接続され、所定の手順にしたがって情報の送受信を行うことができる。 In FIG. 14, the abnormality monitoring device 103 has a configuration including a CPU 300, a communication IF 301, a non-volatile storage device 302, a main memory 303, and a non-volatile memory 304, which are each connected to a bus 305 and have a predetermined procedure. The information can be transmitted and received according to

不揮発性メモリ３０４にはブートプログラムが記憶されており、また、不揮発性記憶装置３０２には各種プログラムが記憶されている。異常監視装置１０３が起動すると、不揮発性メモリ３０４に記憶されたブートプログラムによって不揮発性記憶装置３０２から各種プログラムがメインメモリ３０３へと読み出される。ＣＰＵ３００はメインメモリ３０３に読み出された各種プログラムを実行することにより情報を処理し、通信ＩＦ３０１等による情報の送受信等を行うことができる。 The non-volatile memory 304 stores a boot program, and the non-volatile storage device 302 stores various programs. When the abnormality monitoring device 103 is activated, various programs are read from the nonvolatile storage device 302 to the main memory 303 by a boot program stored in the nonvolatile memory 304. The CPU 300 can process information by executing various programs read to the main memory 303, and can perform transmission / reception of information by the communication IF 301 or the like.

不揮発性記憶装置３０２には、上述のように、ＣＰＵ３００がメインメモリ３０３に読み出して実行するための各種プログラムが記憶されており、例えばＨＤＤ（ＨａｒｄＤｉｓｋＤｒｉｖｅ）、ＳＤＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）、光ディスク（ＯｐｔｉｃａｌＤｉｓｋＤｒｉｖｅ）等によって実現することができる。通信ＩＦ３０１は、ネットワークカード等により実現することができる。通信ＩＦ３０１は、加入者線ネットワーク１１０と接続しており、加入者線ネットワーク１１０と接続する機器との間で情報の送受信を行うことができる。 As described above, the nonvolatile storage device 302 stores various programs for the CPU 300 to read into the main memory 303 and execute them. For example, an HDD (Hard Disk Drive), an SDD (Solid State Drive), an optical disk ( (Optical Disk Drive) and the like. The communication IF 301 can be realized by a network card or the like. The communication IF 301 is connected to the subscriber line network 110, and can transmit and receive information to and from devices connected to the subscriber line network 110.

図１５は、本発明の一実施の形態であるＨＧＷ１３１の構成を例示するブロック図である。ＨＧＷ１３１は、ＣＰＵ３１０、メインメモリ３１３、不揮発性メモリ３１４、センサ装置３１５、異常対策装置３１６、第１の通信ＩＦ３１１、第２の通信ＩＦ３１２を有する構成となっており、これらはそれぞれバス３１７と接続され、所定の手順にしたがって情報の送受信を行うことができる。 FIG. 15 is a block diagram illustrating the configuration of the HGW 131 which is an embodiment of the present invention. The HGW 131 includes a CPU 310, a main memory 313, a nonvolatile memory 314, a sensor device 315, an abnormality countermeasure device 316, a first communication IF 311, and a second communication IF 312, which are connected to a bus 317. Information can be transmitted and received according to a predetermined procedure.

不揮発性メモリ３１４にはブートプログラム、各種プログラムが記憶されており、ＨＧＷ１３１が起動すると不揮発性メモリ３１４に記憶されたブートプログラムによって不揮発性メモリ３１４から各種プログラムがメインメモリ３１３へと読み出される。ＣＰＵ３１０はメインメモリ３１３に読み出された各種プログラムを実行することにより情報を処理し、第１の通信ＩＦ３１１、第２の通信ＩＦ３１２等による情報の送受信等を行う。 The nonvolatile memory 314 stores a boot program and various programs. When the HGW 131 is activated, the various programs are read from the nonvolatile memory 314 to the main memory 313 by the boot program stored in the nonvolatile memory 314. The CPU 310 processes information by executing various programs read to the main memory 313, and performs transmission / reception of information through the first communication IF 311, the second communication IF 312 and the like.

センサ装置３１５は、各種プログラムのＣＰＵ３１０による実行に伴い発生する各種状態の変動を取得することができる。異常対策装置３１６は、異常の検出または異常の予兆を検出したときに指示により、異常の復旧または異常が起こらないよう対策を行う。 The sensor device 315 can acquire various state fluctuations that occur as the CPU 310 executes various programs. The abnormality countermeasure device 316 takes measures to recover from an abnormality or prevent an abnormality from occurring when an abnormality is detected or a sign of an abnormality is detected.

第１の通信ＩＦ３１１は、ネットワークカード等により実現することができる。第２の通信ＩＦ３１２は、ネットワークカード等により実現することができる。第１の通信ＩＦ３１１は、加入者線ネットワーク１１０と接続しており、加入者線ネットワーク１１０と接続する機器との間で情報の送受信を行うことができる。第２の通信ＩＦ３１２は、ホームネットワーク１３２と接続しており、ホームネットワーク１３２に接続する機器との間で情報の送受信を行うことができる。 The first communication IF 311 can be realized by a network card or the like. The second communication IF 312 can be realized by a network card or the like. The first communication IF 311 is connected to the subscriber line network 110 and can transmit and receive information to and from devices connected to the subscriber line network 110. The second communication IF 312 is connected to the home network 132 and can send and receive information to and from devices connected to the home network 132.

なお、上述した異常監視装置１０３およびＨＧＷ１３１の構成は、図１４および図１５に例示する構成に限定されないことは当然である。例えば、ＨＧＷ１３１において、センサ装置３１５および異常対策装置３１６は、全てソフトウェアプログラムによって実現され、ＣＰＵ３１０で実行されるような場合は、センサ装置３１５および異常対策装置３１６を有さない構成となる。この場合、当該ソフトウェアプログラムは不揮発性メモリ３１４に記憶され、メインメモリ３１３上に読み出されてＣＰＵ３１０によって実行される。 Naturally, the configurations of the abnormality monitoring apparatus 103 and the HGW 131 described above are not limited to the configurations illustrated in FIGS. 14 and 15. For example, in the HGW 131, the sensor device 315 and the abnormality countermeasure device 316 are all realized by a software program, and when executed by the CPU 310, the sensor device 315 and the abnormality countermeasure device 316 are not included. In this case, the software program is stored in the nonvolatile memory 314, read onto the main memory 313, and executed by the CPU 310.

プログラム配信装置１０１のハードウェア構成は例示しないが、少なくとも１台以上のコンピュータ（ＣＰＵ、メインメモリ、不揮発性記憶装置、入力装置、出力装置、通信ＩＦ等を含む）から構成されている。不揮発性記憶装置からメインメモリに読み出され、ＣＰＵによって実行される各種プログラムは、例えば不揮発性記憶装置に格納された各種プログラムを、例えばＯＳＧｉフレームワークの技術を利用してＨＧＷ１３１からの要求により配信するプログラムが実装されている。また、不揮発性記憶装置には、ＨＧＷ１３１で実行されるプログラムが記憶される。 Although the hardware configuration of the program distribution apparatus 101 is not illustrated, the program distribution apparatus 101 includes at least one computer (including a CPU, a main memory, a nonvolatile storage device, an input device, an output device, a communication IF, and the like). Various programs that are read from the non-volatile storage device to the main memory and executed by the CPU are, for example, various programs stored in the non-volatile storage device distributed by request from the HGW 131 using the OSGi framework technology, for example. The program to be implemented is implemented. In addition, a program executed by the HGW 131 is stored in the nonvolatile storage device.

ゲートウェイ装置１０２のハードウェア構成は例示しないが、少なくとも１台以上のコンピュータ（ＣＰＵ、メインメモリ、不揮発性記憶装置、入力装置、出力装置、通信ＩＦ等を含む）から構成されている。不揮発性記憶装置からメインメモリに読み出され、ＣＰＵによって実行される各種プログラムは、例えばＨＧＷ１３１とインターネット１２０との間で各種インターネットプロトコルに従ったデータの送受を仲介するプログラムが実装されている。 Although the hardware configuration of the gateway device 102 is not illustrated, the gateway device 102 includes at least one computer (including a CPU, a main memory, a nonvolatile storage device, an input device, an output device, a communication IF, and the like). Various programs that are read from the non-volatile storage device to the main memory and executed by the CPU are implemented with, for example, programs that mediate transmission / reception of data according to various Internet protocols between the HGW 131 and the Internet 120.

情報家電１３５のハードウェア構成は例示しないが、少なくとも１台以上のコンピュータ（ＣＰＵ、メインメモリ、不揮発性記憶装置、入力装置、出力装置、通信ＩＦ等を含む）から構成されている。不揮発性記憶装置からメインメモリに読み出され、ＣＰＵによって実行される各種プログラムは、例えばローカルネットワーク１３２を経由してＨＧＷ１３１を介してインターネット１２０上の各種サーバ装置と接続し、サーバ装置が提供する各種サービスを実現するプログラムが実装されている。 Although the hardware configuration of the information home appliance 135 is not illustrated, it is configured from at least one computer (including a CPU, a main memory, a nonvolatile storage device, an input device, an output device, a communication IF, and the like). Various programs that are read from the non-volatile storage device to the main memory and executed by the CPU are connected to various server devices on the Internet 120 via the HGW 131 via the local network 132, for example, and are provided by the server devices. A program that implements the service is implemented.

センサ装置１３４のハードウェア構成は例示しないが、少なくとも１台以上のコンピュータ（ＣＰＵ、メインメモリ、不揮発性メモリ、入力装置、出力装置、通信ＩＦ等を含む）から構成されている。不揮発性メモリからメインメモリに読み出され、ＣＰＵによって実行される各種プログラムは、例えばローカルネットワーク１３２を経由してＨＧＷ１３１を介してインターネット１２０上の各種サーバ装置と接続し、サーバ装置が提供する各種サービスを実現するために必要な温度情報や位置情報など各種情報を取得して送信するプログラムが実装されている。 Although the hardware configuration of the sensor device 134 is not illustrated, the sensor device 134 includes at least one computer (including a CPU, a main memory, a nonvolatile memory, an input device, an output device, a communication IF, and the like). Various programs read from the nonvolatile memory to the main memory and executed by the CPU are connected to various server devices on the Internet 120 via the HGW 131 via the local network 132, for example, and various services provided by the server device A program for acquiring and transmitting various information such as temperature information and position information necessary for realizing the above is installed.

次に、本発明の一実施の形態である異常監視装置１０３とＨＧＷ１３１の各部の動作を図１６，図１７乃至図２２のフローチャート図と図３１のシーケンス図を用いて説明する。なお、図３は図１６の主要な機能を図示したものであり必要に応じて括弧を付して参照する。 Next, the operation of each part of the abnormality monitoring apparatus 103 and the HGW 131 according to an embodiment of the present invention will be described with reference to the flowcharts of FIGS. 16, 17 to 22, and the sequence diagram of FIG. FIG. 3 illustrates the main functions of FIG. 16 and is referred to with parentheses as necessary.

図１７乃至図２２は、図１６で説明する主要な処理のフローチャート図であり、図１７は、学習処理（６１０）の動作を例示するフローチャート図である。図１８は、稼働構成登録処理（６３０）の動作を例示するフローチャート図である。図１９は、しきい値学習処理（６４０）の動作を例示するフローチャート図である。図２０は、しきい値更新処理（ステップＳ２３０）の動作を例示するフローチャート図である。図２１は、他のしきい値更新処理（ステップＳ２３０）の動作を例示するフローチャート図である。図２２は、異常判定処理（６２０）の動作を例示するフローチャート図である。図３１は、図１６の動作の流れを例示するシーケンス図である。 FIGS. 17 to 22 are flowcharts of main processes described in FIG. 16, and FIG. 17 is a flowchart illustrating the operation of the learning process (610). FIG. 18 is a flowchart illustrating the operation of the operation configuration registration process (630). FIG. 19 is a flowchart illustrating the operation of the threshold learning process (640). FIG. 20 is a flowchart illustrating the operation of the threshold update process (step S230). FIG. 21 is a flowchart illustrating the operation of another threshold value update process (step S230). FIG. 22 is a flowchart illustrating the operation of the abnormality determination process (620). FIG. 31 is a sequence diagram illustrating the operation flow of FIG.

図２３乃至図３０は、図１６において各部で記憶される主要な情報の一例を例示したものである。図２３は、ハード構成情報１８３を例示する図である。図２４は、アプリ情報１８２を例示する図である。図２５は、顧客情報１８１を例示する図である。図２６は、稼働構成データ８５の一例を例示する図である。図２７は、しきい値データ７０を例示する図である。図２８は、学習データ４０を例示する図である。図２９は、機器稼働情報１５５を例示する図である。図３０は、稼働情報データ２０を例示する図である。 23 to 30 illustrate an example of main information stored in each unit in FIG. FIG. 23 is a diagram illustrating hardware configuration information 183. FIG. 24 is a diagram illustrating the application information 182. As illustrated in FIG. FIG. 25 is a diagram illustrating customer information 181. FIG. 26 is a diagram illustrating an example of the operation configuration data 85. FIG. 27 is a diagram illustrating threshold data 70. FIG. 28 is a diagram illustrating the learning data 40. FIG. 29 is a diagram illustrating device operation information 155. FIG. 30 is a diagram illustrating the operation information data 20.

ＨＧＷ１３１において、稼働アプリ情報１４０は、予め登録されたアプリケーションを記憶し、実行手段１４５は、稼働アプリ情報１４０に登録されたアプリケーションを実行するアプリケーション実行処理（６００）を行い、稼働情報収集手段１５０は、実行手段１４５で実行されたアプリケーション実行にともなう状態の変化、例えば物理メモリ使用量の変動量、実行スレッド数、割り込みの頻度などの動的な稼働情報を収集し、機器稼働情報１５５に記憶する、稼働情報収集処理（６０５）を行う。 In the HGW 131, the operation application information 140 stores an application registered in advance, the execution unit 145 performs an application execution process (600) for executing the application registered in the operation application information 140, and the operation information collection unit 150 Collecting dynamic operation information such as changes in the state associated with the execution of the application executed by the execution unit 145, for example, the amount of change in physical memory usage, the number of execution threads, and the frequency of interrupts, and stores them in the device operation information 155 Then, operation information collection processing (605) is performed.

機器構成管理手段１６０は、機器のユーザ情報、ハードウェア構成、ソフトウェア構成、稼働アプリ情報１４０に登録されたアプリケーションなどの静的な稼働構成を収集し、機器構成情報１６５に記憶する、稼働構成収集処理（６０７）を行う。 The device configuration management unit 160 collects a static operation configuration such as an application registered in the user information, hardware configuration, software configuration, and operation application information 140 of the device and stores it in the device configuration information 165. Processing (607) is performed.

異常対策実行手段１７０は、アプリケーション実行時の異常が検知された時に、異常情報１７５を受信し、例えば機器の再起動や異常源と予想されるアプリケーションの実行を停止する等の異常対策処理（６５０）を実行する。 The abnormality countermeasure execution means 170 receives the abnormality information 175 when an abnormality is detected during execution of the application, and performs abnormality countermeasure processing (650, for example, restarting the device or stopping execution of the application that is expected to be an abnormality source). ).

異常監視装置１０３において、稼働情報取得手段１５は、必要に応じてＨＧＷ１３１から機器稼働情報１５５を受信し、稼働情報データ２０に蓄積する。 In the abnormality monitoring apparatus 103, the operation information acquisition unit 15 receives the device operation information 155 from the HGW 131 as necessary and accumulates it in the operation information data 20.

構成情報登録処理（６３０）において、学習手段３０は、稼働情報データ２０に蓄積された稼働情報が統計的学習に必要な量になるまで待機し（ステップＳ１００）、稼働情報が十分な量となった時、データマイニング等の統計的手法を用いて、蓄積された稼働情報データ２０の特徴ベクトルを複数のクラスタに分類し、クラスタ毎の平均、標準偏差、クラスタしきい値からなる新しい統計データを作成し（ステップＳ１１０）、新しい統計データで学習データ４０を更新する（ステップＳ１２０）。 In the configuration information registration process (630), the learning unit 30 waits until the operation information accumulated in the operation information data 20 reaches an amount necessary for statistical learning (step S100), and the operation information becomes a sufficient amount. When a statistical method such as data mining is used, the feature vectors of the accumulated operation information data 20 are classified into a plurality of clusters, and new statistical data including the average, standard deviation, and cluster threshold value for each cluster is obtained. Create (step S110), and update the learning data 40 with new statistical data (step S120).

稼働構成取得手段１６は、必要に応じてＨＧＷ１３１の機器稼働構成１６５を取得する。 The operation configuration acquisition unit 16 acquires the device operation configuration 165 of the HGW 131 as necessary.

稼働構成登録処理（６３０）において、管理手段１８０は、機器稼働構成１６５と、予め登録した顧客情報１８１、アプリ情報１８２，ハード構成情報１８３との照合を行い、稼働構成データ８５に未登録な機器稼働構成１６５が取得されるまで待機し（ステップＳ１３０）、未登録の機器稼働構成１６５を稼働構成データ８５に追加登録する（ステップＳ１４０）。 In the operation configuration registration process (630), the management unit 180 collates the device operation configuration 165 with the previously registered customer information 181, application information 182, and hardware configuration information 183, and unregistered devices in the operation configuration data 85. The system waits until the operating configuration 165 is acquired (step S130), and additionally registers the unregistered device operating configuration 165 in the operating configuration data 85 (step S140).

しきい値学習処理（６４０）において、しきい値学習手段９０は、学習手段３０による学習が実行されるまで待機し（ステップＳ２００）、学習手段３０による学習が実行されると、学習手段３０における稼働構成毎に稼働情報の学習量を計測し（ステップＳ２２０）、学習量に応じたしきい値更新処理を実行する（ステップＳ２３０）。しきい値データ７０の更新が必要な全てのしきい値が更新されるまでステップＳ２２０とＳ２３０の処理を繰り返す（ステップＳ２１０）。 In the threshold value learning process (640), the threshold value learning means 90 waits until learning by the learning means 30 is executed (step S200), and when learning by the learning means 30 is executed, The learning amount of the operation information is measured for each operation configuration (step S220), and threshold value update processing corresponding to the learning amount is executed (step S230). Steps S220 and S230 are repeated until all threshold values that require updating of threshold data 70 are updated (step S210).

しきい値更新処理（ステップＳ２３０）の一例を図２０のフローチャート図を用いて説明する。しきい値更新処理（ステップＳ２３０）は、例えば、予め設定した学習量と比較し（ステップＳ３００）、学習量が設定した値より大きい時はしきい値を基準値の１とし、学習量が小さい時は予め設定した１よりも大きいしきい値Ｔｈ_maxに設定し、設定したしきい値によりしきい値データ７０を更新する（ステップ３３０）。本動作によりしきい値は、図１１に例示するように設定される。 An example of the threshold update process (step S230) will be described with reference to the flowchart of FIG. The threshold update process (step S230) is compared with, for example, a preset learning amount (step S300). When the learning amount is larger than the set value, the threshold is set to 1 as a reference value, and the learning amount is small. The time is set to a threshold value Th _max larger than a preset value 1, and the threshold value data 70 is updated with the set threshold value (step 330). With this operation, the threshold value is set as illustrated in FIG.

他のしきい値更新処理（ステップＳ２３０）の一例を図２１のフローチャート図を用いて説明する。しきい値更新処理（ステップＳ２３０）は、例えば、学習量を引数とするしきい値を求める関数式により仮のしきい値を算出し（ステップＳ３４０）、仮のしきい値が基準値１以下の場合は、しきい値を基準値の１に設定し（Ｓ３６０），それ以外の場合は、仮のしきい値をしきい値とし（ステップＳ３７０）、設定したしきい値によりしきい値データ７０を更新する（ステップＳ３８０）。本動作によりしきい値は、図１２に例示されるように設定される。 An example of another threshold value update process (step S230) will be described with reference to the flowchart of FIG. In the threshold value update process (step S230), for example, a temporary threshold value is calculated by a function expression for obtaining a threshold value using the learning amount as an argument (step S340), and the temporary threshold value is equal to or less than the reference value 1 In this case, the threshold value is set to the reference value 1 (S360). In other cases, the temporary threshold value is set as the threshold value (step S370). 70 is updated (step S380). With this operation, the threshold value is set as illustrated in FIG.

異常判定処理（６２０）の一例を図２２のフローチャート図を用いて説明する。異常判定処理（６２０）において、異常度解析手段５０は、稼働情報取得手段１５が機器稼働情報を取得するまで待機し（ステップＳ４００）、機器稼働情報から１件ずつ順次稼働情報を取得し（ステップＳ４０５）、取得する稼働情報が無くなればステップＳ４００に戻る。取得した稼働情報から特徴ベクトル抽出し、特徴ベクトルと学習データ４０に登録された複数のクラスタの平均との統計的距離を異常度として求め（ステップＳ４１０）、異常度が最小となる異常度と、異常度の属するクラスタを求める（ステップＳ４２０）。 An example of the abnormality determination process (620) will be described with reference to the flowchart of FIG. In the abnormality determination process (620), the abnormality level analysis unit 50 stands by until the operation information acquisition unit 15 acquires the device operation information (step S400), and sequentially acquires operation information one by one from the device operation information (step S400). S405) If there is no operation information to be acquired, the process returns to step S400. A feature vector is extracted from the acquired operation information, a statistical distance between the feature vector and an average of a plurality of clusters registered in the learning data 40 is obtained as an abnormality level (step S410), and an abnormality level at which the abnormality level is minimized, A cluster to which the degree of abnormality belongs is obtained (step S420).

異常判定手段６０は、学習データ４０に登録された異常度の属するクラスタのクラスタしきい値より異常度の正規化を行い正規化された異常度を求め（ステップＳ４３０）、しきい値データ７０から異常度を求めた稼働情報を取得したＨＧＷ１３１と同一の稼働構成におけるしきい値を取得し（ステップＳ４４０）、前記正規化された異常度と前記しきい値とを比較し（ステップ４５０）、正規化された異常度がしきい値より大きいのであれば、異常と判定する（ステップＳ４６０）。 The abnormality determination means 60 normalizes the abnormality degree from the cluster threshold value of the cluster to which the abnormality degree registered in the learning data 40 belongs to obtain the normalized abnormality degree (step S430). A threshold value in the same operating configuration as the HGW 131 that acquired the operation information for which the degree of abnormality is obtained is acquired (step S440), and the normalized abnormality level is compared with the threshold value (step 450). If the normalized abnormality degree is larger than the threshold value, it is determined that there is an abnormality (step S460).

一実施の形態において、異常対策管理手段６５は、異常判定手段６０で異常と判定された場合、異常と判断された稼働情報含む機器稼働情報１５５を送信したＨＧＷ１３１に対し、異常の種類に応じた対策方法１７５を送信する。 In one embodiment, the abnormality countermeasure management unit 65 responds to the type of abnormality to the HGW 131 that has transmitted the device operation information 155 including the operation information determined to be abnormal when the abnormality determination unit 60 determines that there is an abnormality. The countermeasure method 175 is transmitted.

また、図１６は、１台のＨＧＷ１３１と１台の異常監視装置１０３からなる構成を例示したが、ＨＧＷの数を制限するのもではない。また、異常監視装置１０３は、本発明の一実施の形態における、学習処理（６００）、稼働構成登録処理（６３０）、しきい値学習処理（６４０）、異常判定処理（６２０）を複数のハードウェアで分散して実行することを制限するものではない。 FIG. 16 exemplifies a configuration including one HGW 131 and one abnormality monitoring apparatus 103, but the number of HGWs is not limited. In addition, the abnormality monitoring apparatus 103 performs a learning process (600), an operation configuration registration process (630), a threshold learning process (640), and an abnormality determination process (620) according to an embodiment of the present invention. It does not restrict the execution in a distributed manner.

また、図１６は、異常判定処理（６２０）を異常監視装置１０３で行うことを例示したが、例えば、図３２の他の構成における動作の流れを例示するシーケンス図に示すように、学習データ４０としきい値データ７０をＨＧＷ１３１に送信することにより、異常判定処理（６２０）をＨＧＷ１３１で実行してもよい。本発明は、各処理の配置を制限するものではない。 16 illustrates that the abnormality determination process (620) is performed by the abnormality monitoring apparatus 103. For example, as illustrated in a sequence diagram illustrating the flow of operations in another configuration of FIG. By transmitting the threshold data 70 to the HGW 131, the abnormality determination process (620) may be executed by the HGW 131. The present invention does not limit the arrangement of each process.

また、一実施の形態において、学習手段３０において、学習データ４０を作成する方法を例示したが、学習データ４０を作成する方法を制限するものではない。 Further, in the embodiment, the method of creating the learning data 40 in the learning unit 30 is exemplified, but the method of creating the learning data 40 is not limited.

また、一実施の形態において、学習データ４０は、学習手段３０により新しい学習データで学習データを更新する方法を例示したが、学習手段３０は、学習データ４０を参照して学習することも可能であり、また、予め新しい稼働構成に対して別の手段で予め学習データを作成し、学習データ４０を更新することも、もちろん可能である。 In the embodiment, the learning data 40 is exemplified by a method of updating the learning data with new learning data by the learning means 30, but the learning means 30 can also learn by referring to the learning data 40. In addition, it is of course possible to previously create learning data and update the learning data 40 in advance by another means for a new operating configuration.

また、一実施の形態において、異常度解析手段５０において、異常度を求める方法を例示したが、異常度を求める方法を制限するものではない。 Further, in the embodiment, the method for obtaining the degree of abnormality in the abnormality degree analyzing means 50 is exemplified, but the method for obtaining the degree of abnormality is not limited.

また、一実施の形態において、しきい値学習手段９０において、しきい値の値を求める方法を例示したが、しきい値の値を求める方法を制限するものではない。 In the embodiment, the threshold learning means 90 has exemplified the method for obtaining the threshold value, but the method for obtaining the threshold value is not limited.

また、一実施の形態において、クラスタ範囲を求める方法を例示したがクラスタ範囲を求める方法を制限するものではない。 Further, in the embodiment, the method for obtaining the cluster range is illustrated, but the method for obtaining the cluster range is not limited.

また、一実施の形態において、異常度を正規化してしきい値と比較する方法を例示したが、これに限定されず、稼働構成のしきい値と求めた異常度が属するクラスタのクラスタしきい値に応じて、異常度のしきい値を設定してもよい。 Further, in the embodiment, the method of normalizing the degree of abnormality and comparing it with the threshold is exemplified, but the present invention is not limited to this, and the cluster threshold of the cluster to which the threshold of the operating configuration and the obtained degree of abnormality belong. Depending on the value, a threshold value of the degree of abnormality may be set.

また、一実施の形態において、図２６に例示する稼働構成の分類は、ハード構成と登録アプリの組合せを例示したが、これに限定されず、例えばハード構成と稼働中のアプリの組合せを単位とする稼働構成でしきい値を設定してもよい。 In the embodiment, the classification of the operation configuration illustrated in FIG. 26 illustrates the combination of the hardware configuration and the registered application. However, the present invention is not limited to this. For example, the combination of the hardware configuration and the active application is a unit. The threshold value may be set in the operating configuration.

以上、本発明者によってなされた発明を実施の形態に基づき具体的に説明したが、本発明は前記実施の形態に限定されるものではなく、その要旨を逸脱しない範囲で種々変更可能であることはいうまでもない。 As mentioned above, the invention made by the present inventor has been specifically described based on the embodiment. However, the present invention is not limited to the embodiment, and various modifications can be made without departing from the scope of the invention. Needless to say.

以上の説明から明らかなように、機器の運用中に機器から収集した稼働情報を使って学習手段３０により学習モデルを学習できるので、予め試験環境では検出できない異常の検出を学習した学習モデルを用いて行うことができ、さらに、しきい値学習手段９０を設けることにより、稼働構成毎の学習量に応じて稼働構成毎のしきい値を設定することにより、異常判定手段６０において異常度解析手段５０が出力する異常度が異常か否か判断するためのしきい値を、異常度を求めた稼働情報を出力した機器と同一の稼働構成を持つ機器から取得した稼働情報を使った学習量に応じて高く設定することができるので、新設、機能の追加、削除、更新などによる機器の構成が変更されても、学習済みの学習モデルを使った異常の判定を誤検出する確率を低減することが可能になる。 As is clear from the above description, since the learning model 30 can learn the learning model using the operation information collected from the device during the operation of the device, the learning model that has previously learned the detection of an abnormality that cannot be detected in the test environment is used. Furthermore, by providing threshold value learning means 90 and setting a threshold value for each operation configuration according to the learning amount for each operation configuration, the abnormality determination means 60 performs abnormality degree analysis means. The threshold for determining whether or not the abnormality level output by 50 is abnormal is set to the learning amount using the operation information acquired from the device having the same operation configuration as the device that outputs the operation information for which the abnormality level is obtained. Therefore, even if the device configuration is changed due to new installation, function addition, deletion, update, etc., it is likely that anomaly judgment using a learned learning model will be erroneously detected. It is possible to reduce the.

１５稼働情報取得手段
１６稼働構成取得手段
２０稼働情報データ
３０学習手段
４０学習データ
５０異常度解析手段
６０異常判定手段
７０しきい値データ
８０管理手段
８５稼働構成データ
９０しきい値学習手段
６１０学習処理
６２０異常判定処理
６３０稼働構成登録処理
６４０しきい値学習処理 15 operation information acquisition means 16 operation configuration acquisition means 20 operation information data 30 learning means 40 learning data 50 abnormality degree analysis means 60 abnormality determination means 70 threshold data 80 management means 85 operation configuration data 90 threshold learning means 610 learning processing 620 Abnormality determination processing 630 Operation configuration registration processing 640 Threshold value learning processing

Claims

An abnormality detection method for monitoring an operating state of a plurality of devices and detecting an abnormality of the device,
A first learning step of collecting operating information indicating the operating state from the device, learning the operating information, and storing a learning result;
The threshold for each operating configuration according to the learning amount of the operating information collected from the device corresponding to the operating configuration in the first learning step by collecting the operating configuration consisting of the operating configuration of the device A second learning step of learning a value and storing a threshold corresponding to the operating configuration;
Analyzing the operation information collected from the device in comparison with the learning result, and analyzing the output as the degree of abnormality;
Comparing the abnormality level with a threshold value corresponding to the same operating configuration as the device for which the abnormality level was obtained in the analysis step, and determining whether the abnormality level indicates a value of whether or not the abnormality is abnormal An abnormality detection method comprising: a determination step.

In claim 1,
The abnormality detection method characterized in that the second learning step learns the threshold value in synchronization with the first learning step.

In claim 1,
The abnormality detection method characterized in that the analysis step outputs an abnormality degree asynchronously with the first learning step and the second learning step.

An information processing system comprising a plurality of devices and an abnormality monitoring device that monitors an operating state of the devices and detects an abnormality of the devices,
A first learning processing unit that collects operation information indicating the operation state from the device, learns the operation information, and stores a learning result;
The threshold for each operating configuration according to the learning amount of the operating information collected from the device corresponding to the operating configuration in the first learning step by collecting the operating configuration consisting of the operating configuration of the device A second learning processing unit for learning a value and storing a threshold corresponding to the operation configuration;
Analyzing the operation information collected from the device in comparison with the learning result, and outputting the content as an abnormality level;
Comparing the abnormality level with a threshold value corresponding to the same operating configuration as the device for which the abnormality level was obtained in the analysis step, and determining whether the abnormality level indicates a value of whether or not the abnormality is abnormal An information processing system comprising a determination processing unit in the abnormality monitoring device.

In claim 4,
The information processing system, wherein the second learning processing unit learns the threshold value in synchronization with the first learning processing unit.

In claim 4,
The information processing system, wherein the analysis processing unit outputs an abnormality degree asynchronously with the first learning processing unit and the second learning processing unit.

An information processing system comprising a plurality of devices and an abnormality monitoring device that monitors an operating state of the devices and detects an abnormality of the devices,
A first learning processing unit that collects operation information indicating the operation state from the device, learns the operation information, and stores a learning result;
The threshold for each operating configuration according to the learning amount of the operating information collected from the device corresponding to the operating configuration in the first learning step by collecting the operating configuration consisting of the operating configuration of the device A second learning processing unit that learns a value and stores a threshold value corresponding to the operating configuration;
Analyzing the operation information collected from the device in comparison with the learning result, and outputting the content as an abnormality level;
Comparing the abnormality level with a threshold value corresponding to the same operating configuration as the device for which the abnormality level was obtained in the analysis step, and determining whether the abnormality level indicates a value of whether or not the abnormality is abnormal An information processing system comprising a determination processing unit in the device.

In claim 7,
The information processing system, wherein the second learning processing unit learns the threshold value in synchronization with the first learning processing unit.

In claim 7,
The information processing system, wherein the analysis processing unit outputs an abnormality degree asynchronously with the first learning processing unit and the second learning processing unit.