JP6538615B2

JP6538615B2 - Abnormality detection device, abnormality detection method and abnormality detection program

Info

Publication number: JP6538615B2
Application number: JP2016109033A
Authority: JP
Inventors: 山中　章裕; 章裕山中; 中村　吉孝; 吉孝中村; 祥史武市; 弘仁丸山; 慶一郎中川; 明典松尾
Original assignee: Nippon Telegraph and Telephone Corp; NTT TechnoCross Corp
Current assignee: Nippon Telegraph and Telephone Corp; NTT TechnoCross Corp
Priority date: 2016-05-31
Filing date: 2016-05-31
Publication date: 2019-07-03
Anticipated expiration: 2036-05-31
Also published as: JP2017215765A

Description

本発明は、異常検知装置、異常検知方法及び異常検知プログラムに関する。 The present invention relates to an abnormality detection device, an abnormality detection method, and an abnormality detection program.

多数のサーバやルータ等の機器で構成されるシステムでは、各装置のハードディスクやメモリ等のハードウェア異常を検知することが運用上必要になる。情報・通信機器に限らず、多数のセンサを持つ産業機械や自動車などにおいても、温度や加速度等のセンサデータから機械の異常検知をしたいという需要が存在する。また、機器類だけでなく、例えばクレジットカードの利用状況を分析することによる不正使用の検知、情報セキュリティにおけるＤＤｏＳ（Distributed Denial of Service）攻撃の検知分野やマルウェアの検知分野でも、異常検知技術が用いられている。 In a system composed of a large number of servers and devices such as routers, it is necessary for operation to detect hardware abnormalities such as hard disks and memories of each device. Not only in information and communication equipment but also in industrial machines and automobiles having a large number of sensors, there is a demand for detecting machine abnormalities from sensor data such as temperature and acceleration. In addition to equipments, anomaly detection technology is used, for example, in detection of unauthorized use by analyzing credit card usage, detection of Distributed Denial of Service (DDoS) attacks in information security, and detection of malware. It is done.

最も簡単な異常検知の方法として、人間がデータを直接視認することで、データにおける異常を検知するという方法が考えられる。しかし、多数のデータがどのような傾向を示した時に異常であるかを把握するには、相当な習熟が必要になるため、人間の目で異常判定することは困難である。仮に人間の目で異常検知できるとしても、特定の習熟した作業者に依存してしまい、その作業者がいなくなれば異常検知はできなくなってしまう。以上を鑑みると、異常検知の仕組みを何らかの方法で機械的に実行することが必要になる。 As the simplest method of anomaly detection, a method is conceivable in which an anomaly in data is detected by human being directly viewing the data. However, in order to grasp what kind of tendency a large number of data show when it is abnormal, considerable learning is required, so it is difficult for the human eye to make an abnormal judgment. Even if an abnormality can be detected by human eyes, it depends on a specific trained worker, and if the worker disappears, the abnormality can not be detected. In view of the above, it is necessary to mechanically execute the mechanism of anomaly detection in some way.

このような機械的な異常検知の方法として最も簡易なものに、データの値域に閾値を設け、閾値を超えた際に異常であると判定する方法がある。この方法は、適切に機能する場合があるものの、一般的には、妥当な閾値を決定することが困難である。これは、閾値を大きくすると、本来異常として発見したい事象をとり損ねる可能性がある一方で、閾値を小さくすると異常でないのに異常であると判定してしまう事象が増えるためである。 As the simplest method of such mechanical abnormality detection, there is a method of providing a threshold value in the data range and determining an abnormality when the threshold value is exceeded. Although this method may work properly, it is generally difficult to determine a reasonable threshold. This is because increasing the threshold may cause an event that the user originally intended to detect as an anomaly to fail, while decreasing the threshold increases events that are determined to be abnormal although they are not abnormal.

そこで、それまでに出現していないパターンを発見し、異常であると判定する異常検知方法として、広く利用されている方法にＬＯＦが提案されている（例えば、非特許文献１参照）。ＬＯＦは、データ空間内での局所的密度を計算する方法である。 Therefore, LOF has been proposed as a widely used method as an abnormality detection method for detecting a pattern that has not appeared so far and determining that the pattern is abnormal (for example, see Non-Patent Document 1). LOF is a method of calculating local density in data space.

具体的には、ＬＯＦは、新たに得られたデータが、それまでに得られているデータの空間の中で密度の高い箇所に存在する場合は、異常度合いを表す数値を小さく出力する。言い換えると、ＬＯＦは、新たに得られたデータがそれまでにも得られているデータと類似するデータである場合、新たに得られたデータは、異常ではないと判定する。一方、新たに得られたデータが密度の低い箇所に存在する場合、異常度合いを表す数値を大きく出力する。言い換えると、ＬＯＦは、新たに得られたデータがそれまでに得られているデータと類似しないデータである場合、新たに得られたデータは、異常であると判定する。 Specifically, the LOF outputs a small numerical value indicating the degree of abnormality when newly obtained data is present at a high density location in the space of data obtained so far. In other words, the LOF determines that the newly obtained data is not abnormal if the newly obtained data is data similar to the data obtained so far. On the other hand, when the newly obtained data is present at a low density portion, the numerical value representing the degree of abnormality is largely output. In other words, the LOF determines that the newly obtained data is abnormal if the newly obtained data is not similar to the data obtained so far.

また、データ間に何らかの関係性がある場合に、この関係性が崩れたことから異常を検知する方法がある。この方法として、複数種類のデータ間の相関関係が維持されているか否かを分析し、その結果から異常を発見する方法が提案されている。この方法では、データを２組ずつ選び、例えば単回帰により一方から他方を予測する関数を構築し、その予測値が観測値から一定以上離れていることによって相関関係が破壊されているとみなしている。 In addition, there is a method of detecting an abnormality from the fact that the relationship is broken when there is any relationship between data. As this method, a method of analyzing whether or not the correlation between plural types of data is maintained, and finding an abnormality from the result has been proposed. In this method, two sets of data are selected, for example, a function that predicts one from the other by simple regression is constructed, and the correlation is considered to be broken because the predicted value is separated from the observed value by a certain amount or more. There is.

M. Breunig, H. Kriegel, R. Ng， and J. Sander, “ＬＯＦ: Identifying Density-Based Local Outliers”, SIGMOD, Volume 29 Issue 2, 93-104, 2000M. Breunig, H. Kriegel, R. Ng, and J. Sander, “LOF: Identifying Density-Based Local Outliers”, SIGMOD, Volume 29 Issue 2, 93-104, 2000

しかしながら、ＬＯＦでは、データ間に相関がある場合に、相関に従っているデータであって本来正常であるデータであっても、データの集合から外れているデータを異常として検知する、という問題がある。具体的に、図１３〜図１５を参照して、従来技術の問題を説明する。図１３〜図１５は、従来技術に係る異常検知方法を説明するための図である。図１３〜図１５は、データとしてＸ及びＹの組が与えられたとして、座標平面上にその組をプロットしたものである。 However, in the LOF, there is a problem in that, when there is a correlation between data, even if the data is data that follows the correlation and is originally normal, data that is out of the data set is detected as abnormal. Specifically, the problems of the prior art will be described with reference to FIGS. 13 to 15 are diagrams for explaining an abnormality detection method according to the prior art. FIGS. 13 to 15 are plots of the sets on the coordinate plane, assuming that the sets of X and Y are given as data.

図１３では、Ｘ及びＹの組に対する点として、平均ベクトルが（３，３）であり、共分散行列が［［１，０．９］、［０．９，１］］の２次元正規分布に従う点を２００点（白丸）プロットしている。また、右上に位置する点Ｐｂは、(６．５，６．５)の点である。また、左上の点Ｐｒは、（２，５）の点である。 In FIG. 13, two-dimensional normal distributions with mean vectors of (3, 3) and covariance matrices of [[1, 0.9], [0.9, 1]] as points for a set of X and Y 200 points (open circles) are plotted according to points. Further, a point Pb located at the upper right is a point of (6.5, 6.5). The upper left point Pr is the point (2, 5).

この図１３では、次のような、データ間に相関が見られる場合をイメージして、ＸとＹとの組に対する点をプロットしている。例えば、Ｘ及びＹが、アプリケーションサーバＡとアプリケーションサーバＢとのＣＰＵ（Central Processing Unit）使用率をそれぞれ表しているとする。そして、アプリケーションサーバＡとアプリケーションサーバＢとが、Ｗｅｂサーバからのリクエストを均等に受け付けている場合を例とする。 In this FIG. 13, points are plotted against a set of X and Y by imaging the case where there is a correlation between data as follows. For example, it is assumed that X and Y represent central processing unit (CPU) usage rates of the application server A and the application server B, respectively. A case where the application server A and the application server B equally receive requests from the web server is taken as an example.

この場合、Ｗｅｂサーバへのアクセス数が増加し、アプリケーションサーバへのリクエストが増加すると、Ｘ及びＹがともに上昇すると考えられる。逆に、Ｗｅｂサーバへのアクセス数が減少すると、Ｘ及びＹはともに下降すると考えられる。図１３において、白丸は、正常な状態のデータの例である。そして、点Ｐｂは、相関関係を維持したままで、それまでには存在していなかった値をとった場合の例である。また、点Ｐｒは、相関関係が崩れた場合の例である。以下、正常な状態のデータである白丸が先に与えられ、続いて異常検知対象データとして、点Ｐｂ及び点Ｐｒが与えられた状況を考える。 In this case, when the number of accesses to the web server increases and the requests to the application server increase, it is considered that both X and Y rise. Conversely, if the number of accesses to the web server decreases, it is thought that both X and Y fall. In FIG. 13, white circles are examples of data in a normal state. And, the point Pb is an example in the case of taking a value which did not exist until then while maintaining the correlation. Further, the point Pr is an example in the case where the correlation is broken. In the following, it is assumed that a white circle, which is data of a normal state, is given first, and then, a point Pb and a point Pr are given as abnormality detection target data.

このうち、点Ｐｂは、正常な状態において成り立つ相関関係を維持しているため、本来、異常でないと判定すべき場合がある。一方、点Ｐｒでは正常な状態において成り立つ相関関係には従っていないため、異常であると判定すべき場合がある。しかしながら、ＬＯＦではデータ間の関係性を考慮していないため、白丸の密度が低い点に存在する点Ｐｂ及び点Ｐｒは、いずれも異常度合いを表わす数値が大きく出力されてしまう。すなわち、異常であると判定すべきではない点Ｐｂにおいて、異常であると判定されてしまう問題がある。 Among these, since the point Pb maintains the correlation that holds in the normal state, it may be determined that it is not abnormal originally. On the other hand, since the point Pr does not follow the correlation established in the normal state, it may be determined as abnormal. However, since the LOF does not take into consideration the relationship between the data, the points Pb and Pr existing at the points where the density of the white circles is low will both output a large numerical value indicating the degree of abnormality. That is, there is a problem that the point Pb is determined to be abnormal at a point Pb that should not be determined to be abnormal.

一方、複数種類データ間の相関関係が維持されているか否かを分析し、その結果から異常を発見する方法では、図１４や図１５に示すように、データ間の関係性が複数あり、複雑な関係が見られる場合に異常を検知することが難しいという問題がある。 On the other hand, in the method of analyzing whether or not the correlation between plural types of data is maintained and finding an abnormality from the result, as shown in FIG. 14 and FIG. There is a problem that it is difficult to detect an abnormality when a certain relationship is observed.

例えば、図１４の白丸は、平均ベクトルが（３，３）、共分散行列が［［１，０．９９］，［０．９９，１］］の２次元正規分布に従う２００点（データ群Ｒｂ）と、これらを（π／５）だけ反時計回りに回転させた点（データ群Ｒｂ´）である。図１４の例では、データ間の関係性が２つあり、検知データをいずれの相関関係と比較すればよいか判断が難しい。そして、この図１４では、白丸で表示された点の相関係数は「０．０７」と小さい。したがって、関係性が崩れたことから異常を検知する方法でも、図１４の例では、正常な状態のデータに限っても相関が見られないと判断するため、異常検知を適切に実行することが難しい。 For example, the white circles in FIG. 14 have a mean vector of (3, 3) and a covariance matrix of 200 points according to a two-dimensional normal distribution of [[1, 0.99], [0.99, 1]] (data group Rb And a point (data group Rb ') obtained by rotating them counterclockwise by (.pi. / 5). In the example of FIG. 14, there are two relationships between data, and it is difficult to determine which correlation the detected data should be compared with. And in this FIG. 14, the correlation coefficient of the point displayed by the white circle is as small as "0.07." Therefore, even in the method of detecting an abnormality because the relationship is broken, in the example shown in FIG. difficult.

そして、図１５では、正常な状態が、データで見ると３つの離れた群を構成している場合のイメージを示す。図１５中の白丸は正常な状態のデータである。点Ｐｇは、異常判定対象のデータである。また、直線Ｌｂは、正常な白丸のみから単回帰直線（具体的には、「Ｙ＝０．９１７５Ｘ＋０．１０８１」の関係を有する。）を計算し、図１５中にプロットしたものである。図１５中、左下の白丸の群（データ群Ｒｃ）は、Ｘ及びＹが、それぞれ平均が「０．５」、標準偏差が「０．１」の正規分布に従う乱数をとった１００点をプロットしたものである。右の直線Ｌｂの上の群（データ群Ｒｄ）は、Ｘは平均「３」、標準偏差「０．１」、Ｙは平均「４」、標準偏差「０．１」の正規分布に従う乱数をとった２０点をプロットしたものである。右の直線Ｌｂの下の群（データ群Ｒｅ）は、Ｘは平均「４」、標準偏差「０．１」、Ｙは平均「３」、標準偏差「０．１」の正規分布に従う乱数をとった２０点をプロットしたものである。なお、白丸の相関係数は「０．９２３１」という高い値をとっている。 And in FIG. 15, the normal state shows the image at the time of seeing in data, when it comprises three distant groups. White circles in FIG. 15 are data in a normal state. The point Pg is data as an abnormality determination target. Further, the straight line Lb is obtained by calculating a simple regression line (specifically, having a relationship of “Y = 0.9175X + 0.1081”) only from normal white circles, and plotting it in FIG. In FIG. 15, the lower left white circle group (data group Rc) plots 100 points in which X and Y each have a random distribution according to a normal distribution with an average of “0.5” and a standard deviation of “0.1”. It is The group above the straight line Lb on the right (data group Rd) is a random number that follows the normal distribution with an average "3", a standard deviation "0.1", and an average "4" and a standard deviation "0.1". It is a plot of the 20 points taken. The lower group (data group Re) on the right straight line Lb is a random number that follows the normal distribution with an average "4", a standard deviation "0.1", and an average "3" and a standard deviation "0.1". It is a plot of the 20 points taken. In addition, the correlation coefficient of the white circle has a high value of "0.9231".

ここで、図１５に示す点Ｐｇは、明らかに白丸の３つのデータ群Ｒｃ〜Ｒｄから大きく離れている。しかしながら、単回帰直線の直線Ｌｂには近い位置であるため、点Ｐｇと直線Ｌｂとの比較を行うだけでは、点Ｐｇが直線Ｌｂに対応する相関から崩れていると判断することはできない。すなわち、点Ｐｇは、正常な状態のデータから見れば離れた箇所に存在するものであるが、関係性が崩れたことから異常を検知する方法では、この点Ｐｇを異常であると検知することができない。 Here, the point Pg shown in FIG. 15 is clearly far away from the three white circle data groups Rc to Rd. However, since the position is close to the straight line Lb of the simple regression line, it can not be determined that the point Pg is broken from the correlation corresponding to the straight line Lb simply by comparing the point Pg and the straight line Lb. That is, although the point Pg exists in a distant place when viewed from data in a normal state, the method P for detecting an abnormality is that the point Pg is detected as an abnormality because the relationship is broken. I can not

本発明は、上記に鑑みてなされたものであって、データ間の関係性に基づいた検知対象データの異常検知を精度よく実行することができる異常検知装置、異常検知方法及び異常検知プログラムを提供することを目的とする。 The present invention has been made in view of the above, and provides an abnormality detection apparatus, an abnormality detection method, and an abnormality detection program that can accurately execute abnormality detection of detection target data based on the relationship between data. The purpose is to

上述した課題を解決し、目的を達成するために、本発明に係る異常検知装置は、データ間の関係性に基づいて、検知対象である検知データの、データ間の関係性からの乖離を表す乖離値ベクトルと、比較対象である比較データの、データ間の関係性からの乖離を表す乖離値ベクトルと、を計算する乖離値ベクトル計算部と、検知データの乖離値ベクトルの、比較データの乖離値ベクトルの集合からの離散度合を、異常を示す度合として計算する異常度計算部と、離散度合が所定の閾値を超えた場合に検知データは異常であることを判定する異常判定部と、を有する。 In order to solve the problems described above and to achieve the object, the abnormality detection device according to the present invention represents the deviation of the detection data to be detected from the relationship between the data based on the relationship between the data. Deviation value of the comparison data of the deviation value vector calculation unit that calculates deviation value vector and deviation value vector that represents deviation from the relationship between the comparison data to be compared and deviation value vector of the detection data An abnormality degree calculation unit that calculates the degree of discreteness from the set of value vectors as the degree of abnormality, and an abnormality determination unit that determines that the detected data is abnormal when the degree of discreteness exceeds a predetermined threshold value; Have.

本発明によれば、データ間の関係性に基づき、検知対象データの異常検知を精度よく実行することができる。 According to the present invention, abnormality detection of detection target data can be accurately performed based on the relationship between data.

図１は、実施の形態１に係る異常検知装置の構成の一例を示すブロック図である。FIG. 1 is a block diagram showing an example of the configuration of the abnormality detection device according to the first embodiment. 図２は、実施の形態１における処理対象のデータ構成の一例を示す図である。FIG. 2 is a diagram showing an example of a data configuration of a processing target in the first embodiment. 図３は、図１に示す乖離値ベクトル計算部が行う乖離値ベクトルの計算処理の例を示す図である。FIG. 3 is a diagram showing an example of the calculation process of the difference value vector performed by the difference value vector calculation unit shown in FIG. 図４は、図１に示す乖離値ベクトル計算部が行う乖離値ベクトルの計算処理の他の例を示す図である。FIG. 4 is a diagram showing another example of the calculation process of the difference value vector performed by the difference value vector calculation unit shown in FIG. 図５は、図１に示す異常度計算部による異常度計算処理を説明する図である。FIG. 5 is a diagram for explaining abnormality degree calculation processing by the abnormality degree calculation unit shown in FIG. 図６が、図１に示す異常度計算部による異常度計算処理の他の例を説明する図である。FIG. 6 is a diagram for explaining another example of abnormality degree calculation processing by the abnormality degree calculation unit shown in FIG. 図７は、図１に示す異常検知装置が実行する異常検知処理の処理手順を示すフローチャートである。FIG. 7 is a flowchart showing the procedure of the abnormality detection process performed by the abnormality detection device shown in FIG. 図８は、実施の形態１の異常検知処理を説明する図である。FIG. 8 is a diagram for explaining the abnormality detection process according to the first embodiment. 図９は、実施の形態３に係る異常度計算処理を説明する図である。FIG. 9 is a diagram for explaining abnormality degree calculation processing according to the third embodiment. 図１０は、実施の形態３に係る異常度計算処理を説明する図である。FIG. 10 is a diagram for explaining abnormality degree calculation processing according to the third embodiment. 図１１は、実施の形態３に係る異常度計算処理及び異常判定処理を説明する図である。FIG. 11 is a diagram for explaining abnormality degree calculation processing and abnormality determination processing according to the third embodiment. 図１２は、プログラムが実行されることにより、異常検知装置が実現されるコンピュータの一例を示す図である。FIG. 12 is a diagram illustrating an example of a computer in which an abnormality detection device is realized by execution of a program. 図１３は、従来技術に係る異常検知方法を説明するための図である。FIG. 13 is a diagram for explaining an abnormality detection method according to the prior art. 図１４は、従来技術に係る異常検知方法を説明するための図である。FIG. 14 is a diagram for explaining an abnormality detection method according to the prior art. 図１５は、従来技術に係る異常検知方法を説明するための図である。FIG. 15 is a view for explaining an abnormality detection method according to the prior art.

以下、図面を参照して、本発明の一実施の形態を詳細に説明する。なお、この実施の形態により本発明が限定されるものではない。また、図面の記載において、同一部分には同一の符号を付して示している。 Hereinafter, an embodiment of the present invention will be described in detail with reference to the drawings. The present invention is not limited by the embodiment. Further, in the description of the drawings, the same portions are denoted by the same reference numerals.

［実施の形態１］
まず、第一の実施形態について説明する。以下の実施形態では、第一の実施形態に係る異常検知装置の構成、異常検知装置による処理の流れを説明する。 First Embodiment
First, the first embodiment will be described. In the following embodiments, the configuration of the abnormality detection apparatus according to the first embodiment and the flow of processing by the abnormality detection apparatus will be described.

［異常検知装置の構成］
図１は、実施の形態１に係る異常検知装置１０の構成の一例を示すブロック図である。図１に示すように、第一の実施形態に係る異常検知装置１０は、通信処理部１１、制御部１２及び記憶部１３を有する。 [Configuration of abnormality detection device]
FIG. 1 is a block diagram showing an example of the configuration of the abnormality detection apparatus 10 according to the first embodiment. As shown in FIG. 1, the abnormality detection device 10 according to the first embodiment includes a communication processing unit 11, a control unit 12, and a storage unit 13.

通信処理部１１は、接続される端末装置２０との間でやり取りする各種情報に関する通信を制御する。例えば、通信処理部１１は、比較対象であるデータ、検知対象となるデータ、及び、検知対象となるデータに対する異常検知処理の要求を端末装置２０から受信する。また、例えば、通信処理部１１は、異常検知処理の処理結果を端末装置２０に対して送信する。 The communication processing unit 11 controls communication regarding various information exchanged with the terminal device 20 to be connected. For example, the communication processing unit 11 receives, from the terminal device 20, data to be compared, data to be detected, and a request for abnormality detection processing for data to be detected. Also, for example, the communication processing unit 11 transmits the processing result of the abnormality detection processing to the terminal device 20.

制御部１２は、各種の処理手順などを規定したプログラム及び所要データを格納するための内部メモリを有し、これらによって種々の処理を実行する。例えば、制御部１２は、ＣＰＵやＭＰＵ（Micro Processing Unit）などの電子回路である。制御部１２は、関係性推定部１２１、乖離値ベクトル計算部１２２、異常度計算部１２３及び異常判定部１２４を有する。 The control unit 12 has a program defining various processing procedures and the like, and an internal memory for storing required data, and executes various processing by these. For example, the control unit 12 is an electronic circuit such as a CPU or a micro processing unit (MPU). The control unit 12 includes a relationship estimation unit 121, a difference value vector calculation unit 122, an abnormality degree calculation unit 123, and an abnormality determination unit 124.

関係性推定部１２１は、データ間の関係性を推定し、データ間の関係性を示すパラメータを算出する。例えば、与えられたデータについて、データ間の関係性が式として与えられているものの、パラメータに相当するものが未定である場合に、パラメータを推定する。具体的には、データＸとデータＹとの関係性が、「Ｙ＝ａＸ＋ｂ」という単回帰であることは与えられているが、「ａ」及び「ｂ」が不明な場合に、関係性推定部１２１は、与えられたデータを基に「ａ」及び「ｂ」を推定する。この場合、関係性推定部１２１は、それを出力した機器等が正常な状態のデータ、言い換えると、異常な状態のデータを含まないデータを、パラメータ推定のために用いることが望ましい。 The relationship estimation unit 121 estimates the relationship between data, and calculates a parameter indicating the relationship between data. For example, for the given data, although the relationship between the data is given as an equation, the parameter is estimated when the equivalent of the parameter is undecided. Specifically, although it is given that the relationship between data X and data Y is a simple regression "Y = aX + b", relationship estimation is made when "a" and "b" are unknown. The part 121 estimates "a" and "b" based on the given data. In this case, it is desirable that the relationship estimation unit 121 use, for parameter estimation, data in a normal state of the device or the like that has output the data, that is, data that does not include abnormal state data.

乖離値ベクトル計算部１２２は、データ間の関係性に基づき、関係性からの乖離を表す乖離値ベクトルを計算する。実施の形態１では、乖離値ベクトル計算部１２２は、データ間の関係性に基づいて、検知対象である検知データの、データ間の関係性からの乖離を表す乖離値ベクトルを計算する。そして、乖離値ベクトル計算部１２２は、データ間の関係性に基づいて、比較対象である比較データの、データ間の関係性からの乖離を表す乖離値ベクトルを計算する。すなわち、乖離値ベクトル計算部１２２は、データ間に何らかの関係性が見られる場合に、その関係性からの乖離を示す値を、検知データ及び比較データについて計算する。なお、乖離値ベクトル計算部１２２は、データ間の乖離値ベクトルを、記憶部１３（後述）の乖離値ベクトル記憶部１３１に記憶させてもよい。また、乖離値ベクトル計算部１２２は、関係性推定部１２１が算出したデータ間の関係性を示すパラメータを、データ間の関係性に適用し、データ間の乖離値ベクトルを計算する。なお、このデータ間の関係性は、予め与えられたものであってもよい。 The divergence value vector calculation unit 122 calculates a divergence value vector representing the divergence from the relationship based on the relationship between the data. In the first embodiment, the divergence value vector calculation unit 122 calculates, based on the relationship between data, a divergence value vector representing a divergence from the relationship between data of detection data to be detected. Then, the divergence value vector calculation unit 122 calculates, based on the relationship between the data, a divergence value vector representing the divergence from the relationship between the comparison data to be compared. That is, when there is any relationship between the data, the difference value vector calculation unit 122 calculates a value indicating the difference from the relationship for the detection data and the comparison data. The divergence value vector calculation unit 122 may store the divergence value vector between the data in the divergence value vector storage unit 131 of the storage unit 13 (described later). Further, the difference value vector calculation unit 122 applies a parameter indicating the relationship between the data calculated by the relationship estimation unit 121 to the relationship between the data, and calculates a difference value vector between the data. The relationship between the data may be given in advance.

異常度計算部１２３は、検知データの乖離値ベクトルの、比較データの乖離値ベクトルの集合からの離散度合を計算する。具体的には、異常度計算部１２３は、検知データの乖離値ベクトルの、比較データの乖離値ベクトルの集合からの、空間的な距離や密度などに基づき、離散度合を計算する。この異常度計算部１２３が計算した離散度合は、異常を示す異常度として、異常判定部１２４（後述）における判定において用いられる。なお、異常度計算部１２３は、乖離値ベクトル記憶部１３１（後述）が記憶する乖離値ベクトルを用いて離散度合を計算してもよい。 The abnormality degree calculation unit 123 calculates the degree of discreteness of the divergence value vector of the detection data from the set of divergence value vectors of the comparison data. Specifically, the abnormality degree calculation unit 123 calculates the degree of discreteness based on the spatial distance, the density, and the like from the set of the divergent value vectors of the comparison data of the divergent value vector of the detected data. The degree of discreteness calculated by the abnormality degree calculating unit 123 is used in the determination in the abnormality determining unit 124 (described later) as the abnormality degree indicating the abnormality. Note that the abnormality degree calculation unit 123 may calculate the degree of discreteness using a divergence value vector stored in the divergence value vector storage unit 131 (described later).

異常判定部１２４は、異常度計算部１２３が計算した離散度合が所定の閾値を超えた場合に、検知データは異常であることを判定する。異常判定部１２４は、離散度合が所定の閾値以下である場合に、検知データは正常であることを判定する。異常判定部１２４による判定結果は、異常検知結果として、通信処理部１１を介して、例えば、端末装置２０に出力される。 The abnormality determination unit 124 determines that the detection data is abnormal when the degree of discreteness calculated by the abnormality degree calculation unit 123 exceeds a predetermined threshold. The abnormality determination unit 124 determines that the detection data is normal when the degree of discreteness is equal to or less than a predetermined threshold. The determination result by the abnormality determination unit 124 is output to, for example, the terminal device 20 via the communication processing unit 11 as an abnormality detection result.

記憶部１３は、ＲＡＭ（Random Access Memory）、フラッシュメモリ（Flash Memory）等の半導体メモリ素子、又は、ハードディスク、光ディスク等の記憶装置によって実現され、異常検知装置１０を動作させる処理プログラムや、処理プログラムの実行中に使用されるデータなどが記憶される。記憶部１３は、乖離値ベクトル計算部１２２が計算した乖離値ベクトルを記憶する乖離値ベクトル記憶部１３１を有する。 The storage unit 13 is realized by a semiconductor memory device such as a random access memory (RAM) or a flash memory, or a storage device such as a hard disk or an optical disk, and a processing program for operating the abnormality detection apparatus 10 Data used during the execution of are stored. The storage unit 13 includes a divergence value vector storage unit 131 that stores the divergence value vector calculated by the divergence value vector calculation unit 122.

［処理対象のデータの例］
次に、異常検知装置１０における処理対象のデータの例について説明する。図２は、異常検知装置１０における処理対象のデータ構成の一例を示す図である。 [Example of data to be processed]
Next, an example of data to be processed in the abnormality detection device 10 will be described. FIG. 2 is a view showing an example of the data configuration of the processing target in the abnormality detection apparatus 10. As shown in FIG.

図２に示すように、例えば、検知データとして、「Ｘ^１」〜「Ｘ^Ｎ」というＮ種類のデータが与えられたとする。図２において、「ｘ^ｎ _ｍ」は、ｎ番目のデータのｍにおける観測値である。添字の「ｍ」は、観測された地点や時点を意味する。例えば、「Ｘ^１」〜「Ｘ^Ｎ」がユーザ「１」からユーザ「Ｎ」を示し、データの要素が商品「ｍ」の購入の有無を表す場合、添字が等しいデータは、同一商品の購入の有無を意味する。或いは、「Ｘ^１」〜「Ｘ^Ｎ」がサーバ「１」からサーバ「Ｎ」のＣＰＵ使用率を示し、データの要素が時点「ｍ」におけるＣＰＵ使用率を表す場合、添字が等しいデータは、観測時点が等しいことを意味する。 As shown in FIG. 2, for example, it is assumed that N types of data “X ¹ ” to “X ^N ” are given as detection data. In FIG. 2, “x ⁿ _m ” is an observation value at m of the n-th data. The subscript "m" means the observed point or time. For example, when “X ¹ ” to “X ^N ” indicate user “N” from user “1” and the data element indicates the presence or absence of purchase of the product “m”, data having the same subscript is purchase of the same product Means the presence or absence of Alternatively, if “X ¹ ” to “X ^N ” indicate the CPU utilization of server “1” to server “N” and the data element represents the CPU utilization at time “m”, data with the same subscript is It means that observation time is equal.

［関係性推定部の処理］
関係性推定部１２１は、「Ｘ^１」〜「Ｘ^Ｎ」の間に成り立つ関係性を推定し、「Ｘ^１」〜「Ｘ^Ｎ」間の関係性を示すパラメータを算出する。 [Processing of relationship estimation unit]
The relationship estimating unit 121 estimates the relationship that holds between the "X ^1" - "X ^N", calculates a parameter indicating the relationship between "X ^1" - "X ^N".

この場合、関係性推定部１２１は、それを出力した機器等が正常な状態のデータ、言い換えると、異常な状態のデータを含まないデータを、パラメータ算出のために用いることが望ましい。正常な状態のデータであるか否かは、機器が正常であったことから判断してもよい。また、正常な状態のデータであるか否かは、データを見て、異常値に相当するものを含まないことなどを基に、人間が視認して判断してもよい。これは、異常判定部１２４での異常検知において、「正常と異なる」ことを「異常」とみなすという指標を用いるためである。さらに、関係性推定部１２１において、正常なデータのみを用いて、正常な状態のデータに成り立つ関係式を推定することで、異常検知の感度を向上させることが期待できる。 In this case, it is desirable that the relationship estimation unit 121 use, for parameter calculation, data in a normal state of the device or the like that has output the data, that is, data that does not include abnormal state data. Whether the data is normal or not may be determined from the fact that the device was normal. In addition, whether or not the data is in a normal state may be determined by visual recognition based on the fact that the data is not included and the data corresponding to the abnormal value is not included. This is because, in the abnormality detection in the abnormality determination unit 124, an index that “different from normal” is regarded as “abnormal” is used. Furthermore, the sensitivity of abnormality detection can be expected to be improved by estimating the relational expression that holds for data in a normal state using only normal data in the relationship estimation unit 121.

例えば、データ間の関係性が式として与えられているものの、パラメータに相当するものが未定である場合に、関係性推定部１２１は、このパラメータを推定する。具体的には、データＸとデータＹとの関係性が、「Ｙ＝ａＸ＋ｂ」という単回帰であることは与えられているが、「ａ」及び「ｂ」が不明な場合に、関係性推定部１２１は、与えられたデータを基に「ａ」及び「ｂ」を推定する。 For example, although the relationship between data is given as a formula, when the thing equivalent to a parameter is undecided, the relationship estimation part 121 estimates this parameter. Specifically, although it is given that the relationship between data X and data Y is a simple regression "Y = aX + b", relationship estimation is made when "a" and "b" are unknown. The part 121 estimates "a" and "b" based on the given data.

また、関係性推定部１２１は、データ間の関係性として、例えば、「Ｘ^１」〜「Ｘ^Ｎ」のいずれかを目的変数、残りを説明変数とする重回帰式が与えられる場合、この重回帰式のパラメータを求める。また、関係性推定部１２１は、データ間の関係性として、「Ｘ^１」〜「Ｘ^Ｎ」の中から２組ずつを選択し、２組ごとに一方を目的変数とし、他方を説明変数とする単回帰式が与えられる場合、この単回帰式のパラメータを求める。 In addition, as the relationship between the data, for example, when the multiple regression equation in which ^{one of} “X ¹ ” to “X ^N ” is an objective variable and the rest is an explanatory variable is given as the relationship between the data, Find the regression equation parameters. Further, the relationship estimation unit 121 selects two sets of “X ¹ ” to “X ^N ” as the relationship between data, and sets one as an objective variable for each two sets, and the other as an explanatory variable. If a single regression equation is given, the parameters of this single regression equation are determined.

或いは、関係性推定部１２１は、データ間の関係性として、各「Ｘ^Ｎ」が系列データであり、過去のデータから将来を予測する自己回帰式またはベクトル自己回帰式が与えられる場合、この自己回帰式またはベクトル自己回帰式のパラメータを求めてもよい。添え字の「１，２，…，ｍ，…」に順序性がある場合、例えば、前述のサーバのＣＰＵ使用率がデータである例などの場合である。 Alternatively, in the case where each “X ^N ” is series data and an autoregressive equation or a vector autoregression equation for predicting the future from past data is given as the relationship between data, the relationship estimation unit 121 performs this autocorrelation. Parameters of regression equation or vector autoregression equation may be obtained. In the case where the suffixes “1, 2,..., M,...” Have orderness, for example, this is the case where the CPU utilization of the above-mentioned server is data.

また、データ間の関係性は、混合分布モデルでモデリングしてあってもよいし、より複雑な非線形な関係を表す式で示されたものであってもよい。例えば、何らかの確率分布を用いて、データ間の関係性を表してもよい。例えば、データ間の関係性が「Ｋ」個のクラスタを持つ、混合分布モデルにより表現される場合、関係性推定部１２１は、（１）式に示す関係式のパラメータを求める。 Further, the relationship between data may be modeled by a mixture distribution model, or may be represented by an equation representing a more complicated non-linear relationship. For example, some probability distribution may be used to represent the relationship between data. For example, when the relationship between data is expressed by a mixed distribution model having “K” number of clusters, the relationship estimation unit 121 obtains a parameter of the relational expression shown in equation (1).

なお、「Ｘ^１」〜「Ｘ^Ｎ」の間に成り立つ関係性が予め与えられている場合には、本実施の形態では、関係性推定部１２１の算出処理を省略することができる。 In the present embodiment, when the relationship that holds between “X ¹ ” and “X ^N ” is given in advance, the calculation process of the relationship estimation unit 121 can be omitted.

［乖離値ベクトル計算部の処理］
続いて、乖離値ベクトル計算部１２２は、「Ｘ^１」〜「Ｘ^Ｎ」および「Ｘ^１」〜「Ｘ^Ｎ」の間に成り立つ関係性から、乖離値ベクトルを計算する。図３は、乖離値ベクトル計算部１２２が行う乖離値ベクトルの計算処理の例を示す図である。 [Process of deviation value vector calculation unit]
Subsequently, the divergence value vector calculation unit 122 calculates the divergence value vector from the relationship established between “X ¹ ” to “X ^N ” and “X ¹ ” to “X ^N ”. FIG. 3 is a diagram illustrating an example of calculation processing of the divergence value vector performed by the divergence value vector calculation unit 122.

図３に示す例では、乖離値ベクトルは、添字「ｍ」ごとに計算されるものとしている。また、この例では、図３に示すデータ「Ｘ^１」〜「Ｘ^Ｎ」が与えられ、この「Ｘ^１」〜「Ｘ^Ｎ」の間に成り立つ関係性を、「Ｆ（Ｘ^１，Ｘ^２，・・・，Ｘ^Ｎ)＝０」としている。この例では、関係式が、「Ｘ^１」〜「Ｘ^Ｎ」のいずれかを目的変数、残りを説明変数とする重回帰式であることをイメージしている。 In the example shown in FIG. 3, the divergence value vector is calculated for each subscript "m". Further, in this example, data “X ¹ ” to “X ^N ” shown in FIG. 3 are given, and the relationship established between “X ¹ ” to “X ^N ” is “F (X ¹ , X ² ,..., X ^N ) = 0. In this example, it is assumed that the relational expression is a multiple regression that uses any one of “X ¹ ” to “X ^N ” as a target variable and the rest as an explanatory variable.

具体的には、図３における「Ｘ^１の乖離値」は、添字「ｍ」における観測値「ｘ^１ _ｍ」と、「ｍ」において観測されたデータから、関係性を用いて推定される「Ｘ^１」の値「Ｆ（ｘ^１ _ｍ，ｘ^２ _ｍ，・・・，ｘ^Ｎ _ｍ)」と、の差である。また、「Ｘ^２の乖離値」は、添字「ｍ」における観測値「ｘ^２ _ｍ」と、「ｍ」において観測されたデータから、関係性を用いて推定される「Ｘ^２」の値「Ｆ（ｘ^１ _ｍ，ｘ^２ _ｍ，・・・，ｘ^Ｎ _ｍ)」と、の差である。乖離値ベクトル計算部１２２は、図３に示す計算処理を行うことによって、「Ｘ^１」〜「Ｘ^Ｎ」の各乖離値を計算する。そして、図３に示すように、乖離値ベクトル計算部１２２は、共通の添字「ｍ」を持つ複数種類のデータから計算した乖離値を、乖離値ベクトルとして出力する。 Specifically, the “deviation value of X ¹ ” in FIG. 3 is estimated using the relationship from the observed value “x ¹ _m ” at the subscript “m” and the data observed at “m” This is the difference between the value of X ^{1 and} the value “F (x ¹ _m , x ² _m ,..., X ^N _m )”. Further, "divergence values of X ^2" is the observed value of the subscript "m" and "x ² _m" from the observed data in the "m", the value of "X ^2" which is estimated using the relationship " ^{_{^{_{F (x 1 m, x 2}}}} m, ···, x N m) and "a difference. The difference value vector calculation unit 122 calculates each difference value of “X ¹ ” to “X ^N ” by performing the calculation process shown in FIG. 3. Then, as shown in FIG. 3, the divergence value vector calculation unit 122 outputs, as a divergence value vector, a divergence value calculated from a plurality of types of data having the common subscript “m”.

また、データ間の関係性は「Ｆ（Ｘ^１，Ｘ^２，・・・，Ｘ^Ｎ)＝０」のように、全てのデータに対して一つの関係性が与えられる場合だけでなく、データの組に対して与えられる場合もある。例えば、「Ｘ^１」〜「Ｘ^Ｎ」の中から２組ずつを選択し、２組ごとに一方を目的変数、他方を説明変数とする単回帰式で関連性が与えられる場合である。そこで、図４を参照して、この場合における乖離値ベクトルの計算処理について説明する。図４は、乖離値ベクトル計算部１２２が行う乖離値ベクトルの計算処理の他の例を示す図である。 Also, the relationship between data is not limited to the case where one relationship is given to all data, such as “F (X ¹ , X ² ,..., X ^N ) = 0”, but also data It may be given for a set of For example, in the case where two sets are selected from “X ¹ ” to “X ^N ”, and relevance is given by a single regression equation in which one set is an objective variable and the other set is an explanatory variable. Then, with reference to FIG. 4, the calculation process of the divergence value vector in this case is demonstrated. FIG. 4 is a diagram showing another example of the calculation processing of the divergence value vector performed by the divergence value vector calculation unit 122. As shown in FIG.

図４に示すように、例えば、「Ｘ^１のＸ^２から見た乖離値」は、添字「ｍ」における観測値「ｘ^１ _ｍ」と、「ｍ」において観測されたデータから、「Ｘ^１」と「Ｘ^２」の関係性「Ｆ^１２」を用いて推定される「Ｘ^１」の値「Ｆ^１２(ｘ^１ _ｍ，ｘ^２ _ｍ)」と、の差を乖離値である。乖離値ベクトル計算部１２２は、図４に示す計算処理を行うことによって、「Ｘ^１のＸ^２から見た乖離値」〜「Ｘ^Ｎ−１のＸ^Ｎから見た乖離値」を計算する。そして、乖離値ベクトル計算部１２２は、図４に示すように、共通の添字「ｍ」を持つ複数種類のデータから計算した乖離値を、乖離値ベクトルとして出力する。 As shown in FIG. 4, for example, "divergence values viewed from X ² of X ¹ 'is the observed value of the subscript" m "and" x ¹ _m ", from the observed data in the" m "," X ¹ "and" relationship between ^{X 2} "," the value of ^{"X 1"} which is estimated using the ^{F 12} "" ^{^{_{^{F 12 (x 1 m, x}}}} 2 m) ", the difference of a deviation value. The divergence value vector calculation unit 122 calculates “a divergence value viewed from X ^{2 of} X ¹ ” to “a divergence value viewed from X ^N of X ^N ⁻¹ ” by performing the calculation process shown in FIG. 4. Then, as shown in FIG. 4, the divergence value vector calculation unit 122 outputs, as a divergence value vector, divergence values calculated from a plurality of types of data having a common subscript “m”.

なお、図４では特に指定していないが、データ間の関係性が一部の組み合わせに対してのみ成り立つと考えてもよい。例えば、「Ｘ^１」と「Ｘ^２」の間は、相関係数が大きく相関関係が認められるのに対し、「Ｘ^１」と「Ｘ^３」の間は、相関係数が小さく相関関係が認められないような場合である。このような場合、乖離値ベクトル計算部１２２は、関係性が認められるものに対してのみ乖離値を計算すればよい。 Although not particularly specified in FIG. 4, it may be considered that the relationship between data holds for only some combinations. For example, correlation coefficient is large between “X ¹ ” and “X ² ”, while correlation coefficient is small between “X ¹ ” and “X ³ ”. It is a case that is not recognized. In such a case, the divergence value vector calculation unit 122 may calculate the divergence value only for those for which the relationship is recognized.

また、データ間の関係性が（１）式に示す混合分布モデルで表現される場合には、乖離値ベクトル計算部１２２は、以下の（２）式で定義される各クラスタへの帰属度「ｍ_ｋ」を計算する。帰属度が大きい程、そのクラスタへの帰属度が高い、すなわち、そのクラスタの中心点に近いと言える。言い換えると、帰属度が大きい程、そのクラスタへの乖離度が小さいと言える。 Further, when the relationship between data is expressed by the mixed distribution model represented by equation (1), the divergence value vector calculation unit 122 determines the degree of belonging to each cluster defined by the following equation (2). Calculate m _k ". The larger the degree of attribution, the higher the degree of attribution to the cluster, that is, the closer to the center point of the cluster. In other words, it can be said that the greater the degree of attribution, the smaller the degree of deviation from that cluster.

この場合、乖離値ベクトル計算部１２２は、（２）式を用いて計算した帰属度「ｍ_ｋ」に対し、乖離値ベクトルとして、「（ｍ_１，ｍ_２，・・・，ｍ_Ｋ）」を求める。或いは、乖離値ベクトル計算部１２２は、（２）式を用いて計算した帰属度「ｍ_ｋ」に対し、乖離値ベクトルとして、「−logΣπ^ｋP（ｘ｜θ^ｋ）」のように、負の対数尤度を計算してもよい。なお、この場合には、乖離値ベクトルは、１次元となる。 In this case, the divergence value vector calculation unit 122 sets “(m ₁ , m ₂ ,..., M _K )” as the divergence value vector with respect to the degree of attribution “m _k ” calculated using equation (2). Ask for Alternatively, the divergence value vector calculation unit 122 may use a negative value vector such as “−logΣπ ^k P (x | θ ^k )” as the divergence value vector with respect to the degree of membership “m _k ” calculated using equation (2). The log likelihood of may be calculated. In this case, the divergence value vector is one-dimensional.

乖離値ベクトル計算部１２２は、上記に示したような計算処理を行うことによって、データ間の関係性に基づいて検知データ及び比較データのデータ間の関係性からの乖離を表す乖離値ベクトルを計算する。なお、一般的には、比較データは複数存在する。もちろん、比較データは、一つでもよい。 By performing the calculation processing as described above, the divergence value vector calculation unit 122 calculates the divergence value vector representing the divergence from the relationship between the detection data and the comparison data based on the relationship between the data. Do. Generally, a plurality of comparison data exist. Of course, the comparison data may be one.

［異常度計算部の処理］
次に、異常度計算部１２３の処理について説明する。異常度計算部１２３は、乖離値ベクトル計算部１２２が、検知データから計算した乖離値ベクトルと、比較データから計算した乖離値ベクトルと、を用いて、離散度合を計算する。 [Processing of abnormality degree calculation unit]
Next, the process of the abnormality degree calculator 123 will be described. The abnormality degree calculation unit 123 calculates the degree of discreteness using the divergence value vector calculated from the detection data and the divergence value vector calculated from the comparison data by the divergence value vector calculation unit 122.

具体的には、異常度計算部１２３は、検知データの乖離値ベクトルが、比較データの乖離値ベクトルの集合から、空間的にどのくらい離れているかを計算する。この場合、異常度計算部１２３は、例えばｋ−ＮＮ（k−nearest neighbor method）法を用いて、検知データの乖離値ベクトルが、比較データの乖離値ベクトルの集合から、空間的にどのくらい離れているかを計算する。図５を参照して、異常度計算部１２３が、ｋ−ＮＮ法を用いて、離散度合を計算した場合について説明する。 Specifically, the abnormality degree calculation unit 123 calculates how far apart the deviation value vector of the detection data is from the set of the deviation value vectors of the comparison data. In this case, using the k-NN (k-nearest neighbor method) method, for example, the anomalous degree calculating unit 123 determines how far away the divergence value vector of the detection data is from the set of divergence value vectors of comparison data. Calculate if A case where the abnormality degree calculator 123 calculates the degree of discreteness using the k-NN method will be described with reference to FIG.

図５は、異常度計算部１２３による異常度計算処理を説明する図である。図５は、乖離値ベクトルの次元１〜次元３に対し、乖離値ベクトル計算部１２２が比較データ及び検知データから計算した各乖離値ベクトルをプロットした図である。図５において、原点近傍に位置する白丸のデータ群Ｒ１は、比較データから計算された乖離値ベクトルに対応する。点Ｐ１は、検知データから計算した乖離値ベクトルに対応する（図５の（１）参照）。ｋ−ＮＮ法では、検知データの乖離値に対応する点Ｐ１から見て、ｋ番目に近い点Ｐｋまでの距離を、異常度（離散度合）として計算する（図５の（２）参照）。ここで、「ｋ」は、パラメータであり、ヒューリスティックスを用いて設定される。 FIG. 5 is a diagram for explaining abnormality degree calculation processing by the abnormality degree calculation unit 123. As shown in FIG. FIG. 5 is a diagram in which each divergence value vector calculated by the divergence value vector calculation unit 122 from the comparison data and the detection data is plotted with respect to the dimensions 1 to 3 of the divergence value vector. In FIG. 5, a white circle data group R1 located near the origin corresponds to the divergence value vector calculated from the comparison data. The point P1 corresponds to the divergence value vector calculated from the detection data (see (1) in FIG. 5). In the k-NN method, the distance to the k-th closest point Pk as calculated from the point P1 corresponding to the deviation value of the detection data is calculated as the abnormality degree (discrete degree) (see (2) in FIG. 5). Here, "k" is a parameter and is set using heuristics.

また、異常度計算部１２３は、検知データの乖離値ベクトルが、比較データの乖離値ベクトルの集合から、どのくらい空間的に疎な位置に存在するかを計算してもよい。この場合、異常度計算部１２３は、例えば、ＬＯＦを用いて検知データの乖離値ベクトルが、比較データの乖離値ベクトルの集合から、空間的にどのくらい離れているかを計算する。 Further, the abnormality degree calculating unit 123 may calculate how much the spatially separated position of the difference value vector of the detection data exists from the set of difference value vectors of the comparison data. In this case, the abnormality degree calculation unit 123 calculates, for example, how far the divergence value vector of the detected data is spatially separated from the set of the divergence value vectors of the comparison data using the LOF.

図６は、異常度計算部１２３による異常度計算処理の他の例を説明する図である。ＬＯＦは、空間内での局所的密度を計算する手法である。図６の白丸のデータ群Ｒ１は、比較データの乖離値ベクトルに対応する点の集まりであり、点Ｐ１は、検知データから計算した乖離値ベクトルに対応する点である（図６の（１）参照）。 FIG. 6 is a diagram for explaining another example of abnormality degree calculation processing by the abnormality degree calculation unit 123. As shown in FIG. LOF is a method of calculating local density in space. The white circle data group R1 in FIG. 6 is a group of points corresponding to the divergence value vector of the comparison data, and the point P1 is a point corresponding to the divergence value vector calculated from the detection data ((1) in FIG. 6). reference).

具体的には、検知データの乖離値（点Ｐ１）から見て、ｋ番目までに近い点Ｐｋまでの距離の平均を、それらｋ番目の点Ｐｋから見てｍ番目までに近い点Ｐｍまでの距離の平均で割った値を異常度（離散度合）として計算する（図６の（２）参照）。例えば、データ群Ｒ１の密度の高い位置に、検知データの乖離値ベクトルに対応する点Ｐ１があった場合には、点Ｐ１からｋ番目に近い点Ｐｋまでの距離の平均が小さくなり、点Ｐｋから点Ｐｍまでの距離も小さくなるため、離散度合は小さくなる。一方、データ群Ｒ１の密度の低い位置に点Ｐ１があった場合には、点Ｐ１からｋ番目に近い点Ｐｋまでの距離の平均が大きくなるため、離散度合は小さくなる。なお、「ｋ」及び「ｍ」は、パラメータであり、ヒューリスティックスを用いて設定される。 Specifically, the average of the distances to the points Pk close to the k-th point from the deviation value (point P1) of the detected data is the points to the point Pm near the m-th points from the k-th point Pk A value divided by the average of the distances is calculated as an abnormality (discrete degree) (see (2) in FIG. 6). For example, if the point P1 corresponding to the divergence value vector of the detection data is at a high density position of the data group R1, the average of the distance from the point P1 to the point Pk closest to the kth becomes small, and the point Pk Since the distance from the point to point Pm also decreases, the degree of discreteness decreases. On the other hand, when the point P1 is at a position where the density of the data group R1 is low, the average of the distance from the point P1 to the point Pk closest to the k-th becomes large, so the degree of discreteness decreases. Note that “k” and “m” are parameters and are set using heuristics.

また、比較データは、正常な状態のデータに限定することで、後述する異常判定部１２４の異常検知精度を高めることができる。「正常な状態のデータ」の定義は前述の通りである。また、正常な状態のデータのみを比較データとした場合、乖離値ベクトルは、空間的には局所に集中することに注意しておく。例えば図５及び図６において説明した方法を用いて乖離値ベクトルを計算すると、空間的には原点近傍に乖離値ベクトルが集中する。また、（２）式に示す帰属度「ｍ_ｋ」に対し、乖離値ベクトルを（ｍ_１，ｍ_２，・・・，ｍ_Ｋ）で定義した場合は、乖離値ベクトルは、空間的にはＫ個のクラスタに集中する。 Further, by limiting the comparison data to data in a normal state, it is possible to enhance the abnormality detection accuracy of the abnormality determination unit 124 described later. The definition of "normal state data" is as described above. In addition, when only data in a normal state is used as comparison data, it should be noted that the divergence value vector is spatially concentrated locally. For example, when the divergence value vector is calculated using the method described in FIGS. 5 and 6, the divergence value vector is spatially concentrated near the origin. When the divergence value vector is defined as (m ₁ , m ₂ ,..., M _K ) with respect to the degree of membership “m _k ” shown in the equation (2), the divergence value vector is spatially Concentrate on K clusters.

また、検知データと比較データとが別々に与えられる場合がある。例えば、ある特定の過去１日分の複数のサーバのＣＰＵ使用率を比較データとして異常検知装置１０に届き、検知データは、異常検知装置１０の運用時に逐次的に届くような場合である。このような場合、比較データの乖離値ベクトルを、検知対象のデータが届くたびに計算し直すことは計算リソース上、効率的ではない。そこで、記憶部１３の乖離値ベクトル記憶部１３１は、このような場合に比較データの乖離値を再計算する必要がないように、比較データの乖離値ベクトルを記憶しておく。乖離値ベクトル記憶部１３１を利用する場合、異常度計算部１２３は、検知データの乖離値ベクトルと、乖離値ベクトル記憶部１３１が記憶する乖離値ベクトルと、を比較する。 Also, detection data and comparison data may be provided separately. For example, the CPU usage rates of a plurality of servers for a specific past one day are delivered to the abnormality detection apparatus 10 as comparison data, and the detection data is sequentially delivered when the abnormality detection apparatus 10 is operated. In such a case, it is not efficient in terms of computational resources to recalculate the difference value vector of the comparison data each time the data to be detected arrives. Therefore, the difference value vector storage unit 131 of the storage unit 13 stores the difference value vector of the comparison data so that the difference value of the comparison data does not need to be recalculated in such a case. When the divergence value vector storage unit 131 is used, the abnormality degree calculation unit 123 compares the divergence value vector of the detection data with the divergence value vector stored in the divergence value vector storage unit 131.

そして、乖離値ベクトル記憶部１３１を利用する場合も、比較データとして、正常な状態のデータから計算した乖離値ベクトルのみを記憶させることで、異常検知精度を高めることができる。 Also in the case where the divergence value vector storage unit 131 is used, the abnormality detection accuracy can be enhanced by storing only the divergence value vector calculated from the data in the normal state as the comparison data.

［異常判定部の処理］
異常判定部１２４は、異常度計算部１２３が計算した離散度合が所定の閾値を超えた場合に、検知データは異常であることを判定する。異常判定部１２４は、離散度合が所定の閾値以下である場合に、検知データは正常であることを判定する。 [Process of abnormality determination unit]
The abnormality determination unit 124 determines that the detection data is abnormal when the degree of discreteness calculated by the abnormality degree calculation unit 123 exceeds a predetermined threshold. The abnormality determination unit 124 determines that the detection data is normal when the degree of discreteness is equal to or less than a predetermined threshold.

ここで、判定の基準となる閾値は、予め設定されたものである。或いは、テストデータがある場合は、テストデータ中の特定のデータ、すなわち、異常が発生した際のデータにおける異常度を閾値として設定してもよい。または、テストデータにおける異常度が、適当な確率分布に従うと考え、その上位５％或いは上位１％などの値を閾値として設定してもよい。異常判定部１２４による判定結果は、異常検知結果として、通信処理部１１を介して、例えば、端末装置２０に出力される。 Here, the threshold that is the reference of the determination is a preset one. Alternatively, when there is test data, a specific data in the test data, that is, an abnormality degree in data when an abnormality occurs may be set as a threshold. Alternatively, it may be considered that the degree of abnormality in the test data follows an appropriate probability distribution, and a value such as the top 5% or the top 1% may be set as the threshold. The determination result by the abnormality determination unit 124 is output to, for example, the terminal device 20 via the communication processing unit 11 as an abnormality detection result.

［異常検知処理の流れ］
次に、異常検知装置１０が実行する異常検知処理について説明する。図７は、異常検知装置１０が実行する異常検知処理の処理手順を示すフローチャートである。 [Flow of anomaly detection processing]
Next, the abnormality detection process performed by the abnormality detection device 10 will be described. FIG. 7 is a flowchart showing the procedure of the abnormality detection process performed by the abnormality detection apparatus 10.

まず、異常検知装置１０では、関係性推定部１２１が、入力されたデータに対して、データ間の関係性を推定し、データ間の関係性を示すパラメータを算出する関係性推定処理を行う（ステップＳ１）。関係性推定部１２１は、それを出力した機器等が正常な状態のデータ、言い換えると、異常な状態のデータを含まないデータを、パラメータ推定のために用いる。データ間に成り立つ関係性が予め与えられている場合には、本ステップＳ１を省略することができる。 First, in the abnormality detection device 10, the relationship estimation unit 121 performs a relationship estimation process of estimating the relationship between the data with respect to the input data, and calculating a parameter indicating the relationship between the data (see FIG. Step S1). The relationship estimation unit 121 uses, for parameter estimation, data of a normal state of the device or the like that has output the data, that is, data that does not include abnormal state data. This step S1 can be omitted if the relationship established between the data is given in advance.

そして、乖離値ベクトル計算部１２２は、データ間の関係性に基づいて、検知対象である検知データの集合及び比較データの集合におけるデータ間の乖離値ベクトルを計算する乖離値ベクトル計算処理を実行する（ステップＳ２）。ここで、乖離値ベクトル計算部１２２は、比較データが予め与えられている場合、該比較データの集合におけるデータ間の乖離値ベクトルを計算して、乖離値ベクトル記憶部１３１に記憶する。 Then, the divergence value vector calculation unit 122 executes the divergence value vector calculation processing for calculating the divergence value vector between the data in the set of detection data to be detected and the comparison data based on the relationship between the data. (Step S2). Here, when the comparison data is given in advance, the difference value vector calculation unit 122 calculates a difference value vector between data in the set of comparison data, and stores the difference value vector in the difference value vector storage unit 131.

続いて、異常度計算部１２３は、検知データの乖離値ベクトルの、比較データの乖離値ベクトルの集合からの離散度合を、異常度として計算する異常度計算処理を行う（ステップＳ３）。なお、異常度計算部１２３は、比較データの乖離値ベクトルが予め計算されて乖離値ベクトル記憶部１３１に記憶されている場合、乖離値ベクトル記憶部１３１から比較データの乖離値ベクトルを読み出して、比較データの乖離値ベクトルの集合を取得する。 Subsequently, the abnormality degree calculation unit 123 performs abnormality degree calculation processing for calculating the degree of discreteness of the deviation value vector of the detection data from the set of deviation value vectors of the comparison data as the abnormality degree (step S3). When the difference value vector of the comparison data is calculated in advance and stored in the difference value vector storage unit 131, the abnormality degree calculation unit 123 reads out the difference value vector of the comparison data from the difference value vector storage unit 131, Obtain a set of divergence value vectors of comparison data.

そして、異常判定部１２４は、異常度計算部１２３が計算した離散度合を基に、検知データが異常であるか否かを判定する異常判定処理を行う（ステップＳ４）。この場合、異常判定部１２４は、異常度計算部１２３が計算した離散度合が所定の閾値を超えた場合に、検知データは異常であることを判定する。一方、異常判定部１２４は、離散度合が所定の閾値以下である場合に、検知データは正常であることを判定する。異常判定部１２４は、判定結果を異常検知結果として、通信処理部１１を介して端末装置２０に出力し、異常検知処理を終了する。 Then, the abnormality determination unit 124 performs abnormality determination processing to determine whether the detected data is abnormal based on the discrete degree calculated by the abnormality degree calculation unit 123 (step S4). In this case, the abnormality determination unit 124 determines that the detection data is abnormal when the degree of discreteness calculated by the abnormality degree calculation unit 123 exceeds a predetermined threshold. On the other hand, the abnormality determination unit 124 determines that the detection data is normal when the degree of discreteness is equal to or less than a predetermined threshold. The abnormality determination unit 124 outputs the determination result as the abnormality detection result to the terminal device 20 via the communication processing unit 11, and ends the abnormality detection process.

［異常検知処理の具体例］
図８は、実施の形態１の異常検知処理を説明する図である。図８は、データとして、Ｘ及びＹの組が与えられたとして、座標平面上にその組をプロットしたものである。図８の白丸は、正常な状態の比較データに対応する。また、点Ｐｂは、相関関係を維持したままで、それまでには存在していなかった値をとった場合の例である。点Ｐｒは、相関関係が崩れた場合の例である。また、正常である比較データ（図８の白丸）を基に、Ｘ及びＹの関係性として、直線Ｌｔで示される「Ｙ＝ａＸ＋ｂ」という単回帰が与えられている。 [Specific example of abnormality detection processing]
FIG. 8 is a diagram for explaining the abnormality detection process according to the first embodiment. FIG. 8 is a plot of the set on the coordinate plane, given the set of X and Y as data. The white circles in FIG. 8 correspond to comparison data in a normal state. In addition, the point Pb is an example in the case where the correlation is maintained and a value which has not existed until then is taken. The point Pr is an example when the correlation is broken. Moreover, based on the comparison data (white circle in FIG. 8) that is normal, a simple regression “Y = aX + b” indicated by a straight line Lt is given as the relationship between X and Y.

図８の示す点Ｐｂは、直線Ｌｔ上に位置し、正常である場合に成り立つ相関関係を維持しているため、正常であることが想定される。ここで、従来用いられていたＬＯＦでは、データ間の関係性を考慮しておらず、白丸の密度が低い点に存在する点Ｐｂ及び点Ｐｒは、いずれも異常であると検知される。 The point Pb shown in FIG. 8 is assumed to be normal because it is located on the straight line Lt and maintains the correlation established when it is normal. Here, in the LOF conventionally used, the relationship between the data is not taken into consideration, and the point Pb and the point Pr existing at the point where the density of the white circles is low are detected as abnormal.

これに対し、本実施の形態１では、データ間の関係性に基づいて、検知データの集合におけるデータ間の乖離値ベクトルと、比較データの集合におけるデータ間の乖離値ベクトルと、を計算し、検知データの乖離値ベクトルの、比較データの乖離値ベクトルの集合からの離散度合を、異常度として異常判定を行う。 On the other hand, in the first embodiment, the difference value vector between data in the set of detection data and the difference value vector between data in the set of comparison data are calculated based on the relationship between the data, An abnormality determination is performed with the degree of discrepancies from the set of deviation value vectors of comparison data as the deviation value vector of detection data as the abnormality degree.

例えば、点Ｐｂが検知データである場合を例とする。この点Ｐｂは、直線Ｌｔ上に位置するため、点Ｐｂに示す「Ｘ，Ｙ」は、直線Ｌｔで示される「Ｙ＝ａＸ＋ｂ」の関係を有していると言える。したがって、この点Ｐｂに示す「Ｘ，Ｙ」について、直線Ｌｔで示される「Ｙ＝ａＸ＋ｂ」に対する乖離値ベクトルを計算し、その乖離値ベクトルの、正常である比較データ（白丸）の乖離値ベクトルの集合からの離散度合を計算すると、ほぼ０となり、点Ｐｂは正常であることを検知できる。 For example, the case where point Pb is detection data is taken as an example. Since this point Pb is located on the straight line Lt, it can be said that "X, Y" shown at the point Pb has a relationship of "Y = aX + b" shown by the straight line Lt. Therefore, for "X, Y" shown at this point Pb, the divergence value vector for "Y = aX + b" shown by straight line Lt is calculated, and the divergence value vector of the comparison data (white circle) of the divergence value vector that is normal. When the discrete degree from the set of is calculated, it becomes almost 0, and it can be detected that the point Pb is normal.

一方、点Ｐｒが検知データである場合について説明する。この点Ｐｒは、直線Ｌｔから離れているため、点Ｐｒに示す「Ｘ，Ｙ」は、「Ｙ＝ａＸ＋ｂ」の関係を有していないと言える。したがって、この点Ｐｒに示す「Ｘ，Ｙ」の、直線Ｌｔで示される「Ｙ＝ａＸ＋ｂ」に対する乖離値ベクトルを計算し、その乖離値ベクトルの、正常である比較データ（白丸）の乖離値ベクトルの集合からの離散度合を計算すると、その値は大きくなり、Ｐｒは異常であることを検知できる。 On the other hand, the case where point Pr is detection data will be described. Since this point Pr is far from the straight line Lt, it can be said that "X, Y" shown at the point Pr does not have the relationship of "Y = aX + b". Therefore, the divergence value vector for "Y = aX + b" indicated by straight line Lt of "X, Y" shown at this point Pr is calculated, and the divergence value vector of the comparison data (white circle) which is normal for that divergence value vector. If we calculate the degree of discreteness from the set of, the value becomes large and we can detect that Pr is anomalous.

このように、異常検知装置１０は、乖離値ベクトルという概念を導入し、検知対象のデータの乖離値ベクトルと、正常である比較データの乖離値ベクトルの集合との空間的な距離や密度に基づき離散度合（異常度）を計算し、検知データの異常の有無を判定する。したがって、異常検知装置１０は、データ間に相関がある場合に、相関に乗っているが、比較データの集合から外れた、正常であると想定できるデータ（例えば、点Ｐｂ）を、正常であると検知することができる。 Thus, the anomaly detection apparatus 10 introduces the concept of a divergence value vector, and based on the spatial distance and density of the divergence value vector of the data to be detected and the set of divergence value vectors of comparison data that is normal. The discreteness degree (abnormality degree) is calculated to determine the presence or absence of abnormality in the detected data. Therefore, when there is a correlation between the data, the abnormality detection device 10 is normal when the data (for example, the point Pb) which is on the correlation but deviates from the set of comparison data and can be assumed to be normal is normal. Can be detected.

また、図１４や図１５のように、データ間に単なる相関関係でない、複雑な関係性が見られる場合であっても、データ間の関係性からの乖離値ベクトルという概念により、異常検知を精度よく実行することができる。 Also, as shown in FIG. 14 and FIG. 15, even in the case where complex relationships between data are not found to be mere correlations, the accuracy of abnormality detection can be achieved by the concept of the difference value vector from the relationship between the data. It can be done well.

［実施の形態１の効果］
上記のように、実施の形態１では、乖離値ベクトルという概念を導入し、データ間の関係性に基づいて計算した、検知データの乖離値ベクトルと、正常である比較データの乖離値ベクトルの集合との空間的な距離や密度に基づき離散度合（異常度）を計算し、検知データの異常の有無を判定するため、データ間の関係性に基づいた検知対象データの異常検知を精度よく実行することができる。 [Effect of Embodiment 1]
As described above, in the first embodiment, the concept of the difference value vector is introduced, and a set of difference value vectors of detected data and a set of difference value vectors of comparison data that are normal calculated based on the relationship between data. To calculate the degree of abnormality (the degree of abnormality) based on the spatial distance and density of the object and to determine the presence or absence of the abnormality in the detection data, accurately execute the abnormality detection of the detection target data based on the relationship between the data be able to.

［実施の形態２］
次に、実施の形態２について説明する。実施の形態２では、離散度合として、検知データ及び比較データの乖離値ベクトルに基づいたマハラノビス距離を計算し、異常の有無を判定する。なお、実施の形態２に係る異常検知装置は、図１に示す異常検知装置１０と同等の構成を有する。 Second Embodiment
Next, the second embodiment will be described. In the second embodiment, the Mahalanobis distance based on the difference value vector of the detection data and the comparison data is calculated as the discrete degree, and the presence or absence of abnormality is determined. The abnormality detection apparatus according to the second embodiment has the same configuration as the abnormality detection apparatus 10 shown in FIG.

実施の形態２では、実施の形態１と同様に、乖離値ベクトル計算部１２２が、データ間の関係性に基づいて、検知対象である検知データの集合におけるデータ間の乖離値ベクトルを計算する。そして、乖離値ベクトル計算部１２２は、比較データの集合におけるデータ間の乖離値ベクトルを計算する。なお、実施の形態１と同様に、乖離値ベクトル計算部１２２は、比較データが予め与えられている場合、該比較データの集合におけるデータ間の乖離値ベクトルを計算して、乖離値ベクトル記憶部１３１に記憶してもよい。 In the second embodiment, as in the first embodiment, the divergence value vector calculation unit 122 calculates the divergence value vector between data in the set of detection data to be detected based on the relationship between the data. Then, the divergence value vector calculation unit 122 calculates a divergence value vector between data in the set of comparison data. As in the first embodiment, when the comparison data is given in advance, the divergence value vector calculation unit 122 calculates the divergence value vector between the data in the set of comparison data, and calculates the divergence value vector storage unit. It may be stored in 131.

そして、異常度計算部１２３は、比較データの乖離値ベクトルが多次元正規分布に従うと仮定し、それらの乖離値ベクトルの平均と共分散行列とを計算する。続いて、異常度計算部１２３は、（３）式で定義されるマハラノビス距離を計算し、このマハラノビス距離を離散度合（異常度）として出力する。 Then, assuming that the difference value vectors of the comparison data follow the multidimensional normal distribution, the abnormality degree calculation unit 123 calculates an average and a covariance matrix of the difference value vectors. Subsequently, the abnormality degree calculation unit 123 calculates the Mahalanobis distance defined by the equation (3), and outputs this Mahalanobis distance as a discrete degree (abnormality degree).

異常判定部１２４は、異常度計算部１２３が計算したマハラノビス距離が一定の閾値を超えた場合に、検知データは異常であることを判定する。一方、異常判定部１２４は、異常度計算部１２３が計算したマハラノビス距離が一定の閾値以下である場合には、検知データは正常であることを判定する。なお、本実施の形態２では、比較データの乖離値ベクトルが多次元正規分布に従うと仮定しており、この場合、乖離値ベクトルのマハラノビス距離は近似的にｘ二乗分布に従うため、ｘ二乗分布に基づき閾値を決定することができる。このような方法は、ホテリングのＴ^２検定と呼ばれている（「竹内啓，統計学辞典 P112，東洋経済新聞社，1989」参照）。 The abnormality determination unit 124 determines that the detection data is abnormal when the Mahalanobis distance calculated by the abnormality degree calculation unit 123 exceeds a certain threshold. On the other hand, the abnormality determination unit 124 determines that the detection data is normal when the Mahalanobis distance calculated by the abnormality degree calculation unit 123 is equal to or less than a certain threshold. In the second embodiment, it is assumed that the divergence value vector of the comparison data conforms to the multidimensional normal distribution, and in this case, the Mahalanobis distance of the divergence value vector approximates to the x-square distribution. The threshold can be determined on the basis of this. Such a method is called Hotelling's T ² test (see "Takeuchi Uchi, Statistical Dictionary P112, Toyo Keizai Shimbun, 1989").

［実施の形態２の効果］
このように、実施の形態２においては、離散度合として、マハラノビス距離を計算し、計算したマハラノビス距離と所定の閾値との比較結果によって、異常の有無を判定する。マハラノビス距離は、データ間の関係性に基づく検知データ及び比較データの乖離値ベクトルを基に計算されたものであるため、実施の形態２は、実施の形態１と同様に、データ間の関係性に基づいた検知対象データの異常検知を精度よく実行することができる。 [Effect of Embodiment 2]
As described above, in the second embodiment, the Mahalanobis distance is calculated as the degree of discreteness, and the presence or absence of abnormality is determined based on the comparison result of the calculated Mahalanobis distance and a predetermined threshold. Since the Mahalanobis distance is calculated based on the difference value vector of the detection data and the comparison data based on the relationship between the data, in the second embodiment, the relationship between the data is the same as the first embodiment. It is possible to accurately execute abnormality detection of detection target data based on.

［実施の形態３］
次に、実施の形態３について説明する。この実施の形態３に係る異常検知装置は、図１に示す異常検知装置１０と同等の構成を有する。 Third Embodiment
Next, the third embodiment will be described. The abnormality detection device according to the third embodiment has the same configuration as the abnormality detection device 10 shown in FIG.

また、実施の形態１と同様に、乖離値ベクトル計算部１２２は、データ間の関係性に基づいて、検知対象である検知データの集合におけるデータ間の乖離値ベクトルを計算する。そして、乖離値ベクトル計算部１２２は、比較データの集合におけるデータ間の乖離値ベクトルを計算する。なお、実施の形態１と同様に、乖離値ベクトル計算部１２２は、比較データが予め与えられている場合、該比較データの集合におけるデータ間の乖離値ベクトルを計算して、乖離値ベクトル記憶部１３１に記憶してもよい。そこで、次に、異常度計算部１２３の処理を説明する。 Further, as in the first embodiment, the divergence value vector calculation unit 122 calculates the divergence value vector between data in the set of detection data to be detected based on the relationship between the data. Then, the divergence value vector calculation unit 122 calculates a divergence value vector between data in the set of comparison data. As in the first embodiment, when the comparison data is given in advance, the divergence value vector calculation unit 122 calculates the divergence value vector between the data in the set of comparison data, and calculates the divergence value vector storage unit. It may be stored in 131. Therefore, next, the process of the abnormality degree calculator 123 will be described.

［異常度計算部の処理］
実施の形態３では、異常度計算部１２３は、さらに、One-class Support Vector Machine（以下「Ｏｎｅ-ｃｌａｓｓＳＶＭ」と略す。詳しくは、「B. Scholkopf, J. C. Platt, J. Shawe-Taylor， A. J. Smola， and R. C. Williamson, “Estimating the Support of a High-Dimensional Distribution”, Neural Computation， 13(7):1443-1471， 2001．」参照。）の概念に基づいて、比較データの乖離値ベクトルの集合を含む領域を推定する。 [Processing of abnormality degree calculation unit]
In the third embodiment, the abnormality degree calculation unit 123 is further abbreviated as One-class Support Vector Machine (hereinafter referred to as "One-class SVM". For details, see "B. Scholkopf, JC Platt, J. Shawe-Taylor, AJ. Based on the concept of “Smola, and RC Williamson,“ Estimating the Support of High-Dimensional Distribution ”, Neural Computation, 13 (7): 1443-1471, 2001. Estimate the area that contains

具体的に、図９を参照して、離散度合（異常度）を求める処理について説明する。図９は、実施の形態３に係る異常度計算処理を説明する図である。図９は、データの乖離値ベクトルを所定の高次元空間に写像したものである。 Concretely, with reference to FIG. 9, the process which calculates | requires a discrete degree (abnormality degree) is demonstrated. FIG. 9 is a diagram for explaining abnormality degree calculation processing according to the third embodiment. FIG. 9 is a map of deviation value vectors of data in a predetermined high-dimensional space.

異常度計算部１２３は、Ｏｎｅ-ｃｌａｓｓＳＶＭに基づき、正常データである比較データの乖離値ベクトルを、高次元空間（図９ではφの次元１及び次元２）に写像する。そして、異常度計算部１２３は、写像した比較データの乖離値ベクトルに対応する点の、原点からの距離（マージン）が最大化するような平面（超平面）を求める。この平面は、図９の例では、超平面Ｌｅとして示している。この超平面Ｌｅは、正常である比較データの乖離値ベクトルの集合の境界に対応するものであり、実際には、写像した比較データの乖離値ベクトルを示す点は、超平面Ｌｅよりも原点側でない方に位置する。 Based on the One-class SVM, the degree-of-abnormality calculation unit 123 maps the difference value vector of the comparison data, which is normal data, to a high-dimensional space (dimension 1 and dimension 2 of φ in FIG. 9). Then, the abnormality degree calculation unit 123 obtains a plane (hyperplane) such that the distance (margin) from the origin of the point corresponding to the difference value vector of the mapped comparison data is maximized. This plane is shown as a hyperplane Le in the example of FIG. The hyperplane Le corresponds to the boundary of the set of divergence value vectors of comparison data that is normal, and in fact, the point indicating the divergence value vector of the mapped comparison data is on the origin side of the hyperplane Le It is not located.

続いて、異常度計算部１２３は、検知対象データの乖離値ベクトルを、比較データに対して写像した高次元空間と同じ高次元空間に写像する。例えば、図９に示すように、写像した検知データの乖離値ベクトルを示す各点は、超平面Ｌｅから見て、原点側にある群Ｒ２と、原点側にない群Ｒ３とに分けられる。異常度計算部１２３は、写像した検知データの乖離値ベクトルに対応する点が、超平面Ｌｅから見て原点側にあるか否かを基に、離散度合（異常度）を計算する。 Subsequently, the abnormality degree calculation unit 123 maps the difference value vector of the detection target data into the same high-dimensional space as the high-dimensional space mapped to the comparison data. For example, as shown in FIG. 9, each point indicating the difference value vector of the mapped detection data is divided into a group R2 on the origin side and a group R3 not on the origin side when viewed from the hyperplane Le. The degree-of-abnormality calculation unit 123 calculates the degree of discreteness (degree of abnormality) based on whether the point corresponding to the difference value vector of the mapped detection data is on the origin side as viewed from the hyperplane Le.

そこで、異常度計算部１２３における計算処理を、図１０を参照して、説明する。図１０は、実施の形態３に係る異常度計算処理を説明する図である。まず、比較データの乖離値ベクトルを「ｅ_１，ｅ_２，・・・，ｅ_Ｍ」とする。この比較データの乖離値ベクトルに対し、図１０に示す式Ｇ（（Ａ）式参照）を、（Ｂ）式及び（Ｃ）式に示す条件下で最小化する最小化問題を解く。なお、記号「＜，＞」は内積を表す。 Therefore, the calculation process in the abnormality degree calculation unit 123 will be described with reference to FIG. FIG. 10 is a diagram for explaining abnormality degree calculation processing according to the third embodiment. First, the difference value vector of the comparison data is set to “e ₁ , e ₂ ,..., E _M ”. With respect to the difference value vector of the comparison data, a minimization problem is solved in which the equation G (see equation (A) shown in FIG. 10) is minimized under the conditions shown in equations (B) and (C). The symbol "<,>" represents an inner product.

この問題では、超平面として、各データの（「φ（ベクトルｅ_ｍ）」の距離を、「ｄ_ｍ」としたときに、最も小さいｄ_ｍを最大化する超平面を求めようとしている。言い換えると、最も超平面に近いデータまでの距離を最大化する、超平面のパラメータ「ベクトルｗ」と「ρ」を求めようとしている。 This problem, as hyperplane, the distance of each data ( "phi (vector e _m)", when the "d _m", and attempts to find a hyperplane that maximizes the smallest d _m. In other words And we try to find the hyperplane parameters "vector w" and "p" which maximize the distance to the data closest to the hyperplane.

この最小化問題は、以下の（４）式に示す「Ｌ」を最小化するＬａｇｒａｎｇｅの未定乗数法により解くことができる。この（４）式の１行目は、Ｇそのものであり、（４）式の２行目については、図１０の（Ｂ）式に示す制約条件を反映し、（４）式の３行目については、図１０の（Ｃ）式に示す制約条件を反映する。 This minimization problem can be solved by Lagrange's undetermined multiplier method which minimizes “L” shown in the following equation (4). The first line of the equation (4) is G itself, and the second line of the equation (4) reflects the constraint shown in the equation (B) of FIG. 10, and the third line of the equation (4) 10 reflects the constraint shown in the equation (C) of FIG.

図１１は、実施の形態３に係る異常度計算処理及び異常判定処理を説明する図である。異常度計算部１２３は、異常データの乖離値ベクトルを「ｅ’」としたとき、図１１に示す式ｆ（ｅ’）によって、検知データの乖離値ベクトル「ｅ’」に対する異常度を計算する。式ｆ（ｅ’）は、（４）式で求めたパラメータを適用し、比較データの乖離値ベクトル「ｅ_ｍ」を高次元空間に写像した点と、検知データの乖離値ベクトル「ｅ’」を高次元空間に写像した点との距離に基づいた異常度を計算するものである。異常度計算部１２３は、この式ｆ（ｅ’）を用いた計算を行うことによって、写像した検知データの乖離値ベクトルに対応する点が、超平面Ｌｅから見て原点側にあるか否かを示す異常度を求めることができる。 FIG. 11 is a diagram for explaining abnormality degree calculation processing and abnormality determination processing according to the third embodiment. The abnormality degree calculation unit 123 calculates the abnormality degree with respect to the deviation value vector “e ′” of the detection data by the expression f (e ′) shown in FIG. 11 when the deviation value vector of the abnormality data is “e ′”. . Formula f (e ') is (4) by applying the parameter obtained by the formula, and that the mapping divergence value vector of the comparison data to "e _m' in high-dimensional space, divergence value vector of the detection data" e '" To calculate the degree of anomaly based on the distance between the point mapped to the high-dimensional space. Whether the point corresponding to the difference value vector of the mapped detection data is on the origin side with respect to the hyperplane Le or not by performing calculation using this equation f (e ′) Can be determined.

このように、異常度計算部１２３は、上述のＯｎｅ-ｃｌａｓｓＳＶＭに従い、写像した検知データの乖離値ベクトルを示す点が、平面（例えば、超平面Ｌｅ）から見て原点側にあるか、或いは、平面から見て原点側にないかを、式ｆ（ｅ’）を用いて計算する。 As described above, according to the above-described One-class SVM, the abnormality degree calculation unit 123 determines whether the point indicating the difference value vector of the mapped detection data is on the origin side as viewed from the plane (for example, the hyperplane Le) or Using the equation f (e '), it is calculated whether or not it is on the origin side as viewed from the plane.

［異常判定部の処理］
そして、実施の形態３では、異常判定部１２４は、写像した検知データの乖離値ベクトルを示す点が、平面から見て原点側にある場合には、該検知データは異常であると判定する。一方、異常判定部１２４は、写像した検知データの乖離値ベクトルを示す点が、平面から見て原点側にない場合には、正常であると判定する。例えば、異常判定部１２４は、図９に示す写像した検知データの乖離値ベクトルを示す各点のうち、超平面Ｌｅから見て、原点側にある群Ｒ２については、検知データは異常であると判定する。一方、異常判定部１２４は、超平面Ｌｅから見て、原点側にない群Ｒ３については、検知データは正常であると判定する（図９の枠Ｂ１参照）。 [Process of abnormality determination unit]
Then, in the third embodiment, when the point indicating the deviation value vector of the mapped detection data is on the origin side as viewed from the plane, the abnormality determination unit 124 determines that the detection data is abnormal. On the other hand, when the point indicating the difference value vector of the mapped detection data is not on the origin side as viewed from the plane, the abnormality determination unit 124 determines that the point is normal. For example, the abnormality determination unit 124 determines that the detection data is abnormal for the group R2 on the origin side as viewed from the hyperplane Le among the points indicating the difference value vector of the mapped detection data illustrated in FIG. 9. judge. On the other hand, the abnormality determination unit 124 determines that the detection data is normal for the group R3 which is not on the origin side as viewed from the hyperplane Le (see the frame B1 in FIG. 9).

ここで、異常判定部１２４は、検知データに対し式（ｅ’）で求めた異常度と、前述の未定乗数法（（４）式）によって求めた超平面に対応するパラメータ（ρチルダ）と、を比較することによって、検知データの異常の有無を判定する。すなわち、図１１に示すように、異常判定部１２４は、検知データについての異常度ｆ（ｅ’）が、（４）式からパラメータ（ρチルダ）よりも小さい場合には、検知データが超平面Ｌｅよりも原点側にあると判断して、該検知データは異常であると判定する。一方、異常判定部１２４は、検知データについての異常度ｆ（ｅ’）が、パラメータ（ρチルダ）よりも大きい場合には、検知データが超平面Ｌｅよりも原点側にないと判断して、正常であると判定する。 Here, the abnormality determination unit 124 determines the degree of abnormality obtained by the equation (e ′) for the detection data, and the parameter (ρ tilda) corresponding to the hyperplane obtained by the above-described undetermined multiplier method (equation (4)) The presence or absence of abnormality of detection data is determined by comparing. That is, as shown in FIG. 11, when the abnormality degree f (e ′) of the detection data is smaller than the parameter (チル tilde) from the equation (4), the abnormality determination unit 124 determines that the detection data is a hyperplane. It is determined that the detected data is abnormal by determining that it is closer to the origin than Le. On the other hand, when the degree of abnormality f (e ′) of the detection data is larger than the parameter (ρ tilde), the abnormality determination unit 124 determines that the detection data is not on the origin side with respect to the hyperplane Le. Determine that it is normal.

［実施の形態３の効果］
このように、実施の形態３においては、比較データの乖離値ベクトルを高次元空間に写像して原点からの距離が最大化する超平面を求める。そして、実施の形態３では、検知データの乖離値ベクトルを高次元空間に写像した場合に該写像した乖離値ベクトルに対応する点が、超平面から見て原点側にあるか否かを基に異常度を計算して、異常の有無を判定する。すなわち、実施の形態３においても、実施の形態１と同様に、データ間の関係性に基づいて計算した、検知データの乖離値ベクトルと、正常である比較データの乖離値ベクトルの集合との距離によって、検知データの異常の有無を判定しているため、データ間の関係性に基づいた検知対象データの異常検知を精度よく実行することができる。 [Effect of Third Embodiment]
As described above, in the third embodiment, the difference value vector of the comparison data is mapped to the high-dimensional space to obtain a hyperplane at which the distance from the origin is maximized. Then, in the third embodiment, when the divergence value vector of the detection data is mapped to the high-dimensional space, based on whether or not the point corresponding to the mapped divergence value vector is on the origin side with respect to the hyperplane. The degree of abnormality is calculated to determine the presence or absence of an abnormality. That is, also in the third embodiment, as in the first embodiment, the distance between the difference value vector of the detected data and the set of difference value vectors of the comparison data which is normal, calculated based on the relationship between the data. Since the presence or absence of abnormality of detection data is determined by this, abnormality detection of detection object data based on the relationship between data can be performed accurately.

［実施形態のシステム構成について］
図１に示した異常検知装置１０の各構成要素は機能概念的なものであり、必ずしも物理的に図示のように構成されていることを要しない。すなわち、異常検知装置１０の機能の分散および統合の具体的形態は図示のものに限られず、その全部または一部を、各種の負荷や使用状況などに応じて、任意の単位で機能的または物理的に分散または統合して構成することができる。 [About the system configuration of the embodiment]
Each component of the abnormality detection device 10 illustrated in FIG. 1 is functionally conceptual, and does not necessarily have to be physically configured as illustrated. That is, the specific form of the distribution and integration of the functions of the abnormality detection apparatus 10 is not limited to that illustrated, but all or a part thereof may be functionally or physically in any unit depending on various loads, usage conditions, etc. Can be distributed or integrated.

また、異常検知装置１０においておこなわれる各処理は、全部または任意の一部が、ＣＰＵ（Central Processing Unit）およびＣＰＵにより解析実行されるプログラムにて実現されてもよい。また、異常検知装置１０においておこなわれる各処理は、ワイヤードロジックによるハードウェアとして実現されてもよい。 In addition, each process performed in the abnormality detection apparatus 10 may be realized by all or any part of a CPU (Central Processing Unit) and a program analyzed and executed by the CPU. Further, each process performed in the abnormality detection apparatus 10 may be realized as hardware by wired logic.

また、実施形態において説明した各処理のうち、自動的におこなわれるものとして説明した処理の全部または一部を手動的に行うこともできる。もしくは、手動的におこなわれるものとして説明した処理の全部または一部を公知の方法で自動的に行うこともできる。この他、上述および図示の処理手順、制御手順、具体的名称、各種のデータやパラメータを含む情報については、特記する場合を除いて適宜変更することができる。 Further, among the processes described in the embodiment, all or part of the processes described as being automatically performed can be manually performed. Alternatively, all or part of the processing described as being performed manually may be performed automatically by a known method. In addition, the information including the above-described and illustrated processing procedures, control procedures, specific names, various data and parameters can be appropriately changed unless otherwise specified.

［プログラム］
図１２は、プログラムが実行されることにより、異常検知装置１０が実現されるコンピュータの一例を示す図である。コンピュータ１０００は、例えば、メモリ１０１０、ＣＰＵ１０２０を有する。また、コンピュータ１０００は、ハードディスクドライブインタフェース１０３０、ディスクドライブインタフェース１０４０、シリアルポートインタフェース１０５０、ビデオアダプタ１０６０、ネットワークインタフェース１０７０を有する。これらの各部は、バス１０８０によって接続される。 [program]
FIG. 12 is a diagram illustrating an example of a computer in which the abnormality detection device 10 is realized by executing a program. The computer 1000 includes, for example, a memory 1010 and a CPU 1020. The computer 1000 also includes a hard disk drive interface 1030, a disk drive interface 1040, a serial port interface 1050, a video adapter 1060, and a network interface 1070. These units are connected by a bus 1080.

メモリ１０１０は、ＲＯＭ（Read Only Memory）１０１１及びＲＡＭ１０１２を含む。ＲＯＭ１０１１は、例えば、ＢＩＯＳ（Basic Input Output System）等のブートプログラムを記憶する。ハードディスクドライブインタフェース１０３０は、ハードディスクドライブ１０９０に接続される。ディスクドライブインタフェース１０４０は、ディスクドライブ１１００に接続される。例えば磁気ディスクや光ディスク等の着脱可能な記憶媒体が、ディスクドライブ１１００に挿入される。シリアルポートインタフェース１０５０は、例えばマウス１１１０、キーボード１１２０に接続される。ビデオアダプタ１０６０は、例えばディスプレイ１１３０に接続される。 The memory 1010 includes a ROM (Read Only Memory) 1011 and a RAM 1012. The ROM 1011 stores, for example, a boot program such as a BIOS (Basic Input Output System). The hard disk drive interface 1030 is connected to the hard disk drive 1090. Disk drive interface 1040 is connected to disk drive 1100. For example, a removable storage medium such as a magnetic disk or an optical disk is inserted into the disk drive 1100. The serial port interface 1050 is connected to, for example, a mouse 1110 and a keyboard 1120. The video adapter 1060 is connected to, for example, the display 1130.

ハードディスクドライブ１０９０は、例えば、ＯＳ１０９１、アプリケーションプログラム１０９２、プログラムモジュール１０９３、プログラムデータ１０９４を記憶する。すなわち、異常検知装置１０の各処理を規定するプログラムは、コンピュータ１０００により実行可能なコードが記述されたプログラムモジュール１０９３として実装される。プログラムモジュール１０９３は、例えばハードディスクドライブ１０９０に記憶される。例えば、異常検知装置１０における機能構成と同様の処理を実行するためのプログラムモジュール１０９３が、ハードディスクドライブ１０９０に記憶される。なお、ハードディスクドライブ１０９０は、ＳＳＤ（Solid State Drive）により代替されてもよい。 The hard disk drive 1090 stores, for example, an OS 1091, an application program 1092, a program module 1093, and program data 1094. That is, a program that defines each process of the abnormality detection apparatus 10 is implemented as a program module 1093 in which a code executable by the computer 1000 is described. The program module 1093 is stored, for example, in the hard disk drive 1090. For example, the hard disk drive 1090 stores a program module 1093 for executing the same processing as the functional configuration of the abnormality detection apparatus 10. The hard disk drive 1090 may be replaced by a solid state drive (SSD).

また、上述した実施の形態の処理で用いられる設定データは、プログラムデータ１０９４として、例えばメモリ１０１０やハードディスクドライブ１０９０に記憶される。そして、ＣＰＵ１０２０が、メモリ１０１０やハードディスクドライブ１０９０に記憶されたプログラムモジュール１０９３やプログラムデータ１０９４を必要に応じてＲＡＭ１０１２に読み出して実行する。 The setting data used in the process of the above-described embodiment is stored as program data 1094 in, for example, the memory 1010 or the hard disk drive 1090. Then, the CPU 1020 reads out the program module 1093 and the program data 1094 stored in the memory 1010 and the hard disk drive 1090 to the RAM 1012 as needed, and executes them.

なお、プログラムモジュール１０９３やプログラムデータ１０９４は、ハードディスクドライブ１０９０に記憶される場合に限らず、例えば着脱可能な記憶媒体に記憶され、ディスクドライブ１１００等を介してＣＰＵ１０２０によって読み出されてもよい。あるいは、プログラムモジュール１０９３及びプログラムデータ１０９４は、ネットワーク（ＬＡＮ、ＷＡＮ等）を介して接続された他のコンピュータに記憶されてもよい。そして、プログラムモジュール１０９３及びプログラムデータ１０９４は、他のコンピュータから、ネットワークインタフェース１０７０を介してＣＰＵ１０２０によって読み出されてもよい。 The program module 1093 and the program data 1094 are not limited to being stored in the hard disk drive 1090, and may be stored in, for example, a removable storage medium and read by the CPU 1020 via the disk drive 1100 or the like. Alternatively, the program module 1093 and the program data 1094 may be stored in another computer connected via a network (LAN, WAN, etc.). The program module 1093 and the program data 1094 may be read by the CPU 1020 from another computer via the network interface 1070.

以上、本発明者によってなされた発明を適用した実施の形態について説明したが、本実施の形態による本発明の開示の一部をなす記述及び図面により本発明は限定されることはない。すなわち、本実施の形態に基づいて当業者等によりなされる他の実施の形態、実施例及び運用技術等は全て本発明の範疇に含まれる。 Although the embodiment to which the invention made by the inventor is applied has been described above, the present invention is not limited by the description and the drawings that form a part of the disclosure of the present invention according to the present embodiment. That is, all other embodiments, examples, operation techniques and the like made by those skilled in the art based on the present embodiment are included in the scope of the present invention.

１０異常検知装置
１１通信処理部
１２制御部
１３記憶部
１２１関係性推定部
１２２乖離値ベクトル計算部
１２３異常度計算部
１２４異常判定部
１３１乖離値ベクトル記憶部 DESCRIPTION OF SYMBOLS 10 abnormality detection apparatus 11 communication processing part 12 control part 13 memory | storage part 121 relationship estimation part 122 difference value vector calculation part 123 abnormality degree calculation part 124 abnormality determination part 131 difference value vector storage part

Claims

Based on the relationship between the data, the difference between the observed value of the detected data to be detected and the value estimated from the observed value based on the relationship between the data represented by the multiple regression equation A divergence value representing a divergence that is the difference between the representing value value vector representing, the observation value of the comparison data to be compared, and the value estimated from the observation value based on the relationship between the data represented by the multiple regression equation A deviation value vector calculation unit that calculates a vector,
An abnormality degree calculation unit that calculates the degree of discreteness of the deviation value vector of the detection data from the set of the deviation value vectors of the comparison data as the degree of abnormality.
An abnormality determination unit that determines that the detection data is abnormal when the degree of discreteness exceeds a predetermined threshold;
An abnormality detection device characterized by having.

  Based on the relationship between the data, the difference between the observed value of the detected data to be detected and the value estimated from the observed value based on the relationship between the data represented by the simple regression equation A divergence value representing a divergence that is the difference between the representing value value vector representing, the observation value of the comparison data to be compared, and the value estimated from the observation value based on the relationship between the data represented by the simple regression equation A deviation value vector calculation unit that calculates a vector,
  An abnormality degree calculation unit that calculates the degree of discreteness of the deviation value vector of the detection data from the set of the deviation value vectors of the comparison data as the degree of abnormality.
An abnormality determination unit that determines that the detection data is abnormal when the degree of discreteness exceeds a predetermined threshold;
  An abnormality detection device characterized by having.

  Based on the relationship between the data, the difference between the observed value of the detected data to be detected and the value estimated from the observed value based on the relationship between the data represented by the autoregressive equation A divergence value representing a divergence that is the difference between the representing value value vector representing, the observed value of the comparison data to be compared, and the value estimated from the observed value based on the relationship between the data represented by the autoregressive equation A deviation value vector calculation unit that calculates a vector,
  An abnormality degree calculation unit that calculates the degree of discreteness of the deviation value vector of the detection data from the set of the deviation value vectors of the comparison data as the degree of abnormality.
An abnormality determination unit that determines that the detection data is abnormal when the degree of discreteness exceeds a predetermined threshold;
  An abnormality detection device characterized by having.

The apparatus further includes a divergence value vector storage unit that stores the divergence value vector calculated by the divergence value vector calculation unit,
The abnormality detection device according to any one of claims 1 to 3, wherein the abnormality degree calculation unit calculates the degree of discreteness using a divergence value vector stored in the divergence value vector storage unit.

The information processing apparatus further includes a relationship estimation unit that estimates a relationship between the data and calculates a parameter indicating the relationship between the data,
The said difference value vector calculation part is characterized by applying the parameter which shows the relationship between the said data which the said relationship estimation part calculated to the relationship between the said data, and calculating the said difference value vector. The abnormality detection device according to any one of 1 to 4 .

The abnormality degree calculating unit, wherein the said deviation value vector of the detection data, from the set of divergence value vector of the comparative data, based on the spatial distance and density, and calculates the discrete degree The abnormality detection device according to any one of Items 1 to 5 .

The abnormality degree calculation unit calculates an average of the difference value vectors of the comparison data and a covariance matrix of the difference value vectors of the comparison data, and calculates a difference value vector of the detection data and a difference value vector of the comparison data. 6. The Mahalanobis distance determined based on the average of R and the covariance matrix of the difference value vector of the comparison data is calculated as the degree of discreteness according to any one of claims 1 to 5 . Abnormality detection device.

The abnormality degree calculation unit obtains a plane where the distance from the origin at a point corresponding to the mapped difference value vector is maximized when the difference value vector of the comparison data is mapped to the high dimensional space, and When the difference value vector is mapped to the high-dimensional space, the degree of discreteness is calculated based on whether or not a point corresponding to the mapped difference value vector is on the origin side as viewed from the plane. The abnormality detection device according to any one of claims 1 to 5 .

An anomaly detection method executed by an anomaly detection apparatus that detects the presence or absence of an anomaly with respect to a set of detection data to be detected,
Based on the relationship between the data, the difference between the observed value of the detected data to be detected and the value estimated from the observed value based on the relationship between the data represented by the multiple regression equation A divergence value representing a divergence that is the difference between the representing value value vector representing, the observation value of the comparison data to be compared, and the value estimated from the observation value based on the relationship between the data represented by the multiple regression equation Calculating a vector, and
Calculating the degree of discrepancies of the deviation value vector of the detection data from the set of the deviation value vectors of the comparison data as a degree indicating an abnormality;
Determining that the detection data is abnormal if the degree of discreteness exceeds a predetermined threshold;
An abnormality detection method characterized by including:

  An anomaly detection method executed by an anomaly detection apparatus that detects the presence or absence of an anomaly with respect to a set of detection data to be detected,
  Based on the relationship between the data, the difference between the observed value of the detected data to be detected and the value estimated from the observed value based on the relationship between the data represented by the simple regression equation A divergence value representing a divergence that is the difference between the representing value value vector representing, the observation value of the comparison data to be compared, and the value estimated from the observation value based on the relationship between the data represented by the simple regression equation Calculating a vector, and
  Calculating the degree of discrepancies of the deviation value vector of the detection data from the set of the deviation value vectors of the comparison data as a degree indicating an abnormality;
Determining that the detection data is abnormal if the degree of discreteness exceeds a predetermined threshold;
  An abnormality detection method characterized by including:

  An anomaly detection method executed by an anomaly detection apparatus that detects the presence or absence of an anomaly with respect to a set of detection data to be detected,
  Based on the relationship between the data, the difference between the observed value of the detected data to be detected and the value estimated from the observed value based on the relationship between the data represented by the autoregressive equation A divergence value representing a divergence that is the difference between the representing value value vector representing, the observed value of the comparison data to be compared, and the value estimated from the observed value based on the relationship between the data represented by the autoregressive equation Calculating a vector, and
  Calculating the degree of discrepancies of the deviation value vector of the detection data from the set of the deviation value vectors of the comparison data as a degree indicating an abnormality;
Determining that the detection data is abnormal if the degree of discreteness exceeds a predetermined threshold;
  An abnormality detection method characterized by including:

Based on the relationship between the data, the difference between the observed value of the detected data to be detected and the value estimated from the observed value based on the relationship between the data represented by the multiple regression equation A divergence value representing a divergence that is the difference between the representing value value vector representing, the observation value of the comparison data to be compared, and the value estimated from the observation value based on the relationship between the data represented by the multiple regression equation Calculating a vector, and
Calculating the degree of discrepancies of the deviation value vector of the detection data from the set of the deviation value vectors of the comparison data as a degree indicating an abnormality;
Determining that the detection data is abnormal if the degree of discreteness exceeds a predetermined threshold;
An anomaly detection program to make a computer execute.

  Based on the relationship between the data, the difference between the observed value of the detected data to be detected and the value estimated from the observed value based on the relationship between the data represented by the simple regression equation A divergence value representing a divergence that is the difference between the representing value value vector representing, the observation value of the comparison data to be compared, and the value estimated from the observation value based on the relationship between the data represented by the simple regression equation Calculating a vector, and
  Calculating the degree of discrepancies of the deviation value vector of the detection data from the set of the deviation value vectors of the comparison data as a degree indicating an abnormality;
Determining that the detection data is abnormal if the degree of discreteness exceeds a predetermined threshold;
  An anomaly detection program to make a computer execute.

  Based on the relationship between the data, the difference between the observed value of the detected data to be detected and the value estimated from the observed value based on the relationship between the data represented by the autoregressive equation A divergence value representing a divergence that is the difference between the representing value value vector representing, the observed value of the comparison data to be compared, and the value estimated from the observed value based on the relationship between the data represented by the autoregressive equation Calculating a vector, and
  Calculating the degree of discrepancies of the deviation value vector of the detection data from the set of deviation value vectors of the comparison data as a degree indicating an abnormality;
Determining that the detection data is abnormal if the degree of discreteness exceeds a predetermined threshold;
  An anomaly detection program to make a computer execute.