JP2020098373A

JP2020098373A - Abnormality detection device and program and method for the same

Info

Publication number: JP2020098373A
Application number: JP2018235116A
Authority: JP
Inventors: 智雨宮; Satoshi Amemiya; 秀城阿部; Hideki Abe; 史弥牧原; Fumiya Makihara
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2018-12-17
Filing date: 2018-12-17
Publication date: 2020-06-25
Anticipated expiration: 2038-12-17
Also published as: JP7238378B2

Abstract

To provide an abnormality detection device which can find where and in what time zone an event contributing to an abnormality occurred.SOLUTION: An abnormality detection device generates a failure sign model for a normal data constellation of teacher data; derives a threshold value for abnormality detection by calculating a first abnormality score from the teacher data; generates a first regression model whose explanatory variable is an abnormal data constellation of teacher data and whose objective variable is the first abnormality score about the abnormal data constellation and calculates a first abnormality contribution; calculates a second abnormality score for the failure sign model of actual measurement data outputted from plural respective sensors installed in plural positions of monitoring objects; using the threshold value for abnormality detection, gets a new second abnormality score of the actual measurement data in an abnormal section where the new second abnormal scores among the actual measurement data are equal to or more than the threshold value; generates a second regression model whose explanatory variable is the actual measurement data in the abnormal section and whose objective variable is the new second abnormal score about the actual measurement data in the abnormal section; and calculates a second abnormality contribution by the detection data of the plural sensors from the second regression model.SELECTED DRAWING: Figure 7

Description

本発明は、異常検出装置、異常検出プログラム、及び、異常検出方法に関する。 The present invention relates to an abnormality detection device, an abnormality detection program, and an abnormality detection method.

従来より、要因分析データから事象の結果を表す目的変数の時系列データと、事象の要因を表す説明変数の時系列データとを取得する取得手段と、前記目的変数の時系列データに基づき、複数の目的変数基準値を設定する基準値設定手段と、を備える要因分析装置がある。前記設定された複数の目的変数基準値、及び、前記取得された説明変数の時系列データを学習させ、前記目的変数基準値ごとに、前記目的変数基準値と前記説明変数との関係式を生成し、前記生成された関係式のうち、前記説明変数の係数、及び、前記係数に対応する前記説明変数を抽出する影響度算出手段と、前記抽出された係数を影響度として出力し、さらに、前記抽出された説明変数に関連する説明変数名を出力する出力手段と、をさらに備える（例えば、特許文献１参照）。 Conventionally, acquisition means for acquiring time-series data of an objective variable representing the result of an event from factor analysis data, and time-series data of an explanatory variable representing a factor of an event, and a plurality of acquisition units based on the time-series data of the objective variable. There is a reference value setting means for setting the reference value of the objective variable of 1. The plurality of objective variable reference values that have been set and time series data of the acquired explanatory variables are learned, and a relational expression between the objective variable reference value and the explanatory variable is generated for each objective variable reference value. Then, among the generated relational expressions, a coefficient of the explanatory variable, and an influence degree calculating unit that extracts the explanatory variable corresponding to the coefficient, and the extracted coefficient is output as an influence degree. An output unit that outputs the explanatory variable name related to the extracted explanatory variable is further included (see, for example, Patent Document 1).

国際公開２０１６／０７９９７２号International publication 2016/079972

ところで、従来の要因分析装置は、説明変数名で表される対象物が正常な状態であるか異常な状態であるかを区別していない。また、異常な状態である場合に、説明変数名で表される対象物が異常な状態に寄与する度合を求めることを開示していない。 By the way, the conventional factor analysis device does not distinguish whether the object represented by the explanatory variable name is in a normal state or an abnormal state. Further, it does not disclose obtaining the degree to which the object represented by the explanatory variable name contributes to the abnormal state in the abnormal state.

そこで、異常に寄与する事象が発生した箇所と時間帯を把握できる異常検出装置、異常検出プログラム、及び、異常検出方法を提供することを目的とする。 Therefore, it is an object of the present invention to provide an abnormality detection device, an abnormality detection program, and an abnormality detection method capable of grasping a place and a time zone in which an event contributing to abnormality occurs.

本発明の実施の形態の異常検出装置は、監視対象物の複数の箇所における正常及び異常な事象をそれぞれ時系列的に表す正常データ群及び異常データ群を含む教師データのうちの正常データ群の正規分布を表す故障予兆モデルを生成する予兆モデル生成部と、前記教師データから第１異常スコアを算出する第１異常スコア算出部と、前記第１異常スコアに基づき、異常検知の閾値を導出する閾値導出部と、前記教師データの異常データ群を説明変数とし、前記異常データ群についての前記第１異常スコアを目的変数とする第１回帰モデルを生成する第１回帰モデル生成部と、前記第１回帰モデルから前記複数の箇所における事象の第１異常寄与度を算出する第１寄与度算出部と、前記監視対象物の複数の箇所にそれぞれ設けられる複数のセンサから出力される実測データの前記故障予兆モデルに対する第２異常スコアを算出し、前記異常検知の閾値を用いて、前記実測データのうち前記第２異常スコアが前記閾値以上の異常区間の実測データについての第２異常スコアを求める第２異常スコア算出部と、前記異常区間の実測データを説明変数とし、前記異常区間の実測データについての第２異常スコアを目的変数とする第２回帰モデルを生成する第２回帰モデル生成部と、
前記第２回帰モデルから前記複数のセンサの検出データの第２異常寄与度を算出する第２寄与度算出部とを含む。 The abnormality detection device according to the embodiment of the present invention, a normal data group of the normal data group and the abnormal data group representing a normal and abnormal events in a plurality of locations of the monitored object respectively in a time series of normal data group of the teacher data. A predictive model generator that generates a failure predictive model that represents a normal distribution, a first abnormality score calculator that calculates a first abnormality score from the teacher data, and a threshold for abnormality detection based on the first abnormality score. A threshold value derivation unit, a first regression model generation unit that generates a first regression model in which the abnormal data group of the teacher data is an explanatory variable, and the first abnormal score of the abnormal data group is an objective variable; A first contribution degree calculation unit that calculates a first abnormal contribution degree of an event at the plurality of locations from one regression model, and the measurement data output from a plurality of sensors provided at the plurality of locations of the monitored object, respectively. A second anomaly score for the failure sign model is calculated, and a second anomaly score of the actual measurement data of the abnormal section in which the second anomaly score is equal to or higher than the threshold value is calculated using the anomaly detection threshold value; 2 an abnormality score calculation unit, a second regression model generation unit that generates a second regression model that uses the actual measurement data of the abnormal section as an explanatory variable and the second abnormality score of the actual measurement data of the abnormal section as an objective variable,
And a second contribution calculating unit that calculates a second abnormality contribution of the detection data of the plurality of sensors from the second regression model.

異常に寄与する事象が発生した箇所と時間帯を把握できる異常検出装置、異常検出プログラム、及び、異常検出方法を提供することができる。 It is possible to provide an abnormality detection device, an abnormality detection program, and an abnormality detection method capable of grasping the place and time zone where an event contributing to abnormality has occurred.

実施の形態の異常検出装置１００を示す図である。It is a figure which shows the abnormality detection apparatus 100 of embodiment. 異常検出装置１００を実現するコンピュータシステム２０の斜視図である。2 is a perspective view of a computer system 20 that realizes the abnormality detection device 100. FIG. コンピュータシステム２０の本体部２１内の要部の構成を説明するブロック図である。3 is a block diagram illustrating a configuration of a main part in a main body 21 of the computer system 20. FIG. 異常検出装置１００の構成を示す図である。It is a figure which shows the structure of the abnormality detection apparatus 100. 予兆モデル生成部１１２が生成する故障予兆モデルを示す図である。It is a figure which shows the failure predictive model which the predictive model production|generation part 112 produces|generates. 複数の決定木を用いた回帰モデルと、説明変数ｘ及び目的変数ｙの関係とを示す図である。It is a figure which shows the regression model using a some decision tree, and the relationship of the explanatory variable x and the objective variable y. ６個のセンサ５０によって取得される温度データと異常スコアを示す図である。It is a figure which shows the temperature data and abnormality score acquired by six sensors 50. 回帰モデル生成部１１５が作成するランダムフォレスト回帰モデルの推定値と真値との相関を示す図である。It is a figure which shows the correlation of the estimated value and true value of the random forest regression model which the regression model generation part 115 produces. センサ１〜６の温度データの正常区間と予兆区間とについて求めた異常寄与度を示す図である。It is a figure which shows the abnormal contribution degree calculated|required about the normal area and the sign area of the temperature data of sensors 1-6. 異常検出装置１００の制御装置１１０が実行するフローチャートを示す図である。3 is a diagram showing a flowchart executed by a control device 110 of the abnormality detection device 100. FIG. 故障予兆モデル及びテストデータの関係と異常スコアを示す図である。It is a figure which shows the relationship between a failure sign model and test data, and an abnormality score.

以下、本発明の異常検出装置、異常検出プログラム、及び、異常検出方法を適用した実施の形態について説明する。 Hereinafter, an embodiment to which the abnormality detection device, the abnormality detection program, and the abnormality detection method of the present invention are applied will be described.

＜実施の形態＞
図１は、実施の形態の異常検出装置１００を示す図である。異常検出装置１００には、複数のセンサ５０が接続されている。複数のセンサ５０は、一例として光ファイバケーブル１０に沿って設けられており、光ファイバケーブル１０の温度を検出する温度センサである。センサ５０が検出した温度を表すデータ（温度データ）は、異常検出装置１００に入力される。センサ５０によって取得される温度データには、タイムスタンプが付与される。すなわち、温度データには、検出された時刻を表す時刻データが関連付けられる。 <Embodiment>
FIG. 1 is a diagram showing an abnormality detection device 100 according to an embodiment. A plurality of sensors 50 are connected to the abnormality detection device 100. The plurality of sensors 50 are, for example, provided along the optical fiber cable 10 and are temperature sensors that detect the temperature of the optical fiber cable 10. Data representing the temperature detected by the sensor 50 (temperature data) is input to the abnormality detection device 100. A time stamp is added to the temperature data acquired by the sensor 50. That is, the temperature data is associated with time data representing the detected time.

異常検出装置１００は、複数のセンサ５０から入力される温度データに基づき、複数のセンサ５０のうちのどのセンサ５０によって検出された温度データに異常があるかどうかを検出する。複数のセンサ５０が設けられる光ファイバケーブル１０は、異常検出装置１００によって監視が行われる監視対象物の一例である。 The abnormality detection device 100 detects, based on the temperature data input from the plurality of sensors 50, which of the plurality of sensors 50 the temperature data detected by the sensor 50 is abnormal. The optical fiber cable 10 provided with the plurality of sensors 50 is an example of an object to be monitored by the abnormality detection device 100.

図２は、異常検出装置１００を実現するコンピュータシステム２０の斜視図である。図２に示すコンピュータシステム２０は、本体部２１、ディスプレイ２２、キーボード２３、マウス２４、及びモデム２５を含む。 FIG. 2 is a perspective view of a computer system 20 that realizes the abnormality detection device 100. The computer system 20 shown in FIG. 2 includes a main body 21, a display 22, a keyboard 23, a mouse 24, and a modem 25.

本体部２１は、ＣＰＵ（Central Processing Unit：中央演算装置）、ＨＤＤ（Hard Disk Drive：ハードディスクドライブ）、及びディスクドライブ等を内蔵する。ディスプレイ２２は、本体部２１からの指示により画面２２Ａ上に処理結果等を表示する。ディスプレイ２２は、例えば、液晶モニタであればよい。キーボード２３は、コンピュータシステム２０に種々の情報を入力するための入力部である。マウス２４は、ディスプレイ２２の画面２２Ａ上の任意の位置を指定する入力部である。モデム２５は、外部のデータベース等にアクセスして他のコンピュータシステムに記憶されているプログラム等をダウンロードする。 The main body 21 includes a CPU (Central Processing Unit), a HDD (Hard Disk Drive), a disk drive, and the like. The display 22 displays processing results and the like on the screen 22A according to an instruction from the main body 21. The display 22 may be, for example, a liquid crystal monitor. The keyboard 23 is an input unit for inputting various information to the computer system 20. The mouse 24 is an input unit for designating an arbitrary position on the screen 22A of the display 22. The modem 25 accesses an external database or the like to download a program or the like stored in another computer system.

コンピュータシステム２０に異常検出装置１００としての機能を持たせるプログラムは、ディスク２７等の可搬型記録媒体に格納されるか、モデム２５等の通信装置を使って他のコンピュータシステムの記録媒体２６からダウンロードされ、コンピュータシステム２０に入力されてコンパイルされる。 The program that causes the computer system 20 to function as the abnormality detection device 100 is stored in a portable recording medium such as a disk 27 or downloaded from a recording medium 26 of another computer system using a communication device such as a modem 25. And is input to the computer system 20 to be compiled.

コンピュータシステム２０に異常検出装置１００としての機能を持たせるプログラム（異常検出プログラム）は、コンピュータシステム２０を異常検出装置１００として動作させる。このプログラムは、例えばディスク２７等のコンピュータ読み取り可能な記録媒体に格納されていてもよい。コンピュータ読み取り可能な記録媒体は、ディスク２７、ＩＣカードメモリ、フロッピー（登録商標）ディスク等の磁気ディスク、光磁気ディスク、ＣＤ−ＲＯＭ、ＵＳＢ(Universal Serial Bus)メモリ等の可搬型記録媒体に限定されるものではない。コンピュータ読み取り可能な記録媒体は、モデム２５又はＬＡＮ等の通信装置を介して接続されるコンピュータシステムでアクセス可能な各種記録媒体を含む。 A program (abnormality detection program) that causes the computer system 20 to function as the abnormality detection device 100 causes the computer system 20 to operate as the abnormality detection device 100. This program may be stored in a computer-readable recording medium such as the disk 27, for example. The computer-readable recording medium is limited to a portable recording medium such as a disk 27, an IC card memory, a magnetic disk such as a floppy (registered trademark) disk, a magneto-optical disk, a CD-ROM, a USB (Universal Serial Bus) memory, or the like. Not something. The computer-readable recording medium includes various recording media accessible by a computer system connected via a communication device such as a modem 25 or a LAN.

図３は、コンピュータシステム２０の本体部２１内の要部の構成を説明するブロック図である。本体部２１は、バス３０によって接続されたＣＰＵ３１、ＲＡＭ又はＲＯＭ等を含むメモリ部３２、ディスク２７用のディスクドライブ３３、及びハードディスクドライブ（ＨＤＤ）３４を含む。 FIG. 3 is a block diagram illustrating a configuration of a main part in the main body 21 of the computer system 20. The main body 21 includes a CPU 31 connected by a bus 30, a memory 32 including a RAM or a ROM, a disk drive 33 for the disk 27, and a hard disk drive (HDD) 34.

なお、コンピュータシステム２０は、図２及び図３に示す構成のものに限定されず、各種周知の要素を付加してもよく、又は代替的に用いてもよい。 The computer system 20 is not limited to the configuration shown in FIGS. 2 and 3, and various well-known elements may be added or alternatively used.

図４は、異常検出装置１００の構成を示す図である。異常検出装置１００は、制御装置１１０、操作部１２０、表示部１３０を含む。 FIG. 4 is a diagram showing the configuration of the abnormality detection device 100. The abnormality detection device 100 includes a control device 110, an operation unit 120, and a display unit 130.

制御装置１１０は、主制御部１１１、予兆モデル生成部１１２、異常スコア算出部１１３、閾値導出部１１４、回帰モデル生成部１１５、寄与度算出部１１６、及びメモリ１１７を有する。 The control device 110 includes a main control unit 111, a predictive model generation unit 112, an abnormality score calculation unit 113, a threshold derivation unit 114, a regression model generation unit 115, a contribution degree calculation unit 116, and a memory 117.

主制御部１１１、予兆モデル生成部１１２、異常スコア算出部１１３、閾値導出部１１４、回帰モデル生成部１１５、寄与度算出部１１６は、制御装置１１０を実現するコンピュータによって実現される機能を表したものであり、メモリ１１７は、制御装置１１０を実現するコンピュータのメモリを機能的に表したものである。 The main control unit 111, the predictive model generation unit 112, the abnormality score calculation unit 113, the threshold value derivation unit 114, the regression model generation unit 115, and the contribution degree calculation unit 116 represent the functions realized by the computer that realizes the control device 110. The memory 117 is a functional representation of the memory of the computer that implements the control device 110.

主制御部１１１は、制御装置１１０を統括する処理部であり、予兆モデル生成部１１２、異常スコア算出部１１３、閾値導出部１１４、回帰モデル生成部１１５、寄与度算出部１１６が実行する処理以外の処理を実行する。なお、主制御部１１１が行う具体的な処理については、図１０のフローチャートを用いて後述する。 The main control unit 111 is a processing unit that controls the control device 110, and is other than the processes executed by the predictive model generation unit 112, the abnormality score calculation unit 113, the threshold derivation unit 114, the regression model generation unit 115, and the contribution degree calculation unit 116. The process of is executed. The specific processing performed by the main control unit 111 will be described later with reference to the flowchart of FIG.

予兆モデル生成部１１２は、光ファイバケーブル１０の複数のセンサ５０で検出された温度データを時系列的に表す教師データのうちの正常データ群の正規分布を表す故障予兆モデルを生成する。故障予兆モデルは、光ファイバケーブル１０の故障の予兆を検出（分析）するために用いるモデルである。正常データ群とは、正常データの集合であり、正常データのデータセット（データ群）である。 The predictive model generation unit 112 generates a predictive failure model that represents a normal distribution of normal data groups in the teacher data that represents the temperature data detected by the plurality of sensors 50 of the optical fiber cable 10 in time series. The failure sign model is a model used to detect (analyze) a sign of failure of the optical fiber cable 10. The normal data group is a set of normal data, and is a data set (data group) of normal data.

教師データは、正常な温度データと異常な温度データを含む。正常な温度データは正常データの一例であり、異常な温度データは異常データの一例である。教師データは、正常な温度データと異常な温度データが検出されたセンサ５０が特定されていればよく、センサ５０の実測データ（実測値）であってもよく、人工的に作成したデータであってもよい。なお、異常な温度データとは、温度値が所定値より大きい温度データである。所定値は、一例として、光ファイバケーブル１０の異常温度の下限値に設定すればよい。 The teacher data includes normal temperature data and abnormal temperature data. Normal temperature data is an example of normal data, and abnormal temperature data is an example of abnormal data. The teacher data only needs to specify the sensor 50 in which the normal temperature data and the abnormal temperature data have been detected, may be actual measurement data (actual measurement value) of the sensor 50, or is artificially created data. May be. The abnormal temperature data is temperature data whose temperature value is larger than a predetermined value. For example, the predetermined value may be set to the lower limit value of the abnormal temperature of the optical fiber cable 10.

異常スコア算出部１１３は、教師データから異常スコアを算出する。また、異常スコア算出部１１３は、光ファイバケーブル１０に設けられる複数のセンサ５０から出力される実測の温度データの故障予兆モデルに対する異常スコアを算出し、閾値導出部１１４によって導出される異常検知の閾値を用いて、実測の温度データのうち異常スコアが閾値以上の異常区間の実測データについての異常スコアを求める。異常区間とは、温度データが異常である期間（時間帯）である。異常スコア算出部１１３は、第１異常スコア算出部及び第２異常スコア算出部の一例である。 The abnormality score calculation unit 113 calculates an abnormality score from the teacher data. Further, the abnormality score calculation unit 113 calculates an abnormality score for the failure sign model of the actually measured temperature data output from the plurality of sensors 50 provided in the optical fiber cable 10, and detects the abnormality detected by the threshold value deriving unit 114. Using the threshold value, the abnormality score of the actual measurement data of the abnormal section in which the abnormality score is equal to or higher than the threshold value is obtained from the measured temperature data. The abnormal section is a period (time zone) in which the temperature data is abnormal. The abnormality score calculation unit 113 is an example of a first abnormality score calculation unit and a second abnormality score calculation unit.

閾値導出部１１４は、異常スコア算出部１１３によって算出される異常スコアに基づき、異常検知の閾値を導出する。 The threshold derivation unit 114 derives an abnormality detection threshold based on the abnormality score calculated by the abnormality score calculation unit 113.

回帰モデル生成部１１５は、教師データの異常データ群を説明変数とし、異常データ群についての異常スコアを目的変数とするランダムフォレスト回帰モデルを生成する。また、回帰モデル生成部１１５は、異常区間の実測データを説明変数とし、異常区間の実測データについての異常スコアを目的変数とするランダムフォレスト回帰モデルを生成する。回帰モデル生成部１１５は、第１回帰モデル生成部及び第２回帰モデル生成部の一例である。なお、ランダムフォレスト回帰モデルは、非線形の回帰モデルである。 The regression model generation unit 115 generates a random forest regression model in which the abnormal data group of the teacher data is the explanatory variable and the abnormal score of the abnormal data group is the objective variable. Further, the regression model generation unit 115 generates a random forest regression model in which the actual measurement data of the abnormal section is the explanatory variable and the abnormality score of the actual measurement data of the abnormal section is the objective variable. The regression model generation unit 115 is an example of a first regression model generation unit and a second regression model generation unit. The random forest regression model is a non-linear regression model.

寄与度算出部１１６は、教師データに基づくランダムフォレスト回帰モデルから複数のセンサ５０によって取得される温度データの異常寄与度を算出する。また、寄与度算出部１１６は、実測データに基づくランダムフォレスト回帰モデルから複数のセンサ５０によって取得される温度データの異常寄与度を算出する。寄与度算出部１１６は、第１寄与度算出部及び第２寄与度算出部の一例である。 The contribution calculating unit 116 calculates the abnormal contribution of the temperature data acquired by the plurality of sensors 50 from the random forest regression model based on the teacher data. Further, the contribution degree calculation unit 116 calculates the abnormality contribution degree of the temperature data acquired by the plurality of sensors 50 from the random forest regression model based on the actual measurement data. The contribution degree calculation unit 116 is an example of a first contribution degree calculation unit and a second contribution degree calculation unit.

教師データに基づく異常寄与度は、複数のセンサ５０の各々によって取得される温度データが異常データの発生に与える寄与度であり、実測データに基づく異常寄与度は、複数のセンサ５０の各々によって取得される温度データが異常データの発生に与える寄与度である。 The anomalous contribution rate based on the teacher data is the degree of contribution of the temperature data acquired by each of the plurality of sensors 50 to the generation of the anomaly data, and the anomaly contribution rate based on the actual measurement data is acquired by each of the plurality of sensors 50. It is the degree of contribution that the temperature data made gives to the occurrence of abnormal data.

教師データに基づく異常寄与度及び実測データに基づく異常寄与度は、ジニ係数の減少率である。このため、教師データに基づく異常寄与度及び実測データに基づく異常寄与度は、それぞれ、教師データに基づくランダムフォレスト回帰モデル及び教師データに基づくランダムフォレスト回帰モデルから得られるデータの不均等度の減少度合である。 The degree of abnormality contribution based on the teacher data and the degree of abnormality contribution based on the actual measurement data are reduction rates of the Gini coefficient. Therefore, the anomalous contribution rate based on the teacher data and the anomalous contribution rate based on the actual measurement data are calculated based on the reduction degree of the data non-uniformity obtained from the random forest regression model based on the teacher data and the random forest regression model based on the teacher data, respectively. Is.

ジニ係数は、データが不均等であるほど大きな値を取る。このため、ジニ係数の減少率で表される温度データの教師データに基づく異常寄与度及び実測データに基づく異常寄与度は、ある温度データが異常な温度データの発生に寄与する度合を表す。 The Gini coefficient takes a larger value as the data is more uneven. Therefore, the anomalous contribution rate based on the teacher data of the temperature data and the anomalous contribution rate based on the actual measurement data represented by the reduction rate of the Gini coefficient represent the degree to which certain temperature data contributes to the generation of abnormal temperature data.

メモリ１１７は、制御装置１１０が実行する処理に必要なプログラム及びデータ等を格納するとともに、センサ５０から受信した温度データを格納する。また、メモリ１１８は、異常検出装置１００が処理を行う上で必要なその他のデータ等を格納する。 The memory 117 stores programs, data, and the like necessary for processing executed by the control device 110, and also stores temperature data received from the sensor 50. The memory 118 also stores other data necessary for the abnormality detection device 100 to perform processing.

操作部１２０は、例えば、キーボード及びマウス等である。表示部１３０は、例えば、ディスプレイパネル等である。 The operation unit 120 is, for example, a keyboard and a mouse. The display unit 130 is, for example, a display panel or the like.

次に、予兆モデル生成部１１２が生成する故障予兆モデルについて説明する。図５は、予兆モデル生成部１１２が生成する故障予兆モデルを示す図である。図５では、光ファイバケーブル１０（図１参照）の複数のセンサ５０で検出された温度データを時系列的に表す教師データのうちの正常データ群の各データを×印で示す。 Next, the failure predictive model generated by the predictive model generator 112 will be described. FIG. 5 is a diagram showing a failure predictive model generated by the predictive model generator 112. In FIG. 5, each data of the normal data group of the teacher data that represents the temperature data detected by the plurality of sensors 50 of the optical fiber cable 10 (see FIG. 1) in time series is indicated by an X mark.

故障予兆モデルは、このような教師データのうちの正常データ群の正規分布を表すため、図５に点線で示す楕円は、正規分布の確率密度が一定になる等高線である。また、ここでは横軸がセンサ１、縦軸がセンサ２という二次元平面で示すが、実際はセンサ５０の数LだけあるL次元空間で正規分布が表される。 Since the failure sign model represents the normal distribution of the normal data group in such teacher data, the ellipse shown by the dotted line in FIG. 5 is a contour line where the probability density of the normal distribution is constant. Further, here, the horizontal axis is the sensor 1 and the vertical axis is the sensor 2, which is shown by a two-dimensional plane.

ここで、データ群ｘ、平均μに対して、教師データのうちの正常データ群の正規分布を表す故障予兆モデルp(x|μ，Σ)は、次式（１）で表すことができる。右辺のN(x|μ，Σ)は正規分布を表す。 Here, the failure sign model p(x|μ, Σ) that represents the normal distribution of the normal data group of the teacher data with respect to the data group x and the average μ can be expressed by the following equation (1). N(x|μ, Σ) on the right side represents the normal distribution.

また、正規分布の中心から距離が離れるほど正常値ではなく異常値に近づくため、ここでは正規分布の中心からの距離を異常スコアとして取り扱う。異常スコアa(x)は、次式（２）で表すことができる。 Further, as the distance from the center of the normal distribution approaches an abnormal value instead of a normal value, the distance from the center of the normal distribution is treated as an abnormal score here. The abnormality score a(x) can be expressed by the following equation (2).

また、閾値を用いて異常スコアが異常を表しているかどうかを判定するには、次式（３）を用いればよい。 Further, in order to determine whether or not the abnormality score represents abnormality using the threshold value, the following equation (3) may be used.

次に、ランダムフォレスト回帰モデルの回帰係数の算出方法について説明する。ランダムフォレスト回帰モデルの回帰係数とは、異常寄与度のことである。 Next, a method of calculating the regression coefficient of the random forest regression model will be described. The regression coefficient of the random forest regression model is the degree of abnormal contribution.

一般的な線形回帰モデルyは次式（４）で表される。 A general linear regression model y is represented by the following equation (4).

ここで、データＸ，重みｗ、結果Ｙを次のようにする。 Here, the data X, the weight w, and the result Y are set as follows.

ｙは、データＸから得られる結果である。ＸはＭ×Ｎ行列とする。ｗはＭ次元ベクトルとする。また、Xの要素Ｘ_ｊをＮ次元ベクトルとする。ｙもＮ次元ベクトルとする。これはＸ、ｙがＮ個のデータ群を含むことを意味する。なお、これを行列として表現すると次式（６）のようになる。 y is the result obtained from the data X. X is an M×N matrix. w is an M-dimensional vector. Further, the element X _j of X is an N-dimensional vector. Let y also be an N-dimensional vector. This means that X and y include N data groups. If this is expressed as a matrix, the following expression (6) is obtained.

ここで、Ｘ_０＝１、Ｘ_１＝ｘ、Ｘ_２＝ｘ^２・・・とすれば、線形回帰モデルｙは、次式（７）の多項式回帰モデルで表すことができる。ここで太字の１は要素がすべて１のベクトルを表す。 Here, if X ₀ =1, X ₁ =x, X ₂ =x ^2, ..., The linear regression model y can be represented by the polynomial regression model of the following equation (7). Here, the bold 1 represents a vector in which all the elements are 1.

次に、回帰係数を算出する。N個のデータ群を用意したときに、現実的には式（４）のように等号にはならずに、次式（８）のように近似式で表すことになる。 Next, the regression coefficient is calculated. When N data groups are prepared, they are not expressed as equal signs in reality as in Expression (4), but are represented by approximate expressions as in Expression (8) below.

そのため、ｙとｗ^Ｔとが最も等しくなるようなｗの推定値（ｗハット）を探す問題を解くことになる。この「最も等しくなる」というのを数式で表現すると、次式（９）に示すように二乗和誤差関数を最小化(最小二乗法)するということと同義となる。 Therefore, the problem of finding an estimated value of w (w hat) such that y and w ^T are most equal is solved. When this “being equal to” is expressed by a mathematical expression, it is synonymous with minimizing the square sum error function (least square method) as shown in the following expression (9).

最小化するということは式（９）をｗで微分して、0となるところが最小であるため、微分して0として整理すると、次式（１０）ように回帰係数（偏回帰係数）を算出できる。 Minimizing means that the equation (9) is differentiated by w and the point where it becomes 0 is the minimum, so if it is differentiated and rearranged as 0, the regression coefficient (partial regression coefficient) is calculated as the following equation (10). it can.

多項式の場合はXに適宜、多項式の要素を代入すればよい。この回帰係数をここでは異常寄与度としている。異常寄与度を重要度として捉えてもよい。 In the case of a polynomial, the elements of the polynomial may be substituted for X as appropriate. This regression coefficient is defined as the degree of abnormal contribution here. The degree of abnormality contribution may be regarded as the degree of importance.

次に、図６を用いてランダムフォレスト回帰について説明する。図６は、複数の決定木を用いた回帰モデルと、説明変数ｘ及び目的変数ｙの関係とを示す図である。 Next, the random forest regression will be described with reference to FIG. FIG. 6 is a diagram showing a regression model using a plurality of decision trees and the relationship between the explanatory variable x and the objective variable y.

ランダムフォレスト回帰とは、複数の決定木を用いた回帰モデルの作成手法のことである。ここで決定木とは図６（Ａ）に示すように説明変数ｘの値を分岐させ、目的変数ｙを推定するモデルのことを指す。これによって、図６（Ｂ）の点線で示す回帰モデルが得られる。 Random forest regression is a method of creating a regression model using multiple decision trees. Here, the decision tree refers to a model in which the value of the explanatory variable x is branched and the target variable y is estimated as shown in FIG. 6(A). As a result, the regression model shown by the dotted line in FIG. 6B is obtained.

ランダムフォレスト回帰では、教師データのデータ群からランダムにサブデータ群を取得して、サブデータ群毎に決定木を作る。このとき、サブデータ群の数だけ決定木ができる。すると、図６（Ｂ）の点線で示される回帰モデルも決定木の数だけ得られるが、複数の決定木の出力結果の平均値を得ることで、より精度の高い回帰モデルを作ることができる。これがランダムフォレスト回帰の特徴の一つである。 In random forest regression, a sub-data group is randomly acquired from the data group of the teacher data, and a decision tree is created for each sub-data group. At this time, there are as many decision trees as there are sub-data groups. Then, although the regression model shown by the dotted line in FIG. 6B can be obtained by the number of decision trees, a more accurate regression model can be created by obtaining the average value of the output results of a plurality of decision trees. .. This is one of the characteristics of random forest regression.

ここで決定木（回帰木）に関する重要なポイントの一つに、枝を増やすことで複雑な木を作ることができるということである。ここで枝とは、Yes-Noの分岐のことで、この数が増えると木も複雑となり、結果として図６（Ｂ）の回帰モデルもより複雑で表現能力の高いものができる。 One of the important points about the decision tree (regression tree) here is that you can create a complex tree by increasing the number of branches. Here, a branch is a Yes-No branch, and as this number increases, the tree becomes complicated, and as a result, the regression model in FIG. 6B can be made more complicated and have high expressive power.

一方で、複雑で表現能力が高いほど、決定木（回帰木）に用いたデータに過剰にフィットした回帰モデルができてしまう（これを過学習という）。すると、別の新規で得たデータに対しては、作った決定木（回帰木）が全く合わない可能性が出てくる。つまり、過剰に複雑で表現能力が高いと未知のデータに対しては性能が低下する可能性がでてくる。したがって、表現能力の度合いを調整する必要があり、ある基準（分割基準）を用いて表現能力を調整してもっとも高性能な決定木（回帰木）を選ぶ必要がある。この基準となる統計量として、よく使われるのがジニ係数（ジニ不純度、不純度）の減少率となる。 On the other hand, the more complex and expressive the result, the more the regression model that fits the data used for the decision tree (regression tree) will be created (this is called overlearning). Then, there is a possibility that the created decision tree (regression tree) does not match the newly obtained data. In other words, if the data is overly complex and has high expressiveness, the performance may deteriorate for unknown data. Therefore, it is necessary to adjust the degree of expression ability, and it is necessary to adjust the expression ability using a certain criterion (division criterion) and select the highest performance decision tree (regression tree). As a statistic that serves as a reference, a reduction rate of the Gini coefficient (Gini impurity, impurity) is often used.

ジニ係数の定義式を次式（１１）に示す。 The definition equation of the Gini coefficient is shown in the following equation (11).

ここでCはカテゴリ（分類）の数、Nは教師データ数、n_iはカテゴリiに属する教師データ数、tはノードとなる。ノードとは、決定木（回帰木）の節のことで、ある条件に対してYesなのかNoなのかを判断する部分となる（図６（Ａ）参照）。 Here, C is the number of categories (classification), N is the number of teacher data, n _i is the number of teacher data belonging to category i, and t is a node. A node is a node of a decision tree (regression tree), which is a part for determining whether a certain condition is Yes or No (see FIG. 6A).

式（１１）の意味は、あるノードtにはどれくらいのカテゴリが含まれるかということを表す(不純度という)。多数のカテゴリがあれば不純度が高く、カテゴリが少なければ不純度が少ないという。不純度が少なければ、ジニ係数G(t)は0に近づき、不純度が多いと1に近づく。たとえば、n_i=Nであるとき、ジニ係数G(t)は0となるが、これはカテゴリがiしかない場合で、不純度がないということになる。またこれと類似した指標にエントロピーがある。 The meaning of Expression (11) represents how many categories are included in a certain node t (called impure). If there are a large number of categories, the purity is high, and if there are few categories, the purity is low. If the impurity is low, the Gini coefficient G(t) approaches 0, and if the impurity is high, the Gini coefficient G(t) approaches 1. For example, when n _i =N, the Gini coefficient G(t) is 0, which means that there are only i categories and there is no impurity. An index similar to this is entropy.

ジニ係数の減少率は次式（１２）で表現する。 The reduction rate of the Gini coefficient is expressed by the following equation (12).

ここで、G(t_B)は分岐前のノードt_Bのジニ係数、G(t_L)は分岐後の左ノードのジニ係数、G(t_R)は分岐後の右ノードのジニ係数となる。w_Lとw_R分岐後のノードの重み（分岐前に対するデータ量の割合）を表す。式（１２）は分岐前のジニ係数と分岐後の左右のノードのジニ係数の合計との差を計算している。そのため、式（１２）は分岐した時の不純度が低くなっていれば大きくなるので、式（１２）の意味は不純度の減少率を表し、うまくデータを分割（カテゴリに分類）できているか否かを表す指標となる。この指標を用いることで不純度を最大限減らす組を探し、最適な分割を定量的に探索することができる。 Where G(t _B ) is the Gini coefficient of the node t _B before the branch, G(t _L ) is the Gini coefficient of the left node after the branch, and G(t _R ) is the Gini coefficient of the right node after the branch. .. w _L and w _R Represents the weight of the node after branching (ratio of data amount before branching). Expression (12) calculates the difference between the Gini coefficient before branching and the sum of the Gini coefficients of the left and right nodes after branching. Therefore, since the expression (12) becomes large if the impurity at the time of branching is low, the meaning of the expression (12) represents the reduction rate of the impurity, and whether the data is successfully divided (classified into categories). It is an index that shows whether or not. By using this index, it is possible to search for a group that reduces the impurity to the maximum extent and to quantitatively search for the optimal division.

そして、ジニ係数の減少率を用いることで変数の重要度（寄与度）も分かる。変数が複数ある場合、どの変数を分割したかで、どの程度の不純度が減少するかが式（１２）を用いることで算出できる。したがって、変数に対するジニ係数（不純度）の減少率の度合いによって、変数の重要度（寄与度）を表現できる。 Then, the importance (contribution) of the variable can be known by using the reduction rate of the Gini coefficient. When there are a plurality of variables, it is possible to calculate how much the impurity is reduced by dividing the variable by using Expression (12). Therefore, the degree of importance (contribution) of a variable can be expressed by the degree of reduction rate of the Gini coefficient (impurity) with respect to the variable.

図７は、６個のセンサ５０によって取得される温度データと異常スコアを示す図である。ここでは、６個のセンサ５０をセンサ１〜６として区別する。図７（Ａ）において、横軸は時間を表し、縦軸は温度（単位なし）を表す。 FIG. 7 is a diagram showing temperature data and abnormality scores acquired by the six sensors 50. Here, the six sensors 50 are distinguished as the sensors 1 to 6. In FIG. 7A, the horizontal axis represents time and the vertical axis represents temperature (no unit).

図７（Ａ）には、センサ１〜６で取得された温度データの時系列的な変化の様子を示す。センサ１〜６の各々によって時系列的に取得された複数の温度データは、温度データのデータ群である。正常区間は、異常が生じていない区間であり、異常が生じていない期間（時間帯）である。また、予兆区間は、故障の予兆が生じている区間であり、破線の円で囲むようにセンサ３、４で予兆が生じている。 FIG. 7A shows how the temperature data acquired by the sensors 1 to 6 changes in time series. The plurality of temperature data acquired in time series by each of the sensors 1 to 6 is a data group of temperature data. The normal section is a section in which no abnormality has occurred, and is a period (time zone) in which no abnormality has occurred. The sign section is a section in which a sign of failure has occurred, and the sign is generated in the sensors 3 and 4 so as to be surrounded by a broken line circle.

予兆区間は、一例として温度の変動の幅が正常区間の２倍以上になる区間である。正常区間と予兆区間との間には、間隔がある。なお、予兆区間の後に故障が生じる故障区間が続く場合がある。ここでは、異常とは、故障の予兆と故障とを包含する概念である。 The predictive section is, for example, a section in which the range of temperature fluctuation is twice or more the normal section. There is an interval between the normal section and the predictive section. Note that a failure section in which a failure occurs may follow the predictive section. Here, the abnormality is a concept including a sign of failure and a failure.

図７（Ｂ）は、センサ３の正常区間の始まりから予兆区間の終わりまでの温度データから求めた異常スコアを示す。異常スコアは、異常スコア算出部１１３によって求められる。正常区間における異常スコアに比べて、予兆区間の異常スコアが高くなっていることが分かる。 FIG. 7B shows an abnormality score obtained from the temperature data from the beginning of the normal section of the sensor 3 to the end of the precursor section. The abnormality score is obtained by the abnormality score calculation unit 113. It can be seen that the abnormality score in the sign segment is higher than the abnormality score in the normal segment.

図８は、回帰モデル生成部１１５が作成するランダムフォレスト回帰モデルの推定値と真値との相関を示す図である。 FIG. 8 is a diagram showing the correlation between the estimated value and the true value of the random forest regression model created by the regression model generation unit 115.

回帰モデル生成部１１５は、教師データの異常データ群を説明変数とし、異常データ群についての異常スコアを目的変数とするランダムフォレスト回帰モデルと、異常区間の実測データを説明変数とし、異常区間の実測データについての異常スコアを目的変数とするランダムフォレスト回帰モデルとを生成するが、ここでは、説明のために、図７（Ａ）の正常区間と予兆区間とについて作成したランダムフォレスト回帰モデルの推定値と真値との相関について説明する。 The regression model generation unit 115 uses the abnormal data group of the teacher data as an explanatory variable, the random forest regression model having the abnormal score of the abnormal data group as an objective variable, and the actual measurement data of the abnormal section as the explanatory variable, and measures the abnormal section. A random forest regression model in which the anomaly score for the data is used as the objective variable is generated, but here, for the sake of explanation, the estimated value of the random forest regression model created for the normal interval and the predictive interval in FIG. 7A. The correlation between and the true value will be described.

図８（Ａ）は、正常区間の時系列的な温度データについて作成したランダムフォレスト回帰モデルで得られる推定値（横軸）と真値（縦軸）とを示す図である。Ｒ２スコアは０．９５であり、相関が高いことが分かる。 FIG. 8A is a diagram showing an estimated value (horizontal axis) and a true value (vertical axis) obtained by a random forest regression model created for time-series temperature data in a normal section. The R2 score is 0.95, which shows that the correlation is high.

図８（Ｂ）は、予兆区間の時系列的な温度データについて作成したランダムフォレスト回帰モデルで得られる推定値（横軸）と真値（縦軸）とを示す図である。Ｒ２スコアは０．９７であり、相関が高いことが分かる。 FIG. 8B is a diagram showing an estimated value (horizontal axis) and a true value (vertical axis) obtained by the random forest regression model created for the time series temperature data of the predictive section. The R2 score is 0.97, which shows that the correlation is high.

図９は、センサ１〜６の温度データの正常区間と予兆区間とについて求めた異常寄与度を示す図である。ハッチングで示す棒グラフは、センサ１〜６の正常区間の温度データについて求めた異常寄与度を表し、白抜きの棒グラフは、センサ１〜６の異常区間の温度データについて求めた異常寄与度を表す。 FIG. 9 is a diagram showing the degree of abnormality contribution obtained for the normal section and the sign section of the temperature data of the sensors 1 to 6. The hatched bar graphs represent the anomalous contributions calculated for the temperature data in the normal sections of the sensors 1 to 6, and the open bar graphs represent the anomalous contributions found in the temperature data of the sensors 1 to 6 in the abnormal sections.

正常区間の温度データについて求めた異常寄与度は、図７に示すセンサ１〜６の各々の正常区間の温度データを説明変数とし、センサ１〜６の各々の正常区間の温度データから求めた異常スコアを目的変数とするランダムフォレスト回帰モデルから算出したものである。 The abnormality contribution obtained for the temperature data of the normal section is an abnormality obtained from the temperature data of the normal section of each of the sensors 1 to 6 using the temperature data of each of the sensors 1 to 6 shown in FIG. 7 as explanatory variables. It is calculated from a random forest regression model in which the score is the objective variable.

異常区間の温度データについて求めた異常寄与度は、図７に示すセンサ１〜６の各々の異常区間の温度データを説明変数とし、センサ１〜６の各々の異常区間の温度データから求めた異常スコアを目的変数とするランダムフォレスト回帰モデルから算出したものである。 The abnormality contribution obtained for the temperature data of the abnormal section is an abnormality obtained from the temperature data of each abnormal section of the sensors 1 to 6 using the temperature data of each abnormal section of the sensors 1 to 6 shown in FIG. 7 as an explanatory variable. It is calculated from a random forest regression model in which the score is the objective variable.

正常区間の異常寄与度は、センサ１〜６で大きな差はないが、異常区間の異常寄与度は、センサ１、２、５、６に比べてセンサ３、４の値が大きくなっている。このことから、上述のようにして異常区間について求めた異常寄与度を用いれば、異常な温度データを取得したセンサ（ここではセンサ３、４）を特定することができる。温度データには、時刻データが関連付けられているので、どのセンサでいつ異常な温度が検出されたかを検出することができる。 The abnormal contributions in the normal section are not significantly different between the sensors 1 to 6, but the abnormal contributions in the abnormal section are larger in the sensors 3 and 4 than in the sensors 1, 2, 5, and 6. From this, the sensor (here, the sensors 3 and 4) that has acquired the abnormal temperature data can be specified by using the abnormality contribution rate obtained for the abnormal section as described above. Since the time data is associated with the temperature data, it is possible to detect which sensor has detected an abnormal temperature and when.

図１０は、異常検出装置１００の制御装置１１０が実行するフローチャートを示す図である。図１０に示すフローチャートは、実施の形態の異常検出プログラムを実行することによって実現される。また、実施の形態の異常検出プログラムを実行することにより、実施の形態の異常検出方法が実現される。なお、ステップＳ１２の説明では、図１１を用いる。図１１は、故障予兆モデル及びテストデータの関係と異常スコアを示す図である。 FIG. 10 is a diagram showing a flowchart executed by the control device 110 of the abnormality detection device 100. The flowchart shown in FIG. 10 is realized by executing the abnormality detection program according to the embodiment. Further, by executing the abnormality detection program of the embodiment, the abnormality detection method of the embodiment is realized. Note that FIG. 11 is used in the description of step S12. FIG. 11 is a diagram showing the relationship between the failure sign model and the test data and the abnormality score.

フローがスタートすると、主制御部１１１は、教師データＸを取得する（ステップＳ１）。教師データＸは、Ｍ次元×Ｎ次元の行列であり、多次元ベクトルの集合である。Ｍ、Ｎは２以上の任意の整数である。 When the flow starts, the main control unit 111 acquires the teacher data X (step S1). The teacher data X is an M-dimensional×N-dimensional matrix, which is a set of multidimensional vectors. M and N are arbitrary integers of 2 or more.

教師データＸは、複数（Ｋ個）の正常データ群Ｘ₁ ^ok〜Ｘ_K ^okと複数の異常データ群Ｘ_K+1 ^ng〜Ｘ_N ^ngを含む。正常データ群Ｘ₁ ^ok〜Ｘ_K ^okと異常データ群Ｘ_K+1 ^ng〜Ｘ_N ^ngとは、それぞれ、図７に示すセンサ１〜６の各々が取得する温度データのように、時系列的に並べられた複数のデータ群を含む。このように、正常データ群Ｘ₁ ^ok〜Ｘ_K ^okと異常データ群Ｘ_K+1 ^ng〜Ｘ_N ^ngとは、それぞれ、多次元ベクトルの集合になっている。 The teacher data X includes a plurality (K) of normal data groups X ₁ ^{ok to} X _K ^ok and a plurality of abnormal data groups X _K+1 ^{ng to} X _N ^ng . The normal data group X ₁ ^{ok to} X _K ^ok and the abnormal data group X _K+1 ^{ng to} X _N ^ng are time-sequential like the temperature data acquired by each of the sensors 1 to 6 shown in FIG. It includes a plurality of data groups arranged in. As described above, the normal data group X ₁ ^{ok to} X _K ^ok and the abnormal data group X _K+1 ^{ng to} X _N ^ng are each a set of multidimensional vectors.

予兆モデル生成部１１２は、教師データＸから正常データ群Ｘ_i ^okを抽出し、式（１）に基づき正常データ群Ｘ_i ^okの正規分布を表す故障予兆モデルを生成する（ステップＳ２）。ステップＳ２で故障予兆モデルの生成に用いられる正常データ群Ｘ_i ^okは、ｉが１からＫのうちの１つである。ステップＳ２の処理では、図５に示すような正規分布のデータが生成される。 The predictive model generation unit 112 extracts the normal data group X _i ^ok from the teacher data X and generates a failure predictive model representing the normal distribution of the normal data group X _i ^ok based on the equation (1) (step S2). In the normal data group X _i ^ok used for generating the failure sign model in step S2, i is one of 1 to K. In the process of step S2, data of normal distribution as shown in FIG. 5 is generated.

異常スコア算出部１１３は、異常スコアａを算出するための式（２）を設定する（ステップＳ３）。より具体的には、異常スコア算出部１１３は、式（２）を表すデータをメモリ１１７から読み出す。 The abnormality score calculation unit 113 sets the equation (2) for calculating the abnormality score a (step S3). More specifically, the abnormality score calculation unit 113 reads out the data representing the equation (2) from the memory 117.

異常スコア算出部１１３は、教師データＸのうち、ステップＳ２で故障予兆モデルを生成した正常データ群Ｘ_i ^ok以外のＫ−１個の正常データ群Ｘ_i ^okと、教師データＸに含まれるすべての複数の異常データ群Ｘ_K+1 ^ng〜Ｘ_N ^ngとについて、異常スコアを算出する（ステップＳ４）。異常スコアは、データ群毎に算出される。 Anomaly score calculation unit 113, the teacher of the data X, and a sign of failure model normal data group generated the X _i ^ok other K-1 pieces of normal data group X _i ^ok in step S2, all contained in the training data X An abnormality score is calculated for the plurality of abnormal data groups X _K+1 ^{ng to} X _N ^ng (step S4). The abnormality score is calculated for each data group.

閾値導出部１１４は、ステップＳ４で算出された異常スコアに基づき、異常検知の閾値を導出する（ステップＳ５）。閾値は、異常スコアの所定レベルの値を有し、Ｋ−１個の正常データ群Ｘ_i ^okから算出された異常スコアと、異常データ群Ｘ_K+1 ^ng〜Ｘ_N ^ngから算出された異常スコアとを分離する最適な値に設定される。閾値の値を変えながら、Ｋ−１個の正常データ群Ｘ_i ^okから算出されたＫ−１個の異常スコアと、異常データ群Ｘ_K+1 ^ng〜Ｘ_N ^ngから算出された複数の異常スコアとを分離する最適な閾値が導出される。閾値は１つの最適な値に設定される。 The threshold derivation unit 114 derives an abnormality detection threshold based on the abnormality score calculated in step S4 (step S5). The threshold has a value of a predetermined level of the abnormality score, and the abnormality score calculated from the K−1 normal data group X _i ^ok and the abnormality calculated from the abnormal data group X _K+1 ^{ng to} X _N ^ng It is set to the optimum value that separates from the score. While changing the threshold value, K-1 abnormal scores calculated from the K-1 normal data group X _i ^ok and a plurality of abnormalities calculated from the abnormal data group X _K+1 ^{ng to} X _N ^ng. An optimal threshold is derived that separates the scores from. The threshold is set to one optimal value.

異常スコア算出部１１３は、異常データ群Ｘ_K+1 ^ng〜Ｘ_N ^ngの各々について、異常スコアを算出する（ステップＳ６）。なお、ステップＳ６で異常データ群Ｘ_K+1 ^ng〜Ｘ_N ^ngの各々について算出する異常スコアは、ステップＳ４において異常データ群Ｘ_K+1 ^ng〜Ｘ_N ^ngの各々について算出する異常スコアと同じ値を有する。このため、ステップＳ６では異常スコアを算出せずに、ステップＳ４で異常データ群Ｘ_K+1 ^ng〜Ｘ_N ^ngの各々について算出した異常スコアを取得してもよい。 The abnormality score calculation unit 113 calculates an abnormality score for each of the abnormality data groups X _K+1 ^{ng to} X _N ^ng (step S6). The abnormality score calculated for each of the abnormal data groups X _K+1 ^{ng to} X _N ^ng in step S6 is the same as the abnormality score calculated for each of the abnormal data groups X _K+1 ^{ng to} X _N ^ng in step S4. Has a value. Therefore, the abnormality score calculated in step S4 for each of the abnormality data groups X _K+1 ^{ng to} X _N ^ng may be acquired without calculating the abnormality score in step S6.

回帰モデル生成部１１５は、異常データ群Ｘ_K+1 ^ng〜Ｘ_N ^ngの各々を説明変数とし、ステップＳ５で異常データ群Ｘ_K+1 ^ng〜Ｘ_N ^ngについて算出された異常スコアの各々を目的変数とするランダムフォレスト回帰モデルを生成する（ステップＳ７）。説明変数である異常データ群Ｘ_K+1 ^ng〜Ｘ_N ^ngに対応する目的変数は、異常データ群Ｘ_K+1 ^ng〜Ｘ_N ^ngの各々について算出された異常スコアである。 The regression model generation unit 115 sets each of the abnormal data groups X _K+1 ^{ng to} X _N ^ng as an explanatory variable, and sets each of the abnormal scores calculated for the abnormal data groups X _K+1 ^{ng to} X _N ^ng in step S5. A random forest regression model as an objective variable is generated (step S7). The objective variable corresponding to the abnormal data group X _K+1 ^{ng to} X _N ^ng which is an explanatory variable is the abnormal score calculated for each of the abnormal data group X _K+1 ^{ng to} X _N ^ng .

寄与度算出部１１６は、ステップＳ６で生成された各ランダムフォレスト回帰モデルから式（１０）に基づいて異常寄与度を算出する（ステップＳ８）。以上の処理によって、異常データ群Ｘ_K+1 ^ng〜Ｘ_N ^ngについてランダムフォレスト回帰モデルが生成され、異常寄与度を算出する処理が終了する（エンド）。 The contribution calculating unit 116 calculates the abnormal contribution based on the equation (10) from each random forest regression model generated in step S6 (step S8). Through the above processing, a random forest regression model is generated for the abnormal data group X _K+1 ^{ng to} X _N ^ng , and the processing for calculating the abnormal contribution rate ends (END).

次に、センサ５０から実際に取得された温度データを含むテストデータＹについて異常寄与度を求める処理について説明する。テストデータＹは、センサ５０で検出された実測値の温度データを含む。 Next, a description will be given of the process of obtaining the abnormality contribution rate for the test data Y including the temperature data actually acquired from the sensor 50. The test data Y includes temperature data of actual measurement values detected by the sensor 50.

フローがスタートすると、主制御部１１１は、テストデータＹを取得する（ステップＳ１１）。テストデータＹは、教師データＸと同様に、Ｍ次元×Ｎ次元の行列であり、多次元ベクトルの集合である。Ｍ、Ｎは２以上の任意の整数である。なお、テストデータＹと教師データＸでは、Ｍ、Ｎの値が異なっていてもよい。 When the flow starts, the main control unit 111 acquires the test data Y (step S11). Like the teacher data X, the test data Y is an M-dimensional×N-dimensional matrix and is a set of multidimensional vectors. M and N are arbitrary integers of 2 or more. The test data Y and the teacher data X may have different values of M and N.

異常スコア算出部１１３は、テストデータＹに含まれる複数のデータ群のうちの１つに含まれる複数の温度データをステップＳ２で求めた故障予兆モデルに当て嵌め、各温度データの異常スコアを算出し、ステップＳ５で導出した閾値を用いて正常な温度データであるか、異常な温度データであるかを判定する（ステップＳ１２）。 The abnormality score calculation unit 113 applies the plurality of temperature data included in one of the plurality of data groups included in the test data Y to the failure sign model obtained in step S2 to calculate the abnormality score of each temperature data. Then, it is determined whether the temperature data is normal or abnormal by using the threshold value derived in step S5 (step S12).

ステップＳ１２では、図１１に示すように、温度データがステップＳ２で求めた故障予兆モデルに当て嵌められる。図１１には、図５で×印で示した正常データ群の各データは省略し、等高線のみを示す。また、図１１には、テストデータＹに含まれる複数のデータ群のうちの１つに含まれる複数の温度データのうちの１つを×印で示す。図１１のように３本の等高線のうちの最も外側の等高線よりも外側に位置するデータは、異常な温度データである。 In step S12, as shown in FIG. 11, the temperature data is fitted to the failure sign model obtained in step S2. In FIG. 11, each data of the normal data group shown by X in FIG. 5 is omitted, and only the contour lines are shown. In addition, in FIG. 11, one of the plurality of temperature data included in one of the plurality of data groups included in the test data Y is indicated by an X mark. As shown in FIG. 11, the data located outside the outermost contour line of the three contour lines is abnormal temperature data.

ステップＳ１２の処理は、テストデータＹに含まれる複数のデータ群を１つずつ抽出して、テストデータＹに含まれるすべてのデータ群について行われる。 The process of step S12 is performed for all the data groups included in the test data Y by extracting a plurality of data groups included in the test data Y one by one.

異常スコア算出部１１３は、ステップＳ１２の処理結果に基づいて、テストデータＹに含まれる複数のデータ群に含まれる異常な温度データを含む区間を特定し、特定した区間に含まれる温度データの異常スコアを算出する（ステップＳ１３）。テストデータＹに含まれる複数のデータ群に含まれる異常な温度データを含む区間を特定することは、例えば、図７（Ａ）のセンサ３、４の予兆区間が特定されることを意味する。温度データには、センサ（１〜６のいずれか）によって検出された時刻を表す時刻データが関連付けられているため、ステップＳ１３で特定される区間は、どのセンサ（１〜６のいずれか）によって、いつ検出されたかが分かっている。 The abnormality score calculation unit 113 specifies a section including abnormal temperature data included in a plurality of data groups included in the test data Y based on the processing result of step S12, and detects an abnormality in the temperature data included in the specified section. A score is calculated (step S13). Specifying the section including the abnormal temperature data included in the plurality of data groups included in the test data Y means, for example, specifying the predictive section of the sensors 3 and 4 in FIG. 7A. Since the temperature data is associated with the time data representing the time detected by the sensor (any of 1 to 6), the section specified in step S13 depends on which sensor (any of 1 to 6). , I know when it was detected.

回帰モデル生成部１１５は、ステップＳ１３で特定した区間に含まれる異常な温度データ（異常な温度データ群）を説明変数とし、ステップＳ１３で特定された区間について算出された異常スコアを目的変数とするランダムフォレスト回帰モデルを生成する（ステップＳ１４）。回帰モデル生成部１１５は、ステップＳ１３で特定された区間が複数ある場合には、複数の区間の各々についてランダムフォレスト回帰モデルを生成する。 The regression model generation unit 115 sets the abnormal temperature data (abnormal temperature data group) included in the section identified in step S13 as an explanatory variable, and sets the abnormality score calculated for the section identified in step S13 as an objective variable. A random forest regression model is generated (step S14). When there are a plurality of sections identified in step S13, the regression model generation unit 115 generates a random forest regression model for each of the plurality of sections.

寄与度算出部１１６は、ステップＳ１４で生成された各ランダムフォレスト回帰モデルから式（１０）に基づいて異常寄与度を算出する（ステップＳ１５）。以上の処理によって、ステップＳ１３で特定された区間（異常な温度データを含む区間）についてランダムフォレスト回帰モデルが生成され、異常寄与度が算出される。これは、例えば、図９にセンサ１〜６について示す白抜きの棒グラフで示されるように、特定された区間の異常寄与度の分布が分かることになる。 The contribution calculating unit 116 calculates the abnormal contribution based on the equation (10) from each random forest regression model generated in step S14 (step S15). Through the above processing, the random forest regression model is generated for the section specified in step S13 (section including abnormal temperature data), and the abnormality contribution rate is calculated. This means that, for example, as shown by the white bar graphs shown for the sensors 1 to 6 in FIG. 9, the distribution of the abnormal contribution degree in the specified section can be known.

以上のように、実施の形態によれば、センサ５０で検出された実測値の温度データを含むテストデータＹを教師データのうちの正常データの正規分布を表す故障予兆モデルに当て嵌め、教師データから求めた正常データの異常スコアと異常データの異常スコアを分離する閾値を用いて、異常が発生している区間を検出する。 As described above, according to the embodiment, the test data Y including the temperature data of the actual measurement value detected by the sensor 50 is applied to the failure sign model representing the normal distribution of the normal data of the teacher data to obtain the teacher data. A section in which an abnormality has occurred is detected by using a threshold that separates the abnormality score of the normal data and the abnormality score of the abnormal data obtained from the above.

そして、異常が発生している区間に含まれるデータ（データ群）を説明変数とし、異常が発生している区間に含まれるデータ（データ群）について算出された異常スコアを目的変数とするランダムフォレスト回帰モデルを生成し、各区間の異常寄与度を算出する。 Then, a random forest in which the data (data group) included in the section in which the abnormality has occurred is the explanatory variable, and the anomaly score calculated for the data (data group) included in the section in which the abnormality has occurred is the target variable A regression model is generated and the abnormal contribution of each section is calculated.

このため、複数のセンサ５０のうちのどのセンサ５０によって取得された温度データのどの区間で異常に寄与する事象が発生しているかを把握することができる。この異常とは、例えば、故障の予兆である。 Therefore, it is possible to understand in which section of the temperature data acquired by which of the plurality of sensors 50 the event contributing to the abnormality has occurred. This abnormality is, for example, a sign of failure.

したがって、異常に寄与する事象が発生した箇所（複数のセンサ５０のうちのいずれか）と、異常に寄与する事象が発生した時間帯とを把握できる異常検出装置１００を提供することができる。 Therefore, it is possible to provide the abnormality detection device 100 capable of grasping the place (any one of the plurality of sensors 50) where the event contributing to the abnormality has occurred and the time zone in which the event contributing to the abnormality has occurred.

なお、以上では、非線形の回帰モデルであるランダムフォレスト回帰モデルを用いる形態について説明したが、線形の回帰モデルを用いてもよい。線形の回帰モデルとしては、線形回帰モデル、リッジ回帰モデル、又はLasso回帰モデルを用いればよい。また、これらの場合に、異常寄与度は偏重回帰係数として求めればよい。 In addition, although the form which uses the random forest regression model which is a nonlinear regression model was demonstrated above, you may use a linear regression model. A linear regression model, a ridge regression model, or a Lasso regression model may be used as the linear regression model. Further, in these cases, the abnormality contribution may be obtained as a weighted regression coefficient.

以上、本発明の例示的な実施の形態の異常検出装置、異常検出プログラム、及び、異常検出方法について説明したが、本発明は、具体的に開示された実施の形態に限定されるものではなく、特許請求の範囲から逸脱することなく、種々の変形や変更が可能である。 Although the abnormality detection device, the abnormality detection program, and the abnormality detection method according to the exemplary embodiment of the present invention have been described above, the present invention is not limited to the specifically disclosed embodiments. Various modifications and changes can be made without departing from the scope of the claims.

以上の実施の形態に関し、さらに以下の付記を開示する。
（付記１）
監視対象物の複数の箇所における正常及び異常な事象をそれぞれ時系列的に表す正常データ群及び異常データ群を含む教師データのうちの正常データ群の正規分布を表す故障予兆モデルを生成する予兆モデル生成部と、
前記教師データから第１異常スコアを算出する第１異常スコア算出部と、
前記第１異常スコアに基づき、異常検知の閾値を導出する閾値導出部と、
前記教師データの異常データ群を説明変数とし、前記異常データ群についての前記第１異常スコアを目的変数とする第１回帰モデルを生成する第１回帰モデル生成部と、
前記第１回帰モデルから前記複数の箇所における事象の第１異常寄与度を算出する第１寄与度算出部と、
前記監視対象物の複数の箇所にそれぞれ設けられる複数のセンサから出力される実測データの前記故障予兆モデルに対する第２異常スコアを算出し、前記異常検知の閾値を用いて、前記実測データのうち前記第２異常スコアが前記閾値以上の異常区間の実測データについての第２異常スコアを求める第２異常スコア算出部と、
前記異常区間の実測データを説明変数とし、前記異常区間の実測データについての第２異常スコアを目的変数とする第２回帰モデルを生成する第２回帰モデル生成部と、
前記第２回帰モデルから前記複数のセンサの検出データの第２異常寄与度を算出する第２寄与度算出部と
を含む、異常検出装置。
（付記２）
前記第１異常寄与度は、前記複数の箇所の各々における前記監視対象物の所定の事象が前記異常データ群の発生に与える寄与度であり、
前記第２異常寄与度は、前記複数のセンサの各々の検出データが前記異常データ群の発生に与える寄与度である、付記１記載の異常検出装置。
（付記３）
前記第１異常寄与度及び前記第２異常寄与度は、それぞれ、前記第１回帰モデル及び前記第２回帰モデルから得られるデータの不均等度の減少度合である、付記１又は２記載の異常検出装置。
（付記４）
前記第１回帰モデル及び前記第２回帰モデルは、線形の回帰モデル又は非線形の回帰モデルである、付記１乃至３のいずれか一項記載の異常検出装置。
（付記５）
前記線形の回帰モデルは、線形回帰モデル、リッジ回帰モデル、及びLasso回帰モデルのいずれか１つであり、
前記第１異常寄与度及び前記第２異常寄与度は、偏重回帰係数である、付記４記載の異常検出装置。
（付記６）
前記非線形の回帰モデルは、ランダムフォレスト回帰モデルであり、
前記第１異常寄与度及び前記第２異常寄与度は、ジニ係数の減少率である、付記４記載の異常検出装置。
（付記７）
監視対象物の複数の箇所における正常及び異常な事象をそれぞれ時系列的に表す正常データ群及び異常データ群を含む教師データのうちの正常データ群の正規分布を表す故障予兆モデルを生成することと、
前記教師データから第１異常スコアを算出することと、
前記第１異常スコアに基づき、異常検知の閾値を導出することと、
前記教師データの異常データ群を説明変数とし、前記異常データ群についての前記第１異常スコアを目的変数とする第１回帰モデルを生成することと、
前記第１回帰モデルから前記複数の箇所における事象の第１異常寄与度を算出することと、
前記監視対象物の複数の箇所にそれぞれ設けられる複数のセンサから出力される実測データの前記故障予兆モデルに対する第２異常スコアを算出し、前記異常検知の閾値を用いて、前記実測データのうち前記第２異常スコアが前記閾値以上の異常区間の実測データについての第２異常スコアを求めることと、
前記異常区間の実測データを説明変数とし、前記異常区間の実測データについての第２異常スコアを目的変数とする第２回帰モデルを生成することと、
前記第２回帰モデルから前記複数のセンサの検出データの第２異常寄与度を算出することと
を含む処理をコンピュータに実行させる、異常検出プログラム。
（付記８）
監視対象物の複数の箇所における正常及び異常な事象をそれぞれ時系列的に表す正常データ群及び異常データ群を含む教師データのうちの正常データ群の正規分布を表す故障予兆モデルを生成することと、
前記教師データから第１異常スコアを算出することと、
前記第１異常スコアに基づき、異常検知の閾値を導出することと、
前記教師データの異常データ群を説明変数とし、前記異常データ群についての前記第１異常スコアを目的変数とする第１回帰モデルを生成することと、
前記第１回帰モデルから前記複数の箇所における事象の第１異常寄与度を算出することと、
前記監視対象物の複数の箇所にそれぞれ設けられる複数のセンサから出力される実測データの前記故障予兆モデルに対する第２異常スコアを算出し、前記異常検知の閾値を用いて、前記実測データのうち前記第２異常スコアが前記閾値以上の異常区間の実測データについての第２異常スコアを求めることと、
前記異常区間の実測データを説明変数とし、前記異常区間の実測データについての第２異常スコアを目的変数とする第２回帰モデルを生成することと、
前記第２回帰モデルから前記複数のセンサの検出データの第２異常寄与度を算出することと
を含む、異常検出方法。 The following supplementary notes will be disclosed regarding the above-described embodiment.
(Appendix 1)
A predictive model that generates a failure predictive model that represents the normal distribution of normal data groups in the teacher data that includes normal data groups and abnormal data groups that represent normal and abnormal events at multiple locations of the monitored object, respectively. A generator,
A first abnormality score calculator that calculates a first abnormality score from the teacher data;
A threshold derivation unit that derives a threshold for abnormality detection based on the first abnormality score;
A first regression model generation unit that generates a first regression model in which the abnormal data group of the teacher data is an explanatory variable and the first abnormality score for the abnormal data group is an objective variable;
A first contribution calculating unit that calculates a first abnormal contribution of an event at the plurality of locations from the first regression model;
A second abnormality score of the actual measurement data output from a plurality of sensors respectively provided at a plurality of locations of the monitoring target with respect to the failure sign model is calculated, and the second abnormality score is used by using the abnormality detection threshold value. A second abnormality score calculation unit that obtains a second abnormality score for actual measurement data in an abnormal section in which the second abnormality score is equal to or greater than the threshold value;
A second regression model generation unit that generates a second regression model in which the actual measurement data of the abnormal section is an explanatory variable and the second abnormality score of the actual measurement data of the abnormal section is an objective variable;
A second contribution degree calculation unit that calculates a second abnormality contribution degree of the detection data of the plurality of sensors from the second regression model.
(Appendix 2)
The first abnormality contribution rate is a contribution rate that a predetermined event of the monitoring target in each of the plurality of places gives to the occurrence of the abnormality data group,
2. The abnormality detection device according to appendix 1, wherein the second abnormality contribution rate is a contribution rate that each detection data of the plurality of sensors gives to the generation of the abnormality data group.
(Appendix 3)
Anomaly detection according to appendix 1 or 2, wherein the first anomalous contribution rate and the second anomalous contribution rate are reduction degrees of unevenness of data obtained from the first regression model and the second regression model, respectively. apparatus.
(Appendix 4)
4. The abnormality detection device according to any one of appendices 1 to 3, wherein the first regression model and the second regression model are linear regression models or non-linear regression models.
(Appendix 5)
The linear regression model is any one of a linear regression model, a ridge regression model, and a Lasso regression model,
The abnormality detection device according to attachment 4, wherein the first abnormality contribution rate and the second abnormality contribution rate are weighted regression coefficients.
(Appendix 6)
The non-linear regression model is a random forest regression model,
5. The abnormality detection device according to attachment 4, wherein the first abnormality contribution rate and the second abnormality contribution rate are reduction rates of Gini coefficients.
(Appendix 7)
Generating a failure sign model representing a normal distribution of a normal data group of teacher data including a normal data group and an abnormal data group that respectively represent normal and abnormal events at a plurality of locations of a monitoring target in time series; ,
Calculating a first abnormality score from the teacher data,
Deriving a threshold for abnormality detection based on the first abnormality score;
Generating a first regression model having an abnormal data group of the teacher data as an explanatory variable and the first abnormal score for the abnormal data group as an objective variable;
Calculating a first anomalous contribution rate of events at the plurality of locations from the first regression model;
A second abnormality score of the actual measurement data output from a plurality of sensors respectively provided at a plurality of locations of the monitoring target with respect to the failure sign model is calculated, and the second abnormality score is used by using the abnormality detection threshold value. Obtaining a second abnormality score for actual measurement data in an abnormal section in which the second abnormality score is equal to or greater than the threshold value;
Generating a second regression model having the measured data of the abnormal section as an explanatory variable and the second abnormal score of the measured data of the abnormal section as an objective variable;
An anomaly detection program that causes a computer to execute a process including: calculating a second anomalous contribution degree of detection data of the plurality of sensors from the second regression model.
(Appendix 8)
Generating a failure sign model representing a normal distribution of a normal data group of teacher data including a normal data group and an abnormal data group that respectively represent normal and abnormal events at a plurality of locations of a monitoring target in time series; ,
Calculating a first abnormality score from the teacher data,
Deriving a threshold for abnormality detection based on the first abnormality score;
Generating a first regression model having an abnormal data group of the teacher data as an explanatory variable and the first abnormal score for the abnormal data group as an objective variable;
Calculating a first anomalous contribution rate of events at the plurality of locations from the first regression model;
A second abnormality score of the actual measurement data output from a plurality of sensors respectively provided at a plurality of locations of the monitoring target with respect to the failure sign model is calculated, and the abnormality detection threshold is used to calculate the second of the actual measurement data. Obtaining a second abnormality score for actual measurement data in an abnormal section in which the second abnormality score is equal to or greater than the threshold value;
Generating a second regression model using the actual measurement data of the abnormal section as an explanatory variable and the second abnormality score of the actual measurement data of the abnormal section as an objective variable;
Calculating a second degree of abnormality contribution of detection data of the plurality of sensors from the second regression model.

１０光ファイバケーブル
５０センサ
１００異常検出装置
１１０制御装置
１１１主制御部
１１２予兆モデル生成部
１１３異常スコア算出部
１１４閾値導出部
１１５回帰モデル生成部
１１６寄与度算出部
１１７メモリ
１２０操作部
１３０表示部 10 Optical Fiber Cable 50 Sensor 100 Abnormality Detection Device 110 Control Device 111 Main Control Unit 112 Predictive Model Generation Unit 113 Abnormality Score Calculation Unit 114 Threshold Derivation Unit 115 Regression Model Generation Unit 116 Contribution Calculation Unit 117 Memory 120 Operation Unit 130 Display Unit

Claims

A predictive model that generates a failure predictive model that represents the normal distribution of normal data groups in the teacher data that includes normal data groups and abnormal data groups that represent normal and abnormal events at multiple locations of the monitored object, respectively. A generator,
A first abnormality score calculator that calculates a first abnormality score from the teacher data;
A threshold derivation unit that derives a threshold for abnormality detection based on the first abnormality score;
A first regression model generation unit that generates a first regression model in which the abnormal data group of the teacher data is an explanatory variable and the first abnormality score for the abnormal data group is an objective variable;
A first contribution calculating unit that calculates a first abnormal contribution of an event at the plurality of locations from the first regression model;
A second abnormality score of the actual measurement data output from a plurality of sensors respectively provided at a plurality of locations of the monitoring target with respect to the failure sign model is calculated, and the second abnormality score is used by using the abnormality detection threshold value. A second abnormality score calculation unit that obtains a second abnormality score for actual measurement data in an abnormal section in which the second abnormality score is equal to or greater than the threshold value;
A second regression model generation unit that generates a second regression model in which the actual measurement data of the abnormal section is an explanatory variable and the second abnormality score of the actual measurement data of the abnormal section is an objective variable;
A second contribution degree calculation unit that calculates a second abnormality contribution degree of the detection data of the plurality of sensors from the second regression model.

The first abnormality contribution rate is a contribution rate that a predetermined event of the monitoring target in each of the plurality of places gives to the occurrence of the abnormality data group,
The abnormality detection device according to claim 1, wherein the second abnormality contribution rate is a contribution rate that the detection data of each of the plurality of sensors contributes to the generation of the abnormality data group.

The abnormality according to claim 1 or 2, wherein the first abnormality contribution rate and the second abnormality contribution rate are reduction degrees of inequality of data obtained from the first regression model and the second regression model, respectively. Detection device.

The abnormality detection device according to any one of claims 1 to 3, wherein the first regression model and the second regression model are linear regression models or non-linear regression models.

The linear regression model is any one of a linear regression model, a ridge regression model, and a Lasso regression model,
The abnormality detection device according to claim 4, wherein the first abnormality contribution rate and the second abnormality contribution rate are weighted regression coefficients.

The non-linear regression model is a random forest regression model,
The abnormality detection device according to claim 4, wherein the first abnormality contribution rate and the second abnormality contribution rate are reduction rates of Gini coefficients.

Generating a failure sign model representing a normal distribution of a normal data group of teacher data including a normal data group and an abnormal data group that respectively represent normal and abnormal events at a plurality of locations of a monitoring target in time series; ,
Calculating a first abnormality score from the teacher data,
Deriving a threshold for abnormality detection based on the first abnormality score;
Generating a first regression model having an abnormal data group of the teacher data as an explanatory variable and the first abnormal score for the abnormal data group as an objective variable;
Calculating a first anomalous contribution rate of events at the plurality of locations from the first regression model;
A second abnormality score of the actual measurement data output from a plurality of sensors respectively provided at a plurality of locations of the monitoring target with respect to the failure sign model is calculated, and the second abnormality score is used by using the abnormality detection threshold value. Obtaining a second abnormality score for actual measurement data in an abnormal section in which the second abnormality score is equal to or greater than the threshold value;
Generating a second regression model having the measured data of the abnormal section as an explanatory variable and the second abnormal score of the measured data of the abnormal section as an objective variable;
An anomaly detection program that causes a computer to execute a process including: calculating a second anomalous contribution degree of detection data of the plurality of sensors from the second regression model.

Generating a failure sign model representing a normal distribution of a normal data group of teacher data including a normal data group and an abnormal data group that respectively represent normal and abnormal events at a plurality of locations of a monitoring target in time series; ,
Calculating a first abnormality score from the teacher data,
Deriving a threshold for abnormality detection based on the first abnormality score;
Generating a first regression model having an abnormal data group of the teacher data as an explanatory variable and the first abnormal score for the abnormal data group as an objective variable;
Calculating a first anomalous contribution rate of events at the plurality of locations from the first regression model;
A second abnormality score of the actual measurement data output from a plurality of sensors respectively provided at a plurality of locations of the monitoring target with respect to the failure sign model is calculated, and the second abnormality score is used by using the abnormality detection threshold value. Obtaining a second abnormality score for actual measurement data in an abnormal section in which the second abnormality score is equal to or greater than the threshold value;
Generating a second regression model having the measured data of the abnormal section as an explanatory variable and the second abnormal score of the measured data of the abnormal section as an objective variable;
Calculating a second degree of anomaly contribution of the detection data of the plurality of sensors from the second regression model.