JP7423898B2

JP7423898B2 - Information processing device, determination device, model learning method

Info

Publication number: JP7423898B2
Application number: JP2019049848A
Authority: JP
Inventors: サンタナアダモ; 賢哉村上
Original assignee: Fuji Electric Co Ltd
Current assignee: Fuji Electric Co Ltd
Priority date: 2019-03-18
Filing date: 2019-03-18
Publication date: 2024-01-30
Anticipated expiration: 2039-03-18
Also published as: JP2020154406A; JP7435866B2; JP2023065698A

Description

特許法第３０条第２項適用発行日：平成３０年９月１１日集会名：平成３０年計測自動制御学会年次大会（ＳＩＣＥＡｎｎｕａｌＣｏｎｆｅｒｅｎｃｅ２０１８）公開者：アダモサンタナ、村上賢哉、飯坂達也、松井哲郎、福山良和公開された発明の内容：アダモサンタナ、他４名が、ショーケースの異常検知技術について公開した。Article 30, Paragraph 2 of the Patent Act applies Publication date: September 11, 2018 Meeting name: SICE Annual Conference 2018 (SICE Annual Conference 2018) Publisher: Adamo Santana, Kenya Murakami, Tatsuya Iizaka, Tetsuro Matsui, Yoshikazu Fukuyama Details of the disclosed invention: Adamo Santana and four others disclosed anomaly detection technology for showcases.

本発明は、情報処理装置、判定装置、およびモデルの学習方法に関する。 The present invention relates to an information processing device, a determination device, and a model learning method.

複数のデータから異常値を検出する際には、正常値と、異常値とを教師データとして構築されたモデルが用いられることがある（例えば、特許文献１）。 When detecting abnormal values from a plurality of data, a model constructed using normal values and abnormal values as training data is sometimes used (for example, Patent Document 1).

特開２０１９－１６２０９号公報JP 2019-16209 Publication

ところで、異常値を教師データとして用いるためには、異常値のサンプルを多く収集する必要がある。しかしながら、一般に異常値は、例えば、機器やシステムが正常に動作していない場合に出力される値等であるため、異常値のサンプルを多く収集することは難しい。 By the way, in order to use abnormal values as training data, it is necessary to collect many samples of abnormal values. However, since abnormal values are generally values that are output when a device or system is not operating normally, it is difficult to collect many samples of abnormal values.

本発明は、上記のような従来の問題に鑑みてなされたものであって、異常値のサンプルを学習に用いることなく、異常値を検出するための技術を提供することを目的とする。 The present invention has been made in view of the conventional problems as described above, and an object of the present invention is to provide a technique for detecting abnormal values without using samples of abnormal values for learning.

前述した課題を解決する主たる本発明は、所定の属性に属する複数のデータの夫々に対し、第１の値または第２の値を割り当てる割当部と、前記複数のデータのうち前記第１の値が割り当てられた第１データと、前記複数のデータのうち前記第２の値が割り当てられた第２データと、を教師データとして、入力データが前記所定の属性に属するか否かを判定するためのモデルの学習を行う学習部と、を含むこと情報処理装置である。 The main present invention for solving the above-mentioned problems includes: an allocation unit that allocates a first value or a second value to each of a plurality of data belonging to a predetermined attribute; to determine whether the input data belongs to the predetermined attribute using first data to which the value is assigned and second data to which the second value is assigned among the plurality of data as teacher data. The information processing apparatus includes a learning unit that performs learning of the model.

本発明によれば、異常値のサンプルを学習に用いることなく、異常値を検出するための技術を提供することができる。 According to the present invention, it is possible to provide a technique for detecting abnormal values without using samples of abnormal values for learning.

異常検知システム１０の構成を示す図である。1 is a diagram showing the configuration of an abnormality detection system 10. FIG. 学習装置２０のハードウェア構成の一例を示す図である。2 is a diagram showing an example of a hardware configuration of a learning device 20. FIG. データセット４１の一例を示す図である。4 is a diagram showing an example of a data set 41. FIG. 学習装置２０に実現される機能ブロックの一例を示す図である。2 is a diagram showing an example of functional blocks implemented in the learning device 20. FIG. 学習装置２０で実行される処理の一例を示すフローチャートである。3 is a flowchart illustrating an example of processing executed by the learning device 20. FIG. データセット４３の一例を示す図である。4 is a diagram showing an example of a data set 43. FIG. 関数ｙについて説明するための図である。FIG. 3 is a diagram for explaining a function y. 関数ｙについて説明するための図である。FIG. 3 is a diagram for explaining a function y. 判定装置２１のハードウェア構成の一例を示す図である。2 is a diagram showing an example of a hardware configuration of a determination device 21. FIG. 判定装置２１に実現される機能ブロックの一例を示す図である。2 is a diagram showing an example of functional blocks implemented in a determination device 21. FIG. 判定装置２１で実行される処理の一例を示すフローチャートである。3 is a flowchart illustrating an example of processing executed by the determination device 21. FIG. 判定処理Ｓ２０２で実行される内容を説明するための図である。FIG. 6 is a diagram for explaining the content executed in determination processing S202. ショーケース３００の異常検知の実証例を示す図である。3 is a diagram showing a demonstration example of abnormality detection of a showcase 300. FIG.

本明細書及び添付図面の記載により、少なくとも以下の事項が明らかとなる。 From the description of this specification and the attached drawings, at least the following matters will become clear.

＝＝＝＝＝本実施形態＝＝＝＝＝
＜＜＜異常検知システム１０の構成＞＞＞
図１は、本発明の一実施形態である異常検知システム１０の構成を示す図である。異常検知システム１０は、例えば、商業施設に設置されたショーケース３００の異常を検知するためのシステムであり、学習装置２０、判定装置２１を含む。 =====This embodiment =====
<<<Configuration of anomaly detection system 10>>>
FIG. 1 is a diagram showing the configuration of an abnormality detection system 10 that is an embodiment of the present invention. The abnormality detection system 10 is, for example, a system for detecting an abnormality in a showcase 300 installed in a commercial facility, and includes a learning device 20 and a determination device 21.

ショーケース３００は、例えば、食品等を冷却し、保管するためのケースである。ショーケース３００には、ショーケース３００の状態を観測するセンサ３１０が、例えば１０個取り付けられている。なお、図１では、便宜上、１０個のセンサ３１０は、１つのブロックとして描かれている。 The showcase 300 is, for example, a case for cooling and storing foods and the like. For example, ten sensors 310 are attached to the showcase 300 to observe the state of the showcase 300. Note that in FIG. 1, the ten sensors 310 are depicted as one block for convenience.

そして、異常検知システム１０は、１０個のセンサの夫々から出力されるデータｘ１～ｘ１０の値が、異常値となると、ショーケース３００の異常を検知する。なお、ここでは、「異常値」の例として、物理的な異常を要因とする異常の値や数値的な異常の値、またセンサ異常を起因とした異常値、また、異常判定ではないようなシステムに適用する場合には、通常とは異なる挙動を表す値のことが挙げられる。また、以下、ショーケース３００の動作が正常である際のデータを、「正常データ」または「正常なデータ」と称し、ショーケース３００の動作が異常である際のデータを、「異常データ」または「異常なデータ」と称する。 Then, the abnormality detection system 10 detects an abnormality in the showcase 300 when the values of data x1 to x10 output from each of the ten sensors become abnormal values. Here, examples of "abnormal values" include abnormal values caused by physical abnormalities, numerical abnormal values, abnormal values caused by sensor abnormalities, and abnormal values that are not determined to be abnormal. When applied to a system, it refers to values that represent unusual behavior. Further, hereinafter, data when the operation of the showcase 300 is normal is referred to as "normal data" or "normal data", and data when the operation of the showcase 300 is abnormal is referred to as "abnormal data" or "normal data". This is called "abnormal data."

学習装置２０（情報処理装置）は、正常なデータｘ１～ｘ１０に基づいて、ショーケース３００に異常が有るか否かを判定するためのモデル、つまり、データｘ１～ｘ１０の異常値を検出するためのモデルを機械学習によって構築する。 The learning device 20 (information processing device) is a model for determining whether or not there is an abnormality in the showcase 300 based on normal data x1 to x10, that is, for detecting abnormal values of data x1 to x10. Build a model using machine learning.

判定装置２１は、運転中のショーケース３００から出力されるデータｘ１～ｘ１０と、学習装置２０で構築されたモデルとに基づいて、ショーケース３００に異常が有るか否かを判定する。なお、学習装置２０と、判定装置２１とは、ネットワーク２５を介して接続されている。 The determination device 21 determines whether or not there is an abnormality in the showcase 300 based on data x1 to x10 output from the showcase 300 in operation and the model constructed by the learning device 20. Note that the learning device 20 and the determining device 21 are connected via a network 25.

＜＜＜学習装置２０について＞＞＞
＝＝学習装置２０の構成＝＝
図２は、学習装置２０のハードウェア構成の一例を示す図である。学習装置２０は、ＣＰＵ（Central Processing Unit）３０、メモリ３１、記憶装置３２、入力装置３３、表示装置３４、及び通信装置３５を含むコンピュータである。 <<<About the learning device 20>>>
==Configuration of learning device 20==
FIG. 2 is a diagram showing an example of the hardware configuration of the learning device 20. The learning device 20 is a computer including a CPU (Central Processing Unit) 30, a memory 31, a storage device 32, an input device 33, a display device 34, and a communication device 35.

ＣＰＵ３０は、メモリ３１や記憶装置３２に格納されたプログラムを実行することにより、学習装置２０における様々機能を実現する。 The CPU 30 implements various functions in the learning device 20 by executing programs stored in the memory 31 and the storage device 32.

メモリ３１は、例えばＲＡＭ（Random-Aaccess Mmemory）等であり、プログラムやデータ等の一時的な記憶領域として用いられる。 The memory 31 is, for example, a RAM (Random-Acess Memory) or the like, and is used as a temporary storage area for programs, data, and the like.

記憶装置３２は、ＣＰＵ３０によって実行あるいは処理される制御プログラム４０やデータセット４１等の各種のデータを格納する不揮発性の記憶装置である。 The storage device 32 is a nonvolatile storage device that stores various data such as a control program 40 and a data set 41 that are executed or processed by the CPU 30.

制御プログラム４０は、学習装置２０が有する各種機能を実現するためのプログラムを総称しており、例えば、ＯＳ（Operating System）等を含む。 The control program 40 is a general term for programs for realizing various functions of the learning device 20, and includes, for example, an OS (Operating System).

データセット４１は、図３に示すように、ショーケース３００からセンサで取得するデータｘ１～ｘ１０である。ここで、データｘ１は、例えば、ショーケース３００の所定の場所に取り付けられた温度センサからの出力であり、データｘ２は、ショーケース３００内のコンプレッサの圧力を計測する圧力センサからの出力である。また、データｘ１０は、例えば、コンプレッサの冷媒の流量を計測する流量計からの出力である。なお、データｘ３～ｘ９についても、ｘ１，ｘ２等と同様であるため、ここでは詳細な説明は省略する。また、データセット４１は、予め記憶装置３２に格納されていることとする。 The data set 41, as shown in FIG. 3, is data x1 to x10 acquired from the showcase 300 by a sensor. Here, data x1 is, for example, an output from a temperature sensor attached to a predetermined location of showcase 300, and data x2 is an output from a pressure sensor that measures the pressure of a compressor inside showcase 300. . Furthermore, the data x10 is, for example, the output from a flow meter that measures the flow rate of refrigerant in the compressor. Note that the data x3 to x9 are also similar to x1, x2, etc., so a detailed explanation will be omitted here. Further, it is assumed that the data set 41 is stored in the storage device 32 in advance.

学習モデル４２は、ショーケース３００に異常が有るか否かを、データｘ１～ｘ１０から判別するための判別式、あるいは数式（以下、まとめて「関数ｙ」と称する。）を含んで構成される。学習モデル４２の学習が行われると、“関数ｙ”の係数等が調整され、異常検出の精度が変化する。なお、本実施形態において、学習モデル４２の“関数ｙ”は、ｙ＝ｆ（ｘ１，ｘ２，～，ｘ１０）と表される。 The learning model 42 is configured to include a discriminant or mathematical formula (hereinafter collectively referred to as "function y") for determining whether or not there is an abnormality in the showcase 300 from data x1 to x10. . When the learning model 42 is trained, the coefficients of the "function y" and the like are adjusted, and the accuracy of abnormality detection changes. Note that in this embodiment, the "function y" of the learning model 42 is expressed as y=f(x1, x2, ~, x10).

入力装置３３は、ユーザによるコマンドやデータの入力を受け付ける装置であり、キーボード、タッチパネルディスプレイ上でのタッチ位置を検出するタッチセンサなどの入力インタフェースを含む。 The input device 33 is a device that receives commands and data input by the user, and includes an input interface such as a keyboard and a touch sensor that detects a touch position on a touch panel display.

表示装置３４は、例えばディスプレイなどの装置であり、通信装置３５は、ネットワーク２５を介して、判定装置２１や他のコンピュータと各種プログラムやデータの受け渡しを行う。 The display device 34 is, for example, a device such as a display, and the communication device 35 exchanges various programs and data with the determination device 21 and other computers via the network 25.

＝＝機能ブロック＝＝
図４は、学習装置２０に実現される機能ブロックの一例を示す図である。学習装置２０のＣＰＵ３０が、制御プログラム４０を実行することにより、学習装置２０には、取得部５０、割当部５１、及び学習部５２が実現される。 ==Functional block==
FIG. 4 is a diagram showing an example of functional blocks implemented in the learning device 20. When the CPU 30 of the learning device 20 executes the control program 40, the learning device 20 implements an acquisition section 50, an allocation section 51, and a learning section 52.

取得部５０は、記憶装置３２に格納されたデータセット４１を取得し、割当部５１は、データセット４１に含まれるデータに、“０（第１の値）”または“１（第２の値）”の２値をランダムに割り当てる。ここで、“０”または“１”は、教師あり学習の機械学習を行うために必要な「ラベル」に相当する。つまり、割当部５１は、データセット４１に含まれるデータに対し、０”または“１”の“ラベル”をランダムに付与して教師データを生成していることになる。 The acquisition unit 50 acquires the data set 41 stored in the storage device 32, and the allocation unit 51 assigns “0 (first value)” or “1 (second value)” to the data included in the data set 41. )" are randomly assigned. Here, "0" or "1" corresponds to a "label" necessary for performing supervised machine learning. In other words, the allocation unit 51 generates the teacher data by randomly assigning a "label" of "0" or "1" to the data included in the data set 41.

学習部５２は、割当部５１により“０”が割り当てられたデータＤ１と、“１”が割り当てられたデータＤ２とを教師データとし、学習モデル４２の学習を行う。具体的には、学習部５２は、データＤ１，Ｄ２に対し、例えば、サポートベクター回帰を実行することにより“関数ｆ”を求める。 The learning unit 52 uses the data D1 assigned with “0” by the assignment unit 51 and the data D2 assigned with “1” as teacher data, and performs learning of the learning model 42. Specifically, the learning unit 52 calculates the "function f" by performing support vector regression on the data D1 and D2, for example.

なお、ここでは、学習部５２は、サポートベクター回帰を実行することとしたが、他の非線形の回帰分析（例えば、非線形最小二乗法、多項式回帰）を用いても良く、非線形分類（例えば、サポートベクターマシン）を実行しても良い。また、例えば、学習モデル４２としてニューラルネットワークモデルを用いても良く、そのような場合、学習部５２は、ニューラルネットワークモデルのパラメータを、教師データ（データＤ１，Ｄ２）に基づいて定めることになる。この結果、本実施形態における学習モデル４２は、非線形の回帰モデルまたは、非線形の分類モデルとなる。以下、各機能ブロックが実行する処理の一例を、図５を参照しつつ説明する。 Although the learning unit 52 executes support vector regression here, other nonlinear regression analysis (for example, nonlinear least squares method, polynomial regression) may be used, and nonlinear classification (for example, support vector regression) may be used. vector machine). Further, for example, a neural network model may be used as the learning model 42, and in such a case, the learning unit 52 determines the parameters of the neural network model based on the teacher data (data D1, D2). As a result, the learning model 42 in this embodiment becomes a nonlinear regression model or a nonlinear classification model. An example of processing executed by each functional block will be described below with reference to FIG. 5.

＜＜学習処理Ｓ１０＞＞
まず、取得部５０は、記憶装置３２に格納されたデータセット４１を取得する（Ｓ２０）。そして、割当部５１は、データセット４１に含まれるデータに、“０”または“１”をランダムに割り当てる（Ｓ２１：割当処理）。例えば、割当部５１は、データセット４１の時刻ｔ１のデータ（ｘ１，～，ｘ１０）に、“０”を対応させて記憶させ、データセット４１を更新する。この結果、図６に示すように、記憶装置３２において、値が割り当てられた後のデータセット４３の時刻ｔ１のデータは、ラベルとして“０”が付与されたデータ（ｘ１，～，ｘ１０，ｙ）＝（３０．１，～，３５．２，０）となる。 <<Learning process S10>>
First, the acquisition unit 50 acquires the data set 41 stored in the storage device 32 (S20). Then, the allocation unit 51 randomly allocates "0" or "1" to the data included in the data set 41 (S21: allocation process). For example, the allocation unit 51 updates the data set 41 by storing “0” in association with the data (x1, to x10) at time t1 in the data set 41. As a result, as shown in FIG. 6, in the storage device 32, the data at time t1 of the data set 43 after the value is assigned is the data (x1, ~, x10, y )=(30.1,~,35.2,0).

また、割当部５１は、データセット４１の時刻ｔ２のデータ（ｘ１，～，ｘ１０）に、“１”を対応させて記憶させ、データセット４１を更新する。以下、他の時刻においても、時刻ｔ１，ｔ２と同様であるため、詳細な説明は省略する。 Further, the allocation unit 51 stores "1" in association with the data (x1, to x10) at time t2 of the data set 41, and updates the data set 41. Hereinafter, since the other times are similar to times t1 and t2, detailed explanation will be omitted.

そして、学習部５２は、割当部５１により“０”が割り当てられたデータＤ１（第１データ）と、“１”が割り当てられたデータＤ２（第２データ）とを教師データとし、学習モデル４２の学習を行い、“関数ｆ”を求める（Ｓ２２：学習処理）。 Then, the learning unit 52 uses the data D1 (first data) to which “0” is assigned by the assignment unit 51 and the data D2 (second data) to which “1” is assigned as teacher data, and uses the learning model 42 as teacher data. is learned and the "function f" is determined (S22: learning process).

図７及び図８は、“関数ｆ”について説明するための図である。なお、本実施形態では、データセット４１の各時刻のデータは、１０個のデータ（ｘ１～ｘ１０）を含むが、便宜上、図７ではデータがｘ１の１個の場合の“関数ｆ”を説明し、図８ではデータがｘ１，ｘ２の２個の場合の“関数ｆ”を説明する。 7 and 8 are diagrams for explaining the "function f". In this embodiment, the data at each time in the data set 41 includes 10 pieces of data (x1 to x10), but for convenience, the “function f” when there is only one piece of data x1 will be explained in FIG. However, in FIG. 8, the "function f" when there are two pieces of data x1 and x2 will be explained.

図７において、変数ｘ１の正常なデータは、例えば“ａ０”～“ａ１”の範囲Ａに含まれることとする。また、範囲Ａのデータｘ１のうち、“０”が割り当てられたデータＤ１は、ｙ＝０の直線上にプロットされ、“１”が割り当てられたデータＤ２は、ｙ＝１の直線上にプロットされる。 In FIG. 7, normal data of variable x1 is assumed to be included in range A from "a0" to "a1", for example. Also, among the data x1 in range A, data D1 to which "0" is assigned is plotted on the straight line of y = 0, and data D2 to which "1" is assigned is plotted on the straight line of y = 1. be done.

このような場合に、学習部５２により、非線形の回帰分析（または、非線形の分類）が実行されると、例えば、“関数ｆｆ（ｘ１）”の出力（値）ｙは、データｘ１が“ａ０”～“ａ１”の範囲において、“０”～“１”の間で変化する。また、“関数ｆ（ｘ１）”の出力ｙは、データｘ１が“ａ０”より小さい領域では、“１”より大きくなり、データｘ１が“ａ１”より大きい領域では、“０”より小さくなる。 In such a case, when the learning unit 52 executes a nonlinear regression analysis (or nonlinear classification), for example, the output (value) y of the “function ff(x1)” is determined by the fact that the data x1 is “a0 ” to “a1”, it changes between “0” and “1”. Further, the output y of the "function f(x1)" becomes greater than "1" in a region where data x1 is smaller than "a0", and becomes smaller than "0" in a region where data x1 is larger than "a1".

図８では、正常なデータ（ｘ１，ｘ２）は、例えば、ｘ１軸とｘ２軸とを含むｘ１－ｘ２平面の領域Ｂに含まれることとする。領域Ｂのデータ（ｘ１，ｘ２）のうち、“０”が割り当てられたデータＤ１は、ｙ＝０のｘ１－ｘ２平面の領域Ｃにプロットされ、“１”が割り当てられたデータＤ２は、ｙ＝１のｘ１－ｘ２平面の領域Ｄにプロットされる。 In FIG. 8, normal data (x1, x2) is assumed to be included in region B of the x1-x2 plane including, for example, the x1 axis and the x2 axis. Among the data (x1, x2) in area B, data D1 to which "0" is assigned is plotted in area C of the x1-x2 plane where y=0, and data D2 to which "1" is assigned is plotted in y is plotted in area D of the x1-x2 plane where =1.

このような場合に、学習部５２により、非線形の回帰分析（または、非線形の分離）が実行されると、例えば、領域Ｃ，Ｄの間で“０”～“１”の値で変化し、領域Ｃ，Ｄ以外の領域では大きく値が変化する“関数ｆ（ｘ１，ｘ２）”が求められる。 In such a case, when the learning unit 52 executes nonlinear regression analysis (or nonlinear separation), for example, the value changes between regions C and D between "0" and "1", In areas other than areas C and D, a "function f(x1, x2)" whose value changes greatly is determined.

つまり、処理Ｓ２２で求められる“関数ｆ（ｘ１，～，ｘ１０）”は、正常データが入力されると、例えば“０”～“１”の間の値を出力し、異常データが入力されると、“０”～“１”から大きく離れた値を出力する関数である。このような“関数ｆ”を用いることにより、データｘ１～ｘ１０の“異常値”が把握できるため、ショーケース３００の異常検知が可能となる。 In other words, the "function f(x1, ~, x10)" found in process S22 outputs a value between "0" and "1" when normal data is input, and when abnormal data is input. This is a function that outputs a value far away from "0" to "1". By using such a "function f", the "abnormal values" of the data x1 to x10 can be ascertained, making it possible to detect an abnormality in the showcase 300.

＜＜＜判定装置２１について＞＞＞
＝＝判定装置２１の構成＝＝
図９は、判定装置２１のハードウェア構成の一例を示す図である。判定装置２１は、ＣＰＵ７０、メモリ７１、記憶装置７２、入力装置７３、表示装置７４、及び通信装置７５を含むコンピュータである。なお、判定装置２１のハードウェア構成は、学習装置２０のハードウェア構成と同様であるため、ここでは詳細な説明は省略する。 <<<About the determination device 21>>>
==Configuration of determination device 21==
FIG. 9 is a diagram showing an example of the hardware configuration of the determination device 21. The determination device 21 is a computer including a CPU 70, a memory 71, a storage device 72, an input device 73, a display device 74, and a communication device 75. Note that the hardware configuration of the determination device 21 is similar to the hardware configuration of the learning device 20, so a detailed explanation will be omitted here.

記憶装置７２（記憶部）は、学習モデル４２、判定プログラム８０、及び判定データ８１を記憶する。学習モデル４２は、学習装置２０で構築されたモデルである。 The storage device 72 (storage unit) stores the learning model 42, the determination program 80, and the determination data 81. The learning model 42 is a model constructed by the learning device 20.

判定プログラム８０は、制御プログラム４０と同様に、判定装置２１が有する各種機能を実現するためのプログラムを総称している。 Similar to the control program 40, the determination program 80 is a general term for programs for realizing various functions of the determination device 21.

判定データ８１は、ショーケース３００に異常が有るか否かを判定した判定結果を示すデータである。 The determination data 81 is data indicating the determination result of determining whether or not there is an abnormality in the showcase 300.

＝＝機能ブロック＝＝
図１０は、判定装置２１に実現される機能ブロックの一例を示す図である。判定装置２１のＣＰＵ７０が、判定プログラム８０を実行することにより、判定装置２１には、取得部１００、計算部１０１、及び判定部１０２が実現される。 ==Functional block==
FIG. 10 is a diagram showing an example of functional blocks implemented in the determination device 21. As shown in FIG. When the CPU 70 of the determination device 21 executes the determination program 80, the determination device 21 implements an acquisition unit 100, a calculation unit 101, and a determination unit 102.

取得部１００は、センサ３１０から出力されるデータｘ１～ｘ１０を、所定間隔毎（例えば、３０秒毎）に取得する。 The acquisition unit 100 acquires data x1 to x10 output from the sensor 310 at predetermined intervals (for example, every 30 seconds).

計算部１０１は、取得部１００が取得したデータ（以下、「取得データ」と称する。）と、記憶装置７２に記憶された学習モデル４２とに基づいて、学習モデル４２を示す“関数ｆ”の出力（関数ｆの値）を計算する。 The calculation unit 101 calculates a “function f” indicating the learning model 42 based on the data acquired by the acquisition unit 100 (hereinafter referred to as “acquired data”) and the learning model 42 stored in the storage device 72. Calculate the output (value of function f).

判定部１０２は、計算部１０１が計算した“関数ｆ”の値が、例えば“０”より小さい閾値Ｔ１（第１閾値）と、“１”より大きい閾値Ｔ２（第２閾値）との間の範囲Ｆに入る場合、取得データが正常データであると判定し、範囲Ｆに入らない場合、取得データが異常データであると判定する。以下、各機能ブロックの詳細を、判定装置２１で実行される判定処理とともに説明する。 The determination unit 102 determines whether the value of the “function f” calculated by the calculation unit 101 is between, for example, a threshold T1 (first threshold) smaller than “0” and a threshold T2 (second threshold) larger than “1”. If it falls within range F, it is determined that the acquired data is normal data, and if it does not fall within range F, it is determined that the acquired data is abnormal data. The details of each functional block will be explained below along with the determination process executed by the determination device 21.

＜＜判定処理Ｓ１００＞＞
まず、図１１に示すように、取得部１００は、センサ３１０からのデータｘ１～ｘ１０を取得する（Ｓ２００）。そして、計算部１０１は、取得データ（ｘ１，～，ｘ１０）と、学習モデル４２として得られた関数ｆ（ｘ１，～，ｘ１０））と、に基づいて、“関数ｆ”の出力ｙを計算する（Ｓ２０１）。 <<Determination process S100>>
First, as shown in FIG. 11, the acquisition unit 100 acquires data x1 to x10 from the sensor 310 (S200). Then, the calculation unit 101 calculates the output y of the “function f” based on the acquired data (x1, ~, x10) and the function f (x1, ~, x10)) obtained as the learning model 42. (S201).

判定部１０２は、“出力ｙ”の値が、閾値Ｔ１～Ｔ２の範囲Ｆに入るか否かを判定する（Ｓ２０２）。図１２は、処理Ｓ２０２で実行される内容を説明するための図である。なお、“関数ｆ”の値は、１０個のデータｘ１～ｘ１０に基づいて定まるが、図１２では、便宜上、１つの変数（データｘ１）を用いて説明している。 The determining unit 102 determines whether the value of "output y" falls within the range F of threshold values T1 to T2 (S202). FIG. 12 is a diagram for explaining the contents executed in process S202. Note that although the value of the "function f" is determined based on ten pieces of data x1 to x10, one variable (data x1) is used for explanation in FIG. 12 for convenience.

計算された“出力ｙ”値が、例えば図１２の“Ｐ１（Ｔ１＜Ｐ１＜Ｔ２）”である場合、つまり、“出力ｙ”の値が範囲Ｆに入る場合（Ｓ２０２：Ｙｅｓ）、判定部１０２は、取得データは、正常データであると判定する（Ｓ２０３）。 If the calculated "output y" value is, for example, "P1 (T1<P1<T2)" in FIG. 12, that is, if the value of "output y" falls within range F (S202: Yes), the determination unit 102 determines that the acquired data is normal data (S203).

一方、計算された“関数ｆ”値が、例えば図１２の“Ｐ２（Ｔ１＞Ｐ２”）である場合、つまり、“出力ｙ”の値が範囲Ｆに入らない場合（Ｓ２０２：Ｎｏ）、判定部１０２は、取得データは、異常データであると判定する（Ｓ２０４）。そして、判定部１０２は、処理Ｓ２０３，２０４の判定結果を、記憶装置７２に格納し、判定データ８１を更新する（Ｓ２０５）。 On the other hand, if the calculated "function f" value is, for example, "P2 (T1>P2") in FIG. 12, that is, if the value of "output y" does not fall within range F (S202: No), the judgment The unit 102 determines that the acquired data is abnormal data (S204). Then, the determination unit 102 stores the determination results of processes S203 and 204 in the storage device 72, and updates the determination data 81 (S205).

＜＜実証結果＞＞
図１３は、ショーケース３００に実際に故障（例えば、冷媒漏れ）が発生した際の、判定装置２１の判定結果の一例である。＜＜Demonstration results＞＞
FIG. 13 is an example of the determination result of the determination device 21 when a failure (for example, refrigerant leak) actually occurs in the showcase 300.

まず、１月～２月においては、ショーケース３００は正常に動作しているため、計算部１０１の計算結果である“出力ｙ”の値は、基本的に範囲Ｆに入っている。このため、この期間においては、取得データ（ｘ１～ｘ１０）が、異常データであるとの判定結果は基本的に出力されていない。なお、１月～２月の間に１回だけ“出力ｙ”の値が閾値Ｔ１となり、異常検出がされているが、これはノイズ等の影響によるものである。 First, in January and February, the showcase 300 operates normally, so the value of "output y", which is the calculation result of the calculation unit 101, basically falls within the range F. Therefore, during this period, the determination result that the acquired data (x1 to x10) is abnormal data is basically not output. Note that the value of "output y" reached the threshold value T1 only once between January and February, and an abnormality was detected, but this was due to the influence of noise and the like.

そして、２月に入り、ショーケース３００のコンプレッサー（不図示）から冷媒漏れが発生すると、計算部１０１の計算結果が、範囲Ｆを超える回数が増加する。この結果、特に２月中旬以降、判定部１０２が、取得データ（ｘ１～ｘ１０）が、異常データであるとの判定結果を出力する回数も増加する。 Then, in February, when refrigerant leaks from the compressor (not shown) of the showcase 300, the number of times the calculation result of the calculation unit 101 exceeds the range F increases. As a result, especially after mid-February, the number of times the determination unit 102 outputs a determination result that the acquired data (x1 to x10) is abnormal data also increases.

２月末に、冷媒漏れが修理されると、ショーケース３００は再び正常に動作するため、計算部１０１の計算結果は、範囲Ｆに収まる。この結果、判定部１０２は、取得データ（ｘ１～ｘ１０）は、正常データであるとの判定結果を出力する。 When the refrigerant leak is repaired at the end of February, the showcase 300 operates normally again, so the calculation result of the calculation unit 101 falls within range F. As a result, the determination unit 102 outputs a determination result that the acquired data (x1 to x10) is normal data.

＝＝＝まとめ＝＝＝
以上、本実施形態の異常検知システム１０について説明した。学習装置２０の割当部５１は、正常なデータを含むデータセット（所定の属性に属する複数のデータ）に対し、“０”または“１”を割り当てて、教師データを生成する（例えば、処理Ｓ２１）。そして、学習部５２は、入力されるデータｘ１～ｘ１０（入力データ）が、正常なデータ（所定の属性のデータ）であるか否かを判定するための学習モデル４２の学習を、教師データを用いて行う（例えば、処理Ｓ２２）。このため、本実施形態では、異常なサンプルデータを用いることなく、異常なデータ（異常値）を検出するための学習モデル４２を構築できる。さらに、本実施形態では、学習モデル４２の学習を行う際に、様々な回帰、分類を用いることができる。このため、一般的な異常値検出方法（例えば、One Class SVM）を用いる場合と比較して、学習モデル４２が適用される分野等に合わせ、最適な学習モデル４２を構築できる。 ===Summary===
The abnormality detection system 10 of this embodiment has been described above. The allocation unit 51 of the learning device 20 allocates "0" or "1" to a dataset containing normal data (a plurality of data belonging to a predetermined attribute) to generate teacher data (for example, in step S21 ). Then, the learning unit 52 uses the teacher data to perform the learning of the learning model 42 to determine whether the input data x1 to x10 (input data) is normal data (data with a predetermined attribute). (for example, processing S22). Therefore, in this embodiment, the learning model 42 for detecting abnormal data (abnormal values) can be constructed without using abnormal sample data. Furthermore, in this embodiment, various regressions and classifications can be used when learning the learning model 42. Therefore, compared to the case where a general abnormal value detection method (for example, One Class SVM) is used, an optimal learning model 42 can be constructed according to the field to which the learning model 42 is applied.

また、例えば、割当部５１は、データセット４１に含まれるデータに、“０”または“１”を割り当てる際、所定のパターンに従って２値を割り当てても良い。なお、「所定のパターン」とは、例えば、“０”を１００個割り当てた後に、“１”を１００個割り当てる等、割り当てる値の個数や順番を決めて割り当てることをいう。一般に、データセット４１に含まれるデータは、例えば時系列で取得されたデータであることが多い。したがって、割り当てる値の個数や順番を決めると、ノイズ等の影響を受けたデータの多くに同じ値が割り当てられることがある。本実施形態では、割当部５１は、ランダムに２値を割り当てるため、データがノイズの影響を受けることを抑制できる。 Further, for example, when assigning "0" or "1" to the data included in the data set 41, the assigning unit 51 may assign binary values according to a predetermined pattern. Note that the "predetermined pattern" refers to the assignment after determining the number and order of the values to be assigned, such as assigning 100 "0"s and then assigning 100 "1s". Generally, the data included in the data set 41 is often data acquired in time series, for example. Therefore, when the number and order of values to be assigned are determined, the same value may be assigned to much of the data affected by noise or the like. In this embodiment, the allocation unit 51 randomly allocates binary values, so that data can be prevented from being influenced by noise.

また、学習モデル４２は、線形の分類モデル、または線形の回帰モデルであっても良い。ただし、線形のモデルが用いられた場合、一般に、正常なデータが入力された際の“出力ｙ”の値と、異常なデータが入力された際の“出力ｙ”の値との差が小さくなる。本実施形態では、学習モデル４２として、非線形の分類モデルまたは、非線形の回帰モデルを用いている。したがって、一般に、正常なデータが入力された際の“出力ｙ”の値と、異常なデータが入力された際の“出力ｙ”の値との差が大きくなるため、精度良く異常データ（異常値）を検出できる。 Further, the learning model 42 may be a linear classification model or a linear regression model. However, when a linear model is used, the difference between the "output y" value when normal data is input and the "output y" value when abnormal data is input is generally small. Become. In this embodiment, a nonlinear classification model or a nonlinear regression model is used as the learning model 42. Therefore, in general, the difference between the value of "output y" when normal data is input and the value of "output y" when abnormal data is input is large, so abnormal data (abnormal value) can be detected.

また、判定装置２１は、正常なデータで構築された学習モデル４２を用いることにより、異常なデータの有無を判定できる。したがって、異常なデータを多く収集することが困難な場合であっても、異常なデータの検知が可能となる。 Moreover, the determination device 21 can determine the presence or absence of abnormal data by using the learning model 42 constructed using normal data. Therefore, even if it is difficult to collect a large amount of abnormal data, abnormal data can be detected.

上記の実施形態は、本発明の理解を容易にするためのものであり、本発明を限定して解釈するためのものではない。また、本発明は、その趣旨を逸脱することなく、変更や改良され得るとともに、本発明にはその等価物が含まれるのはいうまでもない。 The above-described embodiments are provided to facilitate understanding of the present invention, and are not intended to be interpreted as limiting the present invention. Further, the present invention can be modified and improved without departing from the spirit thereof, and it goes without saying that the present invention includes equivalents thereof.

例えば、本実施形態のデータは、ショーケース３００のセンサ３１０から出力されるデータｘ１～ｘ１０であったが、これに限られない。例えば、２値を割り当てるデータは、他の機器に設けられたセンサの出力や、サーバのアクセスログ等のデータであっても良い。このようなデータであっても、モデルの学習を行うことにより、異常値を検出することができる。 For example, the data in this embodiment is data x1 to x10 output from the sensor 310 of the showcase 300, but is not limited thereto. For example, the data to which binary values are assigned may be data such as the output of a sensor provided in another device or an access log of a server. Even with such data, abnormal values can be detected by learning the model.

また、割当部５１は、データセット４１に含まれるデータに、“０”または“１”を割り当てることとしたが、これらの値に限られない。例えば、割当部５１は、データに“－１”または“１”等、異なる２つの値を割り当てればよい。 Further, although the allocation unit 51 allocates "0" or "1" to the data included in the data set 41, the allocation unit 51 is not limited to these values. For example, the allocation unit 51 may allocate two different values, such as "-1" or "1", to the data.

また、範囲Ｆを定める閾値Ｔ１は、“０”より小さく、閾値Ｔ２は“１”より大きいこととしたが、これに限られない。例えば、閾値Ｔ１は、“０”以上であり、閾値Ｔ２は“１”以下であっても良い。このような場合であっても、異常値を検出することは可能である。 Further, although the threshold value T1 that defines the range F is smaller than "0" and the threshold value T2 is larger than "1", the present invention is not limited to this. For example, the threshold value T1 may be greater than or equal to "0", and the threshold value T2 may be less than or equal to "1". Even in such a case, it is possible to detect abnormal values.

１０異常検知システム
２０学習装置
２１判定装置
２５ネットワーク
３０，７０ＣＰＵ
３１，７１メモリ
３２，７２記憶装置
３３，７３入力装置
３４，７４表示装置
３５，７５通信装置
４０制御プログラム
４１，４３データセット
４２学習モデル
５０，１００取得部
５１割当部
５２学習部
８０判定プログラム
８１判定データ
１０１計算部
１０２判定部

10 Anomaly detection system 20 Learning device 21 Determination device 25 Network 30, 70 CPU
31, 71 Memory 32, 72 Storage device 33, 73 Input device 34, 74 Display device 35, 75 Communication device 40 Control program 41, 43 Data set 42 Learning model 50, 100 Acquisition unit 51 Allocation unit 52 Learning unit 80 Judgment program 81 Judgment data 101 Calculation section 102 Judgment section

Claims

an assignment unit that assigns a first value or a second value less than or equal to the first value to each of the plurality of data belonging to an attribute indicating normal data;
The first data to which the first value is assigned among the plurality of data and the second data to which the second value is assigned among the plurality of data are used as training data, and the input data is set to the normal state. If the input data belongs to an attribute indicating normal data, a value included in a predetermined range of less than or equal to the first value and greater than or equal to the second value is output, and if the input data does not belong to the attribute indicating normal data, the a learning unit that trains the model to output values that are not included in a predetermined range;
An information processing device comprising:

The information processing device according to claim 1,
The allocation department is
Randomly assigning the first value or the second value to each of the plurality of data;
An information processing device characterized by:

The information processing device according to claim 1 or 2,
the model is a non-linear classification model or a non-linear regression model;
An information processing device characterized by:

The computer is
an assignment process of assigning a first value or a second value less than or equal to the first value to each of a plurality of data belonging to an attribute indicating normal data;
The first data to which the first value is assigned among the plurality of data and the second data to which the second value is assigned among the plurality of data are used as training data, and the input data is set to the normal state. If the input data belongs to an attribute indicating normal data, a value included in a predetermined range of less than or equal to the first value and greater than or equal to the second value is output, and if the input data does not belong to the attribute indicating normal data, the a learning process that trains the model to output values that are not included in a predetermined range;
A method for training the model to perform.

Learning using first data to which a first value is assigned among a plurality of data belonging to an attribute indicating normal data, and second data to which a second value is assigned from among the plurality of data as training data. a storage unit that stores a model for determining whether the input data belonging to the attribute indicating the normal data;
a calculation unit that calculates a value for determining whether the input data belongs to the attribute indicating the normal data, based on the input data and the model;
If the value is included in a predetermined range determined by a first threshold value that is less than or equal to the first value and a second threshold value that is greater than or equal to the second value, the input data belongs to the attribute indicating the normal data. a determining unit that determines that the input data does not belong to the attribute indicating the normal data if the value is not included in the predetermined range;
A determination device comprising: