JP7246330B2

JP7246330B2 - How to monitor monitored data

Info

Publication number: JP7246330B2
Application number: JP2020006408A
Authority: JP
Inventors: 敬子大久保; 敏明鈴木
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2020-01-17
Filing date: 2020-01-17
Publication date: 2023-03-27
Anticipated expiration: 2040-01-17
Also published as: JP2021114121A

Description

本発明は、監視対象データを監視する方法に関する。 The present invention relates to a method for monitoring monitored data.

製造業や店舗、倉庫などの様々な作業現場においては、ＩｏＴ（ＩｎｔｅｒｎｅｔｏｆＴｈｉｎｇｓ）化が進み、装置やヒトなどに設置された各種のセンサから様々なデータが収集され、蓄積されている。あらかじめ取得した安定稼動時のデータを学習し、監視時間帯のデータと学習したデータを比較することにより状態監視を行う技術が知られている。また、近年ではオートエンコーダを用いて異常を検知する方法が画像や通信などの分野で提案されている。オートエンコーダを用いた異常検知では、安定稼働時のデータを学習してオートエンコーダを生成し、監視時間帯のデータと、監視時間帯のデータを前記生成したオートエンコーダに入力して得られる出力データとの差異を基に異常を検出する。 In various work sites such as manufacturing industries, stores, and warehouses, the Internet of Things (IoT) is progressing, and various data are collected and accumulated from various sensors installed in devices, people, and the like. A technology is known in which state monitoring is performed by learning data obtained in advance during stable operation and comparing the data during the monitoring period with the learned data. Also, in recent years, methods of detecting anomalies using autoencoders have been proposed in the fields of images and communications. In the anomaly detection using an autoencoder, the autoencoder is generated by learning the data during stable operation, and the output data obtained by inputting the monitoring time period data and the monitoring time period data into the generated autoencoder. Detect anomalies based on the difference between

特許文献１では、オートエンコーダの復元誤差（例えば、入力データｘと出力データｙの平均二乗誤差）を異常度として算出し、予め設定した閾値を超えている場合に、異常を検知する。 In Patent Literature 1, the recovery error of the autoencoder (for example, the mean square error between input data x and output data y) is calculated as the degree of anomaly, and an anomaly is detected when it exceeds a preset threshold.

特開２０１９－４９７７８号公報JP 2019-49778 A

特許文献１では、複数の要素から構成される入力データｘと出力データｙに対して、一つの異常度を算出しており、入力データの要素毎の振れ幅を考慮していない。このため、平常時における値の振れ幅が要素毎に異なる場合に、個々の振れ幅に対応して状態を識別することは困難である。 In Patent Document 1, one degree of abnormality is calculated for input data x and output data y composed of a plurality of elements, and the amplitude of each element of the input data is not taken into consideration. For this reason, when the fluctuation width of the value in the normal state differs for each element, it is difficult to identify the state corresponding to each fluctuation width.

本発明はこのような背景に鑑みてなされたもので、その一態様の目的は、複数の要素から構成されるデータ群について、平常時における値の振れ幅が要素間で異なる場合でも、個々の振れ幅に対応して評価可能な技術を提供することにある。 The present invention has been made in view of such a background, and an object of one aspect thereof is to provide a data group composed of a plurality of elements, even if the fluctuation range of values in normal times differs between elements, individual To provide a technology that can be evaluated corresponding to the amplitude.

本発明の一態様は、計算機システムが監視対象データを監視する方法であって、前記計算機システムは分析モデルを格納し、前記分析モデルは、複数の要素で構成される入力データに対して前記入力データと同じ要素数の復元データを出力し、入力された平常データと前記平常データに対する復元データとの差分が小さくなるように学習されており、前記方法は、前記計算機システムが、複数の差分分布生成用平常データと前記複数の差分分布生成用平常データを前記分析モデルに入力して得られる復元データとの、要素毎の差分の分布を生成し、監視対象データと前記監視対象データを前記分析モデルに入力して得られる復元データとの要素毎の差分と、前記要素毎の差分の分布との比較を行い、前記比較を基に、要素毎の指標値を算出する。 One aspect of the present invention is a method for monitoring monitoring target data by a computer system, wherein the computer system stores an analysis model, and the analysis model is used for the input data composed of a plurality of elements. The computer system outputs restored data having the same number of elements as the data, and learns so that the difference between the input normal data and the restored data with respect to the normal data is small. generating a difference distribution for each element between the normal data for generation and the restored data obtained by inputting the plurality of normal data for generating difference distribution into the analysis model, and analyzing the monitoring target data and the monitoring target data; A difference for each element from restored data obtained by inputting to the model is compared with the distribution of the difference for each element, and an index value for each element is calculated based on the comparison.

上記のように構成された本発明の一態様によれば、複数の要素から構成される入力データについて、要素毎の振れ幅を考慮した評価が可能になる。 According to one aspect of the present invention configured as described above, input data composed of a plurality of elements can be evaluated in consideration of the amplitude of each element.

状態監視システムを含むシステムの概略構成図である。1 is a schematic configuration diagram of a system including a state monitoring system; FIG. 入力データの変動の一例を示す図である。It is a figure which shows an example of a change of input data. 状態監視システムの構成要素として用いることが可能な情報処理装置の一例を示す図である。It is a figure which shows an example of the information processing apparatus which can be used as a component of a condition-monitoring system. データ格納装置の機能及びデータの概要を示す図である。3 is a diagram showing an overview of functions and data of a data storage device; FIG. データ群としての測定データの一例である。It is an example of measurement data as a data group. データ分析装置の機能及びデータの概要を示す図である。It is a figure which shows the function of a data-analysis apparatus, and the outline|summary of data. 前処理後データの一例である。It is an example of post-preprocessing data. 差分分布データの一例である。It is an example of difference distribution data. 分析結果データの一例である。It is an example of analysis result data. 学習時の処理を示すシーケンス図である。FIG. 10 is a sequence diagram showing processing during learning; 学習処理を示すフローチャートである。4 is a flowchart showing learning processing; オートエンコード前後のデータの要素毎の差の算出方法の概略を示す図である。FIG. 4 is a diagram showing an outline of a method of calculating a difference for each element of data before and after auto-encoding; 状態監視時の分析処理を示すシーケンス図である。FIG. 10 is a sequence diagram showing analysis processing during state monitoring; 分析処理を示すフローチャートである。4 is a flowchart showing analysis processing; 要素毎の差分と、差分の標準化後の値、及び異常発生確率との関係を示す図である。FIG. 10 is a diagram showing the relationship between the difference for each element, the standardized value of the difference, and the probability of occurrence of an abnormality; パラメータ設定画面の一例を示す図である。It is a figure which shows an example of a parameter setting screen. 分析結果表示画面の一例を示す図である。It is a figure which shows an example of an analysis result display screen. エッジ側で分析処理を行う状態監視システムの概略構成図である。1 is a schematic configuration diagram of a state monitoring system that performs analysis processing on an edge side; FIG. リアルタイムな分析処理を示すシーケンス図である。FIG. 4 is a sequence diagram showing real-time analysis processing; 入力データの二次元配列化の概略を示す図である。It is a figure which shows the outline of two-dimensional arraying of input data. 二次元配列化された入力データを示す図である。It is a figure which shows the input data arranged two-dimensionally. 二次元配列データのオートエンコード前後の差分の算出方法の概略を示す図である。FIG. 4 is a diagram showing an outline of a method of calculating a difference between before and after auto-encoding two-dimensional array data; 入力データの二次元配列化の一例を示す図である。It is a figure which shows an example of two-dimensional arraying of input data. 入力データの二次元配列化の他の例を示す図である。FIG. 10 is a diagram showing another example of two-dimensional arraying of input data; 入力データの二次元配列化の他の例を示す図である。FIG. 10 is a diagram showing another example of two-dimensional arraying of input data; 入力データの二次元配列化の他の例を示す図である。FIG. 10 is a diagram showing another example of two-dimensional arraying of input data; 入力データを二以上の領域に分割する例を示す図である。It is a figure which shows the example which divides|segments input data into two or more area|regions. 分析結果の表示の一例を示す図である。FIG. 10 is a diagram showing an example of display of analysis results; 分析結果の表示の他の例を示す図である。FIG. 10 is a diagram showing another example of display of analysis results; 異常種別の判定結果の表示の一例を示す図である。FIG. 10 is a diagram showing an example of display of a determination result of an abnormality type; 状態の識別結果の表示の一例を示す図である。FIG. 10 is a diagram showing an example of a display of state identification results; 状態の識別結果の表示の一例を示す図である。FIG. 10 is a diagram showing an example of a display of state identification results; 状態の識別結果の表示の一例を示す図である。FIG. 10 is a diagram showing an example of a display of state identification results; オートエンコーダの構成の一例を示す図である。It is a figure which shows an example of a structure of an autoencoder. 一以上のオートエンコーダに対するオートエンコード前後の差分の分布の比較の一例を示す図である。FIG. 10 is a diagram showing an example of comparison of difference distributions before and after autoencoding for one or more autoencoders;

以下、図面に基づいて、本発明の実施の形態を説明する。本実施形態に係わる状態監視方法、又は状態監視システムは、二つ以上の要素から構成される入力データについて、平常時の値の振れ幅が要素毎に異なるケースにおいても、要素毎の振れ幅に対応した状態監視を行う。本実施形態に係わる状態監視技術は、例えば、製造、倉庫、店舗などにおけるＩｏＴ（ＩｎｔｅｒｎｅｔｏｆＴｈｉｎｇｓ）を利用できる。 BEST MODE FOR CARRYING OUT THE INVENTION An embodiment of the present invention will be described below based on the drawings. The state monitoring method or state monitoring system according to the present embodiment provides input data composed of two or more elements. Perform corresponding condition monitoring. The state monitoring technology according to the present embodiment can use IoT (Internet of Things) in manufacturing, warehouses, stores, and the like, for example.

図１～図１７を用いて第一実施例を説明する。本実施例では、例えば工場機器に設置された各種センサから収集したデータを分析し、安定稼動時（即ち平常時）に収集された平常データと監視時間帯に収集されたデータとの比較により、監視対象の機器が学習済みの平常時と異なる状態であることを検知する例を説明する。本実施例では、平常時と異なる状態を、異常（即ち故障や障害、及びその予兆）として検出する。 A first embodiment will be described with reference to FIGS. 1 to 17. FIG. In this embodiment, for example, data collected from various sensors installed in factory equipment is analyzed, and normal data collected during stable operation (that is, normal time) is compared with data collected during the monitoring period to obtain An example of detecting that the device to be monitored is in a state different from the learned normal state will be described. In this embodiment, a state different from the normal state is detected as an anomaly (that is, a failure or failure and its sign).

図１は、本実施例における状態監視システム１を含むシステムの例を示す。状態監視システム１は、例えばデータ分析装置１０２、データ格納装置１０１、及び入出力装置１０４を含んでいる。分析対象１０５又はその周辺には、センサ１０３が設置されている。そして、データ格納装置１０１、データ分析装置１０２、入出力装置１０４、センサ１０３は、通信ネットワーク１０６を介して接続されている。これにより、データ分析システム１は、センサ１０３から測定データを収集して分析する。 FIG. 1 shows an example of a system including a condition monitoring system 1 in this embodiment. The condition monitoring system 1 includes a data analysis device 102, a data storage device 101, and an input/output device 104, for example. A sensor 103 is installed on or around the analysis target 105 . Data storage device 101 , data analysis device 102 , input/output device 104 and sensor 103 are connected via communication network 106 . Thereby, the data analysis system 1 collects and analyzes measurement data from the sensor 103 .

通信ネットワーク１０６は、例えば、ＬＡＮ（ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）、ＷＡＮ（ＷｉｄｅＡｒｅａＮｅｔｗｏｒｋ）、インターネット、イントラネット、専用線、携帯電話網、光ファイバ等の、有線又は無線の通信基盤により構成される。 The communication network 106 is composed of a wired or wireless communication infrastructure such as a LAN (Local Area Network), a WAN (Wide Area Network), the Internet, an intranet, a dedicated line, a mobile phone network, an optical fiber, or the like.

なお、入出力装置１０４については、図１に示すようにネットワーク１０６を介して各機器（即ち、センサ１０３、データ格納装置１０１、及びデータ分析装置１０２）に接続される装置であってもよいし、各機器（即ち、センサ１０３、データ格納装置１０１、及びデータ分析装置１０２）に直接接続される装置であってもよい。 Note that the input/output device 104 may be a device connected to each device (that is, the sensor 103, the data storage device 101, and the data analysis device 102) via the network 106 as shown in FIG. , devices that are directly connected to each device (that is, the sensor 103, the data storage device 101, and the data analysis device 102).

分析対象１０５は、例えば、製品の製造工場等に設けられている工作機械や製造装置（例えば、プレス機、ＮＣ工作機械、ロボットアーム、成形機、加硫機、又は３次元プリンタなど）である。分析対象１０５は、工作機械又は製造装置に限らず、例えば、電車、電車に関連する設備、照明装置、内燃機関、エレベータ、エスカレータ、又は空調装置などでもよい。分析対象１０５は、必ずしも通信ネットワーク１０６に接続されていなくてもよい。 The analysis target 105 is, for example, a machine tool or manufacturing device (for example, a press machine, an NC machine tool, a robot arm, a molding machine, a vulcanizing machine, or a three-dimensional printer) installed in a product manufacturing factory. . The analysis target 105 is not limited to machine tools or manufacturing equipment, and may be, for example, trains, equipment related to trains, lighting devices, internal combustion engines, elevators, escalators, or air conditioners. The analysis target 105 does not necessarily have to be connected to the communication network 106 .

センサ１０３は、例えば分析対象１０５に設けられてもよいし、分析対象１０５の近傍に設けられてもよい。状態監視システム１は、分析対象１０５に連動する他の装置（不図示）に設けられたセンサ１０３を利用することもできる。 The sensor 103 may be provided on the analysis target 105 or may be provided in the vicinity of the analysis target 105, for example. The state monitoring system 1 can also use the sensor 103 provided in another device (not shown) that works in conjunction with the analysis target 105 .

センサ１０３の種類は問わない。センサ１０３は例えば、振動センサ、速度センサ、加速度センサ、電流センサ、電圧センサ、温度センサ、湿度センサ、圧力センサ、荷重センサ、トルクセンサ、ひずみセンサ、位置センサ、角度センサ、ジャイロセンサ、音センサ、光センサ、ガスセンサ、距離センサ、又はカラーセンサ等である。 The type of sensor 103 does not matter. The sensor 103 includes, for example, a vibration sensor, a speed sensor, an acceleration sensor, a current sensor, a voltage sensor, a temperature sensor, a humidity sensor, a pressure sensor, a load sensor, a torque sensor, a strain sensor, a position sensor, an angle sensor, a gyro sensor, a sound sensor, An optical sensor, a gas sensor, a distance sensor, a color sensor, or the like.

データ格納装置１０１は、例えば通信ネットワーク１０６を介してセンサ１０３から出力されるデータを取得する。以下、データ格納装置１０１が取得するデータ(即ち、センサデータ)のことを取得データと称する。この取得データは「データ群」又は「データ系列」の例であり、測定データと呼ぶこともできる。ただし、取得データは、時系列のデータに限定されない。データ格納装置１０１は、取得データを格納し、データ分析装置１０２からの要求に応じて、要求されたデータをデータ分析装置１０２へ送信する。 The data storage device 101 acquires data output from the sensor 103 via the communication network 106, for example. Data acquired by the data storage device 101 (that is, sensor data) is hereinafter referred to as acquired data. This acquired data is an example of a "data group" or "data series" and can also be referred to as measurement data. However, the acquired data is not limited to time-series data. The data storage device 101 stores acquired data and transmits requested data to the data analysis device 102 in response to a request from the data analysis device 102 .

データ分析装置１０２は、センサ１０３から取得されたセンサデータに対して後述の学習処理、及び分析処理を行うことにより、機器の状態を監視し、異常を検出する。入出力装置１０４は、ユーザがデータ格納装置１０１及びデータ分析装置１０２にアクセスするためのユーザインタフェースを提供する。ユーザは、入出力装置１０４を介して、データ格納装置１０１及びデータ分析装置１０２に対して、データ入力や各種設定、及び分析処理を行うことができる。ユーザは、入出力装置１０４を介して、データ格納装置１０１及びデータ分析装置１０２が出力する情報を参照したり取得したりすることができる。 The data analysis device 102 performs learning processing and analysis processing, which will be described later, on sensor data acquired from the sensor 103 to monitor the state of the device and detect an abnormality. The input/output device 104 provides a user interface for users to access the data storage device 101 and the data analysis device 102 . Via the input/output device 104 , the user can input data, make various settings, and perform analysis processing with respect to the data storage device 101 and the data analysis device 102 . A user can refer to or acquire information output from the data storage device 101 and the data analysis device 102 via the input/output device 104 .

なお、データ格納装置１０１及びデータ分析装置１０２の入出力は、それぞれが有する入力デバイス及び出力デバイスを介して行われてもよい。データ格納装置１０１、データ分析装置１０２、及び入出力装置１０４は、いずれも情報処理装置である。これらは独立したハードウェアであってもよいし、これらのうちの二つ以上が共通のハードウェアで構成されていてもよい。 Input/output to/from the data storage device 101 and the data analysis device 102 may be performed via their respective input devices and output devices. The data storage device 101, the data analysis device 102, and the input/output device 104 are all information processing devices. These may be independent hardware, or two or more of them may be configured with common hardware.

またこれらの装置は、それぞれ、その全部又は一部が、例えばクラウドシステムにおけるクラウドサーバのような仮想的な情報処理資源を用いて構成されていてもよい。また、センサ１０３、データ格納装置１０１、及びデータ分析装置１０２の所有者はそれぞれ異なっていてもよいし、これらのうちの二つ以上の所有者が同じであってもよい。例えば、分析対象１０５の所有者がセンサ１０３を所有し、分析技術の所有者がデータ分析装置１０２を所有し、データ格納装置１０１の所有者がセンサの所有者と分析技術の所有者の仲介となり、データの収集、及び管理を行ってもよい。 All or part of each of these devices may be configured using virtual information processing resources such as a cloud server in a cloud system. Moreover, the owners of the sensor 103, the data storage device 101, and the data analysis device 102 may be different, or two or more of them may be owned by the same person. For example, the owner of the analysis object 105 owns the sensor 103, the owner of the analysis technology owns the data analysis device 102, and the owner of the data storage device 101 acts as an intermediary between the owner of the sensor and the owner of the analysis technology. , may collect and manage data.

また、分析対象１０５及びセンサ１０３は二台以上あっても構わないし、各センサ１０３の所有者が異なっていても構わない。また、状態監視システム１は、データ格納装置１０１、及びデータ分析装置１０２を二台以上含み、分散処理やデータの分散管理などを行っても構わない。 Further, there may be two or more analysis targets 105 and sensors 103, and each sensor 103 may be owned by a different person. Moreover, the state monitoring system 1 may include two or more data storage devices 101 and data analysis devices 102 to perform distributed processing and distributed management of data.

図２は、分析対象データの各要素と測定値の関係を示したイメージ図である。例えば、振動センサ、加速度センサ、又は音センサなどにより時系列に測定された測定データに対してスペクトル密度を用いて分析を実施するケースでは、予め定めた単位時間毎に、例えば高速フーリエ変換を用いてスペクトル密度を算出する。 FIG. 2 is an image diagram showing the relationship between each element of analysis target data and measured values. For example, in the case of performing analysis using spectral density on measurement data measured in time series by a vibration sensor, an acceleration sensor, or a sound sensor, for example, a fast Fourier transform is used for each predetermined unit time. to calculate the spectral density.

グラフ２０１は周波数とスペクトル密度を示すグラフとなり、要素ＩＤは周波数に、測定値はスペクトル密度に対応する。グラフ２０１は、上限値２１１、下限値２１２、正常範囲２１３及び平均値２１４を示す。平常時であれば、単位時間毎に類似するスペクトル密度分布が得られるが、スペクトル密度の正常範囲（即ち振れ幅）２１３は周波数により異なるケースがある。振れ幅が異なる要因としては、機器そのものの特性によるケースや、外的要因（例えば温度、湿度、近接する装置の稼働状況、又はその他の要因）によるケースが考えられる。 A graph 201 is a graph showing frequency and spectral density, with element IDs corresponding to frequencies and measured values corresponding to spectral densities. Graph 201 shows upper limit 211 , lower limit 212 , normal range 213 and average 214 . In normal times, similar spectral density distributions are obtained for each unit time, but there are cases where the normal range (ie, amplitude) 213 of the spectral density differs depending on the frequency. Factors that cause the amplitude to vary include the characteristics of the device itself, and external factors (for example, temperature, humidity, operating conditions of nearby devices, or other factors).

図３は、データ分析装置１０２、データ格納装置１０１、及び入出力装置１０４の各装置として用いることが可能な情報処理装置のハードウェア構成例である。図３に示すように、情報処理装置３００は、例えばプロセッサ３０１、主記憶装置３０２、補助記憶装置３０３、入力装置３０４、出力装置３０５、及び通信装置３０６を備える。これらは図示しないバス等の通信手段を介して互いに通信可能に接続されている。 FIG. 3 is a hardware configuration example of an information processing device that can be used as each device of the data analysis device 102 , the data storage device 101 , and the input/output device 104 . As shown in FIG. 3, the information processing device 300 includes a processor 301, a main memory device 302, an auxiliary memory device 303, an input device 304, an output device 305, and a communication device 306, for example. These are communicably connected to each other via communication means such as a bus (not shown).

プロセッサ３０１は、例えば、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）やＭＰＵ（ＭｉｃｒｏＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）を用いて構成される。プロセッサ３０１が、主記憶装置３０２に格納されているコンピュータプログラムを読み出して実行することにより、データ分析装置１０２、データ格納装置１０１、及び入出力装置１０４の様々な機能が実現される。 The processor 301 is configured using, for example, a CPU (Central Processing Unit) or an MPU (Micro Processing Unit). Various functions of the data analysis device 102, the data storage device 101, and the input/output device 104 are realized by the processor 301 reading and executing the computer programs stored in the main storage device 302. FIG.

主記憶装置３０２は、コンピュータプログラムやデータを記憶する装置であり、例えば、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）、及び不揮発性半導体メモリ等である。 The main memory device 302 is a device that stores computer programs and data, and is, for example, a ROM (Read Only Memory), a RAM (Random Access Memory), a nonvolatile semiconductor memory, or the like.

補助記憶装置３０３は、例えば、ハードディスクドライブ、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）、光学式記憶媒体（即ち、ＣＤ（ＣｏｍｐａｃｔＤｉｓｃ）、及びＤＶＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｃ）等）、ストレージシステム、ＩＣカード（ＩｎｔｅｇｒａｔｅｄＣｉｒｃｕｉｔＣａｒｄ）、ＳＤメモリカード、等の記録媒体の読取／書込装置、及びクラウドサーバの記憶領域等である。補助記憶装置３０３に格納されているコンピュータプログラムやデータは、主記憶装置３０２に随時読み込まれる。 The auxiliary storage device 303 includes, for example, a hard disk drive, an SSD (Solid State Drive), an optical storage medium (that is, a CD (Compact Disc), a DVD (Digital Versatile Disc), etc.), a storage system, and an IC card (Integrated Circuit Card). ), a read/write device for recording media such as an SD memory card, and a storage area of a cloud server. Computer programs and data stored in the auxiliary storage device 303 are read into the main storage device 302 as needed.

入力装置３０４は、例えば、キーボード、マウス、タッチパネル、カードリーダ、又は音声入力デバイス等である。出力装置３０５は、ユーザに処理経過や処理結果等の各種情報を提供するユーザインタフェースである。出力装置３０５は、例えば、画面表示装置（即ち、液晶モニタ、ＬＣＤ（ＬｉｑｕｉｄＣｒｙｓｔａｌＤｉｓｐｌａｙ）、又はグラフィックカード等）、音声出力装置（即ち、スピーカ等）、又は印字装置等である。なお、例えば、情報処理装置３００が、通信装置３０６を介して他の装置との間で情報を入出力してもよい。 The input device 304 is, for example, a keyboard, mouse, touch panel, card reader, voice input device, or the like. The output device 305 is a user interface that provides various information such as processing progress and processing results to the user. The output device 305 is, for example, a screen display device (ie, liquid crystal monitor, LCD (Liquid Crystal Display), graphic card, etc.), an audio output device (ie, speaker, etc.), or a printing device. For example, the information processing device 300 may input/output information to/from another device via the communication device 306 .

通信装置３０６は、ＬＡＮやインターネット等の通信手段を介した他の装置との間の通信を実現する有線方式又は無線方式の通信インタフェースである。通信装置３０６は、例えば、ＮＩＣ（ＮｅｔｗｏｒｋＩｎｔｅｒｆａｃｅＣａｒｄ）、無線通信モジュール、ＵＳＢ（ＵｎｉｖｅｒｓａｌＳｅｒｉａｌＢｕｓ）モジュール、又はシリアル通信モジュール等である。 The communication device 306 is a wired or wireless communication interface that realizes communication with another device via a communication means such as a LAN or the Internet. The communication device 306 is, for example, a NIC (Network Interface Card), a wireless communication module, a USB (Universal Serial Bus) module, or a serial communication module.

なお、データ格納装置１０１、データ分析装置１０２、及び入出力装置１０４が備える各種機能は、例えば、プロセッサ３０１が、主記憶装置３０２に読みだされているコンピュータプログラム（補助記憶装置３０３から主記憶装置３０２に読み出されるコンピュータプログラムを含む。）を実行することにより実現される。主記憶装置３０２、補助記憶装置３０３及びそれらの組み合わせは、それぞれ非一過性記憶媒体を含む記憶装置である。データ分析装置１０２、データ格納装置１０１、及び入出力装置１０４のそれぞれの機能は、１又は複数の情報処理装置３００に実装することができ、一つの情報処理装置３００が異なる装置の機能を含んでよい。このように、情報監視システムは、１以上のプロセッサ及び１以上の記憶装置を含む計算機システムで構成できる。 Various functions provided by the data storage device 101, the data analysis device 102, and the input/output device 104 are, for example, a computer program read by the processor 301 into the main storage device 302 (from the auxiliary storage device 303 to the main storage device). 302 includes a computer program). The main storage device 302, auxiliary storage device 303, and combinations thereof are storage devices each including a non-transitory storage medium. Each function of the data analysis device 102, the data storage device 101, and the input/output device 104 can be implemented in one or a plurality of information processing devices 300, and one information processing device 300 may include the functions of different devices. good. Thus, the information monitoring system can be configured with a computer system including one or more processors and one or more storage devices.

図４に、データ格納装置１０１が備える主な機能、及びデータ格納装置１０１が記憶する主なデータを示す。データ格納装置１０１は、データ記録機能４０１及びデータ読み出し機能４０２を備えており、測定データ４０３を記録する。データ記録機能４０１は、センサ１０３から受信したデータを測定データ４０３へ記録する。データ読み出し機能４０２は、データ分析装置１０２から指定されたデータを測定データ４０３から読出して、データ分析装置１０２へ送信する。 FIG. 4 shows main functions of the data storage device 101 and main data stored in the data storage device 101 . The data storage device 101 has a data recording function 401 and a data reading function 402 and records measurement data 403 . A data recording function 401 records the data received from the sensor 103 into the measurement data 403 . Data reading function 402 reads data specified by data analysis device 102 from measurement data 403 and transmits the data to data analysis device 102 .

図５に、測定データ４０３の一例を示す。データ格納装置１０１及びデータ分析装置１０２が記録する各データは、例えばテーブル形式で記録される。測定データ４０３は、例えば、センサ識別子５０１、測定日時５０２、及び測定値５０３の各項目を有するレコードを格納する。センサ識別子５０１には、センサ１０３の識別子が記録される。センサ識別子５０１は、測定を行ったセンサ１０３を特定するための情報である。センサ識別子５０１は例えば、センサ１０３が設置されている建物の名前、階、部屋番号、装置名称、センサ種類、又は通し番号などが単一で、又は組み合わされて生成された識別子でもよい。 An example of the measurement data 403 is shown in FIG. Each data recorded by the data storage device 101 and the data analysis device 102 is recorded, for example, in a table format. The measurement data 403 stores, for example, a record having items of a sensor identifier 501 , measurement date and time 502 , and a measurement value 503 . An identifier of the sensor 103 is recorded in the sensor identifier 501 . The sensor identifier 501 is information for identifying the sensor 103 that performed the measurement. The sensor identifier 501 may be, for example, the name of the building in which the sensor 103 is installed, the floor, the room number, the device name, the sensor type, the serial number, or the like, or an identifier generated by combining them.

測定日時５０２には、データが測定された日時が記録される。日時は必要に応じ、時間、分、秒、又は秒未満の単位まで記録してもよい。測定値５０３には、センサ１０３が測定した値が記録される。なお、本例は測定データの一例であり、機器の動作角度や、基点からの経過時間や距離などを併せて記録しても構わない。 The measurement date and time 502 records the date and time when the data was measured. The date and time may be recorded in hours, minutes, seconds, or fractions of a second, as appropriate. A value measured by the sensor 103 is recorded in the measured value 503 . Note that this example is just an example of measurement data, and the operating angle of the device, the elapsed time and distance from the base point, and the like may also be recorded.

図６に、データ分析装置１０２が備える主な機能、及びデータ分析装置１０２が格納する主なデータを示す。データ分析装置１０２は、例えばパラメータ設定機能６０１、データ前処理機能６０２、モデル生成機能６０３、モデル演算機能６０４、分布算出機能６０５、異常確率算出機能６０６、及び結果表示機能６０７を有する。また、データ分析装置１０２は、前処理後データ６０８、差分分布データ６０９、分析結果データ６１０、及び分析モデル６１１を格納する。 FIG. 6 shows main functions provided by the data analysis device 102 and main data stored by the data analysis device 102 . The data analysis device 102 has, for example, a parameter setting function 601, a data preprocessing function 602, a model generation function 603, a model calculation function 604, a distribution calculation function 605, an abnormality probability calculation function 606, and a result display function 607. The data analysis device 102 also stores post-preprocessing data 608 , difference distribution data 609 , analysis result data 610 , and an analysis model 611 .

パラメータ設定機能６０１は、学習、及び監視時に必要なパラメータの設定を行うための画面を表示し、入力された情報を読み込む。データ前処理機能６０２は、データ格納装置１０１から受信した測定データに対して、前処理を行い、前処理後データ６０８に記録する。前処理は、例えば、所定の単位時間当たりの平均値、中間値、最頻値、実効値、最大値、又は最小値などの算出や、スペクトル密度の算出などであり、更にこれらを組合せて、例えば単位時間ごとにスペクトル密度を算出し、複数のスペクトル密度データを用いて周波数毎に平均値を算出しても良い。また、移動平均値などによる平滑化を行ってもよい。 A parameter setting function 601 displays a screen for setting parameters necessary for learning and monitoring, and reads input information. The data preprocessing function 602 performs preprocessing on the measurement data received from the data storage device 101 and records the preprocessed data 608 . Preprocessing is, for example, calculation of average value, median value, mode value, effective value, maximum value, or minimum value per predetermined unit time, calculation of spectral density, etc. Further, by combining these, For example, spectral density may be calculated for each unit time, and an average value may be calculated for each frequency using a plurality of spectral density data. Also, smoothing may be performed using a moving average value or the like.

モデル生成機能６０３は、前処理後のデータを入力とするオートエンコーダを生成して分析モデル６１１に記録する。オートエンコーダは、入力データから要素数を削減した中間データを生成し、中間データから入力データと同じ要素数の復元データを生成して出力する。なお、オートエンコーダと異なるタイプの、入力データと同じ要素数の復元データを生成する分析モデルを使用してもよい。 A model generation function 603 generates an autoencoder that receives preprocessed data as an input and records it in an analysis model 611 . An autoencoder generates intermediate data with the number of elements reduced from input data, and generates and outputs restored data with the same number of elements as the input data from the intermediate data. Note that an analysis model that generates restored data with the same number of elements as the input data, which is of a type different from the autoencoder, may be used.

モデル演算機能６０４は、前処理後のデータをモデル生成機能６０３によって生成されたオートエンコーダに入力してオートエンコード後のデータを出力する。分布算出機能６０５は、学習データと、モデル演算機能６０４によって出力されたオートエンコード後のデータとの比較を基に、オートエンコード前後の差分の分布を要素毎に生成し、差分分布データ６０９に記録する。 The model computation function 604 inputs the preprocessed data to the autoencoder generated by the model generation function 603 and outputs the autoencoded data. A distribution calculation function 605 generates a difference distribution before and after auto-encoding for each element based on comparison between the learning data and the data after auto-encoding output by the model calculation function 604, and records it in the difference distribution data 609. do.

異常確率算出機能６０６は、前処理後の監視対象データと、該監視対象データをオートエンコーダに入力して得られる出力データとの比較を基に、オートエンコード前後の差分を要素毎に算出し、前述のオートエンコード前後の差分の分布との比較により、異常発生確率を算出し、分析結果データ６１０に記録する。結果表示機能６０７は、算出した異常発生確率を入出力装置１０４、又は出力装置３０５に表示する。 The abnormality probability calculation function 606 calculates the difference between before and after autoencoding for each element based on comparison between the preprocessed monitoring target data and the output data obtained by inputting the monitoring target data to the autoencoder, By comparing with the distribution of the difference before and after the auto-encoding described above, the probability of abnormality occurrence is calculated and recorded in the analysis result data 610 . A result display function 607 displays the calculated abnormality occurrence probability on the input/output device 104 or the output device 305 .

図７に、前処理後データ６０８の一例を示す。前処理後データ６０８はセンサ識別子７０１、測定日時７０２、及び前処理後の値７０３を含む。前処理後の値は、１～ｎの要素ＩＤと対応して記録される。要素ＩＤは、例えば周波数や、機器の動作角度、又は基点からの経過時間又は距離などに対応する。前処理後データの値７０３（Ｄｉ＿１, Ｄｉ＿２, …Ｄｉ＿ｎ）が入力データの１エントリに相当する。例えばスペクトル密度を算出するケースでは、所定の単位時間毎に、各周波数に対応するスペクトル密度が記録される。 FIG. 7 shows an example of post-preprocessing data 608 . The preprocessed data 608 includes a sensor identifier 701 , a measurement date and time 702 , and a preprocessed value 703 . The values after preprocessing are recorded corresponding to element IDs from 1 to n. The element ID corresponds to, for example, the frequency, the operating angle of the device, or the elapsed time or distance from the base point. The preprocessed data value 703 (Di_1, Di_2, . . . Di_n) corresponds to one entry of the input data. For example, in the case of calculating the spectral density, the spectral density corresponding to each frequency is recorded for each predetermined unit time.

測定日時７０２は、例えば単位時間の最初の時刻である。また測定の際に、基点からの距離や経過時間、又は回転機器の動作角度などが測定値と合わせて記録されるケースでは、機器の動作一回毎に、距離、経過時間、又は動作角度などの軸の所定の値に対応する測定データを７０３に記録してもよい。この場合、例えば動作開始日時を測定日時として記録する。動作回毎のデータの抽出や所定の値に対応する測定データの算出は、データ前処理機能６０２が行う。 The measurement date and time 702 is, for example, the first time of the unit time. In addition, when measuring the distance from the reference point, the elapsed time, or the operating angle of the rotating equipment, etc., are recorded together with the measured values, the distance, elapsed time, or operating angle, etc., will be recorded for each operation of the equipment. Measured data corresponding to predetermined values of the axis of may be recorded in 703 . In this case, for example, the operation start date and time is recorded as the measurement date and time. The data preprocessing function 602 extracts data for each operation and calculates measurement data corresponding to a predetermined value.

図８に、差分分布データ６０９の一例を示す。差分分布データ６０９は、センサ識別子８０１、標準偏差８０２、及び９５％値８０３を含む。標準偏差８０２、及び９５％値８０３は、入力データの要素毎に記録される。また、９５％値８０３以外にも、平均値や９０％値などが記録されても構わない。分析モデル毎に１エントリの記録となる。図８は、例えば、センサ毎に一つの分析モデルを生成する例であり、センサ毎に１エントリの記録となっている。 FIG. 8 shows an example of the difference distribution data 609. As shown in FIG. Difference distribution data 609 includes sensor identifier 801 , standard deviation 802 and 95% value 803 . A standard deviation 802 and a 95% value 803 are recorded for each element of the input data. Besides the 95% value 803, an average value, a 90% value, etc. may be recorded. One entry is recorded for each analysis model. FIG. 8 shows, for example, an example in which one analysis model is generated for each sensor, and one entry is recorded for each sensor.

図９に、分析結果データ６１０の一例を示す。分析結果データ６１０は、センサ識別子９０１、日時９０２、異常確率９０３、差の標準化９０４を含む。 FIG. 9 shows an example of analysis result data 610. As shown in FIG. Analysis result data 610 includes sensor identifier 901 , date and time 902 , abnormality probability 903 , and difference standardization 904 .

図１０に、学習シーケンスの一例を示す。なお、センサ１０３とデータ格納装置１０１間、及びデータ格納装置１０１とデータ分析装置１０２間の通信は、通信ネットワーク１０６を介して行われるが、図１０では簡単のため通信ネットワーク１０６は省略する。 FIG. 10 shows an example of a learning sequence. Communication between the sensor 103 and the data storage device 101 and communication between the data storage device 101 and the data analysis device 102 are performed via the communication network 106, but the communication network 106 is omitted in FIG. 10 for simplicity.

センサ１０３は、予め設定された間隔で分析対象１０５について測定し（Ｓ１００１）、測定結果（例えば、センサ識別子、測定日時、及び測定値を含む）をデータ格納装置１０１へ送信する（Ｓ１００２）。データ格納装置１０１のデータ記録機能４０１は、センサ１０３から受信したデータを測定データ４０３の一部として記録する（Ｓ１００３）。センサ１０３による測定とデータ記録機能４０１による測定データの記録とは、センサ１０３とデータ記録機能４０１により、随時実行される。 The sensor 103 measures the analysis target 105 at preset intervals (S1001), and transmits the measurement results (including, for example, the sensor identifier, the date and time of measurement, and the measured value) to the data storage device 101 (S1002). The data recording function 401 of the data storage device 101 records the data received from the sensor 103 as part of the measurement data 403 (S1003). Measurement by the sensor 103 and recording of measurement data by the data recording function 401 are performed by the sensor 103 and the data recording function 401 as needed.

一方、データ分析装置１０２のパラメータ設定機能６０１は、パラメータ設定画面を出力装置に表示させる（Ｓ１００４）。状態監視システム１を利用するユーザは、入力装置３０４からパラメータ設定画面に、学習に使用する情報を入力する。パラメータ設定機能６０１は、入力装置３０４から入力された学習指示を受け付け（Ｓ１００５）、指示された学習期間のデータをデータ格納装置１０１に要求する（Ｓ１００６）。 On the other hand, the parameter setting function 601 of the data analysis device 102 displays the parameter setting screen on the output device (S1004). A user who uses the condition monitoring system 1 inputs information to be used for learning from the input device 304 to the parameter setting screen. The parameter setting function 601 accepts a learning instruction input from the input device 304 (S1005), and requests data of the instructed learning period from the data storage device 101 (S1006).

データ格納装置１０１のデータ読み出し機能４０２は、データ要求を受け付けると測定データ４０３から該当データを抽出し（Ｓ１００７）、データ分析装置１０２へ送信する（Ｓ１００８）。データ分析装置１０２は、データ格納装置１０１から学習データを受信すると、学習処理を実行する（Ｓ１００９）。 When the data request is received, the data reading function 402 of the data storage device 101 extracts the relevant data from the measurement data 403 (S1007) and transmits it to the data analysis device 102 (S1008). Upon receiving the learning data from the data storage device 101, the data analysis device 102 executes learning processing (S1009).

図１１は、学習処理Ｓ１００９のフローの一例を示す。データ前処理機能６０２は、データ格納装置１０１から受信した学習データに所定のデータ前処理を行い、前処理後データ６０８に記録する（Ｓ１１０２）。ここで、所定のデータ前処理は、例えば、データの平滑化、代表値算出、又はフーリエ展開などである。 FIG. 11 shows an example of the flow of the learning process S1009. The data preprocessing function 602 performs predetermined data preprocessing on the learning data received from the data storage device 101, and records it in preprocessed data 608 (S1102). Here, the predetermined data preprocessing is, for example, data smoothing, representative value calculation, Fourier expansion, or the like.

モデル生成機能６０３は、前処理後の学習データをモデル生成用データと差分分布生成用データに分割する（Ｓ１１０３）。本分割は、例えば、モデル生成用データをランダムに抽出し、残りを差分分布生成用データとする。ランダムに抽出するのが差分分布生成用データであり、残りをモデル生成用データとしてもよい。また、分割の比は、１対１を想定するが、オートエンコーダの学習、及び差分の分布生成が十分に行える範囲であれば、１対１でなくても構わない。 The model generation function 603 divides the preprocessed learning data into model generation data and difference distribution generation data (S1103). For this division, for example, data for model generation is randomly extracted, and the remainder is used as difference distribution generation data. The difference distribution generation data may be randomly extracted, and the rest may be model generation data. Also, although the division ratio is assumed to be 1:1, it does not have to be 1:1 as long as the autoencoder can learn and generate a difference distribution sufficiently.

次に、モデル生成機能６０３は、モデル生成用データを学習することによりオートエンコーダを生成し、生成したオートエンコーダを分析モデル６１１に記録する（Ｓ１１０４）。モデル演算機能６０４は、差分分布生成用データを分析モデル６１１に記録されたオートエンコーダに入力し、出力データを得る（Ｓ１１０５）。 Next, the model generation function 603 generates an autoencoder by learning model generation data, and records the generated autoencoder in the analysis model 611 (S1104). The model calculation function 604 inputs the difference distribution generation data to the autoencoder recorded in the analysis model 611 to obtain output data (S1105).

分布算出機能６０５は、後述する差分分布算出処理により、前処理後データの要素毎に差分の分布を算出し（Ｓ１１０６）、標準偏差、及び９５％値を差分分布データ６０９に記録する（Ｓ１１０７）。なお、９５％値は一つの例であり、例えば、９０％値や８０％値であっても構わないし、これらを併記しても構わない。 The distribution calculation function 605 calculates the difference distribution for each element of the preprocessed data by the difference distribution calculation process described later (S1106), and records the standard deviation and 95% value in the difference distribution data 609 (S1107). . Note that the 95% value is just one example, and for example, the 90% value and the 80% value may be used, and these values may be written together.

図１２は、差分分布算出処理Ｓ１１０６のイメージを示す図である。分布算出機能６０５は、差分分布算出用の各データ１２０１と、対応するオートエンコード後の出力データ１２０２の差分の絶対値を要素毎に算出する（１２０３）。ここで、「Ｄｉｆｆ（ｉ）ｍ」を、ｍ番目の差分分布生成用データと、対応する出力データとの、要素ｉの差の絶対値とする。 FIG. 12 is a diagram showing an image of the difference distribution calculation processing S1106. The distribution calculation function 605 calculates the absolute value of the difference between each data 1201 for differential distribution calculation and the corresponding auto-encoded output data 1202 for each element (1203). Here, let “Diff(i)m” be the absolute value of the difference of the element i between the m-th difference distribution generation data and the corresponding output data.

分布算出機能６０５は、各ｉに対して、全てのｍに対する「Ｄｉｆｆ(ｉ)ｍ」を収集し、標準偏差（σｉ）、及び９５％値（Ｖｉ）を算出し、差分分布データ６０９に記録する（１２０４）。なお、９５％値は一つの例であり、例えば９０％値や、８５％値でも構わないし、複数の値を算出して記録しても構わない。なお、本例では要素毎の差として絶対値を用いたが、例えば、差の二乗値を絶対値の代わりに用いても構わない。 Distribution calculation function 605 collects "Diff(i)m" for all m for each i, calculates standard deviation (σi) and 95% value (Vi), and records in difference distribution data 609 (1204). Note that the 95% value is just one example, and for example, a 90% value or an 85% value may be used, or a plurality of values may be calculated and recorded. Although the absolute value is used as the difference for each element in this example, for example, the square value of the difference may be used instead of the absolute value.

図１３を用いて分析シーケンスを説明する。図１３では、定期的に（即ち、所定の時間間隔で）分析を実施する例を示す。センサ１０３は所定の間隔で測定を実施し（Ｓ１３０１）、データ格納装置１０１に測定データを送信する（Ｓ１３０２）。データ格納装置１０１のデータ記録機能４０１は受信した測定データを測定データ４０３に記録する（Ｓ１３０３）。本処理は学習時の測定（Ｓ１００１）、及び測定データ記録（Ｓ１００３）と同じであり、随時実行される。 An analysis sequence will be described with reference to FIG. FIG. 13 shows an example in which analysis is performed periodically (that is, at predetermined time intervals). The sensor 103 performs measurements at predetermined intervals (S1301) and transmits measurement data to the data storage device 101 (S1302). The data recording function 401 of the data storage device 101 records the received measurement data in the measurement data 403 (S1303). This processing is the same as the measurement during learning (S1001) and the measurement data recording (S1003), and is executed as needed.

データ分析装置１０２の異常確率算出機能６０６は、定期的に（即ち、所定の時間間隔で）分析を開始し（Ｓ１３０４）、データ格納装置１０１に分析対象期間のデータを要求する（Ｓ１３０５）。分析対象期間とは、例えば前回の分析日時から現在の日時までである。 The abnormality probability calculation function 606 of the data analysis device 102 periodically (that is, at predetermined time intervals) starts analysis (S1304), and requests data of the analysis target period from the data storage device 101 (S1305). The analysis target period is, for example, from the previous analysis date and time to the current date and time.

データ格納装置１０１のデータ読み出し機能４０２は、受信したデータ要求に従い、該当するデータを測定データ４０３から抽出し（Ｓ１３０６）、データ分析装置１０２に送信する（Ｓ１３０７）。異常確率算出機能６０６は、データ格納装置１０１から分析対象期間のデータを受信すると、後述する分析処理を実行し（Ｓ１３０８）、分析結果を入出力装置１０４、又は出力装置３０５で表示する（Ｓ１３０９）。 The data reading function 402 of the data storage device 101 extracts the relevant data from the measurement data 403 according to the received data request (S1306) and transmits it to the data analysis device 102 (S1307). The abnormality probability calculation function 606, upon receiving the data of the analysis target period from the data storage device 101, executes the analysis processing described later (S1308), and displays the analysis result on the input/output device 104 or the output device 305 (S1309). .

図１４に分析処理Ｓ１３０８のフローの一例を示す。データ前処理機能６０２は、取得したデータに所定の前処理を実施する（Ｓ１４０２）。ここで所定の前処理とは、図１１に示す学習時の処理フローにおける前処理と同様の処理である。モデル演算機能６０４は、前記前処理後のデータを分析モデル６１１に記録されたオートエンコーダに入力して、出力データを得る（Ｓ１４０３）。 FIG. 14 shows an example of the flow of analysis processing S1308. The data preprocessing function 602 performs predetermined preprocessing on the acquired data (S1402). Here, the predetermined preprocessing is the same processing as the preprocessing in the processing flow during learning shown in FIG. 11 . The model calculation function 604 inputs the preprocessed data to the autoencoder recorded in the analysis model 611 to obtain output data (S1403).

異常確率算出機能６０６は、前記前処理後のデータと、オートエンコード後の出力データを比較して、任意の要素i（０＜ｉ＜＝ｎ）に対して差分Ｄｉｆｆ（i）を算出する（Ｓ１４０４）。次に異常確率算出機能６０６は、要素毎に算出した差分Ｄｉｆｆ（ｉ）を下記数式１により標準化する（Ｓ１４０５）。
Ｄｓ＿ｉ＝ｍａｘ（Ｄｉｆｆ（i）―Ｖｉ,０）／σｉ／Ｂ＊１００（数式１）
ここで、σｉ及びＶｉは、それぞれ差分分布データ６０９に記録した差分の標準偏差、及び９５％値である。また、Ｂはパラメータ設定時に設定する閾値である。 The abnormality probability calculation function 606 compares the preprocessed data with the auto-encoded output data, and calculates a difference Diff(i) for an arbitrary element i (0<i<=n) ( S1404). Next, the abnormality probability calculation function 606 standardizes the difference Diff(i) calculated for each element using the following formula 1 (S1405).
Ds_i=max(Diff(i)-Vi,0)/σi/B*100 (Equation 1)
Here, σi and Vi are the standard deviation and 95% value of the difference recorded in the difference distribution data 609, respectively. Also, B is a threshold value set when parameters are set.

次に異常確率算出機能６０６は異常確率Ｏ＿ｉを、下記数式２により算出する（Ｓ１４０６）。
Ｏ＿ｉ＝ｍｉｎ（ＤＳ＿ｉ，１００）（数式２） Next, the abnormality probability calculation function 606 calculates the abnormality probability O_i by the following Equation 2 (S1406).
O_i=min(DS_i, 100) (Equation 2)

次に、異常確率算出機能６０６は、分析結果（即ち、差分の標準化値（Ｄｓ＿i）、及び異常発生確率（Ｏ＿ｉ））を分析結果データ６１０に記録する（Ｓ１４０７）。そして、異常確率とアラート閾値を比較し（Ｓ１４０８）、異常確率が所定のアラート閾値よりも大きければ（Ｓ１４０８にてＹＥＳ）アラートをあげる（Ｓ１４０９）。 Next, the abnormality probability calculation function 606 records the analysis result (that is, the standardized value of the difference (Ds_i) and the abnormality occurrence probability (O_i)) in the analysis result data 610 (S1407). Then, the abnormality probability and the alert threshold are compared (S1408), and if the abnormality probability is greater than a predetermined alert threshold (YES in S1408), an alert is issued (S1409).

なお、図１３に示す分析シーケンスは、所定の間隔で定期的に分析処理を行う例を示したが、分析開始は、状態監視システム１のユーザにより、明示的に指示されても構わない。データ分析装置は、分析開始指示を受け付けると、分析を開始する。 Although the analysis sequence shown in FIG. 13 shows an example in which analysis processing is periodically performed at predetermined intervals, the user of the state monitoring system 1 may explicitly instruct the start of analysis. The data analysis device starts analysis upon receiving the analysis start instruction.

図１５に、差分Ｄｉｆｆ（ｉ）と、標準化された差分（Ｄｓ＿ｉ）、及び異常確率（Ｏ＿ｉ）の関係を示す。差分Ｄｉｆｆ（i）が平常時の９５％値（Ｖｉ）以下であれば正常範囲に収まっており、異常確率は０％であると考える。そして、差分Ｄｉｆｆ（i）が平常時の９５％値（Ｖｉ）を超えて上昇すると異常確率も線形に上昇する。また、差分Ｄｉｆｆ（i）が平常時の９５％値（Ｖｉ）に標準偏差（σｉ）のＢ倍を加えた値以上になると（即ち、差分Ｄｉｆｆ（ｉ）＞＝Ｖｉ＋σｉ＊Ｂ）、異常確率（Ｏ＿ｉ）が１００％であると考える。なお、標準化した差分（Ｄｓ＿ｉ）は、Ｄｉｆｆ（ｉ）＞＝Ｖｉ＋σｉ＊Ｂの領域でも線形に上昇する。 FIG. 15 shows the relationship between the difference Diff(i), the standardized difference (Ds_i), and the abnormality probability (O_i). If the difference Diff(i) is less than the 95% value (Vi) of the normal time, it is considered to be within the normal range and the probability of abnormality is 0%. Then, when the difference Diff(i) rises above the 95% value (Vi) of the normal time, the abnormality probability also rises linearly. Also, when the difference Diff(i) is equal to or greater than the value obtained by adding B times the standard deviation (σi) to the normal 95% value (Vi) (that is, the difference Diff(i)>=Vi+σi*B), the probability of abnormality Consider (O_i) to be 100%. Note that the standardized difference (Ds_i) increases linearly even in the region of Diff(i)>=Vi+σi*B.

図１６はパラメータ設定機能が表示するパラメータ設定画面の一例である。パラメータ設定画面１６００は、学習データ１６０１、データ前処理１６０２、異常判定パラメータ１６０３、決定ボタン１６０４、及びキャンセルボタン１６０５などを表示する。状態監視システム１のユーザは、入力装置３０４を用いて各種パラメータを入力する。 FIG. 16 is an example of a parameter setting screen displayed by the parameter setting function. The parameter setting screen 1600 displays learning data 1601, data preprocessing 1602, abnormality determination parameters 1603, an OK button 1604, a cancel button 1605, and the like. A user of the condition monitoring system 1 uses the input device 304 to input various parameters.

ユーザは、学習データ１６０１では、分析対象とするセンサ１６０６、及び学習対象期間１６０７を指定する。ユーザは、データ前処理１６０２では、データの前処理方法１６０８、及び前処理に必要な各種のパラメータ（図１６の例では単位時間１６０９）を指定する。データ前処理１６０２は、前処理方法１６０８への入力を読み込んだのちに、必要に応じて付属する一つ以上のパラメータ１６０９（図１６の例では単位時間）を表示してもよい。また、前処理は実施しなくても構わない。 The user designates a sensor 1606 to be analyzed and a learning target period 1607 in the learning data 1601 . In data preprocessing 1602, the user designates a data preprocessing method 1608 and various parameters required for preprocessing (unit time 1609 in the example of FIG. 16). After data preprocessing 1602 reads the input to preprocessing method 1608, it may optionally display one or more accompanying parameters 1609 (units of time in the example of FIG. 16). Also, the pretreatment may not be performed.

ユーザは、異常判定パラメータ１６０３では、正常範囲１６１０、閾値１６１１、及びアラート閾値１６１２を入力する。ユーザは、正常範囲１６１０には、差分の分布生成時、及び異常確率算出時に使用した差分の基準となる値を入力する。本実施例では、正常範囲を９５％として説明したが、正常範囲は他の値であっても構わない。例えば、ユーザは、９０％値、８５％値、平均値などを指定し、差分分布算出処理は、対応する値を算出して、これを差分分布データ６０９に記録しても構わない。閾値１６１１は、異常確率算出時の数式１で使う閾値Ｂである。アラート閾値１６１２は、分析処理におけるアラート要否を判定するための閾値である。 The user inputs a normal range 1610 , a threshold 1611 , and an alert threshold 1612 in the abnormality determination parameters 1603 . In the normal range 1610, the user inputs a difference reference value used when generating the difference distribution and when calculating the abnormality probability. Although the normal range is set to 95% in this embodiment, the normal range may be any other value. For example, the user may specify a 90% value, an 85% value, an average value, etc., and the difference distribution calculation process may calculate the corresponding values and record them in the difference distribution data 609 . A threshold 1611 is the threshold B used in Equation 1 when calculating the probability of abnormality. The alert threshold 1612 is a threshold for determining whether an alert is required in analysis processing.

なお、パラメータ設定画面は、図１６に示す例よりも多くの項目を表示してもよいし、少ない項目を表示してもよい。また、データ前処理１６０２、及び異常判定パラメータ１６０３はデフォルト値が設定されていても構わない。 The parameter setting screen may display more items than the example shown in FIG. 16, or may display fewer items. Default values may be set for the data preprocessing 1602 and the abnormality determination parameter 1603 .

図１７は結果表示機能６０７が表示する分析結果表示画面の一例である。分析結果表示画面１７００は、例えばパラメータ１７０１、決定ボタン１７０６、キャンセルボタン１７０７、及び異常確率１７０８を表示する。 FIG. 17 shows an example of an analysis result display screen displayed by the result display function 607. As shown in FIG. The analysis result display screen 1700 displays parameters 1701, a decision button 1706, a cancel button 1707, and an abnormality probability 1708, for example.

パラメータ１７０１は、分析に使用したセンサ１７０２、正常範囲１７０３、閾値１７０４、及び分析結果の画面への表示期間１７０５を表示する。異常確率１７０８は、測定日時と要素ＩＤの組に対応する異常確率を、異常確率と色を対応付けて表示する。ユーザは、色相の違いや濃淡（即ち明度の違い）等により、異常確率を知ることが出来る。要素ＩＤは、例えば周波数、基点からの時間や距離、又は装置の動作角度などである。また、異常確率が高い時には、明示的な警告を表示しても構わない。 The parameters 1701 display the sensor 1702 used for analysis, the normal range 1703, the threshold 1704, and the display period 1705 of the analysis result on the screen. The abnormality probability 1708 displays the abnormality probability corresponding to the combination of the measurement date and time and the element ID by associating the abnormality probability with the color. The user can know the abnormality probability from the difference in hue and shade (that is, the difference in lightness). The element ID is, for example, frequency, time or distance from the base point, or operating angle of the device. Also, when the probability of abnormality is high, an explicit warning may be displayed.

また、パラメータ１７０１に示すパラメータは、再設定できるようになっていても構わない。再設定が可能なケースにおいては、これらのパラメータが入力されて決定ボタン１７０６が押されると、分布算出機能６０５は必要に応じて入力された正常範囲に対応する値（Ｖｉ）を再計算する。また、異常確率算出機能６０６も数式１及び２の再計算を行う。結果表示機能６０７は、新たに算出された異常確率を用いて異常確率１７０８を更新する。これにより異常検知のセンシビティを調整することが可能になる。 Also, the parameters shown in the parameter 1701 may be resettable. In cases where resetting is possible, when these parameters are entered and the decision button 1706 is pressed, the distribution calculation function 605 recalculates the values (Vi) corresponding to the entered normal ranges as necessary. Further, the abnormality probability calculation function 606 also recalculates Equations 1 and 2. The result display function 607 updates the abnormality probability 1708 using the newly calculated abnormality probability. This makes it possible to adjust the sensitivity of anomaly detection.

なお、分析結果表示画面は、図１７に示す例よりも多くの項目を表示してもよいし、少ない項目を表示してもよい。 The analysis result display screen may display more items than the example shown in FIG. 17, or may display fewer items.

本実施例では、センサ１０３により測定されたデータは、データ格納装置１０１に格納された後、学習処理、又は分析処理の開始時に、データ分析装置１０２によりデータ格納装置１０１から読み出される。これに代えて、例えば、センサ１０３により測定されたデータは外部記憶メディア（不図示）に格納されてもよい。データ分析装置１０２は、外部記憶メディアから測定データを読み出して、学習、及び分析を実施することもできる。また、測定データはセンサ１０３からデータ分析装置１０２へ直接送信されてもよい。 In this embodiment, data measured by the sensor 103 is stored in the data storage device 101 and then read out from the data storage device 101 by the data analysis device 102 at the start of learning processing or analysis processing. Alternatively, for example, data measured by sensor 103 may be stored in an external storage medium (not shown). The data analysis device 102 can also read measurement data from an external storage medium and perform learning and analysis. Measured data may also be transmitted directly from the sensor 103 to the data analysis device 102 .

また、本監視システムは、必ずしもデータ前処理機能６０２を備える必要はなく、前処理済みのデータがデータ分析装置１０２に入力されてもよい。例えば、複雑な前処理を要するデータにおいては、図示しない計算機において予め前処理を行った上で本状態監視システム１に入力することにより、本状態監視システム１を利用してもよい。 Also, the monitoring system does not necessarily have the data preprocessing function 602 , and preprocessed data may be input to the data analysis device 102 . For example, data that requires complicated preprocessing may be preprocessed in a computer (not shown) before being input to the state monitoring system 1 so that the state monitoring system 1 can be used.

また、本実施例において各データを示す一例としてテーブル表記を用いたが、これは各データの記録方法をテーブル型に制限するものではなく、データの記録は、リスト、又はチェーン等、様々な方法を用いてもよい。また、記録される要素は、例えば数字、記号、又は数式等、様々な形で表現されてもよい。 Also, in this embodiment, table notation is used as an example of indicating each data, but this does not limit the recording method of each data to the table type, and data can be recorded in various ways such as list or chain. may be used. Also, the recorded elements may be represented in various forms, such as numbers, symbols, or mathematical expressions.

また、本実施例において各データは単一のセンサにより収集された例を示したが、例えば、入力データの各要素が二つ以上のセンサによって得られたものでも構わないし、要素ＩＤがセンサＩＤに対応しても構わない。所定の順序で並んだ要素で構成される入力データについて学習、及び分析を行うことにより、センサ毎の測定値の振れ幅に応じた異常検知が可能になる。 Also, in this embodiment, each data is collected by a single sensor, but for example, each element of the input data may be obtained by two or more sensors. may correspond to By learning and analyzing input data composed of elements arranged in a predetermined order, it is possible to detect anomalies according to the amplitude of measurement values for each sensor.

また、差の標準化９０４を可視化して表示しても構わない。異常確率が１００％を超える要素ＩＤの中から、Ｄｓ＿ｉが特に大きい要素を検出し、故障個所の特定に用いることも可能である。また、要素の異常確率９０３又は差の標準化９０４と、異常の種類や故障個所を結びつけて記録しておけば、異常確率９０３又は差の標準化９０４が高い要素を特定した後に、異常の種類や故障の箇所を推定することが可能になる。また、数式１に示す閾値Ｂは、要素毎に設定しても構わない。 Also, the normalization of the difference 904 may be visualized and displayed. It is also possible to detect an element with a particularly large Ds_i from among the element IDs whose abnormality probability exceeds 100%, and use it to identify the failure location. Further, if the abnormality probability 903 or difference standardization 904 of the element and the abnormality type or failure location are linked and recorded, after identifying the element with the abnormality probability 903 or the difference standardization 904 high, the abnormality type or failure can be estimated. Also, the threshold value B shown in Equation 1 may be set for each element.

このように構成される本実施例によれば、入力データの個々の要素の振れ幅が異なるケースにおいても、状態監視システム１は個々の要素の振れ幅に対応して異常を検知することが可能な状態監視の実施が可能になる。 According to this embodiment configured in this way, even in cases where the amplitudes of the individual elements of the input data differ, the condition monitoring system 1 can detect abnormalities corresponding to the amplitudes of the individual elements. state monitoring can be performed.

また、本方法では学習データを分割し、一方の学習データを用いてオートエンコーダを生成し、もう一方の学習データを用いてオートエンコーダの入出力データの要素毎の差の分布を算出し、この分布を用いて監視対象データの異常判定を行う。このため、オートエンコーダに要求される復元の精度は必ずしも高くなく、学習データの数が少ないときでも分析モデルの生成が可能になる。 In addition, in this method, the learning data is divided, one of the learning data is used to generate an autoencoder, and the other learning data is used to calculate the distribution of the difference for each element of the input/output data of the autoencoder. Abnormality determination of monitoring target data is performed using the distribution. For this reason, the restoration accuracy required for the autoencoder is not necessarily high, and an analysis model can be generated even when the number of learning data is small.

本実施例では、データ分析装置をエッジとクラウドにそれぞれ配し、エッジ側のデータ分析装置で分析を行う例を示す。図１８は、本実施例におけるシステム構成の概略を示す図である。エッジにエッジ側データ分析装置１８０１が配置され、クラウドにクラウド側データ分析装置１８０２が配置されている。エッジ側データ分析装置１８０１とクラウド側データ分析装置１８０２は、ネットワーク１０６を介して通信可能である。 In this embodiment, an example will be shown in which data analysis devices are arranged on the edge and the cloud, respectively, and analysis is performed by the data analysis device on the edge side. FIG. 18 is a diagram showing an outline of the system configuration in this embodiment. An edge-side data analysis device 1801 is arranged on the edge, and a cloud-side data analysis device 1802 is arranged on the cloud. The edge side data analysis device 1801 and the cloud side data analysis device 1802 can communicate via the network 106 .

エッジ側データ分析装置１８０１は状態監視システム１のデータ格納装置１０１が備える機能及びデータと、データ分析装置１０２が備える機能及びデータの一部又は全てを備える。センサ１０３、及びエッジ側データ分析装置１８０１は、例えば同一の建物内や、同一企業のネットワーク内など、物理的に、又はネットワーク的に近いエリアに設置され、ネットワーク１８０３を介して接続される。なお、学習シーケンスは実施例１と同様であり、分析の前に行われる。 The edge-side data analysis device 1801 has some or all of the functions and data provided by the data storage device 101 of the state monitoring system 1 and the functions and data provided by the data analysis device 102 . The sensor 103 and the edge-side data analysis device 1801 are installed in close physical or network areas, such as in the same building or in the same company's network, and are connected via a network 1803 . Note that the learning sequence is the same as in Example 1 and is performed before analysis.

図１９に本構成における分析シーケンスを示す。クラウド側データ分析装置１８０２は、学習済みの分析モデル６１１（即ちオートエンコーダ）、及び差分分布データ６０９をエッジ側データ分析装置１８０１に送信する（Ｓ１９０１）。エッジ側データ分析装置１８０１は、受信したオートエンコーダ及び差分分布データを、それぞれ分析モデル６１１、及び差分分布データ６０９に記録する（Ｓ１９０２）。 FIG. 19 shows an analysis sequence in this configuration. The cloud-side data analysis device 1802 transmits the learned analysis model 611 (that is, autoencoder) and difference distribution data 609 to the edge-side data analysis device 1801 (S1901). The edge-side data analysis device 1801 records the received autoencoder and difference distribution data in the analysis model 611 and the difference distribution data 609, respectively (S1902).

センサ１０３は、所定の間隔でデータを測定し（Ｓ１９０３）、測定データをエッジ側データ分析装置１８０１へ送信する（Ｓ１９０４）。エッジ側データ分析装置１８０１のデータ記録機能４０１は受信した測定データを測定データ４０３に記録する（Ｓ１９０５）。次に、エッジ側データ分析装置１８０１の異常確率算出機能６０６は実施例１と同様の分析処理を実行し（Ｓ１９０６）、エッジ側データ分析装置１８０１の結果表示機能６０７が結果を表示する（Ｓ１９０７）。 The sensor 103 measures data at predetermined intervals (S1903), and transmits the measured data to the edge side data analysis device 1801 (S1904). The data recording function 401 of the edge side data analysis device 1801 records the received measurement data in the measurement data 403 (S1905). Next, the abnormality probability calculation function 606 of the edge side data analysis device 1801 executes analysis processing similar to that of the first embodiment (S1906), and the result display function 607 of the edge side data analysis device 1801 displays the result (S1907). .

本実施例によれば、エッジ側データ分析装置１８０１によるリアルタイムな分析が可能になり、異常発生から検知までの時間が削減できる。また、データ分析装置１０２をエッジ側に備えることにより、監視対象データを外部に送ることなく分析を行うことが可能になる。また、ネットワーク１０６へのデータの流入量を削減することも可能である。また、オートエンコーダの生成はクラウド側データ分析装置１８０２で行うため、エッジ側データ分析装置１８０１はＤｅｅｐＬｅａｒｎｉｎｇの実行に必要な高い計算能力を有することなくエッジ側でのリアルタイムな分析が可能になる。 According to this embodiment, real-time analysis by the edge-side data analysis device 1801 becomes possible, and the time from the occurrence of an abnormality to detection can be reduced. Also, by providing the data analysis device 102 on the edge side, it becomes possible to analyze the monitored data without sending it to the outside. It is also possible to reduce the amount of data flowing into the network 106 . In addition, since the cloud-side data analysis device 1802 generates autoencoders, the edge-side data analysis device 1801 can perform real-time analysis on the edge side without having the high computing power required to execute Deep Learning.

なお、本状態監視方法はオンプレミス型の運用を行っても構わない。オンプレミス型の運用では、エッジ側のデータ分析装置１８０１で学習処理（即ち分析モデル生成、及び差分の分布算出）を行う。これにより、学習用のデータを含め、測定データを外部に出すことなく学習及び分析処理を行うことが可能になる。 Note that this state monitoring method may be operated on-premises. In the on-premise type operation, the data analysis device 1801 on the edge side performs learning processing (that is, analysis model generation and difference distribution calculation). This makes it possible to perform learning and analysis processing without exposing measurement data, including data for learning.

実施例１では、前処理後の入力データの要素数が固定された一次元配列で表されるケースについて、オートエンコーダを用いた分析例を示した。本実施例では、多様な入力データに対応可能な分析例を示す。 In Example 1, an analysis example using an autoencoder was shown for a case in which input data after preprocessing is represented by a one-dimensional array with a fixed number of elements. In this embodiment, an analysis example that can deal with various input data will be shown.

例えば入力データの各要素が（Ｘｉ，Ｙｉ）組で表され、Ｘｉが周波数、機器の回転角度、又は基点からの距離や時間であり、ＹｉがＸｉに紐づけられた測定値であるケースでは、全ての入力データで同じＸｉが存在するとは限らず、また全ての入力データの入力データ数が同じになるとも限らない。また、一つの入力データに同じＸｉが存在することもあり得る。 For example, each element of the input data is represented by a (Xi, Yi) pair, where Xi is the frequency, the rotation angle of the device, or the distance or time from the base point, and Yi is the measured value linked to Xi. , the same Xi does not necessarily exist in all the input data, and the number of input data in all the input data does not necessarily become the same. Also, the same Xi may exist in one piece of input data.

また、例えばＸｉの領域が広域であり、単位区間毎に代表値を算出して要素数を削減し、これを入力データとして活用するケースにおいて、Ｙｉの振れ幅が大きく、平均値、最大値、又は最小値などの一つの値では表現しきれない可能性も考えられる。そこで、本実施例では入力データを２次元配列データに変換することにより、様々なケースの入力データに対応可能な分析例を示す。 Also, for example, when the area of Xi is wide, the representative value is calculated for each unit interval to reduce the number of elements, and this is used as input data, the fluctuation range of Yi is large, the average value, the maximum value, Alternatively, there is a possibility that it cannot be expressed by a single value such as the minimum value. Therefore, in this embodiment, by converting the input data into two-dimensional array data, an analysis example that can deal with input data in various cases will be shown.

以下に、入力データを二次元配列データに変換する方法、及び二次元配列データに本方式を適用する際の分析方法を示す。なお、本実施例における説明は、実施例１と異なる箇所を中心に行う。 A method of converting input data into two-dimensional array data and an analysis method when applying this method to two-dimensional array data are described below. Note that the description of this embodiment will focus on the points that are different from those of the first embodiment.

データ分析装置１０２は、図示しない二次元配列化機能を有する。データ分析装置１０２は、図１１に示す前処理Ｓ１１０２、及び図１４に示す前処理Ｓ１４０２の後で、前処理後データの二次元配列化処理を行う The data analysis device 102 has a two-dimensional arraying function (not shown). After the preprocessing S1102 shown in FIG. 11 and the preprocessing S1402 shown in FIG. 14, the data analysis device 102 performs two-dimensional array processing of the preprocessed data.

図２０を用いて、入力データの二次元配列化処理を説明する。図７に示す前処理データの測定日時Ｔ１に着目し、前処理後の値（Ｄ１＿１，Ｄ１＿２，．．．，Ｄ１＿ｎ）を二次元配列化する例を示す。なお、要素ＩＤの１～ｎは、所定の数値Ｆ１～Ｆｎに対応し、前処理後の値は数値Ｆ１～Ｆｎに対応して記録されたものとする。 The two-dimensional array processing of input data will be described with reference to FIG. Focusing on the measurement date and time T1 of the preprocessed data shown in FIG. 7, an example of two-dimensionally arraying the values (D1_1, D1_2, . Element IDs 1 to n correspond to predetermined numerical values F1 to Fn, and values after preprocessing are recorded corresponding to numerical values F1 to Fn.

グラフ２００１は、測定日時Ｔ１のデータをグラフ化したものである。画像２００２は、横軸がＦ１～Ｆｎ、縦軸がｙ＿ｍｉｎ～ｙ＿ｍａｘの範囲であらわされたグラフ２００１を、ｎ×Ｙ＿ｈｉｇｈのサイズの画像にマッピングしたものである。画像２００２の各要素（ｉ，ｋ）は、任意のｉ（０＜ｉ＜＝ｎ）、及び任意のｋ（０＜ｋ＜＝Ｙ＿ｈｉｇｎ）について、下記の数式３及び４で表される。下式において、０は画像における黒、１は画像における白を示す。
（ｉ，ｋ）＝１（ｋ＝ｙｉ）（数式３）
（ｉ，ｋ）＝０（ｋ≠ｙｉ）（数式４） A graph 2001 is a graph of data on the measurement date and time T1. An image 2002 is obtained by mapping the graph 2001 represented by the range of F1 to Fn on the horizontal axis and the range of y_min to y_max on the vertical axis to an image of size n×Y_high. Each element (i, k) of the image 2002 is represented by Equations 3 and 4 below for arbitrary i (0<i<=n) and arbitrary k (0<k<=Y_hign). In the formula below, 0 indicates black in the image and 1 indicates white in the image.
(i, k)=1 (k=yi) (Formula 3)
(i, k)=0 (k≠yi) (Formula 4)

ここで、ｙｉは以下の数式５により定義する。
ｙｉ＝ｉｎｔ｛（Ｄ１＿ｉ－ｙ＿ｍｉｎ）／（ｙ＿ｍａｘ－ｙ＿ｍｉｎ）
＊Ｙ＿ｈｉｇｈ｝（数式５）
但し、ｙｉ＜１の時、ｙｉ＝１、また、ｙｉ＞Ｙ＿ｈｉｇｈの時、ｙｉ＝Ｙ＿ｈｉｇｈとする。 Here, yi is defined by Equation 5 below.
yi = int {(D1_i−y_min)/(y_max−y_min)
*Y_high} (Formula 5)
However, when yi<1, yi=1, and when yi>Y_high, yi=Y_high.

ここで、ｎは入力データの要素数であり、二次元配列データの一次元目のサイズである。また、Ｙ＿ｈｉｇｈは二次元配列データの二次元目のサイズであり、例えば、パラメータ設定時に設定する。また、ｉｎｔは、小数点以下の切り上げ、切り捨て、又は四捨五入などの所定の手段により、実数を整数に直す処理である。上記数式３～５により得られた各要素（ｉ，ｋ）が二次元配列データの各要素となる。 Here, n is the number of elements of the input data and the size of the first dimension of the two-dimensional array data. Y_high is the size of the second dimension of the two-dimensional array data, and is set, for example, when setting parameters. Also, int is a process of converting a real number into an integer by a predetermined means such as rounding up, rounding down, or rounding off after the decimal point. Each element (i, k) obtained by Equations 3 to 5 above becomes each element of the two-dimensional array data.

図２１に、二次元配列データのイメージを示す。二次元配列データは、データ分析装置１０２に記録される。なお、二次元配列データを入力及び出力とするオートエンコーダについては、出力層の活性化関数として、例えばシグモイド関数を用いることにより、オートエンコーダからの出力データの各要素を０～１で出力することができる。 FIG. 21 shows an image of two-dimensional array data. The two-dimensional array data are recorded in the data analysis device 102 . In addition, for an autoencoder that inputs and outputs two-dimensional array data, each element of the output data from the autoencoder is output as 0 to 1 by using, for example, a sigmoid function as the activation function of the output layer. can be done.

図２２は、学習処理フローにおける、二次元配列データに対する差分分布算出処理Ｓ１１０６のイメージを示す図である。分布算出機能６０５は、差分分布算出用の各データ２２０１と、対応するオートエンコード後の出力データ２２０２の差分の絶対値２２０３を要素毎（即ち、二次元配列データ２１０１及び２１０２のマス目毎）に算出する。 FIG. 22 is a diagram showing an image of difference distribution calculation processing S1106 for two-dimensional array data in the learning processing flow. The distribution calculation function 605 calculates the absolute value 2203 of the difference between each data 2201 for difference distribution calculation and the corresponding auto-encoded output data 2202 for each element (that is, for each square of the two-dimensional array data 2101 and 2102). calculate.

ここで、「Ｄｉｆｆ（ｉ，ｋ）ｍ」を、ｍ番目の差分分布生成用データと、対応する出力データとの、要素（ｉ，ｋ）の差の絶対値とする。次に、分布算出機能６０５は、任意のｉ及びｍについて、差分の列毎の合計値「Ｄｉｆｆ(ｉ)ｍ」２２０４を数式６により算出する。
Ｄｉｆｆ(ｉ)ｍ＝Σ_k=1 ^Y_highＤｉｆｆ（ｉ，ｋ）ｍ（数式６） Here, "Diff(i, k)m" is the absolute value of the difference in element (i, k) between the m-th difference distribution generation data and the corresponding output data. Next, the distribution calculation function 605 calculates the total value “Diff(i)m” 2204 for each column of differences for arbitrary i and m using Equation (6).
Diff(i)m=Σk ₌₁ ^Y_high Diff(i,k)m (Formula 6)

次に、分布算出機能６０５は各ｉに対して「Ｄｉｆｆ(ｉ)ｍ」の分布２２０５を求め、標準偏差（σｉ）、及び９５％値（Ｖｉ）を算出し、差分分布データ６０９に記録する。なお、９５％値は一つの例であり、例えば９０％値や、８５％値でも構わないし、複数の値を算出して記録しても構わない。 Next, the distribution calculation function 605 obtains the distribution 2205 of "Diff(i)m" for each i, calculates the standard deviation (σi) and the 95% value (Vi), and records them in the differential distribution data 609. . Note that the 95% value is just one example, and for example, a 90% value or an 85% value may be used, or a plurality of values may be calculated and recorded.

分析処理フローにおける処理Ｓ１４０４では、分析対象の二次元配列データと、オートエンコード後の出力データを比較して、任意のｉ（０＜ｉ＜＝ｎ）に対して（即ち、二次元配列の列毎に）、差分の合計値Ｄｉｆｆ（ｉ）を学習時と同様の方法、即ち数式６を用いて算出する。差分の標準化、及び異常確率は、実施例１と同様に数式１及び数式２により算出する。以上により、入力データを二次元配列データ化して、分析を行うことが可能になる。 In the process S1404 in the analysis process flow, the two-dimensional array data to be analyzed is compared with the output data after autoencoding, and for any i (0<i<=n) (that is, the column of the two-dimensional array ), the total value Diff(i) of the differences is calculated using the same method as during learning, that is, using Equation (6). The normalization of the difference and the abnormality probability are calculated by Equations 1 and 2 as in the first embodiment. As described above, input data can be converted into two-dimensional array data and analyzed.

工場機器では、例えばプレス機のように同じ動作を繰り返し行う機器が多く、測定データにも類似する波形が繰り返し出現することが多い。グラフ２００１の横軸としては、例えば回転装置の軸の角度を用いることも可能であるし、基点からの時間や距離などを用いることも可能である。入力データの二次元配列化について、更に例を示して説明する。 Among factory equipment, there are many equipment such as a press that repeats the same operation, and similar waveforms often appear repeatedly in measurement data. As the horizontal axis of the graph 2001, for example, it is possible to use the angle of the axis of the rotating device, or it is also possible to use time or distance from the base point. The two-dimensional arraying of input data will be further described with an example.

図２３Ａ～２３Ｄは、それぞれ、二次元配列化の例２３０１～２３０４を示す。画像２３０１～２３０４は、それぞれ、二次元配列データを可視化したものである。図２３Ａの画像２３０１に示すように、二次元配列データは列ごとに一つの要素が１（即ち画像の対応する列において、一つのマスが白）であっても構わない。 Figures 23A-23D show two-dimensional array examples 2301-2304, respectively. Images 2301 to 2304 are visualizations of two-dimensional array data. As shown in the image 2301 of FIG. 23A, the two-dimensional array data may have one element of 1 per column (that is, one cell in the corresponding column of the image may be white).

図２３Ｂ～２３Ｄの画像２３０２～２３０４に示すように、列ごとに複数の要素が１（即ち画像の対応する列において、複数のマスが白）であっても構わない。また、すべての要素が０の列（即ち、画像の対応する一列がすべて黒）があっても構わない。 As shown in images 2302-2304 of FIGS. 23B-23D, multiple elements per column may be 1 (ie, multiple cells are white in corresponding columns of the image). Also, there may be columns with all elements zero (ie, the corresponding column of the image is all black).

（Ａ）
図２３Ａの画像２３０１は、数式３～５による二次元配列データ化の画像である。列ごとに一つの要素が１（即ち、画像では白）になるように入力データを二次元配列にマッピングした例である。 (A)
An image 2301 in FIG. 23A is an image converted into two-dimensional array data according to Equations 3-5. This is an example in which input data is mapped to a two-dimensional array so that one element in each column is 1 (that is, white in the image).

（Ｂ）
二次元配列データは、必ずしも列ごとに一つの要素が１である必要はない。例えば、数式３、及び数式４を下記数式７及び８に変更しても、二次元配列を得ることが可能である。
（ｉ，ｋ）＝１（ｋ＜＝ｙｉ）（数式７）
（ｉ，ｋ）＝０（ｋ＞ｙｉ）（数式８） (B)
Two-dimensional array data does not necessarily have one element of 1 for each column. For example, a two-dimensional array can be obtained by changing Equations 3 and 4 to Equations 7 and 8 below.
(i, k)=1 (k<=yi) (Formula 7)
(i, k)=0 (k>yi) (Formula 8)

数式７及び８を用いて算出した二次元配列は、図２３Ｂの画像２３０２で可視化される。 The two-dimensional array calculated using Equations 7 and 8 is visualized in image 2302 of Figure 23B.

（Ｃ）
二次元配列の一次元目（即ち横軸）のデータ数と前処理後データの要素数は必ずしも一致する必要はない。また、入力データの各要素は、（ｘ、ｙ）の組で記録されても構わない。 (C)
The number of data in the first dimension (that is, the horizontal axis) of the two-dimensional array and the number of elements in the preprocessed data do not necessarily match. Also, each element of the input data may be recorded as a set of (x, y).

入力データの二次元配列化の一例として、回転装置の動作角度と測定値（例えば、ひずみセンサや荷重センサなどから得られる値）が合わせて測定され、入力データのｉ番目の要素が（角度ｉ、測定値ｉ）の組で記録されるケースにおける入力データの二次元配列化を説明する。ここで任意のｉについて、角度ｉは機器の動作角度であり、０＜＝角度ｉ＜３６０である。 As an example of a two-dimensional array of input data, the operating angle of the rotating device and the measured value (for example, the value obtained from a strain sensor or a load sensor) are measured together, and the i-th element of the input data is (angle i , the two-dimensional arraying of the input data in the case of being recorded in sets of measurements i). where angle i is the operating angle of the instrument, 0<=angle i<360, for any i.

データ前処理機能は、測定データを所定の時間で区切り、複数の入力データに分割する。二次元配列化機能は、サイズが３６０×Ｙ＿ｈｉｇｈである二次元配列を用意し、すべての要素を０と設定する。次に、二次元配列化機能は入力データの各要素（角度ｉ、測定値ｉ）について、対応する二次元配列の要素の位置（Ｘｉ、Ｙｉ）を算出する。Ｘｉは角度ｉの小数点以下を切り捨てて整数に変換し、１を加えることにより得られる。また、Ｙｉは数式５により得られる。 The data preprocessing function divides the measurement data into a plurality of pieces of input data by dividing the measurement data by a predetermined time. The two-dimensional array function prepares a two-dimensional array with a size of 360×Y_high and sets all elements to zero. Next, the two-dimensional array function calculates the position (Xi, Yi) of the corresponding two-dimensional array element for each element (angle i, measured value i) of the input data. Xi is obtained by truncating the decimal part of the angle i, converting it to an integer, and adding 1 to it. Also, Yi is obtained by Equation (5).

二次元配列化機能は、任意のｉについて、（Ｘｉ、Ｙｉ）＝１と設定する。本ケースにおいては、図２３Ｃの画像２３０３に示すように、一つの列に二つ以上の白いマスが存在したり、一列すべてが黒くなることもある。 The two-dimensional array function sets (Xi, Yi)=1 for any i. In this case, as shown in image 2303 of FIG. 23C, there may be two or more white cells in one column, or an entire column may be black.

本方法によれば、入力データの要素数が可変のケースにおいても二次元配列化が可能になる。また、入力データの要素が、軸となるデータの昇順、又は降順に並んでいないケースであっても二次元配列化が可能になる。 According to this method, two-dimensional arraying is possible even when the number of elements of input data is variable. In addition, even if the elements of the input data are not arranged in ascending or descending order of the axis data, two-dimensional arraying is possible.

（Ｄ）
（Ａ）の例では、要素１～ｎに対して二次元配列の一次元目の次数をｎとして、入力データの要素と二次元配列の一次元目を１対１でマッピングした。しかし、実際のデータにおいてはｎが大きく、１対１のマッピングが不可能なケースも存在する。そこで、入力データの二次元配列化の一例として、横軸（即ち、周波数、時間、又は距離など）方向の圧縮方法を示す。 (D)
In the example of (A), the elements of the input data and the first dimension of the two-dimensional array are mapped on a one-to-one basis, with n being the first dimension of the two-dimensional array for the elements 1 to n. However, there are cases where n is large in actual data and one-to-one mapping is impossible. Therefore, as an example of two-dimensional arraying of input data, a compression method in the direction of the horizontal axis (that is, frequency, time, distance, etc.) will be shown.

例えば、所定の単位区間ごとに平均値を算出し、平均値を基に二次元配列化を行うことが可能である。例えば、３６０度の角度に対して、１０度ごとに平均値を算出し、二次元配列の一次元目の次数を３６（＝３６０／１０）とすれば、３６×Ｙ＿ｈｉｇｈのサイズの二次元配列化が可能である。 For example, it is possible to calculate an average value for each predetermined unit interval and perform two-dimensional arraying based on the average value. For example, if the average value is calculated every 10 degrees for an angle of 360 degrees, and the order of the first dimension of the two-dimensional array is 36 (=360/10), then a two-dimensional array with a size of 36 x Y_high is possible.

しかし、測定値の振れ幅が大きいケースでは、平均値、最大値などの一つの値では特性を表現しきれないことも考えられる。この場合、例えば単位区間ごとに最小値及び最大値を算出し、最小値と最大値の間に挟まれる領域を１（即ち画像で白の表示）とすることにより、図２３Ｄの画像２３０４に示すように、測定値の振れ幅に対応した二次元配列化が可能になる。 However, in cases where the amplitude of measured values is large, it is conceivable that a single value, such as an average value or maximum value, cannot fully express the characteristics. In this case, for example, the minimum value and the maximum value are calculated for each unit interval, and the area sandwiched between the minimum value and the maximum value is set to 1 (that is, displayed in white in the image). Thus, it becomes possible to form a two-dimensional array corresponding to the amplitude of the measured values.

なお、所定の単位区間とは必ずしもリニアなスケールで等しくある必要はなく、例えばログスールで表示したときに等しくなるような単位区間であってもよいし、単位区間毎にそれぞれに異なる大きさを予め設定しておいてもよい。これにより、二次元配列のサイズを大きくすることなく入力データを二次元配列データ化することが可能になる。 Note that the predetermined unit interval does not necessarily have to be equal on a linear scale. You can set it. This makes it possible to transform the input data into two-dimensional array data without increasing the size of the two-dimensional array.

図２３Ａ～から２３Ｄでは一つの入力データを一つの二次元配列に変換したが、一つの入力データを必ずしも一つの二次元配列データに変換する必要はなく、例えば、入力データを二以上の領域に分割し、各領域をそれぞれ二次元配列データに変換して分析を行っても構わない。データの軸（即ち、周波数、時間、又は距離など）が広域であるケースや、領域に合わせて分析の粒度を変更したいときなどは、軸（即ち、周波数、時間、又は距離など）を二以上の領域に分割し、分割されたそれぞれの領域について学習、及び分析処理を行ってもよい。また、分割後の各領域は、軸のスケールを個々に選択しても構わない。 Although one input data is converted into one two-dimensional array in FIGS. 23A to 23D, it is not necessary to convert one input data into one two-dimensional array data. It may be divided and each region may be converted into two-dimensional array data for analysis. In cases where the data axis (i.e. frequency, time, distance, etc.) is wide, or when you want to change the granularity of analysis according to the area, set two or more axes (i.e., frequency, time, distance, etc.) regions, and learning and analysis processing may be performed for each of the divided regions. Also, for each region after division, the scale of the axis may be selected individually.

図２４にデータ領域を二つ以上に分割する例を示す。図２４の例では、例えば、軸（即ち、周波数、時間、又は距離など）を１００以下の領域２４０１と１００～１００００の領域２４０２に分割し、１００以下の領域２４０１をリニアスケールで二次元配列２４０３にマッピングし、１００以上の領域２４０２をログスケールで二次元配列２４０４にマッピングする。これにより、分析対象領域に適した粒度で分析を行うことが可能になる。 FIG. 24 shows an example of dividing the data area into two or more. In the example of FIG. 24, for example, an axis (that is, frequency, time, distance, etc.) is divided into a region 2401 of 100 or less and a region 2402 of 100 to 10000, and the region 2401 of 100 or less is linearly scaled in a two-dimensional array 2403 , and maps 100 or more regions 2402 to a two-dimensional array 2404 on a log scale. This makes it possible to perform analysis with a granularity suitable for the region to be analyzed.

また、測定値が小さく、縦軸をログスケールにすることにより値の変動が分かりやすくなるケースでは、分析時に測定値の対数値を用いてもよい。 In addition, in the case where the measured value is small and the variation of the value can be easily understood by using a log scale for the vertical axis, the logarithmic value of the measured value may be used at the time of analysis.

次に、数式５で使用したｙ＿ｍａｘ及びｙ＿ｍｉｎの算出方法を示す。これらのパラメータは、パラメータ設定時に明示的に設定できるようになっていてもよいし、算出アルゴリズムにより算出されてもよい。以下に、ｙ＿ｍａｘ及びｙ＿ｍｉｎを算出するアルゴリズムの一例を示す。 Next, a method of calculating y_max and y_min used in Equation 5 will be described. These parameters may be explicitly set at the time of parameter setting, or may be calculated by a calculation algorithm. An example of an algorithm for calculating y_max and y_min is shown below.

二次元配列化機能は、学習データを読み込み、測定値の最大値（以下、Ｔｍｐ＿ｍａｘと表記する）及び最小値（以下、Ｔｍｐ＿ｍｉｎ）を検出し、その平均値（以下、Ｔｍｐ＿ａｖｅ）を求める。下記の数式９及び１０により、ｙ＿ｍａｘ及びｙ＿ｍｉｎを定義する。
ｙ＿ｍａｘ＝Ｔｍｐ＿ａｖｅ＋α＊（Ｔｍｐ＿ｍａｘ－Ｔｍｐ＿ａｖｅ）（数式９）
ｙ＿ｍｉｎ＝Ｔｍｐ＿ａｖｅ－α＊（Ｔｍｐ＿ａｖｅ－Ｔｍｐ＿ｍｉｎ）（数式１０） The two-dimensional array function reads learning data, detects the maximum value (hereinafter referred to as Tmp_max) and minimum value (hereinafter referred to as Tmp_min) of measured values, and obtains the average value (hereinafter referred to as Tmp_ave). Equations 9 and 10 below define y_max and y_min.
y_max=Tmp_ave+α*(Tmp_max−Tmp_ave) (Formula 9)
y_min=Tmp_ave−α*(Tmp_ave−Tmp_min) (Formula 10)

ここで、αは予め定めた正の定数である。αはパラメータ設定時に設定されてもよいし、デフォルト値（例えば、α＝１．２など）が設定されてもよい。また、ｙ＿ｍａｘ及びｙ＿ｍｉｎは１０の階乗やその整数倍などで近似してもよい。 Here, α is a predetermined positive constant. α may be set when parameters are set, or may be set to a default value (eg, α=1.2). Also, y_max and y_min may be approximated by the factorial of 10 or its integral multiple.

測定値をログスケールにして二次元配列にマッピングする時は、ｙ＿ｍａｘ及びｙ＿ｍｉｎは、測定値の対数値を利用して算出し、分析も測定値の対数を用いて行うことができる。 When the measurements are log-scaled and mapped onto a two-dimensional array, y_max and y_min can be calculated using the logarithms of the measurements, and the analysis can also be performed using the logarithms of the measurements.

上記数式９及び１０により、ｙ＿ｍａｘ及びｙ＿ｍｉｎを明示的に指定せずに算出することが可能になる。 Equations 9 and 10 above allow y_max and y_min to be calculated without explicitly specifying them.

なお、入力データの二次元配列化は上記に挙げた例に限定されるものではない。また、例えば二次元配列化は、データの平滑化など様々な前処理を行った後で実施してもよいし、包絡線を算出し、算出した包絡線を二次元配列データにマッピングしてもよい。また、上側の包絡線と下側の包絡線とをそれぞれに算出し、挟まれた領域を１としてもよい。また、二次元配列化の方法、及び各種パラメータは、それぞれパラメータ設定画面において指定できるようになっていても構わない。 Note that the two-dimensional arraying of input data is not limited to the examples given above. In addition, for example, two-dimensional arraying may be performed after performing various preprocessing such as data smoothing, or an envelope may be calculated and the calculated envelope may be mapped to two-dimensional array data. good. Alternatively, the upper envelope and the lower envelope may be calculated separately, and 1 may be assigned to the sandwiched region. Also, the two-dimensional arraying method and various parameters may be specified on the parameter setting screen.

上記のように二次元配列化を行うことにより、様々な入力データに対応することが可能になり、入力データの特徴を活かした分析が可能になる。上記方法によれば、入力データの要素数は固定でも良いし、異なっていても構わない。 By forming a two-dimensional array as described above, it becomes possible to deal with various types of input data, and it becomes possible to perform analysis that takes advantage of the characteristics of the input data. According to the above method, the number of elements of the input data may be fixed or may vary.

実施例３では、入力データの各要素が二つの値、即ち（ｘ、ｙ）の組で表されるデータを二次元配列データに変換して異常を検出する方法を示した。本実施例では、入力データをｎ次元配列に変換して分析を行う例を示す。 In Example 3, a method of detecting an abnormality by converting data in which each element of input data is represented by a set of two values, ie, (x, y), into two-dimensional array data was shown. This embodiment shows an example of converting input data into an n-dimensional array for analysis.

例えば、入力データの各要素が三つの値、即ち（ｘ、ｙ、ｚ）の組で構成されるデータを三次元配列に変換して異常を検出するケースでは、任意のｉ及びｈ、即ち任意の（ｘｉ、ｙｈ）の組について、例えば数式５と同様の計算によりｚ＿ｉｈを算出し、以下に従い３次元配列を生成する。
（ｉ，ｈ，ｋ）＝１（ｋ＝ｚ＿ｉｈ）（数式１１）
（ｉ，ｈ，ｋ）＝０（ｋ≠ｚ＿ｉｈ）（数式１２） For example, in the case where each element of the input data consists of a set of three values, i.e. (x, y, z), is converted into a three-dimensional array to detect anomalies, arbitrary i and h, i.e. arbitrary (xi, yh), z_ih is calculated by, for example, the same calculation as Equation 5, and a three-dimensional array is generated according to the following.
(i, h, k)=1 (k=z_ih) (Formula 11)
(i, h, k)=0 (k≠z_ih) (Formula 12)

実施例３と同様に要素毎（即ち、（ｉ，ｈ，ｋ）のマス毎）にオートエンコーダへの入力データと出力データの差の絶対値を算出し、オートエンコード前後の差分として（ｘｉ、ｙｈ）の組（即ち１次元空間）毎に差分の合計値を算出し、これを用いて分析処理を行うことにより、（ｘｉ、ｙｈ）の組毎の異常確率を算出することが可能になる。また、オートエンコード前後の差分として、ｘｉ、又はｙｈを固定して得られる２次元空間上の差分の合計値を用いて異常確率を算出してもよい。例えば、ｘｉを固定する場合、ｘｉ毎に全てのｙｈの差分の合計値を算出することにより、ｘｉ毎の異常確率が算出される。これにより２次元以下の任意の空間に対する異常確率が算出可能である。 As in the third embodiment, the absolute value of the difference between the input data and the output data to the autoencoder is calculated for each element (that is, each square of (i, h, k)), and the difference before and after autoencoding is (xi, yh) (i.e., one-dimensional space) by calculating the total value of the differences and performing analysis processing using this, it becomes possible to calculate the abnormality probability for each pair of (xi, yh) . Further, as the difference before and after auto-encoding, the sum of the differences in the two-dimensional space obtained by fixing xi or yh may be used to calculate the abnormality probability. For example, when xi is fixed, the abnormality probability for each xi is calculated by calculating the total value of the differences of all yh for each xi. This makes it possible to calculate the anomaly probability for any space of two dimensions or less.

本方法を用いることにより、例えば、振動センサから得られた測定データについて、ｘを経過時間、ｙを周波数、ｚを周波数密度として、基点からの周波数密度の時間変動を分析することが可能である。また、例えば直線上に所定の間隔で設置された振動センサや音センサから収集した測定データについて、ｘをセンサの位置（即ち、センサＩＤに相当）、ｙを周波数、ｚをスペクトル密度として、状態監視を行うことも可能である。また、例えば平面や曲面上に等間隔で圧力センサやひずみセンサ等を設置し、（ｘ、ｙ）をセンサの位置、ｚを測定値として状態監視を行うことも可能である。このように、本方法は様々なケースに適用が可能である。 By using this method, for example, for measurement data obtained from a vibration sensor, it is possible to analyze the time variation of the frequency density from a starting point, where x is the elapsed time, y is the frequency, and z is the frequency density. . For example, for measurement data collected from vibration sensors and sound sensors installed at predetermined intervals on a straight line, x is the position of the sensor (that is, equivalent to the sensor ID), y is the frequency, and z is the spectral density. Monitoring is also possible. It is also possible to install pressure sensors, strain sensors, etc. on a flat or curved surface at regular intervals, and to monitor the condition using (x, y) as the sensor position and z as the measured value. Thus, this method can be applied to various cases.

図２５Ａ及び２５Ｂは異常確率の表示例を示す図である。分析結果の表示方法として、例えば図２５Ａに示すように、日時と空間の組に対して異常確率を表示することが可能である。異常確率は、任意の空間に対して表示してもよいし、条件を指定して抽出した一部の空間に対して表示してもよい。例えば、異常確率の高い空間を抽出して表示しても構わない。 25A and 25B are diagrams showing display examples of anomaly probabilities. As a display method of the analysis result, for example, as shown in FIG. 25A, it is possible to display the abnormality probability for a combination of date and time. The anomaly probability may be displayed for an arbitrary space, or may be displayed for a part of the space extracted by designating conditions. For example, a space with a high probability of abnormality may be extracted and displayed.

また、（ｘ、ｙ）の組に対して異常確率を表示してもよい。例えば、ｘがセンサＩＤ、ｙが周波数に対応するケースにおいて、図２５Ｂに示すように、横軸をセンサＩＤ、縦軸を周波数として異常確率を表示することができる。これらの表示により、状態監視システム１のユーザは、異常確率が高い箇所を容易に検出することが可能になる。 Also, anomaly probability may be displayed for a set of (x, y). For example, in a case where x corresponds to the sensor ID and y corresponds to the frequency, as shown in FIG. 25B, the abnormality probability can be displayed with the sensor ID on the horizontal axis and the frequency on the vertical axis. These displays allow the user of the condition monitoring system 1 to easily detect locations with a high probability of abnormality.

なお、本方法は入力データの各要素が４つ以上の値の組で表されるケースにも拡張が可能であり、入力データの各要素がｎ個の値の組で構成されるデータをｎ次元配列データに変換して分析を行うことが可能になる。なお、異常確率は、固定する要素の選択に応じて、ｎ－１次元以下の任意の空間に対して算出することが可能である。固定された要素の組（１要素からなる組を含む）毎に差分の合計値が算出される。これにより、多様な入力データについても空間毎の特徴（即ち振れ幅）に考慮した分析が可能になる。 Note that this method can be extended to the case where each element of the input data is represented by a set of four or more values. It becomes possible to convert to dimensional array data and perform analysis. It should be noted that the abnormality probability can be calculated for an arbitrary space of n-1 dimensions or less according to the selection of elements to be fixed. A sum of differences is calculated for each set of fixed elements (including a set consisting of one element). As a result, it is possible to analyze various types of input data while taking into account the characteristics of each space (that is, the amplitude).

入力データをｎ次元配列データに変換する複数の方法を用意し、ユーザによる変換方法の指定を受け付けてもよい。データ分析装置１０２は、入出力装置１０４において指定された方法により、入力データをｎ次元配列データに変換する。これにより、様々な種類のデータをより適切にｎ次元配列データに変換できる。 A plurality of methods for converting input data into n-dimensional array data may be prepared, and user designation of the conversion method may be accepted. The data analysis device 102 converts input data into n-dimensional array data by a method specified by the input/output device 104 . As a result, various types of data can be more appropriately converted into n-dimensional array data.

実施例３で説明したオートエンコード前後の差分の算出の際に、ｐｏｏｌｉｎｇ及びｕｎｐｏｏｌｉｎｇを行う例を示す。 An example of performing pooling and unpooling when calculating the difference before and after auto-encoding described in the third embodiment is shown.

分布算出機能６０５は、図２２の画像２２０１（即ち、二次元配列化された入力データ）、及び画像２２０２（即ち、オートエンコーダからの出力データ）のそれぞれに、ｐｏｏｌｉｎｇを行う。ここでは、例えばフィルタサイズを３×３、スライド幅を１としてｍａｘ－ｐｏｏｌｉｎｇを行う。 The distribution calculation function 605 performs pooling on each of the image 2201 (that is, two-dimensionally arrayed input data) and the image 2202 (that is, output data from the autoencoder) in FIG. Here, max-pooling is performed with a filter size of 3×3 and a slide width of 1, for example.

次に、分布算出機能６０５はｐｏｏｌｉｎｇ後の入力データ、及び出力データの差分を要素毎に算出し、差分の配列を求める。次に、分布算出機能６０５は算出した差分の配列に対してｕｎｐｏｏｌｉｎｇを行う。なお、ｕｎｐｏｏｌｉｎｇは先に実行したｍａｘ－ｐｏｏｌｉｎｇと同じフィルタサイズ、及びスライド幅を用いる。これを要素毎の差分２２０３として、以降は実施例１の処理と同様の処理を行い、列毎に差分の合計値を算出する。これらの処理は、学習処理における差分分布算出時（Ｓ１１０６）、及び分析処理における処理（Ｓ１４０４）の双方で実施する。 Next, the distribution calculation function 605 calculates the difference between the input data after pooling and the output data for each element, and obtains an array of the difference. Next, the distribution calculation function 605 unpools the array of calculated differences. Note that unpooling uses the same filter size and slide width as the previously executed max-pooling. Using this as the difference 2203 for each element, the same processing as in the first embodiment is performed thereafter to calculate the total value of the difference for each column. These processes are performed both when calculating the difference distribution in the learning process (S1106) and in the analysis process (S1404).

なお、フィルタサイズ、及びスライド幅は自由に設定してよい。また、二次元配列データと同様に、ｎ次元配列データについてもｐｏｏｌｉｎｇ及びｕｎｐｏｏｌｉｎｇを用いて分析を行ってもよい。 Note that the filter size and slide width may be set freely. Also, similar to two-dimensional array data, n-dimensional array data may also be analyzed using pooling and unpooling.

本実施例によれば、ｐｏｏｌｉｎｇ及びｕｎｐｏｏｌｉｎｇを行うことにより、測定時の揺らぎを吸収した分析が可能になる。以上により、多様な入力データに対して適切に分析を行うことが可能になる。 According to this embodiment, by performing pooling and unpooling, it is possible to perform analysis that absorbs fluctuations during measurement. As described above, it is possible to appropriately analyze various types of input data.

実施例１では、安定稼働時、即ち平常時のデータを用いてオートエンコーダを生成した。更に、監視時間帯のデータにおいて、実施例１の方法で異常と判定されたデータを収集して学習し、新たな分析モデルを生成してもよい。 In Example 1, an autoencoder was generated using data during stable operation, that is, during normal operation. Further, in the monitoring time period data, the data determined to be abnormal by the method of the first embodiment may be collected and learned to generate a new analysis model.

異常確率算出機能６０６は、異常と判定された入力データ（以下、異常判定データと記す）を、図示しない異常データに記録する。この際、例えば、異常の種類も併せて記録する。モデル生成機能６０３は、同じ種類の異常に分類される異常判定データを読み出し、実施例１と同様に、オートエンコーダ２及び差分分布２を生成し、図示しない異常モデル、及び異常差分分布データに記録する。これらは、異常の種類毎に生成する。 The abnormality probability calculation function 606 records input data determined to be abnormal (hereinafter referred to as abnormality determination data) as abnormality data (not shown). At this time, for example, the type of abnormality is also recorded. The model generation function 603 reads the abnormality determination data classified into the same type of abnormality, generates the autoencoder 2 and the difference distribution 2 in the same manner as in the first embodiment, and records them in the abnormality model and the abnormality difference distribution data (not shown). do. These are generated for each type of anomaly.

異常モデルが記録されているケースにおいては、異常確率算出機能６０６は、実施例１の異常判定処理（Ｓ１４０８）において異常と判定された入力データをオートエンコーダ２に入力し、オートエンコード前後の差分aを算出し、差分aを差分分布２と比較する。本実施例では、実施例１の数式２によって得られる異常度（Ｏ＿ｉ）を、各状態（即ち正常、及び異常）との一致具合を示す指標値と考える。差分aがいずれかの異常の種類に近い場合、該異常の種類を分析中のデータの異常の種類と考えることができる。 In the case where an abnormality model is recorded, the abnormality probability calculation function 606 inputs the input data determined to be abnormal in the abnormality determination process (S1408) of the first embodiment to the autoencoder 2, and calculates the difference a before and after autoencoding. is calculated, and the difference a is compared with the difference distribution 2. In this embodiment, the degree of abnormality (O_i) obtained by Equation 2 of Embodiment 1 is considered as an index value indicating the degree of matching with each state (that is, normal and abnormal). If the difference a is close to any anomaly type, then that anomaly type can be considered the anomaly type of the data under analysis.

図２６は、異常と判定されたデータ（即ち分析対象データ）を、分析モデル、及び一つ以上の異常モデルにより分析した結果を示す一例である。各要素について、分析モデル（又は異常モデル）による分析の結果、指標値が０％の時は分析モデル（又は異常モデル）と一致、指標値が１００％の時は分析モデル（又は異常モデル）と不一致として異常種類判定２６０１を表示する。 FIG. 26 is an example showing the results of analyzing data determined to be abnormal (that is, data to be analyzed) using an analysis model and one or more abnormal models. For each element, as a result of analysis by the analytical model (or abnormal model), when the index value is 0%, it matches the analytical model (or abnormal model), and when the index value is 100%, it is the analytical model (or abnormal model). The abnormality type determination 2601 is displayed as mismatch.

分析対象データの異常の種類が既知（即ち、異常モデル生成済み）であれば、いずれかの異常モデルと分析結果が一致し、異常の種類を推測することができる。但し、例えば複合要因の異常であれば（例えば、異常２と異常３が同時に発生）、分析対象データの分析結果は、異常２の特徴を示す要素では異常２に一致し、異常３の特徴を示す要素では異常３に一致することも考えられる。 If the type of anomaly in the data to be analyzed is known (that is, an anomaly model has already been generated), one of the anomaly models will match the analysis result, and the type of anomaly can be inferred. However, if, for example, anomaly has multiple factors (for example, anomaly 2 and anomaly 3 occur at the same time), the analysis results of the analysis target data will match anomaly 2 in the elements that show the characteristics of anomaly 2, and the characteristics of anomaly 3. It is also conceivable that the shown element matches anomaly 3.

これにより、異常種類判定２６０１において、分析対象データの分析結果と各異常モデルとの比較を基に、異常の種類の推定が可能である。なお、異常種類判定２６０１は一つの監視対象データに対して表示してもよいし、複数の監視対象データに対して表示しても構わない。 As a result, in the abnormality type determination 2601, the abnormality type can be estimated based on the comparison between the analysis result of the analysis target data and each abnormality model. Note that the abnormality type determination 2601 may be displayed for one monitoring target data, or may be displayed for a plurality of monitoring target data.

また、要素ＩＤは周波数、機器の回転角度、又は基点からの時間や距離に対応してもよいし、二つ以上のデータ系列を識別するＩＤであってもよい。また、データ系列は、二つ以上のセンサにより収集されても構わない。また、平常時のデータを活用して生成したオートエンコーダを用いた分析と、異常判定データを用いて生成したオートエンコーダ２を用いた分析を並列に実施しても構わない。 Also, the element ID may correspond to the frequency, the rotation angle of the device, or the time or distance from the base point, or may be an ID that identifies two or more data series. Also, the data series may be collected by more than one sensor. Also, the analysis using the autoencoder generated by utilizing the normal data and the analysis using the autoencoder 2 generated using the abnormality determination data may be performed in parallel.

また、監視対象機器と同型の機器の監視によって得られた前記分析モデル、及び異常モデルを入手して活用してもよい。また、複数の同型機器の監視により、異常種類が同一である異常データを収集及び学習して異常モデルを生成してもよい。これにより、収集が難しい異常データについても、効率的な収集が可能になる。本実施例によれば、異常発生時に異常の種類を推測することが可能になる。 Further, the analysis model and the abnormality model obtained by monitoring the equipment of the same type as the equipment to be monitored may be obtained and utilized. Further, an abnormality model may be generated by collecting and learning abnormality data of the same type of abnormality by monitoring a plurality of devices of the same type. This enables efficient collection of even difficult-to-collect abnormal data. According to this embodiment, it is possible to guess the type of abnormality when an abnormality occurs.

本実施例では、平常時の状態が二つ以上あるケースに対して状態を識別する例を示す。学習時の処理において、データ分析装置１０２は、実施例１と同様の処理により、状態毎にオートエンコーダ、及び要素毎の差分の分布を生成して記録する。監視時の処理においては、データ分析装置１０２は、監視対象データを実施例１と同様の処理により状態毎のオートエンコーダ及び差分の分布を用いて分析し、状態毎、かつ要素毎に数式２を用いて指標値を算出する。これにより、各状態との一致具合は要素毎に数値化される。 In the present embodiment, an example of identifying states for a case where there are two or more normal states is shown. In the processing during learning, the data analysis device 102 generates and records an autoencoder for each state and the difference distribution for each element by the same processing as in the first embodiment. In the monitoring process, the data analysis device 102 analyzes the monitoring target data using the autoencoder and difference distribution for each state by the same processing as in the first embodiment, and calculates Equation 2 for each state and for each element. to calculate the index value. As a result, the degree of matching with each state is quantified for each element.

更に、異常時のデータも収集されたケースにおいては、平常状態と同様に異常状態のオートエンコーダ及び要素毎の差分の分布を生成し、生成した異常状態のオートエンコーダ、及び差分の分布を用いて要素毎に指標値を算出しても構わない。 Furthermore, in the case where data in an abnormal state is also collected, an autoencoder in an abnormal state and a distribution of differences for each element are generated in the same way as in the normal state, and the generated autoencoder in an abnormal state and the distribution of differences are used to An index value may be calculated for each element.

図２７は、状態識別結果の画像の一例２７０１である。実施例６と同様に、各状態に対して監視対象データの要素毎の指標値が表示される。これにより、監視対象データの状態を識別することが可能になる。また、いずれの状態とも異なる場合、未知の状態であると識別することができる。図２８は、状態識別結果の画像の別の一例２８０１である。図２８に示すように、各状態との要素毎の指標値を時系列に表示しても構わない。 FIG. 27 is an example 2701 of the image of the state identification result. As in the sixth embodiment, an index value for each element of monitoring target data is displayed for each state. This makes it possible to identify the state of the monitored data. Also, if it is different from any state, it can be identified as an unknown state. FIG. 28 is another example 2801 of a state identification result image. As shown in FIG. 28, index values for each state and each element may be displayed in chronological order.

図２９は、状態識別結果の画像の他の一例２９０１である。例えば、状態毎に任意の日時について全要素の指標値の平均を算出し、これを各状態である確率として表示しても構わない。また、要素毎に重みを付けて平均値を算出しても構わない。 FIG. 29 is another example 2901 of the state identification result image. For example, an average index value of all elements may be calculated for an arbitrary date and time for each state and displayed as the probability of being in each state. Alternatively, each element may be weighted to calculate the average value.

本実施例によれば、状態が複数存在するケースについて、監視対象データがどの状態であるのかを識別することが可能になる。 According to this embodiment, it is possible to identify which state the monitoring target data is in when there are multiple states.

なお、本実施例は入力データをｎ次元配列に変換して分析を行うケースにも適用可能である。状態識別結果は、例えば指定した空間について、図２７や図２８と同様に表示することができる。但し、この時、要素ＩＤの代わりに空間の識別子を用いる。 This embodiment can also be applied to the case of converting input data into an n-dimensional array for analysis. The state identification result can be displayed, for example, in the designated space in the same manner as in FIGS. However, at this time, a space identifier is used instead of the element ID.

本実施例では、オートエンコーダ生成時のパラメータの選択例を示す。なお、本実施例では実施例１と異なる処理を中心に説明を行う。 This embodiment shows an example of parameter selection when generating an autoencoder. In addition, in the present embodiment, the explanation will focus on the processing different from that of the first embodiment.

図３０にオートエンコーダのネットワーク構成の一例を示す。本実施例ではオートエンコーダは入力ノード群３００２、中間ノード群３００３、及び出力ノード群３００４を含む。入力ノード群３００２、及び出力ノード群３００４におけるノード数Ｎはオートエンコーダへの入力要素数であり、実施例１の要素数(即ちｎ)、又は実施例２の二次元配列の要素数（即ちｎ×Ｙ＿ｈｉｇｈ）に相当する。中間ノード群３００３の１層分（即ち、中間ノード群３００３の縦一列）のノード数Ｍは、１＜Ｍ＜Ｎを満たす整数である。 FIG. 30 shows an example of the network configuration of the autoencoder. In this example, the autoencoder includes input nodes 3002 , intermediate nodes 3003 , and output nodes 3004 . The number of nodes N in the input node group 3002 and the output node group 3004 is the number of input elements to the autoencoder. xY_high). The number M of nodes in one layer of the intermediate node group 3003 (that is, one vertical row of the intermediate node group 3003) is an integer that satisfies 1<M<N.

本実施例では、オートエンコーダの中間ノード群３００３の１層分の個数Ｍ（以下、中間ノード数Ｍと記す）、及び学習量Ｌの二つを、オートエンコーダ生成時のパラメータの例として説明を行う。ここで学習量Ｌは、例えばモデル生成機能６０３がオートエンコーダ生成時にモデル生成用データを繰り返して学習する回数とする。 In this embodiment, the number M of the intermediate node group 3003 of the autoencoder for one layer (hereinafter referred to as the number of intermediate nodes M) and the learning amount L will be described as examples of parameters when generating the autoencoder. conduct. Here, the learning amount L is, for example, the number of times the model generation function 603 repeats and learns model generation data when generating an autoencoder.

中間ノード数Ｍ、及び学習量Ｌは、例えばパラメータ設定画面１６００において入力するようにしてもよい。中間ノード数Ｍの値は、例えば、入力ノード数Ｎの２分の１、４分の１、というように入力ノード数との比で設定してもよい。 The number of intermediate nodes M and the amount of learning L may be input on the parameter setting screen 1600, for example. The value of the number M of intermediate nodes may be set in proportion to the number of input nodes, such as 1/2 or 1/4 of the number N of input nodes.

モデル生成機能６０３は、中間ノード数Ｍ、及び学習量Ｌの任意の組み合わせ（Ｍｕ、Ｌｖ）について、オートエンコーダを生成する。モデル演算機能６０４は、生成された各オートエンコーダに差分分布生成用データを入力し、それぞれ出力データを得る。分布算出機能６０５は、中間ノード数Ｍ、及び学習量Ｌの組み合わせ（Ｍｕ、Ｌｖ）毎に、要素毎(但し、二次元配列を用いる際は列毎。以下も同様。)に差分を算出する。 The model generation function 603 generates an autoencoder for an arbitrary combination (Mu, Lv) of the number of intermediate nodes M and the amount of learning L. The model calculation function 604 inputs difference distribution generation data to each generated autoencoder and obtains output data. The distribution calculation function 605 calculates the difference for each element (however, for each column when using a two-dimensional array; the same applies hereinafter) for each combination (Mu, Lv) of the number of intermediate nodes M and the amount of learning L. .

自然界においては、非負である多くの分布が対数正規分布で近似されることが知られている。そこで、分布算出機能６０５は、算出した各差分の分布と対数正規分布との比較を行う。具体的には、分布算出機能６０５は要素毎に差分の対数値の分布の正規性を確認する。 In nature, it is known that many non-negative distributions are approximated by the lognormal distribution. Therefore, the distribution calculation function 605 compares the calculated distribution of each difference with the logarithmic normal distribution. Specifically, the distribution calculation function 605 confirms the normality of the distribution of logarithmic values of differences for each element.

例えば、分布算出機能６０５は、シャピロ‐ウィルク検定を用いて、中間ノード数Ｍ、及び学習量Ｌの組み合わせ（Ｍｕ、Ｌｖ）毎に、要素毎に差分の対数値の分布の検定統計量を算出する。次に、分布算出機能６０５は、要素毎に算出した検定統計量の平均値を算出する。 For example, the distribution calculation function 605 uses the Shapiro-Wilk test to calculate the test statistic of the distribution of the logarithm of the difference for each element for each combination (Mu, Lv) of the number of intermediate nodes M and the amount of learning L. do. Next, the distribution calculation function 605 calculates the average value of the test statistics calculated for each element.

図３１に中間ノード数Ｍと学習量Ｌの組み合わせ毎に算出した検定統計量の平均値のイメージを示す。ある中間ノード数Ｍｑ、及び学習量Ｌｐの組み合わせ（Ｍｑ，Ｌｐ）において検定統計量の平均値が十分に大きい場合、各要素の差分の対数値の分布は正規分布に近づく。 FIG. 31 shows an image of the average value of test statistics calculated for each combination of the number of intermediate nodes M and the amount of learning L. In FIG. When the mean value of the test statistic is sufficiently large in a certain combination (Mq, Lp) of the number of intermediate nodes Mq and the amount of learning Lp, the distribution of the logarithmic value of the difference of each element approaches the normal distribution.

分布算出機能６０５は、検定統計量の平均値があらかじめ設定した閾値を超える（Ｍｑ，Ｌｐ）の組み合わせがある時、この組み合わせを選択する。検定統計量の平均値が閾値を超える（Ｍｑ，Ｌｐ）の組み合わせが複数存在する場合、例えば、検定統計量の平均値が最大となる組み合わせを選択してもよい。 When there is a combination of (Mq, Lp) in which the average value of test statistics exceeds a preset threshold value, the distribution calculation function 605 selects this combination. If there are multiple combinations of (Mq, Lp) for which the average value of the test statistic exceeds the threshold, for example, the combination that maximizes the average value of the test statistic may be selected.

分布算出機能６０５は、選択した中間ノード数Ｍと学習量Ｌの組み合わせ（Ｍｑ，Ｌｐ）について、対応するオートエンコーダを分析モデル６１１に記録し、任意の要素ｉに対して差分の対数値の分布の平均値Ｍ＿Ｌｎ（ｉ）、及び標準偏差σ＿Ｌｎ（ｉ）を算出し、差分分布データ６０９に記録する。 The distribution calculation function 605 records the corresponding autoencoder in the analysis model 611 for the selected combination (Mq, Lp) of the number of intermediate nodes M and the amount of learning L, and calculates the distribution of the logarithmic value of the difference for an arbitrary element i. The average value M_Ln(i) and the standard deviation σ_Ln(i) of are calculated and recorded in the difference distribution data 609 .

次に、分析処理を説明する。異常確率算出機能６０６は、任意の要素ｉに対して、オートエンコード前後の差分の対数値Ｄｉｆｆ＿Ｌｎ（ｉ）を算出する。次に異常確率算出機能６０６は、任意の要素ｉに対して差分の対数値Ｄｉｆｆ＿Ｌｎ（ｉ）を、平均値Ｍ＿Ｌｎ（ｉ）と標準偏差σ＿Ｌｎ（ｉ）を用いて数式１３により標準化する。
Ｄｉｓｔ（ｉ）＝（Ｄｉｆｆ＿Ｌｎ（ｉ）－Ｍ＿Ｌｎ（ｉ））／σ＿Ｌｎ（ｉ）
（数式１３） Next, analysis processing will be described. The abnormality probability calculation function 606 calculates the logarithmic value Diff_Ln(i) of the difference before and after auto-encoding for an arbitrary element i. Next, the abnormality probability calculation function 606 standardizes the logarithmic value Diff_Ln(i) of the difference for an arbitrary element i by Equation 13 using the average value M_Ln(i) and the standard deviation σ_Ln(i).
Dist(i)=(Diff_Ln(i)-M_Ln(i))/σ_Ln(i)
(Formula 13)

標準正規分布においては、ある値ｚ以上が生じる確率は既知であり（即ち、標準正規分布表）、Ｄｉｓｔ（ｉ）について、その稀有性を、標準正規分布表を用いて得ることができる。 In the standard normal distribution, the probability of occurrence of a value z or greater is known (ie, the standard normal distribution table), and for Dist(i) its rarity can be obtained using the standard normal distribution table.

結果表示機能６０７は、測定日時と要素ＩＤ（即ちｉ）の組に対応するＤｉｓｔ（ｉ）を、Ｄｉｓｔ（ｉ）と色を対応付けて、分析結果表示画面１７００の異常確率１７０８の代わりに表示してもよい。この時、数式１３において、Ｄｉｓｔ（ｉ）＜０の時は、Ｄｉｓｔ（ｉ）＝０としてもよい。 The result display function 607 displays Dist(i) corresponding to a set of measurement date and element ID (that is, i) in place of the abnormality probability 1708 on the analysis result display screen 1700 by associating Dist(i) with a color. You may At this time, in Equation 13, when Dist(i)<0, Dist(i)=0 may be set.

このように差分の分布が対数正規分布に近づくケースにおいては、差分の対数値の分布を正規分布として扱うことにより、任意の要素ｉについて、その稀有性を標準正規分布表に基づいて算出することも可能である。 In such a case where the distribution of the difference approaches the lognormal distribution, by treating the distribution of the logarithmic value of the difference as a normal distribution, the rarity of any element i can be calculated based on the standard normal distribution table. is also possible.

なお、図３０に示したオートエンコーダは一例であり、この構成に制限されるものではなく、例えば、中間ノードの層の数を増やすことも可能である。また、本実施例では中間ノード数Ｍ、及び学習量Ｌをパラメータとして変動させる例を示したが、他のパラメータを変動させてもよく、例えば中間ノードの層の数をパラメータとして変動させても構わない。 Note that the autoencoder shown in FIG. 30 is just an example, and the configuration is not limited to this, and for example, it is possible to increase the number of intermediate node layers. Also, in this embodiment, an example in which the number of intermediate nodes M and the learning amount L are varied as parameters is shown, but other parameters may be varied, for example, the number of layers of intermediate nodes may be varied as a parameter. I do not care.

以上、本発明について実施の形態に基づき具体的に説明したが、本発明は上記の実施の形態に限定されるものではなく、その要旨を逸脱しない範囲で種々変更可能である。例えば、上記の実施の形態は本発明を分かりやすく説明するために詳細に説明したものであり、必ずしも説明した全ての構成を備えるものに限定されるものではない。また上記実施形態の構成の一部について、他の構成の追加、削除、又は置換をすることができる。 Although the present invention has been specifically described above based on the embodiments, the present invention is not limited to the above embodiments, and can be variously modified without departing from the scope of the invention. For example, the above embodiments have been described in detail in order to explain the present invention in an easy-to-understand manner, and are not necessarily limited to those having all the described configurations. In addition, addition, deletion, or replacement of other configurations can be made to part of the configurations of the above embodiments.

また上記の各構成、機能、及び処理手段などは、それらの一部又は全部を、例えば、集積回路で設計する等によりハードウェアで実現してもよい。また上記の各構成、機能等は、プロセッサが夫々の機能を実現するプログラムを解釈し、実行することによりソフトウェアで実現してもよい。各機能を実現するプログラム、テーブル、及びファイル等の情報は、メモリやハードディスク、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）等の記録装置、又はＩＣカード、ＳＤカード、ＤＶＤ等の記録媒体に置くことができる。 Further, each of the above configurations, functions, processing means, and the like may be realized by hardware, for example, by designing a part or all of them using an integrated circuit. Further, each of the above configurations, functions, etc. may be realized by software by a processor interpreting and executing a program for realizing each function. Information such as programs, tables, and files that implement each function can be stored in recording devices such as memories, hard disks, SSDs (Solid State Drives), or recording media such as IC cards, SD cards, and DVDs.

また各図において、制御線や情報線は説明上必要と考えられるものを示しており、必ずしも実装上の全ての制御線や情報線を示しているとは限らない。例えば、実際にはほとんど全ての構成が相互に接続されていると考えてもよい。 Further, in each drawing, control lines and information lines are those considered to be necessary for explanation, and not all control lines and information lines for mounting are necessarily shown. For example, it may be considered that almost all structures are interconnected in practice.

また以上に説明した情報処理装置の各機能、及び各データベースの配置形態は一例に過ぎない。各機能、及び各データベースの配置形態は、各情報処理装置がハードウェアやソフトウェアの性能、処理効率、及び通信効率等の観点から最適な配置形態に変更し得る。 Also, each function of the information processing apparatus described above and the arrangement form of each database are merely examples. The layout of each function and each database can be changed to an optimum layout from the viewpoint of hardware and software performance, processing efficiency, communication efficiency, and the like for each information processing apparatus.

また前述した各データベースの構成は、リソースの効率的な利用、処理効率向上、アクセス効率向上、及び検索効率向上等の観点から柔軟に変更し得る。 Also, the configuration of each database described above can be flexibly changed from the viewpoint of efficient use of resources, improvement of processing efficiency, improvement of access efficiency, improvement of search efficiency, and the like.

また以上に説明したデータの記録形態は、テーブル構造に限るものではなく、夫々のデータを適切に関連付けて記憶できればよく、キュー構造やリスト構造等、他の構造で記録されてもよい。また値の記録方法として数値の他に、数式等を用いてもよい。また夫々のテーブルに記録される各項目は、用途に応じて異なってもよく、例として記載した項目に限定されない。またデータは複数の情報処理装置に分散配置されていてもよい。 The data recording format described above is not limited to the table structure, and may be recorded in other structures such as a queue structure and a list structure as long as each data can be appropriately associated and stored. In addition to numerical values, numerical formulas and the like may be used as a value recording method. Also, each item recorded in each table may be different depending on the application, and is not limited to the items described as examples. Also, the data may be distributed to a plurality of information processing devices.

また以上において、要素の数（例えば、個数、数値、量、及び範囲等）に言及する場合、とくに明示した場合及び原理的に明らかに特定の数に限定される場合等を除いて、その特定の数に限定されるものではなく、特定の数以上でも以下でもよい。また以上の説明において、その構成要素（例えば、各機能、データベース、及び要素ステップ等）は、特に明示した場合及び原理的に明らかに必須であると考えられる場合等を除き、必ずしも必須のものではない。 In addition, in the above, when referring to the number of elements (e.g., number, numerical value, amount, range, etc.), the specific is not limited to the number of , and may be greater than or less than a specific number. Also, in the above explanation, the components (for example, each function, database, element step, etc.) are not necessarily essential unless otherwise specified or clearly considered essential in principle. do not have.

また以上に示した各処理は、監視対象や目的に応じて、リアルタイム処理、又はバッチ処理のいずれで実行されてもよい。また以上に示した各実施形態は夫々を独立して適用してもよいし、複数の実施形態の全部又は一部を組み合せて適用してもよい。 Also, each of the processes described above may be executed either in real time or in batch depending on the object to be monitored and the purpose. Moreover, each embodiment shown above may apply each independently, and may apply combining all or one part of several embodiment.

また、分析対象例として、工場の機器をあげたが、必ずしも工場の機器や装置に限るものではなく、例えば、線路、鉄橋、又はトンネルの天井などに設置したセンサ（例えば振動センサ、音センサ、又はひずみセンサなど）において、電車が通過する際の測定データを分析して状態を識別する、又は異常を検出するなど、工場以外の用途への適用も考えられる。 In addition, as an example of the analysis target, the equipment in the factory was given, but it is not necessarily limited to the equipment and devices in the factory. or strain sensors, etc.), applications other than factories, such as analyzing measurement data when a train passes to identify the state or detect anomalies, are also conceivable.

また、各実施例においてはセンサを使用してデータを収集する例を示したが、センサの使用に限定する必要はなく、例えば、サーバ負荷のように、センサを使用せずに収集した値や、画像データなどを対象としてもよい。 Also, in each embodiment, an example of collecting data using a sensor has been shown, but it is not necessary to limit the use of the sensor. , image data, and the like.

１：状態監視システム、１０１：テータ格納装置、１０２：データ分析装置、１０３：センサ、１０４：入出力装置、１０５：分析対象、６０９：差分分布データ、６１１：分析モデル 1: Condition monitoring system, 101: Data storage device, 102: Data analysis device, 103: Sensor, 104: Input/output device, 105: Analysis object, 609: Difference distribution data, 611: Analysis model

Claims

A method for a computer system to monitor monitored data, comprising:
the computer system stores an analytical model;
The analysis model outputs restored data with the same number of elements as the input data for input data composed of a plurality of elements, so that the difference between the input normal data and the restored data for the normal data is small. is learned by
The method comprises: the computer system comprising:
generating a distribution of differences for each element between a plurality of normal data for generating a difference distribution and restored data obtained by inputting the plurality of normal data for generating a difference distribution into the analysis model;
comparing the difference for each element between the monitoring target data and the restored data obtained by inputting the monitoring target data into the analysis model and the distribution of the difference for each element;
A method of calculating an index value for each element based on the comparison.

2. The method of claim 1, wherein
The analysis model generates intermediate data with a reduced number of elements from the input data, generates and outputs the restored data of the input data from the intermediate data,
The method comprises: the computer system comprising:
Acquire multiple normal data for training,
dividing the plurality of normal data for learning into a plurality of normal data for model generation and the plurality of normal data for difference distribution generation;
The plurality of normal data for model generation are input to the analysis model, and the analysis model is adjusted so that each difference between the plurality of normal data for model generation and the restored data of the plurality of normal data for model generation is reduced. how to learn

2. The method of claim 1, wherein
The method according to claim 1, wherein each element of the input data is a value associated with a predetermined value and arranged in a predetermined order.

2. The method of claim 1, wherein
A method, wherein each element of the input data is a set of a first value and a second value associated with the first value.

2. The method of claim 1, wherein
The input data of the analysis model is n-dimensional array data in which n is an integer of 2 or more,
The analysis model generates intermediate data with a reduced number of elements from the input data, generates and outputs restored data with the same number of elements as the n-dimensional array data from the intermediate data,
The method comprises: the computer system comprising:
Acquiring a plurality of normal data for learning, which consists of a plurality of elements, each element consisting of n values,
converting the plurality of normal data for learning into an n-dimensional array data group;
dividing the n-dimensional array data group into a plurality of n-dimensional array data for model generation and a plurality of n-dimensional array data for difference distribution generation;
inputting the plurality of n-dimensional array data for model generation into the analysis model, learning the analysis model so that each difference between the plurality of n-dimensional array data for model generation and corresponding restored data becomes small;
inputting each of the plurality of n-dimensional array data for difference distribution generation into the analysis model to generate restored data for each of the plurality of n-dimensional array data for difference distribution generation;
calculating a difference between each of the plurality of n-dimensional array data for difference distribution generation and each of the restored data of the plurality of n-dimensional arrays for difference distribution generation, for each element of the n-dimensional array, and element for each space of dimension less than n calculating the total value of the differences calculated for each space, and generating a distribution of the total value of the differences for each of the spaces;
converting the monitoring target data into n-dimensional array data and inputting it to the analysis model to generate restored data of the monitoring target data;
calculating, for each element of the n-dimensional array, a difference between the monitoring target data after the n-dimensional arraying and the restored data of the monitoring target data, and calculating a total value of the differences for each element for each of the spaces; calculating an index value for each space based on a comparison with the distribution of the total value of the differences for each space;
A method of outputting the calculated index value.

6. The method of claim 5, wherein the computer system comprises:
Pooling the input data to the analysis model and the restored data, calculating an n-dimensional array for each,
calculating the difference between the input data after the pooling and the restored data for each element to calculate an n-dimensional array of the difference;
unpooling the n-dimensional array of the differences to calculate an n-dimensional array;
calculating the total value for each space of dimensions less than n for the n-dimensional array obtained by the unpooling;
A method of calculating the index value using a total value calculated for each space of the n-dimensional array obtained by the unpooling.

6. The method of claim 5, wherein the computer system comprises:
dividing the input data to the analysis model into a plurality of regions;
A method of calculating an index value by converting each of the divided regions into n-dimensional array data.

2. The method of claim 1, wherein
The method, wherein the analytical model is an autoencoder.

2. The method of claim 1, wherein the computer system comprises:
Set the threshold for abnormality judgment,
A method of determining abnormality of the index value calculated for each of the elements according to the threshold value.

2. The method of claim 1, wherein the computer system comprises:
generating the analysis model for each state with respect to a plurality of states;
generating a distribution for each element of the difference for each state using the analysis model;
inputting the monitoring target data into each analysis model for each state and calculating the difference for each element from the restored data;
calculating an index value for each element for each state based on a comparison between the difference for each element calculated for each state and the distribution of the difference for each element calculated for each state;
The method of outputting the calculated index value.

2. The method of claim 1, wherein
A method, wherein the computer system displays the index value calculated for each element in association with a color.

2. The method of claim 1, wherein the computer system comprises:
generating an analytical model for each combination of one or more parameters,
generating a logarithm distribution of the difference for each element for each analysis model of the analysis model;
Selecting a combination of parameters that makes the distribution of the logarithmic value of the difference close to a normal distribution,
selecting an analysis model corresponding to the selected combination of parameters from the analysis models;
The logarithm of the difference for each element between the monitoring target data and the restored data obtained by inputting the monitoring target data into the selected analysis model, and the logarithm of the difference for each element corresponding to the selected analysis model. make a comparison with the distribution,
A method of calculating an index value for each of the elements based on the comparison.

A computer system for monitoring monitoring target data,
one or more processors;
one or more storage devices,
the one or more storage devices store analytical models;
The analysis model outputs restored data with the same number of elements as the input data for input data composed of a plurality of elements, so that the difference between the input normal data and the restored data for the normal data is small. is learned by
the one or more processors
generating a distribution of differences for each element between a plurality of normal data for generating a difference distribution and restored data obtained by inputting the plurality of normal data for generating a difference distribution into the analysis model;
comparing the difference for each element between the monitoring target data and the restored data obtained by inputting the monitoring target data into the analysis model and the distribution of the difference for each element;
A computer system that calculates an index value for each element based on the comparison.

14. The computer system of claim 13,
The analysis model generates intermediate data with a reduced number of elements from the input data, generates and outputs the restored data of the input data from the intermediate data,
The one or more processors
Acquire multiple normal data for training,
dividing the plurality of normal data for learning into a plurality of normal data for model generation and the plurality of normal data for difference distribution generation;
The plurality of normal data for model generation are input to the analysis model, and the analysis model is adjusted so that each difference between the plurality of normal data for model generation and the restored data of the plurality of normal data for model generation is reduced. A computer system that learns

14. The computer system of claim 13,
The input data of the analysis model is n-dimensional array data in which n is an integer of 2 or more,
The analysis model generates intermediate data with a reduced number of elements from the input data, generates and outputs restored data with the same number of elements as the n-dimensional array data from the intermediate data,
The one or more processors
Acquiring a plurality of normal data for learning, which consists of a plurality of elements, each element consisting of n values,
converting the plurality of normal data for learning into an n-dimensional array data group;
dividing the n-dimensional array data group into a plurality of n-dimensional array data for model generation and a plurality of n-dimensional array data for difference distribution generation;
inputting the plurality of n-dimensional array data for model generation into the analysis model, learning the analysis model so that each difference between the plurality of n-dimensional array data for model generation and corresponding restored data becomes small;
inputting each of the plurality of n-dimensional array data for difference distribution generation into the analysis model to generate restored data for each of the plurality of n-dimensional array data for difference distribution generation;
calculating a difference between each of the plurality of n-dimensional array data for difference distribution generation and each of the restored data of the plurality of n-dimensional arrays for difference distribution generation, for each element of the n-dimensional array, and element for each space of dimension less than n calculating the total value of the differences calculated for each space, and generating a distribution of the total value of the differences for each of the spaces;
converting the monitoring target data into n-dimensional array data and inputting it to the analysis model to generate restored data of the monitoring target data;
calculating, for each element of the n-dimensional array, a difference between the monitoring target data after the n-dimensional arraying and the restored data of the monitoring target data, and calculating a total value of the differences for each element for each of the spaces; calculating an index value for each space based on a comparison with the distribution of the total value of the differences for each space;
A computer system that outputs the calculated index value.