JP7311480B2

JP7311480B2 - Information processing system and information processing method

Info

Publication number: JP7311480B2
Application number: JP2020201209A
Authority: JP
Inventors: 翼高橋
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2017-06-19
Filing date: 2020-12-03
Publication date: 2023-07-19
Anticipated expiration: 2037-06-19
Also published as: JP7468586B2; JP2022174161A; JP2021047892A

Description

本発明は、データの処理に関し、特に、時系列データを処理する情報処理装置などに関する。 The present invention relates to data processing, and more particularly to an information processing apparatus that processes time-series data.

ネットワークへの不正侵入を検知する装置、又は、工場における温度センサーなどのセンサー装置（以下、まとめて「検知装置」と呼ぶ）は、監視対象（ネットワーク又は工場）における所定の属性の状態又は値を観測する。属性とは、温度、圧力、流量、若しくは、振動などの物理的な値、又は、データの送信、受信、所定のメッセージの作成などの動作の状態である。以下、観測した値及び観測した状態をまとめて「観測値」と呼ぶ。さらに、検知装置は、単一の時間での観測ではなく、通常は、連続して（シーケンスに）観測を継続する。連続した観測を基に、検知装置は、観測した属性の値（観測値）と、観測した時間などの時間に関連する情報（タイムスタンプ）と関連付けて含むデータを、時々刻々と生成する。つまり、検知装置は、監視対象を観測し、複数の時間における観測値とタイムスタンプとを含むデータ（以下、「時系列データ」又は「シーケンスデータ」と呼ぶ）を生成する。そして、検知装置は、生成したシーケンスデータを分析することで、監視対象を把握する。ここで監視対象の把握とは、例えば、監視対象が正常であるか否の判別、又は、監視対象における疑わしい挙動の検出などである。 Devices that detect unauthorized network intrusions or sensor devices such as temperature sensors in factories (hereinafter collectively referred to as “detection devices”) detect the state or value of a predetermined attribute in the monitored object (network or factory). Observe. Attributes are physical values such as temperature, pressure, flow rate, or vibration, or operational states such as sending or receiving data or creating a predetermined message. Hereinafter, an observed value and an observed state are collectively referred to as an "observed value". Further, the sensing device typically continues to make observations continuously (in a sequence) rather than making observations at a single time. Based on the continuous observations, the sensing device generates data moment by moment that includes the value of the observed attribute (observed value) in association with time-related information (time stamp) such as the time of observation. That is, the detection device observes the monitored object and generates data including observation values and time stamps at a plurality of times (hereinafter referred to as "time-series data" or "sequence data"). Then, the detection device grasps the monitoring target by analyzing the generated sequence data. Here, grasping the monitored object means, for example, determination of whether the monitored object is normal or detection of suspicious behavior in the monitored object.

シーケンスデータの例としては、Ｔｗｉｔｔｅｒなどのミニブログ（つぶやき）、プロキシサーバ若しくは監視サーバのログ、又は、侵入検知（ＩＤＳ：ＩｎｔｒｕｓｉｏｎＤｅｔｅｃｔｉｏｎＳｙｓｔｅｍ）のアラートログなどがある。 Examples of sequence data include micro-blogs (tweets) on Twitter and the like, proxy server or monitoring server logs, and intrusion detection (IDS: Intrusion Detection System) alert logs.

このようなシーケンスデータ（時系列データ）を処理する技術は、いろいろな分野に適用されている（例えば、特許文献１ないし３）。 Techniques for processing such sequence data (time-series data) are applied to various fields (for example, Patent Documents 1 to 3).

特許文献１は、脳波を解析する技術を開示している。 Patent Literature 1 discloses a technique for analyzing electroencephalograms.

特許文献２は、正常稼働状態のデータを学習する技術を開示している。 Patent Literature 2 discloses a technique for learning data in a normal operating state.

特許文献３は、正常パターンを自動抽出する技術を開示している。 Patent Literature 3 discloses a technique for automatically extracting normal patterns.

シーケンスデータには、監視対象の様々な挙動が、入り混じっている。例えば、シーケンスデータには、時間的な定常性（ｃｏｎｓｔａｎｃｙ）を持つ挙動と、時間的な定常性のない挙動（以下、「ランダムな挙動」と呼ぶ）とが、入り混じっている場合がある。時間的な定常性を持つ挙動とは、例えば、「特定の曜日及び／又は時間帯によく発現する挙動」又は「毎日発現する挙動」である。また、ランダムな挙動とは、「発現する時間又は位置を特定できない挙動」である。 The sequence data contains a mixture of different behaviors to be monitored. For example, sequence data may contain a mixture of behavior with temporal constancy and behavior without temporal constancy (hereinafter referred to as “random behavior”). A behavior having temporal constancy is, for example, “a behavior that often appears on a specific day of the week and/or during a specific time period” or “a behavior that occurs every day”. Random behavior is "behavior whose manifestation time or position cannot be specified".

コンピュータ・ネットワーク上での不正な攻撃（サイバー攻撃）に対する防御（サイバーセキュリティ）において、時間的な定常性を持つ挙動は、通常の業務に関連する動作に該当する。また、ランダムな挙動は、標的型攻撃などのサイバー攻撃、機器の故障、又は、何らかの異常に該当する。 In defense (cybersecurity) against unauthorized attacks (cyberattacks) on computer networks, behavior with temporal stability corresponds to operations related to normal business. Also, random behavior corresponds to cyberattacks such as targeted attacks, equipment failures, or some kind of anomaly.

時間的な定常性を持つ挙動及びランダムな挙動のそれぞれに対応したパターンを検出することは、アラート（警報）を発報するセキュリティ機器において、通常は生じない異常（ａｎｏｍａｌｙ）を検出する上で重要である。 Detecting patterns corresponding to both temporally stationary behavior and random behavior is important in detecting anomalies that do not normally occur in security devices that issue alerts. is.

このように、シーケンスデータの分析において、時間的な定常性を持つ挙動とランダムな挙動とのパターンを明確に分けたい、という要求がある。 Thus, in the analysis of sequence data, there is a demand to clearly separate patterns of temporally stationary behavior and random behavior.

データ中の主要なパターンを検出する手法として、各種の手法が提案されている。例えば、このような手法として、主成分分析（ＰＣＡ：ＰｒｉｎｃｉｐａｌＣｏｍｐｏｎｅｎｔＡｎａｌｙｓｉｓ）又は特異値分解（ＳＶＤ：ＳｉｎｇｕｌａｒＶａｌｕｅＤｅｃｏｍｐｏｓｉｔｉｏｎ）などの行列分解が知られている。これらの手法は、シーケンスデータに対しても適用することができる。しかしながら、行列分解は、時間的な定常性を持つ挙動であるか、ランダムな挙動であるかを、区別できない。また、時間的な定常性を持つ挙動のように頻度の高いパターンが多数ある場合、行列分解は、ランダムな挙動を雑音と見なしてしまい、ランダムな挙動をパターンとして検出できない可能性がある。 Various techniques have been proposed as techniques for detecting major patterns in data. For example, matrix decomposition such as principal component analysis (PCA) or singular value decomposition (SVD) is known as such a method. These techniques can also be applied to sequence data. However, matrix decomposition cannot distinguish between temporally stationary behavior and random behavior. In addition, when there are many patterns with high frequency, such as behavior with temporal stationarity, the matrix decomposition may regard the random behavior as noise and may not be able to detect the random behavior as a pattern.

非特許文献１に記載の技術は、「タイムスタンプ，属性値」からなるデータを、周期に基づいて分割及び積層して行列形式のシーケンスデータを作成する。さらに、非特許文献１に記載の技術は、行列形式のシーケンスデータを基に、「周期数，周期内タイムスタンプ，属性値」から成るテンソル（ｔｅｎｓｏｒ）を生成する。このように周期に基づいて積層されたテンソル、つまり時間軸方向に積層されたテンソルには、時間方向における新たな冗長性が生まれる。非特許文献１に記載の技術は、このテンソルにテンソル分解を適用して、周期的なパターン（時間的な定常性を持つ挙動に対応したパターン）を検出する。加えて、非特許文献１に記載の技術は、時間的な定常性を持つ挙動から外れる挙動を、異常値として検出する。 The technique described in Non-Patent Document 1 divides and stacks data consisting of "timestamp and attribute value" based on the period to create sequence data in a matrix format. Furthermore, the technique described in Non-Patent Document 1 generates a tensor consisting of "the number of cycles, the time stamp within the cycle, and the attribute value" based on sequence data in a matrix format. In this way, tensors stacked based on periods, that is, tensors stacked in the direction of the time axis, create new redundancy in the direction of time. The technique described in Non-Patent Document 1 applies tensor decomposition to this tensor to detect periodic patterns (patterns corresponding to behavior with temporal stationarity). In addition, the technique described in Non-Patent Literature 1 detects behavior that deviates from temporally stationary behavior as an abnormal value.

上記のような行列又はテンソルにおける計算は、計算量が多く、さらに、結果の値が一つの値に収束しない場合がある。そこで、計算量を軽くするため、及び、結果を収束させるために、制約が用いられる。このような計算における制約として、正則化が、広く用いられている。このような正則化としては、スパース（Ｓｐａｒｓｅ）正則化がある。スパース正則化とは、多くの変数において、その値を０とする制約である。 Calculations on matrices or tensors such as those described above are computationally expensive and, moreover, the resulting values may not converge to a single value. Therefore, constraints are used to reduce the amount of computation and to converge the results. Regularization is widely used as a constraint in such computations. Such regularization includes sparse regularization. Sparse regularization is a constraint that sets the value to 0 in many variables.

非特許文献２は、スパース正則化の一種であるＧｒｏｕｐＬａｓｓｏ正則化を開示している。ＧｒｏｕｐＬａｓｓｏ正則化は、変数のグループに対して、そのグループに属する変数を同時に０につぶす、すなわち変数の値が疎になるように仕向ける作用を持つ正則化である。 Non-Patent Document 2 discloses Group Lasso regularization, which is a kind of sparse regularization. Group Lasso regularization is a regularization that has the effect of squeezing variables belonging to a group to 0 at the same time, that is, making the values of variables sparse.

非特許文献３は、ＧｒｏｕｐＬａｓｓｏを用いて、主成分分析における所定数（ｋ）の上位コンポーネント（上位ｋコンポーネント）を含む変数のグループを生成する。主成分分析では、コンポーネントは、上位になるほど、データの主要な成分を持つ。そのため、上記の変数のグループ（上位ｋコンポーネント）は、多くのデータが共通に持つ特徴を多く含んだ密なパターン、換言すると、多くのデータにおいて発現するパターンの推定となる。 Non-Patent Document 3 uses Group Lasso to generate a variable group containing a predetermined number (k) of top components (top k components) in principal component analysis. In principal component analysis, the higher the component, the more dominant the data. Therefore, the above group of variables (top k components) is a dense pattern containing many common features of a lot of data, in other words, an estimation of a pattern that appears in a lot of data.

非特許文献４は、ＧｒｏｕｐＬａｓｓｏ正則化を用いて、主成分分析で得られるパターンが異常値に頑健であるように仕向ける技術を開示している。 Non-Patent Document 4 discloses a technique of using Group Lasso regularization to make the pattern obtained by principal component analysis robust against outliers.

国際公開第２０１２／１３３１８５号WO2012/133185 特開２０１４－１４９８４０号公報JP 2014-149840 A 特開２０１１－２４７６９６号公報JP 2011-247696 A

Tsubasa Takahashi, Bryan Hooi, Christos Faloutsos, "AutoCyclone: Automatic Mining of Cyclic Online Activities with Robust Tensor Factorization", Proceedings of the 26th International Conference on World Wide Web, April 03-07, 2017, pp. 213-221Tsubasa Takahashi, Bryan Hooi, Christos Faloutsos, "AutoCyclone: Automatic Mining of Cyclic Online Activities with Robust Tensor Factorization", Proceedings of the 26th International Conference on World Wide Web, April 03-07, 2017, pp. 213-221 Ming Yuan, Yi Lin, "Model selection and estimation in regression with grouped variables", Journal of the Royal Statistical Society: Series B (Statistical Methodology), Volume 68, Issue 1, February 2006, pp.49-67Ming Yuan, Yi Lin, "Model selection and estimation in regression with grouped variables", Journal of the Royal Statistical Society: Series B (Statistical Methodology), Volume 68, Issue 1, February 2006, pp.49-67 Ruoyi Jiang, Hongliang Fei, Jun Huan, "Anomaly localization for network data streams with graph joint sparse PCA", KDD '11 Proceeding of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining, August 21-24, 2011, pp. 886-894Ruoyi Jiang, Hongliang Fei, Jun Huan, "Anomaly localization for network data streams with graph joint sparse PCA", KDD '11 Proceeding of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining, August 21-24, 2011, pp. 886-894 Gonzalo Mateos, Georgios B. Giannakis, "Robust PCA as Bilinear Decomposition With Outlier-Sparsity Regularization", IEEE Transactions on Signal Processing, Volume 60, Issue10, Oct. 2012, pp.5176-5190Gonzalo Mateos, Georgios B. Giannakis, "Robust PCA as Bilinear Decomposition With Outlier-Sparsity Regularization", IEEE Transactions on Signal Processing, Volume 60, Issue 10, Oct. 2012, pp.5176-5190

しかしながら、非特許文献１に記載の技術は、異常値に含まれるパターンを抽出することができない。 However, the technique described in Non-Patent Document 1 cannot extract patterns included in abnormal values.

また、非特許文献１に記載の技術は、入力データに時間方向の冗長性を持たせることで、周期的なパターンが抽出されやすいように工夫している。しかし、非特許文献１に記載の技術では、パターンを抽出するためのテンソル分解において、必ずしも、時間的な定常性を持つ挙動のパターンだけを抽出するように仕向けられた制約が加えられてはいない。そのため、非特許文献１に記載の技術では、ランダムな挙動であってもその頻度が高い場合、又は、ランダムな挙動に対応した観測値が大きい場合などでは、時間的な定常性を持つ挙動のパターンに、ランダムな挙動のパターンが混じってしまう可能性がある。 In addition, the technique described in Non-Patent Document 1 is devised so that periodic patterns can be easily extracted by providing input data with redundancy in the time direction. However, in the technique described in Non-Patent Document 1, the tensor decomposition for extracting patterns is not necessarily restricted to extract only behavior patterns with temporal stationarity. . Therefore, in the technique described in Non-Patent Document 1, even if the behavior is random, when the frequency is high, or when the observed value corresponding to the random behavior is large, behavior with temporal stationarity Patterns can be mixed with patterns of random behavior.

また、非特許文献３に開示された上位ｋコンポーネントは、単にデータセット全体における特徴的なパターンが集められたグループである。上位ｋコンポーネントは、必ずしも、時間的な定常性を持つ挙動のパターンの推定となるわけではない。 Also, the top k component disclosed in Non-Patent Document 3 is simply a group of characteristic patterns in the entire data set. The top k-components are not necessarily estimates of patterns of behavior that are temporally stationary.

非特許文献２は、ＧｒｏｕｐＬａｓｓｏ正則化を開示する文献であり、時間的な定常性を持つ挙動に対応したパターンとランダムな挙動とに対応したパターンとを区別する技術を開示していない。 Non-Patent Document 2 is a document that discloses Group Lasso regularization, and does not disclose a technique for distinguishing between a pattern corresponding to temporally stationary behavior and a pattern corresponding to random behavior.

非特許文献４は、主成分分析で得られるパターンにおける異常値に対する頑健性を向上する技術を開示している。しかし、非特許文献４は、時間的な定常性を持つ挙動に対応したパターンとランダムな挙動とに対応したパターンとを区別する技術を開示していない。 Non-Patent Document 4 discloses a technique for improving robustness against abnormal values in patterns obtained by principal component analysis. However, Non-Patent Document 4 does not disclose a technique for distinguishing patterns corresponding to temporally stationary behaviors from patterns corresponding to random behaviors.

特許文献１ないし３は、上記の時間的な定常性を持つ挙動のパターンとランダムな挙動のパターンとを区別する技術を開示していない。 Patent Literatures 1 to 3 do not disclose a technique for distinguishing between a behavior pattern having temporal stationarity and a random behavior pattern.

このように、特許文献１ないし３、及び、非特許文献１ないし４は、時間的な定常性を持つ挙動に対応したパターンとランダムな挙動に対応したパターンとを抽出できないという問題点があった。 Thus, Patent Literatures 1 to 3 and Non-Patent Literatures 1 to 4 have the problem that patterns corresponding to temporally stationary behaviors and patterns corresponding to random behaviors cannot be extracted. .

本発明の目的は、上記問題点を解決し、時間的な定常性を持つ挙動に対応したパターンと、ランダムな挙動に対応したパターンとを抽出する情報処理装置などを提供することにある。 SUMMARY OF THE INVENTION It is an object of the present invention to solve the above problems and to provide an information processing apparatus that extracts a pattern corresponding to temporally stationary behavior and a pattern corresponding to random behavior.

本発明の一形態における情報処理方法は、監視対象設備の異常を検知するための情報として、ランダムパターンを検出する報処理方法であって、データ提供装置が、工場内の監視対象設備の観測値を取得し、複数の時間と、時間における観測値とを第１のデータとして提供し、情報処理装置が、第１のデータを基に第１のデータを時間において積層した第２のデータを構築し、第２のデータを基に第１のデータにおける時間的な定常性を持つ観測値の組合せである定常パターンを抽出し、第１のデータと定常パターンとの時間における差分を生成し、差分を基に時間的な定常性を持たない観測値の組合せであるランダムパターンを抽出し、ランダムパターンを送信し、表示装置が、ランダムパターンを受信し、ランダムパターンを出力する。 An information processing method according to one aspect of the present invention is an information processing method for detecting a random pattern as information for detecting an abnormality in a monitoring target facility, wherein a data providing device detects an observed value of a monitoring target facility in a factory and provides a plurality of times and observation values in time as first data, and the information processing device builds second data by layering the first data in time based on the first data Then, based on the second data, extract a steady pattern that is a combination of observation values with temporal stationarity in the first data, generate a difference in time between the first data and the steady pattern, A random pattern, which is a combination of observed values that do not have temporal stationarity, is extracted based on , the random pattern is transmitted, and the display device receives the random pattern and outputs the random pattern.

本発明の一形態における情報処理システムは、監視対象設備の異常を検知するための情報として、ランダムパターンを検出する情報処理システムであって、工場内の監視対象設備の観測値を取得する取得手段と、複数の時間と、時間における観測値とを第１のデータとして提供する提供手段とを含むデータ提供装置と、第１のデータを基に第１のデータを時間において積層した第２のデータを構築し、第２のデータを基に第１のデータにおける時間的な定常性を持つ観測値の組合せである定常パターンを抽出する定常パターン抽出手段と、第１のデータと定常パターンとの時間における差分を生成する差分生成手段と、差分を基に時間的な定常性を持たない観測値の組合せであるランダムパターンを抽出するランダムパターン抽出手段と、ランダムパターンを送信する送信手段とを含む情報処理装置と、ランダムパターンを受信する受信手段と、ランダムパターンを出力する表示手段とを含む表示装置とを含む。 An information processing system according to one aspect of the present invention is an information processing system that detects a random pattern as information for detecting an abnormality in a monitored facility, and is an acquisition means for acquiring an observed value of the monitored facility in a factory. a data providing device including a plurality of times and a providing means for providing observation values in time as first data; and second data obtained by layering the first data in time based on the first data and a steady pattern extracting means for extracting a steady pattern that is a combination of observed values having temporal stationarity in the first data based on the second data, and the time between the first data and the steady pattern information including difference generation means for generating the difference in the difference, random pattern extraction means for extracting a random pattern that is a combination of observed values that do not have temporal stationarity based on the difference, and transmission means for transmitting the random pattern A display device including a processing device, receiving means for receiving the random pattern, and display means for outputting the random pattern.

本発明に基づけば、時間的な定常性を持つ挙動に対応したパターンとランダムな挙動に対応したパターンとを抽出できる。 According to the present invention, it is possible to extract a pattern corresponding to temporally stationary behavior and a pattern corresponding to random behavior.

図１は、本発明における第１の実施形態に係る情報処理装置の構成の一例を示すブロック図である。FIG. 1 is a block diagram showing an example of the configuration of an information processing apparatus according to the first embodiment of the invention. 図２は、第１の実施形態に係る情報処理装置の動作の一例を示すフローチャートである。2 is a flowchart illustrating an example of the operation of the information processing apparatus according to the first embodiment; FIG. 図３は、シーケンスデータの一例を示す図である。FIG. 3 is a diagram showing an example of sequence data. 図４は、サブデータの一例を示す図である。FIG. 4 is a diagram showing an example of sub-data. 図５は、構築されたテンソルの一例を示す図である。FIG. 5 is a diagram showing an example of a constructed tensor. 図６は、テンソル分解に基づいて得られたパターンの一例を示す図である。FIG. 6 is a diagram showing an example of patterns obtained based on tensor decomposition. 図７は、図６における定常パターンの一例を示す図である。FIG. 7 is a diagram showing an example of a regular pattern in FIG. 6. FIG. 図８は、サブデータにおける個別差分を示す図である。FIG. 8 is a diagram showing individual differences in sub-data. 図９は、図８に示されている個別差分を行列化した差分を示す図である。FIG. 9 is a diagram showing differences obtained by matrixing the individual differences shown in FIG. 図１０は、図９に示されている差分に対して行列分解を適用した結果の一例を示す図である。FIG. 10 shows an example of the result of applying matrix decomposition to the difference shown in FIG. 図１１は、ハードウェア構成の一例を示すブロック図である。FIG. 11 is a block diagram showing an example of the hardware configuration. 図１２は、第１の実施形態に係る情報処理システムの構成の一例を示す図である。FIG. 12 is a diagram illustrating an example of the configuration of an information processing system according to the first embodiment;

以下、図面を参照して、本発明における実施形態について説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

各図面は、本発明の実施形態を説明するものである。ただし、本発明は、各図面の記載に限られるわけではない。また、各図面の同様の構成には、同じ番号を付し、その繰り返しの説明を省略する場合がある。また、以下の説明に用いる図面において、本発明の説明に関係しない部分の構成については、記載を省略し、図示しない場合もある。 Each drawing illustrates an embodiment of the invention. However, the present invention is not limited to the description of each drawing. In addition, the same numbers are assigned to the same configurations in each drawing, and repeated description thereof may be omitted. In addition, in the drawings used for the following description, the description of the configuration of the portion that is not related to the description of the present invention may be omitted and may not be illustrated.

まず、本実施形態の説明における用語について整理する。 First, the terms used in the description of this embodiment will be sorted out.

「観測値」とは、監視対象における所定の挙動に対応した属性について観測された値である。観測値は、直接的に観測された値に限られず、間接的に観測された値（例えば、センサーが検出した値を変換した値、又は、複数の観測値を基に算出された値）でもよい。 An “observed value” is a value observed for an attribute corresponding to a predetermined behavior in the monitored object. Observed values are not limited to directly observed values, but also indirectly observed values (e.g., values converted from values detected by sensors, or values calculated based on multiple observed values). good.

監視対象（以下、「ソース」と呼ぶ場合もある）は、１つに限られず、複数でもよい。また、各監視対象における観測対象の属性は、１つに限られず、複数でもよい。さらに、属性は、全ての監視対象において同じである必要はない。属性は、少なくとも一部の監視対象において異なっていてもよい。 The number of monitoring targets (hereinafter sometimes referred to as "sources") is not limited to one, and may be plural. Also, the number of attributes of the observation target in each monitoring target is not limited to one, and may be plural. Furthermore, the attributes do not have to be the same for all targets. The attributes may be different in at least some monitored objects.

「タイムスタンプ」とは、観測値が観測された時間に関する情報である。つまり、タイムスタンプとは、観測における時間に関する情報である。タイムスタンプは、単一の時間ではなく、所定の幅を持った時間範囲でもよい。 A "timestamp" is information about the time when an observation was observed. In other words, a timestamp is information about the time in an observation. A time stamp may be a time range with a predetermined width instead of a single time.

観測値とタイムスタンプは、関連付けられている。 Observations and timestamps are associated.

各実施形態における情報処理装置は、観測値とタイムスタンプとを含むデータを受信し、以下で説明する動作を実行する。ただし、このデータは、少なくとも複数のタイムスタンプにおけるデータを含む。つまり、このデータは、シーケンスデータ（時系列データ）である。さらに、このデータは、各タイムスタンプにおいて複数の観測値を含む。ここで、複数の観測値とは、少なくとも監視対象又は属性のいずれかが複数となっている観測値のことである。なお、このデータは、一部の観測値が欠損したデータでもよい。 The information processing apparatus in each embodiment receives data including observation values and time stamps, and performs operations described below. However, this data includes data at least at multiple timestamps. That is, this data is sequence data (time-series data). Further, this data includes multiple observations at each timestamp. Here, multiple observation values are observation values for which at least one of the monitoring target and the attribute is multiple. Note that this data may be data in which some observed values are missing.

つまり、シーケンスデータは、複数の時間における複数の観測値を含むデータである。 In other words, sequence data is data that includes multiple observations at multiple times.

＜第１の実施形態＞
以下、図面を参照して、第１の実施形態について説明する。 <First embodiment>
A first embodiment will be described below with reference to the drawings.

［構成の説明］
まず、第１の実施形態に係る情報処理装置１００の構成について、図面を参照して説明する。 [Description of configuration]
First, the configuration of the information processing apparatus 100 according to the first embodiment will be described with reference to the drawings.

図１は、本発明における第１の実施形態に係る情報処理装置１００の構成の一例を示すブロック図である。図１に示されているように、情報処理装置１００は、定常パターン抽出部１０２と、差分生成部１０４と、ランダムパターン抽出部１０６とを含む。 FIG. 1 is a block diagram showing an example configuration of an information processing apparatus 100 according to the first embodiment of the present invention. As shown in FIG. 1, the information processing apparatus 100 includes a regular pattern extraction section 102, a difference generation section 104, and a random pattern extraction section .

定常パターン抽出部１０２は、シーケンスデータ（以下、「シーケンスデータＸ」、又は、単に「Ｘ」と示す場合もある）を取得する。 The regular pattern extraction unit 102 acquires sequence data (hereinafter sometimes referred to as “sequence data X” or simply “X”).

定常パターン抽出部１０２におけるシーケンスデータＸの取得元は、任意である。例えば、定常パターン抽出部１０２は、図示しない外部の装置からシーケンスデータＸを受信してもよい。あるいは、定常パターン抽出部１０２は、図示しない記憶部から、シーケンスデータＸを読み出してもよい。 The acquisition source of the sequence data X in the regular pattern extraction unit 102 is arbitrary. For example, the regular pattern extraction unit 102 may receive sequence data X from an external device (not shown). Alternatively, the regular pattern extraction unit 102 may read the sequence data X from a storage unit (not shown).

そして、定常パターン抽出部１０２は、シーケンスデータＸを、シーケンスデータＸにおけるタイムスタンプに基づいて複数のデータに分割する。以下、分割された各データを「サブデータ」と呼ぶ。 The regular pattern extraction unit 102 then divides the sequence data X into a plurality of pieces of data based on the time stamps in the sequence data X. FIG. Each piece of divided data is hereinafter referred to as "sub-data".

そして、定常パターン抽出部１０２は、分割したサブデータを積層したデータ（以下、テンソルと呼ぶ）を構築する。 Then, the regular pattern extraction unit 102 constructs data (hereinafter referred to as a tensor) in which the divided sub-data are layered.

このように、テンソルは、サブデータを積層して構築される。また、サブデータのオーダーは、シーケンスデータＸのオーダーと同じである。そのため、構築されるテンソルのオーダーは、「シーケンスデータＸのオーダー＋１」となる。例えば、シーケンスデータＸのオーダーがｍ次の場合、テンソルのオーダーは、ｍ＋１次となる。より具体的には、例えば、シーケンスデータＸが行列の場合、ｍは「２」である。そのため、この場合、テンソルのオーダーは、「３＝２＋１」である。なお、このようなテンソルのデータ形式は、シーケンスデータが持っている時間的冗長性を、より明確に表現したデータ形式である。 Thus, a tensor is built by layering sub-data. Also, the order of the sub-data is the same as the order of the sequence data X. Therefore, the order of the constructed tensor is "the order of the sequence data X + 1". For example, if the order of the sequence data X is m, the order of the tensor is m+1. More specifically, for example, m is "2" when the sequence data X is a matrix. Therefore, in this case, the order of the tensor is "3=2+1". Such a tensor data format is a data format that more clearly expresses the temporal redundancy that sequence data has.

なお、シーケンスデータ及びテンソルは、同様の観測値を含むデータの集合である。そこで、以下、シーケンスデータを「第１のデータ」と、テンソルを「第２のデータ」と呼ぶ場合もある。 Note that sequence data and tensors are collections of data containing similar observations. Therefore, hereinafter, the sequence data may be called "first data", and the tensor may be called "second data".

定常パターン抽出部１０２は、このテンソルを対象として、時間的な定常性を持つ挙動のパターン（以下、「定常パターン（ｃｏｎｓｔａｎｔｐａｔｔｅｒｎ）」と呼ぶ）を抽出する。以下の説明において、定常パターンを、「定常パターンＰ」、又は、単に「Ｐ」と示す場合もある。 The constant pattern extraction unit 102 extracts a behavior pattern having temporal continuity (hereinafter referred to as a “constant pattern”) from this tensor. In the following description, the regular pattern may be indicated as "regular pattern P" or simply as "P".

シーケンスデータＸは、観測値の組合せの集合である。定常パターンＰは、シーケンスデータＸにおける時間的な定常性を持つパターンである。ここで、パターンとは、観測値の組合せである。つまり、定常パターンＰとは、シーケンスデータＸにおける時間的な定常性を持つ観測値の組合せである。 Sequence data X is a set of observation value combinations. A constant pattern P is a pattern in the sequence data X that has temporal continuity. Here, a pattern is a combination of observed values. In other words, the steady pattern P is a combination of observed values in the sequence data X that have temporal constancy.

具体的には、定常パターン抽出部１０２は、定常パターンＰの抽出において、時間的な定常性を備えているデータに対応したパターンが抽出されるように制約を付したテンソル因子分解（ｔｅｎｓｏｒｆａｃｔｏｒｉｚａｔｉｏｎ）を用いる。 Specifically, in extracting the stationary pattern P, the stationary pattern extraction unit 102 performs tensor factorization with restrictions so that patterns corresponding to data having temporal stationarity are extracted. Use

そして、定常パターン抽出部１０２は、定常パターンＰを差分生成部１０４に送信する。 The regular pattern extractor 102 then transmits the regular pattern P to the difference generator 104 .

定常パターン抽出部１０２は、抽出した定常パターンＰを図示しない外部の装置に出力してもよい。あるいは、定常パターン抽出部１０２は、抽出した定常パターンＰを図示しない情報処理装置１００の内部の処理部に送信してもよい。あるいは、定常パターン抽出部１０２は、抽出した定常パターンＰを図示しない記憶部に保存してもよい。 The regular pattern extraction unit 102 may output the extracted regular pattern P to an external device (not shown). Alternatively, the regular pattern extraction unit 102 may transmit the extracted regular pattern P to a processing unit inside the information processing apparatus 100 (not shown). Alternatively, the regular pattern extraction unit 102 may store the extracted regular pattern P in a storage unit (not shown).

差分生成部１０４は、シーケンスデータＸから定常パターンＰに対応する部分を取り除いた差分Δを抽出する。本実施形態の説明では、差分生成部１０４は、差分Δを、シーケンスデータＸと同じ形式を用いて、抽出する。例えば、シーケンスデータＸがｍ次のオーダーのテンソルの場合、差分Δはｍ次のオーダーのテンソルである。あるいは、シーケンスデータＸが行列の場合、差分Δは行列である。ただし、差分生成部１０４は、他の形式を用いてもよい。 The difference generator 104 extracts the difference Δ obtained by removing the portion corresponding to the regular pattern P from the sequence data X. FIG. In the description of this embodiment, the difference generator 104 extracts the difference Δ using the same format as the sequence data X. For example, if the sequence data X is a tensor of order m, the difference Δ is a tensor of order m. Alternatively, if the sequence data X is a matrix, the difference Δ is a matrix. However, the difference generator 104 may use another format.

差分生成部１０４は、差分Δをランダムパターン抽出部１０６に送信する。 Difference generation section 104 transmits difference Δ to random pattern extraction section 106 .

なお、差分生成部１０４は、定常パターン抽出部１０２と同様にシーケンスデータＸを取得してもよい。あるいは、差分生成部１０４は、定常パターンＰと同様に、定常パターン抽出部１０２からシーケンスデータＸを受信してもよい。 Note that the difference generator 104 may acquire the sequence data X in the same manner as the steady pattern extractor 102 . Alternatively, the difference generation section 104 may receive the sequence data X from the regular pattern extraction section 102 in the same manner as the regular pattern P.

ランダムパターン抽出部１０６は、差分Δを基に時間的な定常性を持たない観測値の組合せであるランダムパターン（ｒａｎｄｏｍｐａｔｔｅｒｎ）を抽出する。以下、ランダムパターンを、「ランダムパターンＺ」、又は、単に「Ｚ」と示す場合もある。 The random pattern extraction unit 106 extracts a random pattern, which is a combination of observed values that do not have temporal stationarity, based on the difference Δ. Hereinafter, the random pattern may be indicated as "random pattern Z" or simply "Z".

ランダムパターン抽出部１０６は、抽出したランダムパターンＺを図示しない外部の装置に出力してもよい。あるいは、ランダムパターン抽出部１０６は、抽出したランダムパターンＺを図示しない情報処理装置１００の内部の処理部に送信してもよい。あるいは、ランダムパターン抽出部１０６は、抽出したランダムパターンＺを図示しない記憶部に保存してもよい。 The random pattern extraction unit 106 may output the extracted random pattern Z to an external device (not shown). Alternatively, the random pattern extraction unit 106 may transmit the extracted random pattern Z to a processing unit inside the information processing apparatus 100 (not shown). Alternatively, the random pattern extraction unit 106 may store the extracted random pattern Z in a storage unit (not shown).

次に、図面を参照して、情報処理装置１００の動作について説明する。 Next, operations of the information processing apparatus 100 will be described with reference to the drawings.

図２は、第１の実施形態に係る情報処理装置１００の動作の一例を示すフローチャートである。 FIG. 2 is a flow chart showing an example of the operation of the information processing apparatus 100 according to the first embodiment.

まず、定常パターン抽出部１０２は、シーケンスデータＸを基に、テンソルを構築する（ステップＳ１１）。詳細には、まず、定常パターン抽出部１０２は、シーケンスデータＸにおけるタイムスタンプに基づいて、シーケンスデータＸをサブデータに分割する。そして、定常パターン抽出部１０２は、分割したサブデータを積層して、テンソルを構築する。 First, the regular pattern extraction unit 102 constructs a tensor based on the sequence data X (step S11). Specifically, first, the regular pattern extraction unit 102 divides the sequence data X into sub-data based on the time stamps in the sequence data X. FIG. Then, the regular pattern extraction unit 102 builds a tensor by stacking the divided sub-data.

図３は、シーケンスデータＸの一例を示す図である。図３に示されているシーケンスデータＸは、（ｓｒｃ，ｄａｙ，ｍｓｇ）から構成されるタプル（ｔｕｐｌｅ）から成る。なお、タプルとは、複数の構成要素（図３では、ｓｒｃ、ｄａｙ、ｍｓｇ）からなる組の総称である。 FIG. 3 is a diagram showing an example of sequence data X. As shown in FIG. The sequence data X shown in FIG. 3 consists of a tuple consisting of (src, day, msg). Note that a tuple is a general term for a set consisting of a plurality of components (src, day, msg in FIG. 3).

図３において、ｓｒｃは、監視対象、つまり、データの情報源（ｓｏｕｒｃｅ）に関する情報である。例えば、ｓｒｃは、ソースのアドレスである。例えば、シーケンスデータＸがコンピュータ・ネットワークのログの場合、ソースのアドレスは、ソースのＩＰ（ＩｎｔｅｒｎｅｔＰｒｏｔｏｃｏｌ）アドレスである。 In FIG. 3, src is information about the monitored object, that is, the data source. For example, src is the address of the source. For example, if the sequence data X is a computer network log, the source address is the IP (Internet Protocol) address of the source.

ｄａｙは、時間に関する情報、つまりタイムスタンプである。ｄａｙは、例えば、日付及び／又は時間である。 day is information about time, that is, a time stamp. day is, for example, date and/or time.

ｍｓｇは、シーケンスデータＸにおける属性の種類を示す情報である。例えば、ｍｓｇは、所定のメッセージの種類（例えば、障害内容）の区別を示す情報である。 msg is information indicating the type of attribute in the sequence data X. FIG. For example, msg is information indicating a distinction between predetermined message types (for example, failure content).

図３の場合、各セルは、それぞれのメッセージを何度受け取ったかを示している。例えば、図３の最も左の最上位のセルは、ソースｓｒｃ１が、タイムスタンプｄａｙ１に受け取ったｍｓｇ１の回数である。 In the case of FIG. 3, each cell indicates how many times each message has been received. For example, the leftmost top cell in FIG. 3 is the number of times msg1 was received by source src1 on timestamp day1.

図３において、横線を付したセル及び網掛けを付したセルは、０でない値が入っていることを示している。ただし、図３において、回数は省略している。 In FIG. 3, cells with horizontal lines and shaded cells indicate values other than zero. However, in FIG. 3, the number of times is omitted.

さらに、図３において、横線を付したセルは、時間的な定常性を持つ挙動に対応したパターン（定常パターンＰ）を構成するセルである。網掛けを付したセルは、ランダムな挙動に対応したパターン（ランダムパターンＺ）を構成するセルである。 Furthermore, in FIG. 3, cells with horizontal lines are cells that form a pattern (stationary pattern P) corresponding to a behavior having temporal stationarity. A shaded cell is a cell that constitutes a pattern (random pattern Z) corresponding to random behavior.

ただし、第１の実施形態において、各セルは、事前に、定常パターンＰとランダムパターンＺとして把握されている必要はない。図３における区別は、以降の説明の分かりやすくするためである。また、以降の説明において、説明を分かりやすくするため、行列分解及びテンソル分解で生じる誤差等は、省略している。 However, in the first embodiment, each cell does not need to be grasped in advance as a regular pattern P and a random pattern Z. FIG. The distinction in FIG. 3 is for facilitating the understanding of the following description. Also, in the following description, errors and the like that occur in matrix decomposition and tensor decomposition are omitted for the sake of clarity.

なお、図３において、ソース及び属性は、１次元のデータとして示されている。ただし、これは説明の便宜のためである。本実施形態の具体的な処理においては、ソース及び／又は属性は、多次元のデータでもよい。 Note that in FIG. 3, the sources and attributes are shown as one-dimensional data. However, this is for convenience of explanation. In specific processing of this embodiment, the sources and/or attributes may be multi-dimensional data.

定常パターン抽出部１０２は、図３のシーケンスデータＸを、時間情報ｄａｙ（タイムスタンプ）に基づいて、サブデータに分割する。 The regular pattern extraction unit 102 divides the sequence data X in FIG. 3 into sub-data based on the time information day (time stamp).

図４は、サブデータの一例を示す図である。図４に示されているように、サブデータは、時間情報ｄａｙ（タイムスタンプ）に関して分割されている。 FIG. 4 is a diagram showing an example of sub-data. As shown in FIG. 4, the sub-data is divided with respect to time information day (time stamp).

さらに定常パターン抽出部１０２は、図４に示されているサブデータを、時間方向に積層することで、テンソルを構築する。 Furthermore, the regular pattern extraction unit 102 constructs a tensor by layering the sub-data shown in FIG. 4 in the time direction.

図５は、構築されたテンソルの一例を示す図である。 FIG. 5 is a diagram showing an example of a constructed tensor.

定常パターン抽出部１０２は、テンソルを基に、定常パターンＰを抽出する（ステップＳ１３）。詳細には、定常パターン抽出部１０２は、対象となるテンソルに対して、時間的な定常性を持つ挙動のパターンが抽出されるように制約を付したテンソル因子分解を用いる。 The steady pattern extraction unit 102 extracts a steady pattern P based on the tensor (step S13). More specifically, the stationary pattern extraction unit 102 uses tensor factorization, which is constrained so that patterns of behavior with temporal stationarity are extracted from the target tensor.

図６は、テンソル因子分解に基づいて得られたパターンの一例を示す図である。図６は、定常パターンＰを左側に、ランダムパターンＺを右側に示している。 FIG. 6 shows an example of a pattern obtained based on tensor factorization. FIG. 6 shows a regular pattern P on the left and a random pattern Z on the right.

図６は、パターンを分かりやすくするため、パターンをテンソルの各軸のグループに分けて示している。例えば、左端のパターンは、横方向のセルの集合であるｍｓｇ軸のデータと、縦方向のセルの集合であるｓｒｃ軸のデータと、斜めのセルの集合がｄａｙ軸（タイムスタンプ、つまり時間軸）のデータとを含む。図６に示されているように、定常パターンＰは、時間軸（斜めのセル）が密（データがある、又は、ほぼ全体にある状態）となっている。一方、ランダムパターンは、時間軸（斜めのセル）が疎（データがない、又は、ほとんどない状態）となっている。 FIG. 6 shows the patterns divided into groups for each axis of the tensor to make the patterns easier to understand. For example, the pattern on the far left has data on the msg axis, which is a set of cells in the horizontal direction, data on the src axis, which is a set of cells in the vertical direction, and a set of diagonal cells on the day axis (time stamp, that is, the time axis). ) data and As shown in FIG. 6, the stationary pattern P has a dense time axis (diagonal cells) (there is data or there is data almost entirely). On the other hand, in the random pattern, the time axis (diagonal cells) is sparse (no or almost no data).

図６に示されている定常パターンＰを得るためのテンソル因子分解の具体例を説明する。 A specific example of tensor factorization for obtaining the stationary pattern P shown in FIG. 6 will be described.

定常パターン抽出部１０２は、時間的な定常性を持つ挙動のパターンを抽出するため、制約としてスパース正則化を適用したテンソル因子分解を用いる。より具体的には、定常パターン抽出部１０２は、スパース正則化として、ＧｒｏｕｐＬａｓｓｏ正則化を、テンソルのｄａｙ軸（時間軸）に対して付加する。 The stationary pattern extraction unit 102 uses tensor factorization to which sparse regularization is applied as constraints in order to extract patterns of behavior with temporal stationarity. More specifically, the steady pattern extraction unit 102 adds Group Lasso regularization as sparse regularization to the day axis (time axis) of the tensor.

ＧｒｏｕｐＬａｓｓｏ正則化は、スパース正則化の一種であり、グループとして設定した変数の組を同時に０につぶす効果を持つ。この効果（変数の組を同時に０につぶす効果）は、組となっている変数が同程度に小さい値を持つときに生じる。換言すると、定常的な値の出現がグループとして設定した変数の組に所定回数以上ある場合、ＧｒｏｕｐＬａｓｓｏ正則化において、そのグループの少なくとも一部の変数は、０につぶれない。また、定常性が高く常に値が出現する変数の組は、全てにおいて０につぶれない。 Group Lasso regularization is a kind of sparse regularization, and has the effect of collapsing a set of variables set as a group to 0 at the same time. This effect (the effect of collapsing a set of variables to 0 simultaneously) occurs when the variables in the set have similarly small values. In other words, when a set of variables set as a group has a steady value occurrence more than a predetermined number of times, at least some of the variables in the group do not collapse to 0 in Group Lasso regularization. Also, a set of variables with high stationarity and values that always appear does not collapse to 0 at all.

定常パターン抽出部１０２は、このようなＧｒｏｕｐＬａｓｓｏ正則化をテンソル因子分解に付与することで、定常パターンＰのグループと、ランダムパターンＺのグループとに対して異なる効果を実現できる。具体的には、定常パターン抽出部１０２は、ＧｒｏｕｐＬａｓｓｏ正則化を用いて、ランダムパターンＺのグループの変数及びランダムに発生するノイズを０につぶす。すなわち、定常パターン抽出部１０２は、ＧｒｏｕｐＬａｓｓｏ正則化を用いて、ランダムパターンＺがパターンとして検出されないようにする。このとき、定常パターンＰにおいても、多くの変数は、０につぶれる。しかし、定常パターンＰにおいて、少なくとも一部の変数は、０につぶれない。 The stationary pattern extraction unit 102 can achieve different effects for the stationary pattern P group and the random pattern Z group by applying such Group Lasso regularization to the tensor factorization. Specifically, the stationary pattern extraction unit 102 uses Group Lasso regularization to reduce the group variables and randomly generated noise of the random pattern Z to zero. That is, the regular pattern extraction unit 102 uses Group Lasso regularization to prevent the random pattern Z from being detected as a pattern. At this time, even in the stationary pattern P, many variables collapse to zero. However, in the stationary pattern P, at least some variables do not collapse to 0.

定常パターン抽出部１０２は、定常パターンＰとして、常に０でない変数（非ゼロ変数）のパターンを抽出すればよい。ただし、定常パターン抽出部１０２が抽出するパターンは、上記に限定されない。例えば、定常パターン抽出部１０２は、定常パターンＰとして、非ゼロ変数の出現回数、又は、非ゼロ変数の比率が所定の閾値より多いパターンを抽出してもよい。得られるパターンは、時間的な定常性を持つ挙動のパターン（定常パターンＰ）となる。 The regular pattern extraction unit 102 may extract, as the regular pattern P, patterns of variables that are not always 0 (non-zero variables). However, the pattern extracted by the regular pattern extraction unit 102 is not limited to the above. For example, the regular pattern extraction unit 102 may extract, as the regular pattern P, a pattern in which the number of appearances of non-zero variables or the ratio of non-zero variables is greater than a predetermined threshold. The resulting pattern is a pattern of behavior (stationary pattern P) having temporal constancy.

図７は、図６における定常パターンＰの一例を示す図である。 FIG. 7 is a diagram showing an example of the regular pattern P in FIG. 6. As shown in FIG.

ＧｒｏｕｐＬａｓｓｏ正則化を用いた方法として、Ｓｏｆｔ－ｔｈｒｅｓｈｏｌｄｉｎｇが知られている。Ｓｏｆｔ－ｔｈｒｅｓｈｏｌｄｉｎｇを用いる場合、変数のグループｇに属する値ｖのＧｒｏｕｐＬａｓｓｏ正則化を用いたスパースとなる推定値（ｖ_ｅ）は、以下の数式１を用いて導出される。
［数式１］

Soft-thresholding is known as a method using Group Lasso regularization. With soft-thresholding, a sparse estimate (v _e ) using Group Lasso regularization of values v belonging to a group g of variables is derived using Equation 1 below.
[Formula 1]

ここで、「（・）_＋」は、「ｍａｘ（・，０）」である。つまり、「（・）_＋」は、括弧内の値が正の場合、括弧内の値であり、括弧内の値が負の値の場合、０である。また、||・||_２は、Ｌ２ノルムである。λは、正則化の効果を決定するパラメータであり、予め設定されている値である。つまり、グループｇのＬ２ノルムが所定の値より小さい場合、推定値（Ｖ_ｅ）は、０となる。 Here, "(·) ₊ " is "max(·, 0)". That is, "(·) ₊ " is the value within the brackets if the value within the brackets is positive and 0 if the value within the brackets is negative. ||·|| ₂ is the L2 norm. λ is a parameter that determines the effect of regularization and is a preset value. That is, the estimated value (V _e ) is zero when the L2 norm of group g is smaller than a predetermined value.

続いて、差分生成部１０４が、サブデータと定常パターンＰとの差分（以下、「個別差分Δ^＊」と呼ぶ）を抽出する（ステップＳ１５）。サブデータは、時間的に分割されたデータである。そのため、個別差分Δ^＊は、時間的に分割された差分である。 Subsequently, the difference generator 104 extracts the difference between the sub data and the regular pattern P (hereinafter referred to as "individual difference Δ ^* ") (step S15). Sub-data is temporally divided data. Therefore, the individual differences Δ ^* are temporally divided differences.

図８は、サブデータにおける個別差分Δ^＊の一例を示す図である。 FIG. 8 is a diagram showing an example of individual differences Δ ^* in sub-data.

続いて、差分生成部１０４は、個別差分Δ^＊に対して所定の処理を施して、シーケンスデータＸと同じ形式（テンソルの形式）のデータである差分Δを生成する（ステップＳ１７）。なお、シーケンスデータＸが、図３に示すような行列の場合、この処理は、一般的に行列化（Ｍａｔｒｉｃｉｚａｔｉｏｎ）と呼ばれる処理となる。差分Δは、シーケンスデータＸと定常パターンＰとの時間における差分である。 Subsequently, the difference generation unit 104 performs a predetermined process on the individual difference Δ ^* to generate a difference Δ that is data in the same format (tensor format) as the sequence data X (step S17). When the sequence data X is a matrix as shown in FIG. 3, this process is generally called matricization. The difference Δ is the difference in time between the sequence data X and the regular pattern P. As shown in FIG.

図９は、図８に示されている個別差分Δ^＊を行列化した差分Δを示す図である。 FIG. 9 is a diagram showing differences Δ obtained by matrixing the individual differences Δ ^* shown in FIG. 8 .

そして、ランダムパターン抽出部１０６が、差分Δを基にランダムパターンＺを抽出する（ステップＳ１９）。ランダムパターン抽出部１０６が用いる手法は、任意である。例えば、ランダムパターン抽出部１０６は、差分Δに対して、スパースとなる推定値を用いた行列分解、頻出アイテム集合抽出、又はクラスタリングなどの分析手法を用いて、ランダムパターンＺを抽出する。ここで、行列分解は、例えば、特異値分解又は非負値行列分解（Ｎｏｎ－ｎｅｇａｔｉｖｅＭａｔｒｉｘＦａｃｔｏｒｉｚａｔｏｎ）などである。 Then, the random pattern extraction unit 106 extracts the random pattern Z based on the difference Δ (step S19). The method used by the random pattern extraction unit 106 is arbitrary. For example, the random pattern extraction unit 106 extracts a random pattern Z from the difference Δ using an analysis method such as matrix decomposition using sparse estimated values, frequent item set extraction, or clustering. Here, matrix decomposition is, for example, singular value decomposition or non-negative matrix factorization.

図１０は、図９に示されている差分Δに対してスパースとなる推定値を用いた行列分解を適用した結果の一例を示す図である。ランダムパターン抽出部１０６は、差分Δを、二つの行列（ＵとＶ）に分割する。 FIG. 10 is a diagram showing an example of the result of applying matrix decomposition using sparse estimates to the difference Δ shown in FIG. Random pattern extraction section 106 divides difference Δ into two matrices (U and V).

図１０において、行列Ｖは、ランダムパターンＺの集合（ＰＲ１ないしＰＲ４）である。行列Ｖの各行が、ランダムパターンＺである。以下、各ランダムパターンＺを区別して説明する必要がない場合、ランダムパターンＺが複数（集合となる）場合を含めてランダムパターンＺと呼ぶ。 In FIG. 10, matrix V is a set of random patterns Z (PR1 to PR4). Each row of matrix V is a random pattern Z. Hereinafter, when it is not necessary to distinguish each random pattern Z, the random pattern Z will be referred to as a random pattern Z, including the case where there are a plurality of random patterns Z (in a set).

また、行列Ｕは、ランダムパターンＺの発現位置（図１０ではｓｒｃ１ないし５）、時間（図１０ではｄａｙ１ないし５）、及び発現するランダムパターンＺ（図１０ではＰＲ１ないし４）を示す。具体的には、行列Ｕの縦線を付したセルにおいて、ランダムパターンＺが発現する。以下の説明では、行列Ｕを「発現情報」とも呼ぶ。 Also, the matrix U indicates the appearance position of the random pattern Z (src1 to 5 in FIG. 10), the time (days 1 to 5 in FIG. 10), and the random pattern Z to appear (PR1 to 4 in FIG. 10). Specifically, the random pattern Z appears in the cells of the matrix U marked with vertical lines. In the following description, the matrix U is also called "expression information".

例えば、行列Ｕの３行目の発現情報は、ソースｓｒｃ３の時間ｄａｙ１において、３番目のランダムパターンＺ（ＰＲ３）が発現したことを示す。 For example, the expression information in the third row of matrix U indicates that the third random pattern Z (PR3) was expressed at time day1 of source src3.

なお、図１０において、ｒは、抽出されたランダムパターンＺの数である。ｒは、データの属性数（例えば、図３の列数）及びテンソルのランク（階数）に依存する。図９に示されている差分Δは、属性として４つのｍｓｇを持つ。そのため、ランダムパターンＺの数（ｒ）の最大は、「４」である。図１０は、ｒの最大数までランダムパターンＺを抽出した例を示している。ただし、ｒの値は、最大数に限られない。ｒの値は、予め、１以上最大数以下の任意の数が設定されていればよい。 In FIG. 10, r is the number of random patterns Z extracted. r depends on the number of attributes of the data (eg number of columns in FIG. 3) and the rank of the tensor. The difference Δ shown in FIG. 9 has four msg attributes. Therefore, the maximum number (r) of random patterns Z is "4". FIG. 10 shows an example of extracting random patterns Z up to the maximum number of r. However, the value of r is not limited to the maximum number. Any number from 1 to the maximum number may be set in advance as the value of r.

なお、定常パターン抽出部１０２は、定常パターンＰを抽出する。そのため、定常パターン抽出部１０２の対象となるデータ（今の場合、シーケンスデータＸ）は、時間的な冗長性を持っている必要がある。 Note that the regular pattern extraction unit 102 extracts the regular pattern P. FIG. Therefore, the data (sequence data X in this case) to be processed by the steady pattern extraction unit 102 must have temporal redundancy.

これに対し、ランダムパターン抽出部１０６は、ランダムパターンＺ（時間的な定常性を持たないデータに対応するパターン）を抽出する。そのため、ランダムパターン抽出部１０６の対象となるデータは、必ずしも時間的な冗長性を持っている必要はない。時間的な定常性を持たない挙動は、発現頻度が低く、発現位置も不規則になる場合が多い。そのため、ランダムパターン抽出部１０６の対象となるデータは、多くのタプルを含むデータであることが望ましい。そのため、ランダムパターン抽出部１０６は、差分Δに加え、他のデータを用いてもよい。 On the other hand, the random pattern extraction unit 106 extracts a random pattern Z (a pattern corresponding to data without temporal stationarity). Therefore, the data targeted by the random pattern extraction unit 106 does not necessarily have temporal redundancy. Behaviors that do not have temporal stationarity often occur at low frequency and at irregular positions. Therefore, it is desirable that the data to be processed by the random pattern extraction unit 106 is data containing many tuples. Therefore, the random pattern extraction unit 106 may use other data in addition to the difference Δ.

［効果の説明］
このように、第１の実施形態に係る情報処理装置１００は、時間的な定常性を持つ挙動に対応したパターンと、ランダムな挙動に対応したパターンとを抽出するとの効果を得ることができる。 [Explanation of effect]
In this manner, the information processing apparatus 100 according to the first embodiment can obtain the effect of extracting a pattern corresponding to a behavior having temporal stationarity and a pattern corresponding to a random behavior.

その理由は、次のとおりである。 The reason is as follows.

定常パターン抽出部１０２が、複数の時間における複数の観測値を含むシーケンスデータＸ（第１のデータ）を基にシーケンスデータＸ（第１のデータ）を時間において積層したテンソル（第２のデータ）を構築する。そして、定常パターン抽出部１０２が、テンソル（第２のデータ）を基にシーケンスデータＸ（第１のデータ）における時間的な定常性を持つ観測値の組合せである定常パターンＰを抽出する。そして、差分生成部１０４が、シーケンスデータＸ（第１のデータ）と定常パターンＰとの時間における差分Δを生成する。そして、ランダムパターン抽出部１０６が、差分Δを基に時間的な定常性を持たない観測値の組合せであるランダムパターンＺを抽出する。 The steady pattern extraction unit 102 generates a tensor (second data) obtained by layering the sequence data X (first data) in time based on the sequence data X (first data) including a plurality of observed values at a plurality of times. to build. Based on the tensor (second data), the steady pattern extraction unit 102 extracts a steady pattern P, which is a combination of observed values having temporal continuity in the sequence data X (first data). Then, the difference generator 104 generates a difference Δ between the sequence data X (first data) and the regular pattern P in time. Based on the difference Δ, the random pattern extraction unit 106 extracts a random pattern Z, which is a combination of observation values that do not have temporal stationarity.

このように、情報処理装置１００において、まず、定常パターン抽出部１０２が、複数の時間における複数の観測値を含むシーケンスデータＸを時間において積層したテンソルを構築する。そのため、構築されたテンソルにおいて、時間的な冗長性が、明確となっている。そして、定常パターン抽出部１０２は、テンソルを基に定常パターンＰを抽出する。つまり、定常パターン抽出部１０２は、時間的な冗長性を明確となったテンソルを用いて定常パターンＰを抽出するため、時間的な定常性を持つ挙動に対応した定常パターンＰを抽出することができる。 As described above, in the information processing apparatus 100, first, the regular pattern extraction unit 102 constructs a tensor in which sequence data X including a plurality of observed values at a plurality of times are layered in time. So the temporal redundancy is evident in the constructed tensor. Then, the regular pattern extraction unit 102 extracts the regular pattern P based on the tensor. That is, since the regular pattern extraction unit 102 extracts the regular pattern P using a tensor with clear temporal redundancy, it is possible to extract the regular pattern P corresponding to the behavior having temporal stationarity. can.

そして、差分生成部１０４が、シーケンスデータＸと定常パターンＰとの時間的な差として差分Δを生成する。定常パターンＰは、時間的な冗長性を用いて抽出されている。そのため、差分Δは、時間的な冗長性を考慮した結果として、時間的な定常性を持たないデータとなる。 Then, the difference generator 104 generates the difference Δ as the temporal difference between the sequence data X and the steady pattern P. FIG. A stationary pattern P is extracted using temporal redundancy. Therefore, the difference Δ becomes data without temporal stationarity as a result of considering temporal redundancy.

そして、ランダムパターン抽出部１０６が、差分Δを用いて時間的な定常性を持たないランダムパターンＺを抽出する。差分Δは、時間的な冗長性を考慮して生成されたデータである。そのため、ランダムパターン抽出部１０６は、時間的な冗長性を考慮した結果として、時間的な定常性を持たない挙動に対応したランダムパターンＺを抽出することができる。 Then, the random pattern extraction unit 106 extracts a random pattern Z that does not have temporal stationarity using the difference Δ. The difference Δ is data generated considering temporal redundancy. Therefore, the random pattern extraction unit 106 can extract a random pattern Z corresponding to a behavior that does not have temporal stationarity as a result of considering temporal redundancy.

このように、情報処理装置１００は、時間的な定常性を持つ挙動に対応した定常パターンＰと、ランダムな挙動に対応したランダムパターンＺとを抽出できる。 In this way, the information processing apparatus 100 can extract a steady pattern P corresponding to behavior with temporal constancy and a random pattern Z corresponding to random behavior.

［ハードウェアの構成］
以上の説明した情報処理装置１００は、次のように構成される。例えば、情報処理装置１００の各構成部は、ハードウェア回路で構成されてもよい。あるいは、情報処理装置１００において、各構成部は、ネットワークを介して接続した複数の装置を用いて構成されてもよい。あるいは、情報処理装置１００は、複数の構成部を１つのハードウェアで構成されてもよい。 [Hardware configuration]
The information processing apparatus 100 described above is configured as follows. For example, each component of the information processing apparatus 100 may be configured by a hardware circuit. Alternatively, in the information processing apparatus 100, each component may be configured using a plurality of devices connected via a network. Alternatively, the information processing apparatus 100 may have a plurality of components configured by one piece of hardware.

あるいは、情報処理装置１００は、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）と、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）と、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）とを含むコンピュータ装置として実現されてもよい。情報処理装置１００は、上記構成に加え、さらに、入出力接続回路（ＩＯＣ：ＩｎｐｕｔａｎｄＯｕｔｐｕｔＣｉｒｃｕｉｔ）を含むコンピュータ装置として実現されてもよい。あるいは、情報処理装置１００は、上記構成に加え、さらに、ネットワークインターフェース回路（ＮＩＣ：ＮｅｔｗｏｒｋＩｎｔｅｒｆａｃｅＣｉｒｃｕｉｔ）を含むコンピュータ装置として実現されてもよい。 Alternatively, the information processing apparatus 100 may be realized as a computer device including a CPU (Central Processing Unit), a ROM (Read Only Memory), and a RAM (Random Access Memory). The information processing device 100 may be implemented as a computer device that further includes an input/output connection circuit (IOC: Input and Output Circuit) in addition to the above configuration. Alternatively, the information processing apparatus 100 may be realized as a computer apparatus that further includes a network interface circuit (NIC: Network Interface Circuit) in addition to the above configuration.

図１１は、情報処理装置１００のハードウェアの構成の一例を示す情報処理装置６００の構成を示すブロック図である。 FIG. 11 is a block diagram showing the configuration of an information processing device 600 showing an example of the hardware configuration of the information processing device 100. As shown in FIG.

情報処理装置６００は、ＣＰＵ６１０と、ＲＯＭ６２０と、ＲＡＭ６３０と、内部記憶装置６４０と、ＩＯＣ６５０と、ＮＩＣ６８０とを含み、コンピュータ装置を構成している。 The information processing device 600 includes a CPU 610, a ROM 620, a RAM 630, an internal storage device 640, an IOC 650, and a NIC 680, and constitutes a computer device.

ＣＰＵ６１０は、ＲＯＭ６２０からプログラムを読み込む。そして、ＣＰＵ６１０は、読み込んだプログラムに基づいて、ＲＡＭ６３０と、内部記憶装置６４０と、ＩＯＣ６５０と、ＮＩＣ６８０とを制御する。そして、ＣＰＵ６１０を含むコンピュータ装置は、これらの構成を制御し、図１に示されている、定常パターン抽出部１０２と、差分生成部１０４と、ランダムパターン抽出部１０６としての各機能を実現する。 The CPU 610 reads programs from the ROM 620 . Then, the CPU 610 controls the RAM 630, the internal storage device 640, the IOC 650, and the NIC 680 based on the read program. A computer device including the CPU 610 controls these configurations and implements the functions of the regular pattern extraction unit 102, the difference generation unit 104, and the random pattern extraction unit 106 shown in FIG.

ＣＰＵ６１０は、各機能を実現する際に、ＲＡＭ６３０及び／又は内部記憶装置６４０を、プログラムの一時記憶として使用してもよい。 The CPU 610 may use the RAM 630 and/or the internal storage device 640 as temporary storage of programs when implementing each function.

また、ＣＰＵ６１０は、コンピュータで読み取り可能にプログラムを記憶した記録媒体７００が含むプログラムを、図示しない記録媒体読み取り装置を用いて読み込んでもよい。あるいは、ＣＰＵ６１０は、ＮＩＣ６８０を介して、図示しない外部の装置からプログラムを受け取り、ＲＡＭ６３０に保存して、保存したプログラムを基に動作してもよい。 Further, the CPU 610 may read the program included in the recording medium 700 storing the computer-readable program using a recording medium reading device (not shown). Alternatively, the CPU 610 may receive a program from an external device (not shown) via the NIC 680, store it in the RAM 630, and operate based on the stored program.

ＲＯＭ６２０は、ＣＰＵ６１０が実行するプログラム及び固定的なデータを記憶する。ＲＯＭ６２０は、例えば、Ｐ－ＲＯＭ（Ｐｒｏｇｒａｍｍａｂｌｅ－ＲＯＭ）又はフラッシュＲＯＭである。 The ROM 620 stores programs executed by the CPU 610 and fixed data. The ROM 620 is, for example, a P-ROM (Programmable-ROM) or a flash ROM.

ＲＡＭ６３０は、ＣＰＵ６１０が実行するプログラムと、データ（例えば、前述した図４ないし図１０に示されているパターンのデータ及び他の作業用のデータ）とを一時的に記憶する。ＲＡＭ６３０は、例えば、Ｄ－ＲＡＭ（Ｄｙｎａｍｉｃ－ＲＡＭ）である。 The RAM 630 temporarily stores programs executed by the CPU 610 and data (for example, pattern data shown in FIGS. 4 to 10 and other work data). The RAM 630 is, for example, a D-RAM (Dynamic-RAM).

内部記憶装置６４０は、情報処理装置６００が長期的に保存するデータ及びプログラムを記憶する。また、内部記憶装置６４０は、ＣＰＵ６１０の一時記憶装置として動作してもよい。内部記憶装置６４０は、例えば、ハードディスク装置、光磁気ディスク装置、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）又はディスクアレイ装置である。 The internal storage device 640 stores data and programs that the information processing device 600 saves for a long time. Moreover, the internal storage device 640 may operate as a temporary storage device for the CPU 610 . The internal storage device 640 is, for example, a hard disk device, a magneto-optical disk device, an SSD (Solid State Drive), or a disk array device.

ここで、ＲＯＭ６２０と内部記憶装置６４０は、不揮発性（ｎｏｎ－ｔｒａｎｓｉｔｏｒｙ）の記録媒体である。一方、ＲＡＭ６３０は、揮発性（ｔｒａｎｓｉｔｏｒｙ）の記録媒体である。そして、ＣＰＵ６１０は、ＲＯＭ６２０、内部記憶装置６４０、又は、ＲＡＭ６３０に記憶されているプログラムを基に動作可能である。つまり、ＣＰＵ６１０は、不揮発性記録媒体及び／又は揮発性記録媒体を用いて動作可能である。 Here, the ROM 620 and the internal storage device 640 are non-transitory recording media. On the other hand, the RAM 630 is a volatile (transitory) recording medium. The CPU 610 can operate based on programs stored in the ROM 620 , the internal storage device 640 , or the RAM 630 . That is, CPU 610 can operate using a non-volatile recording medium and/or a volatile recording medium.

ＩＯＣ６５０は、ＣＰＵ６１０と、入力機器６６０及び表示機器６７０とのデータを仲介する。ＩＯＣ６５０は、例えば、ＩＯインターフェースカード又はＵＳＢ（ＵｎｉｖｅｒｓａｌＳｅｒｉａｌＢｕｓ）カードである。さらに、ＩＯＣ６５０は、ＵＳＢのような有線に限らず、無線を用いてもよい。 IOC 650 mediates data between CPU 610 and input device 660 and display device 670 . The IOC 650 is, for example, an IO interface card or a USB (Universal Serial Bus) card. Furthermore, the IOC 650 is not limited to a wired connection such as USB, and may use a wireless connection.

入力機器６６０は、情報処理装置６００の操作者からの入力指示を受け取る機器である。入力機器６６０は、例えば、キーボード、マウス又はタッチパネルである。 The input device 660 is a device that receives input instructions from the operator of the information processing device 600 . The input device 660 is, for example, a keyboard, mouse or touch panel.

表示機器６７０は、情報処理装置６００の操作者に情報を表示する機器である。表示機器６７０は、例えば、液晶ディスプレイである。表示機器６７０は、定常パターンＰ及び／又はランダムパターンＺを表示してもよい。 The display device 670 is a device that displays information to the operator of the information processing device 600 . The display device 670 is, for example, a liquid crystal display. The display device 670 may display the regular pattern P and/or the random pattern Z.

ＮＩＣ６８０は、ネットワークを介した図示しない外部の装置とのデータのやり取りを中継する。ＮＩＣ６８０は、例えば、ＬＡＮ（ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）カードである。さらに、ＮＩＣ６８０は、有線に限らず、無線を用いてもよい。ＮＩＣ６８０は、定常パターン抽出部１０２の一部として、シーケンスデータＸを受信してもよい。あるいは、ＮＩＣ６８０は、定常パターン抽出部１０２及び／又はランダムパターン抽出部１０６の一部として、定常パターンＰ及び／又はランダムパターンＺを出力してもよい。 The NIC 680 relays data exchange with an external device (not shown) via the network. The NIC 680 is, for example, a LAN (Local Area Network) card. Furthermore, the NIC 680 is not limited to wired, and may be wireless. The NIC 680 may receive the sequence data X as part of the regular pattern extractor 102 . Alternatively, the NIC 680 may output the regular pattern P and/or the random pattern Z as part of the regular pattern extractor 102 and/or the random pattern extractor 106 .

このように構成された情報処理装置６００は、情報処理装置１００と同様の効果を得ることができる。 The information processing device 600 configured in this way can obtain the same effects as the information processing device 100 .

その理由は、情報処理装置６００のＣＰＵ６１０が、プログラムに基づいて情報処理装置１００と同様の機能を実現できるためである。 The reason is that the CPU 610 of the information processing device 600 can implement the same functions as the information processing device 100 based on the program.

［システムの説明］
次に、図面を参照して、本実施形態の情報処理装置１００を含む情報処理システム１０について、説明する。 [Description of the system]
Next, the information processing system 10 including the information processing apparatus 100 of this embodiment will be described with reference to the drawings.

図１２は、第１の実施形態に係る情報処理システム１０の構成の一例を示す図である。情報処理システム１０は、第１の実施形態に係る情報処理装置１００と、監視対象２００と、表示装置３００とを含む。 FIG. 12 is a diagram showing an example of the configuration of the information processing system 10 according to the first embodiment. The information processing system 10 includes an information processing device 100 according to the first embodiment, a monitoring target 200, and a display device 300. FIG.

監視対象２００は、情報処理装置１００が処理対象であるシーケンスデータＸの提供元である。監視対象２００は、任意である。例えば、監視対象２００は、所定の工場でもよい。この場合、シーケンスデータＸは、観測値として、工場内の各設備における属性（温度、圧力、流量、又は、振動など）を含む。あるいは、監視対象２００は、複数のコンピュータを含むコンピュータ・ネットワークである。この場合、シーケンスデータＸは、例えば、観測値として、ネットワークにおいて送信及び受信されるデータ（例えば、パケット）及びコンピュータの状態を含むログである。図１２は、これらの場合を例示している。 The monitoring target 200 is a provider of the sequence data X that the information processing apparatus 100 is to process. The monitored object 200 is arbitrary. For example, the monitored object 200 may be a predetermined factory. In this case, the sequence data X includes attributes (temperature, pressure, flow rate, vibration, etc.) of each facility in the factory as observed values. Alternatively, monitored object 200 is a computer network that includes multiple computers. In this case, the sequence data X is, for example, a log containing, as observations, the data (eg packets) sent and received in the network and the state of the computer. FIG. 12 illustrates these cases.

情報処理装置１００は、監視対象２００からシーケンスデータＸを取得する。そして、情報処理装置１００は、上記の動作を基に、ランダムパターンＺを抽出する。そして、情報処理装置１００は、ランダムパターンＺを表示装置３００に送信する。 The information processing device 100 acquires the sequence data X from the monitored object 200 . Then, the information processing apparatus 100 extracts the random pattern Z based on the above operation. The information processing device 100 then transmits the random pattern Z to the display device 300 .

表示装置３００は、受信したランダムパターンＺを表示する。その結果、情報処理システム１０の利用者は、時間的な定常性を持たない観測値のパターン（ランダムパターンＺ）を把握することができる。 The display device 300 displays the random pattern Z received. As a result, the user of the information processing system 10 can comprehend a pattern of observed values (random pattern Z) that does not have temporal stationarity.

なお、情報処理装置１００は、表示装置３００に、他の情報、例えば、発現位置情報、定常パターンＰ、及び／又は、シーケンスデータＸを送信してもよい。この場合、表示装置３００は、ランダムパターンＺに加え、発現情報、定常パターンＰ、及び／又は、シーケンスデータＸを表示する。 Note that the information processing device 100 may transmit other information such as expression position information, regular pattern P, and/or sequence data X to the display device 300 . In this case, the display device 300 displays the expression information, the steady pattern P, and/or the sequence data X in addition to the random pattern Z.

例えば、表示装置３００がランダムパターンＺと定常パターンＰとを表示した場合、情報処理システム１０の利用者は、定常パターンＰとの比較を基に、ランダムパターンＺをより適切に把握することができる。 For example, when the display device 300 displays the random pattern Z and the regular pattern P, the user of the information processing system 10 can grasp the random pattern Z more appropriately based on the comparison with the regular pattern P. .

あるいは、例えば、表示装置３００がランダムパターンＺと発現情報とを表示した場合、情報処理システム１０の利用者は、ランダムパターンＺに加え、ランダムパターンＺの発現位置及び時間、並びに、ランダムパターンＺの種別を把握することができる。 Alternatively, for example, when the display device 300 displays the random pattern Z and the occurrence information, the user of the information processing system 10 can display the random pattern Z, the occurrence position and time of the random pattern Z, and the random pattern Z. The type can be grasped.

なお、表示装置３００は、情報処理装置１００の外部の装置に限られず、情報処理装置１００に含まれていてもよい。 Note that the display device 300 is not limited to a device external to the information processing device 100 and may be included in the information processing device 100 .

以上、実施形態を参照して本願発明を説明したが、本願発明は上記実施形態に限定されるものではない。本願発明の構成及び詳細には、本願発明のスコープ内で当業者が理解し得る様々な変更をすることができる。 Although the present invention has been described with reference to the embodiments, the present invention is not limited to the above embodiments. Various changes that can be understood by those skilled in the art can be made to the configuration and details of the present invention within the scope of the present invention.

１０情報処理システム
１００情報処理装置
１０２定常パターン抽出部
１０４差分生成部
１０６ランダムパターン抽出部
２００監視対象
３００表示装置
６００情報処理装置
６１０ＣＰＵ
６２０ＲＯＭ
６３０ＲＡＭ
６４０内部記憶装置
６５０ＩＯＣ
６６０入力機器
６７０表示機器
６８０ＮＩＣ
７００記録媒体 REFERENCE SIGNS LIST 10 information processing system 100 information processing device 102 regular pattern extraction unit 104 difference generation unit 106 random pattern extraction unit 200 monitoring target 300 display device 600 information processing device 610 CPU
620 ROMs
630 RAM
640 internal storage device 650 IOC
660 Input device 670 Display device 680 NIC
700 recording medium

Claims

An information processing system that detects random patterns as information for detecting anomalies in monitored equipment,
Acquisition means for acquiring observed values of monitored equipment in the factory;
a data providing device comprising a plurality of times and providing means for providing said observed values at said times as first data;
constructing second data by stacking the first data in the time based on the first data, and based on the second data, the observation having temporal stationarity in the first data steady pattern extracting means for extracting a steady pattern that is a combination of values;
difference generating means for generating a difference in time between the first data and the steady pattern;
Random pattern extraction means for extracting the random pattern, which is a combination of the observed values without temporal stationarity, based on the difference;
an information processing device comprising a transmission means for transmitting the random pattern;
receiving means for receiving the random pattern;
display means for outputting the random pattern; and a display device including
The first data is data having temporal redundancy,
The random pattern is data containing a tuple,
the first data includes a tuple consisting of a plurality of information sources, the time, and a plurality of attribute types in each of the information sources;
The steady pattern extracting means constructs the second data in which the information source and the attribute type are layered at the time, and constructs the second data having the temporal constancy of the information source and the attribute type. extract the pattern,
The difference generation means generates the difference between the information source and the type of the attribute divided in time,
The random pattern extracting means extracts the random pattern including a tuple including the information source, the time, and the attribute type.
Information processing system.

The information processing system according to claim 1, wherein said observed value includes at least one of temperature, pressure, flow rate, and vibration in said monitored equipment.

The transmission means further transmits the steady pattern,
3. The information processing system according to claim 1, wherein said display means further displays said regular pattern.

The first data includes position information where the observed value is expressed,
The transmission means further transmits the location information,
4. The information processing system according to any one of claims 1 to 3, wherein said display means further displays said position information.

5. The information processing system according to claim 1, wherein said display means displays said random pattern together with expression information.

An information processing method for detecting a random pattern as information for detecting anomalies in monitored equipment,
The data providing device
Acquire the observed values of the monitored equipment in the factory,
providing a plurality of times and said observations at said times as first data;
The information processing device
constructing second data by stacking the first data in the time based on the first data, and based on the second data, the observation having temporal stationarity in the first data Extracting stationary patterns that are combinations of values,
generating a difference in time between the first data and the steady pattern;
Extracting the random pattern, which is a combination of the observed values without temporal stationarity based on the difference,
transmitting the random pattern;
the display device
receiving the random pattern;
outputting the random pattern;
The first data is data having temporal redundancy,
The random pattern is data containing a tuple,
The random pattern is data containing a tuple,
the first data includes a tuple consisting of a plurality of information sources, the time, and a plurality of attribute types in each of the information sources;
The information processing device is
Constructing the second data in which the information source and the attribute type are layered in the time, extracting the steady pattern having temporal constancy of the information source and the attribute type,
generating the deltas of the types of the sources and the attributes, separated in time;
extracting said random pattern containing a tuple containing said source, said time and said attribute type;
Information processing methods.

The information processing method according to claim 6, wherein the observed value includes at least one of temperature, pressure, flow rate, and vibration in the monitored equipment.

The information processing device further transmits the steady pattern,
The information processing method according to claim 6 or 7, wherein the display device further displays the steady pattern.

The first data includes position information where the observed value is expressed,
The information processing device further transmits the location information,
The information processing method according to any one of claims 6 to 8, wherein the display device further displays the position information.

The information processing method according to any one of claims 6 to 9, wherein the display device displays the random pattern together with expression information.