JP4762088B2

JP4762088B2 - Process abnormality diagnosis device

Info

Publication number: JP4762088B2
Application number: JP2006235312A
Authority: JP
Inventors: 理山中; 卓巳小原; 勝也山本
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 2006-08-31
Filing date: 2006-08-31
Publication date: 2011-08-31
Anticipated expiration: 2026-08-31
Also published as: JP2008059270A

Description

本発明は、特に上下水処理プロセス、排水処理プロセス、浄水プロセス、または化学プロセスなどのプロセス系の異常診断を行なうためのプロセス異常診断装置に関する。 The present invention relates to a process abnormality diagnosis apparatus for performing abnormality diagnosis of a process system such as a water and sewage treatment process, a wastewater treatment process, a water purification process, or a chemical process.

近年、上下水道の水量プロセスおよび水質プロセスや、化学プロセスなどのプロセス系においては、制御目的の多目的化や多様化に伴って、プロセス制御方法として、従来用いられていたＰＩＤ制御方法やシーケンス制御方法に加えて、様々なアドバンスト制御方法（高度に改良された制御方法）が用いられるようになってきている。例えば、モデル予測制御に代表されるアドバンストな自動制御や、プロセスシミュレータを利用した自動制御などを適用した方法である。 In recent years, in process systems such as water and sewage water quantity processes, water quality processes, and chemical processes, PID control methods and sequence control methods that have been conventionally used as process control methods in accordance with the diversification and diversification of control purposes. In addition, various advanced control methods (highly improved control methods) have come to be used. For example, there is a method to which advanced automatic control represented by model predictive control or automatic control using a process simulator is applied.

これらのアドバンスト制御方法を実現するためには、通常状態（正常状態）だけでなく、様々な異常状態を検出し、それに対する対処の手順を明確にしておく必要がある。例えば、プロセスの運用管理においては、プロセスの状態や操作量を計測するためのセンサの故障、アクチュエータなどの機器の故障、突発的な負荷変動（突発外乱）、プロセスの処理異常、制御系の異常などの様々な異常状況に対して適切に対処する必要がある。すなわち、プロセス異常診断装置は、プロセス系において、各種のアドバンストな自動制御の実現やプロセスの完全な自動化の実現を支えるための重要な基礎技術である。 In order to realize these advanced control methods, it is necessary to detect not only the normal state (normal state) but also various abnormal states and clarify the procedure for dealing with them. For example, in process operation management, sensor failure for measuring process status and operation amount, failure of equipment such as actuator, sudden load fluctuation (sudden disturbance), process processing error, control system error It is necessary to appropriately deal with various abnormal situations such as That is, the process abnormality diagnosis apparatus is an important basic technology for supporting various advanced automatic controls and complete process automation in the process system.

このような観点から、異常診断機能を有するプラントワイド最適プロセス制御が提案されている（例えば、特許文献１を参照）。この異常診断機能は、最適制御を実現するために必要最小限の構成からなる。これに対して、よりアドバンストなプロセス異常診断方法としては、多変量統計解析手法を用いた多変量統計的プロセス監視（ＭＳＰＣ：Multi-Variate Statistical Process Control）と呼ばれる方法が知られている（例えば、非特許文献１を参照）。ＭＳＰＣは、ケモメトリクス手法と呼ばれることもあり、化学プロセスの分野で発展してきた異常診断方法である。このＭＳＰＣの中で、最も基本的であり、かつよく利用される手法として、主成分分析（ＰＣＡ：Principal Component Analysis）に基づいた異常診断方法が広く用いられている。 From such a viewpoint, plant-wide optimum process control having an abnormality diagnosis function has been proposed (see, for example, Patent Document 1). This abnormality diagnosis function has a minimum configuration necessary for realizing optimal control. On the other hand, as a more advanced process abnormality diagnosis method, a method called multivariate statistical process control (MSPC) using a multivariate statistical analysis method is known (for example, (Refer nonpatent literature 1). MSPC is sometimes called a chemometrics method, and is an abnormality diagnosis method developed in the field of chemical processes. Among the MSPCs, an abnormality diagnosis method based on principal component analysis (PCA) is widely used as the most basic and frequently used method.

ＰＣＡに基づいた異常診断では、通常、（１）プロセスの異常や状況変化の検出、（２）その要因となる計測変数の特定、（３）真の要因の特定、（４）その状況への対策という手順をとる。この一連の手順において、最初に行うプロセスの異常や状況変化の検出をできる限り素早く、また正確に行うことが、この方法の実用化に際してキーポイントとなる。この理由としては、（１）異常等の状況変化は時間が経過してから検出しても意味が無いことが多く、また時間が経過してからであれば、このような監視システムが無くても人間が状況変化を察知できる。また、（２）異常等の要因特定は、異常等のイベントが生じてから時間が経過してしまうとなかなか行うことができないが，異常等のイベントが生じた直後であれば比較的容易に行える。さらに、（３）異常等の要因特定処理は、正しく異常などの検出ができれば、一連の数学的処理によって行えるが、異常と正常の違いをその限界付近で精度よく見極めることは極めて困難である。 In abnormality diagnosis based on PCA, usually, (1) detection of process abnormality or situation change, (2) identification of measurement variable that causes it, (3) identification of true factor, (4) identification of the situation Take a procedure called countermeasures. In this series of procedures, it is a key point in practical use of this method to detect the abnormality of the process performed first and the change of the situation as quickly and accurately as possible. The reason for this is that (1) situation changes such as abnormalities are often meaningless even if detected after the passage of time, and there is no such monitoring system after the passage of time. Even humans can detect changes in the situation. (2) Although it is difficult to specify the cause of an abnormality or the like after an event such as an abnormality has occurred, it can be relatively easily performed immediately after the occurrence of an event such as an abnormality. . Furthermore, (3) a factor specifying process such as an abnormality can be performed by a series of mathematical processes if the abnormality can be detected correctly, but it is extremely difficult to accurately determine the difference between the abnormality and the normal near the limit.

このように、プロセス異常を素早く正確に検出することが重要であるにもかかわらず、従来のプロセス異常診断方法では、プロセス異常を正確に素早く行うことができない場合がしばしば発生する。
特開２００４−１７１５３１号公報加納学、“主成分分析”、２００２年５月第２版、京都大学大学院工学研究科化学工学専攻プロセスシステム工学研究室、［２００５年８月１１日検索］、インターネット＜URL:http://www-pse.cheme.kyoto-u.ac.jp/kano/document/text-PCA.pdf＞ Thus, although it is important to detect a process abnormality quickly and accurately, the conventional process abnormality diagnosis method often fails to perform a process abnormality accurately and quickly.
JP 2004-171531 A Manabu Kano, “principal component analysis”, May 2002, 2nd edition, Kyoto University Graduate School of Engineering, Department of Chemical Engineering, Process System Engineering Laboratory, [Search August 11, 2005], Internet <URL: http: // www-pse.cheme.kyoto-u.ac.jp/kano/document/text-PCA.pdf>

前述のようなＰＣＡに基づいた異常診断方法においても、必ずしもプロセスの異常を正確に検出することができるとは限らない。異常検出の誤りのタイプには、大別して以下のような２タイプがある。即ち、本当（実際）は異常であるのに、正常であると診断する誤りのタイプ（以下、Ａタイプ）と、本当（実際）は正常であるのに、異常であると診断する誤りのタイプ（以下、Ｂタイプ）がある。 Even in the abnormality diagnosis method based on PCA as described above, it is not always possible to accurately detect a process abnormality. There are two types of error detection errors as follows. That is, an error type (hereinafter referred to as A type) that is diagnosed as normal even though true (actual) is abnormal, and an error type that is diagnosed as abnormal even though true (actual) is normal (Hereinafter referred to as B type).

プロセスの異常診断において、正確な異常検出を行うには、単に異常検出に対する感度を高めるだけでは不十分であり、Ａタイプの誤りとＢタイプの誤りのトレードオフを適切に調整することが重要である。Ａタイプの誤りとＢタイプの誤りを共に極力減らすことが理想ではあるが、実際上では、正常と異常の境界域では、異常と正常の重複があることが多い。このため、通常では、Ａタイプの誤りを減少させようとすると、Ｂタイプの誤りが増大し、逆にＢタイプの誤りを減少させようとすると、Ａタイプの誤りが増大することが確認されている。 In order to accurately detect abnormalities in process abnormality diagnosis, it is not sufficient to simply increase sensitivity to abnormality detection, and it is important to appropriately adjust the trade-off between type A errors and type B errors. is there. Although it is ideal to reduce both A-type errors and B-type errors as much as possible, in practice, there are many cases where there is an overlap between normal and abnormal at the boundary between normal and abnormal. For this reason, it has been confirmed that, normally, an attempt to reduce the A type error increases the B type error, and conversely an attempt to reduce the B type error increases the A type error. Yes.

そこで、本発明の目的は、Ａタイプの誤りとＢタイプの誤りのトレードオフの適正化を図り、正確な異常検出と共に、実際的なプロセス監視に適切な異常診断性能を実現したプロセス異常診断装置を提供することにある。 Therefore, an object of the present invention is to optimize the trade-off between an A type error and a B type error, and realize an abnormality diagnosis performance suitable for actual process monitoring together with accurate abnormality detection. Is to provide.

本発明の観点に従ったプロセス異常診断装置は、対象プロセスの状態または操作量を計測する計測手段と、前記計測手段から計測結果を示す時系列データを収集して保存するデータ保存手段と、前記データ保存手段に保存された前記時系列データから、プロセス監視モデルを作成するために使用し、前記対象プロセスの状態を示すプロセスデータを取り出すプロセスデータ抽出手段と、前記プロセスデータ抽出手段により取り出されたプロセスデータを正規化するための正規化モデルを生成する正規化モデル生成手段と、前記正規化モデルにより正規化されたプロセスデータに対する主成分分析方式によるプロセス監視モデルを構築して、当該正規化プロセスデータの異常検出用データとして、Ｑ統計量またはホテリングＴ^２統計量の統計量データを算出し、当該統計量データに対する前記計測手段により計測している計測項目（計測変数）の寄与量を算出するプロセス監視モデル手段と、前記統計量データに含まれる異常データと正常データとの割合に関する事前情報であって、前記寄与量に対する前記事前情報を生成する事前情報生成手段と、前記事前情報及び前記プロセス監視モデル手段により算出される前記統計量データの寄与量を使用して、前記統計量データの異常状態と正常状態の判定を行うための閾値を決定する閾値決定手段と、前記閾値決定手段により決定された閾値及び前記統計量データを使用して、前記対象プロセスの異常を診断する異常診断手段とを備えた構成である。 A process abnormality diagnosis apparatus according to an aspect of the present invention includes a measurement unit that measures a state or an operation amount of a target process, a data storage unit that collects and stores time-series data indicating measurement results from the measurement unit, and Process data extraction means for extracting process data indicating the state of the target process, used to create a process monitoring model from the time series data stored in the data storage means, and extracted by the process data extraction means A normalization model generation means for generating a normalization model for normalizing the process data, and a process monitoring model based on a principal component analysis method for the process data normalized by the normalization model is constructed, and the normalization process Q statistics or Hotelling T ² statistics statistics for data anomaly detection A process monitoring model unit that calculates data and calculates a contribution amount of a measurement item (measurement variable) measured by the measurement unit with respect to the statistical data; and abnormal data and normal data included in the statistical data a priori information about the percentage, the a priori information generation means for generating the prior information for the contribution amount, using the contribution of the statistic data which is calculated by the preliminary information and the process monitor model means , A threshold value determining means for determining a threshold value for determining an abnormal state and a normal state of the statistical data, and using the threshold value determined by the threshold value determining means and the statistical data, the target process abnormality And an abnormality diagnosis means for diagnosing the above.

本発明の別の観点としては、正常データに異常データが混在している可能性があり、少なくとも半数以上の正常データを含み、かつ、異常データと正常データの識別をデータの絶対値に対する閾値で決定できるような時系列データに対して、異常データと正常データの割合を決定する手段として、明らかに正常なデータと明らかに異常データを予め除外して異常データと正常データの閾値の存在範囲を限定する閾値存在範囲限定手段と、前記閾値存在範囲限定手段で決定した閾値存在範囲のデータに対して、クラスタリング法を用いて閾値を設定する閾値設定手段とを備えたプロセス異常診断装置である。 As another aspect of the present invention, there is a possibility that abnormal data is mixed with normal data, including at least half of normal data, and distinguishing abnormal data from normal data using a threshold value for the absolute value of the data. As a means of determining the ratio of abnormal data and normal data for time series data that can be determined, the normal data and clearly abnormal data are excluded in advance, and the existence range of the threshold of abnormal data and normal data is determined. A process abnormality diagnosis apparatus comprising: a threshold existence range limiting unit for limiting; and a threshold setting unit configured to set a threshold for the data of the threshold existence range determined by the threshold existence range limiting unit using a clustering method.

この装置は、換言すれば正常データ及び異常データのクラスタリング決定装置である。本装置によると、正常データと異常データが混在する任意の時系列データを単純にクラスタリングで２分割する様な方法では、クラスタリングを適用する対象データの性質によっては適切な閾値とは考えられない様な閾値を設定することが生じうるのに対し、予め正常データと異常データを分離する閾値の存在領域を絞り込むことによって、最悪な場合でも閾値が存在し得る領域に閾値を設定することができ、より高信頼に異常データと正常データをクラスタリングすることができる。 In other words, this apparatus is a clustering determination apparatus for normal data and abnormal data. According to this device, a method that simply divides any time-series data in which normal data and abnormal data are mixed into two by clustering may not be considered an appropriate threshold depending on the nature of the target data to which clustering is applied. However, it is possible to set a threshold in an area where the threshold can exist even in the worst case by narrowing down the existing area of the threshold that separates normal data and abnormal data in advance. It is possible to cluster abnormal data and normal data with higher reliability.

特に、異常・正常を判断する閾値の設定やプロセスデータを正規化する正規化パラメータ（スケーリングパラメータ）の設定は、異常検出用データ（例：Ｑ統計量やＨｏｔｅｌｌｉｎｇのＴ２統計量等）やスケーリングを行いたいデータ（例：プロセス計測データ）の中から異常データと正常データの割合を推定して異常データと正常データを適切にクラスタリングすることに帰着されるため、あるデータの異常データと正常データの割合を適切にクラスタリングすることができる。また、より具体的に、ＰＣＡを用いたＭＳＰＣによる異常診断システムにおいて、（１）異常・正常の判断に対する感度を調整するための、ＰＣＡでプロセスの異常診断や状況判断を行うＱ統計量及びＨｏｔｅｌｌｉｎｇのＴ２統計量と呼ばれる統計量の閾値の設定方法、（２）異常の要因に対する感度を調整するためのプロセスデータを正規化するパラメータの設定方法を適切に行なうことができる。 In particular, the setting of a threshold value for judging abnormality / normality and the setting of a normalization parameter (scaling parameter) for normalizing process data include anomaly detection data (eg, Q statistic, Hotelling T2 statistic, etc.) and scaling. Since the ratio of abnormal data and normal data is estimated from the data (eg process measurement data) to be performed and the abnormal data and normal data are appropriately clustered, The proportions can be appropriately clustered. More specifically, in an abnormality diagnosis system based on MSPC using PCA, (1) Q statistics for performing process abnormality diagnosis and situation determination with PCA and Hotling for adjusting sensitivity to abnormality / normality determination (2) A parameter setting method for normalizing process data for adjusting sensitivity to abnormal factors can be appropriately performed.

本発明によれば、特にＰＣＡに基づいた異常診断において、Ａタイプの誤りとＢタイプの誤りのトレードオフの適正化を図り、正確な異常検出と共に、実際的なプロセス監視に適切な異常診断性能を実現したプロセス異常診断装置を提供できる。 According to the present invention, particularly in abnormality diagnosis based on PCA, the trade-off between an A-type error and a B-type error is optimized, and an abnormality diagnosis performance suitable for actual process monitoring along with accurate abnormality detection. Can provide a process abnormality diagnostic apparatus.

以下図面を参照して、本発明の実施形態を説明する。 Embodiments of the present invention will be described below with reference to the drawings.

［第１の実施形態］
（システム構成）
図１は、第１の実施形態に関するプロセス異常診断装置の構成を示すブロック図である。 [First Embodiment]
(System configuration)
FIG. 1 is a block diagram illustrating a configuration of a process abnormality diagnosis apparatus according to the first embodiment.

本実施形態に関するプロセス異常診断装置は、例えば窒素及びリンの除去を目的とした下水処理プロセスなどのプロセス１を監視対象とするプロセス監視システムに用いられる。なお、本実施形態のプロセス異常診断装置は、監視対象のプロセス１としては、下水処理プロセスに限定されるものではなく、化学プロセス等の他のプロセスにも適用できる。 The process abnormality diagnosis apparatus according to the present embodiment is used in a process monitoring system that monitors process 1 such as a sewage treatment process for the purpose of removing nitrogen and phosphorus, for example. Note that the process abnormality diagnosis apparatus of the present embodiment is not limited to the sewage treatment process as the process 1 to be monitored, and can be applied to other processes such as a chemical process.

下水処理プロセス１は、最初沈澱池１０１、第１嫌気槽１０２、第２好気槽１０３、第３無酸素槽１０４、第４好気槽１０５、及び最終沈澱池１０６を有する。また、下水処理プロセス１は、引き抜き流量センサを含む最初沈澱池余剰汚泥引き抜きポンプ１１１、投入量センサを含む酢酸系有機物を供給する酢酸投入ポンプ１１２、ステップ流入量センサを含むステップ流入ポンプ１１３、供給空気流量センサを含む第２好気槽に酸素を供給するブロワ１１４、投入量センサを含む炭素源（有機物）を供給する炭素源投入ポンプ１１５、循環流量センサを含む循環ポンプ１１６、返送流量センサを含む返送汚泥ポンプ１１７、および供給空気流量センサを含む第４好気槽に酸素を供給するブロワ１１８、投入量センサを含む凝集剤投入ポンプ１１９、及び引き抜き流量センサを含む最終沈澱池余剰汚泥引き抜きポンプ１１１０のそれぞれを、アクチュエータおよびその操作量センサ群として有する。 The sewage treatment process 1 includes a first sedimentation tank 101, a first anaerobic tank 102, a second aerobic tank 103, a third anaerobic tank 104, a fourth aerobic tank 105, and a final sedimentation tank 106. In addition, the sewage treatment process 1 includes an initial sedimentation pond excess sludge extraction pump 111 including an extraction flow sensor, an acetic acid input pump 112 that supplies acetic acid-based organic matter including an input amount sensor, a step inflow pump 113 that includes a step inflow sensor, and a supply A blower 114 for supplying oxygen to a second aerobic tank including an air flow sensor, a carbon source input pump 115 for supplying a carbon source (organic matter) including an input sensor, a circulation pump 116 including a circulation flow sensor, and a return flow sensor A return sludge pump 117 including a blower 118 for supplying oxygen to a fourth aerobic tank including a supply air flow rate sensor, a flocculant input pump 119 including an input amount sensor, and a final sedimentation tank surplus sludge extraction pump including an extraction flow rate sensor Each of 1110 has an actuator and its operation amount sensor group.

さらに、下水処理プロセス１は、流入下水量を計測する下水流入量センサ１２１、流入下水に含まれる全窒素量を計測する流入ＴＮセンサ１２２、流入下水に含まれる全リン量を計測する流入ＴＰセンサ１２３、流入下水に含まれる有機物量を計測する流入ＵＶセンサあるいは流入ＣＯＤセンサ１２４、第１嫌気槽１０２のリン酸濃度を計測するリン酸センサ１２５、第２好気槽１０３のアンモニア濃度を計測するアンモニアセンサ１２６、第２好気槽１０３の溶存酸素濃度を計測するＤＯセンサ１２７、及び第３無酸素槽１０４の硝酸濃度を計測する硝酸センサ１２８を有する。 Further, the sewage treatment process 1 includes a sewage inflow sensor 121 that measures the inflow sewage amount, an inflow TN sensor 122 that measures the total nitrogen amount contained in the inflow sewage, and an inflow TP sensor that measures the total phosphorus amount contained in the inflow sewage. 123, an inflow UV sensor or inflow COD sensor 124 that measures the amount of organic matter contained in the inflowing sewage, a phosphoric acid sensor 125 that measures the phosphoric acid concentration in the first anaerobic tank 102, and an ammonia concentration in the second aerobic tank 103 It has the ammonia sensor 126, the DO sensor 127 that measures the dissolved oxygen concentration in the second aerobic tank 103, and the nitric acid sensor 128 that measures the nitric acid concentration in the third anaerobic tank 104.

また、下水処理プロセス１は、第１嫌気槽１０２、第２好気槽１０３、第３無酸素槽１０４、第４好気槽１０５の少なくとも1ヶ所の槽で活性汚泥量を計測するＭＬＳＳセンサ１２９、第４好気槽１０５のリン酸濃度を計測するリン酸センサ１２１０、第４好気槽１０５の溶存酸素濃度を計測するＤＯセンサ１２１１、第４好気槽１０５のアンモニア濃度を計測するアンモニアセンサ１２１２、最終沈澱池１０６から引き抜かれる汚泥量のＭＬＳＳ濃度を計測するＭＬＳＳセンサ１２１３、最終沈澱池１０６から放流される放流水のＳＳ濃度を計測するＳＳセンサ１２１４、放流下水量を計測する下水放流量センサ１２１５、放流下水に含まれる全窒素量を計測する放流ＴＮセンサ１２１６、放流下水に含まれる全リン量を計測する放流ＴＰセンサ１２１７、および放流下水に含まれる有機物量を計測する放流ＵＶセンサあるいは放流ＣＯＤセンサ１２１８のそれぞれをプロセスセンサとして有する。 The sewage treatment process 1 includes an MLSS sensor 129 that measures the amount of activated sludge in at least one of the first anaerobic tank 102, the second aerobic tank 103, the third anaerobic tank 104, and the fourth aerobic tank 105. A phosphoric acid sensor 1210 that measures the phosphoric acid concentration in the fourth aerobic tank 105, a DO sensor 1211 that measures the dissolved oxygen concentration in the fourth aerobic tank 105, and an ammonia sensor that measures the ammonia concentration in the fourth aerobic tank 105 1212, MLSS sensor 1213 that measures the MLSS concentration of the sludge amount drawn from the final sedimentation basin 106, SS sensor 1214 that measures the SS concentration of effluent discharged from the final sedimentation basin 106, and sewage discharge flow rate that measures the amount of sewage discharged Sensor 1215, discharge TN sensor 1216 for measuring the total amount of nitrogen contained in the discharged sewage, discharge TP cell for measuring the total amount of phosphorus contained in the discharged sewage Each of the sensor 1217 and the discharge UV sensor or the discharge COD sensor 1218 for measuring the amount of organic matter contained in the discharged sewage is a process sensor.

ここで、前述の各種アクチュエータ１１１〜１１１０は、所定の周期で動作している。また、各種アクチュエータ１１１〜１１１０の操作量センサ群、および各種プロセスセンサ１２１〜１２１８は所定の周期で計測を行っている。 Here, the above-described various actuators 111 to 1110 operate at a predetermined cycle. In addition, the operation amount sensor group of the various actuators 111 to 1110 and the various process sensors 121 to 1218 perform measurement at a predetermined cycle.

プロセス異常診断装置は、プロセス１に設けられている各種のセンサから、プロセスの状態又は操作量の計測結果であるプロセスデータ（時系列データ）を所定の周期で収集し、保存するプロセス計測データ収集・保存部２を有する。 The process abnormality diagnosis apparatus collects process data (time-series data), which is a measurement result of a process state or an operation amount, from various sensors provided in the process 1 at a predetermined cycle and stores it. A storage unit 2 is included.

さらに、プロセス異常診断装置は、プロセス監視モデル構築用プロセスデータ抽出部（以下単にプロセスデータ抽出部と表記）３と、プロセス監視モデル構築・供給部（以下単にモデル構築部と表記）４と、プロセス異常判定基準設定・供給部（以下単に異常判定基準設定部と表記）５と、プロセス監視部６と、プロセス異常診断部７と、プロセス異常要因センサ特定部８と、プロセス異常要因推定部９、及びユーザインターフェース部１０とを有する。 Furthermore, the process abnormality diagnosis apparatus includes a process data extraction unit for process monitoring model construction (hereinafter simply referred to as process data extraction part) 3, a process monitoring model construction / supply part (hereinafter simply referred to as model construction part) 4, An abnormality determination criterion setting / supply unit (hereinafter simply referred to as an abnormality determination criterion setting unit) 5, a process monitoring unit 6, a process abnormality diagnosis unit 7, a process abnormality factor sensor specifying unit 8, a process abnormality factor estimation unit 9, And a user interface unit 10.

プロセスデータ抽出部３は、プロセス計測データ収集・保存部２に保存された各種の時系列データの中から、プロセス監視モデルを構築するために、主に正常状態（通常状態）のデータを中心とした所定の期間に亘るプロセスデータを抽出する。モデル構築部４は、プロセスデータ正規化モデル生成部４１および異常検出モデル生成部４２を含む。プロセスデータ正規化モデル生成部４１は、プロセスデータ正規化用事前情報入力部（以下単に正規化用事前情報入力部と表記）４１１及びパラメータ決定部４１２を含む。 The process data extraction unit 3 mainly focuses on normal state (normal state) data in order to build a process monitoring model from various time series data stored in the process measurement data collection / storage unit 2. Process data over a predetermined period is extracted. The model construction unit 4 includes a process data normalization model generation unit 41 and an abnormality detection model generation unit 42. The process data normalization model generation unit 41 includes a process data normalization prior information input unit (hereinafter simply referred to as normalization prior information input unit) 411 and a parameter determination unit 412.

正規化用事前情報入力部４１１は、プロセスデータ抽出部３で取り出した各種のプロセスデータ（xi(t),t=1,2,---l=所定の期間，i=1,2,---28）に対して、これらのプロセスデータに含まれる正常データと異常データの割合に対する事前情報を入力する。また、パラメータ決定部４１２は、正規化用事前情報入力部４１１から入力される事前情報に従って、各種のプロセスデータを正規化するための式［(xi(t)-ai)/bi, xi(t)はi番目のプロセスデータ］のシフトパラメータ（ai）とスケーリングパラメータ（bi）を決定する。シフトパラメータ（ai）は、i番目のプロセスデータに対するシフトを表す定数である。また、スケーリングパラメータ（bi）は、i番目のプロセスデータに対するスケーリングを表す定数である。 The prior information input unit 411 for normalization uses various process data (xi (t), t = 1,2, --- l = predetermined period, i = 1,2,-) extracted by the process data extraction unit 3 --28), input prior information on the ratio of normal data and abnormal data included in these process data. In addition, the parameter determination unit 412 uses the equations [(xi (t) -ai) / bi, xi (t) for normalizing various process data according to the prior information input from the normalization prior information input unit 411. ) Determines the shift parameter (ai) and scaling parameter (bi) of the i-th process data]. The shift parameter (ai) is a constant representing a shift for the i-th process data. The scaling parameter (bi) is a constant representing scaling for the i-th process data.

異常検出モデル生成部４２は、プロセスデータ正規化モデル生成部４１で定義した正規化モデルによりプロセスデータを正規化した正規化プロセスデータに対して、前述の主成分分析（ＰＣＡ)を施す。これにより、ＰＣＡのローディング行列（負荷行列）とスコア行列を算出し、これらを用いて定義されるＱ統計量およびホテリング（Hotelling）のＴ^２統計量を計算するための計算式（モデル）を設定する。 The abnormality detection model generation unit 42 performs the above-described principal component analysis (PCA) on the normalized process data obtained by normalizing the process data using the normalization model defined by the process data normalization model generation unit 41. As a result, a PCA loading matrix (load matrix) and score matrix are calculated, and formulas (models) for calculating Q statistics and Hotelling T ² statistics defined using these are set. To do.

異常判定基準設定部５は、閾値決定用事前情報入力部５１及び閾値決定部５２を含む。閾値決定用事前情報入力部５１は、プロセスデータ正規化モデル生成部４１の正規化モデルにより正規化された正規化プロセスデータを、異常検出モデル生成部４２により生成したＱ統計量およびホテリングのＴ^２統計量の計算式（モデル）を使用することで、当該各統計量に含まれる正常データと異常データの割合に対する事前情報を入力する。閾値決定部５２は、閾値決定用事前情報入力部５１から入力される事前情報に従って、Ｑ統計量およびホテリングのＴ^２統計量における正常と異常の閾値を設定する。 The abnormality determination criterion setting unit 5 includes a threshold determination prior information input unit 51 and a threshold determination unit 52. The threshold determination prior information input unit 51 uses the Q statistic generated by the anomaly detection model generation unit 42 and the hotelling T ² for the normalized process data normalized by the normalization model of the process data normalization model generation unit 41. By using a statistic calculation formula (model), prior information on the ratio of normal data and abnormal data included in each statistic is input. The threshold determination unit 52 sets normal and abnormal thresholds in the Q statistic and the hotelling T ² statistic according to the prior information input from the threshold determination prior information input unit 51.

プロセス監視部６は、モデル構築部４から供給された正規化モデルとプロセス監視モデルとを用いて、プロセス計測データ収集・保存部２から、例えばオンラインで供給されるプロセスデータに対してＱ統計量およびホテリングのＴ^２統計量を計算して、プロセス１の正常状態または異常状態を監視する。プロセス異常診断部７は、プロセス監視部６で監視しているＱ統計量およびホテリングのＴ^２統計量に対して、異常判定基準設定部５で設定した異常判定の閾値を用いて、プロセス計測データ収集・保存部２から供給されたプロセスデータの正常または異常を判断する。 The process monitoring unit 6 uses the normalized model and the process monitoring model supplied from the model construction unit 4 to, for example, Q statistics for the process data supplied online from the process measurement data collection / storage unit 2. And calculate the Hotelling T ² statistic to monitor the normal or abnormal state of process 1. Process error diagnostic portion 7, with respect to T ² statistic Q statistic and Hotelling monitored by the process monitoring unit 6, using a threshold of abnormality determination set by the failure determination reference setting unit 5, process measurement data Whether the process data supplied from the collection / storage unit 2 is normal or abnormal is determined.

さらに、プロセス異常要因センサ特定部８は、プロセス異常診断部７によりプロセス１が通常状態（正常）でないと診断されたときに、その要因となる少なくとも一つ以上のセンサ１１〜１Ｎを特定する。プロセス異常要因推定部９は、プロセス異常要因センサ特定部８によって特定された情報に基づいて、予め知識処理などを行っておくことにより，何故そのようなセンサ情報が出力されるかを推定する。ユーザインターフェース部１０は、各種の情報を表示出力するための表示部を含み、プロセス監視部６、プロセス異常診断部７、プロセス異常要因センサ特定部８、プロセス異常要因推定部９からの少なくとも一つ以上の情報をオペレータに通知する。 Furthermore, when the process abnormality diagnosis unit 7 diagnoses that the process 1 is not in a normal state (normal), the process abnormality factor sensor identification unit 8 identifies at least one or more sensors 11 to 1N that cause the process 1. The process abnormality factor estimation unit 9 estimates why such sensor information is output by performing knowledge processing or the like in advance based on the information specified by the process abnormality factor sensor specifying unit 8. The user interface unit 10 includes a display unit for displaying and outputting various types of information, and at least one of the process monitoring unit 6, the process abnormality diagnosis unit 7, the process abnormality factor sensor specifying unit 8, and the process abnormality factor estimation unit 9. The above information is notified to the operator.

（第１の実施形態の作用）
以下、本実施形態の作用効果を説明する。 (Operation of the first embodiment)
Hereinafter, the effect of this embodiment is demonstrated.

まず、下水処理プロセスなどのプロセス１では、各種のセンサ１１〜１Ｎにより、所定の周期でプロセスの状態や操作に関する情報が計測されている。プロセス計測データ収集・保存部２は、当該各種のセンサ１１〜１Ｎからの計測結果であるプロセスデータを所定の周期で収集し、予め決められたフォーマットに従って時系列データとして保存する。 First, in the process 1 such as a sewage treatment process, information on the state and operation of the process is measured at predetermined intervals by various sensors 11 to 1N. The process measurement data collection / storage unit 2 collects process data as measurement results from the various sensors 11 to 1N at a predetermined cycle, and stores the process data as time series data according to a predetermined format.

プロセスデータ抽出部３は、プロセス計測データ収集・保存部２に保存された各種の時系列データの中から、プロセス監視モデルを構築するために、主に正常状態（通常状態）のデータを中心とした所定の期間に亘る一連のプロセスデータを抽出する。換言すれば、プロセスデータ抽出部３は、例えばプロセス１の操作量やプロセス量の正常範囲に属しているデータが主要な構成要素となるように、一連のプロセスデータを取り出す。 The process data extraction unit 3 mainly focuses on normal state (normal state) data in order to build a process monitoring model from various time series data stored in the process measurement data collection / storage unit 2. A series of process data over a predetermined period is extracted. In other words, the process data extraction unit 3 extracts a series of process data so that, for example, data that belongs to the operation amount of the process 1 or the normal range of the process amount becomes a main component.

ここで、プロセスデータ抽出部３において、予め正常データのみを取り出すことは、不可能あるいは困難な場合が多い。即ち、予め正常データのみを取り出すことができるということは、少なくとも、プロセス監視モデル構築用データに対しては異常診断が完了し、正常であると判断されていることを意味するからである。従って、プロセスデータ抽出部３は、予め異常データと正常データの割合に関して推測された事前情報を取得して、ある程度の異常データを含むプロセスデータを抽出する。この事前情報としては、例えば、以下のような情報である。 Here, in the process data extraction unit 3, it is often impossible or difficult to extract only normal data in advance. That is, the fact that only normal data can be extracted in advance means that abnormality diagnosis has been completed for at least process monitoring model construction data and it is determined that the data is normal. Therefore, the process data extraction unit 3 obtains prior information preliminarily estimated with respect to the ratio between abnormal data and normal data, and extracts process data including a certain amount of abnormal data. The prior information is, for example, the following information.

（１）抽出したデータの中には、殆ど異常データは含まれておらず、含まれていたとしても数点程度である。 (1) The extracted data contains almost no abnormal data. Even if it is included, there are only a few points.

（２）抽出したデータの中には、異常データは少し含まれていて、大体Ｘ％程度である。 (2) The extracted data includes a little abnormal data, which is about X%.

（３）抽出したデータの中には、異常データは数％以上含まれているが、割合に関しては不明である。 (3) The extracted data contains abnormal data of several percent or more, but the ratio is unknown.

プロセスデータ抽出部３は、以上のような事前情報（１）〜（３）を取得できない場合には、抽出したデータの中の異常データと正常データの割合に関しては不明という判断を実行する。但し、抽出したデータの中に、例えば数十％以上の異常データが含まれている場合、プロセス監視モデル自身を構築することができなくなるので、プロセスデータ抽出部３は、主たる構成要素が正常データであるプロセスデータを抽出する必要がある。また、抽出するデータの中には、正常動作範囲内のなるべく多様なデータが混在していることが好ましい。 If the prior data (1) to (3) as described above cannot be acquired, the process data extraction unit 3 determines that the ratio of abnormal data to normal data in the extracted data is unknown. However, if the extracted data contains, for example, several tens of percent or more abnormal data, the process monitoring model itself cannot be constructed. Therefore, the process data extraction unit 3 is configured so that the main component is normal data. It is necessary to extract process data. In addition, it is preferable that as much data as possible within the normal operation range is mixed in the data to be extracted.

さらに、プロセスデータの中に、欠測データや異常データ（アウトライアも含む）、あるいはノイズがのっているデータが含まれている場合には、プロセスデータ抽出部３は、予めこれらのデータに前処理を実行する前処理機能を有することが好ましい。アウトライアとは、伝送異常などにより時々突発的に生じる異常データを意味する。前処理機能としては、具体的には、例えば、ノイズを除去するためのＦＩＲフィルタ機能やＩＩＲフィルタ機能などのデジタルフィルタ機能である。アウトライアに対しては、メジアンフィルタ機能やウェーブレットフィルタ機能などを用いて、データの中から本当のプロセスデータであると思われるデータを抽出する。 Furthermore, if the process data includes missing data, abnormal data (including outliers), or data with noise, the process data extraction unit 3 preliminarily stores these data. It is preferable to have a preprocessing function for executing preprocessing. The outlier means abnormal data that occasionally occurs suddenly due to a transmission abnormality or the like. Specifically, the pre-processing function is, for example, a digital filter function such as an FIR filter function or an IIR filter function for removing noise. For outliers, the median filter function, wavelet filter function, etc. are used to extract data that is considered to be real process data from the data.

プロセスデータ抽出部３は、抽出したプロセスデータをモデル構築部４に送る。モデル構築部４では、プロセスデータ正規化モデル生成部４１は、各種センサ１１〜１Ｎにより計測されたプロセスデータを正規化するための計算式（正規化モデル）を、プロセスデータ抽出部３で抽出したプロセスデータを用いて決定する。このとき、正規化モデル生成部４１では、まず、正規化用事前情報入力部４１１が、プロセスデータ抽出部３で取り出した各種のプロセスデータに関する事前情報を入力する。この事前情報は、プロセスデータに含まれる正常データと異常データの割合に対する情報である。正規化用事前情報入力部４１１は、例えば図２に示すような画面情報を、ユーザインターフェース部１０の表示部に表示して、プロセス監視モデルを構築するユーザに、例えば図２に示すような４つの事前情報（ａ）〜（ｄ）から選択させる。ここで、ユーザが事前情報（ｄ）を選択した場合には、さらに、プロセスデータの中に混在している異常データの概略的な割合（％）を同時に入力することを促す。 The process data extraction unit 3 sends the extracted process data to the model construction unit 4. In the model construction unit 4, the process data normalization model generation unit 41 uses the process data extraction unit 3 to extract a calculation formula (normalization model) for normalizing the process data measured by the various sensors 11 to 1N. Determine using process data. At this time, in the normalization model generation unit 41, the normalization prior information input unit 411 first inputs prior information regarding various process data extracted by the process data extraction unit 3. This prior information is information on the ratio of normal data and abnormal data included in the process data. The normalization prior information input unit 411 displays screen information such as that shown in FIG. 2 on the display unit of the user interface unit 10, for example, to the user who builds the process monitoring model, for example, as shown in FIG. The two pieces of prior information (a) to (d) are selected. Here, when the user selects prior information (d), the user is further prompted to simultaneously input a rough ratio (%) of abnormal data mixed in the process data.

パラメータ決定部４１２は、正規化用事前情報入力部４１１から入力される事前情報に基づいて、各種のプロセスデータの正規化に必要なシフトパラメータ（ａｉ）、及びスケーリングパラメータ（ｂｉ）の値を、以下に示す手順により決定する。 The parameter determination unit 412 determines the values of the shift parameter (ai) and the scaling parameter (bi) necessary for normalization of various process data based on the prior information input from the normalization prior information input unit 411. It is determined by the following procedure.

正規化用事前情報入力部４１１から、図２に示す事前情報（ａ）が入力されると、事前情報は何も無いため、パラメータ決定部４１２は、なるべく異常値に左右されないロバストな正規化手法を採用した場合のパラメータを決定する。 When the prior information (a) shown in FIG. 2 is input from the normalization prior information input unit 411, since there is no prior information, the parameter determination unit 412 is a robust normalization method that is as independent as possible from abnormal values. Determine the parameters when.

具体的には、ロバスト標本平均、あるいはメジアンとロバスト標本標準偏差を用いる。ここで、ロバスト標本平均やロバスト標本標準偏差とは、予めプロセスデータの最大値の付近及び最小値の付近の例えば０．５％程度のデータを取り除いた上で求めた標本平均と標本標準偏差である。従って、パラメータ決定部４１２は、予め上下限値付近のいくつかのデータを除いた上で、シフトパラメータ（ａｉ）を「ai=1/N(Σxi(k))」というプロセスデータの平均値、あるいは全データの中央値（メジアン）と決定する。ここで、メジアンは、それ自身異常値に影響されにくいロバストな統計量なのでそのまま用いる。ロバスト標本平均またはメジアンのいずれかを用いるかは、データの基準値を０にすることをより優先するか、またはロバスト性をより優先するかの判断に基づく。 Specifically, a robust sample average or median and robust sample standard deviation is used. Here, the robust sample average and the robust sample standard deviation are the sample average and the sample standard deviation obtained after removing, for example, about 0.5% of data in the vicinity of the maximum value and the minimum value of the process data in advance. is there. Therefore, the parameter determination unit 412 removes some data in the vicinity of the upper and lower limit values in advance and sets the shift parameter (ai) as an average value of process data “ai = 1 / N (Σxi (k))”, Alternatively, the median value (median) of all data is determined. Here, the median is used as it is because it is a robust statistic that is hardly affected by an abnormal value. Whether to use the robust sample average or the median is based on the determination of whether to give priority to setting the data reference value to 0 or to give priority to robustness.

統計量として標本平均は、メジアンよりもロバスト性が著しく低い。これは、次の様な簡単な例から明白である。例えば、「Ａ＝1.5, 2.3, 1.3, 1.8, 2.0, 1.7, 1.9, 2.1, 2.2, 1.6, 1.4」というプロセスデータがあり、何らかの理由で異常データ「10」が混入して、「Ａｂ＝1.5, 2.3, 1.3, 1.8, 2.0, 1.7, 1.9, 2.1, 10, 1.6, 1.4」という様に計測されたと想定する。この場合、本当のデータＡの標本平均は「1.8」であり、メジアンも「1.8」である。しかし、計測されたデータＡｂの標本平均は、「2.5」となるが、メジアンは「1.8」のままである。このように、標本平均という統計量は、メジアンという統計量よりもロバスト性が弱く、異常データに対して大きく左右される。これを避けるために、プロセスデータの上下限付近のいくつかのデータを取り除いて計算するロバスト標本平均を用いることができる。しかし、上下限付近の何％のデータを取り除くかを、異常データに対する事前情報なしに決定することは困難である。この点においてメジアンは、極端に多くの異常データが存在したり、上限側あるいは下限側に極端に多量の異常データが存在しない限りはあまり異常データに左右されない。しかし、そもそもスケーリングでは、スケーリングされた後のデータの平均がおおよそゼロになる様に変換することが目的であるため、この点ではロバスト標本平均を用いる方が好ましい。このため、設計者がロバスト性を優先するか、スケーリング後の基準値を０に近くすることを優先するかによって、メジアンあるいはロバスト標本平均を使うことを決定することになる。 As a statistic, the sample mean is significantly less robust than the median. This is clear from the following simple example. For example, there is process data of “A = 1.5, 2.3, 1.3, 1.8, 2.0, 1.7, 1.9, 2.1, 2.2, 1.6, 1.4”, and abnormal data “10” is mixed for some reason, and “Ab = 1.5 , 2.3, 1.3, 1.8, 2.0, 1.7, 1.9, 2.1, 10, 1.6, 1.4 ”. In this case, the sample average of the true data A is “1.8”, and the median is also “1.8”. However, the sample average of the measured data Ab is “2.5”, but the median remains “1.8”. As described above, the statistical quantity called the sample average is less robust than the statistical quantity called the median, and greatly depends on the abnormal data. To avoid this, it is possible to use a robust sample average that is calculated by removing some data near the upper and lower limits of the process data. However, it is difficult to determine what percentage of data near the upper and lower limits is to be removed without prior information on abnormal data. In this respect, the median is not greatly affected by abnormal data unless there is an extremely large amount of abnormal data or an extremely large amount of abnormal data on the upper limit side or the lower limit side. However, since the purpose of scaling is to convert so that the average of the scaled data becomes approximately zero, it is preferable to use a robust sample average in this respect. For this reason, it is decided to use the median or the robust sample average depending on whether the designer gives priority to robustness or gives priority to making the scaled reference value close to zero.

一方、スケーリングパラメータは，予め上下限付近のいくつかのデータを除いた上で、下記式（１）に示すような標本標準偏差を採用する。

On the other hand, as the scaling parameter, after removing some data in the vicinity of the upper and lower limits in advance, a sample standard deviation as shown in the following formula (1) is adopted.

ここで、分散（標準偏差の２乗）を計算する場合には、不偏分散とするために「１／Ｎ」ではなく「１／（Ｎ−１）」とすることが多い。但し、不偏分散が最も良い推定であるとは限らないので、「１／Ｎ」または「１／（Ｎ−１）」とする。また、不偏分散の平方根は、不偏標準偏差ではなく、不偏標準偏差とするために別の係数を乗ずる場合もある。但し、データ数が十分に多ければ、結果として得られる値には殆ど差はない。 Here, when calculating the variance (square of the standard deviation), it is often set to “1 / (N−1)” instead of “1 / N” in order to obtain unbiased variance. However, since unbiased variance is not always the best estimate, it is set to “1 / N” or “1 / (N−1)”. Further, the square root of the unbiased variance may be multiplied by another coefficient in order to obtain an unbiased standard deviation instead of the unbiased standard deviation. However, if the number of data is sufficiently large, there is almost no difference in the resulting values.

前記の標本標準偏差を採用する理由は次の通りである。そもそもスケーリングとは、各種のセンサ１１〜１Ｎで計測される様な異なる物理量を、同じ尺度で扱うためにデータの動作範囲を揃えるものである。従って、各センサ１１〜１Ｎで計測している物理量の動作範囲が分かっているのであれば、その動作範囲（データレンジ）をそのままスケーリングパラメータとすることが一番素直であり、最も良い方法であると考えられる。しかし、データレンジという統計量は、異常値に対して極めて敏感である。即ち、少しでも異常データが混入すると、その値によってデータレンジは大幅に変化してしまう。例えば、前記の例では、プロセスデータＡのデータレンジは１（最小値1.3，最大値2.3）であるが、Ａｂのデータレンジは8.7（最小値1.3，最大値10）となってしまう。このために、前述した様に、プロセスデータの上下限付近のいくつかのデータを取り除いて計算することが考えられるが、上下限付近の何％のデータを取り除くかを、異常データに対する事前情報なしに決定することは困難である。 The reason for adopting the sample standard deviation is as follows. In the first place, the scaling is to align the operation range of data in order to handle different physical quantities as measured by various sensors 11 to 1N with the same scale. Therefore, if the operation range of the physical quantity measured by each sensor 11 to 1N is known, it is the most straightforward and best method to use the operation range (data range) as a scaling parameter as it is. it is conceivable that. However, the data range statistic is extremely sensitive to outliers. In other words, if abnormal data is mixed in even a little, the data range will change greatly depending on the value. For example, in the above example, the data range of the process data A is 1 (minimum value 1.3, maximum value 2.3), but the data range of Ab is 8.7 (minimum value 1.3, maximum value 10). For this reason, as described above, it may be possible to calculate by removing some data near the upper and lower limits of the process data, but there is no prior information on abnormal data as to what percentage of data near the upper and lower limits is removed. It is difficult to determine.

このようなデータレンジと比較すると、標本標準偏差はロバストな統計量であり、データの動作範囲を示す一つの指標となるため、通常スケーリングパラメータとしてよく利用される。ここで、仮にデータが正規分布に従うならば、標本標準偏差の３倍で99.7％のデータをカバーするので、標本標準偏差の３倍程度がデータレンジに相当する。しかし、データが正規分布に従うかどうかは分からないので、データレンジと標本標準偏差の対応関係は一般に不明である。但し、スケーリングは複数の異なる物理量同士を同じ指標で見るために実施するものであるから、標本標準偏差とデータレンジを対応づける必要は特にない。また、標本標準偏差は、データレンジと比較するとロバストな統計量であるが、異常データにある程度左右されるため、好ましくはロバスト標本標準偏差を用いる。 Compared to such a data range, the sample standard deviation is a robust statistic and is an index indicating the operating range of data, so it is often used as a normal scaling parameter. Here, if the data follow a normal distribution, 99.7% of the data is covered by 3 times the sample standard deviation, so about 3 times the sample standard deviation corresponds to the data range. However, since it is not known whether the data follows a normal distribution, the correspondence between the data range and the sample standard deviation is generally unknown. However, since the scaling is performed in order to see a plurality of different physical quantities with the same index, it is not particularly necessary to associate the sample standard deviation with the data range. In addition, the sample standard deviation is a robust statistic compared to the data range. However, since it depends to some extent on abnormal data, the robust sample standard deviation is preferably used.

また仮に、正規化用事前情報入力部４１１から、図２に示す事前情報（ｂ）が入力されると、パラメータ決定部４１２は、予め殆ど全てのデータが正常であることが分かっているため、その情報（ｂ）を利用してパラメータを設定する。具体的には、パラメータ決定部４１２は、シフトパラメータについては、前述した理由により、予め殆ど全てのデータが正常であるとわかっているため、標本平均を用いる。但し、多少の異常値が混入していることを想定して、上下限付近の例えば０．５％程度のデータを除いたロバスト標本平均を用いても良い。 Further, if the prior information (b) shown in FIG. 2 is input from the normalization prior information input unit 411, the parameter determination unit 412 knows that almost all data is normal in advance. The parameter is set using the information (b). Specifically, the parameter determination unit 412 uses a sample average for the shift parameter because it is known in advance that almost all data is normal for the reasons described above. However, assuming that some abnormal values are mixed, a robust sample average excluding data of about 0.5% near the upper and lower limits may be used.

一方、パラメータ決定部４１２は、スケーリングパラメータについては、予め殆ど全てのデータが正常であるということが分かっているので、念のためプロセスデータの最大値の付近及び最小値の付近の例えば０．５％程度のデータを取り除いた上で、そのプロセスデータの動作範囲を直接スケーリングパラメータとする。 On the other hand, since the parameter determination unit 412 knows that almost all data is normal in advance for the scaling parameter, for example, the vicinity of the maximum value and the minimum value of the process data, for example, 0.5 After removing about% data, the operation range of the process data is directly set as a scaling parameter.

また仮に、正規化用事前情報入力部４１１から、図２に示す事前情報（ｃ）が入力されると、パラメータ決定部４１２は、異常データと正常データが混在し，異常データの割合は分からないため、その情報（ｃ）を利用して、予め正常データと異常データの閾値をクラスタリングの方法を用いて決定しておく。クラスタリングとは、複数の異なる集合（クラス）に属していると考えられるデータが混在している場合に、各々のデータがどのクラスに属するかを決定する方法の総称である。この場合、正常データと異常データの２つのクラスに属しているデータが混在していると想定されるので、２クラスのクラスタリング手法を用いる。 Further, if the prior information (c) shown in FIG. 2 is input from the normalization prior information input unit 411, the parameter determination unit 412 has a mixture of abnormal data and normal data, and the ratio of abnormal data is unknown. Therefore, using the information (c), threshold values for normal data and abnormal data are determined in advance using a clustering method. Clustering is a general term for a method for determining which class each data belongs to when data considered to belong to a plurality of different sets (classes) are mixed. In this case, since it is assumed that data belonging to two classes of normal data and abnormal data are mixed, a two-class clustering method is used.

ここで、事前情報（ａ）の場合にも、このような２クラスのクラスタリング手法を適用できる様に思われるが、実際にはそうではない。この理由は、２クラスのクラスタリング手法は、正常データと異常データの２つのクラスのデータが混在している状況下で効果的に働くものであるため、もし事前情報が何もない場合に、２クラスのクラスタリング手法を適用した場合、データに殆ど異常データが混入していない場合に正常データを強引に２つに分割してしまうことになるためである。 Here, even in the case of prior information (a), it seems that such a two-class clustering method can be applied, but this is not the case. This is because the two-class clustering method works effectively in a situation where normal data and abnormal data are mixed, so if there is no prior information, 2 This is because when the class clustering method is applied, normal data is forcibly divided into two when abnormal data is hardly mixed in the data.

このようなクラスタリング手法は、様々な方法が開発されているので、任意の適切なものを用いれば良いが、例えば、「大津の閾値決定法」と呼ばれる一種の判別分析手法を用いることができる。「大津の閾値決定法」とは、２つの異なるクラスに属するデータが混合している際、２つのクラス同士のクラス間分散が最大となる様に閾値を設定する方法である。換言すれば、各々のクラスのクラス内の分散の平均値が、最小になる様に閾値を設定する方法である。すなわち、２つのクラスに属するデータを、お互いになるべく遠ざけるように閾値を決定する方法である。具体的には、以下の式（２）を最小化する様な閾値を求めることである。

Since various methods have been developed for such a clustering method, any appropriate method may be used. For example, a kind of discriminant analysis method called “Otsu's threshold determination method” can be used. The “Otsu's threshold determination method” is a method of setting a threshold value so that the inter-class variance between two classes is maximized when data belonging to two different classes are mixed. In other words, the threshold value is set so that the average value of the variance within each class is minimized. That is, this is a method of determining a threshold value so that data belonging to two classes are as far as possible from each other. Specifically, a threshold value that minimizes the following expression (2) is obtained.

ここで、ω1とω2は、各々、クラス１とクラス２に属するデータの割合（クラス発生確率）である。σ1とσ2は、各々、クラス１とクラス２のクラス内の分散である。これを、図３に示す。 Here, ω1 and ω2 are the ratios (class occurrence probabilities) of data belonging to class 1 and class 2, respectively. σ 1 and σ 2 are the variances in class 1 and class 2, respectively. This is shown in FIG.

この大津の閾値決定法は、２つのクラスのデータが各々正規分布に従っているという条件、２つのクラスに属するデータ割合に著しい差がないという条件、さらに２つのクラスの正規分布の分散が等しいという条件下での閾値の最尤推定値となることが示されている。ここで、最尤推定値とは、統計的推定法に含まれる最尤推定法による推定値である。 This Otsu threshold determination method is based on the condition that the two classes of data each follow a normal distribution, the condition that there is no significant difference in the proportion of data belonging to the two classes, and the condition that the variances of the normal distributions of the two classes are equal. It is shown that this is the maximum likelihood estimate of the threshold below. Here, the maximum likelihood estimated value is an estimated value by the maximum likelihood estimation method included in the statistical estimation method.

通常では、正常データの割合は、異常データの割合よりかなり大きいため、前記の２つのクラスに属するデータ割合に著しい差がないという条件を満たさない。このように２つのクラスに属するデータ割合に著しい差がある場合には、前記式（２）の代わりに、以下の式（３）を最小化することが好ましい。

Usually, since the ratio of normal data is considerably larger than the ratio of abnormal data, the condition that there is no significant difference in the ratio of data belonging to the two classes is not satisfied. Thus, when there is a significant difference in the ratio of data belonging to the two classes, it is preferable to minimize the following expression (3) instead of the expression (2).

ここで、「ω1・σ1＋ω2・σ2」の最小化は、「log(ω1・σ1＋ω2・σ2)」の最小化と同じ事であることに注意すれば、式（３）は式（２）に対して、「-ω1・log(ω1) -ω2・log(ω2)」という補正項を加えたものになっている。これを以下では、修正大津の閾値決定法と呼ぶことにする。 Note that the minimization of “ω1 · σ1 + ω2 · σ2” is the same as the minimization of “log (ω1 · σ1 + ω2 · σ2)”. Thus, a correction term “-ω1 · log (ω1) -ω2 · log (ω2)” is added. Hereinafter, this is referred to as a modified Otsu threshold determination method.

このように、大津の閾値決定法や修正大津の閾値決定法などの２クラスのクラスタリング法を適用することによって、正常データと異常データの閾値を決定することができる。パラメータ決定部４１２は、この決定を、各種のセンサ１１〜１Ｎで計測される全ての計測項目に対して実行する。図４は、この様にして正常データと異常データを分類する閾値４１０を決定の概念を示す図である。図４に示すように、クラスタリング手法によって、プロセス監視モデル構築用のプロセスデータは、正常データと異常データに暫定的に分類できるので、正常データの部分４００のみを用いて、正常データの標本平均をシフトパラメータとし、正常データの標本標準偏差をスケーリングパラメータとする。 In this way, by applying a two-class clustering method such as the Otsu threshold determination method or the modified Otsu threshold determination method, the threshold values for normal data and abnormal data can be determined. The parameter determination unit 412 performs this determination for all measurement items measured by the various sensors 11 to 1N. FIG. 4 is a diagram showing the concept of determining the threshold value 410 for classifying normal data and abnormal data in this way. As shown in FIG. 4, since the process data for constructing the process monitoring model can be provisionally classified into normal data and abnormal data by the clustering method, the sample average of normal data is calculated using only the normal data portion 400. A shift parameter is used, and a sample standard deviation of normal data is used as a scaling parameter.

なお、プロセスデータが上限方向と下限方向の両方向に異常が出る可能性のあるデータである場合には、予めプロセスデータの絶対値を計算した上で、大津の閾値決定法や修正大津の閾値決定法などの２クラスのクラスタリング法を適用する。この場合、結果として、図５に示す様なイメージで正常データと異常データを分類できる。 If the process data is data that may cause anomalies in both the upper limit and lower limit directions, calculate the absolute value of the process data in advance, and then determine the threshold value for Otsu and the threshold value for modified Otsu. Two classes of clustering methods such as the method are applied. In this case, as a result, normal data and abnormal data can be classified with an image as shown in FIG.

さらに、正規化用事前情報入力部４１１から、図２に示す事前情報（ｄ）が入力されると、パラメータ決定部４１２は、正常データが混在している異常データの大よその割合を認識できる。この場合には、前述したように、異常データの割合が追加で入力される。例えば、異常データの割合が５％と入力された場合には、もしプロセスデータが片側方向にしか異常が出ない様なデータである場合には、プロセスデータの最大値（あるいは最小値）から５％のデータを含む点を閾値とする（図４の４１０を参照）。 Furthermore, when the prior information (d) shown in FIG. 2 is input from the normalization prior information input unit 411, the parameter determination unit 412 can recognize the approximate proportion of abnormal data in which normal data is mixed. . In this case, as described above, the ratio of abnormal data is additionally input. For example, when the ratio of abnormal data is entered as 5%, if the process data is such that an abnormality occurs only in one direction, the maximum (or minimum) value of the process data is 5 The threshold value is a point including% data (see 410 in FIG. 4).

また、プロセスデータが両側方向に異常が出る可能性のあるデータである場合には、プロセスデータの最大値および最小値から各々２．５％のデータを含む点を閾値とする（図５の５１０、５１１を参照）。このような操作により、正常データの範囲５００を決定することができるため、正常データと見なされたデータの標本平均をシフトパラメータとし、標本標準偏差をスケーリングパラメータとする。 In addition, when the process data is data that may be abnormal in both directions, the threshold value is a point that includes 2.5% of data from the maximum value and the minimum value of the process data (510 in FIG. 5). 511). As a result of such an operation, the normal data range 500 can be determined. Therefore, the sample average of data regarded as normal data is used as a shift parameter, and the sample standard deviation is used as a scaling parameter.

以上に手順により、パラメータ決定部４１２は、正規化用事前情報入力部４１１から入力される事前情報に基づいて、各種のプロセスデータの正規化に必要なシフトパラメータ（ａｉ）及びスケーリングパラメータ（ｂｉ）の値を決定する。即ち、プロセスデータ正規化モデル生成部４１は、プロセスデータの事前情報に基づいて、正規化モデルによりプロセスデータを正規化する。 According to the procedure described above, the parameter determination unit 412 performs the shift parameter (ai) and scaling parameter (bi) necessary for normalization of various process data based on the prior information input from the normalization prior information input unit 411. Determine the value of. That is, the process data normalization model generation unit 41 normalizes the process data with the normalization model based on the prior information of the process data.

ここで、プロセスデータの適切な正規化と、異常診断との関係について簡単に説明する。正規化という手段は、通常では、値が全く異なる複数の物理量を同じ尺度で評価できるように変換する手段である。この操作が適切でないと、複数の物理量を同じ尺度で評価できないことになり、結果として、異常検出や異常検出をした後の異常要因となるセンサ（＝物理量）の特定を誤ることになる。例えば、Ａという物理量の動作範囲が０〜１０であり、Ｂという物理量の動作範囲が０〜１００である場合に、Ｂを「１／１０」にすることにより、両者を同じ尺度で評価できる。しかし、仮に正規化を間違えて、Ｂをそのまま用いてしまった場合には、ＡはＢに対して非常に小さく評価されることになる。したがって、Ａの量で異常が生じた場合でも、異常が検出されなくなることが多くなる。また、異常が検出された場合であっても、Ｂの方が大きい動作範囲であるため、ほとんどの場合、その異常要因はＢと判断されることになる。このため、プロセスデータの適切な正規化は、異常検出精度や異常要因特定精度を向上させる上で極めて重要である。 Here, the relationship between appropriate normalization of process data and abnormality diagnosis will be briefly described. The means of normalization is usually means for converting a plurality of physical quantities having completely different values so that they can be evaluated on the same scale. If this operation is not appropriate, a plurality of physical quantities cannot be evaluated on the same scale, and as a result, the detection of an abnormality or a sensor (= physical quantity) that becomes an abnormal factor after detecting an abnormality is erroneous. For example, when the operation range of the physical quantity A is 0 to 10 and the operation range of the physical quantity B is 0 to 100, both can be evaluated on the same scale by setting B to “1/10”. However, if normalization is mistaken and B is used as it is, A will be evaluated very small with respect to B. Therefore, even when an abnormality occurs in the amount of A, the abnormality is often not detected. Even when an abnormality is detected, since B is a larger operating range, the abnormality factor is determined to be B in most cases. For this reason, proper normalization of process data is extremely important in improving the abnormality detection accuracy and the abnormality factor identification accuracy.

次に、モデル構築部４では、異常検出モデル生成部４２は、プロセスデータ正規化モデル生成部４１によりプロセス監視モデル構築用のプロセスデータを正規化するパラメータが決定されると、当該パラメータを用いてプロセス監視モデル構築用のプロセスデータを正規化する。そして、異常検出モデル生成部４２は、当該正規化データを利用して、異常検出用モデルを構築する。 Next, in the model construction unit 4, when the parameter for normalizing the process data for constructing the process monitoring model is determined by the process data normalization model generation unit 41, the abnormality detection model generation unit 42 uses the parameter. Normalize process data for building process monitoring models. Then, the abnormality detection model generation unit 42 constructs an abnormality detection model using the normalized data.

具体的には、異常検出モデル生成部４２は、当該正規化データを横（行）方向に各種センサ１１〜１Ｎに対応する変量を設定し、縦（列）方向に時間を考えた行列Ｘに設定する。次に、ＰＣＡを用いて、以下の式（４）に示す様にデータ分解を実行する。

Specifically, the abnormality detection model generation unit 42 sets variables corresponding to the various sensors 11 to 1N in the horizontal (row) direction and sets the normalized data to a matrix X that considers time in the vertical (column) direction. Set. Next, data decomposition is executed using PCA as shown in the following equation (4).

前記式（４）は、元のデータ行列を、図６に示すように分解することと等価である。このように分解されたデータに対して、前記のローディング行列Ｐが、Ｑ統計量やホテリングのＴ^２統計量を計算するための元のモデルとなる。このローディング行列Ｐを、プロセス監視モデルと考えても良いが、実際には、異常検出を行うＱ統計量とホテリングのＴ^２統計量は次式（５），（６）で計算される。

Equation (4) is equivalent to decomposing the original data matrix as shown in FIG. For the data thus decomposed, the loading matrix P becomes the original model for calculating the Q statistic and the Hotelling T ² statistic. The loading matrix P, may be considered as a process monitor model, in practice, T ² statistic Q statistic and Hotelling performing abnormality detection by the following equation (5) is calculated by (6).

ここで、∧は、ＰＣＡ（主成分分析）による各主成分の分散を対角要素として持つ行列であり、分散を正規化していることを意味する。また、Ｉは適当なサイズの単位行列である。また、ｘ（ｔ）は、行列Ｘのｔ番目の要素である。実際のプロセス監視では、このｘ（ｔ）がオンラインで計測されてくるプロセスデータに置き換わって計算されるため、前記式（５），（６）が異常検出モデル生成部４２で生成されるモデルである。 Here, ∧ is a matrix having the variance of each principal component by PCA (principal component analysis) as a diagonal element, and means that the variance is normalized. I is a unit matrix of an appropriate size. X (t) is the t-th element of the matrix X. In actual process monitoring, this x (t) is calculated by replacing it with process data that is measured online, so the above equations (5) and (6) are models generated by the abnormality detection model generation unit 42. is there.

ここで、前記式（５），（６）で定義されるＱ統計量とホテリングのＴ^２統計量の持つ意味について、図７を参照して簡単に説明する。 Here, the meaning of the Q statistic defined by the equations (5) and (6) and the Hotelling T ² statistic will be briefly described with reference to FIG.

ＰＣＡを用いた異常診断方法は、複数の計測変数に含まれる相関情報を利用して異常検出を行う方法である。通常では、プロセスで計測される各変数の間には、何らかの相関があり、相互に完全に独立な情報は少ない。図７は、便宜的に２変数の場合を想定し、２つの変数Ｘ１とＸ２が相関を有する場合、Ｘ１の計測データとＸ２の計測データをＸ１−Ｘ２平面上にプロットした場合を示す。 The abnormality diagnosis method using PCA is a method of performing abnormality detection using correlation information included in a plurality of measurement variables. Normally, there is some correlation between the variables measured in the process, and there is little information that is completely independent of each other. FIG. 7 shows a case where two variables X1 and X2 are correlated for convenience, and the measurement data of X1 and the measurement data of X2 are plotted on the X1-X2 plane.

ここで、例えば、仮にＸ１のセンサが故障した場合、この相関関係が崩れるため、三角で示した様な点にデータがプロットされる。このように、相関関係を表す相関軸からのずれを表す量が、Ｑ統計量の持つ意味である。一方、例えば、プロセスの負荷が急激に高くなった場合などでは、Ｘ１とＸ２は相関を持っているので、Ｘ１とＸ２の値がともに大きくなる（あるいは小さくなる）という事が生じる。このような場合、四角で表した様な点にデータがプロットされる。 Here, for example, if the X1 sensor fails, the correlation is lost, so data is plotted at points as indicated by triangles. Thus, the quantity representing the deviation from the correlation axis representing the correlation is the meaning of the Q statistic. On the other hand, for example, when the process load suddenly increases, X1 and X2 have a correlation, so that the values of X1 and X2 both increase (or decrease). In such a case, the data is plotted at points as represented by squares.

即ち、相関関係は崩れないが、通常の動作範囲以上に値が大きくなったり、小さくなったりする量を測る指標がホテリングのＴ^２統計量である。換言すれば、複数項目のデータ分布のデータが、分布しない方向への乖離を表す指標がＱ統計量である。また、複数項目のデータ分布のデータが、分布している方向への分散を表す量がホテリングのＴ^２統計量である。 That is, although the correlation does not collapse, an index for measuring the amount by which the value increases or decreases beyond the normal operating range is the Hotelling T ² statistic. In other words, the Q statistic is an index representing a deviation in a direction in which the data distribution of a plurality of items is not distributed. Further, the amount of distribution in the direction in which the data of the data distribution of a plurality of items is distributed is the Hotelling T ² statistic.

以上要するに、モデル構築部４では、プロセス正規化モデル生成部４１は、正規化用のシフトパラメータとスケーリングパラメータの値を決定する。異常検出モデル生成部４２は、異常検出を行う際に必要となる計算モデルを決定し、正規化の計算式、Ｑ統計量およびホテリングのＴ^２統計量の計算式を生成する。 In short, in the model construction unit 4, the process normalization model generation unit 41 determines the values of the normalization shift parameter and scaling parameter. The anomaly detection model generation unit 42 determines a calculation model necessary for performing anomaly detection, and generates a normalization calculation formula, a Q statistic, and a hotelling T ² statistic calculation formula.

次に、異常判定基準設定部５は、異常検出モデルである前記式（５），（６）を使用して異常判定を行うための閾値を決定する。この際、プロセス監視モデル構築用データである正規化データＸを用いて、前記式（５），（６）により計算したプロセス監視モデル構築用データのＱ統計量およびホテリングのＴ^２統計量を利用する。以下、異常判定基準設定部５による異常判定基準設定の方法を説明する。 Next, the abnormality determination criterion setting unit 5 determines a threshold value for performing abnormality determination using the equations (5) and (6) which are abnormality detection models. At this time, using the normalized data X which is the process monitoring model construction data, the Q statistic of the process monitoring model construction data and the hotelling T ² statistic calculated by the formulas (5) and (6) are used. To do. Hereinafter, a method of setting an abnormality determination criterion by the abnormality determination criterion setting unit 5 will be described.

まず、閾値決定用事前情報入力部５１は、異常検出モデル生成部４２により生成したＱ統計量およびホテリングのＴ^２統計量の計算式（モデル）を使用することで、当該各統計量に含まれる正常データと異常データの割合に対する事前情報を入力する。なお、Ｑ統計量やホテリングのＴ^２統計量は、異常検出を行うための統計量であるため、これについての事前情報を取得できない場合には、プロセスデータに関する事前情報で代用してもよい。代用する場合には、事前情報入力部４１１から入力される事前情報をそのまま利用することになる。 First, the threshold determination prior information input unit 51 uses the calculation formula (model) of the Q statistic and the hotelling T ² statistic generated by the abnormality detection model generation unit 42 to be included in each statistic. Enter prior information for the ratio of normal data to abnormal data. Since the Q statistic and the hotelling T ² statistic are statistics for detecting anomalies, if prior information about this cannot be acquired, prior information on process data may be substituted. In the case of substitution, the prior information input from the prior information input unit 411 is used as it is.

また、プロセスデータではなく、直接的にＱ統計量やホテリングのＴ^２統計量に関する事前情報を入力したい場合には、図２に示したものと同様の入力用画面情報をＱ統計量とホテリングのＴ^２統計量に対しても作成し、事前情報入力部４１１と全く同様の方法で、当該事前情報を入力する。プロセスデータとは異なり、通常では、Ｑ統計量やホテリングのＴ^２統計量に関する事前情報が明確に分かっていることは少ないが、プロセスで計測されるデータがいくつであっても、当該各統計情報はそれぞれ１つの計２つのデータになるので、プロセス監視モデル構築用データの正規化データＸを用いて計算したＱ統計量とホテリングのＴ^２統計量をプロットすることによって判断しても良い。 Further, instead of the process data directly if you want to enter a pre-information on T ² statistic Q statistic and Hotelling is similar input screen information to that shown in FIG. 2 Q statistic and Hotelling A T ² statistic is also created, and the prior information is input in the same manner as the prior information input unit 411. Unlike process data, in general, prior information on Q statistics and hotelling T ² statistics is rarely known, but each piece of statistical information can be measured regardless of how much data is measured in the process. since become a single total two data respectively, it may be determined by plotting the T ² statistic calculated Q statistic and Hotelling using normalized data X of the data for the construction process monitoring model.

このような可視化を行えば、異常データが混在してそうか否かなどの定性的な情報を取得できることが多い。正規化を行う場合のプロセスデータに対しても可視化を実行して事前情報を取得しても良いが、通常では、プロセスデータは、各種センサ１１〜１Ｎで計測される項目に応じて少なくとも数十変数以上のデータである。従って、このようなデータに対して可視化を実行して、事前情報を得ることは非現実的である。これに対して、Ｑ統計量やホテリングのＴ^２統計量は、プロセスデータの計測変数の数がいくつであっても常に各々１つのデータであるので、可視化によって事前情報を取得しやすい。当該事前情報を取得できるのであれば、閾値決定用事前情報入力部５１は、事前情報入力部４１１と同様の処理を実行する。 If such visualization is performed, it is often possible to acquire qualitative information such as whether or not abnormal data is mixed. Although prior information may be acquired by executing visualization on process data in the case of normalization, the process data is usually at least several tens according to items measured by various sensors 11 to 1N. The data is more than a variable. Therefore, it is unrealistic to obtain prior information by performing visualization on such data. On the other hand, the Q statistic and the Hotelling T ² statistic are always one data regardless of the number of process data measurement variables, and thus it is easy to obtain prior information by visualization. If the prior information can be acquired, the threshold determination prior information input unit 51 performs the same processing as the prior information input unit 411.

次に、閾値決定部５２は、閾値決定用事前情報入力部５１から入力される事前情報に従って、異常判定のための閾値を決定する。 Next, the threshold determination unit 52 determines a threshold for abnormality determination according to the prior information input from the threshold determination prior information input unit 51.

まず、事前情報が何も無い場合（Ｑ統計量やホテリングのＴ^２統計量に関する情報は不明）には、閾値決定部５２は、デフォルトの設定法として、Ｑ統計量の統計的信頼限界値とホテリングのＴ^２統計量に関する統計的信頼限界値（公知の計算式により算出）を用いる。また、事前情報として、殆ど全てのデータが正常であるという情報が入力された場合には、閾値決定部５２は、原理的に、プロセス監視モデル構築用データの正規化データＸを用いて計算したＱ統計量やホテリングのＴ^２統計量の最大値を閾値として決定する。但し、ごく僅かに異常データが含まれている可能性があるので、例えば９９％信頼区間の考え方を採用して、Ｑ統計量およびホテリングのＴ^２統計量の最大値から上位１％の値を閾値として決定する方法の方が好ましい。 First, when there is no prior information (information regarding the Q statistic and the Hotelling T ² statistic is unknown), the threshold determination unit 52 uses the statistical confidence limit value of the Q statistic as a default setting method. statistical confidence limits about the T ² statistic Hotelling a (calculated by a known calculation formula) is used. Further, when information indicating that almost all data is normal is input as prior information, the threshold value determination unit 52, in principle, calculates using the normalized data X of the process monitoring model construction data. The maximum value of the Q statistic and the hotelling T ² statistic is determined as a threshold value. However, since there is a possibility that anomalous data is included in a slight amount, for example, the concept of 99% confidence interval is adopted, and the top 1% value from the maximum value of the Q statistic and the hotelling T ² statistic is selected. The method of determining the threshold value is preferable.

さらに、事前情報として、異常データと正常データが混在しているが異常データの割合は不明であるという情報が入力された場合には、閾値決定部５２は、パラメータ決定部４１２と同様に、例えば、修正大津の閾値決定法などを用いて、Ｑ統計量およびホテリングのＴ^２統計量の閾値を決定する。これは、単に修正大津の閾値決定法などのクラスタリングアルゴリズムを適用するデータを、プロセスデータではなく、正規化データＸを用いて計算したＱ統計量およびホテリングのＴ^２統計量のデータに変更するだけで実行できる。 Further, when information indicating that abnormal data and normal data are mixed but the ratio of abnormal data is unknown is input as prior information, the threshold determination unit 52, for example, similar to the parameter determination unit 412, The threshold value of the Q statistic and the hotelling T ² statistic is determined by using a modified Otsu threshold determination method or the like. This simply changes the data to which a clustering algorithm such as the modified Otsu's threshold determination method is applied to Q statistics calculated using normalized data X and hotelling T ² statistics data instead of process data. It can be executed with.

また、事前情報として、異常データと正常データが混在しており、異常データの大よその割合は分かっているという情報が入力された場合には、閾値決定部５２は、パラメータ決定部４１２と同様の処理を実行する。即ち、閾値決定部５２は、正規化データＸを用いて計算したＱ統計量やホテリングのＴ^２統計量のデータの上位α％（αは事前情報として入力した値）に該当する各値を、各々の統計量の閾値として決定する。 In addition, when information indicating that abnormal data and normal data are mixed and the approximate ratio of abnormal data is known is input as prior information, the threshold determination unit 52 is the same as the parameter determination unit 412. Execute the process. That is, the threshold determination unit 52, a respective value corresponding to the upper alpha% of the data of the calculated Q statistic and Hotelling T ² statistic using normalized data X (alpha is entered as a priori information value), It is determined as a threshold value for each statistic.

以上要するに、異常判定基準設定部５では、閾値決定部５２は、閾値決定用事前情報入力部５１から入力される事前情報に従って、プロセス１の正常と異常を判断する閾値を決定する。以上の一連の操作は、プロセス監視モデルを構築し、異常判断の基準を設定するためのものであり、通常はオフラインで行われる。 In short, in the abnormality determination criterion setting unit 5, the threshold determination unit 52 determines a threshold for determining whether the process 1 is normal or abnormal according to the prior information input from the threshold determination prior information input unit 51. The above series of operations is for constructing a process monitoring model and setting a criterion for abnormality determination, and is normally performed offline.

一方、以下で示す作用は、オンラインで実行される。まず、プロセス監視部６は、モデル構築部４で構築したプロセス監視モデルを使用して、プロセス１の監視を実行する。プロセス監視部６は、監視するプロセス変数として、各種センサ１１〜１Ｎそのものではなく、これらの各センサから新たに生成されたＱ統計量やホテリングのＴ^２統計量である。ここでは、少なくとも、これらのいずれか一方を監視する。通常では、両方を監視し、必要に応じて他の変数、例えば各センサ１１〜１Ｎそのものや、ＰＣＡによって計算されたスコアを平面上にプロットして監視することもできる。 On the other hand, the following actions are executed online. First, the process monitoring unit 6 monitors the process 1 using the process monitoring model constructed by the model construction unit 4. Process monitoring unit 6, a process variable to be monitored, various sensors 11~1N not itself a T ² statistic Q statistic and Hotelling newly generated from each of these sensors. Here, at least one of these is monitored. Usually, both are monitored, and if necessary, other variables, for example, the sensors 11 to 1N themselves and the score calculated by the PCA can be plotted on a plane and monitored.

プロセス監視部６は、前述したＱ統計量とホテリングのＴ^２統計量を、プロセス計測変数収集・保存部２からオンラインで送信される各種センサ１１〜１Ｎの値（ある時刻のこれらのセンサ値からなるベクトルをＸnew(t)とする)と、モデル構築部４で構築したプロセス監視モデルを用いて、下記式（７），（８）に示すような計算を実行する。

The process monitoring unit 6 uses the values of the various sensors 11 to 1N that are transmitted online from the process measurement variable collection / storage unit 2 (from these sensor values at a certain time) as the above-described Q statistics and hotelling T ² statistics. And a process monitoring model constructed by the model construction unit 4 is used to perform calculations as shown in the following formulas (7) and (8).

これらの式（７），（８）と前記式（５），（６）の相違は、単にＱ統計量とホテリングのＴ^２統計量を計算するために入力するデータが異なるだけである。オンラインのプロセス監視では、既に構築したプロセス監視モデルに対して、単にオンラインで計測したプロセスデータを入力すればよい。このように計算されるＱ（Ｘnew(t)）やＴ^２（Ｘnew(t)）はスカラー値であるので、この値をトレンドグラフとして表示することで監視する事ができる。この際、異常判定基準設定部５で設定したＱ統計量やホテリングのＴ^２統計量の閾値（限界レベル：ＣＬ）も同時にプロットすることもできる。 The difference between these formulas (7) and (8) and the above formulas (5) and (6) is simply the difference in the data input to calculate the Q statistic and the Hotelling T ² statistic. In online process monitoring, it is only necessary to input process data measured online to a process monitoring model that has already been established. Since Q (Xnew (t)) and T ² (Xnew (t)) calculated in this way are scalar values, they can be monitored by displaying these values as a trend graph. In this case, failure determination reference Q set by the setting unit 5 statistic and Hotelling T ² statistic threshold (limit level: CL) also may be plotted simultaneously.

次に、プロセス異常診断部７は、Ｑ統計量やホテリングのＴ^２統計量が、異常判定基準設定部５で設定したＱ統計量やホテリングのＴ^２統計量の限界レベル（ＣＬ）を超えた場合に異常と判断する。この際、異常判定基準設定部５で設定したＱ統計量やホテリングのＴ^２統計量の限界レベル以外に、例えば、別途警告レベル（ＷＬ）や注意レベル（ＮＬ）を設けた異常診断ルールに基づいた判断でもよい。 Next, the process error diagnostic portion 7, T ² statistic Q statistic and Hotelling exceeded the abnormal set by criterion setting unit 5 Q statistic and Hotelling T ² statistic limit level (CL) If it is judged abnormal. At this time, in addition to the abnormality determination Q statistic was set by the reference setting unit 5 and the Hotelling T ² statistic threshold level, for example, based on the abnormality diagnostic rule separately provided warning level (WL) and attention level (NL) Judgment may be made.

具体的には、プロセス異常診断部７は、例えば、警告レベル（ＷＬ）を限界レベルの「２／３」と設定し、注意レベル（ＮＬ）を限界レベルの「１／３」と設定して、以下の様な診断ルールで正常と異常を判断する。即ち、（１）１点がＣＬを超えた時、（２）連続する３点のうち、２点以上がＷＬを超えた時、（３）連続する５点のうち４点以上が１σ（標準偏差）を超えた時、（４）連続する８点のデータが単調増加あるいは単調減少する場合、（５）非ランダム性を持つなんらかのパターンが現れた場合に、プロセス異常診断部７は、異常と判断する。 Specifically, for example, the process abnormality diagnosis unit 7 sets the warning level (WL) to “2/3” of the limit level and sets the attention level (NL) to “1/3” of the limit level. The normality and the abnormality are judged by the following diagnostic rule. (1) When one point exceeds CL, (2) Of two consecutive points, two or more points exceed WL, (3) Four or more points among five consecutive points are 1σ (standard) When (deviation) is exceeded, (4) when 8 consecutive data monotonously increase or monotonically decrease, (5) when some pattern with non-randomness appears, the process abnormality diagnosis unit 7 to decide.

ここで、前記の様な複雑な異常診断ルールが必要でない場合や、これによってＢタイプの誤り（後述する図１３，１４を参照）が増加してしまうようであれば、単純に異常判定基準設定部５で設定したＱ統計量やホテリングのＴ^２統計量の閾値を限界レベル（ＣＬ）として異常診断を行なう。要するに、プロセス異常診断部７は、以上の手順によってプロセス１の正常状態と異常状態を判断する。 Here, if a complicated abnormality diagnosis rule as described above is not necessary, or if a B type error (see FIGS. 13 and 14 to be described later) increases due to this, simply set the abnormality criterion. the threshold T ² statistic Q statistic and Hotelling set in part 5 performs an abnormality diagnosis as a limit level (CL). In short, the process abnormality diagnosis unit 7 determines the normal state and abnormal state of the process 1 by the above procedure.

次に、プロセス異常要因センサ特定部８は、プロセス異常診断部７によりプロセス１が異常状態であると診断されたときに、図８に示すように、異常と判断されたＱ統計量あるいはホテリングのＴ^２統計量に対する寄与プロットを行うことにより、その要因となるセンサを特定する。図８は、各バーグラフが各種のセンサ１１〜１Ｎに対応しており、このバーの長さが最も長いものが異常である要因となるセンサ（ここでは、硝酸センサ）を示している。但し、ここでは、必ずしも図８に示す様に一つの要因だけが大きくなっているとは限らない。例えば、図９に示すように、いくつかのセンサが通常でないと考えられる場合がある。このような場合には、プロセス異常要因センサ特定部８は、複数のセンサを要因として特定する。 Next, when the process abnormality diagnosis unit 7 diagnoses that the process 1 is in an abnormal state, the process abnormality factor sensor specifying unit 8 determines the Q statistic or hoteling determined to be abnormal as shown in FIG. By performing a contribution plot with respect to the T ² statistic, the sensor that is the factor is specified. FIG. 8 shows a sensor (here, a nitric acid sensor) in which each bar graph corresponds to various sensors 11 to 1N, and the bar having the longest length is a cause of abnormality. However, only one factor is not necessarily increased here as shown in FIG. For example, as shown in FIG. 9, some sensors may be considered unusual. In such a case, the process abnormality factor sensor specifying unit 8 specifies a plurality of sensors as factors.

次に、プロセス異常要因推定部９は、プロセス異常要因センサ特定部８によって特定されたセンサの情報に基づいて、プロセス１の本当の異常要因を推定する。プロセス異常要因推定部９は、具体的には、例えば過去の経験やシナリオシミュレーションなどに基づいて構築したエキスパートシステムとして推論を実行する。 Next, the process abnormality factor estimation unit 9 estimates the true abnormality factor of the process 1 based on the sensor information identified by the process abnormality factor sensor identification unit 8. Specifically, the process abnormality factor estimation unit 9 executes inference as an expert system constructed based on, for example, past experience or scenario simulation.

具体的には、プロセス異常要因推定部９は、図８に示すように、一つのセンサ（ここでは硝酸センサ）だけが傑出して大きな値を示している場合には、そのセンサの故障であると推定する。また、センサによっては、故障信号を出すことができる様に構成されているものも多いので、そこから得られる故障信号との論理和あるいは論理積などを取ることによって、そのセンサの異常を推定する。図９は、アンモニアを除去する硝化菌という微生物が活動しなくなった硝化阻害と呼ばれる現象が生じた場合の図である。この場合、硝化が行わなくなるために、アンモニアの濃度が高くなり、また硝化菌が活動しなくなるので、その活動に必要な酸素量が小さくなり、結果として溶存酸素濃度が高くなっている。このようなプロセス知識が予め用意されていれば、プロセス異常要因推定部９は、「アンモニア濃度が高く、かつ溶存酸素濃度が高い場合には硝化阻害の可能性がある」などのエキスパートルールに基づいて、その要因を推定する。 Specifically, as shown in FIG. 8, the process abnormality factor estimation unit 9 is a failure of a sensor when only one sensor (here, a nitric acid sensor) shows an outstanding value. Estimated. In addition, some sensors are configured so that a failure signal can be output. Therefore, the abnormality of the sensor is estimated by taking a logical sum or logical product with the failure signal obtained therefrom. . FIG. 9 is a diagram in a case where a phenomenon called nitrification inhibition occurs in which a microorganism called nitrifying bacteria that removes ammonia does not work. In this case, since nitrification is not performed, the concentration of ammonia becomes high, and nitrifying bacteria become inactive, so that the amount of oxygen necessary for the activity becomes small, and as a result, the dissolved oxygen concentration becomes high. If such process knowledge is prepared in advance, the process abnormality factor estimation unit 9 is based on an expert rule such as “there is a possibility of nitrification inhibition when the ammonia concentration is high and the dissolved oxygen concentration is high”. To estimate the factors.

ユーザインターフェース部１０は、プロセス監視部６、プロセス異常診断部７、プロセス異常要因センサ特定部８、プロセス異常要因推定部９からの少なくとも一つ以上の情報をオペレータに通知する。 The user interface unit 10 notifies the operator of at least one information from the process monitoring unit 6, the process abnormality diagnosis unit 7, the process abnormality factor sensor specifying unit 8, and the process abnormality factor estimation unit 9.

（第１の実施形態の効果）
まず、前述したように、従来のＰＣＡに基づいた異常診断方法における問題点を、図１３から図１６を参照して具体的に説明する。 (Effects of the first embodiment)
First, as described above, problems in the conventional abnormality diagnosis method based on PCA will be specifically described with reference to FIGS.

ＰＣＡに基づいた異常診断方法における誤り異常検出として、本当（実際）は異常であるのに、正常であると診断する誤りのＡタイプと、本当（実際）は正常であるのに、異常であると診断する誤りのＢタイプの２タイプがある。図１３は、正常データと異常データの分布の重なりと、誤りタイプ（Ａ，Ｂ）の関係を示す図である。図１４は、誤りタイプの特徴を概念的に示す図である。即ち、Ａタイプは、精度不十分である第２種誤り検出に相当する。また、Ｂタイプは、異常誤り検出である第１種誤り検出に相当する。 Error detection in an abnormality diagnosis method based on PCA is true (actual) is abnormal but type A of an error to diagnose as normal, and true (actual) is normal but abnormal There are two types of error B type. FIG. 13 is a diagram showing the relationship between the distribution of normal data and abnormal data and the error type (A, B). FIG. 14 is a diagram conceptually showing the characteristics of the error type. That is, the A type corresponds to type 2 error detection with insufficient accuracy. The B type corresponds to the first type error detection which is an abnormal error detection.

さらに、図１５は、異常診断アルゴリズムを適用した場合の流入アンモニア濃度の時系列データを示す図であり、Ｂタイプの第１種誤り検出例である。即ち、図１５に示す異常部分１９０は、実際には異常ではなく、異常診断アルゴリズムにより誤って異常であると判断された例である。また、図１６は、異常診断アルゴリズムを適用した場合に、異常データ２００を見落とたＡタイプの第２種誤り検出例である。即ち、実際のプロセスデータからは明らかに異常と判断できるデータ２００を正常であると判断する例であり、異常検出精度が十分でない場合である。 FIG. 15 is a diagram showing time-series data of the inflow ammonia concentration when the abnormality diagnosis algorithm is applied, and is a B type first type error detection example. That is, the abnormal portion 190 shown in FIG. 15 is an example that is not actually abnormal but is erroneously determined to be abnormal by the abnormality diagnosis algorithm. FIG. 16 is an example of A type second type error detection in which the abnormality data 200 is overlooked when the abnormality diagnosis algorithm is applied. That is, this is an example in which the data 200 that can be clearly determined to be abnormal from the actual process data is determined to be normal, and the abnormality detection accuracy is insufficient.

本実施形態のシステムであれば、多変量統計解析を用いた多変量統計的プロセス監視（ＭＳＰＣ）の異常検出の閾値を、プロセスデータに対する事前情報に応じて適切に決定することにより、前記のＡタイプ及びＢタイプの２タイプの異常検出の誤りを減少することが可能である。従って、ＭＳＰＣによるプロセス異常監視の性能を向上させることができる。 In the system according to the present embodiment, the threshold value for detecting anomalies in multivariate statistical process monitoring (MSPC) using multivariate statistical analysis is appropriately determined according to prior information for process data, and the above-described A It is possible to reduce two types of abnormality detection errors of type B and type B. Therefore, the process abnormality monitoring performance by MSPC can be improved.

また、ＭＳＰＣでプロセス監視モデルを構築する際に、複数のプロセスデータを正規化するパラメータをプロセスデータに対する事前情報に応じて適切に決定することにより、異常検出の誤りに加えて、異常要因を特定する際の要因変数の誤りを減少させることができる。従って、ＭＳＰＣによるプロセス異常監視の性能を向上させることができる。 In addition, when building a process monitoring model with MSPC, by appropriately determining parameters for normalizing multiple process data according to prior information on the process data, the cause of the abnormality is identified It is possible to reduce the error of the factor variable when doing. Therefore, the process abnormality monitoring performance by MSPC can be improved.

換言すれば、ＭＳＰＣによるプロセス監視（プロセス異常診断）において、異常検出の閾値をプロセスデータに対する事前知識の量と質に応じて適切に決定することにより、Ａタイプの誤りとＢタイプの誤りのトレードオフの適正化を図ることができ、異常検出性能を向上させることができる。 In other words, in process monitoring (process abnormality diagnosis) by MSPC, the threshold of abnormality detection is appropriately determined according to the amount and quality of prior knowledge of process data, thereby allowing trade between A type error and B type error. It is possible to optimize the off state and improve the abnormality detection performance.

［第２の実施形態］
第２の実施形態に関するシステムは、基本的な構成としては、図１に示すものと同様である。本実施形態のシステムは、前述の第１の実施形態のシステムと比較して、異常検出モデル生成部４２、閾値決定部５２、プロセス監視部６、プロセス異常診断部７、及びプロセス異常要因センサ特定部８のそれぞれの作用が異なる。以下、異なる部分の作用を中心として説明し、同様の作用については説明を省略する。 [Second Embodiment]
The system according to the second embodiment is the same as that shown in FIG. 1 as a basic configuration. Compared with the system of the first embodiment described above, the system of this embodiment is an abnormality detection model generation unit 42, a threshold determination unit 52, a process monitoring unit 6, a process abnormality diagnosis unit 7, and a process abnormality factor sensor identification. Each operation of the part 8 is different. Hereinafter, the description will be focused on the operation of different parts, and description of the same operation will be omitted.

まず、本実施形態のシステムにおいて、プロセス計測データ収集・保存部２、プロセスデータ抽出部３、モデル構築部４におけるプロセスデータ正規化モデル生成部４１の各作用は、前述の第１の実施形態のシステムの場合と同様である。 First, in the system of the present embodiment, the processes of the process measurement data collection / storage unit 2, the process data extraction unit 3, and the process data normalization model generation unit 41 in the model construction unit 4 are the same as those in the first embodiment. It is the same as the case of the system.

異常検出モデル生成部４２は、プロセスデータ正規化モデル生成部４１によりプロセス監視モデル構築用のプロセスデータを正規化するパラメータが決定されると、当該パラメータを用いてプロセス監視モデル構築用のプロセスデータを正規化する。そして、異常検出モデル生成部４２は、当該正規化データを利用して、異常検出用モデルを構築する。 When the parameter for normalizing process data for constructing the process monitoring model is determined by the process data normalization model generating unit 41, the abnormality detection model generating unit 42 uses the parameter to process the process data for constructing the process monitoring model. Normalize. Then, the abnormality detection model generation unit 42 constructs an abnormality detection model using the normalized data.

具体的には、異常検出モデル生成部４２は、当該正規化データを横（行）方向に各種センサ１１〜１Ｎに対応する変量を設定し、縦（列）方向に時間を考えた行列Ｘに設定する。次に、前述と同様に、ＰＣＡを用いて、前記式（５），（６）に示す様に、Ｑ統計量とホテリングのＴ^２統計量を求める。さらに、本実施形態では、異常検出モデル生成部４２は、Ｑ統計量とホテリングのＴ^２統計量に対する各種センサ１１〜１Ｎの寄与量を下記式（９），（１０），（１１）により計算する。

Specifically, the abnormality detection model generation unit 42 sets variables corresponding to the various sensors 11 to 1N in the horizontal (row) direction and sets the normalized data to a matrix X that considers time in the vertical (column) direction. Set. Next, in the same manner as described above, the Q statistic and the hotelling T ² statistic are obtained by using PCA as shown in the equations (5) and (6). Furthermore, in this embodiment, the abnormality detection model generation unit 42 calculates the contribution amount of the various sensors 11 to 1N to the Q statistic and the hotelling T ² statistic by the following equations (9), (10), and (11). To do.

ここで、ｘ(t,n)は、変数ｘのｎ番目の要素の時刻ｔにおける値という意味であり、ｎは各種センサ１１〜１Ｎに対応する。また、Ｆ(:,n)の表記は、行列Ｆのｎ行目を取り出すことを意味する。Ｐ(:,n)についても同様である。 Here, x (t, n) means the value at the time t of the nth element of the variable x, and n corresponds to the various sensors 11 to 1N. The notation F (:, n) means taking out the nth row of the matrix F. The same applies to P (:, n).

この寄与量を計算する計算式（モデル）が、本実施形態でのプロセス監視モデルに相当する。通常のＭＳＰＣでは、前記式（１０），（１１）に示すような寄与量を計算する式は、通常では、次の様に用いられる。まず、前記式（５），（６）で表されるＱ統計量やホテリングのＴ^２統計量により異常の検出を実行し、仮に異常が生じていた場合には、前記式（１０），（１１）に示すような寄与量計算により、どのセンサで計測しているプロセスデータがその異常に寄与しているかを判断する。 A calculation formula (model) for calculating the contribution amount corresponds to the process monitoring model in the present embodiment. In a normal MSPC, the equations for calculating the contribution amount as shown in the equations (10) and (11) are usually used as follows. First, the equation (5), and executes fault detection by T ² statistic Q statistic and Hotelling represented by (6), if the if abnormality has occurred, the equation (10), ( It is determined by which sensor process data measured by the contribution amount calculation as shown in 11) contributes to the abnormality.

これに対して、本実施形態は、この寄与量自身に対して閾値を設定して異常検出を行なう方式である。以下、図１０を参照して説明する。 On the other hand, the present embodiment is a method for detecting an abnormality by setting a threshold value for the contribution amount itself. Hereinafter, a description will be given with reference to FIG.

図１０は、２変数の場合においてＱ統計量に対する寄与量の概念を示す図である。図１０において、符号７０はホテリングのＴ^２統計量を示し、符号７１はＱ統計量を示す。Ｑ統計量は、前述したように、データが分布しない方向への乖離を表す指標である。Ｑ統計量自身は、図１０に示すように、Ｘ１成分による寄与量７２とＸ２成分による寄与量７３とに分解できる。通常のＭＳＰＣでは、前記式（５）のＱ統計量で異常を検出した場合に、前記式（１０）で各成分に分解して異常要因を推定しているが、寄与量自身は異常を検出するかしないかに関わらず、前記式（１０）で表される寄与量は常に計算できることがわかる。従って、常に前記式（１０）で寄与量を計算しておけば、それに対して直接閾値を設定して異常検出を行っても良いことが容易にわかる。 FIG. 10 is a diagram showing the concept of the contribution amount to the Q statistic in the case of two variables. In FIG. 10, reference numeral 70 indicates a Hotelling T ² statistic, and reference numeral 71 indicates a Q statistic. As described above, the Q statistic is an index representing a deviation in a direction in which data is not distributed. As shown in FIG. 10, the Q statistic itself can be decomposed into a contribution amount 72 due to the X1 component and a contribution amount 73 due to the X2 component. In normal MSPC, when an abnormality is detected by the Q statistic of the equation (5), the anomaly factor is estimated by decomposing into each component by the equation (10), but the contribution amount itself detects the abnormality. It can be seen that the contribution amount represented by the equation (10) can always be calculated regardless of whether or not it is performed. Therefore, if the contribution amount is always calculated by the equation (10), it can be easily understood that the abnormality detection may be performed by setting the threshold value directly.

さらに、本実施形態の方式であれば、Ｑ統計量の寄与量を全ての変数に対して総和をとったものがＱ統計量になるので、総和をとったＱ統計量よりもそれを構成する要素で異常検出を行った方が、より精度の高い異常検出が行なうことができる。但し、この場合には、Ｑ統計量やホテリングのＴ^２統計量ではなく、その寄与量で診断を行うことになるため、寄与量に対して適切な閾値を設定しないと、却って異常診断性能が劣化することになる場合がある。本実施形態の方式は、このような場合にも適切な閾値を設定することを狙ったものである。 Furthermore, in the method of the present embodiment, the sum of the contribution amount of the Q statistic for all the variables becomes the Q statistic, so that it is configured rather than the summed Q statistic. More accurate abnormality detection can be performed by performing abnormality detection on elements. However, in this case, diagnosis is performed using the contribution amount, not the Q statistic or the hotelling T ² statistic. Therefore, unless an appropriate threshold is set for the contribution amount, the abnormality diagnosis performance is It may deteriorate. The method of the present embodiment aims to set an appropriate threshold value even in such a case.

以上要するに、異常検出モデル生成部４２は、プロセスデータ正規化モデル生成部４１により正規化用のシフトパラメータとスケーリングパラメータの値が決定されると、異常検出を行う際に必要となる計算モデル（Ｐ，∧）を決定する。さらに、これらの値を用いた正規化の計算式、Ｑ統計量、およびホテリングのＴ^２統計量の計算式を生成する。 In short, when the process data normalization model generation unit 41 determines the normalization shift parameter and scaling parameter values, the abnormality detection model generation unit 42 calculates a calculation model (P , ∧) is determined. Further, a normalization calculation formula, a Q statistic, and a hotelling T ² statistic calculation formula using these values are generated.

次に、異常判定基準設定部５は、異常検出モデルである前記式（１０），（１１）を使用して異常判定を行うための閾値を決定する。この際、プロセス監視モデル構築用データである正規化データＸを用いて、前記式（１０），（１１）により計算したプロセス監視モデル構築用データのＱ統計量およびホテリングのＴ^２統計量の寄与量を利用する。 Next, the abnormality determination criterion setting unit 5 determines a threshold for performing abnormality determination using the equations (10) and (11) that are abnormality detection models. At this time, the contribution of the Q statistic of the process monitoring model construction data and the hotelling T ² statistic calculated by the above formulas (10) and (11) using the normalized data X that is the process monitoring model construction data Use quantity.

まず、閾値決定用事前情報入力部５１は、Ｑ統計量およびホテリングのＴ^２統計量の寄与量に関する事前情報を入力する。なお、この寄与量に関する事前情報を取得できない場合には、プロセスデータに関する事前情報や、Ｑ統計量やホテリングのＴ^２統計量に対する事前情報で代替させてもよい。 First, the threshold determination prior information input unit 51 inputs prior information regarding the contribution amount of the Q statistic and the hotelling T ² statistic. Incidentally, if it can not obtain a priori information about the contribution amount, and prior information about the process data, it may be replaced by a priori information for the T ² statistic Q statistic and Hotelling.

まず、事前情報が何も無い場合（Ｑ統計量やホテリングのＴ^２統計量に関する情報は不明）には、閾値決定部５２は、デフォルトの設定法として、Ｑ統計量の統計的信頼限界値とホテリングのＴ^２統計量に関する統計的信頼限界値（公知の計算式により算出）を用いる。この場合、Ｑ統計量とホテリングのＴ^２統計量の閾値を計算する。通常のＭＳＰＣによる異常診断では、この閾値を越えた場合に異常を検出し、要因特定を行う場合にのみ寄与量を計算する。例えば、（i）最大の寄与量を持つものを要因候補としたり、(ii)寄与量の値が上位3つのものを要因候補としたり、(iii)寄与量の値が閾値に対してある割合を超えるものを異常要因候補とする。 First, when there is no prior information (information regarding the Q statistic and the Hotelling T ² statistic is unknown), the threshold determination unit 52 uses the statistical confidence limit value of the Q statistic as a default setting method. statistical confidence limits about the T ² statistic Hotelling a (calculated by a known calculation formula) is used. In this case, the threshold values of the Q statistic and the Hotelling T ² statistic are calculated. In normal abnormality diagnosis by MSPC, an abnormality is detected when this threshold is exceeded, and the contribution amount is calculated only when the factor is specified. For example, (i) the one with the largest contribution amount is used as a factor candidate, (ii) the top three contribution amount values are used as factor candidates, or (iii) the contribution amount value is a certain percentage of the threshold value. Those that exceed are considered abnormal factor candidates.

ここでは、通常実施される方法(iii)の考え方に従って、Ｑ統計量とホテリングのＴ^２統計量の閾値の例えば８０％を閾値として決定する。通常の異常診断では、ある変数の寄与量が閾値の８０％を超えていたとしても、寄与量の総和であるＱ統計量あるいはホテリングのＴ^２統計量自身が閾値を超えていなければ異常と判断されないことである。一方、寄与量に基づいて異常診断を実行する場合、Ｑ統計量あるいはホテリングのＴ^２統計量自身が閾値を超えていなくても、当該統計量に対する寄与量が閾値のある割合を超えていれば異常と診断される。このため、各種センサ１１〜１Ｎの計測する変数で生じている異常に対する感度を高くすることができるメリットが得られる。 Here, for example, 80% of the threshold values of the Q statistic and the Hotelling T ² statistic are determined as the threshold according to the concept of the method (iii) that is normally performed. In normal abnormality diagnosis, even the contribution of certain variables has exceeded 80% of the threshold, abnormality determination unless T ² statistic own Q statistic or Hotelling is the sum of the contribution amount exceeds the threshold value Is not. On the other hand, when performing an abnormality diagnosis based on the contribution amount, even if the Q statistic or the hoteling T ² statistic itself does not exceed the threshold, the contribution to the statistic exceeds a certain ratio of the threshold. Diagnosed as abnormal. For this reason, the merit which can raise the sensitivity with respect to the abnormality which has arisen with the variable which various sensors 11-1N measure is acquired.

但し、通常のＭＳＰＣの異常診断では、各変数が閾値の８０％などのある割合を超えなくても、総和である統計量が閾値を超えていれば異常と判断される。このため、ここで記載している方法は、各変数が少しずつ正常からずれていているというような状況に対する異常検出精度は却って劣化する場合がある。このような状況に対処するためには、両方の手法を併用すればよい。 However, in normal MSPC abnormality diagnosis, even if each variable does not exceed a certain ratio such as 80% of the threshold value, it is determined as abnormal if the total statistic exceeds the threshold value. For this reason, the method described here may deteriorate the abnormality detection accuracy for a situation where each variable is slightly deviated from normal. In order to cope with such a situation, both methods may be used together.

次に、閾値決定用事前情報入力部５１から、事前情報として、殆ど全てのデータが正常であるという情報が入力された場合には、閾値決定部５２は、原理的に、プロセス監視モデル構築用データの正規化データＸを用いて計算したＱ統計量の寄与量や、ホテリングのＴ^２統計量の寄与量の最大値を閾値として決定する。但し、ごく僅かに異常データが含まれている可能性があるので、例えば９９％信頼区間の考え方を採用して、Ｑ統計量の寄与量およびホテリングのＴ^２統計量の寄与量の最大値から上位１％の値を閾値として決定する方法の方が好ましい。この場合、各種センサ１１〜１Ｎの各寄与量毎に最大値が求まるが、事前情報として殆ど全てのデータが正常であるということが予め知られているため、寄与量毎に決定した最大値の中で最も大きい値を閾値として採用する。 Next, when information indicating that almost all data is normal is input as prior information from the threshold determination prior information input unit 51, the threshold determination unit 52 is in principle configured for process monitoring model construction. contribution amount and using the normalized data X of the calculated Q statistic data, determines the maximum value of T ² statistic contributions of Hotelling as a threshold value. However, there is a possibility that contains very little abnormal data, for example by adopting the idea of a 99% confidence interval, the maximum value of Q statistic contribution amount and Hotelling T ² statistic contribution amount The method of determining the upper 1% value as a threshold is preferable. In this case, the maximum value is obtained for each contribution amount of the various sensors 11 to 1N. However, since it is known in advance that almost all data is normal as prior information, the maximum value determined for each contribution amount is determined. The largest value is adopted as the threshold value.

さらに、事前情報として、異常データと正常データが混在しているが異常データの割合は不明であるという情報が入力された場合には、閾値決定部５２は、例えば、修正大津の閾値決定法などを用いて、Ｑ統計量の寄与量およびホテリングのＴ^２統計量の寄与量の閾値を決定する。但し、異常データと正常データが混合していることが事前情報として入力されている場合でも、各種センサ１１〜１Ｎの全ての計測変数に対して異常データと正常データが混合しているか否かは判断できない。このため、異常データが混合していない計測変数があった場合、その閾値を修正大津の閾値決定法などのクラスタリング法で決定すると、正常データを２分割してしまうことになり、適切な閾値が決定されない。しかし、各種センサ１１〜１Ｎのいずれかの計測変数には、異常データが含まれているはずなので、各計測変数毎に修正大津の閾値決定法などの方法で決定した閾値の中のいずれかの閾値は、正常データと異常データを必ず分離しているはずである。各計測変数毎に決定した閾値の中の最大値は、必ず正常データと異常データを分離している。従って、各計測変数毎に修正大津の閾値決定法などの方法で決定した閾値の中の最大のものを閾値として採用する。 Furthermore, when information indicating that abnormal data and normal data are mixed but the ratio of abnormal data is unknown is input as prior information, the threshold value determination unit 52, for example, a threshold value determination method for the modified Otsu, etc. Is used to determine the threshold for the contribution of the Q statistic and the contribution of the Hotelling T ² statistic. However, whether or not abnormal data and normal data are mixed for all measurement variables of the various sensors 11 to 1N, even if it is input as prior information that the abnormal data and normal data are mixed. I can't judge. For this reason, if there is a measurement variable in which abnormal data is not mixed, if the threshold value is determined by a clustering method such as the modified Otsu threshold value determination method, normal data will be divided into two, and an appropriate threshold value will be obtained. Not determined. However, since any of the measurement variables of the various sensors 11 to 1N should include abnormal data, any one of the threshold values determined by a method such as the threshold determination method of the modified Otsu for each measurement variable is used. The threshold value must always separate normal data and abnormal data. The maximum value among the threshold values determined for each measurement variable always separates normal data and abnormal data. Accordingly, the maximum threshold value determined by a method such as the modified Otsu threshold value determination method for each measurement variable is adopted as the threshold value.

また、事前情報として、異常データと正常データが混在しており、異常データの大よその割合は分かっているという情報が入力された場合には、閾値決定部５２は、正規化データＸを用いて計算したＱ統計量の寄与量やホテリングのＴ^２統計量の寄与量のデータの上位α％（αは事前情報として入力した値）に該当する各値を、予め計算する。仮に、異常データと正常データの割合が全ての計測変数に対してわかっているのであれば、各計測変数毎に計算した閾値を適用してもよい。全ての計測変数に対してではなく、いずれかの計測変数にはおおよそα％の異常が含まれているというような事前情報が得られている場合には、計測変数毎に決定した閾値の中の最大値を閾値として採用する。 Further, when information indicating that abnormal data and normal data are mixed and the approximate ratio of abnormal data is known is input as prior information, the threshold value determination unit 52 uses the normalized data X. Each value corresponding to the upper α% (α is a value input as prior information) of the data of the contribution amount of the Q statistic and the contribution amount of the Hotelling T ² statistic calculated in advance is calculated in advance. If the ratio of abnormal data to normal data is known for all measurement variables, a threshold value calculated for each measurement variable may be applied. If prior information has been obtained that not all measurement variables, but any measurement variable contains approximately α% of abnormalities, the threshold value determined for each measurement variable Is used as the threshold value.

一方、以下で示す作用は、オンラインで実行される。まず、プロセス監視部６は、モデル構築部４で構築したプロセス監視モデルを使用して、プロセス１の監視を実行する。プロセス監視部６は、監視するプロセス変数として、各種センサ１１〜１Ｎから新たに生成されたＱ統計量の寄与量やホテリングのＴ^２統計量の寄与量を直接監視する。即ち、プロセス監視部６は、下記式（１２），（１３），（１４）を使用して、Ｑ統計量の寄与量およびホテリングのＴ^２統計量の寄与量を監視する。

On the other hand, the following actions are executed online. First, the process monitoring unit 6 monitors the process 1 using the process monitoring model constructed by the model construction unit 4. Process monitoring unit 6, a process variable to be monitored, to monitor the contribution of the contribution amount and Hotelling T ² statistic Q statistic newly generated from the various sensors 11~1N directly. That is, the process monitoring unit 6, the following equation (12), (13), (14) is used to monitor the contribution of the contribution amount and Hotelling ^{T 2} statistic Q statistic.

即ち、プロセスの監視対象がＱ統計量やホテリングのＴ^２統計量ではなく、それらの寄与量である。 That is, the process monitoring target is not a Q statistic or a Hotelling T ² statistic, but a contribution amount thereof.

次に、プロセス異常診断部７は、Ｑ統計量やホテリングのＴ^２統計量のいずれかの寄与量が閾値を超えた場合に異常と判断する。プロセス異常要因センサ特定部８は、プロセス異常診断部７により寄与量に基づいて異常診断がなされているため、閾値を超えた寄与量に対応する各種センサ１１〜１Ｎの各計測変数が要因センサであると直接的に特定する。プロセス異常要因推定部９及びユーザインターフェース部１０の動作は、前述の第１の実施形態と全く同様である。 Next, the process abnormality diagnosis unit 7 determines that there is an abnormality when the contribution amount of either the Q statistic or the Hotelling T ² statistic exceeds a threshold value. The process abnormality factor sensor specifying unit 8 is diagnosed on the basis of the contribution amount by the process abnormality diagnosis unit 7, so that each measurement variable of the various sensors 11 to 1N corresponding to the contribution amount exceeding the threshold is a factor sensor. Directly identify that there is. The operations of the process abnormality factor estimation unit 9 and the user interface unit 10 are exactly the same as those in the first embodiment.

以上のように第２の実施形態によれば、通常では、第１に異常の検出、第２に異常要因の特定という手順をとる方法を変更し、異常要因を特定するために計算する量（寄与量）を直接異常検出に用いる方法を採用している。この場合、その異常検出のための閾値を、プロセスデータに対する事前情報に応じて適切に決定することにより、異常検出の誤りを減少できる。これにより、ＭＳＰＣによるプロセス異常監視の性能を向上させる事が可能となる。換言すれば、特にある特定のプロセス計測変数において生じている異常検出性能を向上させることができる。 As described above, according to the second embodiment, usually, the method of firstly detecting an abnormality and secondly specifying the abnormality factor is changed, and the amount calculated to specify the abnormality factor ( (Contribution amount) is directly used for abnormality detection. In this case, errors in abnormality detection can be reduced by appropriately determining the threshold value for detecting the abnormality according to prior information on the process data. As a result, the performance of process abnormality monitoring by MSPC can be improved. In other words, it is possible to improve the abnormality detection performance that occurs particularly in a specific process measurement variable.

［第３の実施形態］
第３の実施形態に関するシステムは、基本的な構成としては、図１に示すものと同様である。本実施形態のシステムは、前述の第１の実施形態のシステムと比較して、異常検出モデル生成部４２、閾値決定部５２、プロセス監視部６、プロセス異常診断部７、及びプロセス異常要因センサ特定部８のそれぞれの作用が異なる。以下、異なる部分の作用を中心として説明し、動揺の作用については説明を省略する。 [Third Embodiment]
The system according to the third embodiment is the same as that shown in FIG. 1 as a basic configuration. Compared with the system of the first embodiment described above, the system of this embodiment is an abnormality detection model generation unit 42, a threshold determination unit 52, a process monitoring unit 6, a process abnormality diagnosis unit 7, and a process abnormality factor sensor identification. Each operation of the part 8 is different. Hereinafter, the description will focus on the action of different parts, and the description of the action of shaking will be omitted.

具体的には、異常検出モデル生成部４２は、当該正規化データを横（行）方向に各種センサ１１〜１Ｎに対応する変量を設定し、縦（列）方向に時間を考えた行列Ｘに設定する。次に、前述と同様に、ＰＣＡを用いて、前記式（５），（６）に示す様に、Ｑ統計量とホテリングのＴ^２統計量を求める。これでプロセス監視モデル構築が完了したが，本実施形態では、異常検出モデル生成部４２は、前記式（５），（６）に示すＱ統計量とホテリングのＴ^２統計量を、各々離散ウェーブレット変換を使って複数の周波数帯域毎のデータに分解する。これは、離散ウェーブレット変換によるＱ統計量とホテリングのＴ^２統計量の分解と、分解された各々のデータを逆離散ウェーブレット変換で再構成することによって実現できる。この操作は、結果的にデジタルフィルタの形式で実現できるので、そのフィルタを「Ｗi，ｉ＝１，---，ｐ（ｐはデータ分割数）」とすると、ウェーブレットによって分解された各統計量は，例えば，下記式（１５），（１６）のように表現できる。

Specifically, the abnormality detection model generation unit 42 sets variables corresponding to the various sensors 11 to 1N in the horizontal (row) direction and sets the normalized data to a matrix X that considers time in the vertical (column) direction. Set. Next, in the same manner as described above, the Q statistic and the hotelling T ² statistic are obtained by using PCA as shown in the equations (5) and (6). Thus, the process monitoring model construction is completed, but in this embodiment, the abnormality detection model generation unit 42 uses the Q statistic and the Hotelling T ² statistic shown in the equations (5) and (6) as discrete wavelets, respectively. Decompose data into multiple frequency bands using transformation. This can be achieved by reconstituting the decomposition of T ² statistic Q statistic and Hotelling by discrete wavelet transform, the decomposition respectively of data to the inverse discrete wavelet transform. As a result, this operation can be realized in the form of a digital filter. Therefore, when the filter is “Wi, i = 1, ---, p (p is the number of data divisions)”, each statistic decomposed by the wavelet Can be expressed as the following equations (15) and (16), for example.

この周波数帯域毎に分解された計算式（モデル）が、本実施形態でのプロセス監視モデルに相当する。図１１は、Ｑ統計量を離散ウェーブレット変換により複数の周波数帯域毎のデータに分解した具体例を示す図である。同図（Ａ）は、Ｑ統計量のグラフである。同図（Ｂ）〜（Ｆ）はそれぞれ、５つの周波数帯域に分割されたＱ統計量を示す図である。ここで、図１１（Ｄ）は、元のＱ統計量を近似する図であり、最も遅い変化を表している。それ以外の同図（Ｂ），（Ｃ），（Ｅ），（Ｆ）はそれぞれ、平均がほぼゼロになるより速い周波数帯域のＱ統計量の分解データを示す図である。図１１において、横軸は速い周波数帯域から遅い周波数帯域の順に付けられている。 The calculation formula (model) decomposed for each frequency band corresponds to the process monitoring model in the present embodiment. FIG. 11 is a diagram illustrating a specific example in which the Q statistic is decomposed into data for each of a plurality of frequency bands by the discrete wavelet transform. FIG. 4A is a graph of Q statistics. FIGS. 7B to 7F are diagrams showing Q statistics divided into five frequency bands. Here, FIG. 11D is a diagram that approximates the original Q statistic, and represents the slowest change. The other figures (B), (C), (E), and (F) are diagrams showing the decomposition data of the Q statistics in the faster frequency band where the average becomes almost zero, respectively. In FIG. 11, the horizontal axis is assigned in order from the fast frequency band to the slow frequency band.

このように周波数帯域毎にデータを分解する意義は、以下の通りである。 The significance of decomposing data for each frequency band as described above is as follows.

図１２は、Ｑ統計量やホテリングのＴ^２統計量ではないが、ある元データを、離散ウェーブレット変換を適用して分解した具体例を示す図である。図１２の矢印の左側の元データには、データの中に高周波の異常が含まれていることが確認できる。この元データを、ウェーブレット変換で分解すると、矢印の右側に示すように、高周波部分の異常として現れる。 FIG. 12 is a diagram illustrating a specific example in which certain original data is decomposed by applying a discrete wavelet transform, although it is not a Q statistic or a Hotelling T ² statistic. In the original data on the left side of the arrow in FIG. 12, it can be confirmed that the data includes a high-frequency abnormality. When this original data is decomposed by wavelet transform, it appears as an abnormality in the high frequency part as shown on the right side of the arrow.

一般的に、ウェーブレット変換を適用するか否かに関わらず、高周波の異常を人間が目視で確認することは比較的容易であるが、図１２の矢印の左側に示すような元データで、異常を機械的に（自動的に）検出する場合、単に閾値を設定するだけでは、異常を検出することは不可能である。一方、図１２の矢印の右側に示すような分解されたデータでは、単純に閾値を設定するだけで異常を検出することができる。このように、ウェーブレット変換は、通常行われる閾値による異常検出という単純な操作を継承したままで、より高度に異常を検出できる機能を有する。 In general, it is relatively easy for a human to visually confirm a high-frequency abnormality regardless of whether or not wavelet transform is applied, but the original data as shown on the left side of the arrow in FIG. Is detected mechanically (automatically), it is impossible to detect an abnormality simply by setting a threshold value. On the other hand, in the decomposed data as shown on the right side of the arrow in FIG. 12, an abnormality can be detected by simply setting a threshold value. As described above, the wavelet transform has a function of detecting an abnormality at a higher level while inheriting a simple operation of detecting an abnormality based on a normal threshold.

本実施形態は、ウェーブレット変換を適用して、Ｑ統計量やホテリングのＴ^２統計量をデータ分解して異常検出を実行するシステムである。この場合、本実施形態では、適切な閾値を設定することにより、異常検出の精度を向上させる構成である。 This embodiment applies a wavelet transform, a system for performing abnormality detection by fragmenting the T ² statistic Q statistic and Hotelling. In this case, the present embodiment is configured to improve the accuracy of abnormality detection by setting an appropriate threshold value.

以上要するに、異常検出モデル生成部４２は、プロセスデータ正規化モデル生成部４１により正規化用のシフトパラメータとスケーリングパラメータの値が決定されると、ウェーブレット変換をＱ統計量やホテリングのＴ^２統計量に適用することにより、異常検出を行う際に必要となる計算モデル（Ｐ，∧，Ｗi）を決定する。さらに、これらの値を用いた正規化の計算式、周波数帯域毎に分解されたＱ統計量、および周波数帯域毎に分解されたホテリングのＴ^２統計量の計算式を生成する。 In short, when the process data normalization model generation unit 41 determines the normalization shift parameter and scaling parameter value, the abnormality detection model generation unit 42 converts the wavelet transform into a Q statistic and a hotelling T ² statistic. By applying to the above, the calculation model (P, ∧, Wi) necessary for detecting the abnormality is determined. Further, a calculation formula for normalization using these values, a Q statistic decomposed for each frequency band, and a calculation formula for Hotelling's T ² statistic decomposed for each frequency band are generated.

次に、異常判定基準設定部５は、異常検出モデルである前記式（１５），（１６）を使用して異常判定を行うための閾値を決定する。この際、プロセス監視モデル構築用データである正規化データＸを用いて、前記式（１５），（１６）により計算したプロセス監視モデル構築用データの周波数帯域毎に分解されたＱ統計量、および周波数帯域毎に分解されたホテリングのＴ^２統計量を利用する。 Next, the abnormality determination criterion setting unit 5 determines a threshold value for performing abnormality determination using the equations (15) and (16), which are abnormality detection models. At this time, using the normalized data X which is the process monitoring model construction data, the Q statistic decomposed for each frequency band of the process monitoring model construction data calculated by the above formulas (15) and (16), and Use Hotelling's T ² statistics broken down by frequency band.

まず、閾値決定用事前情報入力部５１は、周波数帯域毎に分解されたＱ統計量およびホテリングのＴ^２統計量に関する事前情報を入力する。なお、この周波数帯域毎に分解された事前情報を取得できない場合には、プロセスデータに関する事前情報や、Ｑ統計量やホテリングのＴ^２統計量に対する事前情報で代替させてもよい。 First, the threshold value determination prior information input unit 51 inputs a priori information about the Q statistics and T ² statistic Hotelling decomposed for each frequency band. Incidentally, if it can not obtain a priori information decomposed per this frequency band, and prior information about the process data, it may be replaced by a priori information for the T ² statistic Q statistic and Hotelling.

まず、事前情報が何も無い場合（Ｑ統計量やホテリングのＴ^２統計量に関する情報は不明）には、閾値決定部５２は、デフォルトの設定法として、以下の方法を採用する。 First, if prior information is nothing (information about T ² statistic Q statistic and Hotelling unknown), the threshold value determining unit 52, the default setting process, to employ the following method.

図１１に示すように、周波数帯域毎に分解されたＱ統計量やホテリングのＴ^２統計量の中の最も遅い周波数のものは、元のＱ統計量やホテリングのＴ^２統計量の近似になっている。従って、Ｑ統計量の統計的信頼限界値とホテリングのＴ^２統計量に関する統計的信頼限界値、即ちＱ統計量とホテリングのＴ^２統計量の閾値を、そのまま最も遅い周波数帯域に分解されたＱ統計量とホテリングのＴ^２統計量の閾値として利用する。 As shown in FIG. 11, those of the slowest frequency in T ² statistic Q statistic and Hotelling decomposed for each frequency band, turned approximation of T ² statistic original Q statistic and Hotelling ing. Thus, Q statistic statistical confidence limits and statistical confidence limits about the T ² statistic Hotelling, the threshold i.e. Q statistic and Hotelling T ² statistic was degraded as it is the slowest frequency band Q It is used as a threshold for statistics and hotelling T ² statistics.

一方、最も遅い周波数帯域以外の周波数帯域毎に分解されたＱ統計量やホテリングのＴ^２統計量は、図１１に示すように、ゼロ平均の周りに正負方向にばらついた分布を示す。このため、事前情報が何も無いので、通常の統計的プロセス監視で行われる方法に従って、周波数帯域毎に分解されたＱ統計量とホテリングのＴ^２統計量の標本標準偏差（σrob）あるいはロバスト標本標準偏差（σrob）を利用して、その値の３倍である３σあるいは３σrobを閾値として設定する。 On the other hand, T ² statistic slowest decomposed into each frequency band other than the Q statistic and Hotelling, as shown in FIG. 11 shows the variation distribution in the positive and negative directions around the zero mean. For this reason, since there is no prior information, the sample standard deviation (σrob) or robust sample of the Q statistic decomposed for each frequency band and the Hotelling T ² statistic according to the method used in normal statistical process monitoring Using the standard deviation (σrob), 3σ or 3σrob, which is three times the value, is set as a threshold value.

次に、閾値決定用事前情報入力部５１から、事前情報として、殆ど全てのデータが正常であるという情報が入力された場合には、閾値決定部５２は、原理的に、プロセス監視モデル構築用データの正規化データＸを用いて計算した周波数帯域毎に分解されたＱ統計量や、周波数帯域毎に分解されたホテリングのＴ^２統計量の最大値を閾値として決定する。但し、ごく僅かに異常データが含まれている可能性があるので、例えば９９％信頼区間の考え方を採用して、周波数帯域毎に分解されたＱ統計量および周波数帯域毎に分解されたホテリングのＴ^２統計量の最大値から上位１％の値を閾値として決定する方法の方が好ましい。この場合、周波数帯域毎に最大値が求まるが、それぞれの周波数帯域は異なっているので、それぞれの値を周波数帯域毎の閾値として決定する。 Next, when information indicating that almost all data is normal is input as prior information from the threshold determination prior information input unit 51, the threshold determination unit 52 is in principle configured for process monitoring model construction. The Q statistic decomposed for each frequency band calculated using the data normalization data X and the maximum value of the Hotelling T ² statistic decomposed for each frequency band are determined as threshold values. However, since there is a possibility that anomalous data is included in a very small amount, the Q statistic decomposed for each frequency band and the hotelling decomposed for each frequency band, for example, using the concept of 99% confidence interval The method of determining the upper 1% value as the threshold value from the maximum value of the T ² statistic is more preferable. In this case, the maximum value is obtained for each frequency band, but since each frequency band is different, each value is determined as a threshold value for each frequency band.

ここで、前述の第２の実施形態での寄与量による異常検出の場合にも、複数の閾値が計算されたが、寄与量は、Ｑ統計量やホテリングのＴ^２統計量に対する各変数の寄与を意味しているので、同じ基準（閾値）で評価されることが普通である。しかし、本実施形態は、Ｑ統計量やホテリングのＴ^２統計量の異なる周波数帯域の成分によって異常検出を行っているので、本質的には異なる閾値を設定すべきである。 Here, also in the case of abnormality detection by the contribution amount in the second embodiment described above, a plurality of threshold values are calculated, but the contribution amount is the contribution of each variable to the Q statistic and the Hotelling T ² statistic. Therefore, it is common to evaluate with the same standard (threshold value). However, the present embodiment, since the performing abnormality detection by components of different frequency band T ² statistic Q statistic and Hotelling, essentially should set different thresholds.

さらに、事前情報として、異常データと正常データが混在しているが異常データの割合は不明であるという情報が入力された場合には、閾値決定部５２は、例えば、修正大津の閾値決定法などを用いて、周波数帯域毎に分解されたＱ統計量および周波数帯域毎に分解されたホテリングのＴ^２統計量の閾値を決定する。 Furthermore, when information indicating that abnormal data and normal data are mixed but the ratio of abnormal data is unknown is input as prior information, the threshold value determination unit 52, for example, a threshold value determination method for the modified Otsu, etc. Is used to determine the threshold of the Q statistic decomposed for each frequency band and the hotelling T ² statistic decomposed for each frequency band.

但し、異常データと正常データが混合していることが事前情報として入力されている場合でも、周波数帯域毎に分解された統計量に対して異常が混入しているとは限らない。このため、異常データが混入していない周波数帯域があった場合、その閾値を修正大津の閾値決定法などのクラスタリング法で決定すると、正常データを２分割してしまうことになり、適切な閾値が決定されない。しかし、寄与量による異常診断とは異なり、周波数帯域毎に分解されたＱ統計量やホテリングのＴ^２統計量による異常診断では、代表的な閾値を取り出すことができず、各々の計測変数毎に閾値を設定しなければならない。これに対処するためには、修正大津の閾値決定法などのクラスタリング法で決定した閾値と、（a）Ｑ統計量とホテリングのＴ^２統計量に関する情報は不明であるの場合の閾値決定方法で決定した閾値との値を比較し、クラスタリング法で決定した閾値が著しく小さい場合には、（a）の場合の閾値を採用するという方法を採ることができる。例えば、（a）の場合の閾値の５０％を下回る閾値がクラスタリング手法で得られた場合には、(a)の場合の閾値を採用するというルールで対処する。 However, even if it is input as prior information that the abnormal data and the normal data are mixed, the abnormality is not always mixed with the statistical amount decomposed for each frequency band. For this reason, when there is a frequency band in which abnormal data is not mixed, if the threshold is determined by a clustering method such as the modified Otsu's threshold determination method, normal data will be divided into two, and an appropriate threshold is set. Not determined. However, unlike the abnormality diagnosis by the contribution amount, the abnormality diagnosis by T ² statistic of degraded Q statistic and Hotelling for each frequency band, it can not be taken out typical threshold for each respective measurement variable A threshold must be set. To address this, a threshold determined by the clustering method such as modification Otsu threshold determination method, in (a) Q statistic and Hotelling T ² statistic related information in the case of unclear threshold determination method When the threshold value determined by the clustering method is extremely small by comparing the value with the determined threshold value, the method of adopting the threshold value in the case (a) can be adopted. For example, when a threshold value that is less than 50% of the threshold value in the case of (a) is obtained by the clustering method, the rule of adopting the threshold value in the case of (a) is dealt with.

また、事前情報として、異常データと正常データが混在しており、異常データの大よその割合は分かっているという情報が入力された場合には、閾値決定部５２は、正規化データＸを用いて計算したＱ統計量やホテリングのＴ^２統計量を周波数帯域毎に分解したデータの上位α％（αは事前情報として入力した値）に該当する周波数帯域毎に分解されたＱ統計量とホテリングのＴ^２統計量の各値を、予め計算する。仮に、全ての周波数帯域に対しては、異常データの大よその割合がわからず、一部の周波数帯域に対してのみわかっている場合には、わかっていない周波数帯域の閾値は、前述(a)の方法で決定する。また、異常データの割合はおおよそわかっているが、それがどの周波数帯域に属するかはわからない場合には、周波数帯域毎に分解されたＱ統計量とホテリングのＴ^２統計量の各値に基づいて、一旦閾値を決定し、前述(a)の場合の閾値決定法で決定した閾値と比較し、（a）の場合の閾値の５０％を下回る閾値がクラスタリング手法で得られた場合には、(a)の場合の閾値を採用するというルールで対処する。 Further, when information indicating that abnormal data and normal data are mixed and the approximate ratio of abnormal data is known is input as prior information, the threshold value determination unit 52 uses the normalized data X. Q statistic and higher alpha% of the data decomposed into each frequency band T ² statistic Hotelling Q statistic decomposed for each corresponding frequency band in (alpha input value as the prior information) calculated Te and Hotelling Each value of the T ² statistic is calculated in advance. If, for all frequency bands, the approximate proportion of abnormal data is unknown and only known for some frequency bands, the threshold of the unknown frequency band is the same as that described above (a ). The proportion of abnormal data is roughly known, but if it is not known belong to which frequency bands, on the basis of the respective values of the degraded Q statistics and T ² statistic Hotelling for each frequency band , Once the threshold value is determined and compared with the threshold value determined by the threshold value determination method in the case of (a) above, and when a threshold value less than 50% of the threshold value in the case of (a) is obtained by the clustering method, ( Deal with the rule of adopting the threshold in the case of a).

一方、以下で示す作用は、オンラインで実行される。まず、プロセス監視部６は、モデル構築部４で構築したプロセス監視モデルを使用して、プロセス１の監視を実行する。プロセス監視部６は、監視するプロセス変数として、各種センサ１１〜１Ｎから新たに生成されたＱ統計量やホテリングのＴ^２統計量を、ウェーブレット変換で分解した周波数帯域毎のＱ統計量やホテリングのＴ^２統計量を監視する。即ち、プロセス監視部６は、下記式（１７），（１８）を使用して、周波数帯域毎に分解したＱ統計量およびホテリングのＴ^２統計量を監視する。

On the other hand, the following actions are executed online. First, the process monitoring unit 6 monitors the process 1 using the process monitoring model constructed by the model construction unit 4. Process monitoring unit 6, a process variable to be monitored, the T ² statistic Q statistic and Hotelling newly generated from the various sensors 11 to 1N, for each frequency band decomposed by the wavelet transform Q statistic and Hotelling monitoring the T ² statistic. That is, the process monitoring unit 6 uses the following formulas (17) and (18) to monitor the Q statistic decomposed for each frequency band and the hotelling T ² statistic.

即ち、プロセスの監視対象がＱ統計量やホテリングのＴ^２統計量ではなく、それらをウェーブレット変換で分解した周波数帯域毎のＱ統計量やホテリングのＴ^２統計量である。 That is, the process monitoring target is not a Q statistic or a Hotelling T ² statistic, but a Q statistic for each frequency band or a Hotelling T ² statistic obtained by decomposing them by wavelet transform.

次に、プロセス異常診断部７は、Ｑ統計量やホテリングのＴ^２統計量ではなく、それらをウェーブレット変換で分解した周波数帯域毎のＱ統計量やホテリングのＴ^２統計量のいずれか一つが閾値を超えた場合に異常と判断する。プロセス異常要因センサ特定部８、プロセス異常要因推定部９及びユーザインターフェース部１０の動作は、前述の第１の実施形態と全く同様である。 Next, the process abnormality diagnosis unit 7 determines that one of the Q statistic and the hoteling T ² statistic for each frequency band obtained by decomposing the Q statistic and the hotelling T ² statistic by the wavelet transform is a threshold value. If it exceeds, it is judged as abnormal. The operations of the process abnormality factor sensor specifying unit 8, the process abnormality factor estimation unit 9, and the user interface unit 10 are exactly the same as those in the first embodiment.

以上のように第３の実施形態によれば、ＭＳＰＣによるプロセス監視において、異常検出を行うデータを、ウェーブレット変換を用いて分解した周波数帯域毎のＱ統計量やホテリングのＴ^２統計量を使用することにより、異常検出に対する機能を向上させることができる。また、異常検出のための閾値をプロセスデータに対する事前情報に応じて適切に決定することにより、異常検出の誤りを減少熔させることができる。これにより、ＭＳＰＣによるプロセス異常監視の性能を向上させる事が可能となる。 As described above, according to the third embodiment, in the process monitoring by MSPC, the Q statistic for each frequency band and the hotelling T ² statistic obtained by decomposing the data to be subjected to abnormality detection using the wavelet transform are used. Thus, it is possible to improve the function for abnormality detection. In addition, by appropriately determining a threshold value for abnormality detection according to prior information for process data, errors in abnormality detection can be reduced. As a result, the performance of process abnormality monitoring by MSPC can be improved.

以上要するに各実施形態によれば、異常状態を見逃してしまう誤りと正常状態であるのに異常と検出してしまう誤りのトレードオフを適切に調整して、異常検出精度と異常要因特定精度を向上させたプロセス異常診断システムを提供することができる。また、プロセスを監視している監視者（プロセス運転員やプロセス管理者）の主観的な判断を、異常検出精度と異常要因特定精度に適切に反映させた適応的な学習機能を持つプロセス異常診断システムを提供することができる。 In short, according to each embodiment, the error detection accuracy and the error factor identification accuracy are improved by appropriately adjusting the trade-off between an error that misses an abnormal state and an error that is detected as an abnormality even in a normal state. A process abnormality diagnosis system can be provided. In addition, process abnormality diagnosis with an adaptive learning function that appropriately reflects the subjective judgments of process supervisors (process operators and process managers) on abnormality detection accuracy and abnormality factor identification accuracy A system can be provided.

なお、前述の各実施形態のプロセス異常診断装置を適用し、プロセス１のプロセスデータ及び異常診断結果を、サーバが管理するデータベースに蓄積して提供するサービスを行なうプロセス監視システムを構築できる。このシステムは、当該サーバからインターネットや専用通信回線を介して、プロセス１を管理運営している運転員蛾操作するユーザ端末にプロセスデータ及び異常診断結果を提供する。 In addition, by applying the process abnormality diagnosis device of each of the above-described embodiments, it is possible to construct a process monitoring system that provides a service that accumulates and provides process data and abnormality diagnosis results of process 1 in a database managed by a server. This system provides process data and an abnormality diagnosis result to the user terminal operated by the operator who manages and operates the process 1 from the server via the Internet or a dedicated communication line.

なお、本発明は上記実施形態そのままに限定されるものではなく、実施段階ではその要旨を逸脱しない範囲で構成要素を変形して具体化できる。また、上記実施形態に開示されている複数の構成要素の適宜な組み合わせにより、種々の発明を形成できる。例えば、実施形態に示される全構成要素から幾つかの構成要素を削除してもよい。さらに、異なる実施形態にわたる構成要素を適宜組み合わせてもよい。 Note that the present invention is not limited to the above-described embodiment as it is, and can be embodied by modifying the constituent elements without departing from the scope of the invention in the implementation stage. In addition, various inventions can be formed by appropriately combining a plurality of components disclosed in the embodiment. For example, some components may be deleted from all the components shown in the embodiment. Furthermore, constituent elements over different embodiments may be appropriately combined.

本発明の第１の実施形態に関するプロセス異常診断装置の要部を示すブロック図。The block diagram which shows the principal part of the process abnormality diagnostic apparatus regarding the 1st Embodiment of this invention. 本実施形態に関する正規化用事前情報の入力方法を説明するための図。The figure for demonstrating the input method of the prior information for normalization regarding this embodiment. 本実施形態に関する閾値決定法の概念を説明するための図。The figure for demonstrating the concept of the threshold value determination method regarding this embodiment. 本実施形態に関して、正常データと異常データを分類する閾値を設定するための概念を示す図。The figure which shows the concept for setting the threshold value which classify | categorizes normal data and abnormal data regarding this embodiment. 本実施形態に関して、正常データと異常データを分類する閾値を設定するための概念を示す図。The figure which shows the concept for setting the threshold value which classify | categorizes normal data and abnormal data regarding this embodiment. 本実施形態に関するＰＣＡのデータ分解演算の概念を示す図。The figure which shows the concept of the data decomposition calculation of PCA regarding this embodiment. 本実施形態に関するＱ統計量とホテリングのＴ^２統計量の概念を説明するための図。Diagram for explaining the concept of the Q statistic and T ² statistic of Hotelling according to the embodiment. 本実施形態に関して、特にセンサ故障時でのプロセス異常要因の特定を説明するための図。The figure for demonstrating specification of the process abnormality factor at the time of a sensor failure regarding this embodiment. 本実施形態に関して、特にプロセス異常時でのプロセス異常要因の特定を説明するための図。The figure for demonstrating specification of the process abnormality factor at the time of process abnormality regarding this embodiment especially. 第２の実施形態に関するＱ統計量に対する寄与量の概念を示す図。The figure which shows the concept of the contribution amount with respect to Q statistic regarding 2nd Embodiment. 第３の実施形態に関するＱ統計量を離散ウェーブレット変換により複数の周波数帯域毎のデータに分解した具体例を示す図。The figure which shows the specific example which decomposed | disassembled the Q statistic regarding 3rd Embodiment into the data for every several frequency band by discrete wavelet transform. 第３の実施形態に関する異常検出率の効果を説明するための図。The figure for demonstrating the effect of the abnormality detection rate regarding 3rd Embodiment. 第１の実施形態の効果を説明するための図で、データ分布と誤りタイプとの関係を示す図。The figure for demonstrating the effect of 1st Embodiment, and the figure which shows the relationship between data distribution and an error type. 第１の実施形態の効果を説明するための図で、誤りタイプＡ，Ｂの内容を説明するための図。The figure for demonstrating the effect of 1st Embodiment, and the figure for demonstrating the content of error type A and B. FIG. 第１の実施形態の効果を説明するための図で、正常データを異常と誤検出するための具体例を示す図。The figure for demonstrating the effect of 1st Embodiment, and the figure which shows the specific example for erroneously detecting normal data as abnormality. 第１の実施形態の効果を説明するための図、異常データを見落とす具体例を示す図。The figure for demonstrating the effect of 1st Embodiment, The figure which shows the specific example which overlooks abnormal data.

Explanation of symbols

１…プロセス、２…プロセス計測データ収集・保存部、
３…プロセス監視モデル構築用プロセスデータ抽出部、
４…プロセス監視モデル構築・供給部、５…プロセス異常判定基準設定・供給部、
６…プロセス監視部、７…プロセス異常診断部、８…プロセス異常要因センサ特定部、
９…プロセス異常要因推定部、１０…ユーザインターフェース部、
４１…プロセスデータ正規化モデル生成部、
４２…異常検出モデル生成部、５１…閾値決定用事前情報入力部、５２…閾値決定部。 1 ... Process, 2 ... Process measurement data collection / storage unit,
3 ... Process data extraction unit for process monitoring model construction,
4 ... Process monitoring model construction / supply unit, 5 ... Process abnormality judgment standard setting / supply unit,
6 ... Process monitoring unit, 7 ... Process abnormality diagnosis unit, 8 ... Process abnormality factor sensor identification unit,
9 ... Process abnormality factor estimation unit, 10 ... User interface unit,
41 ... Process data normalization model generation unit,
42: Anomaly detection model generation unit, 51: Threshold determination prior information input unit, 52: Threshold determination unit.

Claims

A measuring means for measuring the state or operation amount of the target process;
Data storage means for collecting and storing time-series data indicating measurement results from the measurement means;
Process data extraction means used to create a process monitoring model from the time series data stored in the data storage means, and to extract process data indicating the state of the target process;
Normalization model generation means for generating a normalization model for normalizing the process data extracted by the process data extraction means;
A process monitoring model based on a principal component analysis method for the process data normalized by the normalization model is constructed, and Q statistics or hotelling T ² statistics statistics data is used as abnormality detection data of the normalized process data. Process monitoring model means for calculating the contribution amount of the measurement item (measurement variable) measured by the measurement means for the statistical data ,
Prior information relating to a ratio between abnormal data and normal data included in the statistical data, and prior information generating means for generating the prior information for the contribution amount ;
Threshold determination means for determining a threshold for determining an abnormal state and a normal state of the statistical data using the prior information and the contribution amount of the statistical data calculated by the process monitoring model means; ,
A process abnormality diagnosis device comprising: an abnormality diagnosis unit that diagnoses an abnormality of the target process using the threshold value determined by the threshold value determination unit and the statistical data.

A measuring means for measuring the state or operation amount of the target process;
Data storage means for collecting and storing time-series data indicating measurement results from the measurement means;
Process data extraction means used to create a process monitoring model from the time series data stored in the data storage means, and to extract process data indicating the state of the target process;
Normalization model generation means for generating a normalization model for normalizing the process data extracted by the process data extraction means;
A process monitoring model based on a principal component analysis method for the process data normalized by the normalization model is constructed, and Q statistics or hotelling T ² statistics statistics data is used as abnormality detection data of the normalized process data. And process monitoring model means for calculating the statistical data decomposed for each frequency band by applying a discrete wavelet transform,
Prior information generation means for generating the prior information for the statistical data decomposed for each frequency band, which is prior information on the ratio between abnormal data and normal data included in the statistical data;
Using the prior information and the statistical data decomposed for each frequency band calculated by the process monitoring model means, a threshold for determining the abnormal state and the normal state of the statistical data is determined. Threshold determination means;
An abnormality diagnosis means for diagnosing an abnormality of the target process using the threshold value determined by the threshold value determination means and the statistical data;
A process abnormality diagnosis apparatus comprising:

The process monitoring model means calculates a contribution amount of the statistic data decomposed for each frequency band by applying a discrete wavelet transform,
The prior information generating means generates the prior information for the contribution amount of the statistical data decomposed for each frequency band,
The threshold value determination means uses the contribution amount of the statistical data decomposed for each frequency band calculated by the prior information and the process monitoring model means to determine an abnormal state and a normal state The process abnormality diagnosis apparatus according to claim 1, wherein a threshold value is determined.