JP2010244149A

JP2010244149A - Data classification method, data classification device, diagnostic method and diagnostic device

Info

Publication number: JP2010244149A
Application number: JP2009089673A
Authority: JP
Inventors: Yoshinari Hori; 嘉成堀; Takao Sekiai; 孝朗関合; Toru Eguchi; 徹江口; Akihiko Yamada; 昭彦山田; Hisahiro Kusumi; 尚弘楠見; Masayuki Fukai; 雅之深井; Satoru Shimizu; 悟清水
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2009-04-02
Filing date: 2009-04-02
Publication date: 2010-10-28

Abstract

<P>PROBLEM TO BE SOLVED: To solve the following problem: when there is a difference in variation of data items to be classified, the number of categories for classifying data increases, thereby increasing calculation cost. <P>SOLUTION: A computer obtains standard deviation of each item of sampled input data, preprocesses the input data based on the obtained standard deviation of each item, selects a prototype of a category closest to the preprocessed input data, evaluates whether or not the selected prototype is proper from relation between the prototype and the input data, and classifies a plurality of pieces of the input data constituted of a plurality of items into a plurality of categories. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は、異常検知及び異常診断システムへの適用を目的とした、データ分類方法及び装置に関するものである。 The present invention relates to a data classification method and apparatus intended for application to an abnormality detection and abnormality diagnosis system.

データ分類方法の従来技術には、特許文献１に記載されたデータ分類方法がある。特許文献１に記載された従来技術は、適応共鳴理論（Adaptive Resonance Theory：ＡＲＴ）を用いたデータ分類装置により、予め想定した異常事象以外の異常事象が発生した場合においても、その異常個所を絞り込むことができる診断装置が記載されている。 As a prior art of the data classification method, there is a data classification method described in Patent Document 1. The prior art described in Patent Document 1 narrows down the abnormal location even when an abnormal event other than the abnormal event assumed in advance occurs by a data classification device using Adaptive Resonance Theory (ART). Diagnostic devices that can be described are described.

特開平０５−１６５３７５号公報JP 05-165375 A

しかしながら、従来技術を異常検知に適用した場合、ある条件下では、カテゴリー数が多くなり、計算コストが大きくなる場合がある。 However, when the conventional technique is applied to abnormality detection, the number of categories may increase under certain conditions, and the calculation cost may increase.

本発明の目的は、従来技術と同等のデータ分類性能を維持しながら、より少ないカテゴリーでデータを分類し、計算コストを小さくするデータ分類方法を提供することである。 An object of the present invention is to provide a data classification method for classifying data with fewer categories and reducing calculation costs while maintaining data classification performance equivalent to that of the prior art.

本発明は、コンピュータが、サンプリングした入力データの各項目の標準偏差を求め、求めた各項目の標準偏差に基づいて入力データを前処理し、前処理した入力データに最も近いカテゴリーのプロトタイプを選択し、前記プロトタイプと前記入力データの関係から該選択されたプロトタイプが妥当であるかを評価し、複数項目からなる複数個の入力データを複数個のカテゴリーに分類するデータ分類方法である。 In the present invention, the computer calculates the standard deviation of each item of the sampled input data, preprocesses the input data based on the calculated standard deviation of each item, and selects the prototype of the category closest to the preprocessed input data The data classification method evaluates whether the selected prototype is valid from the relationship between the prototype and the input data, and classifies a plurality of input data composed of a plurality of items into a plurality of categories.

本発明によれば、異常検知や異常診断に適用した場合に、データの特徴に合わせてデータを前処理するため、少ないカテゴリー数でデータを分類することが可能となり計算コストを少なくすることが可能となる。 According to the present invention, when applied to abnormality detection or abnormality diagnosis, data is preprocessed according to the characteristics of the data, so that the data can be classified with a small number of categories, and the calculation cost can be reduced. It becomes.

計測データの一例を示す図。The figure which shows an example of measurement data. 正規化した計測データを示す図。The figure which shows the measurement data normalized. 従来技術によって分類した結果の概要示す図。The figure which shows the outline | summary of the result classified according to the prior art. 本発明の第１の実施例の構成を示す図。The figure which shows the structure of the 1st Example of this invention. データ前処理装置の処理フローを示す図。The figure which shows the processing flow of a data pre-processing apparatus. 本発明の第１の実施例による学習アルゴリズムを示す図。The figure which shows the learning algorithm by 1st Example of this invention. 本発明の第１の実施例による分類結果の概要を示す図。The figure which shows the outline | summary of the classification result by 1st Example of this invention. 本発明の第２の実施例の構成を示す図。The figure which shows the structure of the 2nd Example of this invention. カテゴリー対応マトリックスの例を示す図。The figure which shows the example of a category correspondence matrix. 本発明の第２の実施例による学習アルゴリズムを示す図。The figure which shows the learning algorithm by the 2nd Example of this invention. 判定モードの例を示す図。The figure which shows the example of determination mode. 本発明の第３の実施例の構成を示す図。The figure which shows the structure of the 3rd Example of this invention. 保守データの一例を示す図。The figure which shows an example of maintenance data. データ制御装置のデータ変換アルゴリズムを示す図。The figure which shows the data conversion algorithm of a data control apparatus.

本発明について、各実施例を用いて説明する。 The present invention will be described using each example.

複数個の入力データを複数個のカテゴリーに分類する第一の実施例を提案する。 A first embodiment for classifying a plurality of input data into a plurality of categories is proposed.

また、複数個の入力データを予め設定した複数個のカテゴリーＢに分類する第２の実施例を提案する。 A second embodiment is also proposed in which a plurality of input data is classified into a plurality of preset categories B.

また、複数個の入力データを複数個のカテゴリーに分類する第一の実施例を用いて、設備を診断する第三の実施例を提案する。 A third embodiment for diagnosing equipment is proposed using the first embodiment for classifying a plurality of input data into a plurality of categories.

さらに、複数個の入力データを予め設定した複数個のカテゴリーＢに分類する第２の実施例を用いて、設備を診断する第四の実施例を提案する。 Furthermore, a fourth embodiment for diagnosing equipment using the second embodiment for classifying a plurality of input data into a plurality of preset categories B is proposed.

以下説明する実施例によれば、従来技術と同等のデータ分類性能を維持しながら、より少ないカテゴリーでデータを分類し、計算コストを小さくするデータ分類方法を提供することができる。従来技術を異常検知に適用した場合、ある条件下では、カテゴリー数が多くなり、計算コストが大きくなる場合がある。以下図１〜図３を用いてその一例を説明する。 According to the embodiments described below, it is possible to provide a data classification method for classifying data in fewer categories and reducing the calculation cost while maintaining data classification performance equivalent to that of the prior art. When the conventional technique is applied to abnormality detection, the number of categories may increase under certain conditions and the calculation cost may increase. An example will be described below with reference to FIGS.

図１は、ある機器が正常に稼動しているときに計測した２つの計測値（データＡ，データＢ）のトレンドグラフである。この例は、運転条件１〜運転条件３の３つの運転条件で運転した場合の例である。この計測値を正常時の運転データとして学習させる場合について考える。一般に、計測値の単位，大きさにばらつきがあるため、まず、計測値を正規化する。ここでは、データＡ及びデータＢの値がそれぞれ、［０.０５，０.９５］の範囲に入るように正規化した。 FIG. 1 is a trend graph of two measurement values (data A and data B) measured when a certain device is operating normally. This example is an example in the case where the vehicle is operated under three operation conditions of operation conditions 1 to 3. Consider the case where this measured value is learned as normal operation data. In general, since the units and sizes of measurement values vary, first, the measurement values are normalized. Here, the values of data A and data B are normalized so as to fall within the range of [0.05, 0.95].

図２に、正規化したデータＡとデータＢの関係を示す。データは、運転条件１〜３で３つの固まりになっている。計測値を正規化することで、データＡおよびデータＢの取りうる範囲は同じになるが、データＡに比べて、データＢのばらつきが大きくなり、データ２の方向に細長く伸びたデータ群となる。これは、データＡとデータＢで、データのとりうる範囲とデータのばらつきの比率が異なるためである。 FIG. 2 shows the relationship between normalized data A and data B. The data consists of three clusters under operating conditions 1 to 3. By normalizing the measurement values, the range that can be taken by the data A and the data B becomes the same, but the variation of the data B is larger than that of the data A, and the data group is elongated in the direction of the data 2. . This is because the range of data and the ratio of data variation are different between data A and data B.

次に、このデータを複数のカテゴリーに分類し、機器が正常な状態の領域を定義する。図３に従来技術を用いてデータを分類した結果を示す。破線は、カテゴリーの領域を模式的に現したものである。この例では、データはＡ１〜Ａ８の８つのカテゴリーに分類された。従来技術では、カテゴリーの領域の大きさは可変であるが、領域の形状は常に円形となるため、データの特性により、データ群が細長い形状になる場合には、カテゴリー数が増えてしまう。 Next, this data is classified into a plurality of categories, and an area in which the device is in a normal state is defined. FIG. 3 shows the result of classifying data using the prior art. The broken line schematically represents the category area. In this example, the data was classified into eight categories A1 to A8. In the prior art, the size of the category area is variable, but the shape of the area is always circular. Therefore, the number of categories increases when the data group has an elongated shape due to the characteristics of the data.

第一の実施例に係るデータ分類装置は、複数個の入力データを複数個のカテゴリーに分類するデータ分類装置であって、カテゴリーのプロトタイプと、カテゴリーの領域の大きさを規定する領域決定パラメータを格納する内部データ格納部と、サンプリングした入力データの各項目の分散を求め、求めた各項目の分散が全てある範囲に入るように入力データを前処理するデータ前処理手段と、前記データ前処理手段により処理した入力データに最も近いカテゴリーのプロトタイプを選択するプロトタイプ選択手段と、選択したプロトタイプが妥当であるかを、プロトタイプと入力データの類似度を両者の距離で評価するプロトタイプ評価手段と、新しいプロトタイプを新規に作成するプロトタイプ追加手段と、領域決定パラメータとプロトタイプの少なくとも一方を補正することができる内部データ補正手段を有し、前記プロトタイプ評価手段では、プロトタイプが妥当と判定された場合に、前記内部データ補正手段により、領域決定パラメータとプロトタイプの少なくとも一方を補正し、前記プロトタイプ評価手段でプロトタイプが妥当と判定なかった場合に、プロトタイプ追加手段により新しいプロトタイプを作成することを特徴とするデータ分類装置である。 The data classification apparatus according to the first embodiment is a data classification apparatus that classifies a plurality of input data into a plurality of categories, and includes a prototype of the category and an area determination parameter that defines the size of the category area. An internal data storage unit to store; data preprocessing means for pre-processing input data so as to obtain the variance of each item of sampled input data, and so that all of the calculated distributions of each item fall within a certain range; and the data pre-processing Prototype selection means that selects the prototype of the category closest to the input data processed by the means, prototype evaluation means that evaluates the similarity between the prototype and the input data based on the distance between the two, and a new Prototype addition means for creating a new prototype, area determination parameters and prototype Internal data correction means capable of correcting at least one of the group, and in the prototype evaluation means, when the prototype is determined to be valid, at least one of the region determination parameter and the prototype is obtained by the internal data correction means. The data classification apparatus is characterized in that a new prototype is created by the prototype addition means when the prototype evaluation means does not determine that the prototype is valid.

第二の実施例に係るデータ分類装置は、複数個の入力データを予め設定した複数個のカテゴリーＢに分類するデータ分類装置であって、該カテゴリーＢに関連するカテゴリーＡのプロトタイプと、カテゴリーＡの領域の大きさを規定する領域決定パラメータを格納する内部データ格納部と、カテゴリーＡをカテゴリーＢに変換するためのカテゴリー対応マトリックスを格納するカテゴリー対応マトリックス格納部と、サンプリングした入力データの各項目の分散を求め、求めた各項目の分散が全てある範囲に入るように入力データを前処理するデータ前処理手段と、前記データ前処理手段により処理した入力データに最も近いカテゴリーＡのプロトタイプを選択するプロトタイプ選択手段と、選択したプロトタイプが妥当であるかを、プロトタイプと入力データの類似度から評価するプロトタイプ評価手段と、カテゴリーＡの新しいプロトタイプを新規に作成するプロトタイプ追加手段と、領域決定パラメータとプロトタイプの少なくとも一方、及び前記カテゴリー対応マトリックスを補正する内部データ及びマトリックス補正手段と、該選択されたカテゴリーＡを前記カテゴリー対応マトリックスを用いてカテゴリーＢに変換するカテゴリー変換手段を有し、前記内部データ及びマトリックス補正手段が、前記カテゴリー対応マトリックスに選択されたカテゴリーＡと予め設定したカテゴリーＢの関係を追加する場合において、選択したカテゴリーＡが既に別のカテゴリーＢに対応している場合、対応関係の追加を拒否し、追加拒否された場合には、前記カテゴリーＡのプロトタイプを、前記入力データから遠ざけるように補正し、該領域決定パラメータを、該領域決定パラメータが該カテゴリーＡのプロトタイプと該入力データの距離と等しくなるように補正することを特徴とするデータ分類装置である。 A data classification apparatus according to the second embodiment is a data classification apparatus that classifies a plurality of input data into a plurality of preset category B, and includes a category A prototype related to the category B, a category A An internal data storage section for storing area determination parameters that define the size of the area, a category correspondence matrix storage section for storing a category correspondence matrix for converting category A into category B, and each item of sampled input data Data pre-processing means for pre-processing the input data so that the distribution of all the calculated items falls within a certain range, and a category A prototype closest to the input data processed by the data pre-processing means is selected. Prototype selection means to be used and whether the selected prototype is valid Prototype evaluation means for evaluating from the similarity between the input data and the input data, prototype addition means for newly creating a new prototype of category A, at least one of a region determination parameter and a prototype, and internal data for correcting the category correspondence matrix; Matrix correction means and category conversion means for converting the selected category A into category B using the category correspondence matrix, and the internal data and matrix correction means are selected as the category correspondence matrix A In the case of adding a preset category B relationship, if the selected category A already corresponds to another category B, the addition of the corresponding relationship is rejected, and if the addition is rejected, the category A Protota A data classification device, wherein the region determination parameter is corrected so that the region determination parameter is equal to a distance between the category A prototype and the input data. It is.

第三の実施例に係る設備の診断システムは、設備の診断システムにおいて、第一の発明であるデータ分類装置を有し、該設備の運転データを前記データ分類装置に入力し、分類されたカテゴリーに基づき設備の状態を診断することを特徴とする設備の診断システムである。 The facility diagnosis system according to the third embodiment includes the data classification device according to the first aspect of the facility diagnosis system, and the operation data of the facility is input to the data classification device, and the classified category is obtained. The facility diagnosis system is characterized in that the state of the facility is diagnosed based on the above.

第四の実施例に係る設備の診断システムは第２の発明であるデータ分類装置を有し、設備の状態を教師パターンとして与え、設備の状態と該設備の運転データとの関係を前記データ分類装置により学習し、学習後の前記データ分類装置に未学習の運転データを入力し、該データ分類装置が出力したカテゴリーＢにより機器の状態を診断することを特徴とする設備診断システムである。 A facility diagnosis system according to a fourth embodiment includes the data classification device according to the second aspect of the present invention, which gives the state of the facility as a teacher pattern, and the relationship between the state of the facility and the operation data of the facility is the data classification It is an equipment diagnosis system characterized by learning with a device, inputting unlearned operation data into the data classification device after learning, and diagnosing the state of the device based on category B output by the data classification device.

図４に本発明の第一実施例の概要を示す。本実施例は、入力データをいくつかのカテゴリーに分類するデータ分類装置であり、予め分類したいカテゴリーを教師データとして与えておらず、教師なし分類である。 FIG. 4 shows an outline of the first embodiment of the present invention. The present embodiment is a data classification device that classifies input data into several categories. The category to be classified in advance is not given as teacher data, and is unsupervised classification.

本実施例は、入力データ前処理手段１０，データ分類手段２０、および処理フロー制御手段からなる。また、データ分類手段２０は、データ読込み手段２１，内部データ初期化手段２２，内部データ格納部２３，プロトタイプ選択手段２４，プロトタイプ評価手段２５，プロトタイプ追加手段２６，内部データ補正手段２７からなる。 This embodiment comprises input data preprocessing means 10, data classification means 20, and processing flow control means. The data classification unit 20 includes a data reading unit 21, an internal data initialization unit 22, an internal data storage unit 23, a prototype selection unit 24, a prototype evaluation unit 25, a prototype addition unit 26, and an internal data correction unit 27.

入力データ前処理手段１０では、入力データの各項目によるバラツキを補正するように前処理する。入力データは、一般に、温度，流量，電圧，信号の強度などの物理量であるため、分類する入力データの種類，特性によって、値の大きさ，変動量には、ばらつきが生じる。入力データ前処理手段１０では、これらのばらつきを補正するために、分類する入力データの最大値，最小値および分散を用いて、データを前処理する。データ処理の詳細については、後述する。 The input data preprocessing means 10 performs preprocessing so as to correct variations due to each item of input data. Since the input data is generally a physical quantity such as temperature, flow rate, voltage, and signal intensity, the magnitude of the value and the fluctuation amount vary depending on the type and characteristics of the input data to be classified. In order to correct these variations, the input data preprocessing means 10 preprocesses data using the maximum value, minimum value and variance of the input data to be classified. Details of the data processing will be described later.

次に、データ分類手段２０について説明する。データ読込み手段２１は、処理フロー制御手段３０の指令に応じて、前処理された入力データを読込む。内部データ初期化手段２２は、データ分類処理の最初にのみ実行され、内部データ格納部２３に格納されたデータを入力データに応じて初期化する。 Next, the data classification unit 20 will be described. The data reading means 21 reads preprocessed input data in response to a command from the processing flow control means 30. The internal data initialization unit 22 is executed only at the beginning of the data classification process, and initializes the data stored in the internal data storage unit 23 according to the input data.

内部データ格納部２３には各カテゴリーのプロトタイプと、各カテゴリーの領域を決定するパラメータ（領域決定パラメータ）が格納されている。プロトタイプは各カテゴリーの代表例であり、カテゴリーと１対１に対応する。入力データがＮ次元のデータであった場合、プロトタイプもＮ次元のデータとなる。領域決定パラメータは、各カテゴリーの領域を規定するパラメータであり、カテゴリー毎に定義されている。プロトタイプ選択手段２４は、前処理された入力データと内部データ格納部２３に格納されているプロトタイプを比較し、入力データに最も近いプロトタイプを選択する機能である。どのプロトタイプが入力データに近いかは、入力データとプロトタイプの距離で判断する。すなわち、内部データ格納部２３に格納されているプロトタイプと前処理された入力データの距離を計算し、最も距離が近いプロトタイプを選択する。具体的な処理は後述する。 The internal data storage unit 23 stores a prototype for each category and a parameter for determining the area of each category (area determination parameter). The prototype is a representative example of each category, and corresponds to the category on a one-to-one basis. If the input data is N-dimensional data, the prototype is also N-dimensional data. The area determination parameter is a parameter that defines the area of each category, and is defined for each category. The prototype selection means 24 is a function that compares the preprocessed input data with the prototype stored in the internal data storage unit 23 and selects the prototype closest to the input data. Which prototype is close to the input data is determined by the distance between the input data and the prototype. That is, the distance between the prototype stored in the internal data storage unit 23 and the preprocessed input data is calculated, and the prototype with the closest distance is selected. Specific processing will be described later.

プロトタイプ評価手段２５では、プロトタイプ選択手段２４で選択したプロトタイプが妥当であるかを評価する。選択したプロトタイプは内部データ格納部２３に格納されているものの中では、最も入力データに近いが、十分に近いプロトタイプであるとは限らない。プロトタイプ評価手段２５では、入力データと選択したプロトタイプｊの距離ｒ_mjと領域決定パラメータＲ_jとを比較し、式（１）を満足する場合、選択したプロトタイプｊが妥当であると判断し、入力データが分類されるカテゴリーを決定する。また、同時に内部データ補正手段２７の処理を実施する。 The prototype evaluation unit 25 evaluates whether the prototype selected by the prototype selection unit 24 is valid. The selected prototype is the closest to the input data among those stored in the internal data storage unit 23, but it is not necessarily a sufficiently close prototype. The prototype evaluation means 25 compares the input data, the distance r _mj between the selected prototype j and the region determination parameter R _j, and if the expression (1) is satisfied, determines that the selected prototype j is valid, and inputs Determine the category into which the data is classified. At the same time, the processing of the internal data correction means 27 is performed.

ｒ_mj≦Ｒ_j …（１） r _mj ≦ R _j (1)

式（１）を満足しなかった場合、プロトタイプ追加手段２６の処理を実施する。 If the expression (1) is not satisfied, the processing of the prototype adding means 26 is performed.

プロトタイプ追加手段２６は、プロトタイプ評価手段２５で、選択したプロトタイプが妥当でないと判断された場合に、プロトタイプを追加する。プロトタイプは入力データを代表するものでなければならないため、後述のように入力データの近傍のデータが新たなプロトタイプとして追加される。なお、入力データと全く同じデータをプロトタイプとして追加しても良い。 The prototype adding unit 26 adds a prototype when the prototype evaluating unit 25 determines that the selected prototype is not valid. Since the prototype must represent the input data, data in the vicinity of the input data is added as a new prototype as will be described later. Note that the same data as the input data may be added as a prototype.

内部データ補正手段２７は、選択したプロトタイプが妥当であった場合に、プロトタイプ，領域決定パラメータを補正することで、データの分類特性をより適切にする手段である。したがって、内部データ補正手段２７は、領域決定パラメータ補正手段とプロトタイプ補正手段からなる。領域決定パラメータ補正手段では、カテゴリー領域に含まれるデータの分布状態と領域決定パラメータを比較し、データの分布に対して、領域決定パラメータが大きい場合、領域決定パラメータを小さくする。プロトタイプ補正手段では、プロトタイプを入力データに近づける。近づける度合いはパラメータで調整可能である。プロトタイプは複数の入力データの代表例であり、入力データは逐次入力される。プロトタイプを入力データに近づける処理をすることで、プロトタイプをより適切に設定することが可能となる。 The internal data correction means 27 is a means for making the data classification characteristics more appropriate by correcting the prototype and region determination parameters when the selected prototype is valid. Therefore, the internal data correction unit 27 includes an area determination parameter correction unit and a prototype correction unit. The region determination parameter correction means compares the distribution state of the data included in the category region with the region determination parameter, and decreases the region determination parameter when the region determination parameter is larger than the data distribution. The prototype correction means brings the prototype closer to the input data. The degree of approach can be adjusted by a parameter. The prototype is a representative example of a plurality of input data, and the input data is sequentially input. By processing the prototype closer to the input data, it becomes possible to set the prototype more appropriately.

処理フロー制御手段３０では、データ分類装置における処理フローを制御する。主な機能は、入力データを最初に読込んだ場合に内部データ初期化手段２２を実行することと、プロトタイプ評価手段２５から出力されるカテゴリーの変化から繰り返し処理の打ち切りを判断することである。 The processing flow control means 30 controls the processing flow in the data classification device. The main functions are to execute the internal data initializing means 22 when the input data is first read and to determine the termination of the repetitive processing from the change of the category output from the prototype evaluating means 25.

次に実施例１の処理フローについて詳細に説明する。本実施例では、入力データは２次元データであるが、３次元以上のデータであっても同様の処理となる。また、入力データ（Ｘ，Ｙ）及び教師カテゴリーはＭ組あるとした。また、本データ分類方法は、Ｍ組のデータを繰り返し何度か与えることでデータを適切に分類する手法である。本実施例では、繰り返し回数をＫ回とした。以下、処理フローについて説明する。 Next, the processing flow of the first embodiment will be described in detail. In this embodiment, the input data is two-dimensional data, but the same processing is performed even if the input data is three-dimensional data or more. Further, it is assumed that there are M sets of input data (X, Y) and teacher categories. The data classification method is a method for appropriately classifying data by repeatedly giving M sets of data several times. In this example, the number of repetitions was K. Hereinafter, the processing flow will be described.

最初に、入力データ前処理手段１０の処理フローについて図５を用いて説明する。 First, the processing flow of the input data preprocessing means 10 will be described with reference to FIG.

Step１０１：（Ｘ，Ｙ）のそれぞれについて、最大値と最小値を計算する。例えば、データＸが温度、データＹが流量で、それぞれ、５００℃〜５６０℃，２.５ｔ／ｈ〜３.５ｔ／ｈまで変動したとすると、Ｘの最大値は５６０，最小値は５００。Ｙの最大値は３.５，最小値は２.５となる。 Step 101: Calculate the maximum value and the minimum value for each of (X, Y). For example, if the data X is the temperature and the data Y is the flow rate, the maximum value of X is 560 and the minimum value is 500, assuming that the temperature fluctuates from 500 ° C. to 560 ° C. and 2.5 t / h to 3.5 t / h, respectively. The maximum value of Y is 3.5 and the minimum value is 2.5.

Step１０２：Step１０１で求めた最大，最小値で入力データを正規化する。ここでは、Ｘ，Ｙのどちらも、最小値が０.１、最大値が０.９となるように正規化する。 Step 102: The input data is normalized with the maximum and minimum values obtained in Step 101. Here, both X and Y are normalized so that the minimum value is 0.1 and the maximum value is 0.9.

Step１０３：入力データの中から定常状態のデータを抽出する。ここで、定常状態のデータとは、操作条件を変更せずに、一定条件でしばらく運転していた場合のデータである。 Step 103: Steady state data is extracted from the input data. Here, the steady-state data is data in the case of operating for a while under a fixed condition without changing the operating condition.

Step１０４：抽出したデータの標準偏差を計算する。抽出したデータは定常状態のデータであるが、センサーの誤差，プロセスの外乱などのため、若干データにばらつきが生じるため、バラツキの程度の指標となる標準偏差をＸ，Ｙのそれぞれの正規化したデータについて計算する。 Step 104: Calculate the standard deviation of the extracted data. The extracted data is steady-state data, but the data varies slightly due to sensor errors, process disturbances, etc., so the standard deviation, which is an indicator of the degree of variation, is normalized to X and Y respectively. Calculate on the data.

Step１０５：求めた標準偏差の値を使って、バラツキが同程度になるように正規化したデータを補正する。ここで、正規化したデータを（Ｘｎ，Ｙｎ）、補正したデータを（Ｘｎ′，Ｙｎ′）、Ｘｎ，Ｙｎの標準偏差σｘ，σｙ、基準となる標準偏差をσｓとすると、補正式は、式（２），式（３）となる。 Step 105: Using the obtained standard deviation value, the normalized data is corrected so that the variations are approximately the same. Here, assuming that the normalized data is (Xn, Yn), the corrected data is (Xn ′, Yn ′), the standard deviations σx, σy of Xn, Yn, and the standard deviation as a reference is σs, the correction formula is Equations (2) and (3) are obtained.

Ｘｎ′＝(Ｘｎ−０.５)＊σｓ／σｘ＋０.５ …（２）
Ｙｎ′＝(Ｙｎ−０.５)＊σｓ／σｙ＋０.５ …（３） Xn ′ = (Xn−0.5) * σs / σx + 0.5 (2)
Yn ′ = (Yn−0.5) * σs / σy + 0.5 (3)

なお、基準となる標準偏差はσｘ，σｙのうち、小さい方とする。 The standard deviation serving as a reference is the smaller of σx and σy.

これらのStep１０１〜Step１０５の処理により、入力データの各項目のバラツキが均一化されるため、データを分類する場合に、必要以上にカテゴリー数が増加することを抑制することが可能となる。尚、各カテゴリー項目間のデータのばらつきを均一化するための前処理として、各項目の標準偏差に基づいて入力データを前処理すればよく、式（２）及び式（３）以外のやり方でも良い。例えば、標準偏差の小さい方の正規化したデータに対して、予め定めた所定値を掛けるなどしてバラツキを均一化することによって、各項目の標準偏差に基づいて入力データを前処理しても良い。 Due to the processing of Step 101 to Step 105, the variation of each item of the input data is made uniform. Therefore, when data is classified, it is possible to suppress an increase in the number of categories more than necessary. In addition, as a pre-process for uniformizing the variation in data between each category item, the input data may be pre-processed based on the standard deviation of each item, and methods other than Equation (2) and Equation (3) can be used. good. For example, the input data can be preprocessed based on the standard deviation of each item by equalizing the variation by multiplying the normalized data with the smaller standard deviation by a predetermined value. good.

次に、データ分類手段２０の処理フローについて説明する。 Next, the processing flow of the data classification unit 20 will be described.

Step２０１：内部データであるプロトタイプと領域決定パラメータを初期化する。プロトタイプの初期化処理とは、新規のプロトタイプを作成することである。本実施例では、プロトタイプ（Ｘｊ，Ｙｊ）（ｊ＝１〜５）を作成した。初期のプロトタイプはできるだけ分散している方が望ましいため、乱数を使って、ｘ，ｙのそれぞれが、０.１から０.９に入るｘ，ｙの組み合わせを作成した。プロトタイプは、例えば、（０.４５，０.２８），（０.７８，０.３１），（０.１５，０.１７），（０.５４，０.３９），（０.７８，０.１２）のようになる。 Step 201: Initialize prototypes and area determination parameters that are internal data. The prototype initialization process is to create a new prototype. In this example, prototype (Xj, Yj) (j = 1 to 5) was created. Since it is desirable that the initial prototypes are dispersed as much as possible, a combination of x and y in which x and y each fall from 0.1 to 0.9 is created using random numbers. Prototypes are, for example, (0.45, 0.28), (0.78, 0.31), (0.15, 0.17), (0.54, 0.39), (0.78, 0.12).

領域決定パラメータＲ_jは式（４）で作成した。 The region determination parameter R _j was created by the equation (4).

Ｒ_j＝Ｋ_Ｒ×√Ｎ …（４） R _j = K_R × √N (4)

Ｋ_Ｒは定数であり、Ｎは入力データの次元数である。本実施例では、Ｋ_Ｒ＝０.１，Ｎ＝２であるため、Ｒの初期値は０.１４１４となる。 K_R is a constant and N is the number of dimensions of the input data. In this embodiment, since K_R = 0.1 and N = 2, the initial value of R is 0.1414.

Step２０２：繰り返し回数１回目で、入力データ番号ｍ＝１のデータを読込む。 Step 202: Read the data of the input data number m = 1 at the first repetition.

Step２０３：読込んだ入力データ（ｘ，ｙ）とプロトタイプ（Ｘｊ，Ｙｊ）の距離ｒ_mjを式（５）で計算する。 Step 203: The distance r _mj between the read input data (x, y) and the prototype (Xj, Yj) is calculated by the equation (5).

ｒ_mj＝√((ｘ−Ｘｊ)２＋(ｙ−Ｙｊ)２) …（５） r _mj = √ ((x−Xj) 2+ (y−Yj) 2) (5)

距離ｒ_mjが、最小となるｊをプロトタイプの候補として選択する。 Select j as the prototype candidate for which the distance r _mj is minimum.

Step２０４：プロトタイプの妥当性を評価する。具体的には、ｒ_mjを、基準となる領域決定パラメータＲ_jと比較し、ｒ_mjがＲ_jよりも小さければ、選択したプロトタイプは妥当であると評価し、Step２０６の処理を実行する。プロトタイプが妥当でなければ、Step２０５の処理を実行する。 Step 204: Evaluate the validity of the prototype. Specifically, the r _mj, compared to reference a region determining parameter R _j, r _mj is less than R _j, evaluating the prototype selected is valid, performs the process of Step 206. If the prototype is not valid, Step 205 is executed.

Step２０５：ここでは新たにプロトタイプを１つ追加する。追加するプロトタイプは式（６），式（７）で決定する。 Step 205: One new prototype is added here. The prototype to be added is determined by equations (6) and (7).

Ｘ′ｊ＝ｋｐ×ｘｍ …（６）
Ｙ′ｊ＝ｋｐ×ｙｍ …（７） X′j = kp × xm (6)
Y′j = kp × ym (7)

ここでｋｐはパラメータであり１近傍の値をとる。ｋｐ＝１の場合は、プロトタイプは入力データｘｍ，ｙｍと等しくなる。 Here, kp is a parameter and takes a value in the vicinity of 1. When kp = 1, the prototype is equal to the input data xm, ym.

Step２０６：Step２０４でプロトタイプが妥当であると評価された場合の処理で、プロトタイプを入力データに合わせて補正する。具体的には、式（８），式（９）で補正する。 Step 206: In the process when the prototype is evaluated to be valid in Step 204, the prototype is corrected according to the input data. More specifically, the correction is performed using Expressions (8) and (9).

Ｘ′ｊ＝Ｘｊ＋ｋｗ×(ｘｍ−Ｘｊ) …（８）
Ｙ′ｊ＝Ｙｊ＋ｋｗ×(ｙｍ−Ｙｊ) …（９） X′j = Xj + kw × (xm−Xj) (8)
Y′j = Yj + kw × (ym−Yj) (9)

ここで、ｋｗはパラメータであり、０から１範囲の値をとり、ｋｗの値によりどの程度入力データに合わせて補正するのかが決定される。ｋｗ＝１の場合は、プロトタイプを入力データと完全に一致するまで補正することを意味する。また、ｋｗ＝０の場合はプロトタイプは補正されない。 Here, kw is a parameter, and takes a value in the range of 0 to 1, and the degree of correction according to the input data is determined by the value of kw. When kw = 1, this means that the prototype is corrected until it completely matches the input data. When kw = 0, the prototype is not corrected.

ｋｗの値はデータ分類処理の進行に合わせて段階的に変化させることとした。具体的には、処理の初期過程ではｋｗの値を大きくして、入力データに合わせてプロトタイプが変化するようにし、処理がある程度進むとｋｗの値を小さくし、プロトタイプが変化しないようにした。 The value of kw was changed stepwise as the data classification process progressed. Specifically, in the initial process, the value of kw is increased so that the prototype changes in accordance with the input data, and when the process proceeds to some extent, the value of kw is decreased so that the prototype does not change.

Step２０７：選択されたカテゴリーの領域に含まれるデータから、領域決定パラメータＲ_jが妥当であるかを判定する。ここでは選択されたカテゴリーのプロトタイプと、このカテゴリーに含まれる入力データの距離ｒ_mjの最大値ｒ_{mj_max}を求め、Ｒ_jとの比率Ｆ_ｒを式（１０）で計算する。 Step 207: It is determined whether or not the region determination parameter R _j is valid from the data included in the region of the selected category. Here, the maximum value r _{mj_max} of the distance r _mj between the prototype of the selected category and the input data included in this category is obtained, and the ratio F_r to R _j is calculated by the equation (10).

Ｆ_ｒ＝ｒ_{mj_max}／Ｒ_j …（１０） F_r = r _{mj_max} / R _j (10)

予め設定した閾値Ｆ_ｒ_ｔｈと式（１０）で求めたＦ_ｒを比較し、式（１１）が成立すれば領域決定パラメータＲ_jは妥当と判断し、Step２０９に進む。式（１１）が成立しなければ、領域決定パラメータＲ_jは妥当でないと判断し、Step２０８に進む。 The threshold value F_r_th set in advance is compared with F_r obtained by Expression (10). If Expression (11) is satisfied, it is determined that the region determination parameter R _j is valid, and the process proceeds to Step 209. If Expression (11) does not hold, it is determined that the region determination parameter R _j is not valid, and the process proceeds to Step 208.

Ｆ_ｒ≧Ｆ_ｒ_ｔｈ …（１１） F_r ≧ F_r_th (11)

尚、ここでは距離に基づいて妥当性の評価を行っているが、選択されたカテゴリーのプロトタイプとこのカテゴリーに含まれる入力データの関係を予め定めるなどにより妥当性を評価する別の評価方法でも良い。 Although the validity is evaluated here based on the distance, another evaluation method for evaluating the validity by, for example, predetermining the relationship between the prototype of the selected category and the input data included in this category may be used. .

Step２０８：領域決定パラメータＲ_jを式（１２）にしたがって変更する。 Step 208: Change the region determination parameter R _j according to the equation (12).

Ｒ_j＝ｒ_{mj_max}／Ｆ_ｒ_ｔｈ …（１２） R _j = r _{mj_max} / F_r_th (12)

Step２０９：入力データがＭ組全て処理されたかどうかを判定し、処理されていない入力データがあれば、次の入力データをStep２０３で処理する。全ての入力データが処理されていれば、Step２１０に進む。 Step 209: It is determined whether or not all M sets of input data have been processed. If there is unprocessed input data, the next input data is processed in Step 203. If all input data has been processed, the process proceeds to Step 210.

Step２１０：繰り返し回数が、設定したＫ回実施されたか、あるいはデータ分類処理が収束した場合に処理を終了させる。データ分類の収束判定は、入力データが分類されたカテゴリーが前回（ｋ−１回目）と今回（ｋ回目）が同じか否かで判定する。分類されたカテゴリーが少なくとも１つ以上変化した場合、処理は収束していないと判定し、分類されたカテゴリーが全て同じであれば、処理が収束したと判定する。 Step 210: The process is terminated when the number of repetitions has been set K times or the data classification process has converged. The convergence determination of the data classification is performed by determining whether the category into which the input data is classified is the same as the previous time (k−1) and the current time (k time). If at least one of the classified categories has changed, it is determined that the process has not converged. If all of the classified categories are the same, it is determined that the process has converged.

実施例１では、ｋ＝１から図６に示した処理を実施したが、ｋがある値になるまでは、Step２０７〜Step２０８の処理を実施せず、ｋがある値を超えてからStep２０７〜Step２０８の処理を実施する処理フローでも良い。 In the first embodiment, the processing illustrated in FIG. 6 is performed from k = 1. However, the processing of Steps 207 to 208 is not performed until k reaches a certain value, and Steps 207 to 208 are performed after k exceeds the certain value. A processing flow for performing the above process may be used.

図７は実施例１による分類結果を示す概念図である。入力データは図１と同じであるが、データのバラツキが大きかったデータＢは、データ前処理装置１０で前処理することにより、バラツキがデータＡと同程度になっている。この場合、カテゴリーはＡ１′〜Ａ３′の３つに分かれただけであり、少ないカテゴリー数でデータを分類することができた。 FIG. 7 is a conceptual diagram illustrating a classification result according to the first embodiment. The input data is the same as in FIG. 1, but the data B, which has a large variation in data, is pre-processed by the data preprocessing device 10, so that the variation is almost the same as the data A. In this case, the categories were only divided into three categories A1 ′ to A3 ′, and the data could be classified with a small number of categories.

したがって、本実施例による分類方法を異常検知や異常診断に適用した場合、少ないカテゴリー数でデータを分類できるため、計算コストを小さくすることができる。 Therefore, when the classification method according to the present embodiment is applied to abnormality detection and abnormality diagnosis, data can be classified with a small number of categories, so that the calculation cost can be reduced.

なお、データ分類手段２０は、本実施例に記載した分類方法に限定されるものではなく、適応共鳴理論などのデータ分類手段を利用しても良い。 The data classification means 20 is not limited to the classification method described in the present embodiment, and data classification means such as adaptive resonance theory may be used.

上述したように、コンピュータが、サンプリングした入力データの各項目の標準偏差を求め、求めた各項目の標準偏差に基づいて入力データを前処理し、前処理した入力データに最も近いカテゴリーのプロトタイプを選択し、前記プロトタイプと前記入力データの関係から該選択されたプロトタイプが妥当であるかを評価し、複数項目からなる複数個の入力データを複数個のカテゴリーに分類するデータ分類方法により、従来技術と同等のデータ分類性能を維持しながら、より少ないカテゴリーでデータを分類し、計算コストを小さくすることができる。 As described above, the computer obtains the standard deviation of each item of the sampled input data, preprocesses the input data based on the obtained standard deviation of each item, and selects the prototype of the category closest to the preprocessed input data. A conventional method for selecting and evaluating whether or not the selected prototype is valid from the relationship between the prototype and the input data, and classifying a plurality of input data composed of a plurality of items into a plurality of categories. While maintaining the same data classification performance, it is possible to classify data with fewer categories and reduce the calculation cost.

また、上述した様に、評価によって該プロトタイプが妥当と判定された場合に、領域決定パラメータとプロトタイプの少なくとも一方を補正し、前記評価によって該プロトタイプが妥当と判定されなかった場合に、新しいプロトタイプを追加するデータ分類方法により、データの分類特性をより適切にすることができる。 Further, as described above, when the prototype is determined to be valid by the evaluation, at least one of the region determination parameter and the prototype is corrected, and when the prototype is not determined to be valid by the evaluation, a new prototype is selected. The classification characteristics of the data can be made more appropriate by the data classification method to be added.

次に本発明の第２の実施例について説明する。図４に第２の実施例の概要を示す。 Next, a second embodiment of the present invention will be described. FIG. 4 shows an outline of the second embodiment.

本実施例は、入力データと対応するカテゴリーを学習させるデータ分類方法である。ここで、第１の実施例での分類結果であるカテゴリーと予め学習させるカテゴリーは、必ずしも１対１に対応するものではない。そこで、第１の実施例の分類結果であるカテゴリーをカテゴリーＡ、予め学習させるカテゴリーをカテゴリーＢと定義し、第２の実施例について説明する。なお、第２の実施例の構成要素の多くは第１の実施例と同じであるため、ここでは、第１の実施例との相違点のみを説明する。 This embodiment is a data classification method for learning a category corresponding to input data. Here, the category that is the classification result in the first embodiment and the category that is learned in advance do not necessarily correspond one-to-one. Therefore, the category which is the classification result of the first embodiment is defined as category A, and the category to be learned in advance is defined as category B, and the second embodiment will be described. Since many of the components of the second embodiment are the same as those of the first embodiment, only the differences from the first embodiment will be described here.

第２の実施例のデータ分類手段４０で新規に追加された構成要素は、カテゴリーＢ読込み手段４１，カテゴリー変換手段４２及びカテゴリー対応マトリックス４３の３つである。また、第１の実施例の内部データ補正手段２７は、内部データ及びマトリックス補正手段２８に置き換わっている。 There are three components newly added by the data classification means 40 of the second embodiment: a category B reading means 41, a category conversion means 42, and a category correspondence matrix 43. Further, the internal data correction means 27 of the first embodiment is replaced with internal data and matrix correction means 28.

カテゴリーＢ読込み手段４１は、予め学習させておくデータ（教師データ）であるカテゴリーＢを読込む手段である。データ読込み手段２１と同様に処理フロー制御手段３０により、データを読込むタイミングを制御されている。 The category B reading means 41 is means for reading category B, which is data (teacher data) to be learned in advance. Similar to the data reading means 21, the processing flow control means 30 controls the timing for reading data.

カテゴリー変換手段４２はカテゴリーＡをカテゴリーＢに変換する手段である。ここで、カテゴリーＡは、第１の実施例でデータを分類したカテゴリーと同じであり、いくつかの類似データを１つのグループにしたものである。カテゴリーＡにはカテゴリー番号が付与されるが、その番号自体には意味がない。一方、カテゴリーＢは、人間が意味づけしたカテゴリーである。本データ分類方法を異常診断に利用した場合、カテゴリーＢは、例えば、「正常」「異常１」「異常２」などの事象が割り当てられる。 The category conversion means 42 is means for converting category A into category B. Here, the category A is the same as the category in which the data is classified in the first embodiment, and some similar data are grouped into one group. Category A is assigned a category number, but the number itself has no meaning. On the other hand, category B is a category that humans have given meaning. When this data classification method is used for abnormality diagnosis, for example, events such as “normal”, “abnormal 1”, and “abnormal 2” are assigned to category B.

カテゴリーＡをカテゴリーＢに変換するには、カテゴリーＡとカテゴリーＢの対応関係を表した、カテゴリー対応マトリックスを利用する。 In order to convert category A into category B, a category correspondence matrix that represents the correspondence between category A and category B is used.

カテゴリー対応マトリックス格納部４３にはカテゴリー対応マトリックスが格納されている。カテゴリー対応マトリックスの例を図９に示す。図９の例では、カテゴリーＢは「正常」「異常１」「異常２」の３種類である。カテゴリーＡはカテゴリー１〜１０の１０種類である。複数のカテゴリーＡが１つのカテゴリーＢに対応している。また、各カテゴリーは必ず１つのカテゴリーＢにのみ対応しており、２つ以上のカテゴリーＢに対応していることはない。このカテゴリー対応マトリックスを利用することで、カテゴリーＡを与えるとカテゴリーＢに変換することが可能となる。 The category correspondence matrix storage unit 43 stores a category correspondence matrix. An example of the category correspondence matrix is shown in FIG. In the example of FIG. 9, there are three types of categories B: “normal”, “abnormal 1”, and “abnormal 2”. Category A has 10 types of categories 1-10. A plurality of categories A correspond to one category B. Each category always corresponds to only one category B, and does not correspond to two or more categories B. By using this category correspondence matrix, if category A is given, it can be converted to category B.

内部データ及びマトリックス補正手段２８は、内部データ格納部２３に格納されている内部データとカテゴリー対応マトリックス格納部４３に格納されているカテゴリー対応マトリックスを補正する。 The internal data and matrix correction means 28 corrects the internal data stored in the internal data storage unit 23 and the category correspondence matrix stored in the category correspondence matrix storage unit 43.

補正する内部データは、プロトタイプと領域決定パラメータであり、第１の実施例の内部データ補正手段２７と同じである。ただし、補正方法は内部データ補正手段２７とは異なる。内部データ及びマトリックス補正手段２８では、図９で示したように、カテゴリーＡに対応するとカテゴリーＢが２つ以上にならないように内部データを補正する。具体的な補正アルゴリズムについては、後述する。 The internal data to be corrected is a prototype and a region determination parameter, and is the same as the internal data correction means 27 of the first embodiment. However, the correction method is different from the internal data correction means 27. As shown in FIG. 9, the internal data and matrix correction means 28 corrects the internal data so that the category B does not become two or more when it corresponds to the category A. A specific correction algorithm will be described later.

カテゴリー対応マトリックスの補正は、カテゴリーＡとカテゴリーＢの関係が常に最新の入力データの分類結果と一致するように補正する。各入力データに対応するカテゴリーＢは教師データとして与えられるため、変更はない。しかし、プロトタイプまたは領域決定パラメータが変化した場合、同じ入力データであっても、カテゴリーＡは変化する可能性がある。したがって、カテゴリーＡが変更となった場合に、逐次、カテゴリーＢとの対応関係を補正する。 The category correspondence matrix is corrected so that the relationship between category A and category B always matches the classification result of the latest input data. Since category B corresponding to each input data is given as teacher data, there is no change. However, if the prototype or region determination parameter changes, category A may change even with the same input data. Therefore, when category A is changed, the correspondence with category B is sequentially corrected.

次に、実施例２の動作について説明する。実施例２では、学習モードと判定モードの２種類の動作モードがある。学習モードでは、入力データとカテゴリーＢ（教師データ）を与え、入力データがカテゴリーＢに分類されるように学習する動作モードである。判定モードは入力データのみを与え、入力データが分類されるカテゴリーＢを判定する動作モードである。なお、入力データは、実施例１と同様に２次元データ（Ｘ，Ｙ）である。 Next, the operation of the second embodiment will be described. In the second embodiment, there are two types of operation modes, a learning mode and a determination mode. The learning mode is an operation mode in which input data and category B (teacher data) are given and learning is performed so that the input data is classified into category B. The determination mode is an operation mode in which only input data is given and a category B into which the input data is classified is determined. The input data is two-dimensional data (X, Y) as in the first embodiment.

入力データ前処理手段の処理フローは実施例と全く同じであるため、説明は割愛する。 Since the processing flow of the input data preprocessing means is exactly the same as in the embodiment, the description is omitted.

図１０にデータ分類手段４０の学習モードの処理フローを示す。図中の破線で囲んだ部分が、実施例１の処理フローと異なる部分である。異なる部分を中心に、処理フローを説明する。 FIG. 10 shows a processing flow of the learning mode of the data classification means 40. A portion surrounded by a broken line in the drawing is a portion different from the processing flow of the first embodiment. The processing flow will be described focusing on the different parts.

Step２０１〜２０５：入力データを正規化し、プロトタイプなどの内部データを初期化し、入力データが分類されるカテゴリーＡのプロトタイプを決定する。 Steps 201 to 205: Normalize input data, initialize internal data such as prototypes, and determine a category A prototype into which the input data is classified.

Step２０６−１：入力データが分類されたカテゴリーＡと、教師データとして与えられるカテゴリーＢの関係をカテゴリー対応マトリックスに追加する。ただし、追加するカテゴリーＡとカテゴリーＢの組み合わせにおいて、選択されたカテゴリーＡが既に別のカテゴリーＢとの対応関係が記録されている場合には、追加が拒否される。また、カテゴリーＡとカテゴリーＢの対応関係が既にマトリックスに記録されている場合は、修正しない。 Step 206-1: The relationship between the category A into which the input data is classified and the category B given as teacher data is added to the category correspondence matrix. However, in the combination of category A and category B to be added, if the selected category A has already recorded a corresponding relationship with another category B, the addition is rejected. If the correspondence between category A and category B is already recorded in the matrix, no correction is made.

Step２０６−２：プロトタイプの補正は、Step２０６−１によって追加拒否されたか否かで処理が異なる。Step２０６−１で追加拒否されなかった場合には、式（８），式（９）でプロトタイプを修正する。追加拒否された場合には、入力データが分類されたカテゴリーＡがふさわしくないことを意味する。したがって、対象の入力データが分類されたカテゴリーＡに分類されないようにプロトタイプを補正する必要がある。具体的には式（１３），式（１４）によって、プロトタイプを補正する。 Step 206-2: The process of the correction of the prototype differs depending on whether or not addition is rejected in Step 206-1. If the addition is not rejected in Step 206-1, the prototype is corrected by Expression (8) and Expression (9). When the addition is rejected, it means that the category A into which the input data is classified is not suitable. Therefore, it is necessary to correct the prototype so that the target input data is not classified into the classified category A. Specifically, the prototype is corrected by the equations (13) and (14).

Ｘ′ｊ＝Ｘｊ−ｋｗ×(ｘｍ−Ｘｊ) …（１３）
Ｙ′ｊ＝Ｙｊ−ｋｗ×(ｙｍ−Ｙｊ) …（１４） X′j = Xj−kw × (xm−Xj) (13)
Y′j = Yj−kw × (ym−Yj) (14)

式（１３），式（１４）で、ｋｗ＞０の場合、プロトタイプは入力データから離れる方向に補正される。ｋｗの値が大きいほど、その度合いは大きくなる。なお、ｋｗ＝０の場合、プロトタイプは変更されない。 In equations (13) and (14), if kw> 0, the prototype is corrected away from the input data. The greater the value of kw, the greater the degree. When kw = 0, the prototype is not changed.

Step２０７：Ｒ_jの妥当性判断は、式（１１）の成立の有無及び追加拒否の有無によって異なる。式（１１）が成立し、かつ追加拒否がない場合は、妥当であると判定し、Step２０９に進む。式（１１）を満足しないか追加拒否があった場合は妥当でないと判定し、Step２０８に進む。 Step 207: The validity determination of R _j differs depending on whether or not the expression (11) is established and whether or not additional refusal is present. If Expression (11) is satisfied and there is no additional refusal, it is determined to be valid and the process proceeds to Step 209. If the expression (11) is not satisfied or there is an additional refusal, it is determined that it is not valid, and the process proceeds to Step 208.

Step２０８：追加拒否があった場合は、入力データが分類されたカテゴリーＡがふさわしくないことを意味する。したがって、対象の入力データが分類されたカテゴリーＡに分類されないように領域決定パラメータを補正する必要がある。具体的には式（１５）で補正する。 Step 208: If there is an additional refusal, it means that the category A into which the input data is classified is not suitable. Therefore, it is necessary to correct the region determination parameter so that the target input data is not classified into the classified category A. Specifically, the correction is made by the equation (15).

Ｒ_j＝ｒ_mj …（１５） R _j = r _mj (15)

また、式（１１）を満足しない場合は、式（１２）でＲ_jを補正する。 Further, when Expression (11) is not satisfied, R _j is corrected by Expression (12).

Step２０９〜２１０：処理を繰り返すか否かを判定し、繰り返しが必要なければ、Step２１１に進む。Step２１０までの処理が全て終了すると、Ｍ個全ての入力データが、あるカテゴリーＡに分類され、各カテゴリーＡとカテゴリーＢの対応関係がカテゴリー対応マトリックスに記録される。 Steps 209 to 210: It is determined whether or not the process is repeated. If it is not necessary, the process proceeds to Step 211. When all processing up to Step 210 is completed, all M input data are classified into a certain category A, and the correspondence between each category A and category B is recorded in the category correspondence matrix.

Step２１１：カテゴリー対応マトリックスを利用してカテゴリーＡをカテゴリーＢに変換する。変換するデータはＭ個の入力データである。マトリックスとして図９の例を利用した場合、ある入力データのカテゴリーＡがカテゴリー３とすると、対応するカテゴリーＢは異常１であり、カテゴリー３は異常１に変換される。 Step 211: Convert category A to category B using the category correspondence matrix. The data to be converted is M pieces of input data. When the example of FIG. 9 is used as a matrix, if category A of certain input data is category 3, the corresponding category B is abnormal 1, and category 3 is converted to abnormal 1.

以上の処理により、Ｍ個の入力データを、教師データとして与えたカテゴリーＢに分類することができる。 Through the above processing, M pieces of input data can be classified into category B given as teacher data.

なお、Step２０６−１〜Step２０８までの処理の中で、カテゴリー対応マトリックスに関連する処理を除くと実施例１のStep２０６〜Step２０８の処理と同じになる。実施例２では、ｋ＝１から図に示した処理を実施したが、ｋがある値になるまでは、カテゴリー対応マトリックスに関連する処理を実施せず、実施例１のStep２０６〜Step２０８の処理のみを実施し、ｋがある値を超えてからカテゴリー対応マトリックスに関連する処理を追加する処理フローとしてもよい。 Note that the processing from Step 206-1 to Step 208 is the same as the processing from Step 206 to Step 208 in the first embodiment except for the processing related to the category correspondence matrix. In the second embodiment, the processing shown in the figure is performed from k = 1, but the processing related to the category correspondence matrix is not performed until k reaches a certain value, and only the processing from Step 206 to Step 208 of the first embodiment is performed. It is good also as a processing flow which implements and adds the process relevant to a category correspondence matrix, after k exceeds a certain value.

次に判定モードについて図１１を用いて説明する。１つの入力データを判定する場合について説明する。 Next, the determination mode will be described with reference to FIG. A case where one input data is determined will be described.

Step２２１：内部データを学習モードが終了した状態にする。 Step 221: The internal data is brought into a state where the learning mode is finished.

Step２２２：入力データを読込む。 Step 222: Read input data.

Step２２３：学習モードのStep２０３と同じアルゴリズムでプロトタイプを選択する。 Step 223: A prototype is selected by the same algorithm as Step 203 in the learning mode.

Step２２４：学習モードのStep２０４と同じアルゴリズムでプロトタイプの妥当性を判定する。妥当であれば、入力データのカテゴリーＡは選択されたプロトタイプを持つカテゴリーＡであると判定しStep２２５に進む。妥当でなければ、入力データのカテゴリーＡは新規カテゴリーと判定しStep２２５に進む。 Step 224: The validity of the prototype is determined by the same algorithm as Step 204 in the learning mode. If it is valid, it is determined that the category A of the input data is the category A having the selected prototype, and the process proceeds to Step 225. If not valid, the category A of the input data is determined as a new category, and the process proceeds to Step 225.

Step２２５：学習モードのStep２１１と同様のアルゴリズムで、カテゴリーＡをカテゴリーＢに変換する。ただし、入力データのカテゴリーＡが新規カテゴリーであった場合には、カテゴリー対応マトリックスに対応関係が記録されていないため、カテゴリーＢは不定となる。 Step 225: The category A is converted to the category B by the same algorithm as in the learning mode Step 211. However, when the category A of the input data is a new category, the correspondence relationship is not recorded in the category correspondence matrix, so the category B is undefined.

最後に、実施例２の効果について述べる。実施例２においても、データ前処理手段１０でデータの各項目のデータのバラツキが均一になるため、従来技術に比べて少ないカテゴリー数でカテゴリーＡを分類することができ、計算コストを小さくできる。 Finally, effects of the second embodiment will be described. Also in the second embodiment, the data pre-processing means 10 makes the data variation of each item uniform, so that the category A can be classified with a smaller number of categories compared to the prior art, and the calculation cost can be reduced.

次に本発明の第３の実施例について説明する。本発明の第３の実施例は、本発明の第２の実施例であるデータ分類装置を利用した発電設備の遠隔診断システムである。第２の実施例で説明するが、第１の実施例のデータ分類装置でも実施することができる。 Next, a third embodiment of the present invention will be described. The third embodiment of the present invention is a remote diagnosis system for a power generation facility using the data classification device according to the second embodiment of the present invention. As will be described in the second embodiment, the data classification apparatus of the first embodiment can also be implemented.

本実施例の構成を図１２に示す。本実施例は、発電設備８ａ〜８ｎと監視センタからなり、発電設備８ａ〜８ｎと監視センタはインターネット７により接続されている。 The configuration of this embodiment is shown in FIG. In this embodiment, the power generation facilities 8 a to 8 n and the monitoring center are connected, and the power generation facilities 8 a to 8 n and the monitoring center are connected by the Internet 7.

発電設備８ａ〜８ｎはコンバインドサイクル発電システム８１，制御装置８２及びデータ送信装置８３からなる。コンバインドサイクル発電システム８１は圧縮機，燃焼器，ガスタービン，排熱回収ボイラ，蒸気タービン，復水器および発電機からなる。 The power generation facilities 8 a to 8 n include a combined cycle power generation system 81, a control device 82, and a data transmission device 83. The combined cycle power generation system 81 includes a compressor, a combustor, a gas turbine, an exhaust heat recovery boiler, a steam turbine, a condenser, and a generator.

まず、コンバインドサイクル発電システム８１について説明する。圧縮機による圧縮空気は燃焼器に送られ、燃料と混合されて燃焼器で燃焼する。燃焼により発生した高圧ガスによりガスタービンが回転し発電する。ガスタービンの排ガスは排熱回収ボイラに入り、タービンの排ガスで蒸気を発生する。発生した蒸気で蒸気タービンを回転し発電する。 First, the combined cycle power generation system 81 will be described. Compressed air from the compressor is sent to a combustor, mixed with fuel, and burned in the combustor. The gas turbine rotates by the high-pressure gas generated by the combustion to generate electricity. The gas turbine exhaust gas enters the exhaust heat recovery boiler and generates steam from the turbine exhaust gas. The generated steam rotates the steam turbine to generate electricity.

制御装置８２では電力需要に応じてコンバインドサイクル発電システム８１の出力を制御する。また、コンバインドサイクル発電システム８１に設置されたセンサーを用いて運転データを計測する。本実施例では、運転データとして吸気温度，燃料投入量，ガスタービン排ガス温度，ガスタービン回転数，排熱回収ボイラ蒸気発生量，蒸気タービン出力等の状態量を１秒間隔で計測した。データ送信装置では、制御装置８２で計測した運転データをインターネット７経由で監視センターに送信する。 The control device 82 controls the output of the combined cycle power generation system 81 according to the power demand. In addition, the operation data is measured using a sensor installed in the combined cycle power generation system 81. In the present embodiment, intake air temperature, fuel input amount, gas turbine exhaust gas temperature, gas turbine rotation speed, exhaust heat recovery boiler steam generation amount, steam turbine output, and other state quantities were measured at 1 second intervals as operation data. In the data transmission device, the operation data measured by the control device 82 is transmitted to the monitoring center via the Internet 7.

監視センターでは、インターネット７経由で受信した発電設備８ａ〜８ｎの運転データを受信し、発電設備８ａ〜８ｎの状態を監視する。監視センターは、データ受信装置２，運転データベース３，保守データベース４，データ制御装置５，データ分類装置１ａ〜１ｎ及び操作端末６からなる。これらの構成要素について以下に示す。 The monitoring center receives the operation data of the power generation facilities 8a to 8n received via the Internet 7, and monitors the state of the power generation facilities 8a to 8n. The monitoring center includes a data receiving device 2, an operation database 3, a maintenance database 4, a data control device 5, data classification devices 1 a to 1 n and an operation terminal 6. These components are shown below.

データ受信装置２は受信したデータを各データベース及びデータ制御装置５に送る。発電設備８ａ〜８ｎから送信された運転データはデータ制御装置５に送ると同時に運転データベース３に送る。また、保守員が携帯端末から入力した保守情報は保守データベース４に送る。 The data receiving device 2 sends the received data to each database and the data control device 5. The operation data transmitted from the power generation facilities 8 a to 8 n is sent to the operation database 3 at the same time as being sent to the data control device 5. The maintenance information input from the portable terminal by the maintenance staff is sent to the maintenance database 4.

運転データベース３はデータ受信装置２から送られた運転データを発電設備８ａ〜８ｎ毎に格納する。運転データは吸気温度，燃料投入量，ガスタービン排ガス温度等の状態量であり、１秒間隔で記録されている。また、これらの計測データの他に運転モードが記録されている。運転モードは「起動運転」「定格運転」などに対応する数値で記録されている。さらに、発電設備から送信されるデータの他に、データ分類装置１ａ〜１ｎによる分類結果も格納されている。分類結果は２種類ある。ひとつはデータ分類装置内部のカテゴリーＡの番号が記録されている。もうひとつは「正常」「異常Ａ」などのような機器の状態（カテゴリーＢ）であり、同じく状態に対応する番号で記録されている。 The operation database 3 stores the operation data sent from the data receiving device 2 for each of the power generation facilities 8a to 8n. The operation data includes state quantities such as intake air temperature, fuel input amount, gas turbine exhaust gas temperature, and the like, and is recorded at intervals of 1 second. In addition to these measurement data, an operation mode is recorded. The operation mode is recorded with numerical values corresponding to “start-up operation”, “rated operation”, and the like. Further, in addition to data transmitted from the power generation facility, classification results by the data classification devices 1a to 1n are also stored. There are two types of classification results. One is a category A number in the data classification device. The other is the state (category B) of the device such as “normal” and “abnormal A”, which are also recorded with numbers corresponding to the state.

保守データベース４はデータ受信装置２から送られた保守情報を格納する。保守データベース４には保守員が行った保守情報が記録されている。蓄積されている保守情報は、例えば図１３のように記録されている。図１３の例では実際の保守作業は次のようになる。９：５０にモニタリングによりアラームが発生し、異常が確認された。１０：３０に現地に保守員が到着し保守作業を開始した。１０：４５に部品を交換したのち、ガスタービン発電機の試運転を実施した。１１：３０に異常が復旧したことを確認し、保守作業を終了した。 The maintenance database 4 stores maintenance information sent from the data receiving device 2. Maintenance information performed by maintenance personnel is recorded in the maintenance database 4. The accumulated maintenance information is recorded as shown in FIG. 13, for example. In the example of FIG. 13, the actual maintenance work is as follows. At 9:50, an alarm was generated by monitoring and an abnormality was confirmed. Maintenance personnel arrived at the site at 10:30 and maintenance work started. After exchanging the parts at 10:45, the gas turbine generator was commissioned. After confirming that the abnormality was recovered at 11:30, the maintenance work was completed.

データ制御装置５では、運転データ及び保守データをサイト，運転モードで分類し、データ分類装置１ａ〜１ｎに入力可能な形式に加工する。このデータの分類・加工方法の詳細については、後ほど説明する。また、データ分類装置には推定モード、学習モード及び準備モードがある。データ制御装置５ではモードに合わせた入力データの制御を行う。 In the data control device 5, the operation data and the maintenance data are classified by site and operation mode, and processed into a format that can be input to the data classification devices 1a to 1n. Details of this data classification and processing method will be described later. The data classification device has an estimation mode, a learning mode, and a preparation mode. The data control device 5 controls input data according to the mode.

操作端末６では、データ制御装置５のモード切替え指令を入力する。また、運転データベース３や保守データベース４に記録された情報、あるいはデータ分類装置１ａ〜１ｎにより分類された結果をモニタ画面に出力することができる。 The operation terminal 6 inputs a mode switching command for the data control device 5. In addition, the information recorded in the operation database 3 and the maintenance database 4 or the results classified by the data classification devices 1a to 1n can be output to the monitor screen.

データ分類装置１ａ〜１ｎには、推定モード，学習モード及び準備モードがある。推定モードではデータ受信装置２０から送られた運転データに基づき発電設備８ａ〜８ｎの状態を診断する。診断結果は運転データベース３に送られる。学習モードでは運転データベース３及び保守データベースに格納された情報を用いて運転データと発電設備の状態の関係を学習する。また、準備モードは学習も推定も行わない。このモードは新規サイトが設置された場合などに学習に必要なデータがない場合のモードである。これらのモードは診断装置１ａ〜１ｎ毎に個別に設定可能である。モードの設定は操作端末６にて行う。 The data classification devices 1a to 1n have an estimation mode, a learning mode, and a preparation mode. In the estimation mode, the states of the power generation facilities 8a to 8n are diagnosed based on the operation data sent from the data receiving device 20. The diagnosis result is sent to the operation database 3. In the learning mode, the relation between the operation data and the state of the power generation facility is learned using information stored in the operation database 3 and the maintenance database. In addition, the preparation mode does not perform learning or estimation. This mode is a mode when there is no data necessary for learning, such as when a new site is established. These modes can be set individually for each of the diagnostic apparatuses 1a to 1n. The mode is set on the operation terminal 6.

次に、監視センターの動作について説明する。データ分類装置１ａ〜１ｎが実際に動作するモードは学習モード及び推定モードの２つであるため、学習モード，推定モードの順で説明する。なお、前述したように、学習モード，推定モードはサイト毎に個別に設定可能であるが、ここでは、複数のサイトが全て学習モードの場合及び全て推定モードの場合を例に説明する。 Next, the operation of the monitoring center will be described. Since there are two modes in which the data classification devices 1a to 1n actually operate, the learning mode and the estimation mode will be described in the order of the learning mode and the estimation mode. As described above, the learning mode and the estimation mode can be individually set for each site, but here, a case where a plurality of sites are all in the learning mode and a case where all the sites are in the estimation mode will be described as an example.

学習モードでは運転データベース３及び保守データベース４に格納された情報を用いて運転データと発電設備の状態の関係を学習する。ただし、両データベースに格納された情報をそのままデータ分類装置１ａ〜１ｎに入力することはできない。そこで、データ制御装置５において、運転データ及び保守情報をデータ分類装置に入力するデータ形式に変換する。そのアルゴリズムを図１４に示す。なお、以下の説明では、計測項目は前述した項目の中から３項目（DATA１〜DATA３）を選択した例で説明する。 In the learning mode, information stored in the operation database 3 and the maintenance database 4 is used to learn the relationship between the operation data and the state of the power generation equipment. However, the information stored in both databases cannot be directly input to the data classification devices 1a to 1n. Therefore, the data control device 5 converts the operation data and maintenance information into a data format that is input to the data classification device. The algorithm is shown in FIG. In the following description, measurement items will be described using an example in which three items (DATA1 to DATA3) are selected from the items described above.

Step３０１：全運転データをサイトと運転モードでグループ分けし、各グループのデータのサンプリング間隔を決定する。グループ分けの例を次に示す。例えば、「サイト１」〜「サイト３」の３つのサイトで運転モードが「起動運転」「定格運転」「停止運転」の３モードであったとすると、運転データは合計９つのグループにグループ分けられる。データのサンプリング間隔は運転モードによって異なる。本実施例では運転データの変化が少ない「定格運転」では１分間隔とし、それ以外のモードでは１秒間隔とした。 Step 301: All operation data is grouped by site and operation mode, and the sampling interval of data of each group is determined. An example of grouping is shown below. For example, if the operation mode is “start operation”, “rated operation”, and “stop operation” at the three sites “site 1” to “site 3”, the operation data is grouped into nine groups in total. . The data sampling interval varies depending on the operation mode. In this example, the “rated operation” with little change in operation data was set at 1 minute intervals, and in other modes, the interval was set at 1 second.

Step３０２：Step３０１で分類したグループのうち１つのグループを選択する。 Step 302: Select one group from the groups classified in Step 301.

Step３０３：Step３０２で選択したグループに該当するデータを用いてDATA１〜DATA３のデータについて、図５に示したデータ前処理装置２０処理のStep１０１〜Step１０５の処理を実施する。 Step 303: Steps 101 to 105 of the data preprocessing device 20 shown in FIG. 5 are performed on the data DATA1 to DATA3 using the data corresponding to the group selected in Step 302.

まず、DATA１〜DATA５の計測項目毎に最大値及び最小値を計算する。計算した最大値及び最小値はサイト毎にデータ制御装置５内に記憶される。 First, the maximum value and the minimum value are calculated for each measurement item of DATA1 to DATA5. The calculated maximum and minimum values are stored in the data control device 5 for each site.

次に、計算した最大値及び最小値を用いてデータを正規化する。その方法についてDATA１を例に説明する。DATA１のデータ数がＭ個でｍ番目の計測値をdata１（ｍ）とする。また、Ｍ個のデータの中の最大値及び最小値をそれぞれMax_１，Min_１すると、正規化したデータNdata_１（ｍ）は式（１６）で計算する。 Next, the data is normalized using the calculated maximum and minimum values. The method will be described taking DATA1 as an example. The data number of DATA1 is M and the mth measurement value is data1 (m). Further, if the maximum value and the minimum value in the M data are respectively Max_1 and Min_1, normalized data Ndata_1 (m) is calculated by Expression (16).

Ndata_１(ｍ)＝α＋(１−α)×(data_１(ｍ)−Min_１)／(Max_１−Min_１)
…（１６） Ndata_1 (m) = α + (1-α) × (data_1 (m) −Min_1) / (Max_1−Min_1)
... (16)

ここで、α（０≦α＜０.５）の定数であり、式（１）によりデータは［α，１−α］の範囲に正規化される。本実施例ではα＝０.１とした。 Here, it is a constant of α (0 ≦ α <0.5), and the data is normalized to the range of [α, 1−α] by the equation (1). In this embodiment, α = 0.1.

次に、図５のStep１０３，Step１０４の処理を実施する。ここでは、運転データに定格運転のデータを２００点抽出し、DATA１〜DATA３の標準偏差σｉ（ｉ＝１〜３）を計算した。最後にDATA１〜DATA３の正規化データNdata_ｉ（ｍ）（ｉ＝１〜３）を式（１７）で補正する。 Next, the processing of Step 103 and Step 104 in FIG. 5 is performed. Here, 200 points of rated operation data were extracted from the operation data, and the standard deviation σi (i = 1 to 3) of DATA1 to DATA3 was calculated. Finally, the normalized data Ndata_i (m) (i = 1 to 3) of DATA1 to DATA3 is corrected by Expression (17).

Ndata_ｉ′(ｍ)＝(Ndata_ｉ(ｍ)−０.５)＊σｓ／σｉ＋０.５ …（１７） Ndata_i ′ (m) = (Ndata_i (m) −0.5) * σs / σi + 0.5 (17)

ここで、σｓはσｉ（ｉ＝１〜３）の中の最小の値である。 Here, σs is the minimum value in σi (i = 1 to 3).

Step３０４：機器状態データを変換してカテゴリーＢのデータを作成する。保守データベース４には、機器の状態が記録されている。例えば、状態として「正常」「異常Ａ」「異常Ｂ」の３種類があったとすると、それぞれを「１」「２」「３」のような番号データに変換する。ただし、保守データベース４には決まった時間間隔で設備の状態が記録されているわけではない。したがって、入力データａの時間間隔に合わせてデータを加工する。このように時間間隔を修正することで、同一時刻の運転データとそのときの設備の状態が対応したデータが得られる。 Step 304: Convert the device status data to create category B data. The maintenance database 4 records the state of the device. For example, if there are three types of states, “normal”, “abnormal A”, and “abnormal B”, each is converted into number data such as “1”, “2”, and “3”. However, the equipment status is not recorded in the maintenance database 4 at fixed time intervals. Therefore, the data is processed according to the time interval of the input data a. By correcting the time interval in this way, data corresponding to the operation data at the same time and the state of the equipment at that time can be obtained.

Step３０５：Step３０１で分類した全てのグループに対してStep３０３〜Step３０５の処理を実行したかを判定する。残っているグループがあればそれらのグループに対応するデータについてStep３０３〜Step３０５を実行する。全てのグループに対して処理が完了した場合は終了する。 Step 305: It is determined whether the processing of Step 303 to Step 305 has been executed for all the groups classified in Step 301. If there are any remaining groups, Steps 303 to 305 are executed for the data corresponding to those groups. If the processing is completed for all groups, the process ends.

以上の処理により、Step３０１で分類した個数だけ入力データとそのときのカテゴリーＢのペアが作成される。これらはそれぞれ異なるデータ分類装置１ａ〜１ｎに入力される。 As a result of the above processing, pairs of input data and category B at that time are created for the number classified in Step 301. These are respectively input to different data classification apparatuses 1a to 1n.

次にデータ分類装置１ａ〜１ｎの動作について説明する。データ分類装置１ａ〜１ｎでは、入力データとカテゴリーＢの関係を学習する。入力データ，カテゴリーＢは共にＭ個ある。本実施例では、Ｍ組のデータを順にデータ分類装置１ａ〜１ｎに入力した。Ｍ組の入力データとカテゴリーＢの関係を学習するアルゴリズムについては本発明の第２の実施例において詳細に説明したため、本実施例では説明は割愛する。 Next, the operation of the data classification devices 1a to 1n will be described. The data classification devices 1a to 1n learn the relationship between input data and category B. There are M input data and category B. In this embodiment, M sets of data are sequentially input to the data classification devices 1a to 1n. Since the algorithm for learning the relationship between the M sets of input data and category B has been described in detail in the second embodiment of the present invention, description thereof will be omitted in this embodiment.

学習が終了すると、学習の結果得られたプロトタイプをデータ分類装置１ａ〜１ｎ内の記憶領域に保存する。 When the learning is completed, the prototype obtained as a result of the learning is stored in the storage areas in the data classification devices 1a to 1n.

次に推定モードの動作について説明する。推定モードでは、データ受信装置２から１分毎に送られる運転データに基づき発電設備８ａ〜８ｎの状態を診断する。データ受信装置２から送られたデータはデータ制御装置５により加工され、データ分類装置１ａ〜１ｎに送られる。 Next, the operation in the estimation mode will be described. In the estimation mode, the state of the power generation facilities 8a to 8n is diagnosed based on the operation data sent from the data receiving device 2 every minute. Data sent from the data receiving device 2 is processed by the data control device 5 and sent to the data classification devices 1a to 1n.

データ制御装置５にはサイト毎にデータを正規化するための最大値及び最小値が記憶されている。従って、データ受信装置２から送られたデータは、学習モードのステップ３０４の処理により瞬時にデータ変換され、データ分類装置１ａ〜１ｎに入力される。データ分類装置１ａ〜１ｎには学習モードで得られたプロトタイプが記憶されている。従って、第２の実施例の推定モードで説明した手順でカテゴリーＡ及び発電設備の状態（カテゴリーＢ）が推定される。推定されたカテゴリーＡ及び発電設備の状態は操作端末６のモニタ画面に出力することができる。また、運転データベース３に保存することもできる。 The data control device 5 stores a maximum value and a minimum value for normalizing data for each site. Accordingly, the data sent from the data receiving device 2 is instantaneously converted by the processing in step 304 in the learning mode and input to the data classification devices 1a to 1n. The data classification devices 1a to 1n store prototypes obtained in the learning mode. Therefore, the category A and the state of the power generation facility (category B) are estimated by the procedure described in the estimation mode of the second embodiment. The estimated category A and the state of the power generation equipment can be output to the monitor screen of the operation terminal 6. It can also be stored in the operation database 3.

なお、実施例３では、実施例２のデータ分類装置を利用したが、実施例１のデータ分類装置を利用することも可能である。 In the third embodiment, the data classification device according to the second embodiment is used. However, the data classification device according to the first embodiment can be used.

上述した各実施例の装置は、記憶手段やＣＰＵを備えたコンピュータなどで実施することができ、また装置の有する機能としての各処理手段などはプログラムモジュールであり、モジュールを読込んでコンピュータに実行させることで各機能を実施することができる。また、プログラムモジュールを記録した記録媒体をコンピュータに読込ませることにより各機能を実施可能である。 The apparatus of each embodiment described above can be implemented by a computer or the like having storage means and a CPU, and each processing means as a function of the apparatus is a program module, and the module is read and executed by the computer. Each function can be implemented. Each function can be implemented by causing a computer to read a recording medium on which a program module is recorded.

尚、上述した実施例では、データ分類装置の中に、入力データ前処理手段１０，データ分類手段２０，処理フロー制御手段３０などの各手段を設けて、処理を実行することとしたが、それぞれの処理を別々の装置で実行し、相互の装置間で通信することでも実行することができる。 In the above-described embodiment, each unit such as the input data preprocessing unit 10, the data classification unit 20, and the processing flow control unit 30 is provided in the data classification device, and the processing is executed. These processes can be executed by separate apparatuses and communicated with each other.

１データ分類装置
２データ受信装置
３運転データベース
４保守データベース
５データ制御装置
６操作端末
７インターネット
８発電設備
１０入力データ前処理手段
２０，４０データ分類手段
２１データ読込み手段
２２内部データ初期化手段
２３内部データ格納部
２４プロトタイプ選択手段
２５プロトタイプ評価手段
２６プロトタイプ追加手段
２７内部データ補正手段
２８内部データ及びマトリックス補正手段
３０処理フロー制御手段
４１カテゴリーＢ読込み手段
４２カテゴリー変換手段
４３カテゴリー対応マトリックス格納部 1 Data classification device 2 Data reception device 3 Operation database 4 Maintenance database 5 Data control device 6 Operation terminal 7 Internet 8 Power generation facility 10 Input data preprocessing means 20, 40 Data classification means 21 Data reading means 22 Internal data initialization means 23 Inside Data storage unit 24 Prototype selection unit 25 Prototype evaluation unit 26 Prototype addition unit 27 Internal data correction unit 28 Internal data and matrix correction unit 30 Processing flow control unit 41 Category B reading unit 42 Category conversion unit 43 Category correspondence matrix storage unit

Claims

Computer
Calculate the standard deviation of each item of sampled input data, pre-process the input data based on the standard deviation of each item found,
Select the prototype of the category closest to the preprocessed input data,
Evaluating whether the selected prototype is valid from the relationship between the prototype and the input data,
A data classification method characterized by classifying a plurality of input data composed of a plurality of items into a plurality of categories.

The data classification method according to claim 1,
If the evaluation determines that the prototype is valid, correct at least one of the region determination parameter and the prototype,
A data classification method comprising: adding a new prototype when the prototype is not determined to be valid by the evaluation.

A data classification method in which a computer classifies a plurality of natural input data into a plurality of preset categories B,
Storing a prototype of category A related to category B, a region determination parameter that defines the size of the region of category A, and a category correspondence matrix for converting category A to category B;
Calculate the standard deviation of each item of sampled input data, pre-process the input data based on the standard deviation of each item found,
A category A prototype closest to the preprocessed input data is selected, whether the prototype is valid is determined from the relationship between the selected prototype and the preprocessed input data, and the prototype is determined to be valid If not, add a new prototype and repeat selection of the prototype, and if the prototype is determined to be valid, correct the region determination parameter or the prototype and the category correspondence matrix;
A data classification method, wherein the selected category A is converted to category B using a corrected category correspondence matrix.

A data classification method according to claim 3, wherein
If the selected category A already corresponds to another category B when adding a relationship between the selected category A and the preset category B to the category correspondence matrix,
The addition of the correspondence relationship is rejected, the category A prototype is corrected so as to be away from the input data, and the region determination parameter is equal to the distance between the category A prototype and the input data. The data classification method characterized by correcting as follows.

The computer inputs the operation data of the equipment, executes the data classification method according to claim 1,
A facility diagnosis method, wherein the facility state is diagnosed based on a category classified by the data classification method.

The computer inputs the operation data of the equipment, executes the data classification method according to claim 3,
Giving the state of the equipment as a teacher pattern, learning the relationship between the state of the equipment and the operation data of the equipment by the data classification device,
A diagnosis method characterized by inputting unlearned operation data to the data classification device after learning, and diagnosing the state of the facility based on a category B output by the data classification device.

A data classification device for classifying a plurality of input data composed of a plurality of items into a plurality of categories,
A data preprocessing means for obtaining a standard deviation of each item of the sampled input data and preprocessing the input data based on the obtained standard deviation of each item;
Prototype selection means for selecting a prototype of the category closest to the input data processed by the data preprocessing means;
A data classification apparatus comprising: prototype evaluation means for evaluating whether the prototype is valid from the relationship between the prototype and the input data.

In the data classification device according to claim 7,
The prototype of the category, and an internal data storage unit that stores a region determination parameter that defines the size of the category region;
Prototype addition means to create a new prototype,
Internal data correction means capable of correcting the region determination parameter and the prototype;
In the prototype evaluation unit, when the prototype is determined to be valid, the internal data correction unit corrects at least one of the region determination parameter and the prototype, and if the prototype is not determined to be valid, the prototype evaluation unit A data classification device characterized in that a new prototype is created by an additional means.

A data classification device for classifying a plurality of input data composed of a plurality of items into a plurality of preset categories B,
A prototype of category A related to category B, and an internal data storage for storing area determination parameters that define the size of the area of category A;
A category correspondence matrix storage for storing a category correspondence matrix for converting category A to category B;
A data preprocessing means for obtaining a standard deviation of each item of the sampled input data and preprocessing the input data based on the obtained standard deviation of each item;
Prototype selection means for selecting a category A prototype closest to the input data processed by the data preprocessing means;
Prototype evaluation means for evaluating whether the prototype is valid from the relationship between the prototype and the input data;
Prototype adding means for creating a new category A prototype and repeating selection of the prototype when the prototype evaluation means does not determine that the prototype is valid;
Internal data and matrix correction means for correcting the region determination parameter or the prototype, and the category correspondence matrix when the prototype is determined to be valid;
A data classification apparatus comprising: category conversion means for converting the selected category A into a category B using the category correspondence matrix when the prototype is determined to be valid.

The data classification device according to claim 9, wherein
In the internal data and matrix correction means, when adding the relationship between the selected category A and the preset category B to the category correspondence matrix, if the selected category A already corresponds to another category B, Refuse to add correspondence,
The category A prototype is corrected away from the input data, and the area determination parameter is corrected so that the area determination parameter is equal to the distance between the category A prototype and the input data. Data classification device.

In the equipment diagnostic apparatus, the data classification apparatus according to claim 6 is provided,
A facility diagnosis apparatus, wherein operation data of the facility is input to the data classification device, and the state of the facility is diagnosed based on the classified category.

In equipment diagnostic equipment,
A data classification device according to claim 8,
Giving the state of the equipment as a teacher pattern, learning the relationship between the state of the equipment and the operation data of the equipment by the data classification device,
An equipment diagnosis apparatus characterized by inputting unlearned operation data to the data classification apparatus after learning, and diagnosing the state of the equipment based on the category B output by the data classification apparatus.