JP5517973B2

JP5517973B2 - Pattern recognition apparatus and pattern recognition method

Info

Publication number: JP5517973B2
Application number: JP2011047991A
Authority: JP
Inventors: 明三鶴田
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 2011-03-04
Filing date: 2011-03-04
Publication date: 2014-06-11
Anticipated expiration: 2031-03-04
Also published as: JP2012185657A

Description

本発明は、多次元情報からパターン認識を行なうパターン認識装置およびパターン認識方法に関する。本発明は特に、生産ラインの異常等の「診断・監視」、製品特性値の「予測・検査」、官能特性値の「識別」といった、多次元情報からパターン認識を行なう分野に関する。 The present invention relates to a pattern recognition apparatus and a pattern recognition method for performing pattern recognition from multidimensional information. The present invention particularly relates to the field of pattern recognition from multidimensional information such as “diagnosis / monitoring” of production line abnormality, “prediction / inspection” of product characteristic values, and “identification” of sensory characteristic values.

多次元情報から予測および診断を行なうパターン認識の方法の一つとして、田口玄一博士から提案されたＭＴシステムがある。ＭＴシステムにおいては、ＭＴ法（マハラノビス距離を利用する方法）、ＭＴＡ法（ＭＴ法での相関係数行列を分散共分散行列で代替する方法）、ＴＳ法（特徴項目による直交展開を利用する方法）、Ｔ法（特に、単位空間が集団の中央の場合であるＴ法（１）を指す）、標準ＳＮ比を使ったＴ法（たとえば文字の認識のように、真値がなく、かつ多数の種類の単位空間がありうる場合に用いられる方法であり、ＲＳ法、ＲＴ法、Ｔ法（３）とも呼ばれる）などが提案されている（たとえば非特許文献１から３を参照）。また、上記のＭＴシステムを品質管理、検査などに応用した装置もこれまでに提案されている（たとえば特許文献１，２を参照）。 One of the pattern recognition methods for predicting and diagnosing from multidimensional information is the MT system proposed by Dr. Genichi Taguchi. In the MT system, the MT method (a method using the Mahalanobis distance), the MTA method (a method for replacing the correlation coefficient matrix in the MT method with a variance-covariance matrix), and the TS method (a method using orthogonal expansion by feature items) ), The T method (in particular, the T method (1) in which the unit space is the center of the group), and the T method using a standard signal-to-noise ratio (for example, there is no true value as in character recognition and many (Referred to as non-patent documents 1 to 3), and the like. The RS method, the RT method, and the T method (3) are proposed. In addition, devices that apply the above-described MT system to quality control, inspection, and the like have been proposed (see, for example, Patent Documents 1 and 2).

単位空間とは、目的に対して均質な集団に属するデータセットを意味する。たとえば、健康診断（病気の発見）という目的に対して均質な集団とは、健康人の集団である。健康人の集団は、健康状態に関する特徴項目の値の分散が比較的小さいために、ある特定のパターンを形成していると考えられている。一方、後で説明する信号空間とは、予測精度を評価するときの基準になるものであり、上記の例では、健康状態がさまざまな（健康から重篤な不健康状態まで）集団に属するデータセットである。 The unit space means a data set belonging to a homogeneous group for the purpose. For example, a group that is homogeneous for the purpose of health examination (discovery of a disease) is a group of healthy people. A group of healthy persons is considered to form a specific pattern because of a relatively small distribution of characteristic item values related to health. On the other hand, the signal space described later is a standard for evaluating the prediction accuracy. In the above example, a data set belonging to a group having various health states (from health to severe unhealthy states). It is.

Ｔ法においては、単位空間は、その平均値のデータが用いられるだけである。一方、信号空間のデータは、データベースを作るためのデータになることも一般的である（たとえば非特許文献４を参照）。 In the T method, only the average value data is used for the unit space. On the other hand, signal space data is generally data for creating a database (see, for example, Non-Patent Document 4).

ＭＴシステムにおけるこれらの手法群は、（１）項目間の相関を重視するＭＴ法、ＭＴＡ法のグループと、（２）項目の主効果を重視するＴＳ法、Ｔ法、ＲＴ法のグループとに分けられる。単位空間の性質、単位空間として用意することのできるデータセット数、多重共線性の有無などといった条件によって、上記（１），（２）のいずれかのグループが選択される。一般には予測精度の点で（１）のグループが有利であり、データ制約の少なさの点で（２）のグループが有利と考えられる。 These methods in the MT system are divided into (1) MT method and MTA method groups that emphasize correlation between items, and (2) TS method, T method, and RT method groups that emphasize the main effects of items. Divided. One of the groups (1) and (2) is selected according to conditions such as the nature of the unit space, the number of data sets that can be prepared as the unit space, and the presence or absence of multicollinearity. In general, the group (1) is advantageous in terms of prediction accuracy, and the group (2) is considered advantageous in terms of fewer data constraints.

以下、（１）のグループとしてＭＴ法、（２）のグループとしてＴ法を代表的に取り上げて比較説明する。ＭＴ法は、良否判定などの「診断」に用いられることが多い。ＭＴ法では、単位空間が集団の端にある。ただし、ＭＴ法では、単位空間データにおける相関係数行列の逆行列を演算する過程で、（ａ）相関係数が１となるような２つの特徴項目の組み合わせがある場合、あるいは（ｂ）ある特徴項目の値がデータセット間で同一（その項目のσ＝０）となるといった場合には、逆行列を計算できないという課題がある。 Hereinafter, the MT method will be representatively taken as a group of (1), and the T method will be taken as a representative of a group of (2) for comparison. The MT method is often used for “diagnosis” such as pass / fail determination. In the MT method, the unit space is at the end of the group. However, in the MT method, in the process of calculating the inverse matrix of the correlation coefficient matrix in the unit space data, (a) there is a combination of two feature items such that the correlation coefficient is 1, or (b) When the value of the feature item is the same between the data sets (σ = 0 of the item), there is a problem that the inverse matrix cannot be calculated.

このため、一般に（ａ）の場合には、２つの特徴項目は同一の内容と考えて、組み合わせの一方が解析から省かれる。また、一般に（ｂ）の場合には、その項目の値は将来にわたり変化しないから不要と考えて解析から省かれる。しかしながら（ａ）または（ｂ）のような状況が発生しているのは、結果が均一な単位空間の中だけである可能性が高い。本来検知したい異常状態が含まれる信号空間データにおいては、２つの特徴項目間の相関係数が１にはならず、特徴項目の分散（標準偏差）も０にはならない。このため、検知すべき異常の特徴を有する項目を解析から事前に省くことは、パターン認識精度の低下につながるという問題をもたらす。 For this reason, generally in the case of (a), two feature items are considered to have the same content, and one of the combinations is omitted from the analysis. In general, in the case of (b), since the value of the item does not change in the future, it is considered unnecessary and is omitted from the analysis. However, it is highly possible that the situation such as (a) or (b) occurs only in a unit space where the result is uniform. In signal space data including an abnormal state that is originally desired to be detected, the correlation coefficient between two feature items does not become 1, and the variance (standard deviation) of feature items does not become 0. For this reason, omitting items having abnormal characteristics to be detected in advance from the analysis causes a problem that the pattern recognition accuracy is lowered.

一方、Ｔ法は、たとえば符号をもった出力の定量的な予測および推定に用いられる。しかしながらＭＴ法とは異なり、Ｔ法では、２つの特徴項目間の相関関係を考慮して予測および推定を行なうことはできない。そのようなＴ法での課題を解決するために、Ｔ法において、２つの特徴項目間の相関関係を考慮して予測および推定を行なう方法が提案されている（非特許文献５を参照）。 On the other hand, the T method is used, for example, for quantitative prediction and estimation of an output having a sign. However, unlike the MT method, the T method cannot perform prediction and estimation in consideration of the correlation between two feature items. In order to solve such a problem in the T method, a method of performing prediction and estimation in consideration of the correlation between two feature items has been proposed in the T method (see Non-Patent Document 5).

特開２００９−２１０４４５号公報JP 2009-210445A 特開２００９−２８８１００号公報JP 2009-288100 A

田口玄一、品質工学便覧、日刊工業新聞社、２００７年Genichi Taguchi, Handbook of Quality Engineering, Nikkan Kogyo Shimbun, 2007 田口玄一、ＭＴシステムによる予測と推定、標準化と品質管理、Ｖｏｌ．５８，Ｎｏ．８，ｐｐ．６８−７６，２００５年Genichi Taguchi, prediction and estimation by MT system, standardization and quality control, Vol. 58, no. 8, pp. 68-76, 2005 田口玄一、画像認識、標準ＳＮ比を用いるT法、標準化と品質管理、Ｖｏｌ．５８，Ｎｏ．１１，ｐｐ．９４−１０１，２００５年Taichi Genichi, image recognition, T method using standard S / N ratio, standardization and quality control, Vol. 58, no. 11, pp. 94-101, 2005 吉野荘平、ＭＴシステムによる不動産価格の予測（３）、品質工学、品質工学会，Ｖｏｌ．１４，Ｎｏ．１，２００６年Shohei Yoshino, Real Estate Price Prediction with MT System (3), Quality Engineering, Quality Engineering Society, Vol. 14, no. 1,2006 鐡見太郎、T法において相関を考慮する方法、第１５回品質工学研究発表大会論文集、品質工学会、ｐｐ．４３４−４３７，２００７年Taro Tadami, Method of Considering Correlation in T Method, 15th Quality Engineering Research Conference Proceedings, Quality Engineering Society, pp. 434-437, 2007

非特許文献５に記載の方法は、単位空間内のすべての２項目間の相関を考慮した項目を新たに作成して、その項目をＴ法での項目に加えて解析を行なうというものである。しかしながら非特許文献５に記載の方法には、以下のような課題がある。 The method described in Non-Patent Document 5 is to newly create an item considering the correlation between all two items in the unit space, and add the item to the item in the T method for analysis. . However, the method described in Non-Patent Document 5 has the following problems.

図１１は、非特許文献５に記載の方法の課題点を説明するための図である。図１１を参照して、２項目ｘ_p，ｘ_q（ｐ，ｑは項目番号を表わす）の間の相関を示す項目Ｘ_pqは、基準化された項目Ｘ_pとＸ_qとによって表わされる２次元平面において、単位空間データ１に対する単回帰直線２とデータ３０との間の距離ｄ、あるいは、距離ｄに比例する量として表わされる。単回帰直線２は、データ中心（原点）を通る直線である。この定義によれば、単回帰直線２上にあるデータのＸ_pqは、すべて同じ値（Ｘ_pq＝０）である。ここではデータを示す平面上の点を単に「データ」と呼んでいる。 FIG. 11 is a diagram for explaining the problems of the method described in Non-Patent Document 5. Referring to FIG. 11, item X _pq indicating the correlation between two items x _p and x _q (p and q represent item numbers) is represented by _scaled items X _p and X _q 2 In the dimension plane, the distance d between the single regression line 2 and the data 30 with respect to the unit space data 1 is expressed as an amount proportional to the distance d. The single regression line 2 is a straight line passing through the data center (origin). According to this definition, X _pq of data on the single regression line 2 is all the same value (X _pq = 0). Here, the point on the plane showing the data is simply referred to as “data”.

実際には、複数のデータが単回帰直線２上にあったとしても、それらのデータが異なる特徴や結果を有していることがある。たとえば２つのデータ３１ａ，３１ｂは単回帰直線２上にあるので、データ３１ａ，３１ｂのＸ_pqは０（すなわち距離ｄ＝０）となる。データ３１ａは、データ中心（原点）付近に位置する一方、データ３１ｂはデータ中心から大きく離れた信号空間に属する。すなわちデータ３１ｂは異常のデータである。 Actually, even if a plurality of data is on the single regression line 2, the data may have different characteristics and results. For example, since the two data 31a and 31b are on the single regression line 2, X _pq of the data 31a and 31b is 0 (that is, the distance d = 0). The data 31a is located near the data center (origin), while the data 31b belongs to a signal space far away from the data center. That is, the data 31b is abnormal data.

同様の事例は、単回帰直線２と平行な直線１１上にあるデータ３２ａ〜３２ｃによっても示される。直線１１と単回帰直線２との間の距離はｄ’である。したがってデータ３２ａ〜３２ｃに対するＸ_pqは、すべて同じ値（Ｘ_pq＝ｄ’)となる。しかしながらデータ３２ａ〜３２ｃのうち、データ３２ａはデータ中心（原点）に最も近くに位置する。データ３２ｃは原点から最も遠くに位置し、データ３２ｂは、データ３２ａとデータ３２ｃとの間に位置する。 A similar case is also indicated by data 32a to 32c on a straight line 11 parallel to the single regression line 2. The distance between the straight line 11 and the single regression line 2 is d ′. Accordingly, X _pq for the data 32a to 32c all have the same value (X _pq = d ′). However, of the data 32a to 32c, the data 32a is located closest to the data center (origin). Data 32c is located farthest from the origin, and data 32b is located between data 32a and data 32c.

相関をもった２項目の特徴によって表現されるデータにおいて、同等の異常度（距離とも呼ばれる）をもつ集団は、図１１では同心楕円２０上の点で表わされるデータの集団であって単回帰直線から等距離の集団ではない。このため非特許文献５に記載の方法によれば、十分な推定精度が得られない場合が生じ得る。 In the data expressed by the features of the two items having the correlation, a group having the same degree of abnormality (also called distance) is a group of data represented by points on the concentric ellipse 20 in FIG. It is not an equidistant group from. For this reason, according to the method described in Non-Patent Document 5, there may occur a case where sufficient estimation accuracy cannot be obtained.

本発明の目的は、多次元情報に基づいてパターン認識を行なう分野において、データ形式の制約が少なく、かつ、より高い識別精度を得ることが可能な方法および装置を提供することである。 An object of the present invention is to provide a method and apparatus capable of obtaining higher identification accuracy with less restrictions on data formats in the field of pattern recognition based on multidimensional information.

本発明のある局面に係るパターン認識装置は、複数の原特徴項目を各々が有する複数のデータに基づいてパターン認識を行なうパターン認識装置であって、データごとに、複数の原特徴項目の中から２つの原特徴項目を選択するすべての組み合わせに対して評価距離を算出する評価距離算出部と、データごとに、すべての組み合わせに対応して算出された評価距離のうちの少なくとも一部を複数の原特徴項目に加えることによって、複数の新たな特徴項目を生成して、複数の新たな特徴項目の各々の値と当該新たな特徴項目の真値との相関を表わす重み係数を算出する重み係数算出部と、新たな特徴項目ごとの重み係数を用いて、出力の予測値を算出する予測値算出部と、予測値と、予め定められたしきい値とを比較して、目的に対する判断を行なう判断部とを備える。評価距離算出部は、評価距離としてマハラノビス距離を算出する。 A pattern recognition device according to an aspect of the present invention is a pattern recognition device that performs pattern recognition based on a plurality of pieces of data each having a plurality of original feature items. An evaluation distance calculation unit that calculates an evaluation distance for all combinations that select two original feature items, and for each data, at least some of the evaluation distances that are calculated corresponding to all combinations are a plurality of A weighting factor that generates a plurality of new feature items by adding to the original feature item and calculates a weighting factor that represents the correlation between the value of each of the plurality of new feature items and the true value of the new feature item A calculation unit, a prediction value calculation unit that calculates a predicted output value using a weighting factor for each new feature item, a comparison between the prediction value and a predetermined threshold value, and a determination for the purpose And a determination unit that performs. The evaluation distance calculation unit calculates the Mahalanobis distance as the evaluation distance.

本発明の他の局面に係るパターン認識装置は、複数の原特徴項目を各々が有する複数のデータに基づいてパターン認識を行なうパターン認識装置であって、データごとに、複数の原特徴項目の中から２つの原特徴項目を選択するすべての組み合わせに対して評価距離を算出する評価距離算出部と、データごとに、すべての組み合わせに対応して算出された評価距離のうちの少なくとも一部を複数の原特徴項目に加えることによって、複数の新たな特徴項目を生成して、複数の新たな特徴項目の各々の値と当該新たな特徴項目の真値との相関を表わす重み係数を算出する重み係数算出部と、新たな特徴項目ごとの重み係数を用いて、出力の予測値を算出する予測値算出部と、予測値と、予め定められたしきい値とを比較して、目的に対する判断を行なう判断部とを備える。評価距離算出部は、評価距離として、ＭＴＡ（マハラノビス・タグチ・アジョイント法）によって用いられる評価距離を算出する。 A pattern recognition apparatus according to another aspect of the present invention is a pattern recognition apparatus that performs pattern recognition based on a plurality of pieces of data each having a plurality of original feature items. An evaluation distance calculation unit that calculates evaluation distances for all combinations that select two original feature items from, and a plurality of at least some of the evaluation distances calculated for all combinations for each data A weight for generating a plurality of new feature items by adding to the original feature item and calculating a weighting coefficient representing a correlation between each value of the plurality of new feature items and a true value of the new feature item A coefficient calculation unit, a prediction value calculation unit that calculates a predicted output value using a weighting factor for each new feature item, a prediction value and a predetermined threshold value are compared, and a determination for the purpose is made And a determination unit that performs. The evaluation distance calculation unit calculates an evaluation distance used by MTA (Mahalanobis Taguchi Adjoint method) as the evaluation distance.

本発明のさらに他の局面に係るパターン認識装置は、複数の原特徴項目を各々が有する複数のデータに基づいてパターン認識を行なうパターン認識装置であって、データごとに、複数の原特徴項目の中から２つの原特徴項目を選択するすべての組み合わせに対して評価距離を算出する評価距離算出部と、データごとに、すべての組み合わせに対応して算出された評価距離のうちの少なくとも一部を複数の原特徴項目に加えることによって、複数の新たな特徴項目を生成して、複数の新たな特徴項目の各々の値と当該新たな特徴項目の真値との相関を表わす重み係数を算出する重み係数算出部と、新たな特徴項目ごとの重み係数を用いて、出力の予測値を算出する予測値算出部と、予測値と、予め定められたしきい値とを比較して、目的に対する判断を行なう判断部とを備える。評価距離算出部は、評価距離として、第１の特徴項目の基準化値と第２の特徴項目の基準化値との積を算出する。 A pattern recognition apparatus according to still another aspect of the present invention is a pattern recognition apparatus that performs pattern recognition based on a plurality of data each having a plurality of original feature items, and each of the data includes a plurality of original feature items. An evaluation distance calculation unit that calculates evaluation distances for all combinations that select two original feature items from the inside, and for each data, at least a part of the evaluation distances that are calculated for all combinations A plurality of new feature items are generated by adding to the plurality of original feature items, and a weighting coefficient representing a correlation between each value of the plurality of new feature items and the true value of the new feature item is calculated. The weight coefficient calculation unit, the prediction value calculation unit that calculates the predicted value of the output using the weight coefficient for each new feature item, the predicted value and a predetermined threshold value are compared, and Against And a determination section that performs determination. The evaluation distance calculation unit calculates the product of the normalized value of the first feature item and the normalized value of the second feature item as the evaluation distance.

本発明のさらに他の局面に係るパターン認識方法は、複数の原特徴項目を各々が有する複数のデータに基づいてパターン認識を行なうパターン認識方法であって、データごとに、複数の原特徴項目の中から２つの原特徴項目を選択するすべての組み合わせに対して評価距離を算出するステップと、データごとに、すべての組み合わせに対応して算出された評価距離のうちの少なくとも一部を複数の原特徴項目に加えることによって、複数の新たな特徴項目を生成するステップと、複数の新たな特徴項目の各々の値と当該新たな特徴項目の真値との相関を表わす重み係数を算出するステップと、新たな特徴項目ごとの重み係数を用いて、出力の予測値を算出するステップと、予測値と、予め定められたしきい値とを比較して、目的に対する判断を行なうステップとを備える。評価距離を算出するステップにおいて、評価距離としてマハラノビス距離を算出する。 A pattern recognition method according to yet another aspect of the present invention is a pattern recognition method for performing pattern recognition based on a plurality of pieces of data each having a plurality of original feature items. A step of calculating evaluation distances for all combinations of selecting two original feature items from the inside, and for each data, at least some of the evaluation distances calculated corresponding to all combinations are converted into a plurality of original distances. Adding to the feature item, generating a plurality of new feature items, and calculating a weighting factor representing a correlation between the value of each of the plurality of new feature items and the true value of the new feature item; , Using a weighting factor for each new feature item, calculating a predicted output value, comparing the predicted value with a predetermined threshold value, and determining the purpose And a step of performing. In the step of calculating the evaluation distance, a Mahalanobis distance is calculated as the evaluation distance.

本発明のさらに他の局面に係るパターン認識方法は、複数の原特徴項目を各々が有する複数のデータに基づいてパターン認識を行なうパターン認識方法であって、データごとに、複数の原特徴項目の中から２つの原特徴項目を選択するすべての組み合わせに対して評価距離を算出するステップと、データごとに、すべての組み合わせに対応して算出された評価距離のうちの少なくとも一部を複数の原特徴項目に加えることによって、複数の新たな特徴項目を生成するステップと、複数の新たな特徴項目の各々の値と当該新たな特徴項目の真値との相関を表わす重み係数を算出するステップと、新たな特徴項目ごとの重み係数を用いて、出力の予測値を算出するステップと、予測値と、予め定められたしきい値とを比較して、目的に対する判断を行なうステップとを備える。評価距離を算出するステップにおいて、評価距離として、ＭＴＡ（マハラノビス・タグチ・アジョイント法）によって用いられる評価距離を算出する。 A pattern recognition method according to yet another aspect of the present invention is a pattern recognition method for performing pattern recognition based on a plurality of pieces of data each having a plurality of original feature items. A step of calculating evaluation distances for all combinations of selecting two original feature items from the inside, and for each data, at least some of the evaluation distances calculated corresponding to all combinations are converted into a plurality of original distances. Adding to the feature item, generating a plurality of new feature items, and calculating a weighting factor representing a correlation between the value of each of the plurality of new feature items and the true value of the new feature item; , Using a weighting factor for each new feature item, calculating a predicted output value, comparing the predicted value with a predetermined threshold value, and determining the purpose And a step of performing. In the step of calculating the evaluation distance, the evaluation distance used by MTA (Mahalanobis Taguchi Adjoint method) is calculated as the evaluation distance.

本発明のさらに他の局面に係るパターン認識方法は、複数の原特徴項目を各々が有する複数のデータに基づいてパターン認識を行なうパターン認識方法であって、データごとに、複数の原特徴項目の中から２つの原特徴項目を選択するすべての組み合わせに対して評価距離を算出するステップと、データごとに、すべての組み合わせに対応して算出された評価距離のうちの少なくとも一部を複数の原特徴項目に加えることによって、複数の新たな特徴項目を生成するステップと、複数の新たな特徴項目の各々の値と当該新たな特徴項目の真値との相関を表わす重み係数を算出するステップと、新たな特徴項目ごとの重み係数を用いて、出力の予測値を算出するステップと、予測値と、予め定められたしきい値とを比較して、目的に対する判断を行なうステップとを備える。評価距離を算出するステップにおいて、評価距離として、第１の特徴項目の基準化値と第２の特徴項目の基準化値との積を算出する。 A pattern recognition method according to yet another aspect of the present invention is a pattern recognition method for performing pattern recognition based on a plurality of pieces of data each having a plurality of original feature items. A step of calculating evaluation distances for all combinations of selecting two original feature items from the inside, and for each data, at least some of the evaluation distances calculated corresponding to all combinations are converted into a plurality of original distances. Adding to the feature item, generating a plurality of new feature items, and calculating a weighting factor representing a correlation between the value of each of the plurality of new feature items and the true value of the new feature item; , Using a weighting factor for each new feature item, calculating a predicted output value, comparing the predicted value with a predetermined threshold value, and determining the purpose And a step of performing. In the step of calculating the evaluation distance, a product of the normalized value of the first feature item and the normalized value of the second feature item is calculated as the evaluation distance.

本発明によれば、多次元情報に基づいてパターン認識を行なう分野において、データ形式の制約が少なく、かつ、より高い識別精度を得ることができる。 According to the present invention, in the field of pattern recognition based on multidimensional information, there are few restrictions on the data format, and higher identification accuracy can be obtained.

本発明の実施の形態に係るパターン認識装置の概略構成を示した図である。It is the figure which showed schematic structure of the pattern recognition apparatus which concerns on embodiment of this invention. 図１に示したパターン認識装置の機能ブロック図である。It is a functional block diagram of the pattern recognition apparatus shown in FIG. 実施の形態１に係るパターン認識装置による処理の流れを説明するためのフローチャートである。5 is a flowchart for explaining a flow of processing by the pattern recognition apparatus according to the first embodiment. ステップＳＡの処理のために準備されたデータを表形式で説明した図である。It is the figure explaining the data prepared for the process of step SA in tabular form. 図３に示した処理の変形例を説明するためのフローチャートである。4 is a flowchart for explaining a modification of the process shown in FIG. 3. 実施の形態２に係るパターン認識装置による処理の流れを説明するためのフローチャートである。10 is a flowchart for explaining a flow of processing by the pattern recognition apparatus according to the second embodiment. Ｓ_βj−Ｖe_j＜０の場合のデータの例を示した図である。Is a diagram showing an example of data when the _S βj -Ve _j <0. 特徴項目と出力との関係をあらわすデータと、そのときの単相関係数、実施の形態４〜６のη_jとを比較して説明する図である。It is a figure explaining the data which show the relationship between a feature item and an output, and a simple correlation coefficient at that time, (eta) _j of Embodiments 4-6 compared. センサデバイスの常温での特性値および高温での予測値を模式的に示した図である。It is the figure which showed typically the characteristic value in the normal temperature of a sensor device, and the predicted value in high temperature. 田口のＴ法、非特許文献５の方法および本発明の実施の形態１〜３の各々に係る方法の比較結果を示した図である。It is the figure which showed the comparison result of the T method of Taguchi, the method of nonpatent literature 5, and the method which concerns on each of Embodiment 1-3 of this invention. 非特許文献５に記載の方法の課題点を説明するための図である。It is a figure for demonstrating the subject of the method of a nonpatent literature 5. FIG.

以下、この発明の実施の形態について図面を参照して詳しく説明する。なお、同一または相当する部分には同一の参照符号を付して、その説明を繰返さない。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. The same or corresponding parts are denoted by the same reference numerals, and description thereof will not be repeated.

図１は、本発明の実施の形態に係るパターン認識装置の概略構成を示した図である。図１を参照して、パターン認識装置５０は、コンピュータシステムによって実現可能である。パターン認識装置５０は、ＣＰＵ（中央演算処理装置）５１と、ＲＡＭ（Random Access Memory）などの主記憶装置５２と、ＨＤＤ（Hard Disk Drive）などの補助記憶装置５３と、キーボードやマウスなどの入力装置５４と、モニタやプリンタなどの出力装置５５と、外部の機器と情報の授受を行なう通信装置５６とを備える。 FIG. 1 is a diagram showing a schematic configuration of a pattern recognition apparatus according to an embodiment of the present invention. Referring to FIG. 1, the pattern recognition device 50 can be realized by a computer system. The pattern recognition device 50 includes a CPU (Central Processing Unit) 51, a main storage device 52 such as a RAM (Random Access Memory), an auxiliary storage device 53 such as an HDD (Hard Disk Drive), and an input such as a keyboard and a mouse. A device 54, an output device 55 such as a monitor or a printer, and a communication device 56 for exchanging information with external devices are provided.

補助記憶装置５３は、後述するパターン認識方法をコンピュータシステムに実行させるためのプログラムを格納する。ＣＰＵ５１が補助記憶装置５３から当該プログラムを読み出し、主記憶装置５２にプログラムをロードする。そしてＣＰＵ５１が主記憶装置５２にロードされたプログラムを実行することによってパターン認識方法が実行される。 The auxiliary storage device 53 stores a program for causing a computer system to execute a pattern recognition method described later. The CPU 51 reads the program from the auxiliary storage device 53 and loads the program into the main storage device 52. The pattern recognition method is executed when the CPU 51 executes the program loaded in the main storage device 52.

パターン認識方法をコンピュータシステムに実行させるためのプログラムを提供するための手段は特に限定されるものではない。たとえばＣＰＵ５１がＣＤ−ＲＯＭ等の記憶媒体に記録されたプログラムを読み出して、そのプログラムを補助記憶装置５３に格納してもよい。また、ＣＰＵ５１が通信回線を通じて提供されたプログラムを、通信装置５６を介して受信し、その受信したプログラムを補助記憶装置５３に格納してもよい。 Means for providing a program for causing a computer system to execute the pattern recognition method is not particularly limited. For example, the CPU 51 may read a program recorded on a storage medium such as a CD-ROM and store the program in the auxiliary storage device 53. Alternatively, the CPU 51 may receive a program provided via a communication line via the communication device 56 and store the received program in the auxiliary storage device 53.

また、プログラムが記録された記録媒体は、コンピュータが読み取り可能な記録媒体であればよく、ＣＤ−ＲＯＭに限定されるものではない。 The recording medium on which the program is recorded is not limited to a CD-ROM as long as it is a computer-readable recording medium.

図２は、図１に示したパターン認識装置の機能ブロック図である。図２を参照して、パターン認識装置５０は、データ入力部６１と、評価距離算出部６２と、重み係数算出部６３と、予測値算出部６４と、判断部６５と、出力部６６とを備える。 FIG. 2 is a functional block diagram of the pattern recognition apparatus shown in FIG. With reference to FIG. 2, the pattern recognition device 50 includes a data input unit 61, an evaluation distance calculation unit 62, a weight coefficient calculation unit 63, a predicted value calculation unit 64, a determination unit 65, and an output unit 66. Prepare.

データ入力部６１は、必要なデータを装置の外部から受付ける。データ入力部６１にデータを入力するための方法および手段は特に限定されるものではない。これにより、複数の特徴項目を変量として有するデータセットが準備される。 The data input unit 61 receives necessary data from the outside of the apparatus. The method and means for inputting data to the data input unit 61 are not particularly limited. As a result, a data set having a plurality of feature items as variables is prepared.

評価距離算出部６２は、データごとに、複数の原特徴項目の中から２個を選択するすべての組み合わせの各々に対して評価距離を算出する。これにより、データごとに複数の評価距離が算出される。 The evaluation distance calculation unit 62 calculates the evaluation distance for each of all combinations for selecting two of the plurality of original feature items for each data. Thereby, a plurality of evaluation distances are calculated for each data.

重み係数算出部６３は、データごとに、複数の評価距離の少なくとも一部を複数の原特徴項目に加えて複数の新たな特徴項目を生成する。この新たな特徴項目が、そのデータの全特徴項目となる。重み係数算出部６３は、複数のデータセットの同一の特徴項目について、その特徴項目に対応する真値との相関を表わす重み係数を算出する。 The weight coefficient calculation unit 63 generates a plurality of new feature items by adding at least a part of the plurality of evaluation distances to the plurality of original feature items for each data. This new feature item becomes all the feature items of the data. The weighting coefficient calculation unit 63 calculates a weighting coefficient that represents a correlation with the true value corresponding to the feature item for the same feature item of a plurality of data sets.

予測値算出部６４は、データごとに、複数の新たな特徴項目と複数の重み係数とを統合して、評価値としての出力の予測値を算出する。 The predicted value calculation unit 64 integrates a plurality of new feature items and a plurality of weighting factors for each data, and calculates an output predicted value as an evaluation value.

判断部６５は、評価値と予め定められたしきい値とを比較して、目的に対する判断を行なう。出力部６６は、判断部６５が判断した結果を装置の外部に出力する。 The determination unit 65 compares the evaluation value with a predetermined threshold value and makes a determination on the purpose. The output unit 66 outputs the result determined by the determination unit 65 to the outside of the apparatus.

次に各実施の形態に係るパターン認識方法を詳細に説明する。
［実施の形態１］
図３は、実施の形態１に係るパターン認識装置による処理の流れを説明するためのフローチャートである。図２および図３を参照して、ステップＳ０において、データ入力部６１は、複数のデータ（データセット）を受付ける。次にステップＳＡにおいて、評価距離算出部６２は、データごとに、複数の原特徴項目の中から選択された２個を組み合わせるすべての組み合わせの各々に対して評価距離を算出する。ステップＳＡの処理の詳細は以下の通りである。 Next, the pattern recognition method according to each embodiment will be described in detail.
[Embodiment 1]
FIG. 3 is a flowchart for explaining the flow of processing by the pattern recognition apparatus according to the first embodiment. 2 and 3, in step S0, data input unit 61 accepts a plurality of data (data sets). Next, in step SA, the evaluation distance calculation unit 62 calculates an evaluation distance for each combination of two combinations selected from the plurality of original feature items for each data. Details of the processing in step SA are as follows.

（ステップＳＡ）
図４は、ステップＳＡの処理のために準備されたデータを表形式で説明した図である。図４を参照して、単位空間に属するデータセットｉ＝１，２，・・・，ｎ１（データセット数：ｎ１個）と、単位空間に属さないデータセットｉ＝ｎ１＋１，ｎ１＋２，・・・，ｎ１＋ｎ２（データセット数：ｎ２個）とをあわせた信号空間のデータセットを準備する（ステップＳ１）。なお、このようにして準備された信号空間を以後は「新しい信号空間」と呼び、「単位空間に属さないデータセット」を「信号空間に属するデータセット」と呼ぶことで、両者を区別する。 (Step SA)
FIG. 4 is a diagram illustrating the data prepared for the process of step SA in a table format. Referring to FIG. 4, data sets i = 1, 2,..., N1 (number of data sets: n1) belonging to the unit space and data sets i = n1 + 1, n1 + 2,. , N1 + n2 (number of data sets: n2) is prepared as a data set in the signal space (step S1). The signal space prepared in this way is hereinafter referred to as “new signal space”, and “data set not belonging to unit space” is referred to as “data set belonging to signal space” to distinguish them.

新しい信号空間における出力の真値ｙ₁，ｙ₂，・・・，ｙ_nは、予め判明しているものとする。また、あるデータセットｉ（ｉ＝１，２，・・・，ｎ）に対して原特徴項目ｘ_i1，ｘ_i2，・・・，ｘ_ik（項目数ｋ）が与えられる。小文字「ｘ」は、基準化前の原特徴項目の値を表す。 Assume that the true values y ₁ , y ₂ ,..., Y _n of the outputs in the new signal space are known in advance. Further, original feature items x _i1 , x _i2 ,..., X _ik (number of items k) are given to a certain data set i (i = 1, 2,..., N). The lowercase letter “x” represents the value of the original feature item before normalization.

重要なことは、解析に用いる新しい信号空間が、さまざまな出力結果に対応した、単位空間に属さないデータセット（信号空間のデータセット）を含んでいるということである。単位空間に属さないデータセット（信号空間のデータセット）において２項目間の相関係数が１となる場合、その２項目は全く同じ内容を示している。また、両者の項目は将来にわたり同じ意味を有する。このため、それら２項目のうちの一方を削除してもパターン認識精度は低下しない。 What is important is that the new signal space used for the analysis includes a data set (signal space data set) corresponding to various output results and not belonging to the unit space. When the correlation coefficient between two items is 1 in a data set that does not belong to the unit space (signal space data set), the two items show exactly the same content. Both items have the same meaning in the future. For this reason, even if one of these two items is deleted, the pattern recognition accuracy does not decrease.

同様に、σ＝０となる項目は、その項目の値がどのような状態でも変化しないことを意味する。σ＝０となる項目は認識精度には全く寄与しないので、その項目を削除してもパターン認識精度への影響はない。したがって、図４に示すデータセットにおいては、２項目間の相関係数が１であったり、ある項目の標準偏差σが０であったりすることはない。 Similarly, an item with σ = 0 means that the value of the item does not change in any state. An item for which σ = 0 does not contribute to the recognition accuracy at all. Therefore, even if the item is deleted, the pattern recognition accuracy is not affected. Therefore, in the data set shown in FIG. 4, the correlation coefficient between two items is not 1, and the standard deviation σ of a certain item is not 0.

次に、新しい信号空間のデータセットｉの原特徴項目ｘ_i1，ｘ_i2，・・・，ｘ_ikの中から選択された２項目を組み合わせるすべての組み合わせに対して評価距離を算出する（ステップＳ３，Ｓ４）。実施の形態１では、信号空間を含めたマハラノビス距離が評価距離として用いられる。以下、評価距離の計算方法について詳細に説明する。 Next, evaluation distances are calculated for all combinations that combine two items selected from the original feature items x _i1 , x _i2 ,..., X _ik of the data set i of the new signal space (step S3). , S4). In the first embodiment, the Mahalanobis distance including the signal space is used as the evaluation distance. Hereinafter, a method for calculating the evaluation distance will be described in detail.

まず、原特徴項目のデータ値ｘ_ijの平均値μ_j（ｊ＝１，２，・・・，ｋ）および標準偏差σ_j（ｊ＝１，２，・・・，ｋ）が求められる（ステップＳ３）。平均値μ_jは式（１）に従って表わされる。標準偏差σ_jは式（２）に従って表わされる。 First, the average value μ _j (j = 1, 2,..., K) and the standard deviation σ _j (j = 1, 2,..., K) of the data values x _ij of the original feature items are obtained ( Step S3). The average value μ _j is expressed according to the equation (1). The standard deviation σ _j is expressed according to equation (2).

図３に戻り、ステップＳ３では、原特徴項目の値ｘ_ijを基準化することによってＸ_ijが求められる。基準化された値Ｘ_ijは以下の式（３）に従って表わされる。 Returning to FIG. 3, in step S3, X _ij is obtained by normalizing the value x _ij of the original feature item. The normalized value X _ij is expressed according to the following equation (3).

続いて、ステップＳ４において、ｋ個の原特徴項目のうちの２項目ｐ，ｑの組み合わせ（ｐ，ｑ）に対する評価距離Ｘ_pqが求められる。Ｘ_pqは２つの項目ｐ，ｑの二次元空間におけるマハラノビス距離で与えられる。項目ｐ，ｑの単相関係数をｒ_pqとすると、相関係数行列Ｒ_pqは、以下の式（４）に従って表わされる。 Subsequently, in step S4, an evaluation distance X _pq for a combination (p, q) of two items p, q out of k original feature items is obtained. X _pq is given by the Mahalanobis distance in the two-dimensional space of the two items p and q. When the single correlation coefficient of items p and q is r _pq , the correlation coefficient matrix R _pq is expressed according to the following equation (4).

評価距離Ｘ_pq（実施の形態１ではマハラノビス距離）は、相関係数行列Ｒ_pqの逆行列と、２項目ベクトル（Ｔは転置を表わす）を用いて、以下の式（５）に従って表わされる。 Evaluation distance X _pq (Mahalanobis distance in the first embodiment) is expressed according to the following equation (5) using an inverse matrix of correlation coefficient matrix R _pq and a two-item vector (T represents transposition).

なお、式（５）の右辺を項目数の２で除するかどうかはパターン認識精度には関係しない。また、評価距離Ｘ_pqの値として、式（５）の右辺の平方根をとってもよい。さらに、式（５）の右辺を項目数の２で除した値の平方根を評価距離Ｘ_pqの値として用いてもよい。 Note that whether the right side of equation (5) is divided by the number of items of 2 is not related to the pattern recognition accuracy. Further, the square root of the right side of Expression (5) may be taken as the value of the evaluation distance X _pq . Furthermore, the square root of the value obtained by dividing the right side of Equation (5) by 2 of the number of items may be used as the value of the evaluation distance X _pq .

複数（ｋ個）の原特徴項目から選択された２項目の組み合わせの全てに対して上記の方式に従って評価距離が算出される。これにより、ｋ個の原特徴項目に加えて、_kＣ₂＝｛ｋ・（ｋ−１）／２｝個の項目の新しい項目（以下では「相関特徴項目」と呼ぶ）が得られる。 Evaluation distances are calculated according to the above method for all combinations of two items selected from a plurality (k) of original feature items. As a result, in addition to the _k original feature items, new items of _k C ₂ = {k · (k−1) / 2} items (hereinafter referred to as “correlation feature items”) are obtained.

（ステップＳＢ）
ステップＳＢにおいて、重み係数算出部６３（図２参照）は、データごとに、複数の評価距離の少なくとも一部を複数の原特徴項目に加えて複数の新たな特徴項目を生成する。ステップＳＢの処理の詳細は以下の通りである。 (Step SB)
In step SB, the weight coefficient calculation unit 63 (see FIG. 2) generates a plurality of new feature items by adding at least a part of the plurality of evaluation distances to the plurality of original feature items for each data. Details of the processing in step SB are as follows.

まず、重み係数算出部６３は、ｋ個の原特徴項目と｛ｋ・（ｋ−１）／２｝個の相関特徴項目とを足し合わせて新たな特徴項目（全特徴項目）を生成する（ステップＳ５）。この場合の特徴項目の総数はｋ＋｛ｋ・（ｋ−１）／２｝＝ｋ・（ｋ＋１）／２となる。重み係数算出部６３は、ｋ・（ｋ＋１）／２＝Ｋと置き、Ｋ個の特徴項目に改めて連番１，２，・・・，Ｋを付与する。 First, the weight coefficient calculation unit 63 adds the k original feature items and {k · (k−1) / 2} correlation feature items to generate a new feature item (all feature items) ( Step S5). The total number of feature items in this case is k + {k · (k−1) / 2} = k · (k + 1) / 2. The weight coefficient calculation unit 63 sets k · (k + 1) / 2 = K, and assigns serial numbers 1, 2,..., K to the K feature items.

上記の例では、全特徴項目を生成するために全ての相関特徴項目が用いられている。ただし、複数の相関特徴項目の一部とｋ個の原特徴項目とによって全特徴項目が生成されてもよい。複数の相関特徴項目の全てまたは一部のいずれを用いるかは、解析後の項目の重要度の診断を行なう公知方法に依存して選択されるものであり、本発明の実施の形態を限定するものではない。 In the above example, all correlation feature items are used to generate all feature items. However, all the feature items may be generated by some of the plurality of correlation feature items and k original feature items. Whether to use all or some of the plurality of correlation feature items is selected depending on a known method for diagnosing the importance of the item after analysis, and limits the embodiment of the present invention. It is not a thing.

続いて、重み係数算出部６３は、ステップＳＡの処理によって準備された特徴項目ｊ(ｊ＝１，２，・・・，Ｋ)ごとに、特徴項目ｊの値（基準化された値）Ｘ_1j，Ｘ_2j，・・・，Ｘ_njと出力の真値ｙ₁，ｙ₂，・・・，ｙ_nとの間の相関の大きさに相当する重み係数ｗ₁，ｗ₂，・・・，ｗ_Kを算出する（ステップＳ６）。 Subsequently, the weighting factor calculation unit 63 calculates the value (standardized value) X of the feature item j for each feature item j (j = 1, 2,..., K) prepared by the process of step SA. _1j, X _2j, ..., the true value y _1, y ₂ of X _nj and output, ..., weighting factors w _1, w ₂ which corresponds to the magnitude of the correlation between y _n, ... , W _K are calculated (step S6).

（ステップＳＣ）
ステップＳＣにおいて、予測値算出部６４は、特徴項目ごとに算出された重み係数ｗ₁，ｗ₂，・・・，ｗ_Kを用いて、出力の予測値＾Ｙを求める（煩雑さを避けるためデータセット番号を省略するものとする）。予測値＾Ｙは、たとえば、重み係数ｗ₁，ｗ₂，・・・，ｗ_Kの按分によって以下の式（６）に従って算出される。 (Step SC)
In step SC, the predicted value calculation unit 64 obtains an output predicted value ^ Y using the weighting factors w ₁ , w ₂ ,..., W _K calculated for each feature item (to avoid complexity). The data set number shall be omitted). Predicted value ^ Y is, for example, the weighting factor w _1, w _2, · · ·, is calculated according to equation (6) below the apportioning of w _K.

ここで上記の重み係数を計算する際に特徴項目の値および出力の値を基準化した（たとえば平均値を引く）場合には、式（６）における特徴項目の値および出力の値の各々にもその基準化された値を用いるものとする。式（３）に用いられる基準化された特徴項目と区別するために、式（６）では、基準化された特徴項目をＺ₁，Ｚ₂，・・・，Ｚ_Kと表記する。 If the feature item value and the output value are standardized (for example, the average value is subtracted) when calculating the weighting factor, the feature item value and the output value in Equation (6) are respectively calculated. Also, the normalized value shall be used. In order to distinguish from the standardized feature items used in Equation (3), the standardized feature items are expressed as Z ₁ , Z ₂ ,..., Z _K in Equation (6).

β_jは基準化後の特徴項目Ｚ_jの単位を補正するための係数である。また、予測値＾Ｙは基準化された値であるので、予測値＾Ｙを基準化する前の値に戻す操作（たとえば平均値を加える）が必要であることはいうまでもない。 β _j is a coefficient for correcting the unit of the feature item Z _j after normalization. Further, since the predicted value ^ Y is a standardized value, it is needless to say that an operation (for example, adding an average value) to return the predicted value ^ Y to a value before standardization is necessary.

（ステップＳＤ）
ステップＳＣにおける予測値を算出する処理は、真値が判明している信号空間のデータセットに対して行なうこともできるし、真値が判明していない未知のデータセットに対しても行なうことができる。信号空間のデータセットに対して予測値＾Ｙを求めた場合は、予測値＾Ｙとそれに対応する真値ｙとの相関関係（相関係数、田口の動特性のＳＮ比など）を評価して、予測精度の評価を行なう。また、使用する特徴項目の組み合わせを最適化することもできる。この操作は公知のためここでの詳細な説明は繰り返さない。 (Step SD)
The process of calculating the predicted value in step SC can be performed on a signal space data set whose true value is known, or can be performed on an unknown data set whose true value is not known. it can. When the predicted value ^ Y is obtained for the signal space data set, the correlation between the predicted value ^ Y and the corresponding true value y (correlation coefficient, SN ratio of Taguchi's dynamic characteristics, etc.) is evaluated. To evaluate the prediction accuracy. It is also possible to optimize the combination of feature items to be used. Since this operation is publicly known, detailed description thereof will not be repeated here.

一方、未知のデータセットに対して予測値＾Ｙを求めた場合は、その予測値＾Ｙ(＾Ｙが基準化された値の場合は元の値に戻しておく)を、予め定められたしきい値ｙ_thと比較する。これにより、たとえば良否、合否、正常または異常、健康または不健康、などといった判定および判断が実行される。そして、ステップＳ１０においてその判断結果が出力される。 On the other hand, when the predicted value ^ Y is obtained for an unknown data set, the predicted value ^ Y (return to the original value if ^ Y is a normalized value) is determined in advance. Compare with threshold y _th . Thereby, for example, determination and determination, such as pass / fail, pass / fail, normal or abnormal, health or unhealthy, are executed. In step S10, the determination result is output.

実施の形態１に係る方法によれば、ステップＳＡの処理によって、評価距離としてマハラノビス距離を算出する。これにより、非特許文献５による方法での不具合（単位空間でσ＝０の場合に計算ができない、あるいは、項目Ｘ_pqが２項目間の相関を正しく示していない）といった不具合を回避できる。 According to the method according to the first embodiment, the Mahalanobis distance is calculated as the evaluation distance by the process of step SA. As a result, it is possible to avoid a problem such as a problem in the method according to Non-Patent Document 5 (calculation is not possible when σ = 0 in the unit space, or the item X _pq does not correctly indicate the correlation between the two items).

実施の形態１によれば、多重共線性（項目数ｋ＞データセット数ｎの場合、２つの特徴項目間の相関が１の場合、単位空間の項目の標準偏差が０の場合など）があるデータでも項目間の相関を考慮できるので、特徴項目間の相関が１の場合や項目の標準偏差が０の場合などに起因するデータ形式の制約を取り除くことができるとともに、従来の方法よりも的確な相関特徴項目を導入することができる。したがって実施の形態１によれば、パターン認識の精度を向上させることができる。 According to the first embodiment, there is multicollinearity (when the number of items k> the number of data sets n, the correlation between two feature items is 1, the standard deviation of unit space items is 0, etc.) Since the correlation between items can be taken into consideration even in the data, restrictions on the data format caused by the correlation between feature items being 1 or the standard deviation of the item being 0 can be removed, and more accurate than the conventional method. Correlation feature items can be introduced. Therefore, according to the first embodiment, the accuracy of pattern recognition can be improved.

（変形例）
上記の実施の形態においては、単位空間のデータセットおよび信号空間のデータセットの両方を用いて相関項目が計算される。ただし場合によっては、単位空間のデータセットのみを用いて相関項目を計算する方法も採用することも可能である。 (Modification)
In the above embodiment, the correlation item is calculated using both the unit space data set and the signal space data set. However, in some cases, it is also possible to employ a method of calculating a correlation item using only a unit space data set.

一般に、単位空間のデータのみから相関項目を作成する場合には、Ｒ_pq＝１（項目pとqとの相関係数が１である）の場合、σ_p＝０（項目pの分散が０、すなわち単位空間内で値が変化しない）の場合といったような特殊状況が発生するために、計算ができない、あるいは項目の一部を不用意に削除できないと考えられる。しかしながら単位空間でＲ_pq≠１、あるいはσ_p≠0となるような場合であれば、単位空間データだけから相関項目を作成したほうがパターン認識精度がよいケースもある。このような場合には、ステップＳ１において、単位空間のデータセットのみを用いて相関項目を計算する方法を採用することも可能である。 In general, when a correlation item is created only from unit space data, when R _pq = 1 (the correlation coefficient between items p and q is 1), σ _p = 0 (the variance of item p is 0) In other words, a special situation occurs such as the case where the value does not change in the unit space), so that it is considered that the calculation cannot be performed or a part of the item cannot be deleted carelessly. However, if R _pq ≠ 1 or σ _p ≠ 0 in the unit space, there are cases where the pattern recognition accuracy is better when the correlation item is created from the unit space data alone. In such a case, it is possible to employ a method of calculating correlation items using only a unit space data set in step S1.

図５は、図３に示した処理の変形例を説明するためのフローチャートである。図５を参照して、ステップＳ１において、単位空間のデータセットのみが用いられる。他のステップの処理は、図３の対応するステップの処理と同様であるので以後の説明は繰り返さない。 FIG. 5 is a flowchart for explaining a modification of the process shown in FIG. Referring to FIG. 5, in step S1, only a unit space data set is used. Since the processing of the other steps is the same as the processing of the corresponding steps in FIG. 3, the following description will not be repeated.

また、単位空間内でＲ_pq＝１、あるいはσ_p＝０といったことが発生する場合には、原特徴項目の値Ｘ_p、Ｘ_qに、平均が０となる非常に小さい乱数を加えればよい。これにより、項目を減らすことなくデータの制約（Ｒ_pq＝１、あるいはσ_p＝０）を取り除くことができる。「非常に小さい」とは、例えば、乱数の変動幅が、対応する項目の信号空間データの平均値に対してたとえば１／１００程度であることを意味する。 Further, when R _pq = 1 or σ _p = 0 occurs in the unit space, a very small random number with an average of 0 may be added to the values X _p and X _q of the original feature item. . Thereby, it is possible to remove the data restriction (R _pq = 1 or σ _p = 0) without reducing the number of items. “Very small” means, for example, that the fluctuation range of the random number is, for example, about 1/100 of the average value of the signal space data of the corresponding item.

［実施の形態２］
実施の形態２では、評価距離として、ＭＴＡ法（マハラノビス・タグチ・アジョイント法）における距離が用いられる。この点において実施の形態２は実施の形態１と異なる。 [Embodiment 2]
In the second embodiment, the distance in the MTA method (Mahalanobis Taguchi Adjoint method) is used as the evaluation distance. In this respect, the second embodiment is different from the first embodiment.

図６は、実施の形態２に係るパターン認識装置による処理の流れを説明するためのフローチャートである。図３および図６を参照して、実施の形態２では、ステップＳ２，Ｓ３の処理が省略されている点において図３のフローチャートと異なる。さらに、実施の形態２は、ステップＳ４における評価距離の算出方法の点で実施の形態１と異なる。以下、評価距離の算出方法について詳細に説明する。 FIG. 6 is a flowchart for explaining the flow of processing by the pattern recognition apparatus according to the second embodiment. 3 and 6, the second embodiment is different from the flowchart of FIG. 3 in that the processes of steps S2 and S3 are omitted. Further, the second embodiment differs from the first embodiment in the point of the evaluation distance calculation method in step S4. Hereinafter, a method for calculating the evaluation distance will be described in detail.

まず、図２に示した評価距離算出部６２は、データセットの原特徴項目pの分散Ｖ_p、現特徴項目qの分散Ｖ_q、項目ｐ，ｑの共分散Ｖ_pq＝Ｖ_qpを式（７）〜（９）に従って算出し、式（１０）に従って分散共分散行列Ｖを作成する。ｐ＝１，２，・・・，ｋであり、ｑ＝１，２，・・・，ｋであり、ｐ≠ｑである。 First, evaluation distance calculation unit 62 shown in FIG. 2, the dispersion V _p of the original characteristic item p dataset variance V _q of the current characteristic item q, items p, the covariance V _pq = V _qp of q formula ( 7) to (9), and a variance-covariance matrix V is created according to equation (10). p = 1, 2,..., k, q = 1, 2,..., k, and p ≠ q.

次に、評価距離算出部６２は、式（１１）に従って、分散共分散行列Ｖの余因子行列Ａを求める。 Next, the evaluation distance calculation unit 62 obtains a cofactor matrix A of the variance-covariance matrix V according to Expression (11).

続いて評価距離算出部６２は、余因子行列Ａを用いて、２つの特徴項目ｐ，ｑ間の相関特徴項目Ｘ_pqを以下の式（１２）に従って算出する。 Subsequently, the evaluation distance calculation unit 62 uses the cofactor matrix A to calculate the correlation feature item X _pq between the two feature items p and q according to the following equation (12).

なお、式（１２）の右辺を項目数２で除してもよい。また、Ｘ_pqの値として、式（１２）の右辺の平方根をとってもよい。さらに、式（１２）の右辺を項目数２で除して、その値の平方根をとってもよい。いずれの場合もパターン認識精度には影響しない。 The right side of equation (12) may be divided by the number of items 2. Further, the square root of the right side of Expression (12) may be taken as the value of X _pq . Furthermore, the right side of Expression (12) may be divided by the number of items 2 to obtain the square root of the value. In either case, the pattern recognition accuracy is not affected.

実施の形態２によれば、実施の形態１と同様に、多重共線性があるデータでも項目間の相関を考慮できるので、特徴項目間の相関が１の場合や項目の標準偏差が０の場合などに起因するデータ形式の制約を取り除くことができるとともに、従来の方法よりも的確な相関特徴項目を導入することができる。したがって実施の形態２によれば、パターン認識の精度を向上させることができる。 According to the second embodiment, as in the first embodiment, since the correlation between items can be taken into account even in data having multicollinearity, when the correlation between feature items is 1 or when the standard deviation of the items is 0 In addition to removing restrictions on the data format due to the above, it is possible to introduce more accurate correlation feature items than in the conventional method. Therefore, according to the second embodiment, the accuracy of pattern recognition can be improved.

さらに、上記の方法によって算出された評価距離は、特徴項目の基準化や相関係数行列の逆行列を経由せずに求めることができる。したがって実施の形態２では、図３に示されたステップＳ２，Ｓ３の処理を不要とすることができる。 Furthermore, the evaluation distance calculated by the above method can be obtained without going through the standardization of feature items and the inverse matrix of the correlation coefficient matrix. Therefore, in the second embodiment, the processing of steps S2 and S3 shown in FIG. 3 can be made unnecessary.

さらに、実施の形態２によれば、特徴項目ｐ，ｑの分散が０に非常に近い場合、あるいは、特徴項目ｐ，ｑの相関係数が１に非常に近い場合において、計算の桁落ちによるパターン認識精度の低下を防ぐことができる。 Furthermore, according to the second embodiment, when the variance of the feature items p and q is very close to 0, or when the correlation coefficient of the feature items p and q is very close to 1, the calculation items are lost. A decrease in pattern recognition accuracy can be prevented.

なお、新しい信号空間のデータセットを用いることを前提として上記の方法を説明した。しかし、単位空間でＲ_pq≠１、あるいはσ_p≠0となるような場合であれば、単位空間データだけから相関項目を作成したほうがパターン認識精度がよいケースもある。したがってこのような場合には、実施の形態１と同じく、図６に示したステップＳ１の処理において単位空間のデータセットのみを用い、単位空間のデータセットから相関項目を計算してもよい。 The above method has been described on the assumption that a new signal space data set is used. However, if R _pq ≠ 1 or σ _p ≠ 0 in the unit space, there are cases where the pattern recognition accuracy is better if the correlation item is created from the unit space data alone. Therefore, in such a case, as in the first embodiment, the correlation item may be calculated from the unit space data set by using only the unit space data set in the process of step S1 shown in FIG.

［実施の形態３］
実施の形態３では、評価距離として、項目Ｘ_pの基準化値と項目Ｘ_qとの基準化値との積（＝Ｘ_ｐ・Ｘ_q）が用いられる。この点において実施の形態３は実施の形態１，２と異なる。 [Embodiment 3]
In the third embodiment, the product (= X _p · X _q ) of the standardized value of the item X _{p and} the standardized value of the item X _q is used as the evaluation distance. In this respect, the third embodiment is different from the first and second embodiments.

なお、実施の形態３に係るパターン認識装置による処理の流れを説明するためのフローチャートは図３に示したフローチャートと基本的に同じであり、評価距離の具体的な算出方法の点において異なる。 Note that the flowchart for explaining the flow of processing by the pattern recognition apparatus according to the third embodiment is basically the same as the flowchart shown in FIG. 3, and differs in a specific method for calculating the evaluation distance.

また、実施の形態１，２と同様に、実施の形態３に係る方法では、新しい信号空間のデータセットを用いることもできるし、単位空間に属するデータセットのみを用いることもできる。 Similarly to the first and second embodiments, in the method according to the third embodiment, a data set of a new signal space can be used, or only a data set belonging to a unit space can be used.

実施の形態１において説明されるように、特徴項目の基準化値を求める場合には、原特徴項目の値から平均値を減算する。単位空間のデータセットのみを用いる場合には、単位空間のデータセットについて特徴項目の平均値が求められる。一方、単位空間のデータおよび単位空間に属さないデータを用いる場合は、単位空間のデータおよび単位空間に属さないデータを含めることで特徴項目の平均値が求められる。 As described in the first embodiment, when obtaining the standardized value of the feature item, the average value is subtracted from the value of the original feature item. When only the unit space data set is used, the average value of the feature items is obtained for the unit space data set. On the other hand, when using data in the unit space and data that does not belong to the unit space, the average value of the feature items is obtained by including the data in the unit space and the data that does not belong to the unit space.

基準化値の場合、項目ｐ，ｑのデータ中心は（０，０）の原点付近となる。このため項目ｐと項目ｑとの相関係数が正の場合は、基準価値積が正では回帰直線方向、負では回帰直線に垂直な方向への乖離を示すことになる（相関係数が負の場合は逆になるが結果的に係数βの符号によって修正される）。基準化値積はマハラノビス距離やＭＴＡ法の距離と似た性質をもっており、また計算が簡単になる。したがって事例によっては、実施の形態３による方法を用いた場合に、最もパターン認識精度が高くなることが期待できる。 In the case of the standardized value, the data center of the items p and q is near the origin of (0, 0). For this reason, when the correlation coefficient between the item p and the item q is positive, a deviation in the direction of the regression line is indicated when the reference value product is positive, and a deviation perpendicular to the regression line is indicated when the reference value product is negative (the correlation coefficient is negative). In the case of, the result is corrected by the sign of the coefficient β). The normalized product has properties similar to the Mahalanobis distance and the MTA method distance, and is easy to calculate. Therefore, in some cases, it can be expected that the pattern recognition accuracy is highest when the method according to the third embodiment is used.

［実施の形態４］
実施の形態４では、実施の形態１〜３に係るパターン認識方法のステップＳＢの処理おいて、Ｔ法（タグチ法）を用いて重み係数を算出する。 [Embodiment 4]
In the fourth embodiment, the weighting coefficient is calculated using the T method (Taguchi method) in the process of step SB of the pattern recognition method according to the first to third embodiments.

なお実施の形態４に係るパターン認識装置による処理の流れを説明するためのフローチャートは図３に示したフローチャートと基本的に同じである。以下、重み係数の算出方法を詳細に説明する。 The flowchart for explaining the flow of processing by the pattern recognition apparatus according to the fourth embodiment is basically the same as the flowchart shown in FIG. Hereinafter, the calculation method of the weighting coefficient will be described in detail.

まず、図２に示した重み係数算出部６３は、特徴項目Ｘ₁，Ｘ₂，・・・，Ｘ_K（原特徴項目＋相関特徴項目）と出力値ｙとに対して基準化を行なう。項目値の基準化の場合には、項目値からその項目の平均値が減算される。一方、出力値の基準化の場合には、出力値から、その出力値の平均値が減算される。なお、実施の形態１との区別のため、実施の形態４では、基準化後の項目値をＺ_ijと表記する。また、基準化後の出力値をＹ_iと表記する。 First, the weight coefficient calculation unit 63 shown in FIG. 2 performs normalization on the feature items X ₁ , X ₂ ,..., X _K (original feature item + correlated feature item) and the output value y. In the case of standardization of an item value, the average value of the item is subtracted from the item value. On the other hand, when the output value is normalized, the average value of the output values is subtracted from the output value. For distinction from the first embodiment, the item value after normalization is expressed as Z _{ij in the} fourth embodiment. Also, the normalized output value is denoted as Y _i .

基準化後の項目値Ｚ_ijおよび基準化後の出力値Ｙ_iは、以下の式（１３），（１４）に従ってそれぞれ表わされる。 The normalized item value Z _ij and the normalized output value Y _i are expressed according to the following equations (13) and (14), respectively.

項目ｊ（ｊ＝１，２，・・・，Ｋ)における特徴項目Ｚ_ijと出力Ｙ_iとの間の相関関係を示す尺度として、下記の式（１５）および式（１６）にそれぞれ示す田口の動特性のＳＮ比η_j、およびＺ_ijとＹ_iとの比例定数β_jを計算する。 Taguchi shown in the following equations (15) and (16) as scales indicating the correlation between the feature item Z _ij and the output Y _{i in the} item j (j = 1, 2,..., K) calculating a proportionality constant beta _j of the SN ratio eta _j, and Z _ij and Y _i of the dynamic characteristics.

ここに、 here,

である。
たとえば、単相関係数では±１に漸近するため、±１付近ではほとんど重みづけ係数に差がつかない。しかしながら、η_jは原理上いくらでも大きな値をとることができるため、重み付けの感度が高まり、より精度の高い予測関係式を作成できる。 It is.
For example, since the single correlation coefficient asymptotically approaches ± 1, there is almost no difference in the weighting coefficient in the vicinity of ± 1. However, since η _j can take as large a value as possible in principle, the sensitivity of weighting is increased, and a more accurate prediction relational expression can be created.

また、非特許文献５の方法ではηの値が用いられていない。これにより、Ｓ_β＜Ｖ_eの場合にη＝０となるので使用可能項目が少なくなる。実施の形態４による方法ではηの値を用いているので、このような問題を防ぐことができる。したがって実施の形態４によれば、より感度の高い重み付けを行なうことができるので、パターン認識の精度を向上させることができる。 In the method of Non-Patent Document 5, the value of η is not used. As a result, η = 0 in the case of S _β <V _e , so that usable items are reduced. Since the method according to Embodiment 4 uses the value of η, such a problem can be prevented. Therefore, according to the fourth embodiment, weighting with higher sensitivity can be performed, so that the accuracy of pattern recognition can be improved.

［実施の形態５］
実施の形態５においては、実施の形態４のパターン認識方法で用いられるＴ法での評価式のη_jは、式（２０）で示した平均の変動Ｓ_βjと、式（２１）で示した残差Ｓ_ejとの比である。η_jは、以下の式（２３）に従って表わされる。 [Embodiment 5]
In the fifth embodiment, η _j of the evaluation formula in the T method used in the pattern recognition method of the fourth embodiment is expressed by the average variation S _βj shown in the formula (20) and the formula (21). It is the ratio to the residual S _ej . η _j is expressed according to the following equation (23).

式（１５）において、Ｓ_βj−Ｖ_ej＜０の場合、η_j≡０と定義されている。これは出力の推定値を求めるための重み係数が負になるのは不適当であるためである。 In the equation (15), when S _βj −V _ej <0, it is defined as η _j ≡0. This is because it is inappropriate that the weight coefficient for obtaining the estimated value of the output becomes negative.

η_j≡０の場合、重み係数ｗ_jはｗ_j＝０となる。このため式（６）より明らかなように、特徴項目ｊは解析に使用されないことになる。 When η _j ≡0, the weight coefficient w _j is w _j = 0. Therefore, as is clear from the equation (6), the feature item j is not used for the analysis.

図７は、Ｓ_βj−Ｖ_ej＜０の場合のデータの例を示した図である。図７に示されるようなデータの場合、式（１５）における計算では、Ｓβ₁＝０．２４５、Ｖｅ₁＝０．２５１となり、ＳＮ比η₁の分子は負になる。しかし、このデータで単相関係数の有意性を検定すると、危険率５％で有意になる。 FIG. 7 is a diagram illustrating an example of data in the case of S _βj −V _ej <0. In the case of data as shown in FIG. 7, in the calculation in equation (15), Sβ ₁ = 0.245, Ve ₁ = 0.251 and the numerator of the SN ratio η ₁ becomes negative. However, when the significance of the single correlation coefficient is tested with this data, it becomes significant at a risk rate of 5%.

このように、η_j＜０であっても相関係数が有意になる場合があるが、このような特徴項目を当初から予測に加えないのは、予測精度向上の可能性を捨てていることになる。 In this way, even if η _j <0, the correlation coefficient may be significant. However, the reason for not adding such feature items to the prediction from the beginning is that the possibility of improving the prediction accuracy is discarded. become.

本実施の形態では、式（２３）によって定義されるη_jを用いることで、出力と特徴項目の相関係数との単調性（単相関係数が大きいほどＳＮ比も大きい）を保ちながら、負にならないη_j（ＳＮ比）を定義することができる。 In the present embodiment, by using η _j defined by Expression (23), while maintaining the monotonicity between the output and the correlation coefficient of the feature item (the larger the single correlation coefficient, the larger the SN ratio), Η _j (S / N ratio) that does not become negative can be defined.

図８は、特徴項目と出力との関係をあらわすデータと、そのときの単相関係数、実施の形態４〜６のη_jとを比較して説明する図である。図８に示されるように、実施の形態４ではＳＮ比すなわちη_jが負になる場合があるのに対し、実施の形態５ではη_jが常に正であることが示される。なお、実施の形態６によるη_jについては後述する。 FIG. 8 is a diagram for explaining the comparison between the data representing the relationship between the feature item and the output, the single correlation coefficient at that time, and η _{j in the} fourth to sixth embodiments. As shown in FIG. 8, the SN ratio, that is, η _j may be negative in the fourth embodiment, whereas η _j is always positive in the fifth embodiment. Note that η _j according to the sixth embodiment will be described later.

また、図８に示した単相関係数とη_jとの関係から、実施の形態５では出力と特徴項目の相関係数との単調性が確保されていることがわかる。 Further, from the relationship between the single correlation coefficient and η _j shown in FIG. 8, it is understood that the monotonicity between the output and the correlation coefficient of the feature item is secured in the fifth embodiment.

したがって実施の形態５によれば、すべての特徴項目を推定式（式（６））に用いることができるため、予測精度を向上させることができる。 Therefore, according to Embodiment 5, since all the feature items can be used for the estimation formula (Formula (6)), the prediction accuracy can be improved.

［実施の形態６］
実施の形態６では、実施の形態４に係るパターン認識方法において、上記Ｔ法における評価式のη_jを以下の式（２４）に従って与える。ここにｒ_jは項目ｘ_jと出力との単相関係数である。 [Embodiment 6]
In the sixth embodiment, in the pattern recognition method according to the fourth embodiment, the evaluation formula η _j in the T method is given according to the following formula (24). Here, r _j is a single correlation coefficient between the item x _j and the output.

単相関係数ｒ_jの２乗であるρ（＝ｒ_j ²）は、２変数（項目ｘ_jと出力ｙ）の寄与率を表す尺度である。したがって寄与率ρとその残差１−ρとの比は、上限のない正の値をとる。すなわち、上記の比は、実施の形態５におけるη_jと同じ性質を有する。 Ρ (= r _j ² ), which is the square of the single correlation coefficient r _j , is a scale representing the contribution ratio of two variables (item x _j and output y). Therefore, the ratio of the contribution ratio ρ and its residual 1−ρ takes a positive value without an upper limit. That is, the above ratio has the same property as η _{j in the} fifth embodiment.

図７に示されるように、実施の形態６ではη_jが常に正である。さらに単相関係数とη_jとの関係から、実施の形態６では出力と特徴項目の相関係数との単調性が確保されていることがわかる。 As shown in FIG. 7, in the sixth embodiment, η _j is always positive. Furthermore, it can be seen from the relationship between the simple correlation coefficient and η _j that the monotonicity between the output and the correlation coefficient of the feature item is secured in the sixth embodiment.

実施の形態６では、式（２４）を用いてη_jを算出することによって、実施の形態５と同じく、出力と特徴項目の相関係数との単調性を保ちながら、負にならないη_jを定義することができる。したがって実施の形態６によれば、実施の形態５と同様に、すべての特徴項目を推定式（式（６））に用いることができるため、予測精度を向上させることができる。 In the sixth embodiment, by calculating η _j using Expression (24), as in the fifth embodiment, η _j that is not negative is maintained while maintaining the monotonicity between the output and the correlation coefficient of the feature item. Can be defined. Therefore, according to the sixth embodiment, as in the fifth embodiment, since all feature items can be used in the estimation formula (formula (6)), the prediction accuracy can be improved.

［実施の形態７］
実施の形態７は、パターン認識装置および方法の具体的な適用である。なお以下の説明は、本発明による効果を具体的に説明できる一例を示すものであり、本発明の範囲を限定するものではない。 [Embodiment 7]
The seventh embodiment is a specific application of the pattern recognition apparatus and method. In addition, the following description shows an example which can demonstrate the effect by this invention concretely, and does not limit the scope of the present invention.

この実施の形態では、センサデバイスにおいて、常温の特性値を現特徴項目として用いて、高温の特性値が予測される。この場合、信号データの高温の特性値は、あらかじめ計測によって判明されており、これが解析の真値として用いられる。本実施の形態に係るパターン認識装置および方法を予測に用いる場合には、高温の特性値が未知のサンプルにおいて、常温の特性値から高温の特性値の予測が行なわれる。このような予測は、製造工程におけるオンラインの高温時特性値の計測を省略することができる。したがって調整や検査などの工数削減、および生産性向上に寄与するものである。 In this embodiment, in the sensor device, the characteristic value at high temperature is predicted using the characteristic value at room temperature as the current feature item. In this case, the high-temperature characteristic value of the signal data is previously determined by measurement, and this is used as the true value of the analysis. When the pattern recognition apparatus and method according to the present embodiment are used for prediction, a high-temperature characteristic value is predicted from a normal-temperature characteristic value in a sample whose high-temperature characteristic value is unknown. Such prediction can omit the online high-temperature characteristic value measurement in the manufacturing process. Therefore, it contributes to man-hour reduction such as adjustment and inspection, and to productivity improvement.

ここで、特性値とは調整工程や検査工程においてスペック（規格値）があり良否管理されるような主要な管理項目をいう。たとえば圧力センサであれば、入力圧力に対応する出力電圧値が、主要特性値となる。また、別の例では、光学系製品では照射位置に必要な光量分布が得られているかどうかが重要となるので、照射位置ごとの光量の大きさ、あるいいは光量の均一性などが主要特性値となる。 Here, the characteristic value is a main management item that has specifications (standard values) in the adjustment process and the inspection process and is managed in good or bad. For example, in the case of a pressure sensor, the output voltage value corresponding to the input pressure is the main characteristic value. In another example, since it is important for optical products to obtain the required light intensity distribution at the irradiation position, the main characteristics are the amount of light at each irradiation position, or the uniformity of the light intensity. Value.

なお、予測したい高温時の特性値および低温時の主要特性値の数は限定されず、いくつあってもよい。たとえば光学系製品における特性値である光量分布の場合には、図８に示されるように、光の照射位置（座標）ごとに特性値を管理する必要がある。 Note that the number of characteristic values at high temperatures and the number of main characteristic values at low temperatures to be predicted is not limited, and may be any number. For example, in the case of a light amount distribution which is a characteristic value in an optical system product, it is necessary to manage the characteristic value for each light irradiation position (coordinate) as shown in FIG.

ここでは、その複数ありうる特性値の１つずつを予測する予測関係式を作成するものとする。その予測される特性値の１つを、ここでは出力ｙとして説明する。 Here, it is assumed that a prediction relational expression for predicting each of the plurality of possible characteristic values is created. One of the predicted characteristic values is described here as output y.

図９を参照して、原特徴項目として、特性値カーブの各入力座標（ｐ₁，ｐ₂，・・・，ｐ_l）における常温時の実測値、特性値カーブのピークでの値およびその入力座標、特性値カーブのボトムでの値およびその入力座標、ピークとボトムとの値の差、特性値カーブのピークでの入力座標と特性値カーブのボトムでの入力座標との差などの２４の特徴項目を選んだ。上記以外の特性として、たとえば、カーブに複数の水平線を引いたときの交点の数あるいは交点間の距離の和などを採用するといった手法を用いることができる。 Referring to FIG. 9, as an original feature item, the measured value at the normal temperature at each input coordinate (p ₁ , p ₂ ,..., P _l ) of the characteristic value curve, the value at the peak of the characteristic value curve, and its Input coordinates, values at the bottom of the characteristic value curve and their input coordinates, the difference between the peak and bottom values, the difference between the input coordinates at the peak of the characteristic value curve and the input coordinates at the bottom of the characteristic value curve, etc. Selected feature items. As a characteristic other than the above, for example, a method of adopting the number of intersections when a plurality of horizontal lines are drawn on a curve or the sum of distances between the intersections can be used.

なお、原特徴項目に何を採用すべきかということは、個々の技術に類する問題であるので、個別に研究および判断すべき問題である。すなわち原特徴項目の種類によって本発明が限定されるものではない。 In addition, what should be adopted as the original feature item is a problem similar to each technology, and therefore is a problem to be studied and judged individually. That is, the present invention is not limited by the type of original feature item.

次に、選択した２４項目を固定して、複数の方法を比較した。具体的には、信号空間データとして、真値（高温での特性値）が判明している、さまざまな出力特性をもつ２０種類のデータセットを用意した。データセット数が原項目数よりも少ないため、公知の重回帰分析手法、あるいはＭＴシステムにおけるＭＴ法、ＭＴＡ法などを用いることができない。このため、（ａ）田口のＴ法を用いた場合、（ｂ）非特許文献５の方法を用いた場合、（ｃ）本発明の実施の形態１〜３の各々に係る方法を用いた場合について比較した。 Next, the selected 24 items were fixed, and a plurality of methods were compared. Specifically, 20 types of data sets having various output characteristics whose true values (characteristic values at high temperature) are known are prepared as signal space data. Since the number of data sets is smaller than the number of original items, it is not possible to use a known multiple regression analysis method, or the MT method or MTA method in the MT system. For this reason, (a) when using Taguchi's T method, (b) when using the method of Non-Patent Document 5, (c) when using the method according to each of Embodiments 1 to 3 of the present invention Compared.

なお、（ｂ），（ｃ）の方法では、信号空間のデータを用いて、２４個の原特徴項目（ｋ＝２４）から２７６（＝_kＣ₂＝｛ｋ・（ｋ−１）／２｝＝２４×２３／２）の相関特徴項目を作成し、原特徴項目に加えた（合計で３００個の特徴項目）。（ｃ）においてはさらに、重み係数として、実施の形態５における重み係数を用いた。 In the methods (b) and (c), 24 original feature items (k = 24) to 276 (= _k C ₂ = {k · (k−1) / 2) are used using signal space data. } = 24 × 23/2) correlation feature items were created and added to the original feature items (a total of 300 feature items). In (c), the weighting factor in the fifth embodiment is used as the weighting factor.

比較評価の結果、重み係数ηを計算する過程で、（ａ）の方法では２４項目中２項目、（ｂ）の方法では３００項目中３４項目がη＜０となり、解析にこれらの項目を使用することができなかった。一方、（ｃ）の方法すなわち本発明の実施の形態による方法ではη＜０となる項目は存在しなかった。 As a result of the comparative evaluation, in the process of calculating the weighting coefficient η, 2 out of 24 items in the method (a) and 34 out of 300 items in the method (b) are η <0, and these items are used for the analysis. I couldn't. On the other hand, in the method (c), that is, the method according to the embodiment of the present invention, there was no item satisfying η <0.

η＞０となる項目（（ａ）の方法では２２項目、（ｂ）の方法では２６６項目、（ｃ）の方法では３００項目）を使用して、信号空間データにおける高温時特性値の予測値と真値の相関係数とを複数の入力ケースの総合で比較した。 Using items for which η> 0 (22 items in the method (a), 266 items in the method (b), 300 items in the method (c)), the predicted value of the characteristic value at high temperature in the signal space data And the correlation coefficient of true values were compared in the total of multiple input cases.

図１０は、田口のＴ法、非特許文献５の方法および本発明の実施の形態１〜３の各々に係る方法の比較結果を示した図である。図１０に示されるように、（ｃ）による方法によって得られた相関係数が最も高くなった。これにより、本発明の実施の形態によれば、真値をより正確に予測できることが示される。 FIG. 10 is a diagram showing a comparison result of the Taguchi T method, the method of Non-Patent Document 5, and the method according to each of Embodiments 1 to 3 of the present invention. As shown in FIG. 10, the correlation coefficient obtained by the method according to (c) was the highest. Thus, according to the embodiment of the present invention, it is shown that the true value can be predicted more accurately.

今回開示された実施の形態はすべての点で例示であって制限的なものでないと考えられるべきである。本発明の範囲は上記した説明ではなくて特許請求の範囲によって示され、特許請求の範囲と均等の意味および範囲内でのすべての変更が含まれることが意図される。 The embodiment disclosed this time must be considered as illustrative in all points and not restrictive. The scope of the present invention is defined by the terms of the claims, rather than the description above, and is intended to include any modifications within the scope and meaning equivalent to the terms of the claims.

１単位空間データ、２単回帰直線、１１直線、２０同心楕円、５０パターン認識装置、５１ＣＰＵ、５２主記憶装置、５３補助記憶装置、５４入力装置、５５出力装置、５６通信装置、６１データ入力部、６２評価距離算出部、６３重み係数算出部、６４予測値算出部、６５判断部、６６出力部。 1 unit space data, 2 single regression line, 11 line, 20 concentric ellipse, 50 pattern recognition device, 51 CPU, 52 main storage device, 53 auxiliary storage device, 54 input device, 55 output device, 56 communication device, 61 data input Unit, 62 evaluation distance calculation unit, 63 weight coefficient calculation unit, 64 predicted value calculation unit, 65 determination unit, 66 output unit.

Claims

A pattern recognition device for performing pattern recognition based on a plurality of data each having a plurality of original feature items,
For each of the data, an evaluation distance calculation unit that calculates an evaluation distance for all combinations of selecting two original feature items from the plurality of original feature items;
For each of the data, a plurality of new feature items are generated by adding at least some of the evaluation distances calculated corresponding to all the combinations to the plurality of original feature items, A weighting factor calculating unit that calculates a weighting factor representing a correlation between each value of the new feature item and the true value of the new feature item;
A predicted value calculation unit that calculates a predicted value of output using the weighting factor for each new feature item;
A judgment unit that compares the predicted value with a predetermined threshold value and makes a judgment on a purpose;
The evaluation distance calculation unit is a pattern recognition device that calculates a Mahalanobis distance as the evaluation distance.

A pattern recognition device for performing pattern recognition based on a plurality of data each having a plurality of original feature items,
For each of the data, an evaluation distance calculation unit that calculates an evaluation distance for all combinations of selecting two original feature items from the plurality of original feature items;
For each of the data, a plurality of new feature items are generated by adding at least a part of the evaluation distance calculated corresponding to all the combinations to the plurality of original feature items, and A weighting factor calculating unit that calculates a weighting factor representing a correlation between each value of the new feature item and the true value of the new feature item;
A predicted value calculation unit that calculates a predicted value of output using the weighting factor for each new feature item;
A judgment unit that compares the predicted value with a predetermined threshold value and makes a judgment on a purpose;
The said evaluation distance calculation part is a pattern recognition apparatus which calculates the evaluation distance used by MTA (Maharanobis Taguchi Adjoint) method as said evaluation distance.

A pattern recognition device for performing pattern recognition based on a plurality of data each having a plurality of original feature items,
For each of the data, an evaluation distance calculation unit that calculates an evaluation distance for all combinations of selecting two original feature items from the plurality of original feature items;
For each of the data, a plurality of new feature items are generated by adding at least a part of the evaluation distance calculated corresponding to all the combinations to the plurality of original feature items, and A weighting factor calculating unit that calculates a weighting factor representing a correlation between each value of the new feature item and the true value of the new feature item;
A predicted value calculation unit that calculates a predicted value of output using the weighting factor for each new feature item;
A judgment unit that compares the predicted value with a predetermined threshold value and makes a judgment on a purpose;
The said evaluation distance calculation part is a pattern recognition apparatus which calculates the product of the normalized value of a 1st feature item, and the normalized value of a 2nd feature item as said evaluation distance.

4. The pattern recognition apparatus according to claim 1, wherein the plurality of data includes both unit space data and signal space data. 5.

The pattern recognition apparatus according to claim 1, wherein the plurality of data includes only unit space data.

The pattern recognition apparatus according to claim 1, wherein the weighting factor calculation unit calculates an SN ratio in a T method (Taguchi method) as the weighting factor.

Wherein the SN ratio as eta, the average of the variation and S _beta, when the residual and S _e, the SN ratio is defined by η = S β _/ S _e, the pattern recognition apparatus according to claim 6.

When the SN ratio is η and the single correlation coefficient between the value of the new feature item and the true value of the new feature item is r, the SN ratio is η = r ² / (1−r ^2). The pattern recognition device according to claim 6, defined by:

A pattern recognition method for performing pattern recognition based on a plurality of data each having a plurality of original feature items,
Calculating an evaluation distance for all combinations of selecting two original feature items from the plurality of original feature items for each of the data;
Generating a plurality of new feature items by adding at least a part of the evaluation distance calculated corresponding to all the combinations to the plurality of original feature items for each of the data;
Calculating a weighting coefficient representing the correlation between the value of each of the plurality of new feature items and the true value of the new feature item;
Using the weighting factor for each new feature item to calculate a predicted output value;
Comparing the predicted value with a predetermined threshold value and making a determination on the purpose,
The pattern recognition method of calculating a Mahalanobis distance as the evaluation distance in the step of calculating the evaluation distance.

A pattern recognition method for performing pattern recognition based on a plurality of data each having a plurality of original feature items,
Calculating an evaluation distance for all combinations of selecting two original feature items from the plurality of original feature items for each of the data;
Generating a plurality of new feature items by adding at least a part of the evaluation distance calculated corresponding to all the combinations to the plurality of original feature items for each of the data;
Calculating a weighting coefficient representing the correlation between the value of each of the plurality of new feature items and the true value of the new feature item;
Using the weighting factor for each new feature item to calculate a predicted output value;
Comparing the predicted value with a predetermined threshold value and making a determination on the purpose,
A pattern recognition method, wherein, in the step of calculating the evaluation distance, an evaluation distance used by an MTA (Mahalanobis Taguchi Adjoint) method is calculated as the evaluation distance.

A pattern recognition method for performing pattern recognition based on a plurality of data each having a plurality of original feature items,
Calculating an evaluation distance for all combinations of selecting two original feature items from the plurality of original feature items for each of the data;
Generating a plurality of new feature items by adding at least a part of the evaluation distance calculated corresponding to all the combinations to the plurality of original feature items for each of the data;
Calculating a weighting coefficient representing the correlation between the value of each of the plurality of new feature items and the true value of the new feature item;
Using the weighting factor for each new feature item to calculate a predicted output value;
Comparing the predicted value with a predetermined threshold value and making a determination on the purpose,
The pattern recognition method, wherein, in the step of calculating the evaluation distance, a product of a normalized value of the first feature item and a normalized value of the second feature item is calculated as the evaluation distance.

The pattern recognition method according to claim 9, wherein the plurality of data includes both unit space data and signal space data.

The pattern recognition method according to claim 9, wherein the plurality of data includes only unit space data.

The pattern recognition method according to claim 9, wherein, in the step of calculating the weighting factor, an SN ratio in a T method (Taguchi method) is calculated as the weighting factor.

Wherein the SN ratio and eta, the variation of the average and S _beta, when the residual and S _e, the SN ratio is defined by η = S β _/ S _e, the pattern recognition method according to claim 14.

When the SN ratio is η and the single correlation coefficient between the value of the new feature item and the true value of the new feature item is r, the SN ratio is η = r ² / (1−r ^2). The pattern recognition method according to claim 14, defined by: