JPH1115969A

JPH1115969A - Method and device for recognition and recording medium recording computer program realizing the device

Info

Publication number: JPH1115969A
Application number: JP9185845A
Authority: JP
Inventors: Hisataka Yamagishi; 久高山岸; Yasuhiro Tsutsumi; 康弘堤; Seikou Rou; 世紅労
Original assignee: Omron Corp; Omron Tateisi Electronics Co
Current assignee: Omron Corp
Priority date: 1997-06-26
Filing date: 1997-06-26
Publication date: 1999-01-22
Anticipated expiration: 2017-06-26
Also published as: JP3570604B2

Abstract

PROBLEM TO BE SOLVED: To improve recognition precision by introducing the concept of goodness-of-fit that is defined by a coordinate point in a cluster to a recognition system that uses the cluster. SOLUTION: A cluster having goodness-of-fit generating part (c) describes graphics (e.g. ellipse) including all of them based on cluster information (b) that is information of data which constitute a cluster, adds goodness-of-fit to axis end points and the center of the ellipse and further changes (learn) the goodness-of-fit. A generated cluster (g) to which goodness-of-fit is already added is transferred to a recognition system (h). The system (h) decides which category an unknown input belongs to by using the cluster (g) that is transferred from the part (c). A degree of belonging operating part (1) calculates the degree of belonging when unknown input data is included in a certain cluster. The larger calculated degree of belonging is, the higher the degree of belonging to a category to which the cluster belongs is.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、例えば、手書き文
字のパターン認識等に好適な認識精度の高い認識方法及
び装置、並びに、同装置を実現するためのコンピュータ
プログラムを記録させた記録媒体に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method and an apparatus having a high recognition accuracy suitable for, for example, pattern recognition of handwritten characters, and a recording medium storing a computer program for realizing the apparatus.

【０００２】[0002]

【従来の技術】従来、未知の入力されたパターンと特定
カテゴリに属する幾つかの標準パターンとを比較するこ
とで認識を行うようにしたパターン認識システムが知ら
れている。ところで、このようなシステムにあっては、
同一カテゴリに属するとは言え、それら複数の標準パタ
ーンにはある程度のばらつきがあるのが通例である。そ
の理由は、各パターンがｎ個の特徴により構成される場
合、ｎ次元空間にその点は存在するのであるが、あるカ
テゴリの標準パターンを数種用意したとき、そのｎ次元
の座標は必ずしも完全に一致するわけではなく、座標値
のほとんどが異なるのが一般的だからである。2. Description of the Related Art Conventionally, there has been known a pattern recognition system which performs recognition by comparing an unknown input pattern with some standard patterns belonging to a specific category. By the way, in such a system,
Although they belong to the same category, the plurality of standard patterns usually vary to some extent. The reason is that when each pattern is composed of n features, the point exists in the n-dimensional space, but when several types of standard patterns of a certain category are prepared, the n-dimensional coordinates are not always perfect. This is because most of the coordinate values generally differ from each other.

【０００３】そのため、このようなパターン認識システ
ムにあっては、そのような標準パターンのばらつきをカ
バーするようなクラスタ（集合体）を生成することによ
り、認識精度の向上が図られている。例えば、ｍ個のカ
テゴリがあり、各々に１つのクラスタが生成されると想
定すると、全体ではｍ個のクラスタが生成されることに
なる。Therefore, in such a pattern recognition system, recognition accuracy is improved by generating a cluster (aggregate) that covers such a variation in the standard pattern. For example, assuming that there are m categories and one cluster is generated for each category, a total of m clusters will be generated.

【０００４】そして、未知の入力パターンに対しては、
ｍ個のクラスタのどれに所属するかが調べられ、いずれ
かのクラスタに属することが判定されれば、その未知の
入力パターンはその所属したクラスタのカテゴリに属す
るものと無条件に決定されるようになっていた。Then, for an unknown input pattern,
It is checked which of the m clusters belongs to, and if it is determined that the input pattern belongs to any of the clusters, the unknown input pattern is unconditionally determined to belong to the category of the cluster to which it belongs. Had become.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、このよ
うな従来のパターン認識システムにあっては、未知の入
力パターンがいずれかのクラスタに属するものと判定さ
れると、その入力パターンは、判定されたクラスタに対
応するカテゴリに属するものと無条件に決定されてしま
うため、あるカテゴリ（Ａ）に属する入力パターンが、
たまたま他のカテゴリ（Ｂ）にも類似する結果、そのカ
テゴリ（Ｂ）のクラスタに属すると誤判定されると、そ
のカテゴリ（Ａ）に属する入力パターンは、カテゴリ
（Ｂ）に属するものと誤って決定されてしまうおそれが
あった。However, in such a conventional pattern recognition system, when an unknown input pattern is determined to belong to any one of the clusters, the input pattern is determined. Since the input pattern belonging to a certain category (A) is unconditionally determined as belonging to the category corresponding to the cluster,
As a result of being similar to the other category (B), if it is erroneously determined that the input pattern belongs to the cluster of the category (B), the input pattern belonging to the category (A) is erroneously determined to belong to the category (B). There was a risk of being decided.

【０００６】また、上述のパターン認識システムにあっ
ては、ｍ個のクラスタには重複した領域が全く存在しな
いとは言い難く、ある未知の入力パターンが複数のクラ
スタが跨った領域に存在する場合、どのクラスタ（カテ
ゴリ）に属するかの判定が非常に困難であった。Further, in the above-described pattern recognition system, it is difficult to say that there are no overlapping regions in the m clusters, and a certain unknown input pattern exists in a region where a plurality of clusters straddle. It is very difficult to determine which cluster (category) belongs.

【０００７】この発明は、このような従来の問題点を解
決するためになされたものであり、その目的とするとこ
ろは、この種のパターン認識システムにおける認識精度
を向上させることにある。The present invention has been made to solve such a conventional problem, and an object of the present invention is to improve recognition accuracy in this type of pattern recognition system.

【０００８】[0008]

【課題を解決するための手段】未知の入力データがある
クラスタ内部に含まれた場合、その所属度を演算するた
め、クラスタ内部の適当な場所に適合度を保持する。こ
の適合度は、誤認識データにより学習がおこなわれる。
この学習により、クラスタに含まれる誤認識データの存
在する領域の適合度が下げられることになる。ある領域
の適合度が下げられると次のような効果が得られる。When unknown input data is included in a certain cluster, the degree of conformity is held at an appropriate place inside the cluster in order to calculate the degree of belonging. The degree of matching is learned based on the misrecognition data.
By this learning, the degree of conformity of the area where the erroneously recognized data included in the cluster exists is reduced. When the degree of conformity of a certain area is reduced, the following effects can be obtained.

【０００９】・Ａというカテゴリのクラスタが生成され
ていたとする。このとき、Ｂというカテゴリの入力デー
タがＡのカテゴリのデータに類似していることが多く、
誤ってＡのクラスタに入ってしまう状況を考える。しか
し、あらかじめＢをＡに誤認識したデータを準備し、Ａ
のクラスタの適合度を学習させておくと、ＢをＡに誤認
識したデータが入る領域の適合度が低く変更される。よ
って、未知の入力データが、ＢをＡに誤認識したデータ
が存在するＡのクラスタの領域に入った場合、所属度
（適合度の合成）は低く算出され、Ａのクラスタに入っ
たがＡの可能性は低いと判定することができる。つまり
誤判定を減少することができる。Suppose that a cluster of category A has been generated. At this time, the input data of the category B is often similar to the data of the category A,
Consider a situation in which the user accidentally enters the cluster A. However, data in which B is erroneously recognized as A is prepared in advance, and A
If the fitness of the cluster is learned, the fitness of the region in which the data in which B is erroneously recognized as A is changed to be low. Therefore, when the unknown input data enters the area of the cluster of A in which the data in which B is erroneously recognized as A exists, the degree of belonging (combination of the degrees of matching) is calculated to be low, and the input of the unknown data enters the cluster of A. Can be determined to be low. That is, erroneous determination can be reduced.

【００１０】・Ｃ、Ｄというカテゴリのクラスタが生成
され、この２つのクラスタの一部の領域が重複している
状況を考える。ＣをＤに誤認識したデータが多く存在す
ることが分かっていたため、学習時に利用した誤認識デ
ータは、ＣをＤに誤認識したデータだけであり、このデ
ータを学習に使用することで、ＤのクラスタのＣと重な
る段階の適合度が下げられることになったと仮定する。
このとき未知の入力データが重複領域に入った場合にＣ
のカテゴリで算出される適合度が高く、Ｄのカテゴリで
算出される適合度が低くなる。これにより、「Ｃ、Ｄの
カテゴリに同時に含まれるから不明」とすることが無く
なり、誤判定をせずに認識率向上が図れる。[0010] Consider a situation in which clusters of categories C and D are generated, and some areas of these two clusters overlap. Since it was known that there were many data in which C was erroneously recognized as D, the only erroneously recognized data used in learning was data in which C was erroneously recognized as D. By using this data for learning, D Suppose that the fitness of the stage that overlaps C of the cluster is reduced.
At this time, if unknown input data enters the overlapping area, C
Of the category D is high, and the degree of fitness calculated in the category D is low. As a result, "unknown because they are simultaneously included in the categories C and D" is eliminated, and the recognition rate can be improved without erroneous determination.

【００１１】すなわち、この出願の請求項１に記載の発
明は、入力データと各カテゴリに対応するクラスタとを
特徴量空間で照合することにより、当該入力データがど
のクラスタに内包されるかを判定するステップと、前記
入力データを内包すると判定されるクラスタに関し、当
該クラスタ内の座標位置と関連付けてあらかじめ記憶さ
れた適合度情報を参照することにより、当該クラスタに
関する前記入力データの所属度を求めるステップと、前
記求められたクラスタに関する所属度に基づいて、当該
入力データの属するカテゴリを認識するステップと、を
具備することを特徴とする認識方法にある。That is, the invention according to claim 1 of the present application determines which cluster includes the input data by comparing the input data with the cluster corresponding to each category in the feature amount space. Determining the degree of affiliation of the input data with respect to the cluster by referring to the fitness degree information stored in advance in association with the coordinate position in the cluster with respect to the cluster determined to include the input data. And a step of recognizing a category to which the input data belongs based on the obtained degree of belonging to the cluster.

【００１２】また、この出願の請求項２に記載の発明
は、カテゴリ認識結果が既知の入力データを学習するこ
とにより、前記適合度情報を変化させるステップを有す
ることを特徴とする請求項１に記載の認識方法にある。Further, the invention according to claim 2 of the present application has a step of changing the fitness information by learning input data whose category recognition result is known. It is in the recognition method described.

【００１３】また、この出願の請求項３に記載の発明
は、前記カテゴリを認識するステップは、入力データを
内包すると判定されるクラスタが複数存在する場合に
は、それらのクラスタの中で最大所属度を有する唯一の
クラスタを当該入力データの属するクラスタとして認識
することを特徴とする請求項１に記載の認識方法にあ
る。[0013] In the invention according to claim 3 of the present application, the step of recognizing the category includes, when there are a plurality of clusters determined to include the input data, the largest belonging cluster among the clusters. 2. The method according to claim 1, wherein the only cluster having a degree is recognized as a cluster to which the input data belongs.

【００１４】また、この出願の請求項４に記載の発明
は、前記カテゴリを認識するステップは、当該入力デー
タの属するクラスタを認識するに際して、所属度の大き
さを基準値と比較することにより、「不明」、「××カ
テゴリに含まれるが確信は持てない」、「確信を持って
××クラスタに含まれるといえる」の３通りの認識を行
うことを特徴とする請求項１に記載の認識方法にある。Further, in the invention according to claim 4 of the present application, the step of recognizing the category includes, when recognizing a cluster to which the input data belongs, comparing the degree of belonging to a reference value. 3. The recognition according to claim 1, wherein three types of recognition are performed: "unknown", "included in the xx category but cannot be convinced", and "can be said to be confidently included in the xx cluster". In the recognition method.

【００１５】また、この出願の請求項５に記載の発明
は、前記適合度情報は、クラスタ内部の特定座標位置に
おける適合度と、該適合度を基準としてクラスタ内部の
それ以外の座標位置における適合度を表現するメンバシ
ップ関数とからなることを特徴とする請求項１に記載の
認識方法にある。In the invention described in claim 5 of the present application, the fitness information includes a fitness at a specific coordinate position inside the cluster and a fitness at other coordinate positions inside the cluster based on the fitness. 2. The recognition method according to claim 1, comprising a membership function expressing a degree.

【００１６】また、この出願の請求項６に記載の発明
は、前記適合度情報を構成するクラスタ内部の特定座標
位置とは、クラスタをある図形で囲んだ場合における、
その図形を構成する軸上の位置及びその図形の中心位置
であることを特徴とする請求項５に記載の認識方法にに
ある。Further, according to the invention described in claim 6 of the present application, the specific coordinate position inside the cluster constituting the fitness information is defined as:
6. The recognition method according to claim 5, wherein a position on an axis constituting the figure and a center position of the figure are provided.

【００１７】また、この出願の請求項７に記載の発明
は、クラスタが作成される空間が２次元の場合、クラス
タを囲む図形としては、円、楕円、四角形のいずれかが
用いられ、かつ、前記クラスタ内部の特定座標位置と
は、それらの図形の中心および軸の端点の計５点とされ
ることを特徴とする請求項６に記載の認識方法にある。Further, according to the invention described in claim 7 of the present application, when a space where a cluster is created is two-dimensional, any of a circle, an ellipse, and a rectangle is used as a figure surrounding the cluster, and 7. The recognition method according to claim 6, wherein the specific coordinate positions inside the cluster are a total of five points, that is, a center of the figures and end points of axes.

【００１８】また、この出願の請求項８に記載の発明
は、クラスタが作成される空間が３次元の場合、クラス
タを囲む図形としては、球、楕円体、直方体のいずれか
が用いられ、かつ前記クラスタ内部の特定座標位置と
は、それらの図形の中心および軸の端点の計７点とされ
ることを特徴とする請求項６に記載の認識方法にある。According to the invention described in claim 8 of the present application, when the space where the cluster is created is three-dimensional, any of a sphere, an ellipsoid, and a rectangular parallelepiped is used as a figure surrounding the cluster, and 7. The recognition method according to claim 6, wherein the specific coordinate position inside the cluster is a total of seven points including a center of each figure and an end point of an axis.

【００１９】また、この出願の請求項９に記載の発明
は、クラスタが作成される空間がｎ次元の場合、クラス
タを囲む図形としては、超円、超楕円体、超平面体のい
ずれかが用いられ、かつ前記クラスタ内部の特定座標位
置とは、それらの図形の中心および軸の端点の計（２ｎ
＋１）点とされることを特徴とする請求項６に記載の認
識方法にある。Further, according to the invention as set forth in claim 9 of the present application, when the space in which the cluster is created is n-dimensional, any one of a hypercircle, a hyperellipsoid, and a hyperplane is used as a figure surrounding the cluster. The specific coordinate position used in the cluster is defined as the total (2n
The recognition method according to claim 6, wherein a point is set to +1).

【００２０】また、この出願の請求項１０に記載の発明
は、前記入力データの学習により適合度情報を変化させ
るステップは、学習対象のクラスタに正しく入ることが
わかっているデータを学習データとして採用し、かつ前
記特定座標位置の適合度初期値を０（あるいは適合度と
しての最小値）とし、学習データを与えることで、その
データが含まれる領域を囲む適合度を上昇させていくも
のであることを特徴とする請求項２に記載の認識方法に
ある。Further, in the invention according to claim 10 of the present application, the step of changing the fitness information by learning the input data employs, as learning data, data that is known to correctly enter a learning target cluster. In addition, by setting the initial value of the fitness of the specific coordinate position to 0 (or the minimum value as the fitness) and providing learning data, the fitness of surrounding the area including the data is increased. 3. The recognition method according to claim 2, wherein:

【００２１】また、この出願の請求項１１に記載の発明
は、前記入力データの学習により適合度情報を変化させ
るステップは、学習対象のクラスタに誤って（異なるカ
テゴリのデータなのに入ってしまう）入ることがわかっ
ているデータを学習データとして採用し、かつ前記特定
座標位置の適合度初期値を１（あるいは適合度としての
最高値）とし、学習データを与えることで、そのデータ
が含まれる領域を囲む適合度を下降させていくものであ
ることを特徴とする請求項２に記載の認識方法。Further, in the invention according to claim 11 of the present application, the step of changing the fitness information by learning the input data erroneously enters the cluster to be learned (enters data of a different category). By adopting data that is known to be the learning data, setting the initial value of the fitness of the specific coordinate position to 1 (or the highest value as the fitness), and providing the learning data, the area including the data is reduced. 3. The recognition method according to claim 2, wherein the degree of surrounding suitability is decreased.

【００２２】また、この出願の請求項１２に記載の発明
は、入力データと各カテゴリに対応するクラスタとを特
徴量空間で照合することにより、当該入力データがどの
クラスタに内包されるかを判定する手段と、前記入力デ
ータを内包すると判定されるクラスタに関し、当該クラ
スタ内の座標位置と関連付けてあらかじめ記憶された適
合度情報を参照することにより、当該クラスタに関する
前記入力データの所属度を求める手段と、前記求められ
たクラスタに関する所属度に基づいて、当該入力データ
の属するカテゴリを認識する手段と、を具備することを
特徴とする認識装置にある。In the invention according to claim 12 of the present application, by comparing input data with clusters corresponding to respective categories in a feature amount space, it is determined which cluster the input data is included in. Means for determining the degree of affiliation of the input data with respect to the cluster by referring to the fitness information stored in advance in association with the coordinate position in the cluster with respect to the cluster determined to include the input data And means for recognizing a category to which the input data belongs based on the degree of affiliation with respect to the obtained cluster.

【００２３】また、この出願の請求項１３に記載の発明
は、カテゴリ認識結果が既知の入力データを学習するこ
とにより、前記適合度情報を変化させる手段を有するこ
とを特徴とする請求項１２に記載の認識装置にある。According to a thirteenth aspect of the present invention, there is provided the invention according to the twelfth aspect, further comprising means for changing the fitness information by learning input data whose category recognition result is known. In the recognition device described.

【００２４】また、この出願の請求項１４に記載の発明
は、前記カテゴリを認識する手段は、入力データを内包
すると判定されるクラスタが複数存在する場合には、そ
れらのクラスタの中で最大所属度を有する唯一のクラス
タを当該入力データの属するクラスタとして認識するこ
とを特徴とする請求項１２に記載の認識装置にある。[0024] In the invention according to claim 14 of the present application, the means for recognizing the category includes, when there are a plurality of clusters determined to include the input data, the largest belonging cluster among those clusters. 13. The apparatus according to claim 12, wherein the only cluster having a degree is recognized as a cluster to which the input data belongs.

【００２５】また、この出願の請求項１５に記載の発明
は、前記カテゴリを認識する手段は、当該入力データの
属するクラスタを認識するに際して、所属度の大きさを
基準値と比較することにより、「不明」、「××カテゴ
リに含まれるが確信は持てない」、「確信を持って××
クラスタに含まれるといえる」の３通りの認識を行うこ
とを特徴とする請求項１２に記載の認識装置にある。In the invention according to claim 15 of the present application, the means for recognizing the category compares the magnitude of the degree of affiliation with a reference value when recognizing the cluster to which the input data belongs. "Unknown", "Included in xx category but uncertain", "XX with certainty
13. The recognition apparatus according to claim 12, wherein three kinds of recognition of "can be said to be included in a cluster" are performed.

【００２６】また、この出願の請求項１６に記載の発明
は、前記適合度情報は、クラスタ内部の特定座標位置に
おける適合度と、該適合度を基準としてクラスタ内部の
それ以外の座標位置における適合度を表現するメンバシ
ップ関数とからなることを特徴とする請求項１２に記載
の認識装置にある。In the invention according to claim 16 of the present application, the fitness information includes a fitness at a specific coordinate position inside the cluster and a fitness at other coordinate positions inside the cluster based on the fitness. 13. The recognition apparatus according to claim 12, comprising a membership function expressing a degree.

【００２７】また、この出願の請求項１７に記載の発明
は、前記適合度情報を構成するクラスタ内部の特定座標
位置とは、クラスタをある図形で囲んだ場合における、
その図形を構成する軸上の位置及びその図形の中心位置
であることを特徴とする請求項１６に記載の認識装置に
ある。Further, in the invention according to claim 17 of the present application, the specific coordinate position inside the cluster constituting the fitness information is defined as:
17. The recognition apparatus according to claim 16, wherein the position is on an axis constituting the figure and the center position of the figure.

【００２８】また、この出願の請求項１８に記載の発明
は、クラスタが作成される空間が２次元の場合、クラス
タを囲む図形としては、円、楕円、四角形のいずれかが
用いられ、かつ、前記クラスタ内部の特定座標位置と
は、それらの図形の中心および軸の端点の計５点とされ
ることを特徴とする請求項１７に記載の認識装置にあ
る。Further, according to the invention described in claim 18 of the present application, when the space in which the cluster is created is two-dimensional, any of a circle, an ellipse, and a rectangle is used as a figure surrounding the cluster, and 18. The recognition apparatus according to claim 17, wherein the specific coordinate positions inside the cluster are a total of five points, that is, the center of the figures and the end points of the axes.

【００２９】また、この出願の請求項１９に記載の発明
は、クラスタが作成される空間が３次元の場合、クラス
タを囲む図形としては、球、楕円体、直方体のいずれか
が用いられ、かつ前記クラスタ内部の特定座標位置と
は、それらの図形の中心および軸の端点の計７点とされ
ることを特徴とする請求項１７に記載の認識装置にあ
る。Further, according to the invention described in claim 19 of the present application, when the space where the cluster is created is three-dimensional, any of a sphere, an ellipsoid, and a rectangular parallelepiped is used as a figure surrounding the cluster, and 18. The recognition apparatus according to claim 17, wherein the specific coordinate position inside the cluster is a total of seven points including the center of the figure and the end point of the axis.

【００３０】この出願の請求項２０に記載の発明は、ク
ラスタが作成される空間がｎ次元の場合、クラスタを囲
む図形としては、超円、超楕円体、超平面体のいずれか
が用いられ、かつ前記クラスタ内部の特定座標位置と
は、それらの図形の中心および軸の端点の計（２ｎ＋
１）点とされることを特徴とする請求項１７に記載の認
識装置にある。According to the invention described in claim 20 of the present application, when the space in which the cluster is created is n-dimensional, any of a hypercircle, hyperellipsoid, and hyperplane is used as a figure surrounding the cluster. And the specific coordinate position inside the cluster is the sum of the center of the figures and the end points of the axes (2n +
1) The recognition device according to claim 17, wherein the recognition device is a point.

【００３１】また、この出願の請求項２１に記載の発明
は、前記入力データの学習により適合度情報を変化させ
る手段は、学習対象のクラスタに正しく入ることがわか
っているデータを学習データとして採用し、かつ前記特
定座標位置の適合度初期値を０（あるいは適合度として
の最小値）とし、学習データを与えることで、そのデー
タが含まれる領域を囲む適合度を上昇させていくもので
あることを特徴とする請求項１３に記載の認識装置にあ
る。Further, in the invention according to claim 21 of the present application, the means for changing the fitness information by learning the input data employs, as learning data, data which is known to correctly enter a cluster to be learned. In addition, by setting the initial value of the fitness of the specific coordinate position to 0 (or the minimum value as the fitness) and providing learning data, the fitness of surrounding the area including the data is increased. 14. The recognition device according to claim 13, wherein:

【００３２】また、この出願の請求項２２に記載の発明
は、前記入力データの学習により適合度情報を変化させ
る手段は、学習対象のクラスタに誤って（異なるカテゴ
リのデータなのに入ってしまう）入ることがわかってい
るデータを学習データとして採用し、かつ前記特定座標
位置の適合度初期値を１（あるいは適合度としての最高
値）とし、学習データを与えることで、そのデータが含
まれる領域を囲む適合度を下降させていくものであるこ
とを特徴とする請求項１３に記載の認識装置にある。In the invention according to claim 22 of the present application, the means for changing the fitness information by learning the input data erroneously enters the learning target cluster (enters data of a different category). By adopting data that is known to be the learning data, setting the initial value of the fitness of the specific coordinate position to 1 (or the highest value as the fitness), and providing the learning data, the area including the data is reduced. 14. The recognition apparatus according to claim 13, wherein the degree of surrounding suitability is reduced.

【００３３】さらに、この出願の請求項２３に記載の発
明は、前記請求項１２〜請求項２２のいずれかに記載さ
れた装置を実現するためのコンピュータプログラムを記
録させた記録媒体にある。Further, the invention according to claim 23 of the present application resides in a recording medium in which a computer program for realizing the device according to any one of claims 12 to 22 is recorded.

【００３４】[0034]

【発明の実施の形態】以下、この発明の好ましい実施の
形態につき、添付図面を参照して詳細に説明する。Preferred embodiments of the present invention will be described below in detail with reference to the accompanying drawings.

【００３５】本発明に係るパターン認識システム（方法
及び装置を含む）並びに適合度付きクラスタ生成部の実
施の一形態が図１に示されている。なお、言うまでもな
いが、同システムはコンピュータプログラムにて実現す
ることができる。FIG. 1 shows an embodiment of a pattern recognition system (including a method and an apparatus) according to the present invention and a cluster generator with a fitness. Needless to say, the system can be realized by a computer program.

【００３６】同図において、従来手法によるクラスタ生
成部ａは、２次元の特徴量を用いてクラスタリングを実
施するものである。ここで、従来手法としては、例え
ば、距離の近いデータ同士を１つのクラスタにまとめて
いく方法や、ある規範量を最小にするようデータをまと
めていく方法等を挙げることができる。クラスタリング
により得られたクラスタ情報ｂは、本発明の要部の一つ
である適合度付きクラスタ生成部ｃへと転送される。な
お、ここで、クラスタ情報ｂとは、１個／カテゴリ、あ
るいは、複数個／カテゴリにクラスタリングされた情
報、つまりクラスタを構成するデータのまとまりの情報
のことを意味している。In the figure, a cluster generation unit a according to a conventional method performs clustering using a two-dimensional feature amount. Here, as a conventional method, for example, a method of collecting data having short distances into one cluster, a method of collecting data so as to minimize a certain reference amount, and the like can be cited. The cluster information “b” obtained by the clustering is transferred to a cluster generation unit with a degree of fitness, which is one of the main parts of the present invention. Here, the cluster information b means information clustered into one / category or a plurality / category, that is, information of a group of data constituting a cluster.

【００３７】適合度付きクラスタ生成部ｃは、クラスタ
を構成するデータの情報であるクラスタ情報ｂを元に、
それら全てを内包する図形（例えば、楕円）を記述し、
その楕円の軸端点、中心に適合度を付加させると共に、
さらに、その適合度を変化（学習）させる機能を有する
部分である。そして、この適合度付きクラスタ生成部ｃ
には、後に詳細に説明するように、クラスタ形状の決定
部ｄ、適合度の付加部ｅ、及び、適合度の学習部ｆが含
まれている。ここで、クラスタ形状の決定部ｄでは、ク
ラスタを構成するデータを内包する楕円の記述が行われ
る。また、適合度の付加部ｅでは、クラスタ形状の決定
部ｄにてで決定された楕円の軸端点、中心に初期値１の
適合度が保持される。さらに、適合度の学習部ｆでは、
生成されているクラスタ（楕円）に誤認識データを与
え、誤認識データの含まれる領域の適合度を低下させる
処理が行われる。The cluster generation unit c with a degree of conformity uses the cluster information b, which is information of data constituting a cluster, based on the cluster information b.
Describe a figure (for example, an ellipse) that contains all of them,
While adding the degree of fit to the axis end point and center of the ellipse,
Further, it is a part having a function of changing (learning) the degree of matching. Then, the cluster generation unit c with the degree of conformity
Includes a cluster shape determination unit d, a fitness addition unit e, and a fitness learning unit f, as described in detail later. Here, in the cluster shape determining unit d, an ellipse including data constituting the cluster is described. In addition, the fitness addition unit e holds the fitness of the initial value 1 at the axis end point and center of the ellipse determined by the cluster shape determination unit d. Further, in the learning unit f of the fitness,
Misrecognition data is given to the generated cluster (ellipse), and a process of reducing the degree of conformity of a region including the misrecognition data is performed.

【００３８】こうして生成された適合度付加済みのクラ
スタｇは、本発明の他の要部である認識システムｈへと
転送される。ここで、適合度付加済みのクラスタｇと
は、クラスタを記述した楕円式、及び、適合度を軸端点
並びに中心に保持したクラスタ情報のことを意味してい
る。The cluster g to which the degree of fitness has been added thus generated is transferred to a recognition system h, which is another essential part of the present invention. Here, the cluster g to which the degree of fitness has been added means an elliptic expression describing the cluster and cluster information in which the degree of fitness is held at the axial end point and the center.

【００３９】認識システムｈは、適合度付きクラスタ生
成部ｃから転送されてくる適合度付加済みクラスタｇを
用いて、未知の入力がどのカテゴリに所属するかを判定
する部分である。この認識システムｈには、後に詳細に
説明するように、内包判定部ｉと、所属度算出部ｊと、
認識結果決定部ｍとが含まれている。The recognition system h is a part for determining which category the unknown input belongs to, using the cluster g with the added fitness which is transferred from the cluster generator c with the fitness. As will be described in detail later, this recognition system h includes an inclusion determination unit i, a membership calculation unit j,
And a recognition result determination unit m.

【００４０】内包判定部ｉは、適合度付加済みクラスタ
ｇを保持し、未知の入力データが、その保持するクラス
タｇに内包されるか否かを判定する機能を有するもので
ある。ここで、内包されると判定されたならば、ｋの矢
印、そうでなければｊの矢印に処理は流れる。ここで、
矢印ｊは、未知のデータがどのクラスタにも内包されな
い場合の処理の流れを示しており、この処理に分岐した
場合は適合度演算がされない。また、矢印ｋは、未知の
クラスタが１つ以上のクラスタに内包される場合の処理
の流れを示しており、この処理に分岐した場合は、各ク
ラスタでの所属度（適合度の合成）が演算される。The inclusion judging section i has a function of holding the cluster g to which the matching degree has been added, and judging whether or not the unknown input data is included in the held cluster g. Here, the process flows to the arrow of k if it is determined to be included, and otherwise to the arrow of j. here,
The arrow j indicates the flow of processing when unknown data is not included in any cluster, and when the processing branches to this processing, the fitness calculation is not performed. Arrow k indicates the flow of processing when an unknown cluster is included in one or more clusters. If the processing branches off, the degree of affiliation (combination of fitness) in each cluster is determined. Is calculated.

【００４１】所属度演算部ｌは、未知の入力データがあ
るクラスタに含まれる場合、その所属度を算出する部分
である。ここで算出される所属度が大きいほど、そのク
ラスタが所属するカテゴリへ属する度合いが高いことを
示している。When the unknown input data is included in a certain cluster, the affiliation degree calculation unit 1 is a part for calculating the affiliation degree. The greater the degree of belonging calculated here, the higher the degree of belonging to the category to which the cluster belongs.

【００４２】認識結果決定部ｍは、入力された未知のデ
ータを、例えば、最も大きい所属度が算出されたクラス
タの属するカテゴリに属すると判定する部分である。た
だし、所属度の大きさにより、『どこにも所属しな
い』、『所属するがあやしい』、『自信を持って所属す
ると言える』と判定を下すこともできる。『どこにも所
属しない』は、矢印ｊの処理の流れを通った場合（所属
度が算出されない）に出される結果と考えて良い。The recognition result determination section m is a section for determining that the input unknown data belongs to, for example, the category to which the cluster having the highest belonging degree belongs. However, depending on the degree of affiliation, it is possible to make a judgment such as "I do not belong anywhere", "I am likely to belong", or "I can say that I belong with confidence". “No affiliation” can be considered as a result that is output when the affiliation is processed (the affiliation degree is not calculated).

【００４３】なお、この実施形態にて認識されるべき各
パターンは二次元の特徴量を有するものであり、２つの
特徴量にて特徴付けられている。そのため、クラスタが
出来上がった後、それを囲む図形を楕円とした。もし
も、３次元の特徴量を有するようならば、クラスタは楕
円体で囲まれることになり、ｎ次元の特徴を持つならば
ｎ次元の超楕円体で囲まれることになる。つまり、以上
説明した本発明の技術思想は、ｎ次元まで拡張可能であ
ることに注意されたい。Each pattern to be recognized in this embodiment has two-dimensional features, and is characterized by two features. Therefore, after the cluster is completed, the figure surrounding it is made elliptical. If the cluster has three-dimensional features, the cluster will be surrounded by an ellipsoid, and if it has n-dimensional features, it will be surrounded by an n-dimensional hyperellipsoid. That is, it should be noted that the technical idea of the present invention described above can be extended to n dimensions.

【００４４】次に、適合度の付加部ｅにてクラスタに適
合度を付加するに際して、適合度が付される位置につい
て図２を参照しながら説明する。認知のデータの集合で
クラスタを作成し、そのクラスタに適合度を保持させ
る。各クラスタは楕円で表せるものとする。使用するデ
ータを２次元として作成したクラスタを例に取ると、適
合度が付加されるのは、図２の黒丸の位置である。つま
り、４個の軸端点と１個のクラスタ中心に適合度が付加
される。Next, a description will be given, with reference to FIG. 2, of a position where the degree of conformity is added when the degree of conformity is added to the cluster by the degree of conformity adding unit e. A cluster is created from a set of cognitive data, and the cluster retains fitness. Each cluster can be represented by an ellipse. Taking a cluster in which data to be used is created as two-dimensional as an example, the degree of conformity is added at the position of the black circle in FIG. That is, the fitness is added to the four shaft end points and the center of one cluster.

【００４５】次に、適合度の学習部ｆにおける学習方法
について、図３を参照しながら説明する。あるクラスタ
に誤認識データを入れたときの適合度学習の仕方を以下
に示す。あるクラスタに誤認識データが含まれている状
況が図３に示されている。図中の中心付近に位置する網
掛け部は、クラスタを構成するデータで共分散行列を作
成し、その行列の固有値、固有ベクトルから生成される
楕円である。しかし、この固有値、固有ベクトルから生
成される楕円ではクラスタを構成するデータ全てを内包
する楕円を表すことができないため、定数ｃをかけて、
固有ベクトル方向に楕円の大きさを広げている（網掛け
部の外側の楕円）。Next, a learning method in the learning section f for the degree of matching will be described with reference to FIG. The following describes how to learn the degree of fitness when misrecognition data is put into a certain cluster. FIG. 3 shows a situation where a certain cluster contains erroneously recognized data. The hatched part located near the center in the figure is an ellipse generated from a covariance matrix created with data constituting a cluster and eigenvalues and eigenvectors of the matrix. However, since the ellipse generated from the eigenvalues and eigenvectors cannot represent an ellipse containing all the data constituting the cluster, multiplying by a constant c,
The size of the ellipse is expanded in the direction of the eigenvector (the ellipse outside the shaded portion).

【００４６】なお、図３で使用されている記号の意味は
次の通りである。・クラスタ構成データの共分散行列の固有ベクトルは、
軸Ａ方向、軸Ｂ方向を向いている。・軸Ａ方向の固有値はλ１、軸Ｂ方向の固有値はλ２で
ある。・クラスタ構成データを全て含むための各軸方向の長さ
は、固有値の平方根にｃをかけたものである。・軸Ａ上の端点における適合度はＡ１、Ａ２である。・軸Ｂ上の端点における適合度はＢ１、Ｂ２である。・クラスタ中心における適合度はＯである。・クラスタに含まれた誤認識データの軸Ａ、Ｂ上の座標
は（ａ、ｂ）であり、中心からの正規化距離はｒであ
る。ここで、軸長さで正規化したあとの座標を（ａａ、
ｂｂ）とする。・クラスタ中心と誤認識データ座標までのベクトルと軸
Ａがなす角をΘとする。・共分散行列の固有値の平方根を径に持つ楕円（網掛け
部の楕円）の楕円体式右辺＝Ｒ１とする。・クラスタ構成全データのカバリング距離を考慮した楕
円（大きい楕円）の楕円体式右辺＝Ｒ２とする。The meanings of the symbols used in FIG. 3 are as follows. The eigenvector of the covariance matrix of the cluster configuration data is
It faces the direction of the axis A and the direction of the axis B. The eigenvalue in the axis A direction is λ1, and the eigenvalue in the axis B direction is λ2. The length in each axis direction for including all the cluster configuration data is obtained by multiplying the square root of the eigenvalue by c. The fitness at the end points on the axis A is A1 and A2. The fitness at the end points on the axis B is B1 and B2. The fitness at the cluster center is O. The coordinates on the axes A and B of the misrecognition data included in the cluster are (a, b), and the normalized distance from the center is r. Here, coordinates after normalization by the axis length are (aa,
bb). The angle between the axis A and the vector from the cluster center to the misrecognition data coordinates is represented by Θ. -The right side of the ellipsoidal expression of the ellipse (the shaded ellipse) having the square root of the eigenvalue of the covariance matrix as the diameter is R1. -The right side of the ellipsoidal expression of the ellipse (large ellipse) in consideration of the covering distance of all the cluster configuration data is R2.

【００４７】２次元のクラスタを対象に行った１回の学
習処理の流れが図４のフローチャートに示されている。
同図に示されるように、この学習処理では、誤認識デー
タ数をカウンタｎにプリセットした後（ステップも４０
１）、ポインタｉを＋１づつインクリメントしては（ス
テップ４０６）、該当する誤認識データを読み出し、そ
の読み出されたデータに基づいて適合度を調整する処理
（ステップ４０３〜４０５）が、ポインタｉが誤認識デ
ータ数ｎに達するまで（ステップ４０２ＮＯ、ステップ
４０７）、繰り返される。ここで、適合度調整処理で
は、先ず、その誤認識データがクラスタの内部に含まれ
るものであるか否かの判定が行われ（ステップ４０
３）、含まれないと判定される場合には（ステップ４０
３ＮＯ）、適合度の調整は行われないのに対して、含ま
れると判定される場合には（ステップ４０３ＹＥＳ）、
条件式≦１．０の成立可否に応じて、異なる態様によ
り適合度の調整（ステップ４０４又は４０５）が行われ
る。ここで、図４のステップ４０４で使用される条件式
≦１．０（式は楕円式である）は、クラスタを構成
するデータの共分散行列の固有値、固有ベクトルで形成
される楕円内部に入っているか否かを判定している。こ
の例では、条件を満たす（内部に入る）場合は（ステッ
プ４０４ＹＥＳ）、中心の適合度のみを下げる処理とし
（ステップ４０４）、そうでなければ（ステップ４０４
ＮＯ）、端点と中心を下げる処理とした（ステップ４０
５）。これは、中心に近い領域に存在する誤認識データ
の影響を軸端点の適合度に反映させないためである。す
なわち、１）フローチャートの条件式≦１．０がＹｅｓの場合中心の適合度は、次式（数１）に従って低下される。The flow of one learning process performed on a two-dimensional cluster is shown in the flowchart of FIG.
As shown in the drawing, in this learning process, after the number of misrecognized data is preset in a counter n (step is also 40
1) When the pointer i is incremented by +1 (step 406), the corresponding erroneously recognized data is read, and the process of adjusting the degree of conformity based on the read data (steps 403 to 405) is performed by the pointer i Is repeated until the number n of erroneously recognized data reaches (step 402 NO, step 407). Here, in the matching degree adjustment processing, first, it is determined whether or not the erroneously recognized data is included in the cluster (step 40).
3) If it is determined that they are not included (step 40)
3NO), while the adjustment of the degree of conformity is not performed, but it is determined to be included (YES in step 403),
Adjustment of the degree of conformity is performed in a different manner (step 404 or 405) depending on whether the conditional expression ≦ 1.0 is satisfied. Here, the conditional expression ≦ 1.0 (the expression is an elliptic expression) used in step 404 in FIG. 4 is obtained by entering the inside of the ellipse formed by the eigenvalues and the eigenvectors of the covariance matrix of the data constituting the cluster. Is determined. In this example, if the condition is satisfied (entering the inside) (step 404 YES), only the center fitness is reduced (step 404), otherwise (step 404).
(NO), the end point and the center are lowered (step 40).
5). This is because the influence of the erroneous recognition data existing in the area near the center is not reflected on the degree of conformity of the axis end point. That is, 1) When conditional expression ≦ 1.0 in the flowchart is Yes The fitness of the center is reduced according to the following expression (Equation 1).

【００４８】[0048]

【数１】２）フローチャートの条件式≦１．０がＮｏの場合軸端点と中心の適合度は、次式（数２）に従って低下さ
れる。(Equation 1) 2) When Conditional Expression ≦ 1.0 in the Flow Chart is No The degree of conformity between the shaft end point and the center is reduced according to the following expression (Equation 2).

【００４９】[0049]

【数２】注）・ｋ１、ｋ２、ｋ３は１回の学習で適合度をどれだ
け下げるかということに依存して決定されるパラメータ・中心、端点の適合度の初期値はすべて１である。つま
り、誤認識データの含まれない領域は適合度１のままで
ある。(Equation 2) Note)-k1, k2, and k3 are parameters determined depending on how much the fitness is reduced by one learning.-The initial values of the fitness of the center and the end point are all 1. That is, the area that does not include the misrecognition data remains at the fitness level of 1.

【００５０】学習処理の結果として、楕円で保持される
適合度およびメンバシップ関数のイメージが図５に示さ
れている。FIG. 5 shows an image of the fitness and the membership function held by the ellipse as a result of the learning process.

【００５１】なお、図５において使用されている記号の
意味は次の通りである。The meanings of the symbols used in FIG. 5 are as follows.

【００５２】・Ｉ、II、III、IV：軸Ａ、Ｂで区切られる楕円の領域・Ｏ：楕円中心で保持される適合度・Ａ１、Ａ２：軸Ａの端点で保持される適合度・Ｂ１、Ｂ２：軸Ｂの端点で保持される適合度・Ａ１−Ｂ１：領域Ｉで利用するメンバシップ関数・Ｂ１−Ａ２：領域IIで利用するメンバシップ関数・Ａ２−Ｂ２：領域IIIで利用するメンバシップ関数・Ｂ２−Ａ１：領域IVで利用する軸Ｂ方向のメンバシッ
プ関数・ｄａｔａ：未知の入力データと仮定・Θ：未知の入力データと楕円中心を結んだベクトルの
軸Ａとなす角・ｒ：未知の入力データの楕円中心からの正規化距離・ＭＦ値：領域Ｉに未知の入力データが入ったとき角度
Θにより決まるメンバシップ値（適合度）図５に示すように、未知のデータが存在する場合は、角
度Θにより決定される端点の適合度と楕円中心からの距
離で重み付けされた中心の適合度が加算され、これが所
属度となる。なお、所属度の算出処理については、後
に、より詳細に説明する。I, II, III, IV: Elliptical area delimited by axes A and B O: Fitness maintained at the center of the ellipse A1, A2: Fitness maintained at the end point of axis A B1 , B2: fitness held at the end point of axis B. A1-B1: membership function used in area I. B1-A2: membership function used in area II. A2-B2: member used in area III. Ship function B2-A1: Membership function in the direction of axis B used in region IV. Data: Assume unknown input data. Θ: Angle formed by axis A of vector connecting unknown input data and ellipse center. R : Normalized distance of the unknown input data from the center of the ellipse. MF value: Membership value (fitness) determined by angle とき when unknown input data enters region I. As shown in FIG. If present, by angle Θ It is subject to fitness of weighted centers at a distance from the degree of fitness elliptical center of the end points to be constant, so this is appertaining. The calculation process of the affiliation degree will be described later in more detail.

【００５３】次に、こうして得られた適合度付加済みの
クラスタに基づいて、個々の認識対象データについてそ
の属するカテゴリを決定する処理の詳細を図６のフロー
チャートを参照しながら説明する。同図に示されるよう
に、処理が開始されると、先ず、未知入力データの特徴
抽出処理（ステップ６０１）並びにクラスタ数のプリセ
ット処理（ステップ６０２）を実行した後、クラスタポ
インタｉを＋１づつインクリメントしつつ（ステップ６
０７）、該ポインタｉで指定されるクラスタについて、
それに入力データが含まれるか否かの判定が、ポインタ
ｉの値がｎに達するまで（ステップ６０３）、繰り返し
行われる（ステップ６０４）。ここで、含まれないと判
定される場合には（ステップ６０４ＮＯ）、当該ポイン
タｉにて指定されるクラスタの所属度（所属度ｉ）は０
と記憶されるのに対して（ステップ６０６）、含まれる
と判定される場合には（ステップ６０４ＹＥＳ）、後述
する演算式に従って当該クラスタに対する所属度（所属
度ｉ）の算出が行われる（ステップ６０５）。Next, the details of the process for determining the category to which each piece of recognition target data belongs based on the obtained clusters with the added degree of matching will be described with reference to the flowchart of FIG. As shown in the figure, when the process is started, first, a feature extraction process of unknown input data (step 601) and a preset process of the number of clusters (step 602) are performed, and then the cluster pointer i is incremented by +1. (Step 6
07), for the cluster specified by the pointer i,
The determination whether or not the input data is included therein is repeatedly performed until the value of the pointer i reaches n (step 603) (step 604). If it is determined that the cluster is not included (NO in step 604), the affiliation degree (affiliation degree i) of the cluster designated by the pointer i is 0.
Is stored (step 606), and if it is determined that it is included (step 604 YES), the degree of affiliation (degree of affiliation i) with respect to the cluster is calculated according to an arithmetic expression described later (step 605). ).

【００５４】このようにして、全てのクラスタについて
の所属度が求められたならば、続いて、それらの所属度
に基づいて、当該入力データの属するカテゴリの決定処
理が行われる（ステップ６０８〜６１１）。すなわち、
まず最初に、当該入力データがいずれかのクラスタに含
まれるものであるかの判定が行われ（ステップ６０
８）、入力データを含むクラスタが存在しない場合には
（ステップ６０８ＮＯ）、直ちに、処理は終了するのに
対して、入力データを含むクラスタが存在する場合には
（ステップ６０８ＹＥＳ）、さらに、入力データを含む
クラスタが複数存在するか否かの判定が行われる（ステ
ップ６０９）。ここで、入力データを含むクラスタが単
数である場合には（ステップ６０９ＮＯ）、直ちに、処
理は終了するのに対して、入力データを含むクラスタが
複数存在する場合には（ステップ６０９ＹＥＳ）、最大
所属度算出クラスタを抽出して確定する処理が行われる
（ステップ６１０）。すなわち、この最大所属度算出ク
ラスタの確定処理（ステップ６１０）では、先に説明し
た所属度算出処理（ステップ６０５）にて求められた所
属度（所属度ｉ：ｉ＝１〜ｎ）の中で最大のものに対応
するクラスタが求められ、これが最大所属度算出クラス
タとして確定される。その後、上述の処理で求められた
最大所属度算出クラスタの所属するカテゴリが抽出され
て、当該入力データの属するカテゴリとして確定される
（ステップ６１１）。なお、この確定に際しては、後に
詳細に説明するように、所属度を基準値と比較すること
により、「不明」、「××カテゴリに含まれるが確信は
持てない」、「確信を持って××クラスタに含まれると
いえる」の３通りの認識を行うことができる。After the degree of affiliation for all clusters has been obtained in this way, subsequently, the category to which the input data belongs is determined based on the degree of affiliation (steps 608 to 611). ). That is,
First, it is determined whether the input data is included in any of the clusters (step 60).
8) If there is no cluster including the input data (step 608 NO), the process is immediately terminated, while if there is a cluster including the input data (step 608 YES), the input data is further processed. It is determined whether or not there are a plurality of clusters including (step 609). Here, if the number of clusters including the input data is singular (step 609 NO), the process is immediately terminated, whereas if there is a plurality of clusters including the input data (step 609 YES), the maximum affiliation is determined. Processing for extracting and determining the degree calculation cluster is performed (step 610). In other words, in the process of determining the maximum affiliation degree calculation cluster (step 610), the affiliation degree (affiliation degree i: i = 1 to n) obtained in the affiliation degree calculation processing (step 605) described above is used. The cluster corresponding to the largest one is obtained, and this is determined as the maximum belonging degree calculation cluster. Then, the category to which the maximum belonging degree calculation cluster obtained by the above-described processing belongs is extracted and determined as the category to which the input data belongs (step 611). At the time of this determination, as will be described in detail later, by comparing the degree of affiliation with the reference value, "unknown", "included in the XX category but cannot be convinced", " X can be said to be included in the cluster ".

【００５５】所属度算出処理（ステップ６０５）の一例
を図７を参照しながら説明する。この例では、未知の入
力データｄａｔａは図７に示される楕円の第１象限に存
在するものと想定する。なお、図７において使用されて
いる記号の意味は次の通りである。An example of the membership calculation process (step 605) will be described with reference to FIG. In this example, it is assumed that the unknown input data data exists in the first quadrant of the ellipse shown in FIG. The meanings of the symbols used in FIG. 7 are as follows.

【００５６】・Ｉ、II、III、IV：軸Ａ、Ｂで区切られる楕円の領域・Ｏ：楕円中心で保持される適合度・Ａ１、Ａ２：軸Ａの端点で保持される適合度・Ｂ１、Ｂ２：軸Ｂの端点で保持される適合度・Ａ１−Ｂ１：領域Ｉで利用するメンバシップ関数・Ｂ１−Ａ２：領域IIで利用するメンバシップ関数・Ａ２−Ｂ２：領域IIIで利用するメンバシップ関数・Ｂ２−Ａ１：領域IVで利用する軸Ｂ方向のメンバシッ
プ関数・ｄａｔａ：未知の入力データと仮定・Θ：未知の入力データと楕円中心を結んだベクトルの
軸Ａとなす角・ｒ：未知の入力データの楕円中心からの正規化距離・ＭＦ値：領域Iに未知の入力データが入ったとき角度
Θにより決まるメンバシップ値（適合度）また、この場合におけるクラスタの内包チェック式は、
次式（数３）により表される。I, II, III, IV: Elliptical area delimited by axes A and B O: Fitness maintained at the center of the ellipse A1, A2: Fitness maintained at the end point of axis A B1 , B2: fitness held at the end point of axis B. A1-B1: membership function used in area I. B1-A2: membership function used in area II. A2-B2: member used in area III. Ship function B2-A1: Membership function in the direction of axis B used in region IV. Data: Assume unknown input data. Θ: Angle formed by axis A of vector connecting unknown input data and ellipse center. R : Normalized distance of the unknown input data from the center of the ellipse • MF value: Membership value (fitness) determined by angle Θ when unknown input data enters region I. In this case, the inclusion check formula of the cluster is ,
It is expressed by the following equation (Equation 3).

【００５７】[0057]

【数３】ここで、 λａ：軸Ａ方向の固有値 λｂ：軸Ｂ方向の固有値ｒ：正規化距離ｃ：クラスタを構成するデータをすべて含むため、各軸
方向に広がりを持たせる係数Ｒがｄ以下なので、ｄａｔａはこのクラスタに含まれて
いることになる。(Equation 3) Here, λa: eigenvalue in the direction of the axis A λb: eigenvalue in the direction of the axis B r: normalized distance c: since all the data constituting the cluster are included, the coefficient R for expanding in each axis direction is d or less, so Is included in this cluster.

【００５８】また、この場合における所属度の演算式は
次式（数４）により表される。In this case, the arithmetic expression of the degree of belonging is represented by the following expression (Equation 4).

【００５９】[0059]

【数４】内包チェック式をｎ次元に拡張した場合には、次式（数
５）により表される。このｎ次元に拡張された式の場合
における所属度計算式については、別途、後に詳細に説
明する。(Equation 4) When the inclusion check formula is extended to n dimensions, it is expressed by the following formula (Equation 5). The membership calculation formula in the case of the expression expanded to n dimensions will be described later in detail separately.

【００６０】[0060]

【数５】なお、各記号の意味は次の通りである。(Equation 5) The meaning of each symbol is as follows.

【００６１】γ：クラスタ中心座標を固有ベクトル方向
の座標に変換したもの λ：固有値ｃ：クラスタ構成全データを含むための係数次に、ステップ６１０で求められた最大所属度に基づく
所属カテゴリの抽出決定処理（ステップ６１１）の詳細
について説明する。Γ: Cluster center coordinates converted to coordinates in the eigenvector direction λ: Eigenvalue c: Coefficient for including all data of the cluster configuration Next, the determination of the belonging category extraction based on the maximum belonging degree obtained in step 610 The details of the process (step 611) will be described.

【００６２】この処理では、算出した最大所属度を元
に、未知の入力データを次の３通りの結果に分類する。In this process, unknown input data is classified into the following three results based on the calculated maximum affiliation.

【００６３】最大所属度＝０未知のデータはどこのクラスタにも所属しなかったた
め、『認識結果は不明』最大所属度＜ユーザ設定値未知のデータはあるクラスタに所属したが、所属度低い
ため『××カテゴリのデータとして決定できない』所属度≧ユーザ設定値未知のデータはあるクラスタに所属し、所属度高いため
『××カテゴリに所属すると決定』ユーザ設定値（基準
値）とは、所属クラスタを決定付けられる所属度の大き
さであり、ユーザが任意に設定できる。ただし、０≦ユ
ーザ設定値≦１の範囲である。Maximum belonging degree = 0 Unknown data did not belong to any cluster, so "recognition result is unknown" Maximum belonging degree <user set value Unknown data belonged to a certain cluster, but belonging degree is low. “Cannot be determined as XX category data” Affiliation ≧ user set value Unknown data belongs to a certain cluster and belongs to a high degree, so “determined to belong to XX category” user set value (reference value) This is the degree of affiliation that can determine the cluster, and can be set arbitrarily by the user. However, the range is 0 ≦ user set value ≦ 1.

【００６４】次に、メンバシップ関数の作成方法につい
て説明する。まず、最初に、クラスタを構成する全ての
データを円、楕円、四角形（２次元の場合）などの図形
で囲み、その図形の軸端点および中心に適合度を保持さ
せ、端点と中心間を直線式で補完し、それをメンバシッ
プ関数として定義する場合の例を、図８を参照しながら
説明する。Next, a method of creating a membership function will be described. First, all data constituting the cluster are surrounded by a figure such as a circle, an ellipse, and a rectangle (in the case of two dimensions), and the degree of conformity is held at the end point and the center of the figure, and a straight line is formed between the end point and the center. An example of complementing with an expression and defining it as a membership function will be described with reference to FIG.

【００６５】図８に示されるように、中心の適合度を
Ａ、軸（ｘ）の正方向の端点の適合度をＢ（座標値ｘ＝
ｒ≦１）とした場合、その間の適合度は図８中に記され
た直線ＡＢ相当のメンバシップ関数ＭＦで補完されるこ
とになる。尚、図８においては、Ｘ軸の正方向のメンバ
シップ関数のみが図示されているが、４個の軸端点のそ
れぞれに適合度を持つから、中心と軸端点の適合度間を
補完すると計４つのメンバシップ関数（ＭＦ）が作成さ
れることになる。As shown in FIG. 8, the fitness of the center is A, and the fitness of the positive end point of the axis (x) is B (coordinate value x =
When r ≦ 1), the degree of conformity is complemented by the membership function MF corresponding to the straight line AB shown in FIG. Note that FIG. 8 shows only the membership function in the positive direction of the X-axis. However, since each of the four shaft endpoints has a degree of fitness, it is possible to supplement the fitness between the center and the axis endpoints. Four membership functions (MF) will be created.

【００６６】また、図８には、クラスタが楕円で囲まれ
た例が示されているが、円、四角形あるいはｎ次元空間
に拡張された図形（例：超円、超楕円、超平面体）で囲
まれた場合も同様にメンバシップ関数を作成することが
できる。ちなみに、ｎ次元のとき作成されるメンバシッ
プ関数は（２×ｎ）個となる。FIG. 8 shows an example in which a cluster is surrounded by an ellipse. However, a figure expanded into a circle, a rectangle, or an n-dimensional space (eg, a hypercircle, a hyperellipse, a hyperplane) A membership function can be created in the same manner when enclosed by. Incidentally, the number of membership functions created in the case of n dimensions is (2 × n).

【００６７】さらに、図形の軸端点および中心および軸
端点と中心の間の１つ以上の点に適合度を保持させ、そ
れぞれの間を補完し、それをメンバシップ関数として持
つことを特徴とするクラスタとして拡張することもでき
る。すなわち、図８のＡ、Ｂという適合度の間にさらに
１つ以上の適合度を保持するポイント（Ｃと仮定）を設
け、ＡＣ、ＣＢ間を直線式で補完するのである。Further, it is characterized in that the degree of conformity is maintained at one or more points between the axis end point and the center of the figure and between the axis end point and the center, and the respective points are complemented to have the degree of fitness as a membership function. It can be extended as a cluster. That is, a point (assumed to be C) for holding one or more fitness levels is further provided between the fitness levels A and B in FIG. 8 and the interval between AC and CB is complemented by a linear equation.

【００６８】次に、クラスタを構成する全てのデータを
円、楕円、四角形（２次元の場合）などで囲み、その図
形の軸端点および中心に適合度を保持させ、中心と端点
間を次に述べるような曲線式（指数関数）で補完し、そ
れをメンバシップ関数として定義する場合の例を、図９
を参照しながら説明する。Next, all the data constituting the cluster are surrounded by circles, ellipses, rectangles (in the case of two dimensions), etc., and the end points and the center of the figure are held in the degree of conformity. FIG. 9 shows an example of complementing with a curve expression (exponential function) as described and defining it as a membership function.
This will be described with reference to FIG.

【００６９】図９に示されるように、中心の適合度Ａ、
軸（ｘ）の正方向端点の適合度Ｂ（座標値ｘ＝ｒ＝１）
（Ａ１＞Ａ２）とすると、その間の適合度は図９中に記
された指数関数曲線相当のメンバシップ関数ＭＦで補完
されることとなる。なお、図９においては、Ｘ軸の正方
向のメンバシップ関数（ＭＦ）のみが示されているが、
４個の軸端点のそれぞれに適合度を持つから、中心と端
点の適合度間を補完すると直線式で補完した場合と同様
に計４つのメンバシップ関数が作成されることになる。As shown in FIG. 9, the fitness of the center A,
Fitness B of the positive end point of axis (x) (coordinate value x = r = 1)
Assuming that (A1> A2), the fitness between them is complemented by the membership function MF corresponding to the exponential function curve shown in FIG. FIG. 9 shows only the membership function (MF) in the positive direction of the X axis,
Since each of the four shaft endpoints has a degree of fitness, when the degree of fitness between the center and the endpoint is complemented, a total of four membership functions are created as in the case of complementing with a linear equation.

【００７０】また、図９には、クラスタが楕円で囲まれ
た例を示しているが、円、四角形あるいはｎ次元空間に
拡張された図形（例：超円、超楕円）で囲まれた場合も
同様にメンバシップ関数を作成することができる。ちな
みに、ｎ次元のとき作成されるメンバシップ関数は直線
式で補完した場合と同様に（２×ｎ）個となる。FIG. 9 shows an example in which a cluster is surrounded by an ellipse. When the cluster is surrounded by a circle, a square, or a figure extended into an n-dimensional space (eg, a hypercircle, a hyperellipse) Can create a membership function as well. By the way, the number of membership functions created in the case of n dimensions is (2 × n) as in the case of complementing with a linear equation.

【００７１】さらに、クラスタを構成する全てのデータ
を円、楕円、四角形（２次元の場合）などの図形で囲
み、その図形の軸端点および中心（あるいは重心）およ
び軸端点と中心の間の１つ以上の点に適合度を保持さ
せ、それぞれの間を補完し、それをメンバシップ関数と
して持つことを特徴とするクラスタに拡張することもで
きる。すなわち、図９のＡ、Ｂという適合度の間に１つ
以上の適合度を保持するポイント（Ｃと仮定）を設け、
ＡＣ、ＣＢ間を曲線式（指数関数など）で補完するので
ある。Further, all the data constituting the cluster are surrounded by a figure such as a circle, an ellipse, and a rectangle (in the case of two dimensions), and the axis end point and the center (or the center of gravity) of the figure and the distance between the axis end point and the center are set. It is also possible to extend the cluster to a feature characterized in that one or more points have a goodness of fit, complement each other, and have it as a membership function. That is, a point (assumed to be C) for holding one or more fitness levels between fitness levels A and B in FIG.
The curve between AC and CB is complemented by a curve expression (exponential function or the like).

【００７２】次に、クラスタを構成する全てのデータを
円、楕円（２次元の場合）で囲み、その図形の軸端点お
よび中心に適合度を保持させ、端点間を角度Θで補完
し、それをメンバシップ関数として定義する場合の例
を、図１０を参照しながら説明する。Next, all the data constituting the cluster are surrounded by circles and ellipses (in the case of two dimensions), the degree of conformity is maintained at the axis end points and the center of the figure, and the end points are complemented by an angle Θ. An example in which is defined as a membership function will be described with reference to FIG.

【００７３】図１０に示されるように、中心の適合度を
ｏ、横軸の正方向端点の適合度をＡ、縦軸の正方向端点
の適合度をＢとすると共に、中心と端点の適合度を足し
合わせる比率をｒを用いて表すと、図１０中の点線部に
おけるメンバシップ関数（ＭＦ）は次式（数６）にて表
される。As shown in FIG. 10, the fitness of the center is o, the fitness of the positive end point on the horizontal axis is A, and the fitness of the positive end point on the vertical axis is B. When the ratio of adding degrees is expressed by using r, the membership function (MF) in the dotted line part in FIG. 10 is expressed by the following equation (Equation 6).

【００７４】[0074]

【数６】なお、図１０では、第１象限のメンバシップ関数のみ図
示しているが、軸により領域は４つに分かれているた
め、楕円領域の全体としては、計４つのメンバシップ関
数が作成されることになる。(Equation 6) Although FIG. 10 shows only the membership function in the first quadrant, since the region is divided into four by the axis, a total of four membership functions are created for the entire elliptic region. become.

【００７５】これをｎ次元に拡張すると、次式（数７）
のようになる。なお、ｎ次元空間において作成されるメ
ンバシップ関数（ＭＦ）は、軸により領域が２ⁿに分割
されることを考えると計２ⁿ個となる。When this is extended to n dimensions, the following equation (Equation 7) is obtained.
become that way. Incidentally, the membership function created in the n-dimensional space (MF) is a considered the count 2 ⁿ pieces that region by a shaft is divided into 2 ^n.

【００７６】[0076]

【数７】なお、上式にて使用される記号の意味は次の通りであ
る。(Equation 7) The meanings of the symbols used in the above equation are as follows.

【００７７】ｒ：クラスタ中心からの正規化距離 γ：固有ベクトル方向に座標変換された新座標ｃ：固有値で決まる楕円体径を大きくするための係数 λ：固有ベクトルに対する固有値（λ＝１のとき円）ｏ：クラスタ中心で保持する適合度ＤＤｉ：軸端点で保持する適合度注）端点の適合度に掛けているものが、固有ベクトル方
向に座標軸を取ったときの座標値の２乗である。R: normalized distance from the center of the cluster γ: new coordinate transformed in the direction of the eigenvector c: coefficient for increasing the ellipsoid diameter determined by the eigenvalue λ: eigenvalue for the eigenvector (circle when λ = 1) o: Fitness held at the center of the cluster DDi: Fitness held at the end point of the axis Note) Multiplied by the fitness of the end point is the square of the coordinate value when the coordinate axis is taken in the direction of the eigenvector.

【００７８】次に、適合度の学習方法について、図１１
並びに図１２を参照しながら説明する。まず最初に、ク
ラスタが円あるいは楕円で囲まれた場合において、学習
データ１個につき、そのデータが含まれる領域を囲む適
合度を下げる割合をあらかじめ決めておき、その適合度
低下分をｒ、Θを用いて中心並びに端点に割り振る方法
について、図１１を参照しながら説明する。Next, a method of learning the fitness will be described with reference to FIG.
Description will be made with reference to FIG. First, in the case where a cluster is surrounded by a circle or an ellipse, for each piece of learning data, a rate of decreasing the fitness that surrounds an area including the data is determined in advance, and the reduced fitness is determined by r, Θ The method of assigning to the center and the end point by using is described with reference to FIG.

【００７９】今、適合度学習データ１個につき、そのデ
ータが含まれる領域を囲む適合度をｋだけ下げると仮定
する。また、学習データが図１１に示すような楕円の第
１象限領域に存在するものと仮定する。なお、図１１に
おいて使用されている記号の意味は次の通りである。Now, it is assumed that, for each piece of fitness learning data, the fitness surrounding a region including the data is reduced by k. It is also assumed that the learning data exists in the elliptical first quadrant area as shown in FIG. The meanings of the symbols used in FIG. 11 are as follows.

【００８０】Ａ：Ｘ軸方向の端点適合度Ｂ：Ｙ軸方向の端点適合度ｏ：中心の適合度ｒ：学習データ存在位置までの中心からの正規化距離
（≦１） Θ：Ｘ軸となす角まず、次式（数８）に従って、ｋをｒの大きさにより、
中心、端点で下げる大きさに分解する。A: End point conformance in the X-axis direction B: End point conformity in the Y-axis direction o: Center conformity r: Normalized distance from the center to the learning data existence position (≦ 1) Θ: X-axis First, according to the following equation (Equation 8), k is determined by the magnitude of r.
Decompose into lower size at the center and end points.

【００８１】[0081]

【数８】次に、次式（数９）に従って、残りのｋｒをΘの大きさ
によりＡ、Ｂに振り分ける。(Equation 8) Next, according to the following equation (Equation 9), the remaining kr is divided into A and B according to the size of Θ.

【００８２】[0082]

【数９】なお、この方法は、学習対象のクラスタに誤って（異な
るカテゴリのデータなのに入ってしまう）ことがわかっ
ているデータを学習データとして採用し、適合度を保持
する点の適合度初期値を１（あるいは適合度としての最
高値）とし、学習データを与えることで、そのデータが
含まれる領域を囲む適合度を下降させていく方法（前
者）、あるいは、学習対象のクラスタに正しく入ること
がわかっているデータと誤ってはいることがわかってい
るデータとの双方を学習データとして採用し、適合度を
保持する点の適合度初期値を中間的な値として、学習デ
ータを与えることで、正解データが含まれる領域を囲む
適合度については上昇させ、逆に、誤ったデータが含ま
れる領域を囲む適合度については下降させていく方法
（後者）にも、同様に応用できる。つまり、前者の方法
であるならば、学習データ１つにつきｋだけ適合度をあ
げるとおきかえれば良いし、後者の方法であるならば、
上げ下げを両方考慮すれば実現できる。(Equation 9) In this method, data that is known to be erroneous (enters data of a different category) in the learning target cluster is adopted as learning data, and the fitness value initial value of a point that retains fitness is 1 ( Or the highest value as the degree of fitness), and by giving the learning data, a method of lowering the degree of fitness surrounding the area including the data (the former), or if it is found that the learning target cluster is correctly entered By using both the data that is found and the data that is known to be wrong as learning data, and giving the learning data with the fitness value initial value of the point that maintains the fitness value as an intermediate value, the correct answer data Similarly, a method of increasing the fitness for surrounding a region including, and decreasing the fitness for a region surrounding erroneous data (the latter), similarly. You can use. In other words, if the former method is used, the fitness may be increased by k per learning data, and if the latter method is used,
This can be achieved by considering both raising and lowering.

【００８３】ｎ次元に拡張する場合は、端点の下げ度合
いｋｒを、学習データが存在する領域を囲む平面の数で
割り、各平面に学習データを射影したときにある軸とな
す角Θで平面を構成する軸の端点に割り振れば良い。In the case of extension to n dimensions, the degree of lowering kr of the end point is divided by the number of planes surrounding the area where the learning data exists, and the plane Θ is formed by the angle す between the plane and the axis when the learning data is projected onto each plane. May be assigned to the end points of the axes that constitute.

【００８４】次に、クラスタが四角形で囲まれた場合に
おいて、学習データ１個につき、そのデータが含まれる
領域を囲む適合度を下げる割合をあらかじめ決めてお
き、適合度低下分をｒ、ｘ、ｙを用いて中心、端点に割
り振る方法について、図１２を参照しながら説明する。Next, in the case where the cluster is surrounded by a rectangle, the rate of decreasing the fitness for surrounding the area including the learning data is determined in advance for each learning data, and the reduced fitness is defined as r, x, A method of assigning the center and the end point using y will be described with reference to FIG.

【００８５】今、適合度データ学習データ１個につき、
そのデータが含まれる領域を囲む適合度をｋだけ下げる
と仮定する。また、学習データが図１２に示すような四
角形の第１象限領域に存在するものと仮定する。なお、
図１２で使用されている記号の意味は次の通りである。Now, for one piece of fitness data learning data,
It is assumed that the fitness surrounding the region including the data is reduced by k. It is also assumed that the learning data exists in a quadrangular first quadrant area as shown in FIG. In addition,
The meanings of the symbols used in FIG. 12 are as follows.

【００８６】Ａ：Ｘ軸方向の端点適合度Ｂ：Ｙ軸方向の端点適合度ｏ：中心の適合度ｒ：学習データ存在位置までの中心からの正規化距離
（≦１） Θ：Ｘ軸となす角まず、次式（数１０）に従って、ｋをｒの大きさによ
り、中心、端点で下げる大きさに分解する。A: End point fitness in the X-axis direction B: Endpoint fitness in the Y-axis direction o: Center fitness r: Normalized distance from the center to the learning data existence position (≦ 1) Θ: X-axis First, according to the following equation (Equation 10), k is decomposed into a size to be lowered at the center and the end point by the size of r.

【００８７】[0087]

【数１０】次に、次式（数１１）に従って、残りのｋｒをΘの大き
さでＡ、Ｂに振り分ける。(Equation 10) Next, according to the following equation (Equation 11), the remaining kr is divided into A and B in the size of Θ.

【００８８】[0088]

【数１１】なお、この方法は、円、楕円のときと同様の方法で、先
ほど図１１を参照しつつ説明した前者並びに後者の学
習、更には、ｎ次元空間での学習にも応用することがで
きる。[Equation 11] Note that this method can be applied to the learning of the former and the latter described with reference to FIG. 11 and the learning in the n-dimensional space in the same manner as that of the circle and the ellipse.

【００８９】次に、所属度の算出方法について説明す
る。適合度間を直線あるいは曲線で補完したメンバシッ
プ関数をクラスタで保持する場合、次式（数１２）に従
って所属度を算出することができる。Next, a method of calculating the membership will be described. When a membership function in which the degree of matching is complemented by a straight line or a curve is stored as a cluster, the degree of affiliation can be calculated according to the following equation (Equation 12).

【００９０】[0090]

【数１２】なお、上式で使用される記号の意味は次の通りである。(Equation 12) The meanings of the symbols used in the above formula are as follows.

【００９１】ｎ：次元数Ｄｉ：各軸の座標値が各軸のメンバシップ関数に与えら
れて算出された適合度また、適合度間をクラスタ中心の適合度と中心からの角
度で軸端点間を補完した適合度を中心からの距離ｒに基
づき加算するメンバシップ関数をｎ次元のクラスタで保
持する場合、次式（数１３）に従って所属度を算出する
ことができる。N: the number of dimensions Di: the fitness calculated by giving the coordinate value of each axis to the membership function of each axis The fitness between the fitness of the cluster center and the angle from the center is defined as the fitness between the cluster endpoints. When the membership function for adding the degree of conformity complementing the above based on the distance r from the center is stored in an n-dimensional cluster, the degree of belonging can be calculated according to the following equation (Equation 13).

【００９２】[0092]

【数１３】なお、上式で使用される記号の意味は次の通りである。(Equation 13) The meanings of the symbols used in the above formula are as follows.

【００９３】ｒ：クラスタ中心からの正規化距離 γ：固有ベクトル方向に座標変換された新座標 λ：固有ベクトルに対する固有値ｉ：特徴量の次元数ｃ：固有値で決まる楕円体径を大きくするための係数ｏ：クラスタ中心で保持する適合度ＤＤｉ：軸端点で保持する適合度次に、ｎ次元に拡張した場合の所属度計算方法を３次元
を例に取り、図１３を参照しながら説明する。なお、図
１３で使用されている記号の意味は次の通りである。R: normalized distance from the center of the cluster γ: new coordinate converted in the direction of the eigenvector λ: eigenvalue for the eigenvector i: number of dimensions of the feature c: coefficient for increasing the ellipsoidal diameter determined by the eigenvalue o : Fitness held at the center of the cluster DDi: Fitness held at the end point of the axis Next, a method of calculating the degree of membership when n-dimensionally extended will be described with reference to FIG. 13 taking a three-dimensional example. The meanings of the symbols used in FIG. 13 are as follows.

【００９４】ｘ、ｙ、ｚ：楕円体を構成している軸の名
称Ｘｐ、Ｙｐ、Ｚｐ：各軸の正方向端点で保持する適合度ｏ：楕円体中心で保持する適合度（ｘ１、ｙ１、ｚ１）：未知の入力データの直交座標表
現（ｒ、Θ、Φ）：未知の入力データの極座標表現いま、図１３に示されるように、未知の入力データが、
各軸の正方向で囲まれる領域に存在するときを考える。
このとき所属度は、極座標を用いて次式（数１４）に従
って求めることができる。ただし、最終式は直交座標を
用いた式に近似されている。X, y, z: names of axes forming the ellipsoid Xp, Yp, Zp: fitness held at the positive end point of each axis o: fitness held at the center of the ellipsoid (x1, y1 , Z1): rectangular coordinate expression of unknown input data (r, Θ, Φ): polar coordinate expression of unknown input data Now, as shown in FIG.
Let us consider a case where it exists in a region surrounded by the positive direction of each axis.
At this time, the degree of affiliation can be obtained according to the following equation (Equation 14) using polar coordinates. However, the final expression is approximated by an expression using rectangular coordinates.

【００９５】[0095]

【数１４】この考え方をｎ次元空間に拡張すると、先にも述べたよ
うに、次式（数１５）のようになる。[Equation 14] When this idea is extended to an n-dimensional space, as described above, the following equation (Equation 15) is obtained.

【００９６】[0096]

【数１５】 (Equation 15)

【発明の効果】以上の説明から明らかなように、本発明
によれば、この種のクラスタを使用した認識システムに
クラスタ内の座標位置により定まる適合度の概念を導入
したことにより、この種の認識システムにおける認識精
度を格段に向上させることができる。As is apparent from the above description, according to the present invention, by introducing the concept of the degree of fitness determined by the coordinate position in the cluster to the recognition system using such a cluster, The recognition accuracy in the recognition system can be significantly improved.

[Brief description of the drawings]

【図１】本発明にかかる認識システム並びに適合度付き
クラスタ生成部の構成を示すブロック図である。FIG. 1 is a block diagram showing a configuration of a recognition system and a cluster generation unit with a fitness according to the present invention.

【図２】２軸使用時の適合度付加位置を示す図である。FIG. 2 is a diagram showing a fitness addition position when using two axes.

【図３】誤認識データとクラスタとの関係を示す図であ
る。FIG. 3 is a diagram showing a relationship between erroneously recognized data and clusters.

【図４】学習処理の流れを示すフローチャートである。FIG. 4 is a flowchart illustrating a flow of a learning process.

【図５】クラスタに保持される適合度とメンバシップ関
数との関係を示す図である。FIG. 5 is a diagram showing a relationship between a fitness held in a cluster and a membership function.

【図６】クラスタを使用した判定部の処理の流れを示す
フローチャートである。FIG. 6 is a flowchart illustrating a flow of a process of a determination unit using a cluster.

【図７】未知の入力データに対する所属度演算処理の内
容を示す図である。FIG. 7 is a diagram showing the contents of a membership calculation process for unknown input data.

【図８】適合度を直線式で補完する概念をイメージ化し
て示す図である。FIG. 8 is a diagram illustrating the concept of complementing the degree of conformity with a linear expression.

【図９】適合度を指数関数で補完する概念をイメージ化
して示す図である。FIG. 9 is a diagram illustrating the concept of complementing the degree of fitness with an exponential function.

【図１０】角度Θによる補完の概念をイメージ化して示
す図である。FIG. 10 is a diagram showing an image of a concept of complementation based on an angle Θ.

【図１１】楕円で囲まれた領域に学習データの存在例を
示す図である。FIG. 11 is a diagram showing an example of the existence of learning data in a region surrounded by an ellipse.

【図１２】四角形で囲まれた領域に学習データの存在例
を示す図である。FIG. 12 is a diagram showing an example of the existence of learning data in a region surrounded by a rectangle.

【図１３】３次元空間における所属度演算の概念を示す
図である。FIG. 13 is a diagram showing a concept of a membership calculation in a three-dimensional space.

[Explanation of symbols]

ａ従来手法によるクラスタの生成部ｂクラスタ情報ｃ適合度付きクラスタ生成部ｄクラスタ形状の決定部ｅ適合度の付加部ｆ適合度の学習部ｇ適合度付加済みクラスタｈ認識システムｉ内包判定部ｊ内包されない場合の処理の流れを示す矢印ｋ内包される場合の処理の流れを示す矢印ｌ所属度算出部ｍ認識結果決定部 a Cluster generation unit according to the conventional method b Cluster information c Cluster generation unit with fitness level d Cluster shape determination unit e Fitness degree addition unit f Fitness degree learning unit g Fitness level added cluster h Recognition system i Inclusion determination unit j Arrow indicating the flow of processing when not included k Arrow indicating the flow of processing when included l l Membership calculation unit m Recognition result determination unit

Claims

[Claims]

A step of comparing input data with clusters corresponding to respective categories in a feature amount space to determine which cluster includes the input data; and determining that the input data is included. Determining the degree of affiliation of the input data with respect to the cluster by referring to the degree of fit stored in advance in association with the coordinate position in the cluster, based on the degree of affiliation with respect to the cluster. And recognizing a category to which the input data belongs.

2. The recognition method according to claim 1, further comprising the step of changing the fitness information by learning input data whose category recognition result is known.

3. The step of recognizing the category includes, when there are a plurality of clusters determined to include the input data, among the clusters, only the cluster having the highest degree of membership belongs to the input data. The recognition method according to claim 1, wherein the recognition is performed as a cluster.

4. The step of recognizing the category includes, when recognizing a cluster to which the input data belongs, comparing the magnitude of the degree of belonging with a reference value to determine whether the cluster belongs to the “unknown” or “XX category”. I'm not convinced. "
3 of "I can say that it is included in the XX cluster with certainty"
The recognition method according to claim 1, wherein the recognition is performed as follows.

5. The fitness information includes fitness at a specific coordinate position inside a cluster and a membership function expressing fitness at other coordinate positions inside the cluster based on the fitness. The recognition method according to claim 1, wherein:

6. The specific coordinate position inside a cluster constituting the fitness information is a position on an axis constituting the figure and a center position of the figure when the cluster is surrounded by a figure. The recognition method according to claim 5, characterized in that:

7. When a space in which a cluster is created is two-dimensional, any one of a circle, an ellipse, and a rectangle is used as a figure surrounding the cluster, and the specific coordinate position inside the cluster is the Total 5 of the center of the figure and the end points of the axes
The recognition method according to claim 6, wherein the recognition method is a point.

8. When a space in which a cluster is created is three-dimensional, any of a sphere, an ellipsoid, and a rectangular parallelepiped is used as a figure surrounding the cluster, and the specific coordinate position inside the cluster is defined as Total of the center of the figure and the end point of the axis 7
The recognition method according to claim 6, wherein the recognition method is a point.

9. When the space in which a cluster is created is n-dimensional, any one of a hypercircle, a hyperellipsoid, and a hyperplane is used as a figure surrounding the cluster, and a specific coordinate position inside the cluster is determined. Is a total of (2n + 1) points of the center of the figures and the end points of the axes.
Recognition method.

10. The step of changing the fitness information by learning the input data includes adopting, as learning data, data that is known to correctly enter a learning target cluster, and a fitness initial value of the specific coordinate position. 3 is set to 0 (or a minimum value as the degree of fitness), and by providing learning data, the degree of fitness surrounding an area including the data is increased. Method.

11. The step of changing the fitness information by learning the input data uses, as learning data, data that is known to be erroneously (entered into data of a different category) in a learning target cluster. ,
In addition, by setting the initial value of the fitness of the specific coordinate position to 1 (or the highest value as the fitness) and providing learning data, the fitness of surrounding the area including the data is reduced. 3. The recognition method according to claim 2, wherein:

12. A means for comparing input data with clusters corresponding to respective categories in a feature amount space to determine which cluster includes the input data, and determines that the input data is included. Means for determining the degree of affiliation of the input data with respect to the cluster by referring to the degree of fit stored in advance in association with the coordinate position in the cluster, based on the degree of affiliation with respect to the cluster. Means for recognizing a category to which the input data belongs.

13. The recognition apparatus according to claim 12, further comprising means for changing the fitness information by learning input data whose category recognition result is known.

14. When there are a plurality of clusters that are determined to include input data, the means for recognizing the category determines that only one of the clusters having the highest degree of membership belongs to the input data. 13. The recognition device according to claim 12, wherein the recognition is performed as a cluster.

15. The category recognizing means compares the magnitude of the degree of belonging with a reference value when recognizing a cluster to which the input data belongs.
13. The recognition apparatus according to claim 12, wherein three types of recognition are performed: "included in the xx category but cannot be convinced", and "can be said to be confidently included in the xx cluster".

16. The fitness information includes a fitness at a specific coordinate position inside a cluster and a membership function expressing fitness at other coordinate positions inside the cluster based on the fitness. The recognition device according to claim 12, characterized in that:

17. The specific coordinate position inside a cluster constituting the fitness information is a position on an axis constituting the figure and a center position of the figure when the cluster is surrounded by a figure. The recognition device according to claim 16, characterized in that:

18. When a space in which a cluster is created is two-dimensional, any of a circle, an ellipse, and a rectangle is used as a figure surrounding the cluster, and the specific coordinate position inside the cluster is 18. The recognition apparatus according to claim 17, wherein a total of five points, that is, a center of the figure and an end point of the axis.

19. When a space in which a cluster is created is three-dimensional, any of a sphere, an ellipsoid, and a cuboid is used as a figure surrounding the cluster, and the specific coordinate position inside the cluster is defined as 18. The recognition apparatus according to claim 17, wherein a total of seven points, that is, a center of the figure and an end point of the axis are provided.

20. When the space in which the cluster is created is n-dimensional, the figure surrounding the cluster is a hypercircle, hyperellipsoid,
18. The method according to claim 17, wherein any one of hyperplanes is used, and the specific coordinate position inside the cluster is a total of (2n + 1) points of the centers of the figures and the end points of the axes. Recognition device.

21. The means for changing the fitness information by learning the input data, employs data known to be correctly included in a learning target cluster as learning data, and includes a fitness initial value of the specific coordinate position. 14. The recognition according to claim 13, wherein by setting learning data to 0 (or a minimum value as the degree of conformity) and providing learning data, the degree of conformity surrounding a region including the data is increased. apparatus.

22. The means for changing the fitness information by learning the input data employs, as learning data, data known to be erroneously (entered into data of a different category) in a learning target cluster. And setting the initial value of the fitness of the specific coordinate position to 1 (or the highest value as the fitness) and providing learning data,
14. The recognition apparatus according to claim 13, wherein the degree of suitability surrounding the area including the data is reduced.

23. A recording medium on which a computer program for realizing the device according to claim 12 is recorded.