JPH04295893A

JPH04295893A - Pattern recognition system

Info

Publication number: JPH04295893A
Application number: JP3061842A
Authority: JP
Inventors: Akio Ando; 彰男安藤
Original assignee: Nippon Hoso Kyokai NHK; Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 1991-03-26
Filing date: 1991-03-26
Publication date: 1992-10-20
Anticipated expiration: 2015-11-20
Also published as: JP3110478B2

Abstract

PURPOSE:To offer the pattern recognition system having high discriminating ability. CONSTITUTION:Voice data is sampled by a sampling means 1, an LPC cepstrum coefficient is calculated by an LPC cepstrum analyzing means 2, vowel data is segmented by a vowel data segmenting means 3, this segmented data is set as learning data, this learning data is recognized by using standard patterns prepared by plural pieces each at every category by a standard pattern learning means 4, the degree of an erroneous recognition at that time is constituted as a function of this standard pattern, an optimal pattern for minimizing this function is derived and stored in a storage means 5, the derived optimal standard pattern and input data are collated by a collating means 6, and based on its result, a category corresponding to the standard pattern being the nearest to the input data is derived as a result of recognition by a deciding means 7.

Description

[Detailed description of the invention]

【０００１】0001

【産業上の利用分野】本発明は、標準パターンを用いて
データの認識を行なうパターン認識方式に係り、特に、
各カテゴリごとに複数個の標準パターンを用意してデー
タの識別を行なうパターン認識方式に関する。[Field of Industrial Application] The present invention relates to a pattern recognition method for recognizing data using standard patterns, and in particular,
This invention relates to a pattern recognition method that identifies data by preparing a plurality of standard patterns for each category.

【０００２】0002

【発明の概要】本発明は、各カテゴリごとに複数個ずつ
用意した標準パターンを用いてデータの識別を行なうパ
ターン認識のための標準パターン構成法に関するもので
、与えられた学習データ全体を標準パターンを用いて認
識した時の誤認識の程度を当該標準パターンの関数とし
て構成した上、この関数を最小化する最適標準パターン
を求めることにより、従来から用いられている標準パタ
ーンの学習法を用いて標準パターンを構成して認識を行
なう場合よりも高い識別性能を提供するパターン認識方
式である。SUMMARY OF THE INVENTION The present invention relates to a standard pattern construction method for pattern recognition in which data is identified using a plurality of standard patterns prepared for each category. By configuring the degree of misrecognition when recognizing using the standard pattern as a function of the standard pattern, and then finding the optimal standard pattern that minimizes this function, we can use the conventional standard pattern learning method. This is a pattern recognition method that provides higher identification performance than when recognition is performed by configuring standard patterns.

【０００３】0003

【従来の技術】各カテゴリごとに複数個の標準パターン
を用意してデータの識別を行なうパターン認識方式のた
めの標準パターン構成法として、例えば、Ｔ．Ｋｏｈｏ
ｎｅｎ“Ｓｅｌｆ　　Ｏｒｇａｎｉｚａｔｉｏｎ　　ａ
ｎｄ　　Ａｓｓｏｃｉａｔｉｖｅ　　Ｍｅｍｏｒｙ”に
示されているＬＶＱ（Ｌｅａｒｎｉｎｇ　　Ｖｅｃｔｏ
ｒ　　Ｑｕａｎｔｉｚａｔｉｏｎ）があり、その改良版
であるＬＶＱ２は広く用いられているる。ＬＶＱ２は、
学習データを用いて繰返し標準パターンを修正していく
方法で、ｔ回目の学習ステップについて考えると、先ず
１つの学習データ（ｘで表わす）を選んだ上、ｘに最も
近い標準パターン（ｍｉ　（ｔ）で表わす）がｘとは異
なるカテゴリに属し、かつｘに２番目に近い標準パター
ン（ｍｊ　（ｔ）で表わす）がｘと同じカテゴリに属す
る場合において、ｘ，ｍｉ　（ｔ），ｍｊ　（ｔ）の位
置関係がある条件（ｘがｍｉ　（ｔ）とｍｊ　（ｔ）に
よって決まる「窓」の中に入る）を満たす場合に次のよ
うな「学習」を行なう方法である：（１）ｍｉ　をｘから遠ざけるように修正する；ｍｉ　
（ｔ＋１）＝ｍｉ　（ｔ）−α（ｔ）（ｘ−ｍｉ（ｔ）
）（２）ｍｊ　をｘに近づけるよう修正する；ｍｊ　（ｔ
＋１）＝ｍｊ　（ｔ）＋α（ｔ）（ｘ−ｍｊ（ｔ））（３）ｍｉ　，ｍｊ　以外の標準パターンは修正しない
；ｍｋ　（ｔ＋１）＝ｍｋ　（ｔ），　　　　ｋ≠ｉ，
ｊ上式において、α（ｔ）は、初期値が０≦α（０）≦
１を満たすｔの単調減少関数で、繰返しの最終段階にお
いて０となるように設定される。ＬＶＱ２ではこのよう
な学習の繰返し数と、α（ｔ）を適当に設計してやれば
、学習データに対して誤認識数を少なくするような標準
パターンの組が得られることが知られている。2. Description of the Related Art As a standard pattern construction method for a pattern recognition system in which a plurality of standard patterns are prepared for each category and data is identified, for example, T. Koho
nen“Self Organization a
LVQ (Learning Vecto
r Quantization), and its improved version, LVQ2, is widely used. LVQ2 is
Considering the t-th learning step in a method of repeatedly modifying the standard pattern using learning data, first select one learning data (represented by x), and then select the standard pattern (mi (t ) belongs to a different category from x, and the second closest standard pattern to x (denoted by mj (t)) belongs to the same category as x, then x, mi (t), mj (t ) satisfies the condition that x falls within the "window" determined by mi (t) and mj (t): (1) mi Modify to move away from x;mi
(t+1)=mi(t)−α(t)(x−mi(t)
) (2) Modify mj to bring it closer to x; mj (t
+1)=mj (t)+α(t)(x-mj(t)) (3) Standard patterns other than mi and mj are not modified; mk (t+1)=mk (t), k≠i,
jIn the above formula, α(t) has an initial value of 0≦α(0)≦
It is a monotonically decreasing function of t that satisfies 1 and is set to 0 at the final stage of the iteration. It is known that in LVQ2, if the number of repetitions of such learning and α(t) are appropriately designed, a set of standard patterns can be obtained that reduces the number of false recognitions for the learning data.

【０００４】0004

【発明が解決しようとする課題】ＬＶＱ２が標準パター
ンの構成に有効であることは広く知られているが、ＬＶ
Ｑ２には「学習データに対する誤認識数の減少のみを目
的としているため、特に学習データ数が少ない場合には
、未知データに対し高い識別能力を得ることが難しい。」という問題点がある。[Problems to be Solved by the Invention] It is widely known that LVQ2 is effective in configuring standard patterns, but LVQ2
Q2 has the problem that ``Since the purpose is only to reduce the number of false recognitions for learning data, it is difficult to obtain high discrimination ability for unknown data, especially when the amount of learning data is small.''

【０００５】そこで本発明の目的は、誤認識数の概念を
拡張して誤認識の程度を表わす関数を定義した上、この
関数を学習データに対し最小化することによって、少な
い学習データからでも精度良く最適標準パターンを構成
し、この最適標準パターンを用いてデータの識別を行な
うという枠組みの中で、従来法に比べ未知データに対し
て高い識別能力を持つパターン認識方式を提供すること
にある。[0005] Therefore, the purpose of the present invention is to expand the concept of the number of misrecognitions, define a function that expresses the degree of misrecognition, and then minimize this function with respect to learning data, thereby increasing accuracy even from a small amount of learning data. The object of the present invention is to provide a pattern recognition method that has a higher ability to identify unknown data than conventional methods, within the framework of composing an optimal standard pattern and identifying data using this optimal standard pattern.

【０００６】[0006]

【課題を解決するための手段】上記目的を達成するため
本発明は各カテゴリごとに複数個ずつ用意された標準パ
ターンを用いてデータの認識を行なうパターン認識方式
において、与えられた学習データを標準パターンを用い
て認識した時の誤認識の程度を当該標準パターンの関数
として構成し、当該関数を最小化する最適標準パターン
を求め、当該最適標準パターンを用いてデータの認識を
行なうことを特徴とする。[Means for Solving the Problems] In order to achieve the above object, the present invention provides a pattern recognition method in which data is recognized using a plurality of standard patterns prepared for each category. The method is characterized in that the degree of misrecognition when recognizing using a pattern is configured as a function of the standard pattern, an optimal standard pattern that minimizes the function is found, and data is recognized using the optimal standard pattern. do.

【０００７】[0007]

【作用】本発明によれば与えられた学習データ全体を標
準パターンを用いて認識した時の誤認識の程度を当該標
準パターンの関数として構成し、この関数を最小化する
最適標準パターンを求め、この最適標準パターンを用い
てデータ認識を行なうことによって、高い識別性能が得
られる。[Operation] According to the present invention, the degree of misrecognition when the entire given learning data is recognized using a standard pattern is constructed as a function of the standard pattern, and an optimal standard pattern that minimizes this function is determined. By performing data recognition using this optimal standard pattern, high identification performance can be obtained.

【０００８】[0008]

【実施例】以下、実施例に基づいて本発明を詳細に説明
する。EXAMPLES The present invention will be explained in detail below based on examples.

【０００９】先ず、次のように記法を定める。First, the notation is defined as follows.

【００１０】Ｒｐ　：ｐ次元ユークリッド空間Ｋ：認識
すべきパターンのカテゴリ数Ｎｉ　：カテゴリｉに対する学習データの数（ｉ＝１，
…，Ｋ）Ｍｉ　：カテゴリｉに対する標準パターンの数（ｉ＝１
，…，Ｋ）ｘｉｊ：学習データ（ｘｉｊ　　∈Ｒｐ　）（ｉ＝１，
…，Ｋ、ｊ＝１，…，Ｎｉ　）ｔｉｒ：カテゴリｉに対
するｒ番目の標準パターン（ｔｉｒ　　∈Ｒｐ　）（ｉ＝１，…，Ｋ、ｒ＝１，…，Ｍｉ　）Ｅ：標準パタ
ーンの組｛ｔｉｒ｝を用いて全学習データ｛ｘＩＪ｝を
識別した時の誤認識の程度を表わす関数Ｌ，ｍ：繰り返
しの回数本発明は、上記のＥを最小にする標準パターンの組｛ｔ
ｉｒ｝を構成することを特徴とする方式である。Ｅを最
小化する方法として、組合せ最適化問題を解く方法の１
つであるアニーリング法（ｓｉｍｕｌａｔｅｄ　　ａｎ
ｎｅａｌｉｎｇ）を用いた場合の、本発明の実施例のフ
ローチャートを、図１から図３までに示した。図示した
各ブロックにおける実行内容は次の通りである。Rp: p-dimensional Euclidean space K: number of categories of patterns to be recognized Ni: number of learning data for category i (i=1,
..., K) Mi: number of standard patterns for category i (i=1
,...,K) xij: learning data (xij ∈Rp) (i=1,
..., K, j=1, ..., Ni) tir: r-th standard pattern for category i (tir ∈Rp) (i=1, ..., K, r=1, ..., Mi) E: set of standard patterns Functions L, m that express the degree of misrecognition when all the learning data {xIJ} are identified using {tir}: Number of repetitions The present invention creates a set of standard patterns {t
ir}. One of the methods for solving combinatorial optimization problems as a method to minimize E
A simulated annealing method
Flowcharts of an embodiment of the present invention in which the processing method (nealing) is used are shown in FIGS. 1 to 3. The execution contents of each block shown in the figure are as follows.

【００１１】Ｂ２：クラスタリング等の方法を用いて、
学習データの集合｛ｘｉｊ｝から、各カテゴリごとに予
め決められた数の標準パターンを作成した後、得られた
標準パターンの組｛ｔｉｒ｝を出力する。本実施例では
、ＬＢＧクラスタリングアルゴリズムを用いて初期標準
パターンを作成した。B2: Using methods such as clustering,
After creating a predetermined number of standard patterns for each category from the learning data set {xij}, the resulting set of standard patterns {tir} is output. In this example, an initial standard pattern was created using the LBG clustering algorithm.

【００１２】Ｂ４：集合｛ｔｉｒ｝を標準パターンとし
て、学習データの集合｛ｘｉｊ｝を認識した時の誤認識
の程度を表わす関数Ｅを計算する。誤認識された学習デ
ータの個数でＥを定義しても良いが、本実施例では、次
のようにしてＥを計算した。先ず、ある学習データｘｉ
ｊに対して次の２つの距離ｄ１　、ｄ２　を定義する。B4: Using the set {tir} as a standard pattern, calculate a function E representing the degree of misrecognition when the learning data set {xij} is recognized. Although E may be defined by the number of pieces of learning data that are erroneously recognized, in this example, E was calculated as follows. First, some learning data xi
The following two distances d1 and d2 are defined for j.

【００１３】[0013]

【数１】[Math 1]

【００１４】[0014]

【数２】　　　　　　ここにｄは２つのパターン間のユークリッ
ド距離を表わす。この時、λを[Equation 2] Here, d represents the Euclidean distance between two patterns. At this time, λ

【００１５】[0015]

【数３】　　　　　　で定義する。明らかに、０≦λ≦１。この
λと、パラメータκ（０＜κ＜０．５）に対して関数μ
（ｘｉｊ，｛ｔｉｒ｝）を定義する。具体的には、λ＜
０．５−κの場合にはμ＝０、０．５−κ≦λ＜０．５
＋κに対して μ＝（λ＋κ−０．５）／２κ ０．５＋κ≦λに対してμ＝１とする。標準パターンを
用いる識別の場合には、λ＜０．５なら正しく識別され
、λ≧０．５ならば誤って識別される。従って、λ≧０
．５に対して１、それ以外のλに対して０となる関数μ
′を考えると、μ′は誤認識かどうかを０と１で表現す
る。上式のμは、このμ′にパラメータκを導入したも
のであり、誤認識の程度を表わす関数となっている。関数ＥをDefined by [Equation 3]. Obviously, 0≦λ≦1. For this λ and the parameter κ (0<κ<0.5), the function μ
Define (xij, {tir}). Specifically, λ<
If 0.5-κ, μ=0, 0.5-κ≦λ<0.5
μ=(λ+κ−0.5)/2κ for +κ. Let μ=1 for 0.5+κ≦λ. In the case of identification using a standard pattern, if λ<0.5, it is correctly identified, and if λ≧0.5, it is incorrectly identified. Therefore, λ≧0
．． A function μ that is 1 for 5 and 0 for other λ
′, μ′ expresses whether or not it is a misrecognition using 0 and 1. μ in the above equation is obtained by introducing the parameter κ into μ′, and is a function representing the degree of misrecognition. function E

【００１６】[0016]

【数４】　　　　　　で定義する。明らかにＥは標準パターンの
組｛ｔｉｒ｝の関数であり、標準パターンとして｛ｔｉ
ｒ｝を選んで学習データ全体を識別したときの誤認識の
程度を表わす。本実施例では、κ＝０．１と設定した。Defined by [Equation 4]. Obviously E is a function of the set of standard patterns {tir}, and as standard patterns {ti
r} represents the degree of misrecognition when the entire learning data is identified. In this example, κ was set to 0.1.

【００１７】Ｂ６：変数ｎに０を代入する。B6: Assign 0 to variable n.

【００１８】Ｂ８：変数ｉに０を代入する。B8: Assign 0 to variable i.

【００１９】Ｂ１０：２つの誤認識の程度ＥとＥ＊　を
比較する。B10: Compare the two degrees of misrecognition E and E*.

【００２０】Ｂ１２：Ｂ１０でＥ＜Ｅ＊　のときに実行
するブロックであって、標準パターンの集合｛ｔｉｒ｝
の全要素を集合｛ｔｉｒ＊　｝の要素とし、Ｅの値をＥ
＊　に代入する操作を行なう。B12: A block executed when E<E* in B10, which is a set of standard patterns {tir}
Let all elements of set {tir*} be elements, and let the value of E be E
Perform the operation of assigning to *.

【００２１】Ｂ１４：Ｂ１０でＥ＜Ｅ＊　でないときに
およびＢ１２の次に実行するブロックであって、アニー
リング法におけるパラメータの１つである「温度」Ｔｎ
　を設定する。B14: "Temperature" Tn, which is a block to be executed when E<E* is not satisfied in B10 and after B12, and is one of the parameters in the annealing method.
Set.

【００２２】Ｂ１６：学習データの集合｛ｘｉｊ｝から
、なんらかの方法で１つの要素ｘｉｊを選択する。本実
施例では、集合｛ｘｉｊ｝からランダムに１つの学習デ
ータｘｉｊを選択した。B16: Select one element xij from the learning data set {xij} by some method. In this example, one learning data xij is randomly selected from the set {xij}.

【００２３】Ｂ１８：標準パターンの集合｛ｔｉｊ｝か
ら、なんらかの方法で１つの要素ｔｕｒを選択する。本
実施例では、集合｛ｔｉｊ｝からランダムに１つの標準
パターンｔｕｒを選択した。B18: Select one element tur from the standard pattern set {tij} by some method. In this example, one standard pattern tur is randomly selected from the set {tij}.

【００２４】Ｂ２０：標準パターンの集合｛ｔｉｒ｝に
摂動を与え、新たな標準パターン集合｛ｔｉｒ′｝を出
力する。摂動は、次式によって与える。B20: Perturb the standard pattern set {tir} and output a new standard pattern set {tir'}. The perturbation is given by the following equation.

【００２５】ｔｕｒ′＝ｔｕｒ＋ν・θ・（ｘｉｊ−ｔｕｒ），ｔｉ
ｋ′＝ｔｉｋ；　　　　　　　　ｉ≠μ，ｋ≠ｒ．ここ
に、θは区間［−１．０，１．０］上の一様乱数、νは
０≦ν≦１を満たすパラメータで、本実施例では０．０
２と設定した。[0025] tur'=tur+ν・θ・(xij−tur), ti
k'=tik; i≠μ, k≠r. Here, θ is a uniform random number on the interval [-1.0, 1.0], and ν is a parameter satisfying 0≦ν≦1, which is 0.0 in this example.
It was set to 2.

【００２６】Ｂ４：前記同様に、集合｛ｔｉｒ′｝を標
準パターンとして、学習データの集合｛ｘｉｊ｝を認識
した時の誤認識の程度を表わす関数Ｅ′を計算する。B4: In the same manner as described above, a function E' representing the degree of misrecognition when the learning data set {xij} is recognized is calculated using the set {tir'} as a standard pattern.

【００２７】Ｂ２２：区間［０，１）の間で一様分布す
る乱数ωを発生する。B22: Generate a random number ω uniformly distributed between the interval [0, 1).

【００２８】Ｂ２４：Ｅ′−Ｅが負であるか、あるいは
Ｂ２２で発生されたωがｅｘｐ（−（Ｅ′−Ｅ）／Ｔｎ
　）未満であるかどうかを判定する。B24: E'-E is negative or ω generated in B22 is exp(-(E'-E)/Tn
).

【００２９】Ｂ２６：Ｂ２４の判定がＹｅｓのときに実
行するブロックであって、標準パターンの集合｛ｔｉｒ
′｝の全要素を集合｛ｔｉｒ｝の要素とし、Ｅ′の値を
Ｅに代入する操作を行なう。B26: This block is executed when the determination in B24 is Yes, and is a block that is executed when the determination in B24 is Yes, and is a block that executes when the determination in B24 is Yes.
'} are set as elements of the set {tir}, and the value of E' is assigned to E.

【００３０】Ｂ２８：Ｂ２４の判定がＮｏのとき、およ
びＢ２６の次に実行するブロックであって、ｉの値を１
だけ増加させる。B28: When the determination in B24 is No, this block is executed next after B26, and the value of i is set to 1.
increase only.

【００３１】Ｂ３０：ｉが予め判定された値ｍより大き
いかどうかを判定する。判定がＮｏのときはＢ１０にす
すむ。B30: Determine whether i is greater than a predetermined value m. If the determination is No, proceed to B10.

【００３２】Ｂ３２：Ｂ３０で判定がＹｅｓのときに実
行するブロックであって、ｎの値を１だけ増加させる。B32: This block is executed when the determination in B30 is Yes, and increases the value of n by 1.

【００３３】Ｂ３４：ｎが予め設定された値Ｌより大き
いかどうかを判定する。判定がＮｏのときはＢ８にすす
む。B34: Determine whether n is larger than a preset value L. If the determination is No, proceed to B8.

【００３４】Ｂ３６：Ｂ３４で判定がＹｅｓのときに実
行するブロックであって、Ｂ１０と同様。B36: This block is executed when the determination in B34 is Yes, and is the same as B10.

【００３５】Ｂ１２：前記Ｂ１２と同様Ｂ３８：集合｛
ｔｉｒ＊　｝を最終的な標準パターンの集合とする。B12: Same as B12 above B38: Set {
tir*} is the final set of standard patterns.

【００３６】以上に示した実施例では、アニーリング法
を用いた場合について述べてきたが、例えば、Ｂ２４の
判断条件をＥ′−Ｅ＜０のみにすると最急降下法を用い
る場合に対応する。従って、本発明は、アニーリング法
を用いる場合に限らず、他の最適化手法を用いる場合に
も常に適応し得ることは勿論である。In the embodiment shown above, the case where the annealing method is used has been described, but for example, if the judgment condition for B24 is set only to E'-E<0, this corresponds to the case where the steepest descent method is used. Therefore, it goes without saying that the present invention is applicable not only to cases where the annealing method is used, but also to cases where other optimization methods are used.

【００３７】以上に述べた通り、少ない学習データから
認識精度の高い標準パターンを構成することができるの
で、標準パターンを用いるパターン認識の問題であれば
、認識能力の向上を図るために本方法を適応することが
できる。As mentioned above, standard patterns with high recognition accuracy can be constructed from a small amount of training data, so if the problem is pattern recognition using standard patterns, this method can be used to improve recognition ability. Able to adapt.

【００３８】ついで音声認識において、標準パターンを
用いて母音を認識する装置に本発明を適応した場合につ
いて述べる図４は、本発明を実施したパターン認識装置
の機能構成を示すブロック図であり、その詳細は以下の
通りである。Next, we will discuss the case where the present invention is applied to a device that recognizes vowels using standard patterns in speech recognition. FIG. 4 is a block diagram showing the functional configuration of a pattern recognition device that implements the present invention. Details are as follows.

【００３９】１：入力された音声情報を標本化する標本
化手段。1: Sampling means for sampling input voice information.

【００４０】２：標本化された音声データに基づいて短
い時間区間ごとにＬＰＣケプストラム係数を計算するＬ
ＰＣケプストラム分析手段。2: LPC cepstral coefficients are calculated for each short time interval based on sampled audio data.
PC cepstrum analysis tool.

【００４１】３：手段２からの音声データの中から母音
中心部分のＬＰＣケプストラム係数を切り出す母音デー
タ切り出し手段。3: Vowel data extraction means for extracting LPC cepstral coefficients at the center of vowels from the voice data from means 2.

【００４２】４：手段３からのデータを学習データとし
、標準パターンを構成する標準パターン学習手段であっ
て、ＣＰＵと、前記のＢ２〜Ｂ３８に示す各ブロックを
ＣＰＵに実行させるためのプログラムを格納したＲＯＭ
と、ワークエリアを有するＲＡＭとを具えている。4: A standard pattern learning means that uses the data from the means 3 as learning data and constitutes a standard pattern, which stores a CPU and a program for causing the CPU to execute each block shown in B2 to B38 above. ROM
and a RAM having a work area.

【００４３】５：手段４により得られた標準パターンを
記憶した記憶手段。5: Storage means storing the standard pattern obtained by means 4.

【００４４】６：手段５の標準パターンの集合と手段３
からの入力音声母音データとを照合し、入力データと標
準パターンとの距離を計算する照合手段。6: Set of standard patterns of means 5 and means 3
Collation means that collates the input speech vowel data from and calculates the distance between the input data and the standard pattern.

【００４５】７：手段６からの計算結果に基づいて入力
と最も距離が近かった標準パターンに対応する母音カテ
ゴリを判定し、認識結果として出力する判定手段。7: Determining means for determining the vowel category corresponding to the standard pattern closest to the input based on the calculation result from the means 6, and outputting the result as a recognition result.

【００４６】上記装置を用いて母音を認識した結果につ
いて述べる。母音として撥音を含む６母音／ａ／，／ｉ
／，／ｕ／，／ｅ／，／ｏ／，／Ｎ／を想定し、これら
６つの各カテゴリごとに複数個の標準パターンを用意し
て母音認識を行なう場合について検討した。但し各カテ
ゴリごとの標準パターン数は同数とした。検討に当って
はＡＴＲデータベースの中の話者ＭＡＵが文節単位に発
声した音声データを、標本化周波数１５ｋＨｚで標本化
し（手段１）、１８次のＬＰＣケプストラム分析を行な
った後（手段２）、母音中心３フレームを平均して母音
データを作成した。母音中心部は同データベースに付与
された母音中心のラベルに基づいて定めた。このような
データを、各カテゴリごとに３５０個ずつ作成し、これ
らを５０個ずつに分割（７分割）して、データセットＡ
〜Ｇを作成した。また、各カテゴリごとに１７５個ずつ
に分割（２分割）して、データセット１，２を作成した
（手段３）。The results of vowel recognition using the above device will be described. 6 vowels /a/, /i including pellicles as vowels
/, /u/, /e/, /o/, /N/ were assumed, and a case where a plurality of standard patterns were prepared for each of these six categories and vowel recognition was performed was studied. However, the number of standard patterns for each category was the same. In the study, the speech data uttered by the speaker MAU in the ATR database in units of phrases was sampled at a sampling frequency of 15 kHz (Means 1), and after performing 18th-order LPC cepstral analysis (Means 2), Vowel data was created by averaging three frames centered on the vowel. The vowel center was determined based on the vowel center label given in the same database. Create 350 pieces of data for each category, divide them into 50 pieces (7 parts), and create Dataset A.
~Created G. Further, data sets 1 and 2 were created by dividing each category into 175 pieces (divided into 2) (means 3).

【００４７】学習データ数の少ない場合として、データ
セットＡ〜Ｇを用いて手段４，５，６，７によって実験
を行なった。先ず、データセットＡを用いて標準パター
ンを構成し、データセットＡ以外のデータ（未知データ
）の認識を行なった。さらに、データセットＡ以外のデ
ータを用いて学習を行なう場合についても同様の認識を
行なった。これらの７通りの結果を平均して最終的な認
識結果とした。また、学習データが比較的多い場合とし
て、データセット１，２を用いて上記と同様な実験を行
なった。なお、本発明を用いるに当っては、繰返し回数
をＬ＝５０００、ｍ＝１０と設定した。As a case in which the number of learning data is small, experiments were conducted by means 4, 5, 6, and 7 using data sets A to G. First, a standard pattern was constructed using data set A, and data other than data set A (unknown data) was recognized. Furthermore, similar recognition was made when learning was performed using data other than data set A. These seven results were averaged to obtain the final recognition result. In addition, an experiment similar to the above was conducted using Data Sets 1 and 2 in a case where there was a relatively large amount of learning data. In addition, when using the present invention, the number of repetitions was set as L=5000 and m=10.

【００４８】従来の代表的な標準パターン学習アルゴリ
ズムであるＬＶＱ２アルゴリズムを用いる場合について
も同様の認識実験を行なった。先ず、全学習データをラ
ンダムに並べかえたものを１０通り作成し、これらの中
から順に学習データを選択してＬＶＱ２により標準パタ
ーンの修正を行なった。その際、α（０）＝０．０１５
と設定し、標準パターン修正の条件を決める窓には、２
０％窓を用いた。これらはＬＶＱ２の提案者Ｔ．Ｋｏｈ
ｏｎｅｎが推奨する値である。初期標準パターンは、本
発明と同じくＬＢＧクラスタリングアルゴリズムで与え
た。A similar recognition experiment was also conducted using the LVQ2 algorithm, which is a typical conventional standard pattern learning algorithm. First, 10 patterns were created by randomly rearranging all the learning data, learning data were selected in order from among these, and the standard pattern was corrected using LVQ2. In that case, α(0)=0.015
2 in the window that determines the conditions for standard pattern correction.
A 0% window was used. These are LVQ2 proposer T. Koh
This is the value recommended by onen. The initial standard pattern was given by the LBG clustering algorithm as in the present invention.

【００４９】実験の結果を図５，図６に表で示す。図５
は学習データ数が比較的多い場合、図６は学習データ数
の少ない場合である。図５，図６の縦軸は誤認識率、横
軸は各カテゴリごとの標準パターン数を示す。また、参
考のため、図５，図６にＬＢＧアルゴリズムで得られた
標準パターン（すなわち、本発明およびＬＶＱ２の初期
標準パターン）を用いた場合についての結果も示した。図５に示されているように学習データ数が比較的多い場
合には、本発明、ＬＶＱ２とも初期状態を与えたＬＢＧ
アルゴリズムより誤認識率が少なくなっており、学習が
良好に行なわれいることが分る。しかも本発明の方が良
好な結果となっている。学習データ数が少ない場合（図
６）には、ＬＶＱ２の結果はＬＢＧアルゴリズムと余り
変わらず、従ってＬＶＱ２学習の効果がほとんど表われ
ていない。一方、本発明を用いた場合には概ね良好な結
果が得られている。The results of the experiment are shown in tables in FIGS. 5 and 6. Figure 5
Fig. 6 shows a case where the number of learning data is relatively large, and Fig. 6 shows a case where the number of learning data is small. The vertical axes in FIGS. 5 and 6 show the false recognition rate, and the horizontal axes show the number of standard patterns for each category. For reference, FIGS. 5 and 6 also show the results obtained when the standard pattern obtained by the LBG algorithm (that is, the initial standard pattern of the present invention and LVQ2) was used. As shown in FIG. 5, when the number of learning data is relatively large, both the present invention and LVQ2 use LBG
The false recognition rate is lower than that of the algorithm, indicating that learning is performed well. Moreover, the present invention has better results. When the number of learning data is small (FIG. 6), the results of LVQ2 are not much different from the LBG algorithm, and therefore the effect of LVQ2 learning is hardly visible. On the other hand, when the present invention was used, generally good results were obtained.

【００５０】これらの実験により、本発明が、従来法に
見られた学習データ数が少ない場合に学習効果が表われ
ないという欠点を改善した方法であり、しかも、学習デ
ータをある程度増やした場合でも従来法を上回る方法で
あることが確認された。These experiments have shown that the present invention is a method that improves the drawback of the conventional method that learning effects are not obtained when the number of learning data is small, and moreover, even when the number of training data is increased to a certain extent, It was confirmed that this method is superior to conventional methods.

[Brief explanation of drawings]

【図１】本発明の実施例の動作を示すフローチャートの
一部である。FIG. 1 is a part of a flowchart showing the operation of an embodiment of the present invention.

【図２】同フローチャートの他の一部である。FIG. 2 is another part of the same flowchart.

【図３】同フローチャートの残りの一部である。FIG. 3 is the remaining part of the same flowchart.

【図４】本発明実施例にかかるパターン認識装置の機能
構成を示すブロック図である。FIG. 4 is a block diagram showing the functional configuration of a pattern recognition device according to an embodiment of the present invention.

【図５】本発明の実施例における実験結果の一例を示す
図である。FIG. 5 is a diagram showing an example of experimental results in an example of the present invention.

【図６】同実験結果の他の例を示す図である。FIG. 6 is a diagram showing another example of the same experimental results.

[Explanation of symbols]

１　　標本化手段２　　ＬＰＣケプストラム分析手段３　　母音データ切り出し手段４　　標準パターン学習手段５　　記憶手段６　　照合手段７　　判定手段 1 Sampling means 2 LPC cepstral analysis means 3 Vowel data extraction means 4 Standard pattern learning means 5. Storage means 6. Verification means 7 Judgment means

Claims

[Claims]

Claim 1: In a pattern recognition method that recognizes data using a plurality of standard patterns prepared for each category, the degree of misrecognition when given training data is recognized using standard patterns. A pattern recognition method characterized in that the optimal standard pattern is configured as a function of the standard pattern, the optimal standard pattern that minimizes the function is found, and data is recognized using the optimal standard pattern.