JP2973805B2

JP2973805B2 - Standard pattern creation device

Info

Publication number: JP2973805B2
Application number: JP5310518A
Authority: JP
Inventors: 栄子山田; 浩明服部
Original assignee: Nippon Electric Co Ltd
Current assignee: NEC Corp
Priority date: 1993-12-10
Filing date: 1993-12-10
Publication date: 1999-11-08
Anticipated expiration: 2014-11-08
Also published as: JPH07160287A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、音声認識装置内で用い
られる標準パターンを作成するための、標準パターン作
成装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a standard pattern creating device for creating a standard pattern used in a speech recognition device.

【０００２】[0002]

【従来の技術】音声認識においては、予め認識対象とな
る音素、単語等の標準パターンを用意しておき入力音声
と標準パターンの比較を行ない、最も類似している標準
パターンの属するカテゴリの音素、あるいは単語が発声
されたものと判定を行なうことが多い。このような方式
においては、一般に、標準パターン数が多いほど音声の
種々の変動を表現できるため、良い認識率が得られる。
しかし、その反面、多くのメモリー量と計算量とを必要
とする。2. Description of the Related Art In speech recognition, standard patterns such as phonemes and words to be recognized are prepared in advance, and the input speech is compared with the standard patterns. Alternatively, it is often determined that a word has been uttered. In such a method, in general, the more the number of standard patterns, the more various fluctuations in voice can be expressed, so that a good recognition rate is obtained.
However, on the other hand, it requires a large amount of memory and computation.

【０００３】クラスタリング（Ａ．Ｇｅｒｓｈｏａｎ
ｄＶ．Ｃｕｐｅｒｍａｎ，ＩＥＥＥＣｏｍｍｕｎ，
Ｍｅｇ．２１，９，ｐｐ．１５−２１，１９８３、以下
これを文献１とする）の手法は、認識性能を保ちつつ計
算時間、メモリー量を削減するために、標準パターンを
削減する方法として知られている。その中でも効率良く
標準パターンを削減できる方法として、学習パターンの
特徴ベクトルを分割し、分割された特徴ベクトルごとに
クラスタリングを行なうセパレートクラスタリング（日
本音響学会誌４４巻８号、１９８８、ｐ５９５〜６０２
「セパレートベクトル量子化を用いたスペクトログラム
の正規化」、以下これを文献２とする）が挙げられる。
文献２では、特徴ベクトルは、パワー及びＬＰＣパラメ
ーターによって構成されている。以下、文献２を例にと
って従来の標準パターン作成装置を説明する。[0003] Clustering (A. Gersho an
dV. Cuperman, IEEE Commun,
Meg. 21,9, pp. 15-21, 1983, hereinafter referred to as Document 1) is known as a method for reducing the standard pattern in order to reduce the calculation time and the memory amount while maintaining the recognition performance. Among them, as a method for efficiently reducing the standard pattern, a separate clustering method in which a feature vector of a learning pattern is divided and clustering is performed for each of the divided feature vectors (Journal of the Acoustical Society of Japan, Vol. 44, No. 8, 1988, pp. 595-602).
"Normalization of spectrogram using separate vector quantization", hereinafter referred to as Document 2).
In Literature 2, the feature vector is composed of power and LPC parameters. Hereinafter, a conventional standard pattern creating apparatus will be described with reference to Document 2 as an example.

【０００４】図２は従来の標準パターン作成装置の１例
を示す構成図である。音声入力部２００に音声が入力さ
れ、分析部２１０に送られる。送られた音声波形は、分
析部２１０において分析され、パワーとＬＰＣパラメー
ターの特徴ベクトルが抽出される。抽出された特徴ベク
トルを用い学習された第１の標準パターンは、学習パタ
ーン記憶部２２０に保持される。パワーは、学習パター
ン記憶部２２０からパワークラスタリング部２３０に送
られクラスタリングされる。また、ＬＰＣパラメーター
は、学習パターン記憶部２２０からＬＰＣパラメーター
クラスタリング部２４０に送られクラスタリングされ
る。パワークラスタリング部２３０とＬＰＣパラメータ
ークラスタリング部２４０とでクラスタリングされた情
報を用い、学習パターン記憶部２２０より送られた学習
パターンからパターン作成部２５０において標準パター
ンが作成される。パターン作成部２５０で作成された標
準パターンは、標準パターン出力部２６０に送られ出力
される。FIG. 2 is a block diagram showing an example of a conventional standard pattern creating apparatus. A voice is input to the voice input unit 200 and sent to the analysis unit 210. The transmitted voice waveform is analyzed in the analysis unit 210, and the power and LPC parameter feature vectors are extracted. The first standard pattern learned using the extracted feature vector is stored in the learning pattern storage unit 220. The power is sent from the learning pattern storage unit 220 to the power clustering unit 230 and clustered. The LPC parameters are sent from the learning pattern storage unit 220 to the LPC parameter clustering unit 240 and are clustered. Using the information clustered by the power clustering unit 230 and the LPC parameter clustering unit 240, a standard pattern is created in the pattern creation unit 250 from the learning pattern sent from the learning pattern storage unit 220. The standard pattern created by the pattern creation unit 250 is sent to the standard pattern output unit 260 and output.

【０００５】以上のように、パワーとＬＰＣパラメータ
ーのクラスタリングを行なうことによって、特徴ベクト
ルを一括してクラスタリングを行なうより、よりメモリ
ー量が少なく、かつ、量子化歪みの少ない標準パターン
が得られたと述べられている。[0005] As described above, by performing clustering of power and LPC parameters, a standard pattern with a smaller amount of memory and less quantization distortion was obtained than by clustering feature vectors collectively. Have been.

【０００６】[0006]

【発明が解決しようとする課題】文献２では、パワーと
ＬＰＣパラメータの各特徴量ごとに別々にクラスタリン
グを行なっている。この方法では、相関の低いパラメー
ター同士がまとめられる場合があり、その結果、量子化
歪みが増しクラスタリングの効率が低下するために多く
のクラスタを必要とする。本発明の目的はこの問題点を
解決した標準パターン作成装置を提供することにある。In Reference 2, clustering is separately performed for each feature amount of power and LPC parameters. In this method, parameters having low correlation may be put together, and as a result, many clusters are required because quantization distortion increases and clustering efficiency decreases. An object of the present invention is to provide a standard pattern creating apparatus which solves this problem.

【０００７】[0007]

【課題を解決するための手段】本発明による標準パター
ン作成装置は、音声を入力する音声入力部と、入力され
た音声データを分析し特徴ベクトルを抽出する分析部
と、抽出された第１の特徴ベクトルから標準パターンを
学習する学習部と、学習された第１の標準パターンを記
憶する学習パターン記憶部と、前記特徴ベクトル要素間
の相関ど度合いを計算する相関度計算部と、前記相関度
から特徴ベクトル要素間の相関の強さを計算し、特徴ベ
クトルの分割を行なう特徴ベクトル分割部と、前記特徴
ベクトルからパターン間距離を計算する距離計算部と、
前記ベクトル分割情報、パターン間距離をもとに分割特
徴ベクトルごとに学習パターンをクラスタリングするク
ラスタリング部と、前記クラスタリングの結果得られる
クラスタ中心を記憶するクラスタ中心記憶部と、各クラ
スタを構成するパターンを記憶するクラスタメンバ記憶
部と、前記クラスタリングの結果をもとに標準パターン
を作成する標準パターン作成部とを有して構成される。According to the present invention, there is provided a standard pattern creating apparatus comprising: a voice input unit for inputting voice; an analysis unit for analyzing input voice data and extracting a feature vector; A learning unit that learns a standard pattern from a feature vector, a learning pattern storage unit that stores a learned first standard pattern, a correlation calculation unit that calculates a degree of correlation between the feature vector elements, A feature vector dividing unit that calculates the strength of correlation between feature vector elements from and a feature vector, and a distance calculating unit that calculates an inter-pattern distance from the feature vector.
A clustering unit that clusters a learning pattern for each divided feature vector based on the vector division information and the distance between patterns; a cluster center storage unit that stores a cluster center obtained as a result of the clustering; and a pattern that forms each cluster. It comprises a cluster member storage unit for storing and a standard pattern creation unit for creating a standard pattern based on the result of the clustering.

【０００８】[0008]

【作用】本発明の標準パターン作成装置は、特徴ベクト
ル要素間の相関の強さを計算し、特徴ベクトルを分割
し、分割した分割特徴ベクトルごとにクラスタリングを
行なうことにより、クラスタ数を削減した標準パターン
を作成する。The standard pattern creating apparatus according to the present invention calculates the strength of correlation between feature vector elements, divides the feature vector, and performs clustering for each of the divided feature vectors to reduce the number of clusters. Create a pattern.

【０００９】図３、図４において、概念を簡単に説明す
る。図中のＸ１，Ｘ２，Ｙ１，Ｙ２は、特徴量軸、軸上
の分布は各軸を基準とした分布、Ｒ１〜Ｒ５は、クラス
タ中心番号、点線で囲まれた部分は各クラスタ中心によ
って被覆される特徴空間、実線で囲まれた部分は被覆さ
れるべき特徴空間である。The concept will be briefly described with reference to FIGS. In the figure, X1, X2, Y1, and Y2 are feature amount axes, distributions on the axes are distributions based on each axis, R1 to R5 are cluster center numbers, and portions surrounded by dotted lines are covered by cluster centers. The feature space to be covered, and the portion surrounded by the solid line is the feature space to be covered.

【００１０】図３、図４を見ると、各軸上での分布は等
しいものとなっている。しかし、図３の場合、特徴空間
はパラメーター間の相関が低いため、空間全体を覆うに
は多くの標準パターンを必要とする。それに対し、図４
に示すようにパラメーター間の相関が高い場合には、空
間全体を少ない標準パターンで被覆することができる。
このように、パラメーター間の相関が高いと、より少な
いパラメーターで空間全体を表現することができるた
め、効率よくパターン数を削減した標準パターンを得ら
れるのである。Referring to FIGS. 3 and 4, the distributions on each axis are equal. However, in the case of FIG. 3, since the feature space has a low correlation between parameters, many standard patterns are required to cover the entire space. In contrast, FIG.
When the correlation between the parameters is high as shown in (1), the entire space can be covered with a small number of standard patterns.
As described above, when the correlation between the parameters is high, the entire space can be expressed with fewer parameters, so that a standard pattern with a reduced number of patterns can be efficiently obtained.

【００１１】簡単な例において説明する。A simple example will be described.

【００１２】[0012]

【数１】 (Equation 1)

【００１３】の３つの要素を持つパラメーター、ｘ，
ｙ，ｚを仮定する。また、この３つのパラメーターの中
で、ｘとｙの２つのパラメーターは強い相関を持ち相関
関数が１であるが、ｘとｙ、ｙとｚは無相関であり相関
関数が０であるものとする。この条件において、ｘ，
ｙ，ｚの３パラメーターを２つの組みに分割する場合を
考える。A parameter having three elements x, x,
Suppose y, z. Among these three parameters, two parameters x and y have a strong correlation and a correlation function of 1, but x and y, and y and z have no correlation and a correlation function of 0. I do. Under these conditions, x,
Consider a case where three parameters of y and z are divided into two sets.

【００１４】最初にｘとｙをまとめたものと、ｚとの２
組に分割した場合を考える。ｘ，ｙは常に等しい値をと
るため、取り得る値は、［−１，−１］、［０，０］、
［１，１］の３通りである。ｚについても取り得る値
は、−１，０，１の３通りである。よって、ｘ，ｙとｚ
に分割した場合、記憶すべきパラメーター数は２×３＋
３＝９である。次に、ｘと、ｙ，ｚをまとめたものとの
２組に分割した場合を考える。ｘの取り得る値は、−
１，０，１の３通りである。ｙ，ｚをまとめた方は、
［−１，−１］、［−１，０］、［−１，１］、［０，
−１］、［０，０］、［０，１］、［１，−１］、
［１，０］、［１，１］の９通りの値を取る。よって、
ｘとｙ，ｚに分割した場合、記憶すべきパラメーター数
は３＋２×９＝２１である。この場合、相関の高いパラ
メーターをまとめることによって、９／２１のパラメー
ター数で空間全体を被覆できる。First, the sum of x and y, and 2 of z
Consider the case of dividing into sets. Since x and y always take the same value, possible values are [-1, -1], [0, 0],
[1, 1]. There are three possible values of z, -1, 0 and 1. Therefore, x, y and z
, The number of parameters to be stored is 2 × 3 +
3 = 9. Next, let us consider a case where the image is divided into two sets of x and a set of y and z. The possible value of x is-
1, 0, and 1. If you put together y and z,
[-1, -1,], [-1,0], [-1,1,], [0,
-1], [0,0], [0,1], [1, -1],
It takes nine values of [1,0] and [1,1]. Therefore,
When divided into x, y, and z, the number of parameters to be stored is 3 + 2 × 9 = 21. In this case, by gathering parameters having high correlation, the entire space can be covered with 9/21 parameter numbers.

【００１５】ここでは簡単な例について説明したが、パ
ラメーター数などが増加した場合も同様である。Although a simple example has been described here, the same applies when the number of parameters is increased.

【００１６】以上のように、パラメーター間の相関の強
さを考慮することによって、少ないパターン数でよりよ
い認識性能が得られる標準パターンを提供できる。As described above, by taking into account the strength of the correlation between parameters, it is possible to provide a standard pattern that can obtain better recognition performance with a small number of patterns.

【００１７】[0017]

【実施例】次に本発明による標準パターン作成装置につ
いて図面を用いて説明する。DESCRIPTION OF THE PREFERRED EMBODIMENTS Next, a standard pattern forming apparatus according to the present invention will be described with reference to the drawings.

【００１８】図１は本発明の一実施例を示す構成図であ
る。音声入力部１０に音声が入力され、分析部２０に送
られる。送られた音声波形は、分析部２０において分析
され特徴ベクトルが抽出される。分析後の特徴ベクトル
の例としては、ＬＰＣメルケプストラム、Δメルケプス
トラム（”Ｓｐｅａｋｅｒ−ｉｎｄｅｐｅｎｄｅｎｔｉ
ｓｏｌａｔｅｄｗｏｒｄｒｅｃｏｇｎｉｔｉｏｎ
ｕｓｉｎｇｄｙｎａｍｉｃｆｅａｔｕｒｅｓｏｆ
ｓｐｅｅｃｈｓｐｅｃｔｒｕｍ，”ＩＥＥＥＴｒ
ａｎｓ．Ａｃｏｕｓｔ．，ＳｐｅｅｃｈＳｉｇｎａｌ
Ｐｒｏｃｅｓｓｉｎｇ，ｖｏｌ．ＡＳＳＰ−３４，ｐ
ｐ．５２−５９，１９８６．以下これを文献３とす
る）、Δ²メルケプストラム（”ＩｍｐｒｏｖｅｄＡ
ｃｏｕｓｔｉｃＭｏｄｅｌｉｎｇｗｉｔｈｔｈｅ
ＳＰＨＩＮＸＳｐｅｅｃｈＲｅｃｏｇｎｉｔｉｏ
ｎＳｙｓｔｅｍ，Ｘ．Ｄ．Ｈｕａｎｇ，Ｋ．Ｆ．Ｌｅ
ｅ，Ｈ．Ｗ．Ｈｏｎ，ａｎｄＭ．Ｙ．Ｈｗａｎｇ，Ｉ
ＣＡＳＳＰ９１，ｐｐ．３４５−３４８，１９９１、
以下これを文献４とする）などが挙げられる。FIG. 1 is a block diagram showing an embodiment of the present invention. A voice is input to the voice input unit 10 and sent to the analysis unit 20. The sent voice waveform is analyzed in the analysis unit 20 and a feature vector is extracted. Examples of the feature vector after the analysis include LPC mel-cepstrum and Δ-mel-cepstrum (“Speaker-independent”.
isolated word recognition
using dynamic features of
speech spectrum, "IEEE Tr
ans. Acoustic. , Speech Signal
Processing, vol. ASSP-34, p
p. 52-59, 1986. Hereinafter, this is referred to as Document 3), delta ² Mel cepstrum ( "Improved A
cosmetic Modeling with the
SPHINX Speech Recognition
n System, X. D. Huang, K .; F. Le
e, H .; W. Hon, and M.S. Y. Hwang, I
CASSP 91 pp. 345-348, 1991,
Hereinafter, this will be referred to as Reference 4.).

【００１９】抽出された特徴ベクトル列は、学習部３０
において標準パターンの学習に用いられる。学習方法は
認識手法に依存するが、例えば、パスコストＤＰ（渡
辺、木村、音響学会講演論文集、２−５−９、昭６２−
１０、以下これを文献５とする）ならば、文献５に述べ
られているように、標準パターンの各フレームでの平均
ベクトル及び統計的パスコストが計算される。The extracted feature vector sequence is sent to the learning unit 30
Is used for learning standard patterns. The learning method depends on the recognition method. For example, the path cost DP (Watanabe, Kimura, Proceedings of the Acoustical Society of Japan, 2-5-9, 1962)
10, hereinafter referred to as Reference 5), as described in Reference 5, the average vector and the statistical path cost in each frame of the standard pattern are calculated.

【００２０】以下、パスコストＤＰを例として説明す
る。Hereinafter, the path cost DP will be described as an example.

【００２１】学習されたパターンは、学習パターン記憶
部４０に入力される。次に、平均ベクトルThe learned pattern is input to a learning pattern storage unit 40. Next, the average vector

【００２２】[0022]

【数２】 (Equation 2)

【００２３】（ｊ＝１〜Ｊ：カテゴリー番号、ｎ＝１〜
Ｎ_j：カテゴリーｊの特徴ベクトル数、ｐ＝１〜Ｐ：特
徴ベクトルの次元数）が、相関度計算部５０に送られ
る。ここで、平均ベクトルの要素を(J = 1 to J: category number, n = 1 to
N _j : the number of feature vectors of the category j, p = 1 to P: the number of dimensions of the feature vector) are sent to the correlation degree calculation unit 50. Where the elements of the mean vector

【００２４】[0024]

【数３】 (Equation 3)

【００２５】（カテゴリーｊのｎ番目の特徴ベクトルの
ｐ次元目の要素）とする。(P-dimensional element of the n-th feature vector of category j).

【００２６】この相関度計算部５０について一実施例を
説明する。An embodiment of the correlation calculating section 50 will be described.

【００２７】最初に全学習パターンFirst, all learning patterns

【００２８】[0028]

【数４】 (Equation 4)

【００２９】にわたる特徴ベクトルの各パラメーターご
との平均値μ（ｐ）を求める。平均μ（ｐ）は、The average value μ (p) of each parameter of the feature vector over the range is determined. The average μ (p) is

【００３０】[0030]

【数５】 (Equation 5)

【００３１】で表される。## EQU1 ##

【００３２】次に、計算されたパラメーター平均値を用
い、各パラメーターごとの共分散行列σ（ｐ１，ｐ
２）、１＜ｐ１，ｐ２＜Ｐ（ｐ１，ｐ２は特徴ベクトル
のパラメーター番号）が計算される。Next, using the calculated parameter average value, the covariance matrix σ (p1, p
2) 1 <p1, p2 <P (p1, p2 are parameter numbers of the feature vector) are calculated.

【００３３】[0033]

【数６】 (Equation 6)

【００３４】次に、計算された共分散行列σ（ｐ１，ｐ
２）を用い、各パラメーター間の相関係数ρ（ｐ１，ｐ
２）が計算される。Next, the calculated covariance matrix σ (p1, p
2) and the correlation coefficient ρ (p1, p
2) is calculated.

【００３５】[0035]

【数７】 (Equation 7)

【００３６】相関度計算部５０で計算された相関係数
は、相関度記憶部６０に保持される。次に、特徴ベクト
ル分割部７０において、５０で計算された相関係数をも
とに、各相関係数間の行列式を計算し、パラメーターを
まとめていく。The correlation coefficient calculated by the correlation degree calculation section 50 is stored in the correlation degree storage section 60. Next, in the feature vector dividing unit 70, a determinant between the correlation coefficients is calculated based on the correlation coefficients calculated in 50, and parameters are collected.

【００３７】以下、特徴ベクトル分割部７０について説
明する。（１）最初に、各パラメーターが独立であるものと
し、各パラメーターが部分ベクトルであるようＰ個に分
割する。Hereinafter, the feature vector dividing section 70 will be described. (1) First, each parameter is assumed to be independent, and is divided into P pieces so that each parameter is a partial vector.

【００３８】ｒ＝ＰＴ（ｋ），（１≦ｋ≦Ｐ）（Ｔ（ｋ）はｋ番目の部分ベクトルの次元数）（ｋは、部分ベクトル番号）（２）次に、１≦ｋ，ｌ≦ｒ、ｋ≠ｌである部分ベク
トルｋ，ｌに属するパラメーターｐ１，ｐ２、（１≦ｐ
１，ｐ２≦（Ｔ（ｋ）＋Ｔ（ｌ）））の相関関数ρ（ｐ
１，ｐ２）を相関度記憶部６０から読みだし、（Ｔ
（ｋ）＋Ｔ（ｌ））×（Ｔ（ｋ）＋Ｔ（ｌ））の相関行
列Ｃを作成し行列式Ｄ（ｋ，ｌ）を求める。R = PT (k), (1≤k≤P) (T (k) is the number of dimensions of the k-th partial vector) (k is the partial vector number) (2) Next, 1≤k , L ≦ r, parameters p1, p2, (1 ≦ p
1, p2 ≦ (T (k) + T (l)) correlation function ρ (p
1, p2) is read from the correlation degree storage unit 60, and (T
A correlation matrix C of (k) + T (l)) × (T (k) + T (l)) is created, and a determinant D (k, l) is obtained.

【００３９】Ｄ（ｋ，ｌ）＝ｄｅｔ｜Ｃ｜（３）次に、最小のＤ（ｋ，ｌ）を与える部分ベクト
ルｋ，ｌを１つの部分ベクトルにまとめる。D (k, l) = det | C | (3) Next, the partial vectors k and l giving the minimum D (k, l) are combined into one partial vector.

【００４０】Ｔ（ｋ）＝Ｔ（Ｋ）＋Ｔ（ｌ）（ｋ＜ｌ）この時、新たな部分ベクトルの番号は、まとめられた２
つのうちの小さい方の番号とする。T (k) = T (K) + T (l) (k <l) At this time, the numbers of the new partial vectors are
The smaller of the two.

【００４１】次に、前記分割情報をもとに部分ベクトル
番号の付け直しが行なわれる。この段階で分割数は１減
少することになる。（４）次に、ｒ＝ｒ−１とし、ｒが予め定められるい
き値Ｋよりも大きければ（２）へ戻る。ｒ＝Ｋとなるま
でこの計算を行なう。Next, the partial vector numbers are renumbered based on the division information. At this stage, the number of divisions is reduced by one. (4) Next, r = r−1, and if r is larger than a predetermined threshold value K, the process returns to (2). This calculation is performed until r = K.

【００４２】最終的には、ｐ次元目の要素が属する部分
ベクトル番号ｐｖ（ｐ）とｋ番目の部分ベクトルの次元
数Ｔ（ｋ）が求められる。Finally, the partial vector number pv (p) to which the p-th element belongs and the number of dimensions T (k) of the k-th partial vector are obtained.

【００４３】以上の手続きは、相関の度合いとして特徴
ベクトルの共分散行列から計算された相関系列を例とし
たが、その他の計算方法も可能である。In the above procedure, the correlation sequence calculated from the covariance matrix of the feature vector as the degree of correlation has been described as an example, but other calculation methods are also possible.

【００４４】次に、各部分ベクトルごとに特徴ベクトル
のクラスタリングをクラスタリング部８０で行なう。Next, the clustering of the feature vectors is performed by the clustering section 80 for each partial vector.

【００４５】クラスタリングについては、ＬＢＧアルゴ
リズムを用いた方法（ＩＥＥＥＴｒａｎｓ．Ｃｏｍｍ
ｕｎ．，ＣＯＭ−２８，１ＰＰ．８４−９５，Ｊａｎ．
１９８０、以下これを文献６とする）などが知られてい
る。For clustering, a method using the LBG algorithm (IEEE Trans.
un. , COM-28, 1PP. 84-95, Jan.
1980, hereinafter referred to as Document 6).

【００４６】以下、クラスタリング部の一実施例を述べ
る。Hereinafter, an embodiment of the clustering unit will be described.

【００４７】制御部１２０より部分ベクトル番号ｋ（ｋ
＝１〜Ｋ）と、部分ベクトル番号ｋのクラスタ中心数Ｍ
_kが、クラスタリング部８０に順次送られる。クラスタ
リング部８０は、学習記憶部４０に蓄えられた平均ベク
トルThe controller 120 sends a partial vector number k (k
= 1 to K) and the cluster center number M of the partial vector number k
_k are sequentially sent to the clustering unit 80. The clustering unit 80 calculates the average vector stored in the learning storage unit 40.

【００４８】[0048]

【数８】 (Equation 8)

【００４９】の中からｐｖ（ｐ）＝ｋである要素ｐを抽
出し、Ｔ（ｋ）次元のベクトルとする。抽出されたＴ
（ｋ）次元のベクトルAn element p for which pv (p) = k is extracted from among them, and is set as a T (k) -dimensional vector. The extracted T
(K) dimensional vector

【００５０】[0050]

【数９】 (Equation 9)

【００５１】とする。次に、Assume that next,

【００５２】[0052]

【数１０】 (Equation 10)

【００５３】からＭ_k個のベクトルをクラスタ中心とし
て選択する。この選択方法としては、番号順にＭ_k個と
ってもよいし、ランダムに選んでもよい。選択されたＭ
_k個のクラスタ中心の値Then, M _k vectors are selected as the cluster centers. As this selection method, M _k numbers may be selected in numerical order or may be selected at random. M selected
_k cluster center values

【００５４】[0054]

【数１１】 [Equation 11]

【００５５】は距離計算分１１０に送られる。Is sent to the distance calculation 110.

【００５６】距離計算部１１０は、学習パターン記憶部
４０に蓄えられた各平均ベクトルThe distance calculation unit 110 calculates each average vector stored in the learning pattern storage unit 40.

【００５７】[0057]

【数１２】 (Equation 12)

【００５８】とクラスタリング部８０から送られたＭ_k
個の各クラスタ中心との距離And M _k sent from the clustering unit 80
Distance from each cluster center

【００５９】[0059]

【数１３】 (Equation 13)

【００６０】を計算しクラスタリング部８０に送る。Is calculated and sent to the clustering unit 80.

【００６１】距離については、パスコストＤＰではユー
クリッド距離が利用可能である。As for the distance, the Euclidean distance can be used in the path cost DP.

【００６２】クラスタリング部８０は、クラスタ中心の
値The clustering unit 80 calculates the value of the cluster center.

【００６３】[0063]

【数１４】 [Equation 14]

【００６４】をクラスタ中心記憶部１００に送り、クラ
スタ中心記憶部１００はこれを保持する。また、クラス
タリング部８０は、距離計算部１１０で計算されたＤｃ
ｌ（ｊ，ｎ，ｋ，ｈ）の中で最小値をとるクラスタ番号
ｈをｍｅｍｂｅｒ（ｊ，ｎ，ｋ）＝ｈ（１≦ｍｅｍｂｅ
ｒ（ｊ，ｎ，ｋ）≦Ｍ_k）とし、クラスタメンバ記憶部
９０に送る。ｍｅｍｂｅｒ（ｊ，ｎ，ｋ）はIs sent to the cluster center storage unit 100, and the cluster center storage unit 100 holds this. The clustering unit 80 calculates the Dc calculated by the distance calculation unit 110.
The cluster number h that takes the minimum value among l (j, n, k, h) is defined as member (j, n, k) = h (1 ≦ membe
r (j, n, k) ≦ M _k ) and sends it to the cluster member storage unit 90. member (j, n, k) is

【００６５】[0065]

【数１５】 (Equation 15)

【００６６】の各ベクトルが属するクラスタの番号を示
す。クラスタメンバ記憶部９０はこれを保持する。Indicates the number of the cluster to which each vector belongs. The cluster member storage unit 90 holds this.

【００６７】次に、クラスタリング部８０は、ｍｅｍｂ
ｅｒ（ｊ，ｎ，ｋ）をクラスタメンバ記憶部９０から読
みだし、クラスタ中心Next, the clustering unit 80
er (j, n, k) is read from the cluster member storage unit 90, and the cluster center

【００６８】[0068]

【数１６】 (Equation 16)

【００６９】を番号ｈのクラスタに属するBelongs to the cluster of number h

【００７０】[0070]

【数１７】 [Equation 17]

【００７１】の平均値を用いて更新する。以下、距離計
算部１１０で計算されるＤｃｌ（ｊ，ｎ，ｋ，ｈ）が収
束するまで上記手順を繰り返し、最終的なクラスタ中心
値Update using the average value of Hereinafter, the above procedure is repeated until Dcl (j, n, k, h) calculated by the distance calculation unit 110 converges, and the final cluster center value is obtained.

【００７２】[0072]

【数１８】 (Equation 18)

【００７３】をクラスタ中心記憶部１００に保持し、最
終的な各平均ベクトルの属するクラスタ番号ｍｅｍｂｅ
ｒ（ｊ，ｎ，ｋ）をクラスタメンバ記憶部９０に保持す
る。Is stored in the cluster center storage unit 100, and the cluster number membe to which each final average vector belongs
r (j, n, k) is stored in the cluster member storage unit 90.

【００７４】以上、部分ベクトル番号ｋにおけるクラス
タリングについて説明したが、この作業をｋ＝１〜Ｋに
ついて行なう。Although the clustering at the partial vector number k has been described above, this operation is performed for k = 1 to K.

【００７５】次に、クラスタメンバ記憶部９０に保持さ
れた各パターンの属するクラスタ番号ｍｅｍｂｅｒ
（ｊ，ｎ，ｋ）とクラスタ中心記憶部１００に保持され
たクラスタ中心値Next, the cluster number member to which each pattern belongs held in the cluster member storage unit 90
(J, n, k) and the cluster center value held in the cluster center storage unit 100

【００７６】[0076]

【数１９】 [Equation 19]

【００７７】の情報をもとに、学習パターン記憶部４０
に保持されている学習パターンを用い、パターン作成部
１３０において標準パターンを作成する。Based on the information of the learning pattern storage unit 40,
The standard pattern is created in the pattern creating unit 130 using the learning pattern stored in the standard pattern.

【００７８】まず、クラスタ中心記憶部１００に蓄えら
れているクラスタ中心値First, the cluster center value stored in the cluster center storage unit 100

【００７９】[0079]

【数２０】 (Equation 20)

【００８０】を読みだし、これを保持する。次に、クラ
スタメンバ記憶部９０からｍｅｍｂｅｒ（ｊ，ｎ，ｋ）
の値を読みだし、これを保持する。平均ベクトル作成の
ために記憶すべきものは、Is read out and held. Next, the member (j, n, k) is read from the cluster member storage unit 90.
Read out the value of and keep it. What should be remembered for creating the average vector is

【００８１】[0081]

【数２１】 (Equation 21)

【００８２】個のクラスタ中心の値とＮ×Ｋ個のｍｅｍ
ｂｅｒ（ｊ，ｎ，ｋ）の値となり、よりメモリー量の少
ない標準パターンを作成することができる。パスコストCluster center value and N × K mem
ber (j, n, k), and a standard pattern with a smaller amount of memory can be created. Path cost

【００８３】[0083]

【数２２】 (Equation 22)

【００８４】については、学習パターン記憶部４０に蓄
えられた値をそのまま使用し、上記で計算された平均ベ
クトルトと併せて１つのパターンとする。上記の例で
は、平均ベクトルのみをクラスタリングの対象とした
が、パスコストについてもクラスタリングの対象とする
こともできる。As for the value, the value stored in the learning pattern storage section 40 is used as it is, and one pattern is combined with the average vector calculated above. In the above example, only the average vector is targeted for clustering, but the path cost can also be targeted for clustering.

【００８５】ここで作成された標準パターンは、標準パ
ターン出力部１４０に送られ出力される。本手法の適用
はパスコストＤＰに限らない。例えば、連続ＨＭＭ（Ｂ
−Ｈ．Ｊｕａｎｇ，ＩＥＥＥＴｒａｎｓ．Ａｃｏｕｓ
ｔ．，Ｓｐｅｅｃｈ＆ＳｉｇｎａｌＰｒｏｃｅｓ
ｓ．，ＡＳＳＰ−３３，６，ｐｐ．１４０４−１４１
３，１９８５、以下これを文献４とする）の場合に、そ
の分布の平均ベクトル等をここで述べる方法によってク
ラスタリングすることが可能である。The standard pattern created here is sent to the standard pattern output unit 140 and output. The application of this method is not limited to the path cost DP. For example, a continuous HMM (B
-H. Juang, IEEE Trans. Acous
t. , Speech & Signal Processes
s. , ASSP-33,6, pp. 1404-141
3, 1985, hereafter referred to as Document 4), the average vector of the distribution and the like can be clustered by the method described here.

【００８６】作成された標準パターンを音声認識に用い
るには、例えば、ＳＰＬＩＴ法（菅村、古井、”擬音韻
標準パターンによる大語彙単語音声認識”、信学論、Ｊ
６５−Ｄ、８、ｐｐ１０１４−１０４８、昭５７、以下
これを文献７とする）が利用できる。上記で作成された
標準パターンを音声認識に用いた場合、メモリー量及び
計算量が少ない認識装置が実現できる。To use the created standard pattern for speech recognition, for example, the SPLIT method (Sugamura, Furui, "Large vocabulary word speech recognition using onomatopoeia standard pattern", IEICE
65-D, 8, pp 1014-1048, 1982, hereinafter referred to as Reference 7.). When the standard pattern created above is used for speech recognition, a recognition device with a small amount of memory and a small amount of calculation can be realized.

【００８７】[0087]

【発明の効果】本発明によれば、従来の標準パターン作
成装置よりもより少ないパターンで、より認識率の高い
標準パターンを作成可能な標準パターン作成装置が得ら
れる。According to the present invention, a standard pattern creating apparatus capable of creating a standard pattern having a higher recognition rate with fewer patterns than the conventional standard pattern creating apparatus can be obtained.

[Brief description of the drawings]

【図１】本発明による標準パターン作成装置の一実施例
を示すブロック図である。FIG. 1 is a block diagram showing an embodiment of a standard pattern creation device according to the present invention.

【図２】従来の標準パターン作成装置の一実施例を示す
ブロック図である。FIG. 2 is a block diagram showing one embodiment of a conventional standard pattern creation device.

【図３】特徴量間の相関の高低による被覆空間の相違を
示す図である。FIG. 3 is a diagram illustrating a difference in a covering space depending on a level of a correlation between feature amounts;

【図４】特徴量間の相関の高低による被覆空間の相違を
示す図である。FIG. 4 is a diagram illustrating a difference in a covering space depending on a level of a correlation between feature amounts;

[Explanation of symbols]

１０音声入力部２０分析部３０学習部４０学習パターン記憶部５０相関度計算部６０相関度記憶部７０特徴ベクトル分割部８０クラスタリング部９０クラスタメンバ記憶部１００クラスタ中心記憶部１１０距離計算部１２０制御部１３０パターン作成部１４０標準パターン出力部２００音声入力部２１０分析部２２０学習パターン記憶部２３０パワークラスタリング部２４０ＬＰＣパラメータークラスタリング部２５０パターン作成部２６０標準パターン出力部 Reference Signs List 10 voice input unit 20 analysis unit 30 learning unit 40 learning pattern storage unit 50 correlation degree calculation unit 60 correlation degree storage unit 70 feature vector division unit 80 clustering unit 90 cluster member storage unit 100 cluster center storage unit 110 distance calculation unit 120 control unit 130 pattern creation unit 140 standard pattern output unit 200 voice input unit 210 analysis unit 220 learning pattern storage unit 230 power clustering unit 240 LPC parameter clustering unit 250 pattern creation unit 260 standard pattern output unit

フロントページの続き (56)参考文献特開平４−363000（ＪＰ，Ａ) 特開平１−233499（ＪＰ，Ａ) 特開平５−119790（ＪＰ，Ａ) 特開平４−111189（ＪＰ，Ａ) 実開昭58−147062（ＪＰ，Ｕ) 特許2800618（ＪＰ，Ｂ２) 特公平６−7345（ＪＰ，Ｂ２) 特公平６−7344（ＪＰ，Ｂ２) ＩＥＥＥＴｒａｎｓａｃｔｉｏｎｓｏｎＣｏｍｍｕｎｉｃａｔｉｏｎｓＶｏｌ．ＣＯＭ−28，Ｎｏ．１，Ｊａｎｕａｒｙ 1980，”ＡｎＡｌｏｇｏｒｉｔｈｍｆｏｒＶｅｃｔｏｒＱｕａｎｔｉｚｅｒＤｅｓｉｇｎ”, ｐ．84−95 ＩＥＥＥＣｏｍｍｕｎｉｃａｔｉｏｎｓＭａｇａｚｉｎｅ，Ｖｏｌ．21, Ｎｏ．９，Ｄｅｃｅｍｂｅｒ 1983，" ＶｅｃｔｏｒＱｕａｎｔｉｚａｔｉｏｎ：ＡＰａｔｔｅｒｎ−ＭａｔｃｈｉｎｇＴｅｃｈｎｉｑｕｅｆｏｒＳｐｅｅｃｈＣｏｄｉｎｇ”，ｐ．15− 21 日本音響学会誌Ｖｏｌ．44，Ｎｏ. ８，1988，「セパレートベクトル量子化を用いたスペクトログラムの正規化」ｐ．595−602（昭和63年８月１日発行) 日本音響学会昭和62年度秋季研究発表会講演論文集２−５−16「ベクトル量子化を用いたスペクトログラムの正規化」ｐ．81−82（昭和62年８月10月) (58)調査した分野(Int.Cl.⁶，ＤＢ名) G10L 3/00 515 G10L 9/18 H03M 7/30 ＪＩＣＳＴファイル（ＪＯＩＳ)Continuation of the front page (56) References JP-A-4-363000 (JP, A) JP-A-1-233499 (JP, A) JP-A-5-119790 (JP, A) JP-A-4-111189 (JP) JP-A-58-147062 (JP, U) Patent 2800618 (JP, B2) JP 673345 (JP, B2) JP 6-7344 (JP, B2) IEEE Transactions on Communications Vol. COM-28, No. 1, January 1980, "An Analogism for Vector Quantifier Design", p. 84-95 IEEE Communication Magazines, Vol. 21, No. 9, December 1983, "Vector Quantitation: A Pattern-Matching Technique for Speech Coding", p. 15-21 Journal of the Acoustical Society of Japan, Vol. 44, No. 8, 1988, “Normalization of spectrogram using separate vector quantization” p. 595-602 (Published on August 1, 1988) Proceedings of the Fall Meeting of the Acoustical Society of Japan in 1987 2-5-16 "Spectrogram Normalization Using Vector Quantization" p. 81-82 (August 10, 1987) (58) Field surveyed (Int. Cl. ⁶ , DB name) G10L 3/00 515 G10L 9/18 H03M 7/30 JICST file (JOIS)

Claims

(57) [Claims]

A voice input unit for inputting voice, an analysis unit for analyzing input voice data and extracting a feature vector, a learning unit for learning a first standard pattern from the extracted feature vector, and a learning unit. A learning pattern storage unit that stores the obtained first standard pattern, a correlation degree calculation unit that calculates the degree of correlation between the feature vector elements, and calculates a correlation strength between the feature vector elements from the correlation degree. A feature vector dividing unit that divides a feature vector, a distance calculating unit that calculates a distance between patterns from the feature vector,
A clustering unit that clusters a learning pattern for each divided feature vector based on the vector division information and the inter-pattern distance; a cluster center storage unit that stores a cluster center obtained as a result of the clustering; and a pattern that forms each cluster. A standard pattern creation device, comprising: a cluster member storage unit for storing; and a standard pattern creation unit for creating a standard pattern based on the result of the clustering.