JPS61102697A

JPS61102697A - Pattern comparator

Info

Publication number: JPS61102697A
Application number: JP59224411A
Authority: JP
Inventors: 英一坪香
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1984-10-25
Filing date: 1984-10-25
Publication date: 1986-05-21
Anticipated expiration: 2009-01-26
Also published as: JPH067354B2

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】産業上の利用分野本発明は、特徴ベクトルの系列で表された複数種類の標
準パターンと入力パターンとの比較を行い、入力パター
ンの識別を行うパターン比較装置に関し、特に連続して
発声した単語音声の認識などに適用可能なパターン比較
装置に関する。DETAILED DESCRIPTION OF THE INVENTION Field of Industrial Application The present invention relates to a pattern comparison device that compares an input pattern with a plurality of types of standard patterns represented by a series of feature vectors and identifies the input pattern. The present invention relates to a pattern comparison device that can be applied to recognition of continuously uttered word sounds.

従来例の構成とその問題点人間にとって最も自然な情報発生手段である音声が、人
間−機械系の入力手段として使用できれば、その効果は
非常に大きい。その場合、音声認識装置としては、より
自然な発声で認識できる条件として、連続して発声した
音声の認識ができることが望ましい。Conventional configuration and its problems If voice, which is the most natural means of generating information for humans, could be used as an input means for a human-machine system, the effect would be very large. In this case, it is desirable for the speech recognition device to be able to recognize continuously uttered speech as a condition for recognizing more natural speech.

連続して発声した単語音声の認識に有効なパターン比較
装置として、動的計画法（以下ＤＰという）を２段回用
いたいわゆる２段ＤＰ法を用いたパターン比較装置が既
に実用化されている。A pattern comparison device using the so-called two-stage DP method, which uses dynamic programming (hereinafter referred to as DP) two times, has already been put into practical use as a pattern comparison device that is effective in recognizing continuously uttered word sounds. .

しかし、この２段ＤＰ法は重複した計算が多く計算量が
膨大で、従ってこの方法を用いた実時間処理が可能なパ
ターン比較装置は、極めて高速の処理が要求され、装置
も複雑となり、高価なものとならざるを得ない。However, this two-stage DP method involves many duplicate calculations and requires a huge amount of calculation. Therefore, a pattern comparison device that can perform real-time processing using this method is required to perform extremely high-speed processing, making the device complex and expensive. It has to become something.

以上の欠点を除去するために、大幅に計算量の少いパタ
ーン比較装置として、Ｏ（ｎＪ　Ｄ　Ｐ法、ＲＣＷＤＰ
法等を用いたパターン比較装置が提案されている。しか
し、これらの方法をハードウェア的に実現するためには
、高速にアクセス可能な大容量のメモリを必要とする欠
点がある。In order to eliminate the above drawbacks, O(nJ D P method, RCWDP method,
A pattern comparison device using a method has been proposed. However, implementing these methods in hardware requires a large capacity memory that can be accessed at high speed.

本発明は、これらの欠点を除去したパターンマツチング
装置を提供するものである。○（ｎ）ＤＰ法を例にこれ
を説明する。本発明を説明するに先立って次にＯ（ｎＪ
　Ｄ　Ｐ法について先ず説明する。The present invention provides a pattern matching device that eliminates these drawbacks. This will be explained using the (n)DP method as an example. Before explaining the present invention, O(nJ
First, the DP method will be explained.

本発明のパターン比較装置は、種々の入力パターンの認
識に用いることができるが、以下、連続して発声される
連続単語音声を例に説明する。The pattern comparison device of the present invention can be used to recognize various input patterns, and will be described below using continuous word speech that is continuously uttered as an example.

人間により発声される音声は人によりまた時により変化
し、基準となる標準パターンに対し時間的に非線形に伸
縮したものとなっている。この非線形に伸縮している入
力パターンと標準パターンとを比較し入力音声の認識を
行うためには、入力パターント標準パターンの各特徴ベ
クトルの対応付けを非線形に行い、入力パターンがどの
標準パターンと最も類似しているかを計算する必要かあ
る。しかしこの入力音声は非線形に伸縮するとはいって
も異常に長く伸びたり、短くなったシすることはない。The voices uttered by humans change from person to person and from time to time, and are non-linearly expanded and contracted in time with respect to a standard pattern that serves as a reference. In order to recognize the input speech by comparing this non-linearly expanded/contracted input pattern with the standard pattern, we need to non-linearly associate each feature vector of the input pattern with the standard pattern. Do you need to calculate which is the most similar? However, even though this input voice expands and contracts non-linearly, it does not become abnormally long or short.

このような入力パターンの物理的な特徴に注目し、入力
パターンと標準パターンを比較する際には無制限にすべ
ての可能性について比較するのではなぐ、入力パターン
の物理的な性質により定まる妥当と考えられる範囲につ
いて比較を行うようにする。Focusing on the physical characteristics of such input patterns, when comparing input patterns and standard patterns, rather than comparing all possibilities without limit, we consider the validity determined by the physical properties of the input patterns. Comparisons should be made within the range that can be used.

入力音声信号はパターン比較装置において、周　　　　
　゛波数分析、ＬＰＧ分析、ＰＡＲＣＯＲ分析、相関□
分析等により、いくつかの数値の組（特徴ベクトル）の
系列に変換され、この人力バター／の特徴ベクトルと比
較の対象となる標準パターンの特徴ベクトルとが各ベク
トル毎に比較される。この各ベクトル毎の比較値、すな
わちベクトル間の距離を合計した累積距離というものを
パターンの類似の尺度に用いる。この累積距離を計算す
る場合、各ベクトル毎の比較をすべての組み合わせにつ
いて行うのは計算量が膨大となり、パターン比較装置と
して実用化することができない。The input audio signal is
゛Wavenumber analysis, LPG analysis, PARCOR analysis, correlation□
Through analysis or the like, it is converted into a series of several sets of numerical values (feature vectors), and each vector is compared with the feature vector of the standard pattern to be compared. This comparison value for each vector, that is, the cumulative distance, which is the sum of distances between vectors, is used as a measure of pattern similarity. When calculating this cumulative distance, comparing each vector for all combinations requires an enormous amount of calculation, and cannot be put to practical use as a pattern comparison device.

、入力パターンを一方の軸に、標準パターンを他方の軸
とする平面（以下、ｉ　−ｉ平面という）を考えると、
入力パターンおよび標準パターンの各ベクトルの組み合
わせというのは、ｉ　−ｊ平面上の各格子点（以下、単
に点という）により示すことができる。従って前記あら
ゆる組み合わせについて各ベクトル間の距離を計算する
とは、各点におけるベクトル間の距離を計算することで
あり、累積距離を計算するとは、入力パターンの特徴ベ
クトルと、それに対応する標準パターンの特徴ベクトル
のベクトル間距離を順次計算し合計していくことである
。この累積距離を計算する過程で選択された、入力パタ
ーンと標準パターンの特徴ベクトルの対応、すなわち点
列を径路という。, considering a plane (hereinafter referred to as i-i plane) with the input pattern on one axis and the standard pattern on the other axis,
The combination of each vector of the input pattern and the standard pattern can be represented by each grid point (hereinafter simply referred to as a point) on the i-j plane. Therefore, calculating the distance between each vector for all the above combinations means calculating the distance between vectors at each point, and calculating the cumulative distance means calculating the feature vector of the input pattern and the feature of the standard pattern corresponding to it. This involves sequentially calculating and summing the distances between vectors. The correspondence between the feature vectors of the input pattern and the standard pattern, that is, the sequence of points selected in the process of calculating this cumulative distance, is called a path.

前記した入力パターンの物理的な性質を考慮して比較の
範囲を限定するということは、径路の選択に拘束条件を
設けるということである。Limiting the range of comparison in consideration of the physical properties of the input patterns described above means setting constraints on route selection.

ここで、以後の説明において用いる用語および記号につ
いて説明する。Here, terms and symbols used in the following description will be explained.

Ａ：入力パターン（Ａ＝ａ１．ａ２．・−、ａｉ、・−
・、ａ工）、ａｉは第ｉフレームの特徴ベクトル、Ｉは入力パターンのフレーム数Ｒｎ：第ｎ標準パターン（Ｒｎ＝ｂ？、］ｌＦ２．・・
・、ｂト・・・、ｂ７ｎ）ｂ９は第ｎ標準パターンの第
ｊフレームの】特徴ベクトル ■０は第ｎ標準パターンのフレーム数、Ｎを標準パター
ンの総数とするとき１≦ｎ≦Ｎ ”（’＊　ｊ）：　第ｎ標準パターンの第ｊフレーム。A: Input pattern (A=a1.a2.・-, ai,・−
・, a), ai is the feature vector of the i-th frame, I is the number of frames of the input pattern Rn: n-th standard pattern (Rn=b?, ]lF2...
・, bt..., b7n) b9 is the j-th frame of the n-th standard pattern] Feature vector ■0 is the number of frames of the n-th standard pattern, where N is the total number of standard patterns, 1≦n≦N” ('* j): j-th frame of the n-th standard pattern.

の特徴ベクトルｂｎ　と入力パターンの８ｇｉフレーム
の特徴ベクトルａ□とのベクトル間距離Ｄ（ｉ）：第１〜第ｉフレームまでの入力パターンと、
各標準パターンの最適な組み合わせの結合パターンとの
パターン間の距離（以下、終端累積距離という）Ｎ（ｉ）：第１〜第１フレームまでの入力バター、ンに
対する各標準パターンの最適な組み合わせの結合パター
／を求めたときの当該結合パターンを構成する最後尾標
準パターンを示す番号（以下、最後尾標準パターン名）
Ｂ（ｉ）：Ｎ（ｉ）の始点フレームの１つ手前のフレー
ムを示す番号（以下、バックポインタという）”　（’
　＋　］　）　：入入力バターの第１〜第１フレームま
での部分パターンとＲｎの第１〜第ｉ′　フレームまで
の部分パターンのパターン間の距離（以下、部分累積距
離Ｄ？’　（ｉ　、ｊ　）という）と、Ｄ（ｉ’−１）
との和のｉ′についての最小値（以下、中間累積距離と
いう）Ｂ″（ｉ＋ＪＬ部分累積距離Ｄ７ｚ（ｉ、ｊ）と
Ｄ（ｉ’−１）との和を最小にするｉ′をｓｌ、すなわ
ちｉ’＝ａｒｇｍｉｎ　（Ｄ（ｉ’−１）＋Ｍ／　（ｉ
　、ｊ　）〕とするとき、当該ｉｉ′レームの１つ手前
の゛フレームを示す番号（以下、中間バックポインタと
いう）ただし、ａｒｇｍｉｎ　Ｏは０内の値を×について最小
化したときの×の値を示す。Intervector distance D(i) between the feature vector bn of the input pattern and the feature vector a□ of the 8gi frame of the input pattern: the input pattern from the first to the i-th frame,
Distance between patterns with the combined pattern of the optimal combination of each standard pattern (hereinafter referred to as the terminal cumulative distance) N(i): Distance between the optimal combination of each standard pattern with respect to the input butter and Number indicating the last standard pattern that constitutes the combined pattern when the combined pattern / is calculated (hereinafter referred to as the last standard pattern name)
B(i): Number indicating the frame one frame before the starting point frame of N(i) (hereinafter referred to as back pointer)"('
+]) : Distance between the partial pattern from the 1st frame to the 1st frame of the input input butter and the partial pattern from the 1st frame to the i'th frame of Rn (hereinafter, partial cumulative distance D?' (i, j ) and D(i'-1)
The minimum value for i' of the sum of (hereinafter referred to as intermediate cumulative distance) , i.e. i'=argmin (D(i'-1)+M/ (i
. shows.

Ｄ”（ｉ）：　ｊ　＝Ｉ　ｎのときの中間累積距離であ
り、Ｄ”（ｉ）＝Ｄ”　（ｉ　、　Ｉ　”　）　’ｔ’
　６７）。D"(i): is the intermediate cumulative distance when j = I n, and D"(i) = D" (i, I") 't'
67).

Ｂ”（ｉ）：　ｎ＝１　”のときの中間バックポインタ
であり、Ｂ”（ｉ）＝Ｂ”（ｉ、Ｊ”）　である。B"(i): This is the intermediate back pointer when n=1", and B"(i)=B"(i, J").

０　（ｎ）　Ｄ　Ｐ法は、入力パターンが第１０フレー
ムで終了すると仮定した場合、最後尾パターンをＨｎと
したときの中間累積距離Ｄ　”　（ｚ　ｏ　）を求める
のに、ｉ′を始端フレームｌ　’Ｏを終端フレームとす
る入力パターンの部分パターンＡ（ｉ’−１、ｉｏ）と
標準パターンＲｎとのＤＰマツチングを始端点自由、終
端点固定として行うものであって、始端点ｉ′における
中間累積距離の初期値Ｄｎ（ｉ′、１）と中間バックポ
インタのｍ直Ｂ”（ｉ’　、ｏ）をＤ”（ｉ’　、１　
）＝Ｄ（ｉ’　−１）Ｉｄ”（ｉ　、１　）Ｂ”（ｉ’
　、０）＝Ｅ”（ｉ’　−１）とすることによって、第
１〜第１フレームまでのＤＰマツチングの続きとして部
分パターンＡ（ｉ’−１゜１０）ト標準パターンＨｎの
ＤＰマツチングを行うものである。このようにすること
によって、例えば、第１図に示すようなマツチング径路
に対する拘束条件のもとでは、取り得るマツチング径路
は第２図のＰの領域内に制限され、Ｄｎ（ｉｏ）を求め
るために必要とされるｄ″（’＋１）＋”（ｉ＋ｔ）の
計算は領域Ｐ内の各点についてそれぞれ１回行うのみで
よい。第２図において横軸は入力パターン、縦軸は最後
尾標準パターンＨｎである。領域Ｐは傾きＨの直線Ｐ１
　　と傾き２の直線２とで囲まれた領域となっている。0 (n) D P method assumes that the input pattern ends at the 10th frame, and when the last pattern is Hn, the intermediate cumulative distance D '' (z o ) is calculated by setting i' to the starting frame. DP matching is performed between a partial pattern A (i'-1, io) of an input pattern whose end frame is l'O and a standard pattern Rn with a free start point and a fixed end point. The initial value Dn (i', 1) of the intermediate cumulative distance and the m direction B''(i', o) of the intermediate back pointer are expressed as D''(i', 1).
)=D(i'-1)Id"(i,1)B"(i'
, 0)=E"(i'-1), DP matching of partial pattern A(i'-1°10) and standard pattern Hn is performed as a continuation of DP matching from the first frame to the first frame. By doing this, for example, under the constraint conditions for matching paths as shown in FIG. 1, the possible matching paths are limited to the region P in FIG. ) The calculation of d''('+1)+''(i+t) needed to calculate d''('+1)+''(i+t) only needs to be performed once for each point in the area P.In Fig. 2, the horizontal axis is the input pattern, and the vertical axis is the input pattern. is the last standard pattern Hn.A region P is a straight line P1 with a slope H
This is an area surrounded by a straight line 2 with a slope of 2.

”（’＋１）を求めるには、第１図から明らかなように
Ｄ″（ｉ−２，ｊ−１）。To find ``('+1), D''(i-2, j-1), as is clear from FIG.

Ｄ”（ｉ−１１ｊ−１）ＩＤ”（ｆ−１１ｊ−２）Ｉｄ
”（ｉ−１ｔｊ）１ｄ”（ｔ、＋）のみわかっていれば
よいから、第１フレーム上の中間累積距離Ｄ”（ｉｏ、
ｊ）（ただしり＝１．２．・・・、Ｉｎ）を求めるには
、第１０フレーム、第ｉ−２クレーム上の中間累積距離
Ｄ”（ｉ−１゜ｊ）ｔＤ”（ｉ−２１ｊ）　お！び第１
−１７ｖ−ム、第１フレーム上のベクトル間距離”（’
−’＋１）＋”（’　ｒ　３　）　（タタＬ　ｓ＝１．
２　、＋＋＋、　Ｊ”）　ヲ記憶シテおくのみでよい。D”(i-11j-1)ID”(f-11j-2)Id
Since it is only necessary to know “(i-1tj)1d” (t, +), the intermediate cumulative distance D” (io,
j) (button = 1.2..., In), the intermediate cumulative distance D''(i-1゜j)tD''(i-21j ) oh! and 1st
−17v−m, distance between vectors on the first frame”('
−'+1)+”(' r 3 ) (Tata L s=1.
2, +++, J”) Just store it in your memory.

このとき、Ｄ　”　（ｉｏ　）はＤ　ｎ（ｚ　ｏ　）＝Ｄ　ｎ（ｉｏ　、Ｊ　”　）とし
て求めることができる。At this time, D '' (io) can be obtained as D n (zo) = D n (io, J '').

以上のように、入力パターンのフレーム１が１フレーム
進む毎に、そのフレーム上の中間累積圧１ｉｆＩＤ”（
ｉ　＋　ｉ）（タタＬ、　ｊ−＝１．２．・＋１”；ｎ
＝１．２゜・・・、Ｎ）を１フレーム前と２フレーム前
の中間累積距離Ｄ”（ｉ−１＋ｉ）、Ｄ”（ｉ−２，ｊ
）、！：１７ｖ−ム前および当該フレームのベクトル間
距離ｄ”（ｎ＝１．ｊ）、ｄ″（ｉ＋ｉ）　　ＣｆｔＪ
ｒ：Ｉ、ｎ＝１．２．・・。As described above, each time frame 1 of the input pattern advances by one frame, the intermediate cumulative pressure 1ifID"(
i + i) (Tata L, j-=1.2.・+1"; n
= 1.2°..., N) is the intermediate cumulative distance D"(i-1+i), D"(i-2,j
),! :17v-distance between vectors before the frame and in the current frame d'' (n=1.j), d'' (i+i) CftJ
r:I, n=1.2. ....

Ｊ”；ｎ＝＝１．２．・ＩＮ）から求め、Ｄ（ｉ）’＝
＝ｓ＋ｍ　（Ｄ”　（ｉ　。J''; n==1.2.・IN), D(i)'=
=s+m(D”(i.

Ｊｏ）〕として第１フレームまでの終端累積距離を求め
ることができる。このようにして求められた”（’＋ｔ
）（ｆｃだしｌ＝’　＋　２＊”’＊Ｉ”＋　”＝１ｅ
　２ｒ・・’＋Ｎ）は必要がなくなるまで、すなわち次
の７レームないしその次のフレームにおけるＤ”（ｉ、
ｊ）の計算終了まで記憶される。Jo)], the final cumulative distance to the first frame can be determined. It was obtained in this way” ('+t
) (fc, l=' + 2*"'*I"+ "=1e
2r...'+N) until it is no longer needed, that is, D''(i,
It is stored until the calculation of j) is completed.

また、Ｄ　（ｉ）に対するバックポインタＣＤ（ｉ）に
対する始端点から１を差し引いた値）Ｂ（ｉ）は次のよ
うにして求まる。Further, the back pointer for D(i) (the value obtained by subtracting 1 from the starting point for CD(i)) B(i) is determined as follows.

Ｄ”（ｉ、ｊ）に対する中間バックポインタをＢ”（’
Ｉ］）とするとき、１）　Ｄｎ（ｉｔｉ）＝Ｄ”（ｉ−２＋１−１）＋ｄ”
（ｉ−１＋ｉ）＋ｄ”（ｉｔｉ）のときはＢ”（’　＋　３）＝””（’−２＋　１−１　）２）
　”（’　＋　＋　）＝”（’−’　Ｉ　］−１）”（
’　＋　］）　ノド＠は　　Ｂ”（’＋１）＝Ｂ”（’
−１１］−１）３）　Ｄ”（ｔ＋　＋）＝Ｄ”Ｄ−１，
ｊ−２）＋ｄｎ（ｉ　ｌ　Ｄ　ｏトキは　　Ｂｎ（ｉ、
ｊ）＝Ｂｎ（ｉ−１，５−２）とおくととによシとすればＢ（ｉ）＝Ｂ合（ｉ、１合）となる。従って、Ｂ”（ｔ、＋）についても、１−フレ
ーム前と２フレーム前のものを覚えておく０なお、第３
図のような径路の場合は、”（’＋＋）。Set the intermediate back pointer for D"(i, j) to B"('
I]), 1) Dn(iti)=D"(i-2+1-1)+d"
(i-1+i)+d"(iti) then B"('+3)=""('-2+1-1)2)
”(' + + )=”('-' I ]-1)”(
' + ]) Nodo @ is B"('+1) = B"('
−11]−1)3) D”(t+ +)=D”D−1,
j-2)+dn(i l Dotoki is Bn(i,
j) = Bn (i-1, 5-2), then B(i) = B combination (i, 1 combination). Therefore, for B''(t, +), remember the one frame before and the one before two frames.
In the case of a route like the one shown in the figure, "('++)".

Ｂ”（’ｌ］）は１フレーム前の値を覚えておくだけで
よい。B''('l]) only needs to remember the value of one frame before.

以上説明した原理を用いたパターン比較装置の従来例に
ついて説明する。第４図は以上の原理に基づくパターン
比較装置を連続単語認識に適用した場合の従来例を示す
ブロック図である。図において、Ｉｎは゛音声信号の入
力端子、１はフィルタバンク等で構成された特徴抽出部
であって、入力音声信号を特徴ベクトルａ、の系列Ａに
変換する。A conventional example of a pattern comparison device using the principle explained above will be explained. FIG. 4 is a block diagram showing a conventional example in which a pattern comparison device based on the above principle is applied to continuous word recognition. In the figure, In is an input terminal for an audio signal, and 1 is a feature extraction unit composed of a filter bank, etc., which converts the input audio signal into a series A of feature vectors a.

２は単語標準パターン記憶部であって、認識語粟たるＮ
個の単語がそれぞれ標準パターンＲ”＝ｂ〒。2 is a word standard pattern storage unit, which stores the recognized word N
Each word is a standard pattern R''=b〒.

・・・ｌ　ｂ’ｊ’　ｌ・・・、ｂ’ｉ’ｎ　、　（１
＜ｎ＜Ｎ）　　として特徴ベクトルの形で予め登録され
ている。３はベクトル間距離計算部であって、入力パタ
ーンの第ｉフレームにおける特徴ベクトルａ、とｎ°番
目の単語標準パターンＨＨの特徴ベクトルｂ、との距離
ｄ”（ｉ、ｉ）を、５＝１．２．・・・、Ｉｎ　　につ
いて求め、必要がなくなるまで記憶する。本例において
は中間累積距離　　　　　１を計算しているフレームの
１つ前のフレームおよ　　　　　１′び当該フレームの
ベクトル間距離を当該フレームの中間累積距離を計算す
るまで記憶する。”（ｉｌｌ）は、例えばａ、とｂｊ　
の市街距離として定義できる。すなわち、ベクトルｂ次
元を１とし、”ｉ　：（ａ　ｉｌ　　＋”ｉ２””ｉ　
　ｌ）　　ｌ　ｂ　ｊ＝（ｂ　ｊｌ　　＋”ｊ２＋・・
・、ｂ〒１）とするときとなる。...l b'j' l..., b'i'n, (1
<n<N) and is registered in advance in the form of a feature vector. 3 is an inter-vector distance calculation unit, which calculates the distance d'' (i, i) between the feature vector a in the i-th frame of the input pattern and the feature vector b of the n°-th word standard pattern HH, 5= 1.2..., In, and store it until it is no longer needed.In this example, the distance between the vectors of the frame immediately before the frame for which the intermediate cumulative distance 1 is being calculated, 1', and the frame concerned. "(ill) is stored until the intermediate cumulative distance of the frame is calculated." (ill) is, for example, a, and bj
It can be defined as the city distance of In other words, let the vector b dimension be 1, and “i:(a il +”i2””i
l) l b j=(b jl +”j2+...
・, b〒1).

４は累積距離計算部であって、第ｉフレームについて中
間累積距離”（’Ｉ））、終端累積距離Ｄ（ｉ）。4 is a cumulative distance calculation unit which calculates an intermediate cumulative distance "('I))" and a terminal cumulative distance D(i) for the i-th frame.

中間バンクポインタＢｎ（１，ｊ）、バックポインタＢ
　（ｉ）を１　＝１＋　２＋”’＋　Ｊ　ｎ：　１ｌ＝
＝１ｔ　２　、・・・＋　Ｎについて求め、最後尾の単
語を示すＮ（ｉ）を求める。第１図に示したマツチング
径路の拘束条件が採用される・・・・・・・・・０）初期条件、Ｄ”（ｉ、１）＝Ｄ（ｉ−１）＋ｄ”（ｉ、
１）ＤＣｉ）＝ｍｉｎ　（Ｄ”　（ｉ　、　Ｉ　”　）
）ｇ”（ｉ、ｊ）、Ｂ（ｔ）は次ｏ式から求するｏＢｎ
（ｉｔｉ）はＢｎ（ｉ、０）＝Ｂｎ（ｉ−１）ヲ初期条件トシテ１　
）　”（’　＋　］　）＝”（’　２＊　＋−’　）＋
ｄ”（’　ｌ　］　）＋ｄｎ（ｉ−１，０のときははＢ”（ｉｌｌ）＝Ｂ”（ｔ−１＋１−２）　　　　　−
・・−−−−−−（４）として求まり、Ｂ　（ｉ）は式
（１）を満足する単語番号をｎとすればＢ（ｉ）＝Ｂ”（ｉ　ｌ　１”）　　　　　　　−−・
・−・・・（６）となる。また、Ｎ（ｉ）＝ｎ　　であ
る０以上のようにして求められた終端累積距離Ｄ（ｉ）
＝Ｄｎ（ｉ、Ｊ”）は終端ＪＲｉ距離記憶部ｓＫ、バッ
クポインタＢ（ｉ）＝Ｂｎ（ｉ、Ｔ”）はバックポイン
タ記憶部６に、最後尾単語番号Ｎ（ｉ）＝ｎは最後尾単
語記憶部７に記憶される。Intermediate bank pointer Bn (1, j), back pointer B
(i) as 1 = 1+ 2+”'+ J n: 1l=
=1t 2 , . . . + N is determined, and N(i) indicating the last word is determined. The matching path constraint conditions shown in Fig. 1 are adopted...0) Initial conditions, D"(i, 1) = D(i-1) + d"(i,
1) DCi)=min (D” (i, I”)
)g''(i, j), B(t) is oBn obtained from the following o formula
(iti) is Bn(i, 0) = Bn(i-1), the initial condition is 1
) ”('+])=”('2*+-')+
d"('l])+dn(i-1,0, then B"(ill)=B"(t-1+1-2) -
...----- (4), and B (i) is determined as n if the word number that satisfies equation (1) is B (i) = B" (i l 1") ---
...(6). Also, the terminal cumulative distance D(i) obtained as 0 or more where N(i) = n
= Dn (i, J") is the end JRi distance storage sK, back pointer B (i) = Bn (i, T") is stored in the back pointer storage 6, last word number N (i) = n is the last It is stored in the tail word storage section 7.

なおり”（ｉ＋ｉ）、Ｂｎ（ｉ＋ｉ）（ただしｉ＝１．
２゜・・・、Ｔ”；ｎ＝１．２．・・・、Ｎ）は必要が
なくなるまで、累積距離計算部１４に一時的に記憶され
る。本実施例においては中間累積距離を計算しているフ
レームの１つ前および２つ前のフレームの中間累積距離
を当該フレームの中間累積距離を計算するまで記憶する
。Naori” (i+i), Bn(i+i) (however, i=1.
2゜..., T''; n = 1.2..., N) are temporarily stored in the cumulative distance calculation unit 14 until they are no longer needed. The intermediate cumulative distances of the frames immediately before and two frames before the current frame are stored until the intermediate cumulative distance of the frame is calculated.

また終端累積距離記憶部６に記憶される終端累積距離Ｄ
（ｉ）は、式（１）の初期条件を求めるために必要７ｚ
　モロＤＹｌ’６　リ、Ｄ　（ｉＪＫ　ツイテハＤ”　
（ｉ　＋１．１　）を求めるまで記憶されておればよい
。Also, the terminal cumulative distance D stored in the terminal cumulative distance storage unit 6
(i) is necessary to find the initial condition of equation (1)7z
Moro DYl'6 Ri, D (iJK Tsuiteha D"
It suffices if it is stored until (i +1.1) is found.

８は音声区間検出部であって、入力信号の大きさ等から
音声区間を判定するものである。音声区間検出部８が、
音声入力が開始されたことを検出するとフレーム数計数
部９はフレーム毎に計数をはじめる。前記の処理は第ｉ
フレームについての処理であったが、このフレーム数計
数部９の計数値がすなわちこのｉを設定している。従っ
て、前記と同様の処理が、フレームが１進む毎に行われ
ることになる。フレーム数計数部９は音声区間が検出さ
れると計数を始め、音声区間が終了するとリセットされ
る。最後尾単語記憶部７．バックポインタ記憶部６には
、従って、Ｎ　（ｉ）　、　Ｂ　（ｉ）がｉ；１．２．
・・・、１について記憶されることになる。Reference numeral 8 denotes a voice section detecting section, which determines a voice section from the magnitude of the input signal and the like. The voice section detection unit 8
When it is detected that audio input has started, the frame number counting section 9 starts counting each frame. The above process is
Although the processing was performed on frames, the count value of the frame number counting unit 9 sets i. Therefore, the same processing as described above is performed every time the frame advances by one. The frame number counting section 9 starts counting when a voice section is detected, and is reset when the voice section ends. Last word storage unit 7. Therefore, in the back pointer storage unit 6, N (i) and B (i) are i;1.2.
..., 1 will be stored.

セグメンテーション部１０はバックポインタ記憶部６に
対し、所定のバックポインタを読出すべき命令を発する
ものである。すなわち、セグメンテーション部１０がｉ
なる値をバックポインタ記憶部６に発すると、バックポ
インタ記憶部ｅからはバックポインタＢ　（ｉ）が読出
される。セグメンテーション部１０はバックポインタ記
憶部６からＢ　（ｉ）なる値を受は取ると、その同じ値
をバックポインタ記憶部６に発する。従って、音声区間
検出部８が音声入力の終了上検知すると、フレーム数計
数部９の最終値工がセグメンテーション部１゜に供給さ
れ、セグメンテーション部１Ｑは先ず！なる値をバック
ポインタ記憶部ｅに発する。以後、前記説明の動作に従
って、バックポインタ記憶部らから、Ｂ（Ｉ）　、　Ｂ
　（Ｂ（Ｉ）　）　、Ｂ（Ｂ（Ｂ（Ｉ）　）　）　、・
・・、０なる出力が順次得られることになる。これらの
値は最後から２番目の単語の終シのフレーム、同３番目
の終りのフレーム、同４番目の終りのフレーム、・・・
・・・というものであり、Ｎ（ｉ）はｉフレームで終る
単語であったから、この値をそのまま最後尾単語記憶部
７に与えると、最後の単語から逆の順序で認識結果が得
られる。なお認識結果が逆の順序で得られないようにす
るためには、この順序の変換をバックポインタ記憶部６
の出力に対して行うか最後尾単語記憶部７の出力に対し
て行えばよい０本従来例装置における最終フレームエまでの総計算回数
は、ｉ−ｊ空間の各点におけるベクトル間距離ｄ（’＋
１）＋中間累積距離”（’＋Ｉ）　　の計算を各点毎に
ただ１回行うのみなので平均クレーム数を１とするとい
ずれもＮＩｌとなる。従来の２段ＤＰ法を用いた装置に
おける総計算回数Ｎｌ−！１２　　と比べると計算回数
は１／（−Ｔ）になる。いま７レーム長が１０　ｍ５ｅ
ｃで単語長が平均０ｎ５ｓｅｃであったとすれば標準パ
ターンの平均フレーム数Ｉは、Ｊ＝５０であるから、計
算回数は１／３７．５になる。また幅Ｒの窓（計算制限
領域）を設けた場合の計算回数ＮＩ　ＩＲと比べても１
／Ｒとなシ、Ｒは大体２０位であるから計算回数は１／
２ｏとなる。The segmentation unit 10 issues a command to the back pointer storage unit 6 to read a predetermined back pointer. That is, the segmentation unit 10
When the value B (i) is issued to the back pointer storage section 6, the back pointer B (i) is read out from the back pointer storage section e. When the segmentation unit 10 receives the value B(i) from the back pointer storage unit 6, it issues the same value to the back pointer storage unit 6. Therefore, when the voice section detecting section 8 detects the end of the voice input, the final value of the frame number counting section 9 is supplied to the segmentation section 1°, and the segmentation section 1Q first detects the end of the voice input. A value is issued to the back pointer storage section e. Thereafter, according to the operation described above, B(I), B are stored from the back pointer storage unit, etc.
(B(I) ) , B(B(B(I) ) ) ,・
..., outputs of 0 will be obtained sequentially. These values are the last frame of the second to last word, the third last frame of the last word, the fourth last frame of the last word, etc.
..., and since N(i) is a word that ends in the i frame, if this value is given as is to the last word storage section 7, the recognition results will be obtained in the reverse order starting from the last word. Note that in order to prevent recognition results from being obtained in the reverse order, this order conversion is performed using the back pointer storage unit 6.
or the output of the last word storage section 7.0 The total number of calculations up to the final frame in this conventional example device is determined by the distance between the vectors d( at each point in the i-j space) '＋
1) + Intermediate Cumulative Distance ('+I) is calculated only once for each point, so if the average number of complaints is 1, then all of them are NIl.The total calculation in the device using the conventional two-stage DP method Compared to the number of times Nl-!12, the number of calculations becomes 1/(-T).The length of 7 rams is now 10 m5e.
If the average word length is 0n5 sec in c, the average number of frames I of the standard pattern is J=50, so the number of calculations is 1/37.5. Also, compared to the number of calculations NI IR when a window of width R (calculation restriction area) is provided, it is 1
/R and Nashi, R is approximately 20th place, so the number of calculations is 1/
It becomes 2o.

以上のように０（ロ）ＤＰ法においては、連続して発声
された音声を２段ＤＰ法よりもはるかに少い計算回数で
認識することができ、認識速度の速い連続音声認識装置
を実現することができ、入力フレーム毎に処理が可能で
あるから、実時間で動作する認識装置が可能となる。As described above, the 0(b)DP method can recognize continuously uttered speech with far fewer calculations than the two-stage DP method, realizing a continuous speech recognition device with high recognition speed. Since processing can be performed for each input frame, a recognition device that operates in real time is possible.

しかし、前記ベクトル間距離ｄ”（ｔ、ｊ）、中間累積
距離”’（ｉｓ　ｉ　）を７７−　Ａ　ｉ毎に２１＝１
．２゜・・・、Ｎ；ｊ＝１．２．・・・、Ｉｎについて
求めるためには、標準パターン記憶用メモリとして、が
なり高速なアクセスが要求されることになる。例えば、
フレーム周期を６ｍｓ　、標準パターｙ数Ｎ　ｍ３ｅｃ
、標準パターンの平均７レーム数Ｊ　＝ｓ。However, the distance between the vectors d"(t, j) and the intermediate cumulative distance"'(is i ) are 21=1 for every 77-A i
．． 2°..., N; j=1.2. ..., In, a standard pattern storage memory is required to have very high speed access. for example,
Frame period 6ms, standard putter y number N m3ec
, the average number of 7 frames of the standard pattern J = s.

とすれば、入力パターンのフレーム当９の計算すべき平
均の格子点数はＮＴ＝９０００であり、特微ベクトルの
次元数を１６とすれば、入力パターンのフレーム当りの
標準パターン記憶用メモリに対するアクセス回数は９０
００Ｘ１５＝１３５ｏｏｏ　　回“となる。これを５ｍ
５ｅＣ内に行うためには、アクセスタイム３７ｎｓ以下
が要求される。また、標準パターンの記憶に必要とされ
るメモリ量は、特徴ベクトルの１要素を１バイトで表現
するものとすれば、１３６ＫＢとなり、３７ｆｉＳｅＣ
以下でアクセスできる高速メモリが、１３５ＫＢも必要
ということになり、装置として大変高価なものになる。Then, the average number of grid points to be calculated per frame of the input pattern is NT = 9000, and if the number of dimensions of the feature vector is 16, then the access to the standard pattern storage memory per frame of the input pattern is NT = 9000. The number of times is 90
00X15=135ooo times".This is 5m
To perform this within 5eC, an access time of 37 ns or less is required. In addition, the amount of memory required to store the standard pattern is 136KB, assuming that one element of the feature vector is expressed in one byte, which is 37fiSeC.
This means that 135 KB of high-speed memory, which can be accessed below, is required, making the device extremely expensive.

発明の目的本発明は、前記方法による実時間向きの連続的パターン
の比較装置において、高速アクセスが要求されるメモリ
の数を大幅に減少させることを目的とする。OBJECTS OF THE INVENTION It is an object of the present invention to significantly reduce the number of memories that require high-speed access in a real-time sequential pattern comparison device according to the method described above.

発明の構成本発明は、入力信号を特徴ベクトル８１ｒ　ａ　２　ｅ
・・・＋”ｉ＋・・・、ａｌの系列に変換する特徴抽出
手段と、特徴ベクトルの系列ｂｎ　、　ｂｎ、・・・＋
　ｂ”ｊ　ｌ・・・、ｂ”ｉｎカら成る標準パターンＲ
ｎ（ただしｎ＝１．２．・・・。Structure of the Invention The present invention converts an input signal into a feature vector 81r a 2 e
...+"i+..., a feature extraction means for converting into a series of features vectors bn, bn,...+
Standard pattern R consisting of b"j l..., b"in
n (however, n=1.2...

Ｎ）を記憶する標準パターン記憶手段と、入力フレーム
ｉを横軸に、標準パターンフレーム）を縦軸とする格子
グラフにおいて、標準パターンＲｎとマツチングすると
きは傾きｋで、入カノシターンｎのフレームに対してＷ（≦（−Ｔ−）　＋１　）フレー
ム隔ッた直線で挾まれる格子点に対して、標準パターン
の各フレームｊの特徴ベクトルｂ？に対してｊ＝１．２
．・・・、Ｉｎの順にｉ軸方向にマツチング径路の最大
傾斜がｋの傾斜制限をもつＤＰマツチングを行い、１つ
の領域の計算が完了すると次の相隣る前記と同様な領域
の計算を同様に行うというように、入力パターンの全範
囲にわたって前記マツチング計算を行うＤＰマツチング
手段とを含むパターン比較装置である。In a lattice graph with the input frame i as the horizontal axis and the standard pattern frame) as the vertical axis, when matching with the standard pattern Rn, the slope k is used to match the frame of the input pattern n. On the other hand, for the grid points sandwiched by straight lines separated by W(≦(-T-) +1) frames, the feature vector b of each frame j of the standard pattern? for j=1.2
．． ..., DP matching is performed in the i-axis direction in the order of In, with a slope restriction where the maximum slope of the matching path is k, and when the calculation of one area is completed, the calculation of the next adjacent area similar to the above is performed in the same way. and DP matching means for performing the matching calculation over the entire range of the input pattern.

実施例の説明第５図は本発明の原理を示す図である。径路の制限条件
を、第１図または第３図のように選えは、２４．２５を
傾き２の直線、２０．２１を、　＝　７　ｎと直線２４
．２６との交点、２２．２３をｊ＝１と直線２４．２５
との交点とし、ＷをＪ”／２とすれば、２０〜２１に含
まれる格子点の終端累積距離は２２〜２３に含まれる各
格子点までの終端累積距離と、直線２４上の中間累積距
離が求まっておれば、計算できる０また、途中の計算も
矢印２６に示すように横方向に順次下から上へ計算を進
めていくことができる。このようにして、フレーム周期
をＴとするとき、時間ＷＴの間に斜線部Ａの計算を以上
のように行い、次のＷＴの間に斜線部Ｂの計算を以上の
ように行うというように、以上に述べた計算を時間ＷＴ
毎に順次進めてゆくことができる。DESCRIPTION OF EMBODIMENTS FIG. 5 is a diagram showing the principle of the present invention. Choose the path restriction conditions as shown in Figure 1 or Figure 3: 24.25 is a straight line with slope 2, 20.21 is = 7 n and straight line 24
．． Intersection with 26, 22.23, j=1 and straight line 24.25
, and if W is J”/2, then the terminal cumulative distance of the grid points included in 20 to 21 is the terminal cumulative distance to each grid point included in 22 to 23, and the intermediate cumulative distance on straight line 24. If the distance is known, it can be calculated as 0.In addition, intermediate calculations can be performed horizontally from bottom to top as shown by arrow 26.In this way, the frame period is set to T. Then, during time WT, the calculation for the shaded area A is performed as described above, and during the next WT, the calculation for the shaded area B is performed as described above, and so on.
You can proceed step by step.

このようにすると、標準ノ（ターンの第ｊフレームは、
Ｗフレームの入カッくターンと照合する間一定で良いか
ら、標準）くターン用メモリに対して要求されるアクセ
スタイムを遅くすることができる。In this way, the jth frame of the standard turn is
Since it can be kept constant while checking the input turn of the W frame, the access time required for the standard turn memory can be slowed down.

すなわち、各斜線部の格子点の数はＷＴであって、この
各格子点に対する計算をＴＷの間に行う必要があるが（
Ｔはフレームの周期とする）、このとき、標準パターン
の１フレームに対して許容される計算時間はＴＷ／ＩＮ
となり、これは、標準ノ々ターンの１フン一ム分のデー
タを読み出す時間にμ豊であり、特徴ベクトルの次数を
以前と同じく１５次とすれば、標準パターンの１つのベ
クトルの要素を読み出すために許されるアクセスタイム
は８．３μｓｅｃ／１５　＝　５５３　ｎ５ｅｃとなり
、従来例で要求された値に比べると約１６倍のアクセス
タイムで良いことになる。ただし、今度は入力パターン
を記憶するためのメモリに高速性が要求されるが（標準
パターンの第ｊフレームに対して、ＮＷ回の読Ｗみ出しを時間下の間に行わなければならない。In other words, the number of grid points in each shaded area is WT, and calculations for each grid point need to be performed during TW (
T is the period of the frame), then the calculation time allowed for one frame of the standard pattern is TW/IN
This takes a lot of time to read out data for one hour of the standard number turn, and if the order of the feature vector is set to 15 as before, then the elements of one vector of the standard pattern can be read out. Therefore, the access time allowed is 8.3 μsec/15 = 553 n5ec, which is about 16 times the value required in the conventional example. However, this time, high speed is required of the memory for storing the input pattern (for the j-th frame of the standard pattern, reading must be performed NW times in a short period of time).

ＴＷ前記の例に対しては、　　／ＭＷ＝Ｔ／ＮＪ＝５ｍＳＥ
／］− ３００Ｘ３０＝＝０．５８μ気の間に１５個の特徴ベク
トルを読み出す必要がある。従って、ベクトルの１つの
要素を読み出すのに許されるアクセスタイムは３７　ｎ
５ｅｃとなる）、その必要とされる要素は、前記の例に
対しては平均１６・Ｗ＝１５Ｘ３０＝４５０バイトで良
いことになり、大幅なコストダウンに・つながる。TW For the above example, /MW=T/NJ=5mSE
/]-300X30==0.58μ It is necessary to read out 15 feature vectors. Therefore, the access time allowed to read one element of the vector is 37 n
5ec), the required elements are on average 16·W=15×30=450 bytes for the above example, leading to a significant cost reduction.

以上が本発明の原理であるが、Ｗを固定とする単純な考
え方では、Ｗ≦（ｍｉｎ　（Ｉ　ｎ〕／２）　＋１　　
とする必要があり、標準パターンの中の最も短いパター
ンでＷの値が決定されてしまい、あまり効果的でない場
合がある。この欠点を補うためには、Ｉｎが変る毎にそ
れに応じてＷを可変とす石ことが考えられる（Ｗ”−（
Ｊ”／２）　＋１　）。本発明によるこのさらに進んだ
原理による比較装置の原理を次に述べる。The above is the principle of the present invention, but in a simple concept where W is fixed, W≦(min (I n)/2) +1
Therefore, the value of W is determined by the shortest pattern among the standard patterns, which may not be very effective. In order to compensate for this drawback, it is conceivable to vary W accordingly each time In changes (W''-(
J"/2) +1). The principle of this further principle comparison device according to the invention will now be described.

第６図は本発明の詳細な説明する詳細図である。FIG. 6 is a detailed diagram illustrating the invention in detail.

いま、入力パターンが第ｉフレームのときの処理を説明
する。前記斜線Ａ、Ｂ等のそれぞれをブロックと呼ぶこ
とにする。このとき、Ｄｎ（ｉ−（’））が求まってい
なければ、同図におけるＯで示す格・はこのブロックの
１つ前のブロックのｊ＝１゜２、・・・、Ｉｎにおける
格子点でこれらの点における中間累積距離は、本ブロッ
クにおける中間累積距離の初期値となるものである。こ
の場合、マツチング径路の拘束条件は、第３図に示すも
のである。第１図に示す径路の拘束条件を用いるときは
、さらに０における中間累積距離を記憶しておく必要が
ある。Ｄｎ（ｉ−〔こ〕）が求まっているときは、この
ｎに対応する標準パターンに対する計算はスキップする
ことになる。Now, processing when the input pattern is the i-th frame will be explained. Each of the diagonal lines A, B, etc. will be called a block. At this time, if Dn(i-(')) is not found, the case indicated by O in the figure is a lattice point at j=1゜2,...,In of the block immediately before this block. The intermediate cumulative distances at these points are the initial values of the intermediate cumulative distances in this block. In this case, the constraint conditions for the matching path are as shown in FIG. When using the route constraint conditions shown in FIG. 1, it is also necessary to store the intermediate cumulative distance at 0. When Dn(i-[ko]) has been determined, the calculation for the standard pattern corresponding to n is skipped.

以上のようにすれば、各標準パターンＨｎに対し、ｎ毎
にＷｎ＝〔工ｎ／２〕＋１として計算したことを決定す
るには、最初にＤ・（ｉ−−２〕）を０、あるいは■に
初期化しておけば、求まっているときは、０以外の有限
の値になっているはずであることから判断が可能である
。As described above, in order to determine that each standard pattern Hn is calculated by setting Wn = [work n/2] + 1 for each n, first set D・(i--2) to 0, Alternatively, if it is initialized to ■, it is possible to determine since it should be a finite value other than 0 when it has been determined.

第７図は、以上の原理に基づく本発明の実施例を示すブ
ロック図である。１０１は第４図で１と同じ動作を示す
特徴抽出部、１０２は第４図２と同様な単語標準パター
ン記憶部、１０３は第４図　　　　　　。FIG. 7 is a block diagram showing an embodiment of the present invention based on the above principle. Reference numeral 101 denotes a feature extractor which performs the same operation as 1 in FIG. 4, 102 a word standard pattern storage unit similar to that shown in FIG.

３と同様なベクトル間距離計算部であるが、本実１３例
ｆＵｉ算０”向７゛”１”向７６銭・１°標′準パター
ンＲｎの第ｊフレームのベクトルｂ、にｍ＝　−［−！
−（１”　−ｊ＋１　）　］　＋　ｉの範囲の入力パタ
ーンに対しｄｎ（ｍ１１）を求める。The inter-vector distance calculation unit is similar to that in 3, but in the actual 13th example fUi calculation, m = - for the vector b of the j-th frame of the 1° standard pattern Rn of [-!
−(1″−j+1) ] + dn(m11) is determined for the input pattern in the range of i.

１０４は累積距離計算部であって、前記ｄ”（ｍ、ｊ）
と同様の範囲で中間累積距離、終端累積距離を求める。104 is a cumulative distance calculation unit, which calculates the above-mentioned d''(m, j)
Find the intermediate cumulative distance and final cumulative distance in the same range as .

１１２は前記ベクトル間距離の計算と、累積距離の計算
の実行範囲を各ｎ＋］について決定いないとき、前記ｍ
の範囲でｄ”（ｍ、ｊ）、Ｄｎ（ｍ、ｊ）の計算をそれ
ぞれベクトル間距離計算部１０３．累積距離計算部１０
４に指示する。１０５〜１０７は第４図５〜７と同様な
動作をする。１１１は認識結果の出力端子であって、第
４図１１で得られるものと全く等しい。112 is the m
The intervector distance calculation unit 103 and the cumulative distance calculation unit 10 calculate d” (m, j) and Dn (m, j) within the range of
4. 105 to 107 perform the same operations as those in FIG. 4, 5 to 7. Reference numeral 111 is an output terminal for the recognition result, which is exactly the same as that obtained in FIG. 4, 11.

第８図は、第７図における実施例をより明確に説明する
ために、ソフトウェア的にその動作を説明するものであ
り、ソフトウェアにより本発明を実現する場合も本例に
従えば良い。In order to more clearly explain the embodiment shown in FIG. 7, FIG. 8 explains its operation in terms of software, and this example may also be followed when the present invention is implemented by software.

本実施例においては、径路の拘束条件としては第１図の
ものを用いている。In this embodiment, the conditions shown in FIG. 1 are used as the path constraint conditions.

２０１はマツチング径路の開始に先立って、中間累積距
離、終端累積距離等に閃を代入する等の初期化を行うス
テップである。２０２は音声区間終了後の処理のために
ｆｌａｇを′０”にセントしておくステップである。２
０３は音声区間の開始を検出するステップであって、第
７図１０８が行う動作に対応する。音声区間が検出され
るまで、ステップ２０１，２０２が実行される。音声区
間が検出されるとステップ２０４〜ステツプ２３０が入
力の１フレーム毎に実行される。ステップ２０５〜ステ
ツプ２２４は、フレームｌにおいて各標準パターンに対
して実行される。２０６はフレームｉにおいて、標準パ
ターンｎに対して、マツチング計算を行うべきかどうか
を判定するステップである。次の３つの条件が満たされ
たときマツチング計算を実行すべきであると判定する。201 is a step in which, prior to starting the matching path, initialization is performed such as substituting flashes into intermediate cumulative distances, final cumulative distances, and the like. 202 is a step in which the flag is set to '0' for processing after the voice section ends.2
03 is a step of detecting the start of a voice section, and corresponds to the operation shown in FIG. 7 108. Steps 201 and 202 are executed until a voice section is detected. When a voice section is detected, steps 204 to 230 are executed for each frame of input. Steps 205 to 224 are performed for each standard pattern in frame l. 206 is a step of determining whether matching calculation should be performed for standard pattern n in frame i. It is determined that matching calculation should be performed when the following three conditions are met.

Ｄ・（ｉ−〔ピ〕）−一＋ｎ１−（−）＜、Ｉ＋ｎ１−２（−）≧１これは第７図１１２のマツチング計算実行範囲決定部の
動作に相当する。ステップ２０７〜ステツ７”２１０は
ｉｔフレームにおいてマツチング計算の実行に先立って
、マツチング計算の漸化式の初期値を標準パターンｎに
ついて設定する。すなり”（ｍ−１、Ｏ）＝Ｄ（ｍ−１
）、Ｂ”（ｍ−１、Ｏ）＝ｍ−１ツブ２１１〜ステップ
２１７はｊ；１．・・、１０１間バックポインタを標準
パターンｎについて計算する。２１３はその漸化式であ
って、この式は、式（１）におけるｉがｍに変った点を除けば
全く同じ形であり、計算の順序が式（１）ではｉ毎にｊ
を１〜Ｉｎと変化させて求めていたのに対し、この式で
は、各ｊに対し、 −Ｃ’（１”−１＋１））＋ｔ °と変化させて求めている点が異なる。ステップ２１８
〜ステツプ２２２は、ステップ２１３で計距離Ｄｎ（ｍ
、Ｉｎ）が、それ以前に計算された標準パターン　ｎ／
に対する中間累積距離Ｄ”（ｍ、Ｊ”）の最小値よりも
小さいときにステップ２２０で示される置き替を行うも
のである。以上のように、ステップ２０６〜ステツプ２
２４の動作が完了すると、ｌ以前の各フレームについて
、計算されたＨｎに対する最小値がＤ（ｉ）に入ってい
ることになる。以上の動作は主として、第７図１０３〜
１０７で行われるものである。２２５は音声区間の終了
を検出するステップである。音声区間の終了が検出され
るまで、以上のステップ２０６〜ステツプ２２５の動作
が行われ、終端累積距離Ｄ（ｉ）、終端バックポインタ
Ｂ（ｉ）、最後尾単語Ｎ（１）が順次計算され、所定の
場所に記憶されてゆく。ステップ２２６〜ステツプ２２
９は、以上の計算を行って　　　　　　１音声区間を終
了したことが検出されても、前記ブロック毎に処理して
いくものであるから、前記Ｄ　（ｉ）　、　Ｂ　（ｉ）
　、　Ｎ　（ｉ）等は一般には音声区間の終了フレーム
まで求まっていないので、これを完全に求まで進める部
分である。ただし、’ｍａニーｍａ　ｘ　（Ｔ町であり
、Ｉは音声区間の終るフレームである。ステップ２３１
は、以上のようにして求めた終端バックポインタＢ（ｉ
）、最後尾単語Ｎ（ｉ）から入力単語例を前記従来例と
同様にして求める部分である。D·(i-[pi])-1+n 1-(-)<, I+n 1-2(-)≧1 This corresponds to the operation of the matching calculation execution range determining section 112 in FIG. Steps 207 to 7 210 set the initial value of the recurrence formula for the matching calculation for the standard pattern n before executing the matching calculation in the IT frame. -1
), B''(m-1, O)=m-1 The steps 211 to 217 calculate the back pointer between j; 1..., 101 for the standard pattern n. 213 is its recurrence formula, This formula has exactly the same form except that i in formula (1) has been changed to m, and the order of calculation is j for each i in formula (1).
The difference is that this formula is calculated by changing -C'(1''-1+1)) + t ° for each j, whereas it was calculated by changing it from 1 to In.Step 218
~Step 222 calculates the measured distance Dn (m
, In) is the previously calculated standard pattern n/
The replacement shown in step 220 is performed when the intermediate cumulative distance D'' (m, J'') is smaller than the minimum value. As described above, steps 206 to 2
When 24 operations are completed, the minimum value for Hn calculated for each frame before l will be in D(i). The above operations are mainly performed from 103 to 103 in FIG.
107. 225 is a step of detecting the end of the voice section. The operations from step 206 to step 225 described above are performed until the end of the voice section is detected, and the end cumulative distance D(i), end back pointer B(i), and last word N(1) are calculated in sequence. , are stored in a predetermined location. Step 226 - Step 22
9, even if it is detected that one voice section has ended after performing the above calculation, processing is continued for each block, so the D (i), B (i)
. However, 'many max (T town, I is the frame where the voice section ends. Step 231
is the terminal back pointer B(i
), the input word example is obtained from the last word N(i) in the same manner as in the conventional example.

ステップ２３２は結果を出力する部分であり、文字とし
てプリントアウトされるか、他の機器を動かすコマンド
の出力となる。Step 232 is a part that outputs the results, which are printed out as characters or output as commands to operate other devices.

なお、本実施例においては、説明の便宜のために、点（
ｉｒ　＋　）におけるベクトル間距離、中間累積距離、
中間バックポインタ等はそれぞれｄ”（ｉｌｊ）、Ｄ”
（ｉｌｊ）ＩＢ”（ｔ、ｊ）　等と表記したが、これら
の記憶領域はｉｒ１の全範囲にわたって準備する必要は
なく、第１段の格子点におけるＤ”（ｉ、ｊ）、Ｂ”（
ｉｌｊ）＋７）計ｇＫは、第８図２１３に示す漸化式か
らも明らかなように、第１図、第３図の何れの径路に対
しても第】−１１段ｊ−２段Ｏ中間累ｍ距ｍＤ”（ｌ＋
１−１）　、”（ｉｒ１−２Ｌ中間ハック；’ｆ：イ７
りＢｎ（ｉ　、　ｉ　−１）、Ｂｎ（ｉ　、　ｊ−２）
のみ記憶していれば良く、この計算が終了すると、Ｄｎ
（ｉｒ１−２）、Ｂ”（ｉｒ１−２）　の記憶内容は捨
−ＣてＬ４い、”（ｉｌｊ　　ｔ）、Ｂｎ（ｉ、１　　
＊）の内容とそれぞｎＤ”（ｉ、ｊ−２）、Ｂ”（ｉ、
ｊ−２）（Ｄ記憶領域Ｋ、”（’　＋　＋）　ｌＢ”（
１１］）　Ｏ内容ｆＤｎ（ｉｒ　１−１）、Ｂｎ（ｉｒ
１−１）Ｏ記憶領Ｗ２Ｆ１ｍサセ、Ｄｎ（ｉｒ　＋　）
　ＩＢ”（１１１）　ノ記憶領域ニ次段の計算結果を記
憶させて行けば良い。また、各ブロックの幅は、たかだ
かＶｖｍａ　Ｘ可ｍａｗ（Ｔ”）／２）　＋Ｉであれば
良いから、前記漸化式の計算の為のバッフ１メモリとし
ては、入力パターン用としてＷ　　フレームのバックア
メモリ１段分と、中間ｍ＆Ｘ累積距離と中間バックポインタ用のバックァメモリとし
てＷｍａ　ｘフレームのものが、それぞれ３段分と、ベ
クトル間距離記憶用としては、第１図の径路の制限条件
の場合はｄｎ（ｉ−１，ｊ）のみ記憶していれば良いか
ら１フレ一ム分のみ準備しておけば良いことになる。第
３図の径路の制限条件の場合はベクトル間距離は計算さ
れる都度使用されるから特に記憶する必要はない。In addition, in this example, for convenience of explanation, point (
ir + ), intermediate cumulative distance,
Intermediate back pointers etc. are d” (ilj) and D” respectively.
(ilj) IB"(t, j) etc., but these storage areas do not need to be prepared over the entire range of ir1, and D"(i, j), B"(
ilj)+7) As is clear from the recurrence formula shown in FIG. Cumulative m distance mD”(l+
1-1) ,”(ir1-2L intermediate hack;'f:i7
Bn(i, i-1), Bn(i, j-2)
All you need to remember is Dn.
(ir1-2), B" (ir1-2) are discarded, "(ilj t), Bn(i, 1
*) and the contents of nD"(i, j-2) and B"(i,
j-2) (D storage area K, "(' + +) lB" (
11]) O contents fDn(ir 1-1), Bn(ir
1-1) O memory area W2F1m Sase, Dn (ir + )
The calculation result of the next stage may be stored in the storage area of IB" (111). Also, the width of each block may be at most Vvma x maw(T")/2) +I, so The buffer 1 memory for calculating the recurrence formula consists of one stage of W-frame backer memory for the input pattern, and one Wmax-frame backer memory for the intermediate m&X cumulative distance and intermediate back pointer. For storing three stages and the distance between vectors, in the case of the path restriction conditions shown in Figure 1, it is sufficient to store only dn(i-1,j), so prepare only one frame. That's a good thing. In the case of the route restriction conditions shown in FIG. 3, the distance between vectors is used each time it is calculated, so there is no need to specifically store it.

また、本実施例においては、第１図または第３図に示す
ような径路の拘束条件を用いたため径路の最大の傾斜ば
２となるので、ブロックの幅は〔工ｎ／２〕＋１となっ
たが、径路の拘束条件としては、これ以外に種々前えら
れ、例えば、第９図のような場合は、最大の傾斜が３と
なるからこの揚殻に径路の最大の傾斜がｋのときは、ブ
ロックのとはＪｎが整数の場合は等しい）０さらに、本実施例においては、中間累積距離めに、これ
を〜等に初期化している場合について説明したが、入力
のフレーム１についてｉがＷｎで割り切れるか否かを判
定して、割り切れるならマツチング計算を行い、割り切
れないならマツチング計算をスキップするようにしても
勿論良い。In addition, in this example, since the constraint conditions for the path as shown in FIG. 1 or 3 are used, the maximum slope of the path is 2, so the width of the block is [engine n/2] + 1. However, there are various other constraint conditions for the path.For example, in the case shown in Figure 9, the maximum inclination is 3, so when the maximum inclination of the path in this shell is k, is the same as that of the block if Jn is an integer)0 Furthermore, in this embodiment, the intermediate cumulative distance is initialized to ~, etc., but for input frame 1, i Of course, it is also possible to determine whether or not is divisible by Wn, and if it is divisible, perform the matching calculation, and if not, skip the matching calculation.

発明の効果以上のように、本発明によれば、ＣＷＤＰ法や０　（ｎ
Ｊ　Ｄ　Ｐ法のように、ＤＰマツチングを高速に連続し
て行う方法においては、高速、大容量のメモリが必要で
あったのが、高速性を要求されるメモリ数を大幅に減ら
すことができ、安価な装置の実現が可能となったもので
ある。Effects of the Invention As described above, according to the present invention, the CWDP method and 0 (n
Methods such as the JDP method that perform DP matching continuously at high speed required high-speed, large-capacity memory, but now the number of memories that require high speed can be significantly reduced. This makes it possible to realize an inexpensive device.

[Brief explanation of the drawing]

第１図はマツチング計算径路の拘束条件を示す図、第２
図はマツチング計算を行う領域を示す図、第３図はマツ
チング計算径路の別の拘束条件を示す図、第４図は音声
認識装置の従来例を示すブロック図、第６図は本発明の
原理の概略を説明する図、第６図は本発明の原理の詳細
を説明する図、第７図は本発明における一実施例の音声
認識装置のブロック図、第８図は同実施例装置の機能を
ンフトウエアで実現したときのフローチャート、第９図
はマツチング計算径路の拘束条件の他の例を示す図であ
る。　　　　　　　　　　　　　　　　　　　　　　１
：′１０１・・・・・・特徴抽出部、１０２・・・・・
・単語標準パターン記憶部、１０３・・・・・・ベクト
ル間距離計算部、１０４・・・・・・累積距離計算部、
１０６・・・・・・終端累積距離記憶部、１０６・・・
・・・バックポインタ記憶部、１０７・・・・・・最後
尾単語記憶部、１０８・・・・・・音声区間検出部、１
０９・・・・・・フレーム数計数部、１１０・・・・・
・セグメンテーション部、１１２・・・・・・マツチン
グ計算実行範囲決定部。代理人の氏名　弁理士　中　尾　敏　男　ほか１名第１
図第２図 λ力へ〇ターン第３図第９図（１−２゜Figure 1 shows the constraint conditions of the matching calculation path, Figure 2
The figure shows the area in which matching calculations are performed, Fig. 3 shows another constraint condition for the matching calculation path, Fig. 4 is a block diagram showing a conventional example of a speech recognition device, and Fig. 6 shows the principle of the present invention. 6 is a diagram explaining the details of the principle of the present invention, FIG. 7 is a block diagram of a speech recognition device according to an embodiment of the present invention, and FIG. 8 is a diagram showing the functions of the device according to the embodiment. FIG. 9 is a flowchart when this is realized by software, and is a diagram showing another example of constraint conditions for the matching calculation path. 1
:'101...Feature extraction section, 102...
・Word standard pattern storage unit, 103... Inter-vector distance calculation unit, 104... Cumulative distance calculation unit,
106...Terminal cumulative distance storage unit, 106...
... Back pointer storage section, 107 ... Last word storage section, 108 ... Voice section detection section, 1
09... Frame number counting section, 110...
- Segmentation unit, 112...Matching calculation execution range determination unit. Name of agent: Patent attorney Toshio Nakao and 1 other person No. 1
Figure 2 Turn to λ force Figure 3 Figure 9 (1-2°

Claims

[Claims] The input signal is converted into feature vectors a_1, a_2, ..., a_i,
..., a_I, and a feature vector sequence b^n_1, b^n_2, ..., b^n_j,
...standard pattern R^n consisting of b^n_Jn (however,
standard pattern storage means for storing n=1, 2,...,N), and standard pattern frame j with input frame i on the horizontal axis.
In a lattice graph whose vertical axis is the standard pattern R^n
When matching with , the slope is k, and W(≦2-[(1-J^n-1)/k]) for the frame of the input pattern.
For the grid points sandwiched by straight lines separated by frames, the matching radius is calculated in the i-axis direction in the order of j = 1, 2, ..., J^n for the feature vector b^n_j of each frame j of the standard pattern. The maximum slope of is calculated over the entire range of the input pattern by performing DP matching with a slope limit of k, and when the calculation of one region is completed, the calculation of the next adjacent region similar to the above is performed in the same way. D performing the matching calculation
A pattern comparison device comprising: P matching means.