JPS6073698A

JPS6073698A - Pattern comparator

Info

Publication number: JPS6073698A
Application number: JP58183361A
Authority: JP
Inventors: 中川　聖一; 英一坪香
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1983-09-30
Filing date: 1983-09-30
Publication date: 1985-04-25
Also published as: JPH0449954B2

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】産業上の利用分野本発明は連続する音声等のパターンを一連のパターンと
して自動的に認識するパターン比較装置に関する。DETAILED DESCRIPTION OF THE INVENTION Field of the Invention The present invention relates to a pattern comparison device that automatically recognizes continuous speech patterns or the like as a series of patterns.

従来例の構成とその問題点パターンマツチングによる音声認識装置の一般的な構成
は次のようなものである。Conventional configuration and its problems A general configuration of a speech recognition device using pattern matching is as follows.

入力音声信号を、フィルタバンク、周波数分析ＬＰＣ分
析等によって特徴ベクトルの系列に変換する特徴抽出手
段と、予め発声され、この特徴抽出手段により抽出され
た特徴ベクトルの系列を認識単語全部について標準パタ
ーンとして登録しておく標準パターン記憶手段と、認識
させるべく発声され、前記特徴抽出手段にょシ抽出され
た入力パターンと前記標準パターン記憶手段に記憶され
ている標準パターンの全てと特徴ベクトルの系列として
の類似度あるいは距離を計算するパターン比較手段と、
パターン比較の結果、最も類似度の高かった（距離の小
さかった）標準パターンに対応する単語を認識結果とし
て判定出力する判定手段からなる。A feature extraction means converts an input speech signal into a series of feature vectors by using a filter bank, frequency analysis LPC analysis, etc., and a series of feature vectors uttered in advance and extracted by the feature extraction means are used as a standard pattern for all recognized words. A standard pattern storage means to be registered, an input pattern uttered to be recognized and extracted by the feature extraction means, and similarity as a series of feature vectors to all standard patterns stored in the standard pattern storage means. a pattern comparison means for calculating degree or distance;
It consists of a determining means for determining and outputting a word corresponding to the standard pattern with the highest degree of similarity (smallest distance) as a recognition result as a result of pattern comparison.

このとき、同一話者が同一の単語を発声しても発声の都
度、その発声時間長が異るので、前記パターン比較手段
で標準パターンと入カバターンの比較を行う際には、両
者の時間軸を伸縮させ、両者のパターン長を揃えて比較
する必要がある。その際、発声時間長の変化は、発声単
語の各部で一様に生じているわけではないので、各部を
不均一に伸縮する必要がある。その伸縮は、比較すべき
両者のパターンの類似度が最大になる（距離が最小にな
る以下距離で説明するように行われるのが最も良い結果
が得られている。このようなマツチングを効率的に行う
のに動的計画法を用いる装置が一般的である（以下この
マツチングをＤＰマツチングと称する。）ＤＰマツチングの方法は格子グラフによって説明できる
。第１図は格子グラフであって、横軸は入カバターンＴ
　＝ａ１　ａ２・・・ａＸに対応するｉ座標、５ベージるｉ座標を表す。入カバターンＴと標準パターンを時間
軸を非線形に伸縮してマツチングすることはこの格子グ
ラフ上において、両パターンの各特徴ベクトルの対応関
係を示す径路（１）を何らかの棟側基準によって決定し
、この径路に関して両パターンの距離を評価することで
ある。この径路を決定する際には音声の性質を考慮して
制限条件を設ける。第２図（ａ）は径路選択の制限条件
の一例である。At this time, even if the same speaker utters the same word, the duration of the utterance differs each time, so when comparing the standard pattern and the input pattern using the pattern comparison means, the time axis of both is It is necessary to expand and contract the pattern lengths of the two to make them the same and compare them. At this time, since the change in utterance time length does not occur uniformly in each part of the uttered word, it is necessary to expand and contract each part non-uniformly. The best results have been obtained when the expansion/contraction is performed as described below using distance, which maximizes the similarity between the two patterns to be compared (minimizes the distance). (hereinafter, this matching is referred to as DP matching).The DP matching method can be explained using a lattice graph.Figure 1 is a lattice graph, in which the horizontal axis Cover turn T
=a1 a2... represents the i coordinate corresponding to aX, the i coordinate of 5 pages. To match the input cover turn T and the standard pattern by nonlinearly expanding and contracting the time axis, on this grid graph, a path (1) indicating the correspondence between the feature vectors of both patterns is determined by some kind of standard, and this The goal is to evaluate the distance between both patterns with respect to the path. When determining this route, limiting conditions are set in consideration of the nature of the voice. FIG. 2(a) is an example of restrictive conditions for route selection.

即ち、この例では点（’ｙｊ）へ至る径路は、点（ｉ−
２，１−１）から点（ｉ−１，ｊ）を通る径路（乃か、
点（ｉ　−１、ｊ　−１）か銖る径路（３）か、点（ｉ
−１，ｊ−２）から点（ｉ、ｊ−１）′ｆ：通る径路（
→かの何れかしか取り得ないということを意味している
。That is, in this example, the path to point ('yj) is point (i-
2,1-1) to the point (i-1,j) (or,
The point (i -1, j -1) or the path (3) or the point (i
-1, j-2) to point (i, j-1)'f: Path passing through (
→It means that only one of the following can be taken.

このとき、入カバターンと標準パターンの始端と終端は
必ず対応させるという条件をつければ、前記マツチング
の径路は第１図の斜線の部分に制限される。この制限は
、いかに時間軸が伸縮するといっても、同一単語に対し
てはそれ程極端に伸縮するはずはないという事実からあ
捷り極端な対応６ベージづけが生しないようにするためである。At this time, if the condition is that the starting and ending ends of the input cover pattern and the standard pattern must correspond, the matching path is limited to the shaded area in FIG. This restriction is to prevent extreme correspondence six page assignments from occurring due to the fact that no matter how much the time axis expands or contracts, it is unlikely that the same word will expand or contract to that extent.

ａｉとｂｆｌのベクトル間距離をｄｎ（ｉ、ｊ）とすれ
ば、入カバターンＴと標準パターンＲｎのパターン間の
前記径路に沿う距離は、その径路に沿うｄｎ（ｉ、ｊ）
の荷重平均として定義される。第２図の径路上のａ、ｂ
、ｃ、ｄはそれに対応する径路が選ばれたときの荷重で
あるＤＰマツチングが適用できるためにはこの荷重の決
め方は、格子グラフ上で前記制限条件の下でいかなる径
路が選ばれようともその径路に沿う荷重の和が一定にな
るように決めれば良い。’ａ＝ｃ＝ｅ＝２．１）＝ｄ’
＝１とすれば、この荷重の和はＩ　＋　１　”　、　ａ
　＝　ｂ　＝　ｃ　＝　１　。If the distance between the vectors ai and bfl is dn (i, j), then the distance along the path between the input cover turn T and the standard pattern Rn is dn (i, j) along that path.
is defined as the weighted average of a, b on the route in Figure 2
, c, and d are the loads when the corresponding paths are selected. In order for DP matching to be applicable, the method for determining these loads is such that no matter what path is selected on the grid graph under the above-mentioned limiting conditions, It may be determined so that the sum of the loads along the path is constant. 'a=c=e=2.1)=d'
= 1, the sum of these loads is I + 1'', a
= b = c = 1.

ｄ　＝　ｅ　＝０．５とすれば、この荷重の和ばＩ、ａ
＝＝ｂ＝ｏ、　５．ｃ＝ｄ＝ｅ＝１　とすれば、この荷
重の和は■０　となり径路の選ばれ方によらず一定とな
る。If d = e = 0.5, the sum of this load I, a
==b=o, 5. If c=d=e=1, the sum of these loads becomes 0, which is constant regardless of how the path is selected.

これらは共によく用いられる。また、前記荷重の和一定
という条件の下でこの荷重をｊに関する関数とすること
により、より重視してマツチングしたい径路上の部分の
荷重を重くする等の操作も可能である。Both of these are commonly used. Furthermore, by making this load a function of j under the condition that the sum of the loads is constant, it is also possible to perform operations such as increasing the load on a portion of the path that is to be matched with greater emphasis.

７ページ人カバターンＴと標準パターンＲｎの距離は、前記制限
条件の下で、前記荷重平均の最小値として定義される。The distance between the page 7 human cover turn T and the standard pattern Rn is defined as the minimum value of the weighted average under the limiting conditions.

即ち、次の漸化式を解くことによって前記荷重平均の最
小値とその最小値を与える径路が決定され得る。That is, by solving the following recurrence formula, the minimum value of the weighted average and the path that provides the minimum value can be determined.

初期条件　ｑｎ（１，１）＝ｄｎ（１，１）ここにＤ（
Ｔ、Ｒ”）は入カバターンＴと標準パターンＲｎの距離
である。Initial condition qn (1, 1) = dn (1, 1) where D (
T, R'') is the distance between the input cover turn T and the standard pattern Rn.

径路選択の条件としては他にも種々考えられる第２図（
ｂｌ〜（ｉ）等は他の例である。この他にもさらに種々
の変形が考えられ得る。これら径路の選択条件に伴って
前記漸化式は対応するものに書き換えられるのは勿論で
ある。There are various other possible route selection conditions as shown in Figure 2 (
bl~(i) etc. are other examples. In addition to this, various other modifications can be considered. Of course, the recurrence formula can be rewritten into a corresponding one depending on the selection conditions of these routes.

孤立して発声された単語を認識する場合は勿論連続して
発声された（単語と単語の間に切れ目なく発声された）
音声を認識する場合もＤＰマツチングは良好な成積をお
さめている。When recognizing words that were uttered in isolation, of course they were uttered consecutively (uttered without any breaks between words).
DP matching has also achieved good results when recognizing speech.

連続単語音声認識の問題は次のように定式化される。The problem of continuous word speech recognition is formulated as follows.

入力バター／のフレーム数を１１第ｉフレームの特徴ベ
クトルをａｉｓ単語ｎの標準ノ（ターンのフレーム数を
把、第ｊフレームの特徴ベクトルをｂη　とするとき、
単語ｎの標準パターンＲｎは次のように表わされるＯＲ”　＝　ｂ秒”−’−ｂ”ｊ　−”　ｂ’ｉ’ｎそこ
でＸ個の単語列に対応する標準パターンの結合、 □＝Ｒｑ（１）Ｒイ２）１．、Ｈｑｌ→＝ｂｑ（１）ｂ
ｑε１）・・・ｂ、ｑ（、を入）ｂ（］呪２）ｂｑ！、
２）、、・ｂ？（皺のと入カバターンＴ　＝ａ　１　ａ
　２・・・ａｉ・・・１１とのベクトル系列間の距離が
最小になる単語列ｑ（１）　ｑ（２）・・・ｑ（ｘ）９
ページをめる。The number of frames of the input butter is 11. The feature vector of the i-th frame is ais the standard number of word n.
The standard pattern Rn of word n is expressed as follows: OR" = b seconds"-'-b"j -"b'i'nThen, the combination of standard patterns corresponding to X word strings, □=Rq (1)R2)1. , Hql→=bq(1)b
qε1)...b, q(, enter) b(] Curse 2) bq! ,
2),,・b? (Wrinkled cover turn T = a 1 a
2...ai...11 Word string q(1) where the distance between the vector series is minimum q(2)...q(x)9
Turn the page.

以上の計算を前記孤立単語の場合と同様にしてそのまま
ＤＰマツチングで解こうとすれば、例えば１ｏ数字の単
語を標準パターンとしてもっているとき、３数字の連続
発声された音声を認識するには１６３−１ｏｏＯ種類の
標準パターンとマツチングしなければならない。標準パ
ターンの数が増せばたちまちその組合せの数は禁止的な
量になる。If we try to solve the above calculation using DP matching in the same way as in the case of isolated words, for example, when we have a word with 1o digits as a standard pattern, it takes 163 digits to recognize the sound of 3 digits uttered consecutively. -1ooO types of standard patterns must be matched. As the number of standard patterns increases, the number of combinations quickly becomes prohibitive.

そこで、連続単語の認識にもＤＰマツチングを適用する
ために、マツチングの累積距離の正規化係数（前記荷重
の和のこと）は入力のフレーム数にのみ依存するように
径路の選択の条件を設定すれば、以下に示すように標準
パターンの単語の組合せにも動的計画法が適用でき計算
量を大幅に減らし得る。Therefore, in order to apply DP matching to continuous word recognition, we set the conditions for path selection so that the normalization coefficient of the cumulative distance of matching (the sum of the loads) depends only on the number of input frames. Then, as shown below, dynamic programming can be applied to word combinations of standard patterns, and the amount of calculation can be significantly reduced.

径路の選択条件としては一般に第３図（ａ）〜（ｅ）に
示すものがある。径路上に示した数値はその径路が選ば
れたときの荷重係数である。There are generally conditions for selecting a route as shown in FIGS. 3(a) to 3(e). The numerical value shown on the route is the load factor when that route is selected.

入カバターンＴの第ｉフレーム（の特徴ベントル、以後
フレームとのみ称する）　ａｉとＸ個の標準１０ページパターン連結からなる連続標準パターンＨの第ｊフレー
ムｂ、のフレーム間距離（ベクトル距離）をｄＲ（’＊
ｉ）とし、入カバターンと連続標準パターンとの対応づ
けをする時間関数（前記マツチングの径路）を（ｉ）と
して、この時間関数に沿ってめられる次の累積距離（フ
レーム間距離の荷重の和）Ｄ（ＴｔＲ）を最小化するＲ
（介と記す）がめるものであるとする。即ち、Ｄ（Ｔ、Ｒ）＝ｍｉｎ［！　ｄＲ（ｉｔｕ（ｉ））：ｌ
　・・・・・・（３１ｕ（ｉ）　ｘ＝１ｆ’ｔ＝ａｒｇｍｔｎ　（Ｄ　（Ｔ　、　Ｒ）　〕ここ
で、第３図（−）の径路のときは、０＜ｕ（ｉ）　−ｕ
（ｉ　−１）≦２　ｊ　ｕ（１）＝１　？　ｕ（Ｉ）　
＝　ＪＲである。また、ｍ１ｎ（ｆ（ｚ））は２に関し
て最小化されたｆ　（ｚ）　、　ａｒｇｍｉｎ（ｆ（ｚ
））はｆ（ｚ）を最小にする２の値を意味する。Let dR be the interframe distance (vector distance) between the i-th frame (characteristic ventle of the incoming cover turn T, hereinafter referred to simply as frame) ai and the j-th frame b of the continuous standard pattern H consisting of X standard 10-page pattern concatenations. ('*
i), and the time function that matches the input cover pattern with the continuous standard pattern (the matching path) is (i), and the next cumulative distance measured along this time function (the sum of the weights of the interframe distances) is ) D(TtR)
(denoted as ``intermediate''). That is, D(T,R)=min[! dR(itu(i)):l
・・・・・・(31u(i)
(i −1)≦2 j u(1)=1? u(I)
= JR. Also, m1n(f(z)) is minimized with respect to 2 f(z), argmin(f(z
)) means the value of 2 that minimizes f(z).

式（４は単語数既知の場合、未知の場合、あるいはオー
トマトン制御を組み込んだ形で解くことができ、その方
法については既に種々提案されてお１１ベ−ジク、製品化されている例もある。Equation (4) can be solved when the number of words is known, when it is unknown, or when automaton control is incorporated. Various methods have already been proposed, and some examples have been commercialized. .

本願はこのうち、オートマトン制御を組み込んだ形で式
（鴫を解くことによって、連続して発声された音声を認
識する装置に関するものである。Among these, the present application relates to a device that recognizes continuously uttered speech by solving equations incorporating automaton control.

次に、オートマトン制御を組み込んだ形で式（３）を解
〈従来の方法について説明する。Next, we will explain the conventional method of solving equation (3) by incorporating automatic control.

我々が実際に単語を連続して発声する場合は、それらの
順序が決っている場合が多い。従って、入カバターン（
入力文）は、有限状態オートマトンαと等価な正規文法
によって生成された文であるとし、オートマトンａで受
理されるあらゆる単語列のうち、式（鴫を最小にする単
語列（ｑ（１）　。When we actually utter words in succession, their order is often fixed. Therefore, the input cover turn (
The input sentence) is a sentence generated by a regular grammar equivalent to the finite state automaton α, and among all the word sequences accepted by the automaton a, the word sequence (q(1)) that minimizes the expression (Q).

ｑ（Ｊ、・・・ｔｑ（→）をめるというようにすること
によって、認識率を向上させることができる。ここで、
単語列（ｑ（１）　、　ｑ（２！ｌ　、・・・、ｑ（ｘ
））が、オートマトンａで受理されるとはΔ（ｑｏ、ｑ
（１））＝ｑ、Ｃ８゜Δ（ｑｉ、ｑ（坤＝ｑｊＣ８，・
・・、Δ（ｑｋｙｑ（”−’））＝Ｓは状態ｑの有限集
合［ｑ６ｑ□ｙｑ１　？・・・、ｑ　）。The recognition rate can be improved by increasing q(J,...tq(→).Here,
Word string (q(1) , q(2!l ,..., q(x
)) is accepted by automaton a if Δ(qo, q
(1))=q,C8゜Δ(qi,q(gon=qjC8,・
..., Δ(qkyq(''-'))=S is a finite set of states q [q6q□yq1 ?..., q ).

−１−１ Σは入力単語ｎの有限集合（ｎｌ−１，２，・・・、Ｎ
）。-1-1 Σ is a finite set of input words n (nl-1, 2,..., N
).

Δは状態遷移関数で、Ｓ×Σ→Ｓ、（Δ（ｑｉ　ｔ”）
−ｑ’　ｌ　ｙ】ｑｏは初期状態でｑ。４Ｓ、Ｆは最終状態の集合ＦＣ８
であるｏ　ココ−’ｒｑ（１）　ｐ−ｑｃｘｉ　ｅ　ｉ
　１ｔ２ｐ−Ｎｌである。Δ is the state transition function, S×Σ→S, (Δ(qi t”)
-q' l y] qo is q in the initial state. 4S, F is the final state set FC8
is o coco-'rq(1) p-qcxi e i
1t2p-Nl.

通常のオートマトンの認識問題と異る点は、時間を表わ
すフレーム番号も変数として入っている点であり、しか
も単に受理、拒否の出力でなく、受理可能な度合（累積
距離）が出力される点であるＯＤｑ　、　（ｉ）を状態ｑ５　で入力のｉフレームで終
端す】ると仮定したあらゆる単語列のうちの最小累積距離、Ｎ
ｑ、（ｉ）をＤｑ　、　（１）に対応する単語列の最後
尾単］　】語基・Ｂｑ　、　０）をＮｑ　、　（ｔ）の始点位置′
イナ７”（Ｎ１（ｉ）の一つ前の単語の最終フレーム、
バックポインタと称する）、Ｑｑ、（ｉ）をｑ５　への
状態遷移】によってＤｑ　、　（１）を満たした状態名即ちΔ（Ｑ
ｑｊ（ｉ）、Ｎｑｊ（ｉ））−ｑｊとするとき、次の漸
化式を解くことで、オートマトン制御による式（′４の
解が得られる。即ち、１３ページ初期条件　Ｄｇｏ（ｏ）−０１Ｂ９許）＝０としてｑ＝
Δ（ｑｋｙｎ）　・・・・・・（４Ｎｑ（ｉ）＝ｎ、Ｂ
９（ｉ）−ｍ、Ｑ９（ｉ）＝ｑｋとする。ｉ　＝　Ｉま
でこの計算を行えば、次のようにして最後尾の単語から
逆順に単語がまる０即ち ■　ｉ　”　Ｉ　ｔ　（１＝ａ９ｍｉｎｑｆ　Ｄｑｆ（
ｉ）、ｑ１６Ｆとして■　ｎ＝Ｎｑ（ｉ） ■　Ｂ　ｑ　（１）＼○なら、’　＝　Ｂｑ（’）　ｔ
　ｑ　＝％（ｉ）として■へ、Ｂｑ（ｉ）＝０なら終了
する。The difference from normal automaton recognition problems is that the frame number representing time is also included as a variable, and the degree of acceptability (cumulative distance) is output instead of simply accepting or rejecting. The minimum cumulative distance among all word strings, N
q, (i) as Dq, the last unit of the word string corresponding to (1)]] Base・Bq, 0) as Nq, starting position of (t)'
Ina7” (the last frame of the word before N1(i),
(referred to as back pointer), Qq, (i) state transition to q5], the state name that satisfies Dq, (1), that is, Δ(Q
When qj (i), Nqj (i)) - qj, by solving the following recurrence formula, the solution to the equation ('4) by automaton control can be obtained. In other words, Initial condition on page 13 Dgo (o) - 01B9allow)=0 and q=
Δ(qkyn) ・・・・・・(4Nq(i)=n, B
9(i)-m, Q9(i)=qk. If this calculation is performed until i = I, the words will be rounded to 0 in reverse order from the last word as follows:
i), q16F, ■ n=Nq(i) ■ B q (1)\○, then ' = Bq(') t
Set q=%(i) and go to ■. If Bq(i)=0, end.

第６図はフローチャートである。FIG. 6 is a flowchart.

なお、Ｄ”（ｍ＋１　：　ｉ　）　は次式で定義され、
前記の孤立単語のＤＰマツチングと同じ方法でめられ・
る。In addition, D"(m+1: i) is defined by the following formula,
In the same way as the DP matching of isolated words mentioned above,
Ru.

Ｄ”（ｍ＋１　：　ｔ　）　’　（′４：ｄ”（ｋ、ｕ
（ｋ））：）−（ａ）冒ｉ）”　ｋ＝ｍ＋１ここで、第３図（ａ）の径路のときはｏ＜：ｕ（ｋ）−ｕ　（ｋ−１）≦：２．ｕ（ｍ＋１　
）＝１　、ｕ（ｉ）＝　１”１４ページである。D"(m+1:t)'('4:d"(k, u
(k)):)-(a)expi)"k=m+1 Here, in the case of the route shown in Fig. 3(a), o<:u(k)-u (k-1)≦:2.u (m+1
)=1, u(i)=1”14 pages.

式（′４は、予め定められたｍの範囲１１〜１２につい
てＤ”（ｎ汁１：ｉ）をめ、各ｍについて既にめ満足す
る）とｎをめ、これをＤｑ（ｉ）とするのであるが、こ
の計算量を減する方法として、ｉ＝ｍ＋１以後は単語ｎ
で、次の状態がｑであるとしたときの、（ｔｔｔ）＝（
１ｙ１）から（’　ｔ　）　）＝（”ｙ　］リ　までの
累積距離ヲＤ”（”、ｉ’）　トｔ　ルトキ、Ｄｑ（’
ｔＴ”）の前記ｎ　、　ｑｋについての最小値としてＤ
ｑ（ｚ）をめる方法が提案されている。Formula ('4 is D'' (n juice 1:i) for a predetermined range of m from 11 to 12, and satisfies the condition for each m) and n, and let this be Dq(i). However, as a way to reduce the amount of calculation, after i=m+1, word n
Then, when the next state is q, (ttt) = (
Cumulative distance from 1y1) to (' t ) = ("y ]ri woD"(",i')
D as the minimum value for n, qk of tT'')
A method of calculating q(z) has been proposed.

これらの方法の問題点は、オートマトンの構造によって
は、さらに計算量を減じ得るものであるが、従来はその
点が考慮されていなかった点にある０発明の目的オートマトン制御による連続音声認識における計算量を
大幅に削減したパターン比較装置を提案することを目的
とする。The problem with these methods is that although the amount of calculation can be further reduced depending on the structure of the automaton, this point has not been taken into consideration in the past. The purpose of this paper is to propose a pattern comparison device that significantly reduces the amount of data required.

１５ページ発明の構成本発明はオートマトン制御による連続パターンのＤＰマ
ツチングにおいて、オートマトンの制御規則をｑ＝Δ（
ｑｋｔ”）　とするとき、パターンｎに対して、状態ｑ
ｋが複数存在するとき、それぞれの状態に対してその状
態１での累積距離が、最小であるものを／４にとすれば
、状態ｑまでの最後尾△ パターンｎに対する累積距離は直前の状態がｑｋのみで
あるとして計算することにより、従来ｑｋのすべてに対
して行っていた累積距離の計算を減らすものである。Page 15 Structure of the Invention The present invention is based on DP matching of continuous patterns by automaton control.
qkt''), then for pattern n, state q
When there are multiple k, if the minimum cumulative distance in state 1 for each state is /4, then the cumulative distance to the last △ pattern up to state q is the previous state By calculating as if only qk exists, the calculation of cumulative distance, which was conventionally performed for all qk, is reduced.

実施例の説明以下に本発明の原理及び実施例を説明する。Description of examples The principle and embodiments of the present invention will be explained below.

第６図は本発明の詳細な説明するための図で、第６図（
ａ）は、（ｂｌに書かれた単語に対する有限オートマト
ン表現である。ここでの説明は簡単のために前記単語の
代りに単音節としている。上記従来例における計算では
、すべての状態遷嵌第７図においてその数は２２）に対
して計算を行なわなければならなかった０ところが、第
７図の例では、状態Ｓ１２　には状態Ｓ６．Ｓ７．Ｓ８
がら同じ”ン′”が入ってきている。このような場合、
状態Ｓ６．Ｓ７゜Ｓ８を一時的に縮退化してやれば、状
態ｓ１２のす″　に関しては一回のみ計算で済む０この
ことは、状態Ｓ　のＹＡ″′につぃても言える。本発明
４は、この原理を利用して計算量の削減をはかったパター
ン比較装置である。FIG. 6 is a diagram for explaining the present invention in detail.
a) is a finite automaton representation for the word written in (bl).In order to simplify the explanation here, a single syllable is used instead of the word.In the calculation in the conventional example above, all state transitions are In FIG. 7, the number is 0, which had to be calculated for 22), but in the example of FIG. 7, state S12 has state S6. S7. S8
However, the same “n′” is included. In such a case,
State S6. If S7 and S8 are temporarily degenerated, only one calculation is required for S'' in state s12. This also applies to YA''' in state S. The present invention 4 is a pattern comparison device that utilizes this principle to reduce the amount of calculation.

第７図の例において、ｑ＝＄１２．ｎ０＝ｎ０（ｎｏは
、音節゛ケバに付された番号）とすれば、ｑ＝８１２に
対して式（′４の意味するところはＤＢ　（ｉ）：＝ｍｉｎ　［：ＤｎＯ（ｆＷ）＋Ｄ”Ｏ
（ｍ＋１：ｉ）〕１２　ｑｋ、ｍ　（Ｉｋとなる。換言すれば、第５図において、＄６．ｓ７゜Ｓ
８から１ケ″を発して”１２に遷移するとき、ＤＢ１２
（ｉ）を最小にするためには、ＤＢ６（ｍ）、ＤＢ７（
ｍ）。In the example of FIG. 7, q=$12. If n0=n0 (no is the number attached to the end of the syllable), then for q=812, the formula ('4 means DB (i):=min [:DnO(fW)+D”) O
(m+1:i)]12 qk, m (Ik. In other words, in Fig. 5, $6.s7゜S
When emitting 1 digit from 8 and transitioning to 12, DB12
In order to minimize (i), DB6(m), DB7(
m).

Ｄｓ、（ｍ）のうちの最小の状態から遷移することに１
７ベー〕！なるということである。これを一般的に書けば、ｑｏ　
を初期状態として、Ｄｑｏ（Ｏ）＝０．Ｂｑｏ（Ｏ） △ Ｎｑ（ｉ）＝ｎ、Ｂｑ（ｉ）＝ｍ、Ｑｑ（ｉ）＝ａｒｇ
ｍｉｎ（Ｄ　（Ｑ）：］ｋとなる。即ち、フレームｉにおいて状態ｑとなる直前の
状態のうち、同じ音節（あるいは単語）ｎで連がるもの
があるときは、それら状態に到るまでの累積距離が最小
である状態から連がるということである。1 to transition from the smallest state of Ds, (m)
7 b]! It means that it will become. If we write this in general, qo
As the initial state, Dqo(O)=0. Bqo(O) △ Nq(i)=n, Bq(i)=m, Qq(i)=arg
min(D (Q):]k.In other words, if there are states immediately before state q in frame i that have the same syllable (or word) n, the process continues until those states are reached. This means that the state continues from the state where the cumulative distance of is the minimum.

このことを利用すれば、第６図の例の場合は゛ひ”′と
ＹＡ”について以上のことが言えるから式（場の計算は
１９回となり、３回減る。Ｄ”　（ｍ−）１　：　ｉ　
）をめて式（場を直接解く場合は、この量は殆んど無視
できるが、前記高速計算法を用いる場合には大きな差と
なる。また、タスクによっては大きな計算量の削減が期
待される。Using this fact, in the case of the example in Figure 6, the above can be said about ``hi'' and YA'', so the formula (field calculations will be 19 times, reduced by 3 times.D'' (m-)1: i
) and solve the equation (when directly solving the field, this amount can be almost ignored, but when using the high-speed calculation method mentioned above, it makes a big difference. Also, depending on the task, a large reduction in the amount of calculation can be expected. Ru.

上記高速計算法の一つに対し、本発明を適用した一実施
例について説明する。An embodiment in which the present invention is applied to one of the above-mentioned high-speed calculation methods will be described.

１８ページ第３図（ｂ）の径路制限条件によれば、Ｄ”（ｍ−１−
１：　ｉ　）は第６図の斜線の内部における径路に沿う
（’　ｐ　Ｊ）−（ｍ、１）から（’ｔｊ）−（’ｙ”
）までの累積距離である。ここで（＠は傾き％、（ηは
傾き２の直線である。いま、径路（＠が（ｉ、１）＝（
１，１）から（ｉ？ｉ）””（’？Ｉ”）までの最小の
累積距離を与えるものとすれば、動的計画法の原理に従
って、（ｉｇ）＝（１ｔ１）から径路（→上の点（’　
９　］　）＝（”ｐ　ｊ’）までの累積距離を最小にす
る径路は、径路（時の（’ｙＮ＝（”、ｊ’）までの径
路と全く一致する。従って、Ｄ”（ｍ−１−１：　ｉ　
）は特にからめ、Ｄｑ　（ｉ）　＝Ｄｇ（Ｉ　Ｐ　Ｊジ
としてめることができる。ただし、漸化式（９）の初期
値は”（−１ｔｊ）−”　（ｊ＝　０，１ｓ”・ｙ”）
Ｄｑ　（０、Ｏ）　＝０１９ベージＤ”（０，ｊ　）＝ｏｏ　（ｊ＝−１，１，２、、、、
、Ｊ”Ｄ：（ｉ、−１）＝ｏＯ（ｎ＝１．２．・・・、
Ｉ）Ｄａ（ｉ、０）−Ｄｑｋ（ｉ−１）（ｑｋはｑ＝Ｕ
（ｑｋ、ｎ）を満たすｑｋ）テアリ、Ｄｑ（’、ｌ）　に対応するバックポインタ門
（ｌ、Ｊ）は式（＠においてＤｑ　（’　ｔ　］　）竺■　のとき　ＢＨ（ｉｔｔ）
−Ｂ、：（ｉ−２ｔｊ−１）Ｄ訂ｉ？＋）＝＠　のとき
　リ（ｉｔｊ）−Ｂｑ（’−’？］−１）Ｄｑ（’、ｊ
）唖　のとき　Ｂη（’ｔＪ）＝Ｂｑ（’−１ｙ＋−２
）となる。捷た、Ｂｑ（’ｔｌ）の初期値はＢ：（ｉ、
１）＝ｔ　−１である。According to the path restriction conditions shown in Figure 3(b) on page 18, D”(m-1-
1: i) is along the path inside the diagonal line in Fig. 6 from ('p J) - (m, 1) to ('tj) - ('y'
) is the cumulative distance. Here, (@ is the slope %, (η is a straight line with slope 2.) Now, the path (@ is (i, 1) = (
1,1) to (i?i)""('?I"), then according to the principles of dynamic programming, from (ig) = (1t1) to the path (→ The upper point ('
9] )=("p j') is exactly the same as the path to ('yN=(",j') at ('yN=(", j'). Therefore, D"(m- 1-1: i
) can be determined as Dq (i) = Dg (I P J di. However, the initial value of recurrence formula (9) is "(-1tj)-" (j = 0,1s" y”)
Dq (0, O) = 0 19 pages D" (0, j ) = oo (j = -1, 1, 2, ,,
, J"D: (i, -1) = oO (n = 1.2...,
I) Da(i, 0)-Dqk(i-1) (qk is q=U
(qk, which satisfies n) The back pointer gate (l, J) corresponding to Dq(', l) is BH(itt) when
-B, :(i-2tj-1) D revision i? +)=@ When li(itj)-Bq('-'?]-1)Dq(', j
) When mute Bη('tJ)=Bq('-1y+-2
). The initial value of Bq('tl) is B:(i,
1)=t −1 .

第６図はｉ　＝　ｉ上における’ｑ　（ｉ　、　ｉ　）
がどのような意味をもっているかを説明している。（８
）　、　（１０）。Figure 6 shows 'q (i, i) on i = i.
explains what it means. (8
), (10).

（１２）なる直線は傾き％、（９）　、（１１）、（１
３）なる直線は傾き（→であって、（ｉｔＴ）−（ｉｔ
Ｔ”）（点２６）へ至る径路は直線（Ｅ）と（ｃ４で挾
まれた領域に含まれ、点０６）を通るマツチング径路は
直線（１０）と（１１）で挾まれた領域に含まれ、点（
１７）を通るマツチング径路は直線（１２）と（１３）
で挾まれる領域に含まれる〇言い換えれば、点１６を通
る径路は単語ｎに対して、始端は１８〜１９０間にあり
、終端は２０〜２１の間にあり、点１７を通る径路は始
端は２２〜２３０間にあり、終端は２４〜２６の間にあ
るということになる。結局、ｉｔｊ上のＤｑ（ｉｔｊ）
ａ直線２８の傾きを％とするとき、フレームｉｘｉ’に
おけるｉ“に対する累積距離Ｄｎ　（ｔ　／／　）　−
Ｄｎ　（ｔ“ｔ　”）に対する途中の累積距離を表して
いることになる。The straight line (12) has a slope of %, (9), (11), (1
3) The straight line has a slope (→, (itT)−(it
The path leading to T'') (point 26) is included in the area between straight lines (E) and (c4), and the matching path passing through point 06 is included in the area between straight lines (10) and (11). , point (
The matching path passing through 17) is a straight line (12) and (13)
In other words, the path passing through point 16 is between 18 and 190, the end is between 20 and 21, and the path passing through point 17 is between 18 and 190, and the path through point 17 is between 20 and 21. is between 22 and 230, and the end is between 24 and 26. After all, Dq on itj (itj)
When the slope of the a straight line 28 is expressed as %, the cumulative distance Dn (t // ) −
This represents the cumulative distance along the way with respect to Dn (t“t”).

（以後”ａ　（’　ｔ　］　）を中間累積距離と呼ぶこ
とにする）また、漸化式（ｃ４から明らかなようにＤｑ
　（’　ｔ　）　）をめるには、フレームｉ−２とｉ−
１における中間累積距離と、フレームｉ−１とｉにおけ
るフレーム間距離のみ既知であればよく、それ以前の値
は忘れてしまっても良い。(Hereinafter, "a (' t ]) will be referred to as the intermediate cumulative distance.) Also, as is clear from the recurrence formula (c4), Dq
(' t )), frame i-2 and i-
Only the intermediate cumulative distance at frame 1 and the interframe distance at frames i-1 and i need to be known, and the values before that may be forgotten.

以上のことをまとめて言えば、Ｄｑ（ｉ）をめるには、
フレームｉ毎にｎ＝１．２．・・・２把。To summarize the above, to calculate Dq(i),
For each frame i, n=1.2. ...2 pieces.

ｎ＝１，２．°’ｔＮｔｑ＝ｑ１ｓｑ２ｔ”’ｙｑｊ、
ｊ−１に対してＤ３　（’　ｔ　］　）をめ、とするとき、２１　ページＱｑ（ｉ）＝ｑｋＮｑ（ｉ　）　＝　ｎとすることができる。このようにすれば、各格子点（’
ｙ５）におけるｄ”（ｉ、ｊ）の計算は単語ｎ毎に１回
で済み、”　（’　ｙ　］　）の計算はｑ、ｎ、Δ（ｑ
ｋｔ　ｎ　）＝ｑなるｑｋについて１回で済み、毎フレ
ーム第６図に示す斜線内部の格子点についてｄｎ（ｉ’
ｔ５Ｌ”　（ｍ＋１　：　ｉ　）を計算しなければなら
力い式（場を直接解く方法に比べて大幅に計算量が減少
する。n=1, 2. °'tNtq=q1sq2t"'yqj,
When D3 (' t ]) is calculated for j-1 and 21 pages Qq(i)=qk Nq(i)=n. In this way, each grid point ('
The calculation of d''(i, j) in y5) only needs to be done once for each word n, and the calculation of ``(' y ]) requires q, n, Δ(q
It only needs to be done once for qk, where kt n )=q, and dn(i'
t5L'' (m+1: i) is required to calculate the equation (the amount of calculation is greatly reduced compared to the method of directly solving the field).

壕だ、この方法は入カバターンのフレームｉ毎に処理を
進めてゆくものであるが、標準パターンのフレームｊ毎
に処理してゆくこともできる。即ち、漸化式（＠から明
らかなように、ｊ方向についでも、Ｄ：（ｉ、ｊ）をめ
るには、フレームｉ−１とフレームｊのフレーム間距離
”（’ｙｊ）とフレームｊ−２とフレームｊ−１の中間
累積距離がわかっていればよいから、フレームｊ毎にｉ
ｔｊ上のＤｑ（’ｓ＋）をめてゆくこともできる。In this method, processing is performed for each frame i of the input pattern, but it is also possible to process each frame j of the standard pattern. That is, as is clear from the recurrence formula (@, in order to calculate D: (i, j) even in the j direction, the inter-frame distance between frame i-1 and frame j "('yj)" and frame j Since it is sufficient to know the intermediate cumulative distance between -2 and frame j-1, i
It is also possible to calculate Dq('s+) on tj.

２２ページ以上の方法は、オートマトン制御クロック同期伝播形Ｄ
Ｐ法（Ｆ３ＡＣＷＤＰ）と呼ばれるものである〇このとき、式（→の考え方を導入するには、漸化式（ｃ
４の計算において、ＤＨ（ｉ、ｊ）の初期値を次のよう
にすればよい。即ち、ｑ＝Δ（ｑｋ、ｎ）を満足するき
、とすればよい。The method on page 22 and above is based on the automaton control clock synchronous propagation type D.
This is called the P method (F3ACWDP). At this time, to introduce the idea of formula (→), we need to use the recurrence formula (c
In the calculation of 4, the initial value of DH (i, j) may be set as follows. That is, when q=Δ(qk, n) is satisfied, the following may be used.

第８図は本発明の一実施例を示すブロック図である。１
ｏＯは音声信号の入力端子である。１０１は特徴抽出部
であって入力音声信号を特徴ベクトルの系列に変換する
。１０２は標準パターン記憶部であって、認識すべき単
語（音節）のそれぞれが同様な特徴ベクトルの系列とし
て記憶されている０１０３はフレーム間距離計算部であって、特徴抽出部１
０１の出力の特徴ベクトルと標準パターン記憶部１０２
の特徴ベクトルとの差を計算する。FIG. 8 is a block diagram showing one embodiment of the present invention. 1
oO is an input terminal for audio signals. A feature extraction unit 101 converts an input audio signal into a series of feature vectors. 102 is a standard pattern storage unit in which each word (syllable) to be recognized is stored as a series of similar feature vectors; 103 is an interframe distance calculation unit;
01 output feature vector and standard pattern storage unit 102
Calculate the difference between the feature vector and the feature vector.

２３ベーフ１０４は計算されたフレーム間距離を一時的に記憶する
フレーム間距離記憶部である。１ｏ５はオートマトン制
御規則記憶部である。１０６は累積距離記憶部であって
、累積距離Ｄｑ　（１）が各フレーム毎に記憶されてい
る。１０７は初期値計算部であって、漸化式（噂の計算
における初期値を式（１０）に従って計算する部分であ
る。１０８は漸化式計算部であって、１０７で計算され
た初期値をもとに、漸化式（１０）を計算する部分であ
る。１０９は最終値決定部であって、漸化式計算部１０
８の結果から各フレームにおける最終値としてを計算す
る部分である。１１３，１１４，１１５はそれぞれ最後
尾単語記憶部、直前状態記憶部、バックポインタ記憶部
であって、それぞれ、最終値決定部１０９でめられた、
最後尾単語Ｎｑ（ｉ）、ｑの直前の状態Ｑ９（ｉ）、バ
ックポインタＢｑ　（ｉ）が各フレームについて記憶さ
れる。１１２は入力音声の最終フレームにおける状態を
決定する部分である。23BEF 104 is an interframe distance storage unit that temporarily stores the calculated interframe distance. 1o5 is an automaton control rule storage section. 106 is a cumulative distance storage unit in which cumulative distance Dq (1) is stored for each frame. Reference numeral 107 denotes an initial value calculation unit, which calculates the initial value in the calculation of rumors according to formula (10). This is a part that calculates the recurrence formula (10) based on
This part calculates the final value for each frame from the results of step 8. Reference numerals 113, 114, and 115 are a last word storage unit, a previous state storage unit, and a back pointer storage unit, respectively, which are determined by the final value determination unit 109.
The last word Nq(i), the state Q9(i) immediately before q, and the back pointer Bq(i) are stored for each frame. 112 is a part that determines the state of the input audio in the final frame.

即ち、ｑｆ　Ｆとするとき最終フレームＩの状態ｑはと決定される。１１ｏは音声区間検出部、１１１はフレ
ーム数計数部である。１１６はバックトレース制御部で
あって、最後尾単語記憶部１１３と、直前状態記憶部１
１４と、バックポインタ記憶部１１６の内容から、第４
図に示すフローチャートに従って逆の順序で、最後尾単
語記憶部１１３から認識結果を出力せしめる。That is, when qf F, the state q of the final frame I is determined as. 11o is a voice section detection section, and 111 is a frame number counting section. 116 is a backtrace control unit, which includes the last word storage unit 113 and the immediately preceding state storage unit 1.
14 and the contents of the back pointer storage unit 116, the fourth
The recognition results are output from the last word storage unit 113 in the reverse order according to the flowchart shown in the figure.

第９図は、以上の実施例の動作をソフトウェアで実現す
る場合の一例を示すチャートである。FIG. 9 is a chart showing an example of the case where the operation of the above embodiment is realized by software.

ステップ２００〜ステツプ２０４は、全体としての初期
化を行う部分、ステップ２０５〜ステツプ２１４はフレ
ームｉにおける処理を行う部分であって、ステップ２０
６〜ステツプ２１２は、各標準パターンｎ　（ｎ＝１　
、２　、・・・、Ｎ）に対して２６ページＤｎ（ｉ）、請求メル部分、ステップ２１３〜ス゛ｑテップ２１４は、入力音声がフレームｉで終端すると仮
定したときの最後尾単語（音節）名、それに対応する累
積距離、バックポインタ、直前の状態を各状態について
める部分である。ステップ２０８はフレーム間距離計算
部１ｏ３、フレーム間距離記憶部１０４の動作に対応す
る。ステップ２１０は初期値計算部１０７の動作に対応
する。Steps 200 to 204 are the parts that initialize the entire process, steps 205 to 214 are the parts that perform processing in frame i, and steps 200 to 214 are the parts that perform processing for frame i.
6 to step 212, each standard pattern n (n=1
, 2 , ..., N), page 26 Dn(i), request part, steps 213 to 214 are the last word (syllable) assuming that the input speech ends at frame i. This is the part that stores the name, the corresponding cumulative distance, the back pointer, and the previous state for each state. Step 208 corresponds to the operations of the interframe distance calculation section 1o3 and the interframe distance storage section 104. Step 210 corresponds to the operation of the initial value calculation section 107.

ステップ２１１〜ステツプ２１２は漸化式計算部１０８
の動作に対応する。ステップ２１３〜ステツプ２１４は
最終値決定部１０９の動作に対応する。ステップ２１５
〜ステツプ２１７は入力音声の最終フレームから逆の順
序で認識結果を決定していく処理であって、最終フレー
ム状態決定部１１８、最後尾単語記憶部１１３、直前状
態記憶部１１４バックポインタ記憶部１１５、バックト
レース制御部１１６の間で行われる動作に対応している
。Steps 211 and 212 are performed by the recurrence formula calculation unit 108.
corresponds to the behavior of Steps 213 and 214 correspond to the operations of the final value determining section 109. Step 215
- Step 217 is a process of determining the recognition result in the reverse order starting from the last frame of the input audio, and includes the final frame state determining section 118, the last word storage section 113, the immediately preceding state storage section 114, and the back pointer storage section 115. , and the backtrace control unit 116.

”ｉ　（ｉｔ　］　）を直接求めて解く方法としては他
にオートマトン制御Ｌｅｖｅ１．　Ｂｕｉｌｄｉｎｇ　
灰（ＦＳＡＬＢ）が２６ページ知られている。これにも、本発明の考え方を導入するこ
とができ、計算量を大幅に減らし得る。Another way to directly obtain and solve ``i (it]) is Automaton Control Level 1. Building
26 pages of Ash (FSALB) are known. The idea of the present invention can also be introduced to this, and the amount of calculation can be significantly reduced.

第１０図は、６桁の数字の棒読み音声例えば、七万三千
二百へ子穴という具合にＯ万○千○百０十○と読む場合
のオ−トマトン制御である。これに対し、従来のＦＳＡ
ＣＷＤＰ　、　Ｆ、５ＡＬＢ　を用いた装置と、本発明
による縮退化ＦＳＡＣＷＤＰと縮退化ＦＳＡＬＢを用い
た装置に関して、計算量、ワークメモリの記憶量を比較
したのが第１表である。この表によれば、第１０図に示
すタスクの場合は、計算量が大幅に減り、記憶量も同等
以下となっており、実用的に効果の大きいものである。FIG. 10 shows the automaton control when reading a six-digit number as 0,000,000, for example, 73,200 to koana. In contrast, traditional FSA
Table 1 compares the calculation amount and work memory storage amount for a device using CWDP, F, 5ALB and a device using degenerate FSACWDP and degenerate FSALB according to the present invention. According to this table, in the case of the task shown in FIG. 10, the amount of calculation is significantly reduced and the amount of memory is also less than the same, so it is practically effective.

第１表２７ベーコナ発明の効果以上述べたように、オートマトンの制御規則をｑ＝Δ（
ｑｋ、ｎ）とするとき、パターンｎに対して、状態ｑｋ
が複数存在するとき、従来は最後尾パターンｎに対する
累積距離を直前の状態ｑｋに対してすべて行っていたの
を、状態ｑとなる直前の状態のうち、同じパターンｎで
連がるものがあるときは、それら直前の状態に至るまで
の累積距離が最小である状態から連がるという事実を利
用することにより、タスクによっては計算量を大幅に減
らすことが可能となったものである。Table 1 27 Effects of Bacona's Invention As mentioned above, the control rule for the automaton can be expressed as q=Δ(
qk, n), then for pattern n, state qk
When there are multiple patterns, conventionally all the cumulative distances for the last pattern n were calculated for the immediately preceding state qk, but if some of the states immediately before becoming state q have the same pattern n, then By taking advantage of the fact that times are continuous from a state with the minimum cumulative distance to the previous state, it has become possible to significantly reduce the amount of calculation for some tasks.

[Brief explanation of the drawing]

第１図〜第３図はＤＰマツチングの基本原理を説明する
図、第４図はオートマトン制御による連続単語音声認識
におけるセグメンテーションおよび認識単語の決定手順
を示すフローチャート、第５図は本発明の詳細な説明す
るオートマトン表現の一例を示す図、第６図、第７図は
本発明の一実施例の原理を説明する図、第８図は本発明
の一実施例を示すブロック図、第９図は同実施例装置の
機能をソフトウェアで実現したときのチャート、第１０
図は同実施例の効果を示す図である。１０１・・・・・・特徴抽出部、１ｏ２・・・・・・標
準パターン記憶部、１ｏ３・・・・・・フレーム間距離
計算部、１０４・・・・・・フレーム間距離記憶部、１
０５・・・・・・オートマトン制御規則記憶部、１０６
・・・・・・累積距離記憶部、１０７・・・・・・初期
値計算部、１０８・・・・・・漸化式計算部、１０９・
・・・・・最終値決定部、１１０・・・・・・音声区間
検出部、１１１・・・・・・フレーム数計数部、１１２
・・・・・・最終フレーム状態決定部、１１３・・・・
・・最後尾単語記憶部、１１４・・・・・・直前状態記
憶部、１１５・・・・・・バックポインタ記憶部、１１
６・・・・・・バックトレース制御部。代理人の氏名　弁理士　中　尾　敏　男　ほか１名第１
図第２図　、α、（ｂ。（Ｃ）　（ｄｌ　（Ｃ）　（ｆ）（７）（ｈ）（２）（ｊ）第３図、ω）　ｔｂ）（Ｃ）　（ｃｂ第４図開ｄシ＝ＩＤ瞥す（シ］ｎ＝Ｎ番（わ８ｇ（ｉｌ−０飼１ｚ８４ｒｉ＋第５図（ｂ）〜ＡｅＯＨＩＪＫｕＸＮＡｃｒＯＹＡ　ＨＵＫＩＪＳＩＭＡＮＡＧＡＮＯＨＩＪＫＬＪＹＡＭハＮＡｔ３ＡＯ’ＫＡ　ＯγハＭＡＳｒ；ＩＬＩＯＫＡ　ＯＫＡＹＡＭＡｓｕＫｕＯＫＡ　ｗＡＫＡＹＡＭハ第６図第９図勢−イ、２．−−−．ｌδｌ−１ｎ−４，２，−−−、ｔＪ　２０’ ｊ−イ、２．−−−．Ｊｌ＋２０２Ｄｇ（Ｏ，ｊｌ−”　、　’Ｊ’−イ５ハ　−”　２０
３Ｂ仁０．夛）〜の　、Ｂε＜−ｒｊ）−ωＤｏ（ｏ）
−ｏ　２０４ご婁４．２．−−−．ｒｎ−イ、２．−−−．〜２θ６１−イ、２．−−−．ＪＱ２０７ｄ’〔２，ｊｌ　芝十めろ　２θ８ φ−イ、２．−−−．ｌ５ｌ−ｔ２０’？’；ｉ、ｍ　
、Ｌｒ＃、、１ＬｒｌＣＤ６Ｃｉ　−１＋１　ｋｒＦＬ
、ｂｌｌ）−ｄ（ｋ、ｎ＋ｋｒｔ’ｐＤε（ε−＋、ｏ
）−Ｄε−（２−１１，１）？ＬＬ−２．Ｄノ＝ＯＯＢ
ｊ！ｔ−ヅ０】！ンーイ　＋　ｖｊ　（＝−ｔ、−１＝
Φｐイ、２．−−−、：Ｉｎ” Ｄｉ　ｒ＆、〆＋＝ｍｔｎ　Ｄｊｒ；−ｆ、７−ＩＭ’
ｆ＃、、）ｌ−Ｄｌ（ｂ−１，ｔ−２）＋ｄ”（ｉ、１
）　−−−−−Ｂ＠＋ｉ−ｔ、ｉ−ｎ　Ｄｉ［Ａ、ｊ）
−■４ｈ！２１２材（し、メ）冨　＋１Ｊｉｔ−イナデ
」　Ｄε（器、ｊ）：■のとき８εｔＢ−ｔ、多−ｉ’
ｌ　Ｄε（シ、３）〜■のとさ６＝１．２、−、　ＩＪ
Ｉイ　２１３ π峠ｒ炉シｒ）　［１）ｊ（ｉ、Ｊ’ｌ］〜シ（１＋目
６特開昭ＧＯ−７３６９８（１０）第１０図Ａ／ｄｓ＝（−ｙｓ、二重７−　、九万Ｌ　ｄｑ＝（＋、二
今ｒ　−−−１九千）ｄ３＝（白、二百、−、九舌）、
ｄ２＝（−ｒ、＝士、−−−、九ｔ）ｄず＝（−１二、
−−−１り１４）Figures 1 to 3 are diagrams explaining the basic principle of DP matching, Figure 4 is a flowchart showing the procedure for segmentation and recognition word determination in continuous word speech recognition by automaton control, and Figure 5 is a detailed explanation of the present invention. A diagram showing an example of an automaton expression to be explained, FIGS. 6 and 7 are diagrams explaining the principle of an embodiment of the present invention, FIG. 8 is a block diagram showing an embodiment of the present invention, and FIG. Chart when the functions of the same embodiment device are realized by software, No. 10
The figure is a diagram showing the effect of the same embodiment. 101...Feature extraction unit, 1o2...Standard pattern storage unit, 1o3...Interframe distance calculation unit, 104...Interframe distance storage unit, 1
05...Automaton control rule storage section, 106
... Cumulative distance storage section, 107 ... Initial value calculation section, 108 ... Recurrence formula calculation section, 109.
...Final value determining section, 110... Voice section detecting section, 111... Frame number counting section, 112
...Final frame state determination section, 113...
... Last word storage section, 114 ... Immediate state storage section, 115 ... Back pointer storage section, 11
6...Backtrace control section. Name of agent: Patent attorney Toshio Nakao and 1 other person No. 1
Figure 2, α, (b. (C) (dl (C) (f) (7) (h) (2) (j) Figure 3, ω) tb) (C) (cb Figure 4 open d Shi=I D glance (shi) n=Nth (wa8g(il-0 1z84ri+ Figure 5(b) ~AeOHIJKuX NAcrOYA HUKIJSIMA NAGANOHIJKLJYAMhaNAt3AO'KA OγHAMA Sr;ILIOKA OKAYAMA suKuOKA wAKAYAM Ha 6th Figure 9: Figure 9 - A, 2.---.lδl-1 n-4,2,---, tJ 20' j-i, 2.---.Jl+202 Dg(O, jl-", 'J'-i5ha-” 20
3B Jin0.夛)～の ,Bε＜−rj)−ωDo(o)
-o 204 Goro 4.2. ---. r n-i, 2. ---. ~2θ6 1-i, 2. ---. JQ207 d' [2, jl Shibajumero 2θ8 φ-i, 2. ---. l5l-t20'? ';i,m
,Lr#,,1LrlCD6Ci -1+1 krFL
, bll)-d(k, n+krt'pDε(ε-+, o
)−Dε−(2−11,1)? LL-2. D no = OOB
j! t-zu0]! nii + vj (=-t, -1=
Φp i, 2. ---, :In" Di r&, 〆+=mtn Djr;-f, 7-IM'
f#,,)l−Dl(b−1,t−2)+d”(i,1
) −−−−−B@+i−t, i−n Di[A, j)
-■4h! 212 material (shi, me) wealth +1 Jit-inade" Dε (vessel, j): 8εtB-t when ■, multi-i'
l Dε (shi, 3) ~■ Nosa 6 = 1.2, -, IJ
I 213 π pass r furnace shi r) [1) j (i, J'l] ~ shi (1 + 6 JP-A-Sho GO-73698 (10) Figure 10 A/ ds = (-ys, double 7 −, 90,000 L dq=(+, 2 ima r ---19,000) d3=(white, 200, −, 9,000),
d2=(-r,=shi,---,9t)dzu=(-12,
---1ri14)

Claims

[Claims] The input signal is defined as a feature vector a 1t &2 /...a l
4. Feature extraction means for converting into a series of feature vectors; bn,...bn...bfn
Standard pattern Rn (where n=1.2...
·, N), and the distance between the feature vectors a1 and b5 in the i-th frame of the input pattern ``(', 1)woj = 1t 2t''' t
"j" = 122 g..., an inter-frame distance calculation means that calculates N, and in the i-th frame of the input pattern %] = 1, 2, -, J"; n = 1.2
,..., Tweet N, assume that the state is q, then calculate the intermediate cumulative distance Dq (it5) from the state q in the i-1st frame of the state qk immediately preceding q in pattern n.
The cumulative distance D9k (i-1) along the path leading to k is determined as the continuation of the minimum value, and the intermediate back pointer Bq (' t j) along the path leading to it is calculated as the cumulative 2
Page distance calculation means and the last pattern n when assuming that it ends in state q in the first frame are calculated by intermediate cumulative distance 1
)n(i, )n) is determined as n that minimizes (i))
When we choose qk that minimizes 'Qq(i)=q
k, final value determining means, last pattern storing means for storing these values for each frame, back pointer storing means, immediately preceding state storing means, and when the input is completed at the last frame I of the input pattern, Letting the cumulative distance Dqf(I) to the state qf allowed as the state be the state q, and from the contents of the back pointer storage means, the previous state storage means, and the last word storage means, Bq(1)=o. up to n=Nq(i). 1=Bq(i), q=09(1) and outputs the recognition results in the reverse order.
A pattern comparison device characterized by the following features: