JPS6313099A

JPS6313099A - Continuous voice recognition equipment

Info

Publication number: JPS6313099A
Application number: JP61157257A
Authority: JP
Inventors: 誠夫亘理
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1986-07-03
Filing date: 1986-07-03
Publication date: 1988-01-20

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】（産業上の利用分野）本発明は連続音声認識装置に関し、特に文法に従って連
続発声された文音声’（ｒ認識する連続音声認識装置の
改良に関する。DETAILED DESCRIPTION OF THE INVENTION (Field of Industrial Application) The present invention relates to a continuous speech recognition device, and more particularly to an improvement in a continuous speech recognition device that recognizes sentence speech '(r) continuously uttered according to grammar.

（従来の技術）音声認識装置の中でも文法に従って発声された文音声を
認識する装置は、計算機プログラムや限定業務用文章あ
るいは航空管制や各種機器の制御用指令などの認識がで
き広範囲な応用分野を有している。文法の拘束が与えら
れている場合には、その文法規則を利用することによっ
て誤認識を防止できることが原理的に知られている。特
に連続数字認識において、入力音声に桁数の制約がある
場合、その制約を規則化することにより認識率を改善す
ることができる。(Prior art) Among speech recognition devices, devices that recognize sentence sounds uttered according to grammar can recognize computer programs, limited business texts, air traffic control and control commands for various equipment, etc., and have a wide range of applications. have. It is known in principle that when grammatical constraints are given, misrecognition can be prevented by using the grammatical rules. Particularly in continuous number recognition, if there is a restriction on the number of digits in the input speech, the recognition rate can be improved by regularizing the restriction.

このような文法に従って連続に発声された文音声を認識
する手法が特願昭５６−１９９０９８号明細書に記載さ
れている。この手法はＣＷＤＰ法と呼ばれ、原理は大路
次のとおりである。文法をオートマトンαで表現し、そ
のオートマトンαを次のように定義する。A method of recognizing sentence sounds continuously uttered according to such a grammar is described in Japanese Patent Application No. 199098/1983. This method is called the CWDP method, and the principle is as follows. The grammar is expressed by an automaton α, and the automaton α is defined as follows.

α＝（Ｋ、Σ、Δ＊ｐｏ＋Ｆ）　　　　　・・・・・−
・・・（１）ここで、　Ｋ：状態ｐの集合（ｐｌｐ＝１
＊２ｙ・・・、π）Σ：入力単語ｎの集合（ｎｌｎ＝１
．２．・−、Ｎ）Δ：状態遷移規則（（ｐ、ｑ、ｎ）ここで、（ｐｓｑｔｎ）はｐｘＬｑなる状態遷移を意味
する。α=(K, Σ, Δ*po+F) ・・・・・・−
...(1) Here, K: set of states p (plp=1
*2y..., π)Σ: Set of input words n (nln=1
．． 2. -, N) Δ: State transition rule ((p, q, n) Here, (psqtn) means a state transition of pxLq.

ｐｏ：初期状態。以後はｐ−ｏで示す。po: initial state. Hereinafter, it will be indicated as po.

Ｆ：Ｒｔ終状態集合ＦＣＫ次に前記オートマトンαに従って単語ｎ（Σを連続して
発声して得られる音声バタンＡｉＡ＝　ａ、　、　ａ２
．−−−、　ａ　ｉ、　・−、ａ、　　　・−・−−−
−−（２）で示し、これを（未知）入力バタンと呼ぶ。F: Rt final state set FCK Next, according to the automaton α, the sound bang obtained by continuously uttering the word n (Σ) AiA= a, , a2
．． ---, ai, ・-, a, ・-・----
--(2), and this is called the (unknown) input button.

各単語ｎ（Σ　に対して標準的なバタンＢ＝Ｎ、へ、・・・、ｂｊ、・−・、ｂｊｎ　　　　　
　・・・・・・・・・（３）を用意し、これを単語標準
バタンと呼ぶ。この単語標準バタンをオートマトンαに
従って接続することによって得られる連続音声標準パタ
ンイ♂１Ｂ　、・・・、Ｂ　と入力バタンＡとのＤＰマ
ツチングを行い、２つのバタンの相互に異なる度合を表
わすＸ＜以下相異度と称する）を算出し、最小の相異度
を与える単語系列を認識結果とする。For each word n(Σ, the standard bang B=N, to, ..., bj, ..., bjn
・・・・・・・・・(3) is prepared and this is called a word standard slam. DP matching is performed between the continuous speech standard pattern ♂1B,...,B obtained by connecting these standard word batons according to the automaton α and the input baton A, and the degree of mutual difference between the two batons is expressed by (referred to as the degree of dissimilarity) is calculated, and the word sequence that provides the minimum degree of dissimilarity is taken as the recognition result.

ここで最小の相異度を次のような動的計画の手法で求め
る。初期条件をＴ（０，０）＝ＯＴ（ｉ　、　ｑ　）＝ｏｏ、　ｉ〆Ｏ＋　Ｑ〆０Ｇ（ｑ
＊ｎｔｉ）＝＝（１）　　　　　　　　・・・・・・・
・・（４）とし、ｉ　＝　１よシエまで順次（５）式の
境界条件を基に（６）式の漸化式を（ｐｔ　ｑ＊　ｎ）
’Δなるすべての対（ｐ＋ｎ）について計算する。すな
わち、対（ｐ、ｎ）について境界条件Ｇ（ｐｔｎｔｏ）＝Ｔ（ｉ−１９ｐ）Ｈ（ｐ　、　ｎ　、　ｏ　）＝ｉ　−１−−・・−（５
）とし、漸化式ただしｊは（６）式の右辺における最小のＧ（ｐ。Here, the minimum degree of dissimilarity is found using the following dynamic programming method. The initial conditions are T(0,0)=O T(i, q)=oo, i〆O+ Q〆0G(q
*nti)==(1) ・・・・・・・・・
...(4), and the recurrence formula of formula (6) is (pt q * n) based on the boundary condition of formula (5) sequentially until i = 1.
'Calculate for all pairs (p+n) of Δ. That is, for the pair (p, n), the boundary conditions G(ptnto)=T(i-19p) H(p, n, o)=i-1--(5
), where j is the minimum G(p) on the right side of equation (6).

”＋Ｊ）を与えるｊ゛である。”+J).

をｊ＝１よりＪｔで（ｎ番目の標準パターンの始端より
終端まで）計算し、ｇｏ）ｈ（ｊ）をそれぞれＧ　（ｐ
　＋”ｐＪ）＋Ｈ（ｐ＋ｎｐＪ）に格納する。ここでｄ
（ｊ）ば入力パターン時刻ｉＫおける特徴ベクトルａｉ
とｎ番目の標準バタン時刻ｊにおける特徴ベクトルｂｊ
との間の距離であり、例えばチェビシェフ距離として求
めることができる。from j=1 (from the start to the end of the nth standard pattern), and calculate go)h(j) as G(p
+”pJ)+H(p+npJ).Here, d
(j) If the feature vector ai at input pattern time iK
and the feature vector bj at the nth standard slam time j
It is the distance between

次に単語の境界における最小化としてｉｆ　　Ｔ（ｉ、ｑ）′）ＣＴ（ｐ、ｎ、Ｊ　）　　　
　・・・・・−＝・（９）ｔｈｅｎ　Ｔ（ｉ、ｑ）−Ｇ
（ｐ、ｎ、Ｊ　）Ｎ（ｉ・ｑ）＝ｎＰ（ｉ、ｑ）＝ｐＬ（ｉｅｑ）＝Ｈ（ｐｓｎ＋Ｊｎ）を計算する。Next, as a minimization at word boundaries, if T(i, q)') CT(p, n, J)
・・・・・・−=・(9) then T(i, q)−G
Calculate (p, n, J) N(i・q)=n P(i, q)=p L(ieq)=H(psn+Jn).

入力バタンの認識結果は、判定処理として次のような手
続により求められる。The recognition result of the input button is obtained by the following procedure as a determination process.

△ す。ｐ＝Ｑならば終了とする。△ vinegar. If p=Q, the process ends.

以上説明した方法では、第２図（ａｌと（６）式に示す
ように単語相異度ｑ（ｊ）は入力パターン時間長ｉに比
例した値となっている。第２囚はＤＰマツチングのパス
を示す説明図である。すなわち、入力パターン時刻１つ
ごとに特徴ベクトル距離ｄ（ｊ）ｅｌつ加算している。In the method explained above, the word dissimilarity q(j) is a value proportional to the input pattern time length i, as shown in Figure 2 (al) and equation (6). It is an explanatory diagram showing a path. That is, a feature vector distance d(j)el is added for each input pattern time.

連続音声認識において、単語相異度が入力パターンの時
間長に比例している埋山は以下のとおりである。、例えば、第２図（ｂ）のようなりＰパスを用いると単語
相異度ｇ（ｉ、ｊ）はただしＷ１＝Ｗ、＝１．Ｗ、＝Ｖ２にて求めることができ、この単語相異度は入力パターン
時間長と標準パターン時間長との和に比例する。すなわ
ち（ｉ＋ｊ　）に比例する。この場合、相異度は標準パ
ターン時間長が短い程小さな値となる。す々わち、連結
した標準パターンの長さが短い程有利になり、単語の脱
落が起りやすくなる。In continuous speech recognition, the cases in which the degree of word dissimilarity is proportional to the time length of the input pattern are as follows. , For example, if P path is used as shown in FIG. 2(b), the word dissimilarity g(i, j) becomes W1=W,=1. W,=V2, and this degree of word dissimilarity is proportional to the sum of the input pattern time length and the standard pattern time length. That is, it is proportional to (i+j). In this case, the degree of difference becomes a smaller value as the standard pattern time length becomes shorter. In other words, the shorter the length of the connected standard patterns, the more advantageous it is, and the more likely words will be omitted.

このため、連続音声認識においては（６）式のような標
準パターンの時間長に依存しない漸化式を用いる必要が
ある。Therefore, in continuous speech recognition, it is necessary to use a recurrence formula that does not depend on the time length of the standard pattern, such as formula (6).

（発明が解決しようとする問題点）しかしながら、入力パターン時間長に比例する漸化式で
は、第２図（ｂ）に示したような入力パターン時間軸に
垂直な方向を取ることはできない。このため、入力パタ
ーンの伸縮の度合が制限される。(Problems to be Solved by the Invention) However, with the recurrence formula proportional to the input pattern time length, it is not possible to take a direction perpendicular to the input pattern time axis as shown in FIG. 2(b). Therefore, the degree of expansion and contraction of the input pattern is limited.

すなわち、ら）式に示す漸化式では入力パターンは標準
パターンの５４までしか縮むことができない。That is, the input pattern can only be reduced to 54, which is the standard pattern, using the recurrence formula shown in equation (a).

′ゆえに１部分的に伸縮がはげしく起るパターンで　。'Therefore, there is a pattern in which there is a lot of expansion and contraction in one part.

は、入力パターンと標準パターンとの時間軸対応が正確
にできず誤認識の原因となっている。一方、入力パター
ン時間長に比例した漸化式を用いて伸縮の度合を大きく
するためには、第２図（ｃ）　、　（ｄｌのようなりＰ
パスを用いる必要があυ、 α４式に示す漸化式またはα９式に示す漸化式にて求められる。しかしながら、α菊式αＱ式はｆｂ１
式に比較し計算量が増大するという欠点がある。In this case, the time axis correspondence between the input pattern and the standard pattern cannot be accurately achieved, which causes misrecognition. On the other hand, in order to increase the degree of expansion and contraction using a recursion formula proportional to the input pattern time length, it is necessary to
It is necessary to use a path υ, which can be obtained using the recurrence formula shown in equation α4 or equation α9. However, the α chrysanthemum formula αQ formula is fb1
The disadvantage is that the amount of calculation increases compared to the formula.

本発明の目的は、上述した欠点を除去し、認識率が高く
計算量の少ない連続音声認識装置を提供することにある
。An object of the present invention is to eliminate the above-mentioned drawbacks and provide a continuous speech recognition device with a high recognition rate and a small amount of calculation.

（問題点を解決する之めの手段）本発明による連続音声認識装置は、有限状態オートマト
ンの各状態Ｐにおいて入力パターンの時間長に比例した
相異度Ｔ（ｉ、ｐ）を単語境界値とし入力パターンと標
準パターンの間の単語相異度を入力パターンの時間長に
比例し々い形式にて求めるＤＰマツチング部と、前記Ｄ
Ｐマツチング部によって求められた単語相異度を入力パ
ターン時間長に比例した相異度に変換する相異度変換部
と、前記相異度変換部にて求められた変換相異度をもと
に状態ｑにおける最小相異度を求めこれを単語境界値Ｔ
　（ｉｔｑ）とする最小相異度計算部とを備えることを
特徴とする。(Means for solving the problem) The continuous speech recognition device according to the present invention uses the degree of dissimilarity T(i, p) proportional to the time length of the input pattern as a word boundary value in each state P of a finite state automaton. a DP matching unit that calculates the degree of word dissimilarity between the input pattern and the standard pattern in a format that is proportional to the time length of the input pattern;
a dissimilarity degree conversion section that converts the word dissimilarity degree obtained by the P matching section into a dissimilarity degree proportional to the input pattern time length; and a dissimilarity degree conversion section that converts the word dissimilarity degree obtained by the P matching section, and Find the minimum degree of dissimilarity in state q and use this as the word boundary value T
(itq).

（作用）次に本発明の作用について説明する。本発明では、入力
パターンと標準パターンのＤＰマツチングの計算に使用
する消化式として入力パターン時間長に比例しない形式
のものを用いる。(Function) Next, the function of the present invention will be explained. In the present invention, the digestion formula used to calculate the DP matching between the input pattern and the standard pattern is of a type that is not proportional to the input pattern time length.

例えば、従来方法で使用していた（６）式の代カに０３
式を用いる。これは第２図（ｂｌに示すＤＰパスを持っ
ている。このα４式の漸化式を入力パターンのフレーム
ごとに計算されるＣＷＤＰ法の形式に書きなおすと、ｈ（ｊ）＝ｆ（（ｐｓｎｓｊ）ｏｒ　Ｈ（ｐｓｎｓｊ−
１）ｏｒ　ｈ（ｊ−１）・・・・・・（１７）ただしα０式の最小選択に対応して０９式の右辺の選択
が行われる。For example, 03 is substituted for the formula (6) used in the conventional method.
Use the formula. This has the DP path shown in Figure 2 (bl). If we rewrite this α4 recurrence formula into the form of the CWDP method, which is calculated for each frame of the input pattern, we get h(j) = f(( psnsj)or H(psnsj-
1) or h(j-1) (17) However, in response to the minimum selection of the α0 equation, the right side of the 09 equation is selected.

となる。このαｅ、αη式を使用したＤＰマツチングの
計算は以下のようになる。becomes. Calculation of DP matching using the αe and αη formulas is as follows.

初期条件を（４）式とし、ｉ−１よ、９Ｉまで順次（５
）式の境界条件を基に００式の漸化式を（ｐ、ｑ、ｎ）
（Δなるすべての対（ｐ、ｎ）について計算する。Let the initial condition be equation (4), and sequentially from i-1 to 9I (5
) Based on the boundary conditions of equation 00, the recurrence equation is (p, q, n)
(Calculate for all pairs (p, n) where Δ.

各対（ｐ　ｅ　ｎ）とｉにおいて、ｊ＝１からＪｎま１
）式の境界条件の基にαＧ、αη式を計算し、その結果
ｇ（ｊ）、ｈす）をそれぞれＧ（ｐｓ”＋Ｊ）＋Ｈ（ｒ
’＋”ｔｊ）に格納する。For each pair (p e n) and i, from j=1 to Jn or 1
) based on the boundary conditions of the equations αG and αη, and the results g(j) and
'+'tj).

次に単語の境界における相異度を入力パターン時間長に
比例した相異度に０９式に従って変換する。Next, the degree of dissimilarity at the word boundary is converted into a degree of dissimilarity proportional to the input pattern time length according to equation 09.

α＝Ｇ（ｐｓｎ、”）×ｉ／（ｉ十Ｊ’）　　　　　　
＋＋＋＋＋＋＋＋＋α槌続いて単語の境界における最小
化としてｌ　ｆ　　Ｔ　（ｉｒ　ｑ）＞”　　　　　　
　　　・・・・・・・・値■ｔｈｅｎ　　Ｔ（ｉ、ｑ）
＝Ｇ’ Ｎ（ｉ、ｑ）＝ｎＰ（ｉ、ｑ）＝ｐＬ（’−ｑ）＝Ｈ（ｐｙｎ、”）を計算する。α=G(psn,”)×i/(i×J')
＋＋＋＋＋＋＋＋α Then as a minimization at the word boundary, l f T (ir q)>”
・・・・・・・・・Value ■then T(i, q)
=G' N(i, q)=n P(i, q)=p L('-q)=H(pyn,'') Calculate.

入力パターンの認識結果は（１１、（Ｉｌｌ　、０２式
の判定処理により求められる。The recognition result of the input pattern is obtained by the determination process of equation (11, (Ill, 02).

以上の方法では、単語相異度は入力パターン時間長に比
例していないが、単語境界において入力パターン時間長
に比例する値に変換し全体の相異度を求めている。これ
によシ従来と同様に入力パターンと連結された標準パタ
ーンとの間の相異度を求めることができる。また、単語
内での相異度は（１０式のような漸化式を用いることが
できるので、入力パターンと標準パターンの時間軸対応
における伸縮の度合を大きくできる。In the above method, the degree of word dissimilarity is not proportional to the input pattern time length, but is converted to a value proportional to the input pattern time length at the word boundary to obtain the overall degree of dissimilarity. As a result, the degree of dissimilarity between the input pattern and the connected standard pattern can be determined as in the conventional method. Furthermore, since a recurrence formula such as Equation 10 can be used for the degree of dissimilarity within a word, the degree of expansion and contraction in the time axis correspondence between the input pattern and the standard pattern can be increased.

（実施例）次に本発明について図面を参照して詳細に説明する。第
１ＵＡは本発明の１実痴例を示すブロック図であり、第
３図は第１図の実施例における動作の時間関係を示すタ
イムチャー）、ｇ４（ａ）〜（ｄ１図は第１図の実施例
における動作の流れを示すフローチャートである。(Example) Next, the present invention will be described in detail with reference to the drawings. 1UA is a block diagram showing a practical example of the present invention, and FIG. 3 is a time chart showing the time relationship of operations in the embodiment of FIG. 3 is a flowchart showing the flow of operations in the embodiment.

標準バタンメモリ１１３０には単語セットΣに含まれる
単語ｎの標準パタンＢｎが記憶されており、オートマト
ン記憶部２３０には状態遷移規則（ｐ＊Ｑ＋”）と最終
状態Ｆの指定情報が記憶されている。The standard pattern Bn of the word n included in the word set Σ is stored in the standard button memory 1130, and the state transition rule (p*Q+'') and the designation information of the final state F are stored in the automaton storage unit 230. There is.

マイクロホン１００よυ未知入力音声が入力されると、
入力部１１０によりて周波数分析がなされ特徴を示すベ
クトルａｔに変換され順次大カバクンメモリ部１２０に
送られる。また、入力部１１０には音声レベルを検知す
ることによって音声区間を決定する機能が与えられてお
シ、音声区間中では「１」その他ではｒＯＪなる音声区
間信号ＳＰを発生する。制御部２４０は、この音声区間
信号ＳＰの立上がりの時刻において初期化パルス５ＥＴ
１を発生する（第３図）。これによって（４）式および
第４（ａ）図のブロック１０に対応する初期化かのメモ
リ２００に対してなされる。When unknown input audio is input from microphone 100,
Frequency analysis is performed by the input unit 110, and the vector at is converted into a characteristic vector at, which is sequentially sent to the large-capacity memory unit 120. Further, the input section 110 is provided with a function of determining a voice section by detecting the voice level, and generates a voice section signal SP which is "1" during the voice section and rOJ at other times. The control unit 240 generates an initialization pulse 5ET at the time of rise of the voice section signal SP.
1 (Figure 3). As a result, the memory 200 corresponding to equation (4) and block 10 in FIG. 4(a) is initialized.

相異度の計算は入力パターン時刻ｉに沿って行われる。The calculation of the degree of difference is performed along the input pattern time i.

各時刻ｉでは状態遷移規則ｒ（ｐｔ”＊ｑ）をオートマ
トン記憶部２３０から読み出す。At each time i, the state transition rule r(pt''*q) is read from the automaton storage unit 230.

続いて制御部２４０からの信号５ＥＴ２により第４（ａ
）図ブロック１１と（５）式に対応する境界条件のセッ
トが行われる。Subsequently, the signal 5ET2 from the control section 240 causes the fourth (a
) Boundary conditions corresponding to block 11 and equation (5) are set.

続いて標準パターン時刻信号ｊが１からＪｎまで変化し
第４（ｂ）図のブロック１２とαｅ、αη式に対応する
漸化式の計算がＤＰマツチング部３１０にて行われる。Subsequently, the standard pattern time signal j changes from 1 to Jn, and the DP matching section 310 calculates the recurrence formula corresponding to block 12 in FIG. 4(b) and the αe, αη formulas.

標準パターン時刻がＪｎとなった時、第４（ｂ）図のプ
ロップ１３に対応するワークメモリの更新を行う。続い
て第４（ｂ）図のブロック１４とα梯式に対応する相異
度の変換が相異度変換部３５０にて行われる。最後に第
４（ｂ）図のブロック１５とａ９式に対応する単語境界
の最小化が比較回路１７０を用いて行われる。When the standard pattern time reaches Jn, the work memory corresponding to prop 13 in FIG. 4(b) is updated. Subsequently, the dissimilarity conversion unit 350 performs dissimilarity conversion corresponding to block 14 in FIG. 4(b) and the α ladder formula. Finally, word boundary minimization corresponding to block 15 and formula a9 in FIG. 4(b) is performed using comparison circuit 170.

以上の処理をすべての状態遷移規則ｒについて求め、さ
らに入力パターン時刻ｉを１からＩｔで求める。その後
、第４（ｃ）図のブロック１６と（１１式に対応する終
端点での最小相異度を終端判定部４００で求める。続い
て、第４（ｄ）図のブロック１７とαυ、０２式に対応
する認識結果の判定処理が結果判定部２２０で行なわれ
る。The above process is performed for all state transition rules r, and the input pattern time i is determined from 1 to It. After that, the minimum dissimilarity at the terminal point corresponding to the block 16 in FIG. The result determination unit 220 performs a process of determining the recognition result corresponding to the formula.

以上、本発明を実施例に基づいて説明したが、これらの
記載は本発明の権利範囲を限定するものでない。本発明
で使用される漸化式は００式以外にも種種考えられる。Although the present invention has been described above based on examples, these descriptions do not limit the scope of the rights of the present invention. Various recurrence formulas other than the 00 formula can be used in the present invention.

例えば、Ｗｌ＝Ｗ２＝Ｗ、＝１　　としてもよい。For example, Wl=W2=W, =1 may be used.

この場合は斜め方向が多少有利となるが、計算が簡単と
なる。In this case, the diagonal direction is somewhat advantageous, but the calculation is simpler.

また、本実施例では特願昭５６−１９９０９８に記載し
ているようなＣＷＤＰ法を基にしているが、同様な種種
のオートマトン制御ＤＰマツチング法（例えば特願昭５
４−１０４６６９．特願昭６ｌ−０３１１７９）にも本
発明の原理を適用することができる。Furthermore, although this embodiment is based on the CWDP method as described in Japanese Patent Application No. 56-199098, various similar automaton-controlled DP matching methods (for example, Japanese Patent Application No. 56-1999)
4-104669. The principles of the present invention can also be applied to Japanese Patent Application No. 61-031179).

（発明の効果）以上説明した如く本発明によれば、連続音声認識装置に
おいて、ＤＰマツチングの漸化式を複雑にせずに、入力
パターンと標準パターンの時間軸対応における伸縮の度
合を大きくできこれによって認識率を著しく高め、しか
も計算量を大幅に減少することができるという効果があ
る。(Effects of the Invention) As explained above, according to the present invention, in a continuous speech recognition device, it is possible to increase the degree of expansion and contraction in the time axis correspondence between the input pattern and the standard pattern without complicating the recurrence formula of DP matching. This has the effect of significantly increasing the recognition rate and significantly reducing the amount of calculation.

[Brief explanation of drawings]

第１図は本発明の１実施例を示すブロック図、第２図は
ＤＰマ、チングのバスを示す説明図、第３因は第１図の
実施例における動作の時間関係を示すタイムチャート、
第４（ａ）〜ｉ）図は第１図の実施例における動作の流
れを示すフローチャートである。１１０・・・・・・入力部、１２０・・・・・・入力バ
タンメモリ部、１３０・・・・・・標準バタンメモリ部
、１７０・・・比較回路、１８０〜２１０メモリ、２２
０・・・・・・結果判定部、２３０・・・・・・オート
マトン記憶部、２４０・・・・・・制御部、３１０・・
・・・・ＤＰマツチング部、３２０゜３３０・・・・・
・ワークメモリ、３５０・・・・・・相異度変換部、４
００・・・・・・終端判定部。 αυ　　　　　（ｂ）　　　　　（Ｃ）　　　　　　（
ダク酩２図第４（ａ）図第４（ｂ）区第４　（Ｃ）図FIG. 1 is a block diagram showing one embodiment of the present invention, FIG. 2 is an explanatory diagram showing a DP machining bus, and the third factor is a time chart showing the time relationship of operations in the embodiment of FIG.
4(a) to i) are flowcharts showing the flow of operations in the embodiment of FIG. 1. 110... Input section, 120... Input button memory section, 130... Standard button memory section, 170... Comparison circuit, 180 to 210 memory, 22
0...Result determination unit, 230...Automaton storage unit, 240...Control unit, 310...
...DP matching section, 320°330...
・Work memory, 350...Difference conversion unit, 4
00... Termination determination section. αυ (b) (C) (
Figure 4 (a) Figure 4 (b) Section 4 (C) Figure 2

Claims

[Claims] Continuously uttered speech is combined with a standard pattern concatenation pattern specified by a finite state automaton and DP (Dyn
In a continuous speech recognition device that performs matching (dynamic programming (amic programming)) and recognizes a series of standard patterns that yield the minimum degree of dissimilarity, the time length of the input pattern is determined in each state P of a finite state automaton. A DP matching unit that uses a proportional degree of dissimilarity T (i, p) as a word boundary value and calculates the degree of word dissimilarity between the input pattern and the standard pattern in a format that is not proportional to the time length of the input pattern, and the DP matching unit. a dissimilarity converter that converts the word dissimilarity obtained into a dissimilarity proportional to the input pattern time length; 1. A continuous speech recognition device comprising: a minimum dissimilarity calculating section which obtains a dissimilarity and uses this as a word boundary value T(i, q).