JPH0355836B2

JPH0355836B2 -

Info

Publication number: JPH0355836B2
Application number: JP61002940A
Authority: JP
Priority date: 1986-01-10
Filing date: 1986-01-10
Publication date: 1991-08-26
Also published as: JPS62161200A

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明の連続音声認識装置に関し、特に文法に
従つて連続発声された文音声を高速認識する高認
識率の連続音声認識装置に関する。DETAILED DESCRIPTION OF THE INVENTION [Industrial Field of Application] The present invention relates to a continuous speech recognition device, and particularly to a continuous speech recognition device with a high recognition rate that rapidly recognizes sentence speech continuously uttered according to grammar.

[Conventional technology]

音声認識装置の中でも文法に従つて発声された
文音声を認識する装置は、計算機プログラムや限
定業務用文章あるいは航空管制や各種機器の制御
用指令などの認識ができ広範囲な応用分野を有し
ている。文法の拘束が与えられている場合には、
その文法規則を利用することによつて誤認識を防
止できることが原理的に知られている。特に連続
数字認識において入力音声に桁数の制約がある場
合、その制約を規則化することにより認識率を改
善することができる。 Among voice recognition devices, devices that recognize sentence sounds uttered according to grammar can recognize computer programs, limited business texts, and commands for air traffic control and control of various equipment, and have a wide range of applications. There is. Given grammatical constraints,
It is known in principle that misrecognition can be prevented by using the grammar rules. In particular, when there is a restriction on the number of digits in input speech in continuous number recognition, the recognition rate can be improved by regularizing the restriction.

このような文法に従つて連続に発声された文音
声を認識する手法が同一出願人による特願昭58−
239303号明細書「連続音声認識装置」に記載され
ている。この原理であるブロツクワイズDPマツ
チング法は大略次のようである。文法をオートマ
ンαで表現し、そのオートマンαを次のように定
義する。 A method for recognizing sentence sounds continuously uttered according to such a grammar was proposed in a patent application filed in 1983 by the same applicant.
It is described in the specification 239303 "Continuous speech recognition device". The principle of this blockwise DP matching method is roughly as follows. The grammar is expressed by an automan α, and the automan α is defined as follows.

α＝＜Ｋ、Σ、Δ，P₀、Ｆ＞ ……(1) ここで、Ｋ：状態ｐの集合｛ｐ｜ｐ＝１、２、
…π｝ Σ：入力単語ｎの集合｛ｎ｜ｎ＝１、２、…Ｎ｝ Δ：状態遷移規則｛（ｐ、ｑ、ｎ）｝ここで、（ｐ、ｑ、ｎ）はｐｎ ―→ ｑなる状態
遷移を意味する。 α=<K, Σ, Δ, P ₀ , F> ...(1) Here, K: set of states p {p|p=1, 2,
...π} Σ: Set of input words n {n|n=1, 2, ...N} Δ: State transition rule {(p, q, n)} Here, (p, q, n) is pn -→ It means q state transition.

P₀：初期状態、以後はｐ＝０で示す。P ₀ : Initial state, hereinafter indicated as p=0.

Ｆ：最終状態集合Ｆｎ ―→ Ｋ次に前記オートマトンαに従つて単語ｎｎ ―→ Σ
を連続して発声して得られる音声パタンＡをＡ＝a₁、a₂、…a_i、a_I …(2) で示し、これを（未知）入力パタンと呼ぶ。各単
語ｎｎ ―→ Σに対して標準的なパタン Bⁿ＝b₁ ⁿ、b₂ ⁿ、…b_j ⁿ、…bⁿ _Jo …(3) を用意し、これを単語標準パタンと呼ぶ。この単
語標準パタンBⁿをオートマンαに従つて接続す
ることによつて得られる連続音声標準パタンＣ＝
Bⁿ¹、Bⁿ²、…B^nxと入力パタンＡとのDPマツチン
グを行い、２つのパタンの相互に異なる度合を表
わす量（以下相異度と称する）を算出し、最小の
相異度を与える単語系列を認識結果とする。F: Final state set Fn -→ K Next, according to the automaton α, word nn -→ Σ
A voice pattern A obtained by continuously uttering is expressed as A=a ₁ , a ₂ , ...a _i , a _I ...(2), and is called an (unknown) input pattern. A standard pattern B ⁿ =b ₁ ⁿ , b ₂ ⁿ , ... b _j ⁿ , ... b ⁿ _Jo (3) is prepared for each word nn -→ Σ, and this is called a word standard pattern. Continuous speech standard pattern C= obtained by connecting this word standard pattern B ⁿ according to Automan α
Perform DP matching between B ⁿ¹ , B ⁿ² , ... B ^nx and the input pattern A, calculate the amount representing the degree of mutual difference between the two patterns (hereinafter referred to as the degree of dissimilarity), and give the minimum degree of dissimilarity. The word sequence is the recognition result.

ここで最小の相異度を次のような動的計画の手
法で求める。初期条件をＴ(0、0)＝０Ｔ(i、q)＝∞、ｉ≠０、ｑ＝０ G₁(p、n、j)＝∞、G₂(p、n、j)＝∞ …(4) とし、ｂ＝１よりＩ／BL（ここでＩ／BLは説明
の簡単のため割り切れるものとする）まで順次
(5)、(6)式の境界条件を基に(7)、(8)式の漸化式を
（ｐ、ｑ、ｎ）ｎ ―→ Δなるすべての対（ｐ、ｎ）
について計算する。すなわち状態ｐとの境界条件
をｉ＝：_s、…i_eについてｇ(i-1、0)＝Ｔ(i-1、p) ｈ(i-1、0)＝ｉ−１ …(5) ただしi_s＝(b-1)・BL＋１、i_e＝ｂ・BL とし、ブロツクｂ−１との境界条件をｉ＝１、
…、Jⁿなる各標準パタン時刻ｊについてｇ（i_s−１、ｊ）＝G₁(p、n、j) ｈ（i_s−１、ｊ）＝H₁(p、n、j) ｇ（i_s−２、ｊ）＝G₂(p、n、j) ｈ（i_s−２、ｊ）＝H₂(p、n、j) …(6) とし、漸化式ｇ（ｉ、ｊ）＝ｄ（ｉ、ｊ）＋minｄ（ｉ−
１、ｊ）＋ｇ（ｉ−２、ｊ−１）ｇ（ｉ−１、ｊ−１）ｇ（ｉ−１、ｊ−２） …(7) ｈ（ｉ、ｊ）＝ｈ（i^、j^） …(8) ただし（（i^、j^）は(7)式の右辺における最小の
ｇ（ｉ、ｊ）を与える（ｉ、ｊ）である。 Here, the minimum degree of dissimilarity is found using the following dynamic programming method. Initial conditions are T(0, 0)=0 T(i, q)=∞, i≠0, q=0 G ₁ (p, n, j)=∞, G ₂ (p, n, j)=∞ ...(4), and sequentially from b=1 to I/BL (Here, I/BL is assumed to be divisible for ease of explanation)
Based on the boundary conditions of equations (5) and (6), the recurrence equations of equations (7) and (8) are expressed as (p, q, n) for all pairs (p, n) such that n -→ Δ
Calculate about. In other words, the boundary condition with state p is i =: _s ,...i For _e , g(i-1, 0) = T(i-1, p) h(i-1, 0) = i-1...(5) However, i _s = (b-1)・BL+1, i _e =b・BL, and the boundary conditions with block b-1 are i=1,
..., J ⁿ for each standard pattern time j g(i _s -1, j) = G ₁ (p, n, j) h (i _s -1, j) = H ₁ (p, n, j) g (i _s −2, j)=G ₂ (p, n, j) h(i _s −2, j)=H ₂ (p, n, j) …(6), and the recurrence formula g(i, j)=d(i,j)+mind(i-
1, j) + g (i-2, j-1) g (i-1, j-1) g (i-1, j-2) ...(7) h (i, j) = h (i^, j^) ...(8) However, ((i^, j^) is (i, j) that gives the minimum g(i, j) on the right side of equation (7).

を入力パタン時刻ｉ＝i_sよりi_eまで計算する。こ
こでｄ（ｉ、ｊ）は入力パタンの時刻ｉにおける
特徴ベクトルa_iと標準パタンの時刻ｊにおける特
徴ベクトルbⁿ _jとの間の距離である。is calculated from input pattern time i=i _s to i _e . Here, d(i, j) is the distance between the feature vector a _i of the input pattern at time i and the feature vector b ⁿ _j of the standard pattern at time j.

ｄ（ｉ、ｊ）＝Dis（a_i、bⁿ _j） …(9) 次のブロツクの計算のため前記計算結果の境界
値を格納する。 d(i, j)=Dis(a _i , b ⁿ _j ) (9) Store the boundary value of the calculation result for the calculation of the next block.

G₁(p、n、j)＝ｇ（i_e、ｊ） H₁(p、n、j)＝Ｈ（i_e、ｊ） G₂(p、n、j)＝ｇ（i_e、１、ｊ） H₂(p、n、j)＝ｈ（i_e、１、ｊ） …(10) (6)、(7)、(8)、(10)式の計算が標準パタン時刻ｉ＝
Jⁿまで終了した後、単語境界における最小化とし
てｉ＝i_s、…、i_eについて if Ｔ(i、q)＞ｇ（ｉ、jⁿ） then Ｔ(i、q)＝ｇ（ｉ、Jⁿ）Ｎ(i、q)＝ｎＰ(i、q)＝ｐＬ(I、q)＝ｈ（ｉ、Jⁿ …(11) を計算する。 G ₁ (p, n, j) = g (i _e , j) H ₁ (p, n, j) = H (i _e , j) G ₂ (p, n, j) = g (i _e , 1 , j) H ₂ (p, n, j) = h (i _e , 1, j) ...(10) (6), (7), (8), (10) are calculated at standard pattern time i =
After finishing up to J ⁿ , if T( _i , q)>g(i, j _n ⁾ then T(i, q)=g(i, J ⁿ ) N(i, q)=n P(i, q)=p L(I, q)=h(i, J ⁿ ...(11) Calculate.

以上述べた(7)、(8)式は漸化式計算は、第２図
（説明図）に示すように入力パタンのBLフレーム
分をブロツ化してブロツクごとに実行している。 The recurrence formula calculations for equations (7) and (8) described above are performed for each block by converting the BL frame portion of the input pattern into blocks, as shown in FIG. 2 (explanatory diagram).

最後に、入力パタンの認識結果は判定処理とし
て次のような手続きにより求められる。 Finally, the recognition result of the input pattern is determined by the following procedure as a determination process.

初期条件 q^＝argmin〔Ｔ（Ｉ、ｑ）〕…（12）ｑｎ ―→ Ｆｑ＝q^、ｍ＝１ …（13）認識単語 n^＝Ｎ(m、q) 単語始点 l^＝Ｌ(m、q) 状態遷移 q^＝Ｐ(m、q) …(14) を求める。Initial condition q^=argmin [T(I, q)]…(12) qn ―→ F q=q^, m=1...(13) Recognized word n^=N(m, q) Word starting point l^=L(m, q) State transition q^=P(m, q) …(14) seek.

もしl^＞０ならばｑ＝q^、ｍ＝l^として（14）式
を繰り返す。l^＝０ならば終了。 If l^ > 0, repeat equation (14) with q = q^ and m = l^. If l^=0, end.

以上説明した方法の漸化式(7)は第３図ａに示す
径路に沿つて計算される。このため、入力パタン
と標準パタンの時間軸の対応は1/2から２倍まで
の伸縮に制限されている。このことを「傾斜制
限」と呼んでおり、局所的に不自然な時間軸の対
応を排除している。この傾斜制限を用いることに
より認識率を高くすることが可能である。 The recurrence formula (7) of the method described above is calculated along the path shown in FIG. 3a. For this reason, the correspondence between the input pattern and the standard pattern on the time axis is limited to expansion/contraction from 1/2 to 2 times. This is called "tilt restriction" and eliminates locally unnatural time axis correspondences. By using this slope restriction, it is possible to increase the recognition rate.

しかしながら漸化式(7)を使用する場合はワーク
メモリG₁（ｐ、ｎ、ｊ）、H₁（ｐ、ｎ、ｊ）、G₂
（ｐ、ｎ、ｊ）、H₂（ｐ、ｎ、ｊ）が必要である。
さらに演算部とワークメモリ間のデータ転送
（(6)、(10)式に相当）が必要である。すなわち、漸
化式(7)を使用する場合はワークメモリが大きくな
り、かつデータ転送が多くなる欠点がある。 However, when using recurrence formula (7), the work memories G ₁ (p, n, j), H ₁ (p, n, j), G ₂
(p, n, j), H ₂ (p, n, j) are required.
Furthermore, data transfer (corresponding to equations (6) and (10)) between the calculation unit and the work memory is required. That is, when using the recurrence formula (7), there is a drawback that the work memory becomes large and data transfer increases.

一方、ブロツクワイズDPマチツング法におい
て傾斜制限のない漸化式を使用することが可能で
ある。この場合第３図ｂに示す径路に沿つて計算
される。このDPマツチングの計算の手順は前記
(4)、〜（14）式と同様に行われるが、初期条件と
して(4)式の代わりに（15）式を用い、Ｔ(0、0)＝０Ｔ(i、q)＝∞、ｉ≠０、ｑ≠０Ｇ(p、n、j)＝∞ …(15) ブロツクｂ−１との境界条件として(6)式の代わ
りに（16）式を用い、ｇ（i_s−１、ｊ）＝Ｇ(p、n、j) ｈ（i_s−１、ｊ）＝Ｈ(p、n、j) …(16) 漸化式として(7)式の代わりに（17）式を用い、ｇ(i、j)＝ｄ(i、j)＋minｇ(i-1、j) ｇ(i-1、i-1) ｇ(i-1、j-2) …(17) (10)式の代わりに（18）式を用いる。 On the other hand, it is possible to use a recurrence formula without slope restrictions in the blockwise DP matching method. In this case, the calculation is performed along the path shown in FIG. 3b. The calculation procedure for this DP matching is described above.
It is performed in the same way as equations (4) to (14), but using equation (15) instead of equation (4) as the initial condition, T(0, 0) = 0 T(i, q) = ∞, i≠0, q≠0 G(p, n, j)=∞ …(15) Using equation (16) instead of equation (6) as the boundary condition with block b-1, g(i _s −1 , j)=G(p, n, j) h(i _s −1, j)=H(p, n, j) …(16) As the recurrence equation, use equation (17) instead of equation (7). Use, g(i, j) = d(i, j) + ming(i-1, j) g(i-1, i-1) g(i-1, j-2) …(17) (10) Use equation (18) instead of Eq.

Ｇ(p、n、j)＝ｇ（i_e、ｊ）Ｈ(p、n、j)＝ｈ（i_e、ｊ） …(18) 以上説明した傾斜制限なしの漸化式（17）を用
いる方法では傾斜制御ありの漸化式(7)を用いる方
法に比較して、ワークメモリが1/2に減少でき、
さらにデータ転送も1/2に減少できる。 G(p, n, j) = g(i _e , j) H(p, n, j) = h(i _e , j) ...(18) The recurrence formula (17) without slope restriction explained above is The method used can reduce the work memory by half compared to the method using recurrence formula (7) with slope control.
Furthermore, data transfer can be reduced to 1/2.

しかしながら、傾斜制限がないため入力パタン
と標準パタンの時間軸の対応が不自然になること
が許されており、誤認識の原因となつている。 However, since there is no slope limit, the correspondence between the time axes of the input pattern and the standard pattern is allowed to become unnatural, causing erroneous recognition.

[Problem that the invention seeks to solve]

上述した従来の連続音声認識装置では、傾斜制
限ありの漸化式を用いる場合はワークメモリが大
きくかつデータ転送が多いという欠点がある。一
方、傾斜制限なしの漸化式を用いる場合は誤認識
が多いという欠点がある。 The above-mentioned conventional continuous speech recognition apparatus has the disadvantage that when using a recurrence formula with slope restriction, the work memory is large and data transfer is frequent. On the other hand, when using a recurrence formula without slope restrictions, there is a drawback that there are many erroneous recognitions.

本発明の目的は前記欠点を解消し、ワークメモ
リが小さくかつデータ転送が少なく、また認識率
の高い連続音声認識装置を提供することにある。 SUMMARY OF THE INVENTION It is an object of the present invention to provide a continuous speech recognition device which eliminates the above-mentioned drawbacks, has a small work memory, requires little data transfer, and has a high recognition rate.

[Means for solving problems]

本発明の連続音声認識装置は入力パタンを格納
する入力パタンメモリ部と、標準パタンを格納す
る標準パタンメモリ部と、単語ｎの入力によつて
状態遷移ｐ→ｑが生じることを意味する規則
（ｐ、ｑ、ｎ）群である状態遷移テーブルΔと最
終状態群Ｆとを記憶するオートマトン記憶部と、
状態ｐと単語ｎにより指定された状態遷移におい
て入力パタン時刻ｉと標準パタン時刻ｊで定めら
れる領域内で入力パタンBLフレームの幅を持つ
ブロツクｂ内の各点（ｉ、ｊ）の入力パタンの特
徴ベクトルと標準パタンの特徴ベクトル間の距離
ｄ（ｉ、ｊ）を求める距離計算部と、前記ブロツ
クｂと１つ前のブロツクｂ−１との境界部分の
DPマツチング計算を第１の漸化式を用いて求め
る境界DP計算部と、前記ブロツクｂの内部のDP
マツチング計算を第２の漸化式を用いて求める本
体DP計算部と、前記境界DP計算部と本体DP計
算部で求められた最小累積距離が得られる単語の
組合せを定める判定部とを備えている。 The continuous speech recognition device of the present invention includes an input pattern memory section that stores input patterns, a standard pattern memory section that stores standard patterns, and a rule (which means that state transition p→q occurs when word n is input). an automaton storage unit that stores a state transition table Δ and a final state group F, which are a group of p, q, n);
In the state transition specified by state p and word n, the input pattern at each point (i, j) in block b having the width of the input pattern BL frame within the area defined by input pattern time i and standard pattern time j. A distance calculation unit that calculates the distance d(i, j) between the feature vector and the feature vector of the standard pattern, and a distance calculation unit that calculates the distance d(i, j) between the feature vector and the feature vector of the standard pattern;
A boundary DP calculation unit that calculates the DP matching calculation using the first recurrence formula, and a DP inside the block b.
A main body DP calculation unit that calculates a matching calculation using a second recurrence formula, and a determination unit that determines a combination of words that yields the minimum cumulative distance calculated by the boundary DP calculation unit and the main body DP calculation unit. There is.

[Effect]

次に本発明の作用について第４図を参照しなが
ら説明する。 Next, the operation of the present invention will be explained with reference to FIG.

第４図は本発明の原理を説明するための図であ
る。 FIG. 4 is a diagram for explaining the principle of the present invention.

本発明の原理のDPマツチングの計算は、ブロ
ツクの境界部分では傾斜制限のない漸化式を使用
し、ブロツク内部では傾斜制限ありの漸化式を使
用して進められる。 The calculation of DP matching according to the principles of the present invention proceeds by using a recurrence formula without a slope restriction at the boundary of a block, and using a recurrence formula with a slope restriction inside the block.

初期条件をＴ(0、0)＝０Ｔ(i、q)＝∞、ｉ≠０、ｑ＝≠ Ｇ(p、n、j)＝∞ …(19) とし、ブロツクｂ＝１よりＩ／BLまで順次次の
（20）、（21）式の境界条件を基に（22）、〜（25）
式の漸化式を（ｐ、ｑ、ｎ）ｎ ―→ Δなる対（ｐ、
ｎ）について計算する。すなわち状態ｐとの境界
条件をｉ＝i_s、…、i_eについてｇ(i-1、0)＝Ｔ(i-1、p) ｈ(i-1、0)＝ｉ−１ …(20) ただしi_s＝（ｂ−１）・BL＋１、i_e＝ｂ・BL とし、ブロツクｂ−１との境界条件ををｊ＝１、
…、Jⁿなる標準パタン時刻ｊについてｇ（i_s−１、ｊ）＝Ｇ(p、n、j) ｈ（i_s−１、ｊ）＝Ｈ(p、n、j) …(21) とし、ブロツクの境界部分を求める第１の漸化式ｇ（i_s、ｊ）＝ｄ（i_s、ｊ）＋minｇ（i
_s−１、ｊ）ｇ（i_s−１、ｊ−１）ｇ（i_s−１、ｊ−２） …(22) ｈ（h_s、ｊ）＝ｈ（i^、j^） …（23）ただし（i^、j^）は(22)式の右辺における最小を
与える(i、j)である。 The initial conditions are T(0, 0)=0, T(i, q)=∞, i≠0, q=≠ G(p, n, j)=∞...(19), and from block b=1, I/ (22) to (25) based on the boundary conditions of equations (20) and (21) sequentially up to BL.
The recurrence formula of (p, q, n)n -→ Δ is the pair (p, q, n)
n). In other words, the boundary conditions with state p are i=i _s , ..., i _e g(i-1, 0)=T(i-1, p) h(i-1, 0)=i-1 ...(20 ) However, i _s = (b-1)・BL+1, i _e =b・BL, and the boundary conditions with block b-1 are j=1,
..., J ⁿ for standard pattern time j g(i _s -1, j) = G(p, n, j) h (i _s -1, j) = H(p, n, j) ...(21) The first recurrence formula g(i _s , j) = d(i _s , j) + ming(i
_s −1, j) g(i _s −1, j−1) g(i _s −1, j−2) …(22) h(h _s , j)=h(i^, j^) …( 23) However, (i^, j^) is (i, j) that gives the minimum on the right side of equation (22).

を計算し、続いてブロツク本体を求める第２の漸
化式ｇ(i、j)＝ｄ(i、j)＋minｄ(i-1、j)＋
ｇ(i-2、j-1) ｇ(i-1、j-1) ｇ(i-1、j-2) …(24) ｈ（ｉ、ｊ）＝ｈ（i^、j^） …(25) ただし（i^、j^）は（24）式の右辺における最小
を与える（ｉ、ｊ）である。Then, the second recurrence formula g(i, j) = d(i, j) + mind(i-1, j) + calculates the block body.
g(i-2, j-1) g(i-1, j-1) g(i-1, j-2) …(24) h(i, j)=h(i^, j^)… (25) However, (i^, j^) is (i, j) that gives the minimum on the right side of equation (24).

をｉ＝i_s＋１よりi_eまで計算する。Calculate from i=i _s +1 to i _e .

次のブロツクの計算のため前記計算結果の境界
値を格納する。 The boundary value of the calculation result is stored for calculation of the next block.

Ｇ(p、n、j)＝ｇ（i_e、ｊ）Ｈ(p、n、j)＝Ｈ（i_e、ｊ） …(26) (21)、〜(25)式の計算が標準パタン時刻ｊ＝Jⁿ
まで終了した後、単語境界における最小化として
ｉ＝i_s、…、i_eについて if Ｔ(i、q)＞ｇ（ｉ、Jⁿ then Ｔ(i、q)＝ｇ（ｉ、Jⁿ）Ｎ(i、q)＝ｎＰ(i、q)＝ｐＬ(i、q)＝ｈ（ｉ、Jⁿ …(27) を計算する。最後に、入力パタンの認識結果は判
定処理として従来方法と同様の手続き、すなわち
（12）、（13）、（14）式より求められる。 G (p, n, j) = g (i _e , j) H (p, n, j) = H (i _e , j) ...(26) The calculations of equations (21) and (25) are standard patterns. Time j=J ⁿ
If T( _i , q)>g(i, J ⁿ then T(i, q ₎ =g(i, J ⁿ ) N(i, q)=n P(i, q)=p L(i, q)=h(i, J ⁿ ...(27) It is obtained using the same procedure as the method, ie, equations (12), (13), and (14).

〔Example〕

次に、本発明について第１図、第５図、〜第１
０図を用いて詳細に説明する。 Next, regarding the present invention, FIGS. 1, 5, - 1
This will be explained in detail using FIG.

第１図は本発明の連続音声認識装置の一実施例
を示すブロツク図、第５図、第６図、第７図、第
８図はそれぞれ第１図の実施例の一部詳細構成を
示す部分ブロツク図、第９図は第１図における動
作の時間関係を示すタイムチヤート、第１０図
ａ，〜ｄは第１図における動作の流れを示すフロ
ーチヤートである。 FIG. 1 is a block diagram showing an embodiment of the continuous speech recognition device of the present invention, and FIGS. 5, 6, 7, and 8 each show a partially detailed configuration of the embodiment of FIG. 1. FIG. 9 is a time chart showing the time relationship of the operations in FIG. 1, and FIGS. 10a to 10d are flow charts showing the flow of the operations in FIG. 1.

標準パタンメモリ部１３０には単語セツトΣに
含まれる単語ｎの標準パタンBⁿが記憶されてお
り、オートマトン記憶部２３０には状態遷移規則
（ｐ、ｑ、ｎ）と最終状態Ｆの指定情報が記憶さ
れている。 The standard pattern memory unit 130 stores a standard pattern B ⁿ of word n included in the word set Σ, and the automaton storage unit 230 stores state transition rules (p, q, n) and designation information for the final state F. remembered.

マイクロホン１００より未知入力音声が入力さ
れると入力部１１０によつて周波数分析がなされ
特徴を示すベクトルa₁に変換され順次入力パタン
メモリ部１２０に送られる。また、入力部１１０
には音声レベルを検知することによつて音声区間
を決定する機能が与えられており、音声区間中で
は「１」その他では「０」なる音声区間信号Ｓを
発生する。制御部２４０は、この音声区間信号Ｓ
の立上りの時刻において初期化パルスSET₁を発
生する。これによつて第１０図ａのブロツク１０
に対応する初期化がＧメモリ１５０とＴメモリ２
００に対してなされる。 When an unknown input voice is input from the microphone 100, the input section 110 performs frequency analysis, converts it into a vector _a1 representing the characteristic, and sequentially sends it to the input pattern memory section 120. In addition, the input section 110
is provided with a function of determining a voice section by detecting the voice level, and generates a voice section signal S that is "1" during the voice section and "0" otherwise. The control unit 240 controls the voice section signal S
Initialization pulse SET ₁ is generated at the rising edge of . This results in block 10 in Figure 10a.
The initialization corresponding to G memory 150 and T memory 2
00.

以上の初期化が終了すると、以後のBLフレー
ム分の入力特徴a_iに入力に同期して入力パタンブ
ロツク信号ｂが１、２…と計数される。この入力
パタンブロツクｂにおいて制御部２４０よりの単
語指定信号ｎは１からＮまで変化する。各単語ｎ
においてオートマトン記憶部２３０中の状態テー
ブルが参照され、またその単語ｎと状態ｐにて規
定される状態指定信号ｑも出力される。 When the above initialization is completed, the input pattern block signal b is counted as 1, 2, etc. in synchronization with the input of the input feature a _i for the subsequent BL frames. In this input pattern block b, the word designation signal n from the control section 240 changes from 1 to N. each word n
The state table in the automaton storage section 230 is referred to, and a state designation signal q defined by the word n and state p is also output.

次に単語ｎ、状態ｐなる１サイクル内の動作を
説明する。この１サイクルによつて第２図に図示
した斜線部分の計算が実行される。すなわち
（20）、（21）式の境界条件のもとで（22）、（25）
式を計算する。初めに制御部２４０よりの信号
SET₂によつて、第１０図ａのブロツク１１，１
２に対応する値のセツトがDPマツチング用ワー
クメモリのｇメモリ３３０とｈメモリ３４０に対
して行われる。続いて制御部２４０よりの標準パ
タン時刻信号ｊは１からJⁿまで変化する。各標準
パタン時刻ｊにおいて、入力パタン時刻信号i₁は
i_s（i_s＝（ｂ−１）・BL＋１）よりi_e（i_e＝ｂ・BL）
まで変化する。 Next, the operation within one cycle of word n and state p will be explained. Through this one cycle, calculations shown in the shaded areas in FIG. 2 are executed. That is, under the boundary conditions of equations (20) and (21), (22) and (25)
Calculate the formula. First, a signal from the control unit 240
By SET ₂ , block 11,1 in Figure 10a
2 is set in the g memory 330 and h memory 340 of the DP matching work memory. Subsequently, the standard pattern time signal j from the control section 240 changes from 1 to J ⁿ . At each standard pattern time j, the input pattern time signal i ₁ is
From i _s (i _s = (b-1)・BL+1), i _e (i _e =b・BL)
changes up to.

i_sフレームにおいて第１０図ｂのブロツク１３
に示した計算が境界DP計算部３１０にて行われ
る。 Block 13 of FIG. 10b in the i _s frame.
The calculation shown in is performed by the boundary DP calculation unit 310.

第５図は境界DP計算部の一例を示すブロツク
図である。初めに入力パタンのi_sフレームと第ｎ
単語の標準パタンのｊフレームが読み出されて、
前記(9)式に示すベクトル間距離ｄ（ｉ、ｊ）が距
離計算部３００にて求められる。 FIG. 5 is a block diagram showing an example of a boundary DP calculating section. First, the i _s frame of the input pattern and the nth
J-frames of the standard pattern of words are read out,
The inter-vector distance d(i, j) shown in equation (9) above is determined by the distance calculation unit 300.

一方、境界DP計算部３１０では、距離計算と
並列して３つの相異度の最小値が求められる。す
なわち、ｇメモリ３３０よりｇ（i_s−１、ｊ）、ｇ
（i_s−１、ｊ−１、ｇ（i_s−１、ｊ−２）とｈメモ
リ３４０よりｈ（i_s−１、ｊ）、ｈ（i_s−１、ｊ−
１）、ｈ（i_s−１、ｊ−２）が読み出され、レジス
タＧ１，Ｇ２，Ｇ３とＨ１，Ｈ２，Ｈ３にそれぞ
れ格納される。比較回路３１２は３つのレジスタ
Ｇ１，Ｇ２，Ｇ３から最小値を検出し、その最小
値が得られたレジスタGn^（n^は１，２，３のどれ
か）に対応したレジスタHn^を選択するゲート信
号n^を発する。前記ゲート信号n^により選択され
たレジスタHn^の内容がｈメモリ３４０ｈ（i_s、
ｊ）へ書き込まれる。また、比較回路３１２より
出力された最小値ｇ（i_s−１、j^）は前記距離計算
部３００で求められた距離ｄ（i_s、ｊ）と加算器
３１１によつて加算され、ｇメモリ３３０のｇ
（i_s、ｊ）へ書き込まれる。 On the other hand, the boundary DP calculation unit 310 calculates the minimum value of the three dissimilarities in parallel with the distance calculation. That is, from the g memory 330, g(i _s −1, j), g
(i _s -1, j-1, g (i _s -1, j-2) and h (i _s -1, j), h (i _s -1, j-
1), h(i _s -1, j-2) are read out and stored in registers G1, G2, G3 and H1, H2, H3, respectively. The comparison circuit 312 detects the minimum value from the three registers G1, G2, and G3, and selects the register Hn^ corresponding to the register Gn^ (where n^ is 1, 2, or 3) from which the minimum value was obtained. It emits a gate signal n^ to The contents of the register Hn^ selected by the gate signal n^ are stored in the h memory 340h (i _s ,
j). Further, the minimum value g (i _s −1, j^) outputted from the comparison circuit 312 is added to the distance d (i _s , j) obtained by the distance calculating section 300 by the adder 311, and g memory 330g
(i _s , j).

続いて各入力ペタン時刻ｉ（ｉはi_s＋１よりi_eま
で変化する）において、第１０図ｂのブロツク１
４に示した計算が本体DP計算部３２０で行われ
る。 Then, at each input point time i (i changes from i _s +1 to i _e ), block 1 of FIG.
The calculation shown in 4 is performed by the main body DP calculation section 320.

第６図は本体DP計算部の一例を示すブロツク
図である。初めに入力パタンのｉフレームが読み
出されて、(9)式に示すベクトル間の距離を距離計
算部３００で求め、ｄ（ｉ、ｊ）が得られる。 FIG. 6 is a block diagram showing an example of the main body DP calculation section. First, the i-frame of the input pattern is read out, and the distance calculation unit 300 calculates the distance between the vectors shown in equation (9) to obtain d(i, j).

一方、本体DP計算部３２０では距離計算と並
列して（24）式の第２の漸化式の右辺の最小値が
求められる。すなわち、ｇメモリ３３０より読み
出されたｇ（ｉ−２、ｉ−１）と、遅延回路３２
４にて１時刻遅延された距離ｄ（ｉ−１、ｊ）が
加算器３２３にて加算されレジスタＧ１に格納さ
れる。また、ｇメモリ３３３よりｇ（ｉ−１、ｊ
−１）とｇ（ｉ−１、ｊ−２）が読み出されレジ
スタＧ２，Ｇ３にそれぞれ格納される。さらに、
ｈメモリ３４０よりｈ（ｉ−２、ｊ−１）、ｈ（ｉ
−１、ｊ−１）、ｈ（ｉ−１、ｊ−２）が読み出さ
れレジスタＨ１，Ｈ２，Ｈ３にそれぞれ格納され
る。 On the other hand, the main body DP calculation unit 320 calculates the minimum value on the right side of the second recurrence formula of equation (24) in parallel with the distance calculation. That is, g(i-2, i-1) read from the g memory 330 and the delay circuit 32
The distance d(i-1,j) delayed by one time in step 4 is added by adder 323 and stored in register G1. Also, from the g memory 333, g(i-1, j
-1) and g(i-1, j-2) are read out and stored in registers G2 and G3, respectively. moreover,
From the h memory 340, h(i-2, j-1), h(i
-1, j-1) and h(i-1, j-2) are read out and stored in registers H1, H2, and H3, respectively.

比較回路３２２は３つのレジスタＧ１，Ｇ２，
Ｇ３より最小値を検出し、その最小値が得られた
レジスタGn^（n^は１、２、３のどれか）に対応し
たレジスタHn^を選択するゲート信号n^を発する。
前記ゲート信号n^により選択されたレジスタHn^の
内容がｈメモリ３４０のｈ（ｉ、ｊ）へ書き込ま
れる。また、比較回路３２２より出力された最小
値は前記距離計算部３００で求められた距離ｄ
（ｉ、ｊ）と加算器３２１によつて加算され、ｇ
メモリ３３０のｇ（ｉ、ｊ）へ書き込まれる。入
力パタン時刻ｉがi_s＋１からi_eまで変化させるこ
とによつて標準パタン時刻ｊに対する処理が終了
する。さらに標準パタン時刻ｊが終端Jⁿとなつた
後、第１０図ｂのブロツク１５に示すように信号
SET３に従つて漸化式（24）の結果ｇ（i_e、ｉ）、
ｈ（i_e、ｊ）をテーブルメモリＧ（ｐ、ｎ、ｊ），
Ｈ（ｐ、ｎ、ｊ）へ格納する。つづいて制御部２
４０より発せされた信号ｉ２（i_sからi_eまで変化
する）に従つて第１０図ｃのブロツク１６に示し
た比較が行われる。すなわち第７図に示すよう
に、信号ｉ２とｑに従つてテーブルメモリ２００
よりＴ（ｉ、ｑ）とｇメモリ３３０よりｇ（ｉ、
Jⁿ）が読み出され、比較回路１７０により比較し
Ｔ（Ｉ、ｑ）＞ｇ（ｉ、Jⁿ）の場合wp信号が発せら
れ、ｇ（ｉ、J^N）、ｎ、ｐ、ｈ（ｉ、Jⁿ）がテーブ
ルメモリＴ（ｉ、ｑ），Ｎ（ｉ，ｑ），Ｐ（ｉ、ｑ），
Ｌ（ｉ、ｑ）へ書き込まれる。 The comparison circuit 322 has three registers G1, G2,
The minimum value is detected from G3, and a gate signal n^ is generated to select the register Hn^ corresponding to the register Gn^ (n^ is any one of 1, 2, or 3) from which the minimum value was obtained.
The contents of the register Hn^ selected by the gate signal n^ are written to h(i, j) of the h memory 340. Further, the minimum value output from the comparison circuit 322 is the distance d calculated by the distance calculation section 300.
(i, j) by the adder 321, and g
It is written to g(i,j) in memory 330. By changing the input pattern time i from i _s +1 to _ie , the processing for the standard pattern time j is completed. Further, after the standard pattern time j reaches the terminal J ⁿ , the signal is changed as shown in block 15 of FIG.
According to SET3, the result of recurrence formula (24) g(i _e , i),
h(i _e , j) in table memory G(p, n, j),
Store in H(p, n, j). Next, control section 2
According to the signal i2 (varying from i _s to i _e ) issued by 40, the comparison shown in block 16 of FIG. 10c is carried out. That is, as shown in FIG. 7, the table memory 200
From T(i, q) and g(i, q) from g memory 330.
J ⁿ ) is read out and compared by the comparator circuit 170. If T(I, q)>g(i, J ⁿ ), a wp signal is generated, and g(i, J ^N ), n, p, h( i, J ⁿ ) are table memories T(i, q), N(i, q), P(i, q),
written to L(i,q).

以上の動作によつて状態ｐ、単語ｎ、入力パタ
ンブロツクｂの処理が終了する。さらに状態指定
信号ｐがp₁からπと変化されることにより単語指
定信号ｎに対する処理が終了する。さらに単語指
定信号ｎが１からＮまで変化することにより入力
パタンブロツクｂに対する処理が終了する。入力
パタンブロツク信号がＩ／BLとなつた後に、
（12）、（13）、（14）式に示した判定処理が開始さ
れる。 With the above operations, the processing of state p, word n, and input pattern block b is completed. Further, the state designation signal p is changed from p ₁ to π, thereby completing the processing for the word designation signal n. Furthermore, when the word designation signal n changes from 1 to N, the processing for input pattern block b ends. After the input pattern block signal becomes I/BL,
The determination process shown in equations (12), (13), and (14) is started.

判定部２２０は第８図に示すように構成され、
初めに第１０図ｄのブロツク１７の処理として、
オートマトン記憶部２３０から最終状態集合Ｆに
含まれる状態ｑを信号ｑ３に従つて順次指定さ
れ、Ｔメモリ２００よりＴ（Ｉ、ｑ）を読み出し、
比較回路２２１、最小値レジスタ２２２、状態レ
ジスタ２２３を用いて最小のＴ（Ｉ、ｑ）が与え
られる状態q^を得る。続いて第１０図ｄのブロツ
ク１８の処理として判定制御部２２７はｑ＝q^、
ｍ＝Ｉとしてアドレス信号ｍ３、ｑをＮメモリ１
９０、Ｌメモリ２１０、Ｐメモリ１８０へ発しＮ
（ｍ、ｑ），Ｌ（ｍ、ｑ），Ｐ（ｍ、ｑ）を読み出す。
このＬ（ｍ、ｑ）とＰ（ｍ、ｑ）は次の時刻ｍと状
態ｑとなり、Ｎ（ｍ、ｑ）は認識結果として出力
される。この処理をl^が零になるまで繰り返すこ
とにより順次認識結果が得られる。 The determination unit 220 is configured as shown in FIG.
First, as the processing of block 17 in FIG. 10d,
The states q included in the final state set F are sequentially specified from the automaton storage unit 230 according to the signal q3, and T(I, q) is read from the T memory 200,
The comparison circuit 221, minimum value register 222, and state register 223 are used to obtain the state q^ that provides the minimum T(I, q). Next, as the process of block 18 in FIG. 10d, the determination control unit 227 determines that q=q^,
Assuming m=I, address signals m3 and q are transferred to N memory 1.
90, send to L memory 210, P memory 180 N
Read out (m, q), L (m, q), and P (m, q).
These L(m, q) and P(m, q) become the next time m and state q, and N(m, q) is output as the recognition result. By repeating this process until l^ becomes zero, recognition results can be obtained sequentially.

以上、本発明の構成を実施例にもとづいて説明
したが、これらの記載は本発明の権利範囲を限定
するものではない。 Although the configuration of the present invention has been described above based on examples, these descriptions do not limit the scope of the present invention.

本実施例では特願昭58−239303明細書に記載し
ているようなブロツクワイズDPマツチング法を
もとに説明しているが、入力パタンの複数フレー
ムをまとめてブロツク化してDPマツチングを実
行する方法であるならば、本発明の原理であるブ
ロツク境界部分とブロツク本体部分を異なる漸化
式で実行することが可能である。すなわち、特願
昭59−067116明細書に記載されている修正DPマ
ツチング計算部をもつブロツクワイズDP法特願
昭59−068015明細書に記載されているブロツクを
標準パタン軸に対して斜めに傾けた斜めブロツク
ワイズDP法、特願昭59−267830明細書に記載さ
れている標準パタン長によつてブロツク幅を変化
させる可変斜めブロツクワイズDP法などにも本
発明の原理であるブロツク境界部分は第１の漸化
式を使用し、ブロツク本体部分は第２の漸化式を
使用することができる。 This example is explained based on the blockwise DP matching method as described in the specification of Japanese Patent Application No. 58-239303, but multiple frames of the input pattern are collectively made into blocks and DP matching is executed. If it is a method, it is possible to implement the principle of the present invention, that is, the block boundary part and the block body part, using different recurrence formulas. That is, the Blockwise DP method having the modified DP matching calculation section described in the specification of Japanese Patent Application No. 59-067116 The block described in the specification of Japanese Patent Application No. 59-068015 is tilted diagonally with respect to the standard pattern axis. The principle of the present invention, the block boundary portion, can also be applied to the diagonal block DP method and the variable diagonal block DP method in which the block width is changed according to the standard pattern length described in the specification of Japanese Patent Application No. 59-267830. The first recurrence formula can be used, and the block body portion can use the second recurrence formula.

さらにブロツク境界部分を求める第１の漸化式
として（22）式を用いているが、入力パタン時刻
ｉとｉ−１のみを使用する漸化式であるならばど
のような形でもよい。例えばｇ(i、j)＝ｄ(i、j)＋minｇ(i-1、j) ｇ(i-1、j-1) …(28) でもよいし、またｇ(i、j)＝ｄ(i、j)＋min ｇ(i-1、j) ｇ(i-1、j-1) ｇ(i-1、j-2) ｇ(i-1、j-3) …(29) でもよい。 Furthermore, although equation (22) is used as the first recurrence equation for determining the block boundary portion, any recurrence equation may be used as long as it uses only input pattern times i and i-1. For example, g(i, j) = d(i, j) + ming(i-1, j) g(i-1, j-1) ...(28) or g(i, j) = d( i, j) + min g(i-1, j) g(i-1, j-1) g(i-1, j-2) g(i-1, j-3) ...(29) may also be used.

またブロツク本体部を求める第２の漸化式とし
て（24）式を使用しているが、例えばｇ(i、j)＝minｄ(i、j)＋ｄ(i-1、j)＋ｇ(i-
2、j-1) ｄ(i、j)＋ｇ(i-1、j-1) ｄ(i、j)＋ｄ(i、j-1)／２＋ｇ(i-1、j-2) …（30）ｇ(i、j)＝ｄ(i、j)＋minｄ(i-1、j)＋ｄ(i-
2、j)＋ｇ(i-3、j-1) (i-1、j)＋ｇ(i-2、j-1) ｇ(i-1、j-1) ｇ(i-1、j-2) ｇ(i-1、j-3) …(31) などを用いてもよいことは明白である。 Also, formula (24) is used as the second recurrence formula for calculating the block body, for example, g(i, j) = mind(i, j) + d(i-1, j) + g(i-
2, j-1) d(i, j)+g(i-1, j-1) d(i, j)+d(i, j-1)/2+g(i-1, j-2)...(30 ) g(i, j) = d(i, j) + mind(i-1, j) + d(i-
2, j) + g(i-3, j-1) (i-1, j) + g(i-2, j-1) g(i-1, j-1) g(i-1, j-2 ) g(i-1, j-3) ...(31) etc. can obviously be used.

〔Effect of the invention〕

以上説明したように本発明は、DPマツチング
の各ブロツクの計算において境界部分の漸化式を
入力パタン時刻ｉとｉ−１のみを使用することに
よつてワークメモリＧ，Ｈを小さくすることがで
き、またデータ転送量も小さくすることができ
る。さらに各ブロツクの本体の漸化式は傾斜制御
のある形式をとり、これにより誤認識率も小さく
できる効果がある。 As explained above, the present invention makes it possible to reduce the work memories G and H by using only the input pattern times i and i-1 for the boundary part recurrence formula in the calculation of each block of DP matching. It is also possible to reduce the amount of data transferred. Furthermore, the recurrence formula for the main body of each block takes a form with tilt control, which has the effect of reducing the rate of misrecognition.

[Brief explanation of drawings]

第１図は本発明の連続音声認識装置の一実施例
を示すブロツク図、第２図、第３図、第４図は本
発明の原理を説明するための図、第５図、第６
図、第７図、第８図はそれぞれ第１図の実施例の
一部詳細構成を示す部分ブロツク図、第９図は第
１図における動作の時間関係を示すタイムチヤー
ト、第１０図ａ〜ｄは第１図における動作の一連
の流れを示すフローチヤートである。１００…マイクロホン、１１０…入力部、１２
０…入力パタンメモリ部、１３０…標準パタンメ
モリ部、１５０…Ｇメモリ、１６０…Ｈメモリ、
１７０…比較回路、１８０…Ｐメモリ、１９０…
Ｎメモリ、２００…Ｔメモリ、２１０…Ｌメモ
リ、２２０…判定部、２３０…オートマトン記憶
部、２４０…制御部、３００…距離計算部、３１
０…境界DP計算部、３２０…本体DP計算部、３
３０…ｇメモリ、３４０…ｈメモリ。 FIG. 1 is a block diagram showing an embodiment of the continuous speech recognition device of the present invention, FIGS. 2, 3, and 4 are diagrams for explaining the principle of the present invention, and FIGS.
7 and 8 are partial block diagrams showing a part of the detailed configuration of the embodiment shown in FIG. 1, respectively. FIG. 9 is a time chart showing the time relationship of the operations in FIG. 1, and FIGS. d is a flowchart showing a series of operations in FIG. 100...Microphone, 110...Input section, 12
0...Input pattern memory section, 130...Standard pattern memory section, 150...G memory, 160...H memory,
170... Comparison circuit, 180... P memory, 190...
N memory, 200...T memory, 210...L memory, 220...judgment unit, 230...automaton storage unit, 240...control unit, 300...distance calculation unit, 31
0... Boundary DP calculation section, 320... Main body DP calculation section, 3
30...g memory, 340...h memory.

Claims

[Claims]

1. In a continuous speech recognition device that recognizes speech in which a string of words specified by a finite state automaton is continuously uttered by performing DP matching with a standard pattern, there is an input pattern memory section that stores input patterns, and a standard memory section that stores the standard patterns. A pattern memory unit, an automaton storage unit that stores a state transition table Δ, which is a group of rules (p, q, n) that means that state transition p → q occurs when word n is input, and a final state group F. Then, in the state transition specified by the state p and word n, the input pattern within the area defined by the input pattern time i and the standard pattern time j.
Each point (i,
a distance calculation unit that calculates the distance d(i, j) between the feature vector of the input pattern of j) and the feature vector of the standard pattern;
a boundary DP calculation unit that calculates DP matching calculation for the boundary portion with block b using the first recurrence formula;
A main body DP calculation unit that calculates using a second recurrence formula different from the recurrence formula, and a body DP calculation unit that calculates the boundary DP calculation unit and the main body DP.
A continuous speech recognition device comprising: a determination unit that determines a combination of words that yields the minimum cumulative distance determined by the calculation unit.