JPS5898796A

JPS5898796A - Continuous voice recognition equipment

Info

Publication number: JPS5898796A
Application number: JP56197841A
Authority: JP
Inventors: 誠夫亘理; 迫江　博昭
Original assignee: Nippon Electric Co Ltd
Current assignee: NEC Corp
Priority date: 1981-12-09
Filing date: 1981-12-09
Publication date: 1983-06-11
Also published as: JPH0134399B2

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】本発明は１個以上の単語を連続して発声した連続音声を
自動的に認識する連続音声認識装置に関する。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a continuous speech recognition device that automatically recognizes continuous speech in which one or more words are successively uttered.

音声認識の手段としては従来から楢々の方法が試みられ
ている。それらの中で最も簡単、かつ有効な方法として
パタンマツチング法があげられる。As a means of voice recognition, the Narasa method has been tried in the past. Among them, the pattern matching method is the simplest and most effective method.

この方法は、認識すべきＷ！索の各単語に標準的なパタ
ン（以下単語標準パタンと称する）を用意しておき、入
力された未知の音声パタン（以下入力パタンと称する）
との間で比較操作（すなわちバタンマツチング）を行っ
て相互で異なる度合を表わす量（以下相異度と称する）
を算出し、最も相異の少ないすなわち相異度が最小にな
る単語標準パタンと同じ単語に属すると判定する方法で
ある。This method is a must-recognize W! A standard pattern (hereinafter referred to as the word standard pattern) is prepared for each word in the search, and the input unknown voice pattern (hereinafter referred to as the input pattern) is prepared in advance.
(hereinafter referred to as the degree of dissimilarity)
This is a method for determining that a word belongs to the same word as the word standard pattern with the least difference, that is, the minimum degree of difference.

特開昭５１−１０４２０４号公報には上記パタンマツチ
ング法を基礎として動作する連続音声認識装置の動作原
理が記載されている。この原理は大路次のようである。JP-A-51-104204 describes the operating principle of a continuous speech recognition device that operates based on the pattern matching method described above. The principle behind this is as follows.

すなわち、何個かの単語標準バタンをあらゆる順列で接
続することによって得られるパタンを連続音声の標準パ
タン（以下連続音声標準パタンと称す）と考えて、入力
パタン全体とのマツチングを行う。全体としての相異度
が最小となるように単語標準パタンの個数と単語標準パ
タンの順列を定めることによって認識を行なう。That is, a pattern obtained by connecting several word standard patterns in any permutation is considered as a continuous speech standard pattern (hereinafter referred to as a continuous speech standard pattern), and is matched with the entire input pattern. Recognition is performed by determining the number of word standard patterns and the permutation of word standard patterns so that the overall degree of difference is minimized.

実際には上記最小化を単語単位での最小化と全体として
の最小化の２段階に分割し、それぞれの最小化を動的計
画法を利用して実行する（以下動的計画法を用いたマツ
チングをＤＰマツチングと称する）。In reality, the above minimization is divided into two stages: word-by-word minimization and overall minimization, and each minimization is performed using dynamic programming (hereinafter, the minimization is performed using dynamic programming). This matching is called DP matching).

上記公開公報記載の装置で紘単語単位での最小化におい
て、入力パタンを単語単位にあらゆる可能な分割をし、
そのすべてに対して単語標準パタンとのＤＰマツチング
を行っている。すなわち入カパタン長をＮとし、単語標
準パタン数をＶとすればＭ・Ｖ回のＤＰマツチングを必
要とする。In the minimization of each word using the device described in the above publication, the input pattern is divided into every possible word,
DP matching with word standard patterns is performed for all of them. That is, if the input pattern length is N and the number of word standard patterns is V, then M.V times of DP matching are required.

ところで、上記のＤＰマツチングの回数をＬｍａｘ・Ｖ
回（入力パタンの最大可能桁数をＬｍａｘとする）にす
る方法がＩＥＥＥＴＲＡＮＳＡＣＴｌＯＮＳＯＮＡＣ
ＯＵＳＴｌＣＳ，ＳＰＥＥＣＨ、ＡＮＤＳＩＧＮＡＬ
ＰＲＯＣＥＳＳＩＮＧ、ＶＯＬＡＳＳＰ−２９ＮＯ．
２ＡＰＲＩＬ１９８１第２８４頁から第２９７頁に記
載されている。次にこの方法（以下ＨＬＢ法と称する）
の大略を述べる。入力パタンＡと連続音声標準パタンＣ
＝Ｂｖ１、Ｂｖ２、・・・、Ｂｖｌ、・・・、ＢｖＬｍ
ａｘとの相異度は次のようにして求める。入力パタンの
時間点ｍと連続音声標準パタンの時間点ｎを第１図に示
したような最適な単調増加で非線形関数ｎ＝ｎｍ（以下
時間正規化関数という）にて対応づけを行い、その対応
づけられた時間点における特徴ベクトル間の距離ｄ（ｍ
、ｎ）を時間正規化関数に沿って加算したものを相異度
８（Ａ、Ｃ）と定義する。By the way, the number of times of the above DP matching is Lmax・V
(the maximum possible number of digits of the input pattern is Lmax) is based on IEEE
OUSTlCS, SPEECH, AND SIGNAL
PROCESSING, VOL ASSP-29NO.
2APRIL 1981, pages 284 to 297. Next, this method (hereinafter referred to as HLB method)
I will give an outline. Input pattern A and continuous voice standard pattern C
=Bv1, Bv2,..., Bvl,..., BvLm
The degree of difference with ax is determined as follows. The time point m of the input pattern and the time point n of the continuous speech standard pattern are correlated using an optimal monotonically increasing nonlinear function n=nm (hereinafter referred to as time normalization function) as shown in Figure 1, and The distance d(m
, n) along the time normalization function is defined as the degree of dissimilarity 8 (A, C).

ζこで距離ｄ（ｍ、ｎ）は例えば（８）式にて求めるこ
とができる。ζ Here, the distance d(m, n) can be determined, for example, using equation (8).

（１）式の最小化を次のような動的計画の手法で行う。Equation (1) is minimized using the following dynamic programming method.

す表わち、初期条件Ｄ（０，０）＝Ｏ・・・・・・（４）Ｄ（ｍ、Ｑ）＝∞ ｍ＝０〜Ｍ・・・・・・（５）Ｆ（
ｍ、Ｏ）＝ｍｍ＝０〜Ｍ・・・・・・（６）のもとに
漸化式ただしをｍ＝Ｌ（ｍ）〜Ｕ（ｍ）、ｍ＝１〜Ｍについてすなわ
ち第１図の斜線部分について求める。In other words, the initial condition D (0, 0) = O (4) D (m, Q) = ∞ m = 0 ~ M (5) F (
m, O) = m m = 0 ~ M ...... Based on (6), the recurrence formula is written for m = L (m) ~ U (m), m = 1 ~ M, that is, the first Find the shaded area in the figure.

Ｕ（ｍ）＝２ｍ−１・・・・・・（１０）Ｌ（ｍ）＝（
ｍ＋１）／２・・・・・・（１１）ここでａｒｇｍｉｎ
ｙはｘＥＸの条件の下でｙを最小とするｘを意味してい
る。すなわち（９）式はｎ−２≦ｎ’≦ｎのもとでＤ（
ｍ−１，ｎ’）を最小とするｎ′をｎとしている。また
、（７）式は第２図に示す３つの経路より最小を選択す
ることを示しておシ、許される経路を３つに制限したの
は時間正規化関数による対応づけが必要以上に歪むこと
を防ぐためである。ここで相異度を求める時用いた最小
値を選択した経路をマツチング経路と呼び、（８）式の
Ｆ（ｍ。U(m)=2m-1...(10)L(m)=(
m+1)/2...(11) Here argmin
y means x that minimizes y under the condition of xEX. In other words, equation (9) is D(
m-1, n') is set to be n'. In addition, equation (7) indicates that the minimum is selected from the three routes shown in Figure 2, and limiting the number of allowed routes to three causes the correspondence by the time normalization function to become unnecessarily distorted. This is to prevent this. Here, the path that selects the minimum value used when calculating the degree of dissimilarity is called the matching path, and F(m) in equation (8).

ｎ）を経路情報と呼ぶ。前記漸化式（７）、ｔ８）、（
９）を入力パタンの終端Ｍ、連続音声標準パタンの終端
Ｎまで計算して得られるＤ（Ｎ、Ｎ）が前記（１）式の
相異度Ｓ（Ａ、Ｃ）である。n) is called route information. The recurrence formula (7), t8), (
D(N, N) obtained by calculating 9) from the end M of the input pattern to the end N of the continuous speech standard pattern is the degree of dissimilarity S(A, C) in equation (1).

ところで、全体の最小相異度を求めた時得られたマッチ
ング経路（ｍ、ｎ（ｉ）上のある点（ｍｌ、ｎｌ）にお
いて、始端よりそのマツチング経路に沿ってその点（ｍ
ｌ、ｎｌ）まで得られた部分相異度は、その点（ｍｌ、
ｎｌ）を通るすべてマツチング経路に沿りて得られる部
分相異度の最小値である。すなわちある点（ｍｌ、ｎｌ
）を通るすべてのマツチング経路に沿って得られる全体
相異度の最小値は始端よりその点（ｍｌ、ｎｌ）までの
部分相異度とその点（ｍｌ．ｎｌ）より終端までの部分
相異度のそれぞれの最小値の和で与えられる。By the way, at a certain point (ml, nl) on the matching path (m, n(i)) obtained when calculating the overall minimum dissimilarity, the point (m
The partial dissimilarity obtained up to the point (ml, nl) is
nl) is the minimum value of partial dissimilarity obtained along all matching paths passing through nl). That is, a certain point (ml, nl
) The minimum value of the overall dissimilarity obtained along all matching paths passing through It is given by the sum of the minimum values of each degree.

すなわち、ただし（ｍｌ、ｎｌ）はＳ（Ａ、Ｃ）が得られたマツチ
ング経路上の点である。今、連続音声標準パタンの各桁
ごとの区切れ目の点を考え、それぞれの桁で最小の部分
相異度を求め、その和として最小の全体相異度を得るこ
とができる。従って、入力バタンＡと連続音声標準パタ
ンＣ＝ＢＶｓ、Ｂｖ２、・・・、ＢＶｌ、・・・、Ｂと
の最小相異度は次のようにして求めることができる。初
めに、連続音声標準パタンの第１桁目の各単語標準パタ
ンと入力パタンとのマツチングを行い、相異度の最小値
を求め、その結果を第２桁目のマツチングの初期値とし
て第２桁目の各単語標準パタンと入力パタンとのマツチ
ングを行う。第Ｌｍａｘ桁までマツチングを行った後、
入力パタンの終端Ｍにおける各桁ごとの相異度の最小値
を求め、最適な桁ａＬを得る。第Ｌ桁の相異度が得られ
九マツチング経路を逆にたどって順次各桁での認識カテ
ゴリを得る。That is, where (ml, nl) is the point on the matching path where S(A, C) was obtained. Now, considering the break points for each digit of the continuous speech standard pattern, the minimum partial dissimilarity can be found for each digit, and the minimum overall dissimilarity can be obtained as the sum of the results. Therefore, the minimum degree of difference between the input button A and the continuous voice standard pattern C=BVs, Bv2, . . . , BVl, . . ., B can be determined as follows. First, each word standard pattern in the first digit of the continuous speech standard pattern is matched with the input pattern, the minimum value of the degree of dissimilarity is found, and the result is used as the initial value for matching in the second digit. The standard pattern of each word in the digit is matched with the input pattern. After matching up to the Lmax digit,
The minimum value of the degree of difference for each digit at the terminal end M of the input pattern is determined to obtain the optimum digit aL. The degree of difference for the L-th digit is obtained, and the nine matching paths are traced in reverse to obtain the recognition category for each digit in sequence.

次にＨＬＢ法の計算手順を第３図〜第６図を用いて説明
する。第３図は相異度計算の進行順序を示す図、第４図
は第１桁目の相異度計算を示す図、第５図は桁経路情報
ＦＢ（ｌ、ｍ）、桁認繊カテゴリＷ（Ｊ、ｍ）よシ判定
計算順序を示す図、Ｎ６図は前記引用文献の第２８９頁
から第２９０頁に記載されているアルゴリズム５をフロ
ーチャートで表わしたものである。ここで、ｍは入力パ
タンの時間点、ｎは標準パタンの時間点、Ｖは単語、ｌ
は桁、Ｍは入力パタンの終端、Ｎｖは第Ｖ番目の単語標
準パタンの終端、■は単語標準パタン数、Ｌｍｉｎは入
力パタンの最小桁数、Ｌｍａｘは最大桁数である。相異
度計算は、初期条件（４）、（５）、（６）式のもとて
漸化式（γ）、（８）、（９）を第３図に示す領域ｌよ
）領域Ｌｍａｘまですなわち連続音声標準パタンの各桁
ごとに順に求めることでるる。初期条件の設定は第６図
のブロック１で行われる。次に第ｌ桁目における相異度
計算は以下のように行われる。Next, the calculation procedure of the HLB method will be explained using FIGS. 3 to 6. Figure 3 is a diagram showing the progression order of difference degree calculation, Figure 4 is a diagram showing the difference degree calculation of the first digit, Figure 5 is digit route information FB (l, m), digit recognition fiber category Figure N6, which is a diagram showing the W(J, m)-wise determination calculation order, is a flowchart representing Algorithm 5 described on pages 289 to 290 of the cited document. Here, m is the time point of the input pattern, n is the time point of the standard pattern, V is the word, and l
is a digit, M is the end of the input pattern, Nv is the end of the Vth word standard pattern, ■ is the number of word standard patterns, Lmin is the minimum number of digits of the input pattern, and Lmax is the maximum number of digits. The dissimilarity calculation is performed using the initial conditions (4), (5), and (6) as the recurrence formula (γ), (8), and (9) in the region L shown in Fig. 3). In other words, each digit of the continuous speech standard pattern can be found in order. Setting of initial conditions is performed in block 1 of FIG. Next, the degree of difference calculation at the l-th digit is performed as follows.

相異度Ｄ（ｍ、０）の初期値として前の桁の結果である
桁相異度ＤＢ（ｌ−１，ｍ）をセットしく第６図のブロ
ック２で行われる）、（７）、（８）、（９）式に示す
漸化式を第４図に示すように上限Ｕ（ｍ）と下限Ｌ（ｍ
）でかこまれた部分について計算する（第６図のブロッ
ク３．４で行われる）。ここで上限Ｕ（ｍ）は第４図の
左側の線分ＡＢ（（１０）式に示されている）および上
側の線分ＢＥを意味し、下限Ｌ（ｍ）は下側の線分ＡＣ
および右側の線分ＣＥ（（１１）式に示されている）を
意味する。単語標準パタンの終端ＮＹまで計算を行い、
その終端ＮＹでの相異度Ｄ（ｍ、ＮＹ）を単語相異度Ｄ
（ｖ、ｍ）とする（第６図のブロック５で行われる。１
個の単語標準パタンと計算した後単語相異度り、（ｖ、
ｍ）７）最小値を求め、その最小値を桁相異度ＤＢ（）
１ｍ）とし、その最小値が得られた単語標準パタンの属
するカテゴリｖを桁認識カテゴリＷ（ｌ、ｍ）とし、そ
の最小値が得られたマツチング経路情報Ｆ（ｖ、ｍ）を
桁経路情報ＦＢ（ｌ、ｍ）とする（第６図のブロック６
で行われる）。このようにして第１桁目より第Ｌｍａｘ
桁目まで相異度計算を行った後、得られた桁経路情報Ｆ
Ｂ（ｌ、ｍ）と桁認識カテゴリＷ（ｌ、ｍ）より人力パ
タンの判定を行う。まず、入力パタンの終端Ｍにおける
各桁の桁相異度ＤＢ（ｌ、Ｎ）より、許された桁すなわ
ちＬｍｉｎ桁よりＬｍａｘ桁の間で最小値を求め、（第
６図のブロック７で行われる）、最小値の得られた桁り
が入力パタンの最終的に決定され九桁数である。つづい
て、第５図に示すように第Ｌ桁目の認識結果Ｒ（Ｌ）を
Ｗ（Ｌ、Ｎ）より得、また入力パタンの終端Ｎでの桁経
路情報ＦＢ（Ｌ。The digit dissimilarity DB(l-1, m), which is the result of the previous digit, is set as the initial value of the dissimilarity D(m, 0) (this is done in block 2 of FIG. 6), (7), The recurrence formulas shown in equations (8) and (9) are converted into upper limit U(m) and lower limit L(m
) (carried out in block 3.4 of Figure 6). Here, the upper limit U(m) means the left line segment AB (shown in equation (10)) and the upper line segment BE in FIG. 4, and the lower limit L(m) means the lower line segment AC
and the line segment CE on the right (shown in equation (11)). Calculate up to the end NY of the word standard pattern,
The dissimilarity degree D (m, NY) at the terminal NY is the word dissimilarity degree D
(v, m) (performed in block 5 of Fig. 6.1
After calculating the word standard pattern, the word dissimilarity is (v,
m) 7) Find the minimum value and use the minimum value as digit dissimilarity DB ()
1m), the category v to which the word standard pattern whose minimum value was obtained belongs is defined as the digit recognition category W(l, m), and the matching path information F(v, m) from which the minimum value was obtained is the digit path information. FB(l,m) (block 6 in Fig. 6)
). In this way, the Lmax is calculated from the first digit.
After calculating the degree of difference up to the digit, the obtained digit path information F
The manual pattern is determined from B(l, m) and digit recognition category W(l, m). First, from the digit dissimilarity DB (l, N) of each digit at the end M of the input pattern, find the minimum value between the allowed digits, that is, between Lmin digit and Lmax digit. ), the obtained digit of the minimum value is finally determined as the nine-digit number of the input pattern. Subsequently, as shown in FIG. 5, the L-th digit recognition result R(L) is obtained from W(L, N), and the digit path information FB(L) at the terminal end N of the input pattern is obtained.

Ｍ）より第Ｌ−１桁目の終端を得る（第６図のブロック
８で行われる）。前記操作を順にくり返すことによって
、各桁での認識結果Ｒ（ｌ）が得られる。M) to obtain the end of the L-1st digit (performed in block 8 of FIG. 6). By repeating the above operations in order, a recognition result R(l) for each digit can be obtained.

以上説明したように、ＨＬＢ法では各桁で７回のＤＰマ
ツチングを行えばよいので、全体でＬｍａｘ・Ｖ回のＤ
Ｐマッチングを必要としている。As explained above, in the HLB method, it is only necessary to perform DP matching seven times for each digit, so there are a total of Lmax V times of DP matching.
P matching is required.

一方、特開昭５１−１０４２０４号公報の方法ではＭ・
Ｖ回のＤＰマッチングが必要である。通常、入力パタン
の最大桁数Ｌｍａｘが５程度である場合には、フレーム
周期を２０ｍｇと想定すると、入カパタン長Ｍは１００
程度となり、ＨＬＢ法の計算量は大幅に少ないことにな
る。On the other hand, the method disclosed in Japanese Patent Application Laid-Open No. 51-104204
V times of DP matching are required. Normally, when the maximum number of digits Lmax of the input pattern is about 5, and assuming the frame period is 20 mg, the input pattern length M is 100.
This means that the amount of calculation for the HLB method is significantly smaller.

音声認識装置において認識応答時間は、音声の終端が検
出されてから認識結果を出力するまでの時間である。と
ころで、ＨＬＢ法においては、第１桁目のマツチングに
必要な入力パタンが得られた後、第１行目のマツチング
が開始され、順次第Ｌｍａｘ桁目までマツチングを行い
、認識結果が得られる。その途中の第ｌ桁目に関しては
、第４図に示した右上隅のＥ点（ｍ、、ｎ、）まで入力
パタンが得られた時、すなわち連続音声標準パタンの第
ｌ桁目までの最大パタン長をｎｏ、単語標準パタンの最
大パタン長をＮｍａｘとすれば、Ｅ点の座標の関係よりｎ、＝Ｌ（ｍ、）・・・・・・（１４）でありであるのでｍ、＝２・Ｎｍａｘ・ｌ−１・・・・・・（１６）とな
り、２・Ｎｍａｘ・ｌ−１点まで入力パタンが得られた
時第ｌ桁目のマツチングを行うことができる。今、入力
パタンはＬ桁で各桁の平均単語長をＮとし、Ｌ＝Ｌｍａ
ｘと仮定すれば、（１６）式へｍ、＝Ｌ・Ｎを代入する
ととなり、入力音声の終端が検出された時点では’Ｌｍ
ａｘ桁までの計算しか進めることはできず、残りのＬｍ
ａｘ桁に関してはその後で計算することになり、このＬ
ｍａｘ桁分の計算時間が認識応答時間となり、大きな遅
れを持つ仁とになる。一方、この認識応答時間を短くす
るためには、短くするためには、並列処理やパイプライ
ン処理ができる複雑な高速演算器を必要とする。In a speech recognition device, the recognition response time is the time from when the end of speech is detected to when the recognition result is output. By the way, in the HLB method, after the input pattern necessary for matching the first digit is obtained, matching of the first line is started, and matching is performed sequentially up to the Lmax digit to obtain a recognition result. Regarding the lth digit in the middle, when the input pattern is obtained up to point E (m,, n,) in the upper right corner shown in Figure 4, that is, the maximum value up to the lth digit of the continuous speech standard pattern. If the pattern length is no and the maximum pattern length of the word standard pattern is Nmax, then from the relationship of the coordinates of point E, n, = L(m,) (14), so m, = 2.Nmax.l-1 (16), and when the input pattern is obtained up to 2.Nmax.l-1 points, matching of the l-th digit can be performed. Now, the input pattern has L digits, and the average word length of each digit is N, and L=Lma
If it is assumed that
The calculation can only proceed up to ax digits, and the remaining Lm
The ax digit will be calculated later, and this L
The calculation time for max digits becomes the recognition response time, resulting in a response with a large delay. On the other hand, in order to shorten this recognition response time, a complex high-speed arithmetic unit capable of parallel processing or pipeline processing is required.

本発明の目的は、上記ＨＬＢ法を改良するととにより、
認識応答時間を短縮させ、さらに全体の計算量を少なく
し、これにより経済的な連続音声認識装置を提供するこ
とにある。The purpose of the present invention is to improve the above HLB method, and to
It is an object of the present invention to provide an economical continuous speech recognition device by shortening the recognition response time and reducing the overall amount of calculation.

このためＨＬＢ法の計算順序を入れ換えて、本発明の原
理であるＶＬＢ法と呼ぶ新規な計算原理を導出する。Ｈ
ＬＢ法においては第６図のフローチャートに示すように
、各単語標準パタンと入力パタンとの相異度の針算紘、
初めに各単語標準バタンと入力パタンと計算を行い、次
に桁を１つ上げ同様の計算を行っている。すなわち第６
図のブロック３と４に示す計算のルーズの順序は一番内
側より、単語標準パタンの時間点ｎ、入カパタンの時間
点ｍ、単語標準パタンの番号ｖ、桁の番号ｌである。こ
こで前記計算のループの順序を入れ換え、第７図のフロ
ーチャートに示すように、一番外側を入力パタンの時間
点ｍにすることが可能であることを示す。ＤＰマツチン
グの時間正規化関数ｎ−は単調増加関数であるので、第
ｌ桁目の初期値ＤＨ（ｌ−１、ｍ−１）は、入力パタン
の時間点ｍ−１以前のデータによって決定されている。Therefore, the calculation order of the HLB method is changed to derive a new calculation principle called the VLB method, which is the principle of the present invention. H
In the LB method, as shown in the flowchart of Figure 6, the degree of dissimilarity between each word standard pattern and the input pattern is calculated,
First, calculations are performed using the standard button and input pattern for each word, and then the same calculation is performed by increasing the digit by one. That is, the sixth
The loose order of calculations shown in blocks 3 and 4 of the figure is, from the innermost side, the time point n of the word standard pattern, the time point m of the input pattern, the number v of the word standard pattern, and the digit number l. Here, it is shown that it is possible to change the order of the calculation loop and set the outermost point to time point m of the input pattern, as shown in the flowchart of FIG. Since the time normalization function n- of DP matching is a monotonically increasing function, the initial value DH (l-1, m-1) of the lth digit is determined by the data before time point m-1 of the input pattern. ing.

すなわち入力パタンの時間点ｍ−１以前のすべての点に
おいて相異度計算が終了しているならば、入力パタンの
時間点ｍにおける相異度計算を行うことができる。すな
わち、第８図の斜線部分に示すようにｎ軸に平行で各桁
を含む縦１列の相異度計算を行うことができる。仁の各
桁を含む縦１列の相異度計算には各桁での初期値ＤＢ（
ｌ−１，ｍ−１）と各桁のｍ−１点における相異度Ｄ（
ｍ−１，ｎ）が必要であり、これらはｍ−１点での計算
にて求められている。ただし、ｍ−１点における相異度
Ｄ（ｍｔ、ｎ）および経路情報Ｆ（ｍ−１、ｎ）を各桁
ｌ各単語Ｖについて記憶しておく必要がある。このため
桁ｌ、単語Ｖにおける相異度Ｄ（ｍ−１，ｎ）およびＦ
（ｍ−ｌｅ”）をそれぞれＤ（ｌ、ｖ、ｎ）およびＦ（
ｌ、ｖ、ｎ）で示す。このＤ（ｌ、ｖ、ｎ）とＦ（ｌ、
ｖ、ｎ）の構成を第１５図に示す。That is, if the degree of dissimilarity calculation has been completed at all points before the time point m-1 of the input pattern, the degree of dissimilarity calculation at the time point m of the input pattern can be performed. That is, as shown in the shaded area in FIG. 8, it is possible to calculate the degree of dissimilarity in one vertical column that is parallel to the n-axis and includes each digit. To calculate the dissimilarity of one vertical column including each digit of jin, use the initial value DB (
l-1, m-1) and the degree of dissimilarity D(
m-1, n) are required, and these are obtained by calculation at the m-1 point. However, it is necessary to store the degree of dissimilarity D (mt, n) at point m-1 and the route information F (m-1, n) for each digit V and each word V. Therefore, the degree of dissimilarity D (m-1, n) and F at digit l and word V
(m-le”) respectively D(l, v, n) and F(
l, v, n). This D(l, v, n) and F(l,
v, n) is shown in FIG. 15.

このように計算順序を入れ換えたＶＬＢ法の計算手順を
第７図と第８図を用いて説明する。相異度計算は、動的
計画の漸化式を第８図に示すように上限Ｕ（ホ）と下限
Ｌ−の間の領域内で入力パタンの時間軸ｍの順に求める
ことである。ＶＬＢ法における相異度計算の初期条件はＤ（ｌ、ｖ、ｎ）∞・・・・・・（１７）ｌ＝１〜Ｌｍ
ａｘ、ｖ＝１〜Ｖ、ｎ＝１〜ＮｖＤＢ（ｌ、ｍ）＝∞・
・・・・・（１８）ｌ＝０〜Ｌｍａｘ、ｍ＝０〜ＭＤＢ（０，０）＝０・・・・・・（１８）であり、第７
図のブロックｌで行われる。次に入力パタンの時間点ｍ
におけるｎ軸に平行な縦１列の相異度計算は以下のよう
に行われる。初めに入力パタンの時間点ｍの特徴ベクト
ルａｍ、と第Ｖ番目の単語標準パタンｂｙとの間のベク
トル距離を（８）式により求める（第７図のブロック２
で行われる）。つづいて各桁において縦１列の相異度計
算を行う。この縦１列の相異度計算は、初期値をＤ（ｌ
、ｖ、０）＝ＤＢ（ｌ−１，ｍ−１）・・・・・・（２
０）Ｆ（ｌ、ｖ、０）＝ｍ−１・・・・・・（２１）と
して（第７図のブロック３で行われる）、漸化式ただしをＵ（ｍ）とＬ（ｍ）の間でｎを減少させる方向で計算
する（第７図のブロック４で行われる）。第２図に示す
ように（ｍ、ｎ）点の計算は（ｍ−１，ｎ）、（ｍ−１
，ｎ−１）、（ｍ−ｔ、ｎ−２）の３点の相異度より求
められる。次の（ｍ、ｎ−１）点の計算は（ｍ−ｔ、ｎ
−１）、（ｍ−１、ｎ−２）、（ｍ−１、ｎ−３）の３
点の相異度より求められ（ｍ−１、ｎ）点の相異度は使
用しないので（ｍ、ｎ）点の計算結果を（ｍ−１、ｎ）
点へ記憶しても（ｍ、ｎ−１）点の計算に影譬を与えな
い。The calculation procedure of the VLB method in which the calculation order is changed in this way will be explained using FIGS. 7 and 8. The dissimilarity calculation is to obtain the recurrence formula of the dynamic programming in the order of the time axis m of the input pattern within the region between the upper limit U (e) and the lower limit L-, as shown in FIG. The initial condition for dissimilarity calculation in the VLB method is D(l, v, n)∞ (17) l = 1 ~ Lm
ax, v=1~V, n=1~NvDB(l, m)=∞・
...(18) l = 0 to Lmax, m = 0 to M DB (0,0) = 0 (18), and the seventh
This is done in block l of the figure. Next, the time point m of the input pattern
The dissimilarity calculation for one vertical column parallel to the n-axis is performed as follows. First, the vector distance between the feature vector am at time point m of the input pattern and the Vth word standard pattern by is calculated using equation (8) (block 2 in Figure 7).
). Next, a dissimilarity calculation is performed in one vertical column for each digit. In this vertical column dissimilarity calculation, the initial value is D(l
,v,0)=DB(l-1,m-1)...(2
0) F(l, v, 0) = m-1... (21) (performed in block 3 of Figure 7), the recurrence formula is expressed as U(m) and L(m) (This is done in block 4 of FIG. 7.) As shown in Figure 2, the calculations for (m, n) points are (m-1, n), (m-1
, n-1) and (m-t, n-2). The calculation for the next (m, n-1) point is (m-t, n
-1), (m-1, n-2), (m-1, n-3) 3
It is calculated from the dissimilarity of the points (m-1, n).Since the dissimilarity of the point is not used, the calculation result of the (m, n) point is (m-1, n)
Even if it is stored in the point (m, n-1), it does not affect the calculation of the point (m, n-1).

ゆえにｎを減少させる方向で計算を進めれば、ｍ−１点
の相異度とｍ点の相異度の記憶エリアを共有することが
できる。上記晰化式計算を縦１列実行した後、単語標準
パタンの終端Ｎｖにおける相異度Ｄ（ｌ、ｖ、Ｎｖ）と
それまで計算された最小単語相異度である桁相異度ＤＢ
（ｌ、ｍ）と比較し、得られた相異度Ｄ（ｌ、ｖ、Ｎｖ
）の方が小さい場合は、その相異度Ｄ（ｌ、ｖ、ＮＹ）
を桁相異度ＤＢ（ｌ、ｍ）とし、その単語標準パタンの
属するカテゴリＶを桁認識カテゴリＷ（ｌ、ｍ）とし、
その相異度Ｄ（ｌ、ｖ、ＮＹ）が得られたマッチング経
路情報Ｆ（ｌ、ｖ、Ｎｖ）を桁経路情報ＦＢ（ｌ，ｍ）
とする（第７図のブロック５で行われる）。このように
して行われる縦１列の相異度計算（第７図のブロック２
，３．４．５の計算）をＶ個の単語標準パタンについて
実行する。次に入力パタンの時間点ｍを１つ増加して同
様の縦１列の相異度計算を７個の単語標準パタンについ
て実行し、入力パタンの終端Ｍまで求める。最後に桁経
路情報ＦＢ（ｌ、ｍ）と桁認識カテゴリＷ（ｌ、ｍ）よ
り入カパタンの判定を行う。Therefore, if the calculation proceeds in the direction of decreasing n, the storage area for the dissimilarity of the m-1 point and the dissimilarity of the m point can be shared. After executing the above clarification formula calculation in one column, the dissimilarity D (l, v, Nv) at the end Nv of the word standard pattern and the digit dissimilarity DB which is the minimum word dissimilarity calculated so far
(l, m), and obtained dissimilarity D(l, v, Nv
) is smaller, its dissimilarity D(l, v, NY)
Let be the digit dissimilarity degree DB (l, m), let the category V to which the word standard pattern belongs be the digit recognition category W (l, m),
The matching route information F(l, v, Nv) from which the degree of dissimilarity D(l, v, NY) was obtained is converted into the digit route information FB(l, m)
(performed in block 5 of FIG. 7). Dissimilarity calculation for one vertical column performed in this way (block 2 in Figure 7)
, 3.4.5) are performed for V word standard patterns. Next, the time point m of the input pattern is increased by one, and the same dissimilarity calculation in one vertical column is performed for seven word standard patterns to obtain the end point M of the input pattern. Finally, the input pattern is determined based on the digit path information FB (l, m) and the digit recognition category W (l, m).

この判定の方法はＨＬＢ法の判定方法と同様である。ま
ず、入力パタンの終端Ｍにおける各桁の桁相異度ＤＢ（
ｌ、Ｎ）より許された桁すなわちＬｍｉｎ桁よりＬｍａ
ｘ桁の間で最小値を求め（第７図のブロック６で行われ
る）、最小値の得られた桁Ｌが入力パタンの桁数である
。さらに第Ｌ桁目の認識結果Ｒ（Ｌ）をＷ（Ｌ、Ｍ）よ
り得、また桁経路情報ＦＢ（Ｌ、Ｍ）より第Ｌ−１桁目
の終端を得る（第７図のブロック７で行われる）。前記
操作を順にくり返すことによって各桁での認識結果Ｒ（
ｌ）が得られる。This determination method is similar to the determination method of the HLB method. First, digit difference degree DB (
l, N), that is, Lmin digit, Lma
The minimum value is found among x digits (this is done in block 6 of FIG. 7), and the digit L for which the minimum value is obtained is the number of digits of the input pattern. Furthermore, the recognition result R(L) of the L-th digit is obtained from W(L, M), and the termination of the L-1st digit is obtained from the digit path information FB(L, M) (block 7 in FIG. ). By repeating the above operations in order, the recognition result R(
l) is obtained.

本発明の連続音声認識装置は前記のＶＬＢ法を実行する
装置であるから次のような各部を必要とする。すなわち
、入カパタンＡと連続音声標準パタンＣ＝Ｂｖｌ、Ｂｖ
２、・・・、Ｂｖｌ、・・・、ＢｖＬｍａｘと、以下の
各部に対して入力パタンの時間点を不す信号ｍを１から
Ｍまで変化させ、各ｍに関して単語を示す信号Ｖを１か
ら■まで変化させ、さらに各Ｖに関して桁を示す信号ｌ
を１からＬｍａｘまでおよび標準パタンの時間点を示す
信号ｎを１からＮｖｔで変化させて与える制御部と、上
記制御部の信号ｌ、ｖ、ｎによって番地指定される相異
度メモリ部Ｄ（ｌ、ｖ、ｎ）と、経路情報メモリ部Ｆ（
ｌ、ｖ、ｎ）とを有し、各時間点ｍにおいて前記制御部
より順次指定される単語Ｖの単語標準パタンｂ、ｖ、ｎ
＝１〜Ｎｖと入力パタンａｍとのベクトル間距離ｄ（ａ
ｍ、ｂｎｖ）ｎ＝ｌ〜Ｎ７を求める距離計算部と；この
距離を記憶する距離メモリ部ｄ（ｎ）と、各時間点ｍに
おいて、各桁ｌ、および各単語ｖに関して最初に初期条
件を時間点ｍ−１の結果である桁相異度ＤＢ（ｌー１．
ｍ−１）と桁経路情報ＦＢ（ｌ−１、ｍ−１）により与
え、前記距離ｄ（ｎ）と時間点ｍ−１における相異度Ｄ
（ｌ、ｖ、ｎ）と経路情報Ｆ（ｌ、ｖ、ｎ）とを参照し
て動的計画の漸化式を計算し時間点ｍにおける相異度Ｄ
（ｌ，ｖ、ｎ）と経路情報Ｆ（ｌ、ｖ、ｎ）を順次求め
、単語相異度Ｄ（ｌ、Ｖ、ＮＹ）と単語経路情報Ｆ（ｌ
，ｖ、Ｎｖ）を求める漸化式計算部と；各時間点ｍにお
いて、各桁ｌに関して前記漸化式計算部で求められた各
単語相異度Ｄ（ｌ、ｖ、ＮＹ）の中より最小を求め、こ
れを桁相異度ＤＢ（ｌ，ｍ）とし、これに対応した単語
経路情報Ｆ（ｌ、ｖ、ＮＹ）を桁経路情報ＦＢ（ｌ，ｍ
）とし、最小値が得られた単語各ｖを桁認織カテゴリＷ
（ｌ、ｍ）とする桁相異度計算部と；これらを記憶する
ための桁相異度メモリ部ＤＢ（ｌ、ｍ）と桁経路情報メ
モリ部ＦＢ（ｌ、ｍ）と、桁認識カテゴリメモリ部Ｗ（
ｌ、ｍ）と、桁経路情報ＦＢ（ｌ，ｍ）と桁ｇ、ｍカテ
ゴリＷ（ｌ、ｍ）に基づいて逆順に入力パタンの各桁の
カテゴリを判定し出力する判定部とを有している。Since the continuous speech recognition device of the present invention is a device that executes the above-mentioned VLB method, it requires the following parts. That is, input pattern A and continuous voice standard pattern C = Bvl, Bv
2, ..., Bvl, ..., BvLmax, and the signal m that misses the time point of the input pattern for each part below is varied from 1 to M, and the signal V indicating a word for each m is varied from 1 to M. ■ A signal l indicating the digit for each V
from 1 to Lmax and a signal n indicating the time point of the standard pattern by varying it from 1 to Nvt, and a dissimilarity memory section D (addressed by the signals l, v, n of the control section). l, v, n) and the route information memory section F(
l, v, n), and word standard patterns b, v, n of the word V that are sequentially specified by the control unit at each time point m.
= 1~Nv and the input pattern am vector distance d(a
m, bnv) n=l~N7; a distance memory unit d(n) that stores this distance; and at each time point m, initial conditions are first set for each digit l and each word v. Digit dissimilarity DB (l-1.
m-1) and the digit path information FB (l-1, m-1), and the distance d(n) and the degree of dissimilarity D at the time point m-1
(l, v, n) and the route information F(l, v, n) to calculate the recurrence formula of the dynamic program and calculate the dissimilarity degree D at time point m.
(l, v, n) and route information F(l, v, n) are sequentially obtained, word dissimilarity degree D(l, V, NY) and word route information F(l
, v, Nv); at each time point m, from each word dissimilarity degree D(l, v, NY) calculated by the recurrence formula calculation unit for each digit l; Find the minimum, set this as digit dissimilarity DB(l,m), and set the corresponding word path information F(l, v, NY) as digit path information FB(l,m
), and each word v for which the minimum value was obtained is defined as a digit recognition category W
(l, m); a digit difference memory unit DB (l, m) for storing these, a digit path information memory unit FB (l, m), and a digit recognition category. Memory part W (
l, m), and a determination unit that determines and outputs the category of each digit of the input pattern in reverse order based on the digit path information FB (l, m) and the digit g, m categories W (l, m). ing.

このように本発明の原理であるＶＬＢ法を用いれば相異
度計算を入力パタンの時間軸方向に進めることができる
。とれによって音声の入力が検出されるとすぐ計算を開
始し、音声の入力に同期して順次計算することができる
ので音声の終了と同時に第７図のブロック６、７の判定
処理を始めることができる。したがって従来技術である
ＨＬＢ法に比較し、認識応答時間が短縮できることにな
る。また、距離計算は、ＨＬＢ法では第６図のプロック
３に示すようにｎ、ｍ、ｖ、ｌのループで囲まれている
が、ＶＬＢ法で線絡７図のプロック３で示すようにｎ、
ｖ、ｍのループで囲まれている。すなわちＨＬＢ法にお
ける距離計算の回数はＮｖ。In this way, by using the VLB method, which is the principle of the present invention, the dissimilarity calculation can proceed in the time axis direction of the input pattern. As soon as a voice input is detected due to a break, calculations can be started, and the calculations can be performed sequentially in synchronization with the voice input, so it is possible to start the determination process of blocks 6 and 7 in FIG. 7 at the same time as the voice ends. can. Therefore, compared to the conventional HLB method, the recognition response time can be shortened. In addition, distance calculation is surrounded by a loop of n, m, v, l as shown in block 3 in Figure 6 in the HLB method, but n, as shown in block 3 in diagram 7 in the VLB method. ,
It is surrounded by a loop of v and m. That is, the number of distance calculations in the HLB method is Nv.

Ｍ・■・Ｌｍａｘであり、ＴＬＢ法における距離計算の
回数はＮ’−Ｖ、Ｍである。したがって従来技術である
ＨＬＢ法に比較し、距離計算の計算量が１／Ｌｍａｘに
減少できることになる。M.■.Lmax, and the number of distance calculations in the TLB method is N'-V,M. Therefore, compared to the conventional HLB method, the amount of distance calculation can be reduced to 1/Lmax.

次に本発明の装置の具体的構成を図面を参照しながら説
明する。第９図は、本発明の一構成例を示すブロック図
であり、第１０図は制御指令信号のタイムチャートであ
る。制御部１０は、ｍ１。Next, the specific configuration of the apparatus of the present invention will be explained with reference to the drawings. FIG. 9 is a block diagram showing a configuration example of the present invention, and FIG. 10 is a time chart of control command signals. The control unit 10 is m1.

ｎ１、ｖ１などの制御指令信号を第１０図に示すように
発することによって、他の各部を制御する機能を持つが
、その詳細は他の各部の動作に関連してその都度説明す
る。人力部１１は、信号Ｓｐｅｅ−ｃｈｉｎで与えられ
る入力音声を分析し一定時間ごとに特徴ベクトルを出力
する。この音声分析は例えば、多チャンネルのフィルタ
より構成されるフィルタバンクによる周波数分析などが
ある。また入力部１１には入力音声のレベルを監視し、
音声の始端、終端を検出する機能を持ち、その検出した
時点を制御部１０へ信号ＳＰにより伝える。It has a function of controlling other parts by issuing control command signals such as n1 and v1 as shown in FIG. 10, but the details will be explained each time in relation to the operation of the other parts. The human power section 11 analyzes the input speech given by the signal Spee-chin and outputs a feature vector at regular intervals. This audio analysis includes, for example, frequency analysis using a filter bank composed of multi-channel filters. The input unit 11 also monitors the level of input audio,
It has a function of detecting the start and end of audio, and transmits the detected time to the control unit 10 by a signal SP.

入力パタンバッファ１２は、音声の始端が検出された後
、信号ｍ３に従って入力部１１より与えられる特徴ベク
トルａｍを記憶する。信号ｍ３は入力パタンの時間点ｍ
に対応した信号である。標準パタンメモリ部１３は、７
個の単語標準パタンＢ１、Ｂ２．・・・Ｂｖを記憶し、
糠準パタン長メモリ部１４は単語標準パタンＢｖの長さ
Ｎｖを記憶している。The input pattern buffer 12 stores the feature vector am given from the input unit 11 in accordance with the signal m3 after the start of the voice is detected. Signal m3 is time point m of the input pattern
This is a signal corresponding to The standard pattern memory section 13 has 7
word standard patterns B1, B2. ...Remember Bv,
The bran standard pattern length memory section 14 stores the length Nv of the word standard pattern Bv.

信号ｖｌは連続音声標準パタンの単語ｖに対応する信号
であり、制御部１０は、信号ｖ１に従って、標準パタン
長メモリ部１４より単語標準パタンＢｖの長さＮｖを読
み出し、単語標準パタンの時間点ｎに対応する信号ｎ１
を発生する。信号ｎ１に従りて入力パタンバッファ１２
より入カパタンの特徴ベクトルａｍが読み出され、標準
パタンメモリ部よりｂ１ｖ、ｂ２ｖ、・・・、ｂｖＮｖ
が順次読み出され距離計算部１５において（８）式が計
算され、距離ｄ（ｎ）、ｎ＝１．２・・・Ｎｖが距離メ
モリ部１６へ記憶される。The signal vl is a signal corresponding to the word v of the continuous speech standard pattern, and the control unit 10 reads the length Nv of the word standard pattern Bv from the standard pattern length memory unit 14 according to the signal v1, and calculates the time point of the word standard pattern. signal n1 corresponding to n
occurs. Input pattern buffer 12 according to signal n1
The feature vector am of the input pattern is read out from the standard pattern memory section, and b1v, b2v, ..., bvNv are read out from the standard pattern memory section.
are sequentially read out, the distance calculation section 15 calculates the equation (8), and the distance d(n), n=1.2...Nv, is stored in the distance memory section 16.

距離計算部１５において第１１図に示すように初めに信
号ｎ１にてアキュムレータ１５３がクリヤされ、入力パ
タンバッファ１２と標準パタンメモリ部１３よりｒ個の
データが読み込まれ、絶対値回路１５１にて絶対値を求
め、加算器１５２にて加算され、（８）式の距離Ｄｉａ
（ａｍ、ｂｎｖ）がアキュレータ１５３にて求まり、こ
の距離が距離メモリ部１６へ出力される。In the distance calculation section 15, as shown in FIG. The values are calculated and added by the adder 152, and the distance Dia of equation (8) is obtained.
(am, bnv) is determined by the accurator 153, and this distance is output to the distance memory section 16.

漸化式計算の初期値のセットは音声の入力される前に制
御部１０の信号ＣＬにより行われ、相異度メモリ部１８
、桁相異度メモリ部２１へ（１７）、（１８）、（１９
）式で示した値がセットされる。The initial value for the recurrence formula calculation is set by the signal CL of the control unit 10 before the voice is input, and the initial value is set by the signal CL of the control unit 10.
, to the digit difference memory unit 21 (17), (18), (19
) is set.

漸化式計算部１７は、第７図のプロック４を行う部分で
あり、漸化式（２２）、（２３）（２４）を実行する。The recurrence formula calculation unit 17 is a part that performs block 4 in FIG. 7, and executes recurrence formulas (22), (23), and (24).

すなわち、漸化式計算部１７は、第１２図に示すように
３つの相異度レジスタＤ１、Ｄ２．Ｄ３と、その３つの
レジスタＤ１、Ｄ２．Ｄ３の最小値を計算する比較回路
１７１と、加算器１７２と、３つの経路レジスタＦ１、
Ｆ２、Ｆ３より構成される。制御部１０より発せられた
信号ｎ２によって相異度メモリ部１８と経路メモリ部１
９より３つの相異度Ｄ（ｌ、ｖ、ｎ）、Ｄ（ｌ、ｖ、ｎ
−１）、Ｄ（ｌ、ｖ、ｎ−２）と３つの経路情報Ｆ（ｌ
、ｖ、ｎ）、Ｆ（ｌ、ｖ、ｎ−１）、Ｆ（ｌ、ｖ、ｎ−
２）を読み出しそれぞれ相異度レジスタＤ１．Ｄ２．Ｄ
３と経路レジスタＦ１、Ｆ２、Ｆ３へ格納する。比較回
路１７１は３つの相異度レジスタＤＩ、Ｄ２．Ｄ３より
最小値を検出し、その最小値が得られた相異度レジスタ
ーＤｎ（ｎは１、２、３のどれか）に対応した経路レジ
スタＦｎを選択するゲート信号ｎを発する。前記ゲート
信号ｎにより選択された経路レジスターＦｎの内容が経
路メモリ部１９のＦ（ｌ、ｖ、ｍ）へ格納される。また
、比較回路１７１より出力された相異度の最小値Ｄ（ｌ
、ｖ、ｎ）は、距離メモリ部１６より読み出された距離
ｄ（ｎ）と加算器１７２によって加算され、相異度メモ
リ部１８ヘ格納される。That is, the recurrence formula calculation unit 17 has three dissimilarity registers D1, D2 . D3 and its three registers D1, D2 . A comparison circuit 171 that calculates the minimum value of D3, an adder 172, three path registers F1,
Consists of F2 and F3. The dissimilarity memory section 18 and the route memory section 1 are controlled by the signal n2 issued from the control section 10.
9, the three dissimilarities D(l, v, n), D(l, v, n
-1), D(l, v, n-2) and three route information F(l
, v, n), F(l, v, n-1), F(l, v, n-
2) are read out and set in the respective difference registers D1. D2. D
3 and stored in route registers F1, F2, and F3. The comparison circuit 171 includes three dissimilarity registers DI, D2 . The minimum value is detected from D3, and a gate signal n is generated to select the path register Fn corresponding to the dissimilarity register Dn (n is any one of 1, 2, or 3) from which the minimum value was obtained. The contents of the route register Fn selected by the gate signal n are stored in F(l, v, m) of the route memory section 19. Further, the minimum value D(l
, v, n) are added to the distance d(n) read from the distance memory unit 16 by an adder 172, and stored in the dissimilarity memory unit 18.

この漸化式計算がｎ＝Ｕ（ｍ）よりＬ（ｍ）まで算出さ
れ、この結果である単語相異度Ｄ（ｌ、ｖ、Ｎｖ）が各
ｖおよび各ｌに対して算出される。This recurrence formula calculation is calculated from n=U(m) to L(m), and the resulting word dissimilarity D(l, v, Nv) is calculated for each v and each l.

桁相異度計算部２０は、第７図のプロック５を行う部分
であり、Ｖ個の単語相異度Ｄ（ｌ、ｖ、Ｎｖ）の最小値
を逐次求める。すなわち、桁相異度計算部２０は第１３
図に示すように、比較回路２０１と、単語相異度Ｄ（ｌ
、ｖ、Ｎｖ）を保持するレジスタ２０２と、単語標準パ
タンの属するカテゴリｖを保持するレジスタ２０３と、
経路情報Ｆ（ｌ，ｖ、Ｎｖ）を保持するレジスタ２０４
より構成される。The digit dissimilarity calculation unit 20 is a part that performs block 5 in FIG. 7, and sequentially calculates the minimum value of V word dissimilarities D(l, v, Nv). That is, the digit difference calculation unit 20
As shown in the figure, the comparison circuit 201 and the word dissimilarity degree D(l
, v, Nv), and a register 203 that holds the category v to which the standard word pattern belongs.
Register 204 that holds route information F (l, v, Nv)
It consists of

信号ｌ１は信号ｖｌ１つの区間にＬｍａｘ個発生される
。この信号ｌ１は、連続音声標準パタンの桁ｌに対応す
る信号である。制御部１０より発せられた信号ｌ１に従
い、相異度メモリ部１８と経路メモシ部１９より単語相
異度Ｄ（ｌ，ｖ、Ｎｖ）と単語経路情報Ｆ（ｌ、ｖ、Ｎ
ｖ）が読み出され、それぞれレジスタ２０３と２０５へ
格納され、単語標準パタンの属するカテゴリｖをレジス
タ２０４へ格納される。一方、比較回路２０１は前記単
語相異度Ｄ（ｌ、ｖ、Ｎｖ）と桁相異度メモリ部２１よ
り読み出された桁相異度ＤＢ（ｌ，ｍ）と比較し、単語
相異度Ｄ（ｌ、ｖ、Ｎｖ）がより小さいと判定するとゲ
一ト信号ｖを発生する。ゲート信号ｖに従ってレジスタ
２０２，２０３．２０４に保持されていた単語相異度Ｄ
（ｌ，ｖ、Ｎｖ）、カテゴリｖ単語経路情報Ｆ（ｌ、ｖ
、Ｎｖ）がそれぞれ桁相異度メモリ部２１のＤＢ（ｌ、
ｍ）、桁認識カテゴリメモリ部２２のＷ（ｌ、ｍ）、桁
経路メモリ部２３のＦＢ（ｌ、ｍ）へ格納される。さら
に制御部１０より信号ｌ１につづいて発せられる信号ｌ
１２によりて第７図のブロック３にて行われる部分であ
る縦１列の相異度計算の（２０）、（２１）式に示した
初期セットが行われる。すなわち桁相異度メモリ部２１
よりＤＢ（ｌ、ｍ−１）が読み出され、相異度メモリ部
１８のＤ（ｌ、ｖ、０）へ格納され、経路メモリ部のＦ
（ｌ、ｖ、０）へｍ−１が格納される。判定部２４は、
第７図のブロック６．７を行う部分であり、桁経路情報
ＦＢ（ｌ，ｍ）と桁認識カテゴリＷ（ｌ、ｍ）より入力
パタンの各桁の認識結果Ｒ（ｌ）を出力する。すなわち
判定部２４は第１４図に示すように、比較回路２４１と
、最小桁相異度を保持するレジスタ２４２と、桁数を保
持するレジスタ２４３と、桁経路情報Ｆ（ｌ、ｍ）を保
持するレジスタ２４４と認識結果を保持するレジスタ２
４５より構成される。音声の終端が検出されると入力部
１１より信号ＳＰによって制御部１０に通知され、つづ
いて制御部ｌＯは判定部２４へ信号ｍ１を発し、判定部
２４は判定処理を開始する。判定制御部２４６は信号ｍ
１を受けた彼、信号ｌ３を桁相異度メモリ部２１へ発す
る。The signal l1 is generated Lmax times in one section of the signal vl. This signal l1 is a signal corresponding to digit l of the continuous voice standard pattern. According to the signal l1 issued from the control unit 10, the word dissimilarity degree D (l, v, Nv) and the word route information F (l, v, N
v) are read out and stored in registers 203 and 205, respectively, and the category v to which the word standard pattern belongs is stored in register 204. On the other hand, the comparison circuit 201 compares the word dissimilarity degree D(l, v, Nv) with the digit dissimilarity degree DB(l, m) read out from the digit dissimilarity memory unit 21, and determines the word dissimilarity degree. If it is determined that D(l, v, Nv) is smaller, a gate signal v is generated. Word dissimilarity degree D held in registers 202, 203, and 204 according to gate signal v
(l, v, Nv), category v word path information F(l, v
, Nv) are respectively DB(l, Nv) of the digit difference memory unit 21.
m), are stored in W(l,m) of the digit recognition category memory section 22, and FB(l,m) of the digit path memory section 23. Furthermore, a signal l is issued from the control unit 10 following the signal l1.
12, the initial setting shown in equations (20) and (21) of the dissimilarity calculation for one vertical column, which is the part performed in block 3 of FIG. 7, is performed. In other words, the digit difference memory section 21
DB(l, m-1) is read out, stored in D(l, v, 0) of the dissimilarity memory section 18, and stored in F of the path memory section.
m-1 is stored in (l, v, 0). The determination unit 24
This is the part that performs block 6.7 in FIG. 7, and outputs the recognition result R(l) of each digit of the input pattern from the digit path information FB(l,m) and the digit recognition category W(l,m). That is, as shown in FIG. 14, the determination unit 24 includes a comparison circuit 241, a register 242 that holds the minimum digit difference, a register 243 that holds the number of digits, and digit path information F (l, m). register 244 to hold the recognition result and register 2 to hold the recognition result.
It consists of 45 pieces. When the end of the voice is detected, the input section 11 notifies the control section 10 by a signal SP, and the control section 10 then issues a signal m1 to the determination section 24, and the determination section 24 starts determination processing. The determination control unit 246 receives the signal m
1, he issues a signal l3 to the digit difference memory section 21.

信号ｌ３に従って、桁相異度メモリ部２１より入力パタ
ンの終端Ｍでの桁相異度ＤＢ（ｌ、Ｎ）が順次読み出さ
れ、比較回路２４１によって逐次最小値を求めレジスタ
Ｄへ格納され、その時の桁数ｌがレジスタＬへ格納され
る。信号ｌ３に従って、Ｌｍａｘ個の桁相異度が読み出
された後、レジスタＬの内容が入力パタンの桁数を示し
ている。判定制御部２４６はｌ＝Ｌ、ｍ＝Ｍとしてアド
レス信号ｍ２を桁経路メモリ部２３と桁認識カテゴリメ
モリ部２２へ発し、ＦＢ（Ｌ、Ｍ）とＷ（Ｌ、Ｍ）が読
み出され、レジスタＦとレジスタＲへ格納される。According to the signal l3, the digit disparity DB(l, N) at the end M of the input pattern is sequentially read out from the digit disparity memory unit 21, and the minimum value is sequentially determined by the comparator circuit 241 and stored in the register D. The number of digits l at that time is stored in register L. After Lmax digit differences are read out according to signal l3, the contents of register L indicate the number of digits of the input pattern. The determination control unit 246 issues an address signal m2 to the digit path memory unit 23 and digit recognition category memory unit 22 with l=L and m=M, and FB (L, M) and W (L, M) are read out. Stored in register F and register R.

レジスタＲの内容が認識結果として出力される。The contents of register R are output as the recognition result.

さらに判定制御部２４６はｌ＝ｌ−、ｍ＝Ｆとしてアド
レス信号ｍ２を桁経路メモリ部２３と桁認識力テゴリメ
モリ部２２へ発し、ＦＢ（ｌ，ｍ）とＷ（ｌ、ｍ）が読
み出されレジスタＦとレジスタに格納される。この処理
を順次Ｌより１まで操り返すことによりＬ桁の認識結果
がレジスタＲより出力される。Furthermore, the determination control unit 246 issues an address signal m2 to the digit path memory unit 23 and the digit recognition ability category memory unit 22 with l=l− and m=F, and FB(l, m) and W(l, m) are read out. and stored in register F and register. By repeating this process sequentially from L to 1, the recognition result of L digits is output from register R.

以上、本発明の原理とその一構成例を説明したが、これ
らの記載は本発明の範囲を限定するものではない。特に
本発明の原理であるＶＬＢ法の説明において計算のルー
プの順序を一番内側よりｎ・ｌ、ｖ、ｍとしたが、ｌ、
ｎ、ｖ、ｍすることもＶＬＢ法を導出した同様な理由に
より可能である。Although the principle of the present invention and one configuration example thereof have been explained above, these descriptions do not limit the scope of the present invention. In particular, in the explanation of the VLB method, which is the principle of the present invention, the order of the calculation loops is n・l, v, m from the innermost, but l,
It is also possible to use n, v, and m for the same reason as the reason for deriving the VLB method.

また、桁相異度ＤＢ（ｌ、ｍ）、桁経路情報ＦＢ（ｌ、
ｍ）、桁認識カテゴリＷ（ｌ、ｍ）より入力パタンの判
定を行う部分の説明において、ＤＢ（ｌ、Ｍ）の最小値
を求め入力パタンの桁数を判定しているが、ＩＥＥＥ
ＴＲＡＮＳＡＣＴＩＯＮＳＯＮＡＣＯＵＳＴＩＣＳ、
ＳＰＥＥＣＨ、ＡＮＤＳＩＧＮＡＬＰＲＯＣＥＳＳＩ
ＮＧ。In addition, digit difference degree DB (l, m), digit route information FB (l,
m), in the explanation of the part in which the input pattern is determined from the digit recognition category W (l, m), the minimum value of DB (l, M) is determined and the number of digits of the input pattern is determined.
TRANSACTIONS ON ACUSTICS,
SPEECH, AND SIGNAL PROCESSI
NG.

ＶＯＬＡＳＳＰ−２７，ＤＥＣＥＭＢＥＲ１９７９第
５８８頁より第５９５頁に記載されているような制約条
件のもとて入力パタンの桁数を判定する方法も可能であ
る。It is also possible to determine the number of digits of an input pattern under constraints such as those described in VOL ASSP-27, DECEMBER 1979, pages 588 to 595.

さらに、入力パタンａｍと標準パタンｂｎｖとの距離を
（８）式のような距離尺度を用いて説明したが、このか
わりに（２５）式のようなユークリッド距離、（２６）
式のような内積等を用いてよい。Furthermore, although the distance between the input pattern am and the standard pattern bnv was explained using a distance measure such as equation (8), instead of this, the Euclidean distance as shown in equation (25),
You may use an inner product, etc. as shown in Eq.

また、相異度を計算するための漸化式は（２２）、（２
３）、（２４）式の形の他にも種々前えられ、この（２
２）、（２３）、（２４）式の代わりに特公告５６−２
８２７８号に記載されている形も使用できることは明白
である。Also, the recurrence formula for calculating the degree of dissimilarity is (22), (2
In addition to the forms of equations 3) and (24), various forms have been prepared, and this (2)
2), (23), and (24) instead of Special Publication 56-2
It is clear that the forms described in No. 8278 can also be used.

[Brief explanation of drawings]

第１図は相異度計算を行う範囲および、マツチング経路
の例を示した図であり、第２図は漸化式において許され
ているマツチング経路を示した図であり、第３図はＨＬ
Ｂ法の計算順辱を示した図であり、第４図はＨＬＢ法に
おける第ｌ桁目の計算順序を示した図であり、第５図は
、判定処理の計算順序を示した図であり、第６図（１）
および（２）はＨＬＢ法の計算手順を示すフローチャー
トであり、第７図（１）および（２）は本発明の原理で
あるＶＬＢ法の計算手順を示すフローチャートであり、
第８図はＶＬＢ法の計算順序を示した図であり、第９図
は本発明の一実施例の構成図であり、第１０図は本発明
の実施例の動作を説明するためのタイムチャートであい
、第１１図は本発明の一構成要素の一つである距離計算
部の構成図であり、第１２図は漸化式計算部の構成図で
あり、第１３図は桁相異度計算部の構成図であり、第１
４図は判定部の構成図であり、第１５図は相異度メモリ
部、経路情報メモリ部の構成図であり、第１６図は桁相
異度メモリ部、桁経路情報メモリ部、桁紹臓カテゴリメ
モリ部の構成図である。第９図、第１１図、第１２図、第１３図、第１４図にお
いて、１０・・・・・・制御部、１１・・・・・・入力
部、１２・・・・・・入力パタンバッファ、１３・・・
・・・標準パタンメモリ部、１４・・・・・・標準パタ
ン長メモリ部、１５・・・・・・距離計算部、１６・・
・・・・距離メモリ部、１７・・・・・・漸化式計算部
、１８・・・・・・相異度メモリ部、１９・・・・・・
経路情報メモリ部、２０・・・・・・桁相異度計算部、
２１・・・・・・桁相異度メモリ部、２２・・・・・・
桁認識カテゴリメモリ部、２３・・・・・・桁経路情報
メモリ部、２４・・・・・・判定部、１５１・・・・・
・絶対値回路、１５２・・・・・・加算器、１５３・・
・・・・アキュムレータ、１７１・・・・・・比較回路
、１７２・・・・・・加算器、Ｄ１、Ｄ２．Ｄ３・・・
・・・相異度を保持するレジスタ、Ｆ１、Ｆ２、Ｆ３・
・・・・・経路を保持するレジスタ、２０１・・・・・
・比較回路、２０２・・・・・・単語相異度を保持する
レジスタ、２０３・・・・・・カテゴリを保持するレジ
スタ、２０４・・・・・・経路情報を保持するレジスタ
、２４１・・・・・・比較回路、２４２・・・・・・最
小桁相異度を保持するレジスタ、２４３・・・・・・桁
数を保持するレジスタ、２４４・・・・・・桁経路情報
を保持するレジスタ、２４５・・・・・・認識結果を保
持し出力するレジスタ、２４６・・・・・・判定制御部
である。第１図第２図第３図入力パタン第４図５５６− 拾５図入力パタン第１０図ｙ＋４’ＩＪ（２１％）−−・Ｌ（η 第１１図第１２図第１３図第１４図り一−−−−−−−−−−−−−−−−−−−−」第１
５図１□Ｖ−Ｖ第１６図０−一−−−−−◆７ＦＬＭ手続補正書（自発）５８．２゜２日昭和年月日特許庁長官殿１、事件の表示昭和ｓ６年特許願第１９７１１４１号２
、発明の名称連続音声認識装置３、補正をする者事件との関係出願人東京都港区芝五丁目３３番１号４、代理人〒１０８東京都港区芝五丁目３７番８号住友三田ビル日
本電気株式会社内（６５９１）弁理士内原晋電話東京（０３）４５６−３１１１（大代表）５、補正
の対象明細書の「特許請求の範囲」、「発明の詳細な説明」の
欄および「図面」。６、補正の内容囚「特許請求の範囲」の欄別紙のとおり＠「発明の詳細な説明」の欄（２）第１５頁第５行目に「短くするためには。短くするためには、」とあるのを「短くするためには、
」と補正する。（３）第１７頁第１１行目にｒｎ（４Ｙ、ｎ）”Ｊとあ
るのをｒＤ（１，Ｙ、ｎ）＝ａ■」と補正する。（４）第２２頁第１４行目にｒＷ（４ｍ）と、」とある
のをｒＷ（４ｍ）と；」と補正する。（５）第２５頁第７行目に「初めに信号ｎｌにて」とあ
るのを「初めに信号Ｃｔ１５３にて」と補正する。（６）第２５頁第９行目に「１３よりｒ個の」とあるの
を「１３より信号ｒｌに従って１個の」と補正する。（７）第２５頁第１θ行目に「絶対値を求め」とあるの
を「差の絶対値を求め」と補正する。（８）第２６頁第５行目に「信号ｎ２によって」とある
のを「信号ｎ２．ｎ２１．ｎ２２によって」と補正する
。（９）ｌｉ２６頁第１７行目にｒＦ（４ｖ、ｍ）Ｊとあ
るのを「Ｆ（４マ、ｎ）」と補正する。（１０）第２６頁第２０行目に［メモリ部１８Ｊとある
のを［メモリ部１８のＤ（ｔ、マ、ｎ）」と補正する。（１１）第２７頁第１９行目に「２０３と２０５」とあ
るのを「２０２と２０４」と補正する。０２１）第２６頁第２０行目に［２０４Ｊとあるのを「
２０３Ｊと補正する。（１３）第２８頁第１１行目に「信号ｔ１につづいて発
せられる信号ｔ２Ｊを「信号Ｃｌ２Ｊと補正す石。 α◇第２８頁第１５行目に「ＤＢ（４ｍ−１）Ｊとある
のを「ＤＢ（ｔ−１，ｍ−１）Ｊと補正する。（ｌ）第２９頁第１５行目に「レジスタＤ」とあるのを
「レジスタ２４２」と補正する。 αの第２９頁第１６行目に「レジスタＬ」とあるのを「
レジスタ２４３」と補正する、αつ第２９頁第１８行目
に四」とあるのを［２４３Ｊと補正する。 αの第２９頁第２０行目に「ｍ＝」とあるのビｔ４゜■
２」と補正する。（１９）第３０頁第２行目に「レジスタＦとレジスタ几
」とあるのを「レジスタ２４４とレジスタ２４５」と補
正する。（１）第３０頁第３行目に「レジスタＢ」とあるのを「
レジスタ２４５」と補正する。（２１）１１１Ｅ３０頁第４行目にｒｔ−１＋、ｔｎ＝
ＦＪとあるのを［Ｌ＝１−１．ｍ＝（レジスタ２４４の
内容）」と補正する。（２＠第３０頁第７行目に「レジスタＦとレジスタ」と
あるのを「レジスタ２４４とレジスタ２４５」と補正す
る。（ｑｒ図面」・本願添付図面の第７−１図、第９図、第１０図、第１
１図、第１２図、第１３図および第１４図を別紙図画と
差し替える。別紙２、特許請求の範囲特徴ベクトルの時系列である１個以上の単語よりなる入
カパタン人＝−□、Ｊ８．・・・＋”ｍｓ・・”＋７Ｍ
とあらかじめ記憶されている７個の単語標準パタンＢＹ
＝１）？、ｂｒ、・・・、Ｔｏｌ、・・・、ＴｏＸｖ（
マ＝１，２．・・・、■）全組合せて得られる最大ＬＨ
ａｘ桁の連続音声標準パタ：／Ｃ−Ｂｖｌ、Ｂｖｌ、−
、ＢＶｌ、・、ＢＹＬａａｘとの間で入力パタンの時間
軸ｍと連続音声標準パタンの時間軸ｎとを対応させる時
間関数ｎ−の上の入力パタンＪ１１ｍと連続音声標準パ
タンＴｏ、とのベクトル間距離’（ａｍ、ｂｎ）の和と
して定義される相異度の最小値を求めるために、連続音
声標準パタンＣ＝＝Ｂｖｌ。Ｂ１．・・・、ＢＡ、・・・Ｉ３１ｘａ＊ｘを各桁ごと
に分割し。第を桁目における最適な時間関数ｎ−によって与えられ
るベクトル間距離の最小累積量を示す桁相異度ＤＢ（４
ｔｏ）と、この時間関数の先頭時間点を示す桁経路情報
ＦＢ（４ｍ）と、この時間関数上において最小累積距離
を与えた単語名Ｖである桁認識カテゴリＷ（ｊ、ｍ）と
を１桁ｔおよび入力パタンの時間点ｍに対して順次求め
、最後に入力パタンの桁数および各桁の認識結果を判定
する連続音声認識装置において、入力パタンの時間点を
示す信号ｍを１からＭまで変化させ、各１１に関して単
語を示す信号マを１からＶまで変化させ、さらに各マに
関して桁を示す信号ｔを１からＬ１！Ｉｎｘまでおよび
標準パタンの時間点を示す信号ｉｔｌからＮＹまで変化
させて与える制御部と；前記制御部の信号ｔ、マ、ｎに
よって番地指定される相異度メモリ部Ｄ（４ｖ、ｎ）と
経路情報メモリ部Ｆ（ｚｔ’ｔ”）と；各時間点ｍにお
いて前記制御部より順次指定される単語マの単語標準パ
タンＮ、Ｈ＝ｌ−Ｊｉｖと入力パタン−１とのベクトル
間距離’（ａ、、ｂｎｖ）ｎ＝１％ＮＹを求める距離計
算部と；この距離を記憶する距離メモリ部ｄ（ｎ）と；
各時間点ｍにおいて。各桁ｔ、および各単語マに関して最初に初期条件を時間
点ｍ−１の結果である桁相異度ＤＢ（ｔ−１゜ｍ−１）
と桁経路情報ＦＢ（Ｌ−１，ｖａ−１）Ｋより与え。前記距離ｄＩｎ）と時間点ｍ−１における相異度Ｄ（４
゜ｖ、ｎ）と経路情報Ｆ（４ｖ、ｎ）とを参照して動的
計画の漸化式を計算し時間点ｍにおける相異度Ｄ（４ｖ
、ｎ）と経路情報Ｆ（’ｔ’ｍ”）を順次求め、単語相
異度Ｄ（４マＮＶ）と単語経路情報Ｆ（４マＮＶ）を求
める漸化式計算部と；各時間点ｍにおいて、各桁ｔに関
して前記漸化式計算部で求められた各単語相異度Ｄ（４
ｖ、ＮＹ）の中より最小を求め、これを桁相異度ＤＢ（
４ｍ）とし、これに対応した単語経路情報Ｆ（４ｖ、Ｎ
Ｙ）を桁経路情報ＦＢ（４ｍ）とし、最小値が得られた
単語名Ｖを桁認識カテゴ！ＪＷ（４ｍ）とする桁相異度
計算部と；これらを記憶するための桁相異度メモリ部Ｄ
Ｂ（４ｍ）と桁経路情報メモリ部ＦＢ（４ｍ）と、桁認
識カテゴリメモリ部Ｗ（４ｍ）とｊ桁経路情報ＦＢ（４
ｍ）と桁認識カテゴリＷ（４ｍ）に基づいて逆順に入力
パタンの各桁のカテゴリを判定し出力する判定部とを有
することを特徴とする連続音声認識装置。第１Ｚ回ｎｚ、ｎｚｔ、ηＺ２Figure 1 is a diagram showing the range for dissimilarity calculation and examples of matching paths, Figure 2 is a diagram showing matching paths allowed in recurrence formulas, and Figure 3 is a diagram showing HL
FIG. 4 is a diagram showing the calculation order of the first digit in the HLB method, and FIG. 5 is a diagram showing the calculation order of the determination process. , Figure 6 (1)
and (2) are flowcharts showing the calculation procedure of the HLB method, and FIGS. 7(1) and (2) are flowcharts showing the calculation procedure of the VLB method which is the principle of the present invention,
FIG. 8 is a diagram showing the calculation order of the VLB method, FIG. 9 is a block diagram of an embodiment of the present invention, and FIG. 10 is a time chart for explaining the operation of the embodiment of the present invention. Fig. 11 is a block diagram of the distance calculation section which is one of the components of the present invention, Fig. 12 is a block diagram of the recurrence formula calculation section, and Fig. 13 is a block diagram of the digit dissimilarity calculation section. This is a configuration diagram of the first part.
Fig. 4 is a block diagram of the determination section, Fig. 15 is a block diagram of the dissimilarity degree memory section and route information memory section, and Fig. 16 is a block diagram of the digit dissimilarity degree memory section, digit route information memory section, and digit introduction section. FIG. 2 is a configuration diagram of an organ category memory section. In FIG. 9, FIG. 11, FIG. 12, FIG. 13, and FIG. 14, 10... control section, 11... input section, 12... input pattern Buffer, 13...
...Standard pattern memory section, 14...Standard pattern length memory section, 15...Distance calculation section, 16...
... Distance memory section, 17 ... Recurrence formula calculation section, 18 ... Dissimilarity memory section, 19 ...
route information memory section, 20... digit difference calculation section,
21... Digit difference memory section, 22...
Digit recognition category memory section, 23... Digit route information memory section, 24... Judgment section, 151...
・Absolute value circuit, 152... Adder, 153...
. . . Accumulator, 171 . . . Comparison circuit, 172 . . . Adder, D1, D2. D3...
...Registers that hold the degree of difference, F1, F2, F3・
...Register that holds the route, 201...
Comparison circuit, 202...Register for holding word similarity, 203...Register for holding category, 204...Register for holding route information, 241... ... Comparison circuit, 242 ... Register that holds the minimum digit difference, 243 ... Register that holds the number of digits, 244 ... Holds digit route information 245...Register for holding and outputting recognition results, 246...Determination control unit. Fig. 1 Fig. 2 Fig. 3 Input pattern Fig. 4 556- Fig. 5 Input pattern Fig. 10 y+4'IJ (21%) -- L −−−−−−−−−−−−−−−−−−−−" 1st
5 Figure 1 □V-V Figure 16 0-1---◆7FLM Procedural amendment (spontaneous) 58.2゜2 Date Showa Date Mr. Commissioner of the Japan Patent Office 1, Indication of the case Patent application filed in Showa s6 No. 1971141 2
, Name of the invention Continuous speech recognition device 3, Person making the amendment Relationship to the case Applicant: 5-33-1-4, Shiba 5-chome, Minato-ku, Tokyo, Agent: Sumitomo Mita, 5-37-8 Shiba, Minato-ku, Tokyo 108 Building NEC Co., Ltd. (6591) Patent Attorney Susumu Uchihara Telephone Tokyo (03) 456-3111 (Main Representative) 5, "Claims" and "Detailed Description of the Invention" columns of the specification to be amended and "drawing". 6. Contents of the amendment: "Claims" column As shown in the attached sheet @ "Detailed Description of the Invention" column (2) Page 15, line 5, "To make it shorter. To make it shorter. ,” is changed to “In order to shorten it,
” he corrected. (3) In the 11th line of page 17, rn(4Y,n)"J is corrected to rD(1,Y,n)=a■". (4) In the 14th line of page 22, "rW(4m)" is corrected to "rW(4m);". (5) In the 7th line of page 25, the phrase "first with signal nl" is corrected to "first with signal Ct153." (6) On the 9th line of page 25, the phrase "r from 13" is corrected to "from 13, from 1 according to signal rl." (7) In the 1st θ line of page 25, the phrase "calculate the absolute value" is corrected to "calculate the absolute value of the difference." (8) In the fifth line of page 26, the phrase "by signal n2" is corrected to "by signal n2.n21.n22." (9) On page 26, line 17 of li, correct rF(4v, m)J to "F(4ma, n)". (10) In the 20th line of page 26, correct "memory section 18J" to "D(t,ma,n) of memory section 18". (11) Correct "203 and 205" in the 19th line of page 27 to "202 and 204." 021) On page 26, line 20, replace [204J with “
Corrected to 203J. (13) In the 11th line of page 28, it says, ``A stone that corrects the signal t2J that is emitted following the signal t1 as the signal Cl2J. (1) Correct "register D" on page 29, line 15 to "register 242.""RegisterL" on page 29, line 16 of α is replaced with "
The text "4" on the 18th line of page 29 is corrected to [243J]. Bit 4゜■ “m=” is written on page 29, line 20 of α
2”. (19) In the second line of page 30, "register F and register 几" is corrected to "register 244 and register 245." (1) On the third line of page 30, replace “Register B” with “
register 245”. (21) rt-1+, tn= on the 4th line of page 30 of 111E
The one that says FJ is [L=1-1. m=(contents of register 244)". (2@Page 30, line 7, "register F and register" is corrected to "register 244 and register 245". (qr drawings) - Figures 7-1 and 9 of the drawings attached to this application , Fig. 10, 1st
Replace Figures 1, 12, 13, and 14 with the attached drawings. Attachment 2, time series of claim feature vectors consisting of one or more words = -□, J8. ...+"ms..."+7M
7 standard word patterns BY that are pre-memorized as
=1)? ,br,...,Tol,...,ToXv(
Ma = 1, 2. ..., ■) Maximum LH obtained by all combinations
Continuous voice standard pattern of ax digits: /C-Bvl, Bvl, -
, BVl, ., BYLaax, the vector between the input pattern J11m on the time function n- that makes the time axis m of the input pattern correspond to the time axis n of the continuous voice standard pattern and the continuous voice standard pattern To. In order to find the minimum value of the degree of dissimilarity defined as the sum of distances' (am, bn), continuous speech standard pattern C==Bvl. B1. ..., BA, ...I31xa*x is divided into each digit. Digit dissimilarity DB (4
to), the digit path information FB (4m) indicating the first time point of this time function, and the digit recognition category W (j, m) which is the word name V that gave the minimum cumulative distance on this time function. In a continuous speech recognition device that sequentially calculates the digit t and the time point m of the input pattern, and finally determines the number of digits of the input pattern and the recognition result of each digit, the signal m indicating the time point of the input pattern is calculated from 1 to M. For each 11, the signal ma indicating a word is changed from 1 to V, and the signal t indicating a digit for each ma is changed from 1 to L1! a control unit that varies signals up to Inx and from itl to NY indicating time points of the standard pattern; and a dissimilarity memory unit D (4v, n) whose address is designated by the signals t, ma, and n of the control unit; Path information memory unit F(zt't''); inter-vector distance between the word standard pattern N, H=l-Jiv of the word matrix sequentially specified by the control unit at each time point m and the input pattern -1'(a,,bnv); a distance calculation unit that calculates n=1%NY; a distance memory unit d(n) that stores this distance;
At each time point m. For each digit t and each word mark, first set the initial condition to digit dissimilarity DB (t-1゜m-1) which is the result of time point m-1.
is given from the digit route information FB (L-1, va-1)K. The distance dIn) and the degree of dissimilarity D(4) at the time point m-1
゜v, n) and route information F(4v, n) to calculate the recurrence formula of the dynamic program and calculate the dissimilarity degree D(4v, n) at time point m.
. m, each word dissimilarity degree D(4
v, NY), and calculate this as the digit dissimilarity DB (
4m), and the corresponding word path information F(4v, N
Y) is the digit route information FB (4m), and the word name V for which the minimum value is obtained is the digit recognition category! A digit dissimilarity calculating section for calculating JW (4m); and a digit dissimilarity memory section D for storing these.
B (4m), digit route information memory section FB (4m), digit recognition category memory section W (4m), and j digit route information FB (4m).
m) and a determination unit that determines and outputs the category of each digit of an input pattern in reverse order based on the digit recognition category W (4m). 1st Z nz, nzt, ηZ2

Claims

[Claims]

Input pattern A of one or more words that is a time series of feature vectors A = a1, a2, ..., am, ..., aM
V word standard patterns Bv stored in advance as
:=bv,'*b2v...,bH,...,bto(v
= 1, 2, ..., V) maximum Lma obtained by combining
Continuous voice standard pattern of x digits C=Bvl, Hvl,...
, Bvl, as+, BvLmax, the vector distance between the input pattern am and the continuous speech standard pattern bm on the time function n- that makes the time axis m of the input pattern correspond to the time axis n of the continuous speech standard pattern. In order to find the minimum value of the degree of dissimilarity defined as the sum of d(am, bn), continuous speech standard pattern C=BV1°By2..., Bvl,...
, BVLmax is divided into each digit, and the digit dissimilarity DB(l, m) indicating the minimum cumulative amount of inter-vector distance given by the optimal time function n(m) at the l-th digit and this time function are calculated. Digit path information FB(l, m
) and the digit recognition category W(l, m) which is the word name V that gave the minimum cumulative distance on this time function.
In a continuous speech recognition device that sequentially calculates the time point m of the input pattern and finally determines the number of digits of the input pattern and the recognition result of each digit, the signal m indicating the time point of the input pattern is changed from 1 to M. For each m, the signal V indicating the word is varied from 1 to V, and for each V, the signal l indicating the digit is varied from 1 to Lmax, and the signal n indicating the time point of the standard pattern is varied from 1 to Nv. a dissimilarity memory section D (l, v, n) and a route information memory section F (l, v, n) whose addresses are specified by the signals l, v, n of the control section; a word standard pattern b of the word V sequentially designated by the control unit at each time point m;
a distance calculation section that calculates the intervector distance d(am, bl, v) n=1 to NV between v, n=1 to Nv and the input pattern am; a distance memory section d(n) that stores this distance; At each time point m, for each digit l and each word V, first set the initial condition to the digit dissimilarity DB (
l-1゜m-1) and digit route information FB (l-1, m-1)
given the distance d(n), the degree of dissimilarity D(l, v, n) at the time point m-1, and the route information F(l, v, n). The equation is calculated and the degree of dissimilarity D (l, v, n) and route information F (l, v, n) at time point m are sequentially obtained, and the degree of dissimilarity D (l, v, NY) and the word route are calculated. a recurrence formula calculation unit that calculates information F (l, v, Nv); at each time point m, round can word dissimilarity degree D (l, v, Nv) calculated by the recurrence formula calculation unit for each digit l; ), set this as the digit dissimilarity DB (J, m), and set the corresponding word path information F (l, v, Nv) as the digit path information FB.
(J, m), and the word name V for which the minimum value was obtained is the digit recognition category W(l, m); a digit dissimilarity calculation unit; a digit dissimilarity memory unit DB for storing these; l,m), digit route information memory section FB(l,m), digit recognition category memory section W(l,m), digit route information FB(l,m) and digit recognition category W(l,m ), and a determination unit that determines and outputs the category of each digit of an input pattern in reverse order based on the following: