JPS60130799A

JPS60130799A - Continuous voice recognition equipment

Info

Publication number: JPS60130799A
Application number: JP58239303A
Authority: JP
Inventors: 誠夫亘理
Original assignee: Nippon Electric Co Ltd
Current assignee: NEC Corp
Priority date: 1983-12-19
Filing date: 1983-12-19
Publication date: 1985-07-12

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】本発明は連続音声認識装置に関し、特に文法に従って連
続発声された文音声を認識する装置の改良に関する。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a continuous speech recognition device, and more particularly to an improvement in a device that recognizes sentence speech continuously uttered according to grammar.

音声認識装置の中でも文法に従って発声された文音声を
認識する装置は、計算機プログラムや限定業務用文章あ
るいは航空管制や各種機器の制御用指令などの認識がで
き広範囲な応用分野を有している。文法の拘束が与えら
れている場合には、その文法規則を利用することによっ
て誤認識を防止できることが原理的に知られている。特
に連続数字認識において、入力音声に桁数の制約がある
場合、その制約を規則化することにより認識率を改善す
ることができる。Among voice recognition devices, devices that recognize sentence sounds uttered according to grammar can recognize computer programs, texts for limited business use, commands for air traffic control and control of various devices, and have a wide range of applications. It is known in principle that when grammatical constraints are given, misrecognition can be prevented by using the grammatical rules. Particularly in continuous number recognition, if there is a restriction on the number of digits in the input speech, the recognition rate can be improved by regularizing the restriction.

このような文法に従って連続に発声された文音声を認識
する手法が特願昭５６−１９９０９８号明細書に記載さ
れている。この原理は大路次のとおりである。文法をオ
ートマトンαて表現し、そのオートマトンαを次のよう
に定義する。A method of recognizing sentence sounds continuously uttered according to such a grammar is described in Japanese Patent Application No. 199098/1983. The principle behind this is as follows. We express the grammar as an automaton α, and define the automaton α as follows.

α＝（Ｋ、Σ、Δ＋　ｐａ　ｌ　Ｆ）　・・　・（１）
ここで、Ｋ：状態ｐの集合（ｐｌｐ＝１．２．・・・、
π）Σ：入力単語ｎの集合（ｎ１ｎ＝１．２．、、、、
Ｎ）Δ：状態遷移規則（（ｐ、ｑ、ｎ））ここで、（ｐ、ｑ、ｎ）はｐ　−＞　ｑなる状態遷移を
意味する。α=(K, Σ, Δ+ pal F) ・・(1)
Here, K: set of states p (plp=1.2...,
π) Σ: Set of input words n (n1n=1.2., , ,
N) Δ: State transition rule ((p, q, n)) Here, (p, q, n) means a state transition of p −> q.

ｐｏ：初期状態。以後はｐニＯで示す。po: initial state. Hereinafter, it will be indicated as pniO.

Ｆ：最終状態集合　ＦｃＫ次に前記オートマトンαに従って単語ｎ６Σを連続して
発声して得られる音声バタンＡをＡ＝は１　＋　ａ＋２
＋・・・、ａ、・・・、ａ■　・・・・・・・・・（２
）で示し、これを（未知）入力バタンと呼ぶ。各単語ｎ
ＥΣに対して標準的なバタンＢ’＝＝＝　１）？　、　Ｉｂ；　、　、、、　、　１
ｂ７　、　、、、　、１ｂシ１．−−−−−１Ｉ：３）
を用意し、これを単語標準バタンと呼ぶ。この単語標準
バタンをオートマトンαに従って接続することによって
得られる連続音声標準パタンＣニＢ０゛。F: Final state set FcK Next, the sound bang A obtained by continuously uttering the word n6Σ according to the automaton α is A=1 + a+2
＋・・・、a、・・・、a■ ・・・・・・・・・(2
), and this is called an (unknown) input button. each word n
Standard slam B' === 1) for EΣ? , Ib; , , , , 1
b7 , , , , 1bsi1. ------1I:3)
This is called a word standard slam. A continuous speech standard pattern C2B0゛ is obtained by connecting this word standard batan according to the automaton α.

Ｂ１．・・・　Ｂｎｘと入力バタンＡとのＤＰマッチン
クを行い、２つのバタンの相互に異なる度合を表わす量
（以下相異度と称する）を算出し、最小の相異度を与え
る単語系列を認識結果とする。B1. ...Performs DP matching between Bnx and the input button A, calculates the amount representing the degree of mutual difference between the two buttons (hereinafter referred to as the degree of dissimilarity), and recognizes the word sequence that gives the minimum degree of dissimilarity. shall be.

ここで最小の相異度を次のような動的計画の手法でめる
。初期条件Ｔ（０，０）二〇 ’Ｊ）　（ｍ　、　ｑ　）＝”　、　ｍ４０　、　ｑ”
ｒ。Here, the minimum degree of dissimilarity is determined using the following dynamic programming method. Initial condition T(0,0)20'J) (m, q)=”, m40, q”
r.

（）（ｐ　、　ｎ　、　ｊ　）＝−・・川・（４）とし
、ｒｒｌ　＝　ｌよりＩまで順次（５）式の境界条件を
基に（６）式の漸化式を（ｐ、ｑ、ｎ）ｅΔなるすべて
の対（ｐ、ｎ）について計算する。すなわち、対（ｐ、
ｎ）について境界条件Ｇ（ｐ　、　ｎ　、　ｏ　）＝’ｌ’（ｍ−１、１）　
）Ｈ（ｐ、口、ｏ）＝ｍ−１・・・・・・（５）とし、
漸化式％式％）（６）ただし、ｄ　（ｊｌ：＝　Ｄｉｓ　（ａ−、ｌｂ’；　）＝Σｌ
　ａ＋ｎｒ　６’ｊｒ　ｌ　・・・（ｂ　］、　）ｒｒ
ｌｊ　＝　ａｒｇｍｉｎ　Ｇ（ｐ　、　ｎ　、　ｊ’　）
」−２≦３′≦ｊをｊ＝ｘよりＪ″まで計算し、ｇ（１，ｈ（ｊ）をそれ
ぞれＧ（ｐ、ｎ、ｊ）、Ｈ（ｐ、ｎ、ｊ）へ格納すを最
小とするＸを意味している。次に単語の境界における最
小化としてｉｆｆ　Ｔ（ｍ、ｑ）＞Ｇ（ｐ、ｎ、Ｊｎ）　・−−−
（力ｔｈｅｎ　Ｔ（ｍ、　ｑ　）＝ＧＣｐ　、　ｎ　、
Ｊ”）Ｎ（ｍ、ｑ）＝ｎＰ（ｍ、ｑ）＝ｐＬ（ｍ、　ｑ　）＝）（（ｐ　、　ｎ　、　Ｊ”）を計
算する。() (p, n, j) = -... river (4), and sequentially from rrl = l to I, based on the boundary condition of equation (5), the recursion equation of equation (6) is (p, q , n) eΔ for all pairs (p, n). That is, the pair (p,
For n), the boundary condition G(p, n, o) = 'l'(m-1, 1)
) H (p, mouth, o) = m-1 (5),
Recurrence formula % formula %) (6) However, d (jl:= Dis (a-, lb'; )=Σl
a+nr 6'jr l...(b], )rr
l j = argmin G(p, n, j')
''-2≦3'≦j from j=x to J'' and store g(1, h(j) in G(p, n, j) and H(p, n, j), respectively. It means the minimum X. Next, as a minimization at the word boundary, if T (m, q) > G (p, n, Jn) ・---
(force then T(m, q) = GCp, n,
J”) N(m, q)=n P(m, q)=p L(m, q)=)((p, n, J”).

すなわち、（６）式の漸化式では第１図に示すように標
準バタン軸に沿った縦１列の計算を各対（ｐ。That is, in the recurrence formula of equation (6), as shown in FIG. 1, calculations in one vertical column along the standard batten axis are performed for each pair (p.

ｎ）について算出し、この縦１列の計算を入力バタン軸
ｌこ沿って＃動させ、終端ｍ　＝　Ｉまでめる。n), and move this vertical column of calculations # along the input button axis l until it reaches the terminal m = I.

入力バタンの認識結果は、判定処理として次のような手
続きによりめられる。The recognition result of the input button is determined by the following procedure as a determination process.

ｑ　＝　ａｒｇｍｉｎ　（Ｔ（Ｉ　、　ｑ　）　）　・
・・・・（８）ｑεＦ初期条件　ｑニｑ、ｍ−１・・・・・・（９）認識単Ｈ
ｎ＝Ｎ（ｍ、ｑ）　−・・−ａｏ）単語始点　ｌニＬ（
ｍ、ｑ）状態遷移　ｐニＰ（ｍ、ｑ）もしＸ＞Ｏならばｑ＝ｐ、ｒ１１＝４としてＵ（２）を
繰り返す。ｎ＝０ならば終了。　・・・・・（１１）以
上述べた方法では縦１列の５漸化式計算においてすべて
の標準バタンを１回読み出さなＩすればならない。また
％　Ｇ（ｐ、ｎ、ｊ）さ１４（ｐ、ｎ、Ｊ）を読み、計
算した結果を格納しなぜればならない。q = argmin (T(I, q)) ・
...(8) qεF Initial condition qniq, m-1...(9) Recognition single H
n=N(m, q) −・・−ao) word starting point lniL(
m, q) State transition pniP(m, q) If X>O, repeat U(2) with q=p and r11=4. If n=0, end. (11) In the method described above, all the standard buttons must be read out once in the calculation of the five recurrence formulas in one vertical column. It is also necessary to read %G(p, n, j) and store the calculated results.

ｐ、ｎの数が大きくなると標準パタンやＯ（ｐ。When the numbers of p and n become large, the standard pattern or O(p.

ｎ、ｊ）、）ｉ（ｐ、ｎ、ｊ）は大晴となり、全メモリ
アクセス時間（読み出し１店−き込み時間）は大きくな
る。一方、ＤＰマツチングの繰返し計算。n, j), )i(p, n, j) becomes clear, and the total memory access time (one read time minus one read time) becomes long. On the other hand, iterative calculation of DP matching.

は複数の演算器で実行可能であり１．演算時間は全メモ
リアクセス時間より小さくすることができる。can be executed by multiple arithmetic units, and 1. The computation time can be less than the total memory access time.

さらにメモリの読み出し％書き込みと演Ｂを並列処理す
ることも可能である。この場合、ｌ）Ｐマツチングの処
理時間は全メモ’Ｊ　７クセス時間となり、単語数や状
態数が大きくなるとＤＰマツチング処理時間が大きくな
る欠点がありた。言いカンえれば一定時間内に処理でき
る単語数や状態数が少ないという欠点があった。Furthermore, it is also possible to process memory read/write and performance B in parallel. In this case, l) the processing time for P matching is 7 times for all memos, and there is a drawback that as the number of words and states increases, the processing time for DP matching becomes longer. Simply put, the drawback was that the number of words and states that could be processed within a certain amount of time was small.

本発明の目的は、縦１列の漸化式計算を数フレームまと
めてブロック化することにより、標準パタンやＧ（ｐ、
ｎ、ｊ）、Ｈ（ｐ、、ｎ、ｊ）のアクセス回数を減少ざ
ぜ、ＤＰマツチングの処理時間が小さく処理単語数や状
態数の大きい連続音声認識装置を提供することである。The purpose of the present invention is to create a standard pattern, G(p,
It is an object of the present invention to provide a continuous speech recognition device that can reduce the number of accesses to H(p, , n, j), shorten the processing time for DP matching, and process a large number of words and states.

本発明による連続音声認識装置は、入力音声を特徴の時
系列ＡニａＩ１．ａｂ、・・・、ａｌｌ、・・・、ａ）
Ｉ　に変換する入力部と、単語セットΣ＝（ｎ）の甲の
各車４！　ｎに対して標準バタ７　Ｂ″＝　ｌｂ’；　
、ｌｂ’；　、　−・−、＋ｂ７、−。The continuous speech recognition device according to the present invention uses input speech as a time series of characteristics A2aI1. ab,..., all,..., a)
I and each car 4 in the instep of the word set Σ=(n)! Standard butter 7 B″=lb′ for n;
, lb'; , -・-, +b7, -.

ｌｂ５’、を記憶する標準バタン記憶部と、単語ｎの入
力によって状態遷移ｐ　−＋　ｑが生じることを意味す
る規則（ｐｅ　ｑ＋　ｎ）の群である状態遷移テーブル
Δと雇終状悪の群Ｆとを記憶するオートマトン記憶部と
、入カバクン時刻τｎと状態ｑとによって番地指定され
るテーブル記憶’Ｉ’　（ｍ　、　ｑ　）　、　Ｐ（ｍ
、ｑ）、Ｎ（ｍ、ｑ）およびＬ（ｍ、ｑ）と、状態ｐさ
即語指定ｎおよび標準バタン時刻ｊとｌこよって番地指
定されるテーブル記憶（）（ｐ、ｎ、ｊ）およびＨ（ｐ
、ｎ、ｊ）と、人力バタンの第ｉブロックの各時刻ｍ　
＝　（ｉ−１）・ＩＬ＋１．・・・、凰・ＩＬにおいて
順次指定される東語指定信号ｎに対応した標準パタンの
各特徴ベクトル１ｂ７と入力バタンの特徴ベクトルａ−
との距離ｄ（ｒｎ、ｊ）を算出する距離計算部と、単語
指定ｎに対して生起し得る状態対（ｐ、ｑ）に対して、
状態ｐと単語指定ｎと標準バタン時刻Ｊによって指定さ
れる前記テーブル記憶Ｇ（ｐ、ｎ、ｊ）およびＨ（ｐ、
ｎ、ｊ）を参照し、前記テーブル記憶Ｔ（ｍ−１，ｑ）
の値を漸化式境界条件とし、Ｌ（ｍ−＝１．ｑ）を経路
境界条件としてＤＰマツチング漸化式値ｇ　（ｍ。lb5', a state transition table Δ which is a group of rules (pe q+ n) which means that a state transition p −+ q occurs due to the input of word n, and a group of employment termination states. an automaton storage section storing F, and a table memory 'I' (m, q), P(m
, q), N (m, q) and L (m, q), state p, immediate word designation n, and standard slam time j and l, thereby table memory () (p, n, j) and H(p
, n, j) and each time m of the i-th block of the manual slam.
= (i-1)・IL+1. ..., each feature vector 1b7 of the standard pattern and the feature vector a- of the input button corresponding to the Togo designation signal n sequentially specified in 凰/IL
A distance calculation unit that calculates the distance d(rn, j) from
The table memories G(p, n, j) and H(p,
n, j) and the table storage T(m-1, q)
DP matching recurrence formula value g (m.

ｊ）と経路値ｈ（ｍ、ｊ　）をｍ　＝　（ｉ−１）　−
ＩＬ−ｔ−１よりｉ・ＩＬまで、かつＪ−１よりＪｎ　
までめて、前記テーブル記憶Ｇ（ｐ、ｎ、ｊ）とＨ（ｐ
、ｎ。j) and the path value h(m, j) as m = (i-1) −
From IL-t-1 to i・IL, and from J-1 to Jn
In summary, the table memories G(p, n, j) and H(p
, n.

ｊ）とに格納するＤＰマツチング部と、前記１）Ｐマツ
チング部で算出された漸化式値ｇ’（ｍ、Ｊ”）を状態
ｑに関して最小値をめＴ（ｍ、ｑ）Ｉこ書込み、この最
小を与えた単語名ｎ、状態指定ｐ。j) and the DP matching unit stored in 1) above, and write the recurrence formula value g'(m, J'') calculated by the P matching unit to T(m, q)I to find the minimum value with respect to state q. , word name n that gave this minimum, state specification p.

経路値ｈ（ｍ、Ｊ”）をＮ（ｍ、ｑ　）、Ｐ（ｍ、ｑ）
Ｌ（ｍ、ｑ）Ｇこそれぞれ書き込む比較回路と、前記入
カバタン時刻ｍ、入カバタンブロック番号凰。Route value h(m, J”) is converted to N(m, q), P(m, q)
A comparison circuit that writes L(m, q) and G respectively, the input time m, and the input block number.

単語指定ｎ、状態指定ｐ　、　ｑｉ６よび標準バタン時
刻ｊ等の信号を発生するための制御部と、最終時刻ｍ＝
Ｉにおいて、前記最終状態群Ｆに含まれる状態ｑで前記
テーブル記憶Ｔ（ｍ、ｑ）が最小なるものを定め、この
最小を与えたｑおよび時刻ｍをもとにして前記テーブル
記憶Ｐ　（ｍ　、　ｑ　）　、　Ｎ（ｍ、　ｑ　）およ
びＬ（ｍ、ｑ　）を参照して認識結果を定める判定部と
を有している。A control unit for generating signals such as a word designation n, a state designation p, qi6, and a standard slam time j, and a final time m=
In I, the table memory T (m, q) is determined to be the minimum state q included in the final state group F, and the table memory P (m, q) is determined based on the minimum q and time m. , q ), N(m, q ), and L(m, q ) to determine the recognition result.

次に本発明の原理について説明する。本発明による方法
では、第２図に示すように（６）式の漸化式の計算を入
力バタンのＩＬフレーム分をブロック化して算出する。Next, the principle of the present invention will be explained. In the method according to the present invention, as shown in FIG. 2, the recurrence formula (6) is calculated by dividing the IL frame of the input button into blocks.

すなわち、次に示す動的計画の手法でめる。初期条件Ｔ（０，０）＝Ｏ・・・・・・・・・＠Ｔ（ｍ、ｑ）＝
閃１ｍ〜ｏ、ｑ〜０Ｇ（ｐ、ｎ、ｊ）＝■ とし、ｉ　＝　ｌよりＩ／ＩＬ（ここでＩ／Ｉｆ、は説
明の簡単のため割り切れるとする。）まで順次ｕ　、　
ｕｉ式の境界条件を基に１喝式の漸化式を（ｐ、ｑ、ｎ
）εΔなるすべての対（ｐ、ｎ）について計算する。′
すなわち、境界条件ｇ（ｍ−１ｉＱ）＝Ｔ（ｍ−１、ｐ）　ｍ：ｍ、、−・
−、ｍ６−刊りｈ　（ｍ−１、Ｑ　）　＝ｍ−１ｍ＝ｍ
、　、−・−、ｍ。In other words, it can be done using the following dynamic programming method. Initial condition T (0, 0) = O... @T (m, q) =
Flash 1m~o, q~0 G (p, n, j) = ■, and sequentially u from i = l to I/IL (Here, I/If is divisible for simplicity of explanation).
Based on the boundary conditions of the ui equation, the recurrence equation of the 1-step equation is (p, q, n
)εΔ for all pairs (p, n). ′
That is, the boundary condition g(m-1iQ)=T(m-1,p) m:m, , -・
-, m6-print h (m-1, Q) = m-1m=m
, , −・−, m.

ただし、ｍ、＝　（ｊ−１）　＊　ＩＬ−１、ｍｅ＝ｍ
ｅ　ＩＬｇ（ｍ、−１，ｊ）＝Ｇ（ｐ、ｎ、ｊ）　Ｊ＝
１．−、Ｊ”−１１４ｈ（ｍ、−１，ｊ）＝Ｈ（ｐ、ｎ
、ｊ）　ｊ＝１．−−−、Ｊ’とし、漸化式％式％）ただしｄ（ｍ、　ｊ）＝Ｄｉｓ（ａｙ、ｂ７）ｊ　＝　ａｒｇ
ｍｉｎ　ｇ　（ｍ−１、ｊ’　）１−２≦ｊ’＜ｊをｔｎ　＝　ｍ、よりｍ６まで計算し、その後Ｊ＝１よ
りＪｎまで計算する。１ブロツクの計算終了後ｇ、（ｍ
ａ。However, m, = (j-1) * IL-1, me=m
e ILg (m, -1, j) = G (p, n, j) J =
1. −, J”−114h(m, −1, j)=H(p, n
, j) j=1. ---, J', recurrence formula % formula % ) where d (m, j) = Dis (ay, b7) j = arg
min g (m-1, j')1-2≦j'<j from tn = m, calculate up to m6, and then calculate from J=1 to Jn. After completing one block of calculation, g, (m
a.

ｊ）、ｈ（町、ｊ）をそれぞれＧ（ｐ　、　ｎ　、　ｊ
　）　。j) and h(town, j) respectively as G(p, n, j
).

Ｈ（ｐ、ｎ、ｊ）へ格納する。Store in H(p, n, j).

次に単語の境界における最小化としてｉｆｆ　Ｔ（ｍ、ｑ）＞ｇ（ｍ、Ｊ”）　−−−−−−
ｕ＊ｔｌｌｅｎ　Ｔ（ｍ、ｑ）＝ｇ（ｍ、Ｊ”）Ｎ（ｍ
、ｑ）二ｎＰ（ｒｎ、ｑ）＝ｐＬ（ｍ、ｑ）＝ｈ（ｍ、Ｊ’）ただしｍ＝ｍ、　、　・−、ｍｌＩを計算する、入力バタンの認識結果は、（８）　、　（９）　、　（
ＩＧ　、　（ｌυにより前記従来方式と同様にしてめる
。Next, as a minimization at the word boundary, if T(m, q)>g(m, J”) --------
u*tllen T(m, q)=g(m, J”)N(m
, q) 2n P(rn, q)=p L(m, q)=h(m, J') However, m=m, , ・−, mlI is calculated. The recognition result of the input button is (8 ) , (9) , (
IG, (lυ) is obtained in the same manner as in the conventional method.

以上、説明したように本発明における方法の計算量は従
来方法と同一であるが、Ｇ（ｐ、ｎ、ｊ）。As explained above, the amount of calculation of the method according to the present invention is the same as that of the conventional method, but G(p, n, j).

Ｈ（ｐ、ｎ、ｊ）のアクセス回数はｌ／ＩＬとなり、ア
クセス時間が減少される。また、入力バタンのｍ１フレ
ームより町フレームまでのデータを演算部のレジスタま
たは高速メモリへ格納してあれば、標準バタン１フレー
ムの読み出しに対してｄ　（ｍ　。The number of accesses to H(p, n, j) becomes l/IL, and the access time is reduced. Furthermore, if the data from the m1 frame of the input button to the town frame is stored in the register or high-speed memory of the arithmetic unit, d (m ) for reading the standard button 1 frame.

ｊ　）　ｍ　＝　ｍ、　’、・・・１ｍｅまでを計算す
ることができる。j) m = m, ', . . . up to 1me can be calculated.

このため、標準バタンの読み出しもｌブロックの計算に
対して１回でよく、従来方法と比較しアクセス時間は１
／ＩＬとなる。ＤＰマツチングの演算は繰り返し計算が
多いため並列処理が可能であり、高速の演算部を構成す
ることは容易であるが、バタン記憶など大容量メモリ部
は高速にすることが困難である。従って演算時間は小さ
く、メモリアクセス時間が大きくなっている。本発明に
よる方法ではメモリアクセス時間を従来方法に比較し１
／ＩＬに減少させる利点を持っており、ＤＰマツチング
の処理時間も１／ＩＬに近くなり、一定時間内に処理で
きる単語数や状態数を大きくできる利点を持っている。Therefore, the standard button only needs to be read once for every l block calculation, and compared to the conventional method, the access time is 1
/IL becomes. Since the calculation of DP matching involves many repeated calculations, parallel processing is possible and it is easy to construct a high-speed calculation section, but it is difficult to make a large-capacity memory section such as a slam memory high-speed. Therefore, the calculation time is short and the memory access time is long. The method according to the present invention has a memory access time of 1 compared to the conventional method.
/IL, the processing time for DP matching becomes close to 1/IL, and the number of words and states that can be processed within a certain amount of time can be increased.

次に本発明の第１実施例について説明する、第３図は本
発明の第１実施例を示すブロック図であり−トである。Next, a first embodiment of the present invention will be described. FIG. 3 is a block diagram showing the first embodiment of the present invention.

標準バタン記憶部１３０には単語セットΣに含まれる単
語ｎの標準パタンＢ”が記憶されている。オ−トマトン
記憶部２３０には状態遷移規則と最終状態の指定情報と
が記憶されている。本実施例では状態遷移規則は第９図
（ａ）および（ｂ）のように記憶されている。第９図（
ａ）は状態指定テーブルと呼ばれ単語ｎによる状態遷移
ｐ　−＋　ｑとして許されるｐの集合を指定している。The standard pattern B'' of the word n included in the word set Σ is stored in the standard pattern storage unit 130. The automaton storage unit 230 stores state transition rules and final state designation information. In this embodiment, state transition rules are stored as shown in FIGS. 9(a) and (b).
A) is called a state specification table and specifies a set of p allowed as state transition p −+ q by word n.

図の例では単語ｎに対応するｐとしてｐｌ、・・・、ｐ
、・・・、ｐ６が生じ得ることが例示されている。第９
図ｔｂ）は遷移テーブルと呼ばれ単語ｎによって状態ｐ
から遷移し得る状態ｑの集合を記憶している。図の例で
はｑとしてｑｚ　ｅ　ｑｚ　＊ｑ３　＊　Ｇ４が生じ得
ることを示している。In the example shown in the figure, p corresponding to word n is pl, ..., p
,..., it is exemplified that p6 can occur. 9th
Figure tb) is called a transition table, and the state p is defined by the word n.
A set of states q that can be transitioned from is stored. The example shown in the figure shows that qz can be expressed as qz e qz *q3 *G4.

マイクロホン１００より未知入力音声が入力されると人
力部１１０によって周波数分析がなされ特徴を示すベク
トルａｉに変換され順次入力バタンバッファ１２０ｉこ
送られる。また、人力部１１０には音声レベルを検知す
ることによって音声区間を決定する機能が与えられてお
り、音声区間中では「１」その他では「０」なる音声区
間信号Ｓを発生する。When an unknown input voice is input from the microphone 100, the human power section 110 performs frequency analysis, converts it into a vector ai representing characteristics, and sequentially sends it to the input bang buffer 120i. Furthermore, the human power unit 110 is provided with a function of determining a voice section by detecting the voice level, and generates a voice section signal S that is "1" during the voice section and "0" otherwise.

制御部２４０は、この音声区間信号Ｓの立上りの時刻ｌ
こおいて初期化パルスＩＴ　１　を発生する。これによ
って第８図のブロック１０に対応する初期化がＧメモリ
ー１５０とＴメモリ２００に対してなされる。The control unit 240 controls the rise time l of this voice section signal S.
At this point, an initialization pulse IT 1 is generated. As a result, initialization corresponding to block 10 in FIG. 8 is performed on the G memory 150 and the T memory 200.

以上の初期化が終了すると、以後の工しフレーム分の入
力特徴ａ１　の入力に同期して入力バタンブロック信号
ｉが１，２．・・・・・・と計数される。この人力バタ
ンブロックｉにおいて制御部２４０よりの単語指定信号
ｎは１よりへまで変化する。各ｎにおいてオートマトン
記憶部２３０中の状態テーブルが参照され、状態指定信
号ｐがｐｌからｐ８　まで変化される。またそのｎとｐ
にて規定される状態指定信号ｑも出力される。When the above initialization is completed, the input slam block signal i is changed to 1, 2, etc. in synchronization with the input of the input feature a1 for the subsequent processing frames. It is counted as... In this manual slam block i, the word designation signal n from the control section 240 changes from 1 to down. At each n, the state table in the automaton storage section 230 is referred to, and the state designation signal p is changed from pl to p8. Also, that n and p
A state designation signal q defined by is also output.

次に単語ｎ１状態ｐなる１サイクル内の動作を以下に説
明する。この１サイクルによって第２図の斜線部分の計
算が実行される。すなわち、Ｕ。Next, the operation within one cycle of word n1 state p will be described below. The calculations shown in the shaded areas in FIG. 2 are executed through this one cycle. That is, U.

Ｉ式の境界条件のもとて（１鴎、ＱＱ式を計算する。初
めに制御部よりの信号５ＥＴ２　によって、第８図のブ
ロック１１　、１２に対応する値のセットがＤＰマツチ
ング部１４０内のｇメモリ１４３とｈメモリ１４４に対
して行われる。つづいて制御部２４０よりの標準バタン
時刻信号ｊは１よりＪ”まで変化する。各ｊにおいて、
入力バタン時刻信号ｍ　ｉｔ　ｍ、　（ｍ、　＝（ｉ−
１）−１Ｌ＋１）よりｍ、（ｍｅ＝ｉ　−ＩＬ）まで変
化する。Based on the boundary conditions of the I equation (1), calculate the QQ equation. First, the set of values corresponding to blocks 11 and 12 in FIG. This is performed for the g memory 143 and the h memory 144.Next, the standard slam time signal j from the control unit 240 changes from 1 to J''.For each j,
Input slam time signal m it m, (m, = (i-
1) changes from −1L+1) to m, (me=i −IL).

ング部Ｊ４０では、初めに入力バタンのｍフレームと第
ｎ番目の単語の標準バタンのｊフレームが読み出されて
、（６−１）式に示すＲ次元べりトル間の距離を距離計
算部１４１でめる。すなわち人力バタンバッファ１２０
と標塘パタンメモリ部１３０より信号に従ってｌも個の
データを順次読み出し。In the processing section J40, m frames of the input button and j frames of the standard button of the n-th word are first read out, and the distance between the R-dimensional beams shown in equation (6-1) is calculated by the distance calculation section 141. Demeru. In other words, the human-powered slam buffer 120
Then, l pieces of data are sequentially read out from the pattern memory unit 130 according to the signal.

絶対値回路１４１１にて差の絶対値をめ、加算器１４１
２とアキュムレータ１４１３によってそのオロがめられ
、Ｄｉｓ　（ａ、ｂ；）が得られる。一方、゛距離計算
と並夕１ルで３つの相異度の最小値がめられる。すなわ
ら、ｇメモリ１４３よりｇ（ｍ−ｔ＊ｊ）ｅｇ（Ｉｎ−
１、ｊ−１）　、　ｇ（ｔｎ−１、ｊ　−２）とｈメモ
リ１４４よりｈ（ｍ−１，ｊ）、ｈ（ｍ−ｘｎｊ−Ｂ。The absolute value of the difference is determined by the absolute value circuit 1411, and the adder 141
2 and the accumulator 1413, Dis (a, b;) is obtained. On the other hand, the three minimum values of dissimilarity can be found by distance calculation and parallelism. In other words, g(m-t*j)eg(In-
1, j-1), g(tn-1, j-2) and h(m-1, j), h(m-xnj-B) from the h memory 144.

ｈ（ｘｎ−１，ｊ　２）を読み出し、レジスタ山。Read h(xn-1,j　2) and register pile.

０２、Ｇ３．Ｈｌ、１−１２．８３　へそれぞれ格納す
る。比較回路１４２２は３つのレジスタｏｉ　、　Ｇ２
　、０３より最小値を検出し、その最小値が得られたレ
ジスタモリ１６４のｈ（ｍ、ｊ）へ誓き込まれる。また
、比較回路１４２２より出力された最小値ｇ（ｍ−１゜
Ｊ）は前記距離計算部１４１でめられた距離ｄ（ｔｎ、
ｊ）と加算器１４２１によって加算され、ｇメモＩＪ　
１４３のｇ（ｍ、ｊ　）へ書き込まれる。入力バタン時
刻ｍがｍ、からｎｌ、まで変ｆヒされることによって標
準バタン時刻ｊＩこ対する処理が終了する。02, G3. Hl, 1-12.83 respectively. Comparison circuit 1422 has three registers oi, G2
, 03, and the minimum value is stored in h(m, j) of the register memory 164 from which the minimum value was obtained. Furthermore, the minimum value g (m-1°J) output from the comparison circuit 1422 is the distance d (tn,
j) and the adder 1421, and the g memo IJ
143 g(m,j). By changing the input slam time m from m to nl, the processing for the standard slam time jI is completed.

さらに標準バタン時刻」が終端Ｊｎとなった後、第８図
のブロック１４に示すように信号５ＥＴ３　に従って順
化式（１暖の結果ｇ（Ｉｎ＋＋　、　ｊ　）”、　１１
　Ｃｒｎｅ　、　ｊ　）をテーブルメモリＧ（ｐ　、　
ｎ　、　ｊ　）　、Ｈ（ｐ、ｎ、ｊ　）へ格納する。つ
づいて制御部より発せられた信号ｍ２（ｍ、よりｍ８ま
で変化する）に従って第８図のブロック１５に示した比
較が行われる。すなわち第５図に示すように、信号ｍ２
とｑに従ってテーブルメモリ２００よりＴ（ｍ、ｑ）と
ｇメモリ１４３よりｇ　（ｍ　、　Ｊ’　）が読み出さ
れ、比較回路１７０により比較しＴ（ｍ、ｑ）＞ｇ（ｍ
、Ｊ”）の場合ｗｐ倍信号発せられ、ｇ（ｍ、Ｊ’）、
ｎ、ｐ、ｈ（ｍ、Ｊ”）がテーブルメモリＴ（ｍ、ｑ）
、Ｎ（ｍ、ｑ）。Furthermore, after the "standard slam time" becomes the terminal Jn, the acclimatization formula (1 warm result g(In++, j)", 11 according to the signal 5ET3 as shown in block 14 of FIG.
Crne, j) as table memory G(p,
n, j), and stored in H(p, n, j). Subsequently, the comparison shown in block 15 of FIG. 8 is performed in accordance with the signal m2 (m, which changes from m to m8) issued by the control section. That is, as shown in FIG.
T(m, q) is read from the table memory 200 and g(m, J') is read from the g memory 143 according to
, J”), then wp times the signal is emitted, and g(m, J'),
n, p, h (m, J”) is table memory T (m, q)
, N(m, q).

Ｐ（ｍ、ｑ）、Ｌ（ｍ、ｑ）へ書き込まれる。Written to P(m, q) and L(m, q).

以上の動作によって状態ｐ、単単語ｎ大入カバタンブロ
ックの処理が終了する。さらに状態指定信号ｐがｐｌか
らｐ６と変化されることにより単語指定信号ｎに対する
処理が終了する。さらに単語指定信号ｎが１からＮまで
変化されることにより入力バタンブロックｉに対する処
理が終了する。With the above operations, the processing of the state p and single word n large entry kabatan block is completed. Furthermore, the state designation signal p is changed from pl to p6, thereby completing the processing for the word designation signal n. Furthermore, the word designation signal n is changed from 1 to N, thereby completing the processing for the input slam block i.

入力バタンブロック信号がＩ／ＩＬとなった後に、（８
）ｔ９）（１０）（１１）式に示した判定処理が開始さ
れる。判定部２２０は第６図に示すように構成され、始
めに第８図のブロック１６の処理として、オートマトン
記憶部２３０から最終状態集合ＦＩこ含まれる状態ｑを
信号９３に従って順次指定され、テーブルメモリ２００
よりＴ（１，ｑ）を読み出し比較回路２２１゜最小値レ
ジスタ２２２．状態レジスタ２２３を用いて最小のＴ（
１，ｑ）が与えられる状態ｑを得る。After the input slam block signal becomes I/IL, (8
)t9) The determination process shown in equations (10) and (11) is started. The determination unit 220 is configured as shown in FIG. 6, and first, as the process of block 16 in FIG. 200
T(1,q) is read from the comparison circuit 221° and the minimum value register 222. Using the status register 223, the minimum T(
1, q) is given.

続いて第８図のプロ、り１７の処理上して判定制例部２
２７はｑ：ｑ、ｍ：工としてアドレス信号ｍ３゜ｑをテ
ーブルメモリ１９０，２１０．１８０　ヘ発しＮ（ｍ。Next, in the process of FIG. 8, the judgment restriction part 2
27 issues an address signal m3゜q to the table memory 190, 210.180 as q:q, m:engine, N(m.

ｑ）、Ｌ（ｍ、ｑ）、Ｐ（ｍ、ｑ）を読み出す。q), L(m, q), and P(m, q).

このＬ（ｍ、ｑ）とＰ（ｍ、ｑ）は次（７）　ｍとｑと
なり、Ｎ（ｍ、ｑ、）は認識結果として出力される。These L(m, q) and P(m, q) become the following (7) m and q, and N(m, q,) is output as the recognition result.

この処理をｌが七〇になるまで繰返丁ことにより順次認
識結果が得られる。By repeating this process until l reaches 70, recognition results can be obtained in sequence.

以上説明した第１の実施例では、状態ｐ、単飴ｎの入力
バタン時刻Ｉｎｓよりｍ。まで（入力バタンブロックｉ
）の部分の計算結果ｇ（ｍｓ、Ｊｎ）〜ｇ（ｍｅ、Ｊ“
）がまとめて得られる。この結果を次の状態ｐ′の同じ
人力バタンフロック１の初期値の計算（第５図のブロッ
ク１４）で使用するため、オートマトンの初期状態より
最終状態へ順序よ＜　ｉｔ算を進める必要がある。一方
、オートマトンの状態遷移にループが含まれる場合（ま
、状態ｐｉこおけＡ計算結果を自分自身の状態ｐの初期
匝の計算に使用するという不都合が生じ、正しい計算結
果が得られない。In the first embodiment described above, in the state p, from the input button time Ins of the single candy n, m. up to (input slam block i
) part calculation result g(ms, Jn) ~ g(me, J“
) are obtained all at once. In order to use this result in the calculation of the initial value of the same human-powered slam block 1 in the next state p' (block 14 in Figure 5), it is necessary to proceed with the < it calculation from the initial state of the automaton to the final state. . On the other hand, if the state transition of the automaton includes a loop (well, the inconvenience arises that the calculation result of state pi box A is used to calculate the initial box of its own state p, and the correct calculation result cannot be obtained.

第２の実施例では、前記の欠点を取り除き、オートマト
ンの状態遷移にループが含まれる場合でも正しい結果を
得ることができる。すなわち、状態ｐ、単語ｎの入カバ
クンブロックｉの部分の計算結果のｇ　（ｍｅ−Ｊ’　
）のみを用いて入力バタンプロ、りｉ＋１の状態ｑの初
期値の計算を行う。In the second embodiment, the above-mentioned drawbacks are eliminated, and correct results can be obtained even when a loop is included in the state transition of the automaton. In other words, g (me-J'
) is used to calculate the initial value of the state q of the input button i+1.

第２の実施例における計算手順は本発明の詳細な説明し
た方法とほとんど同じであるが、（［，１式の境界条件
はｇ　（ｍ、１　、０　）＝：”Ｉ’（１７１，１−ｐ）
　３１．１．−Ｑ７）ｈ　（ｍ、１　、０　）＝＝ｍ＝
　１ｇ　（ｍｔ　０　）　＝ｏｏ　ｍ＝ｍ、　、・・・、塵
と変わり、さらにｕ６）式の単語の境界における最小化
はｉｆｆ　Ｔ（ｍ、、ｑ）＞ｇ（ｍｅ、Ｊ’）　−−ｕｓ
ｔｈｅｎ　Ｔ（ｍｓ、ｑ）＝ｇ（ｍｅＪ”）Ｎ　（ｍ、
　Ｉ　ｑ　）二〇Ｐ（ｍ、、ｑ）＝ｐＬ（ｍ、、ｑ）＝ｈ（ｍｅ、Ｊｎ）と変わる。The calculation procedure in the second embodiment is almost the same as the method described in detail of the present invention, but the boundary condition of the equation ([, 1 -p)
31.1. -Q7)h (m, 1, 0)==m=
1 g (mt 0 ) =oo m=m, ,..., change to dust, and furthermore, the minimization of equation u6) at the word boundary is iff T(m,, q)>g(me, J') − -us
then T(ms, q)=g(meJ”)N (m,
I q )20 P(m,, q)=p L(m,, q)=h(me, Jn).

第２の実施側番ごおける装置ｔの構成は第１の実施例と
同じであるが、境界条件吋）式と車飴の境界における最
小化ＵＳ式となるため、計算の手１１ｆｆｉが異なる。The configuration of the device t in the second embodiment is the same as the first embodiment, but the calculation method 11ffi is different because the boundary condition (2) equation and the minimization US equation at the boundary of car candy are used.

第２の実施例の動作時間ｒ＠係を示ｒタイムチ異なる部
分のみ説明゛づ−る。境界榮件は（１階式より（１７）
式へ変わっており、制御信号ＳＬ、Ｔ２に従って第１１
図のブロック１１　、１２に示した櫃がｇメモＩＪ　１
４３　。The operation time r of the second embodiment will be shown below, and only the different parts will be explained. The boundary condition is (from the first floor formula (17)
According to the control signals SL and T2, the 11th
The boxes shown in blocks 11 and 12 of the figure are g memo IJ 1
43.

ｈメモリ１４４へセットされる。また、単語の境界にお
ける最小化はｔｔｔｐ式より０樽式へ変わっており、制
御信号ｍ２に従って第１１図のブロック１５に示す比較
処理が行われる。h memory 144. Furthermore, the minimization at word boundaries has been changed from the tttp method to the 0-barrel method, and the comparison process shown in block 15 of FIG. 11 is performed in accordance with the control signal m2.

以上説明した第２の実施例では、オートマトンの状態遷
移にループが存在してもよいが、単語の境界の最小化を
ｍウニｉ・ＩＬ　の点のみでしか行っていないため、単
語」暴り持朋崖度が若干悪くなっている。一方、テーブ
ル記憶Ｔ（ｍ、ｑ）。In the second embodiment described above, a loop may exist in the state transition of the automaton, but since the word boundaries are minimized only at the point munii・IL, the word The degree of resilience has deteriorated slightly. On the other hand, table storage T(m, q).

Ｎ（ｍ、ｑ）、Ｌ（ｍ、ｑ）、Ｐ（ｍ、ｑ）のｍに関し
て使用している部分はｍｅ＝ｉ・ＩＬ　のみであるので
、第１の実施例のテーブル記憶の］／ｉＬの小さな記憶
容量とすることが可能である。Since the part used for m in N(m, q), L(m, q), and P(m, q) is only me=i・IL, the table storage in the first embodiment is It is possible to have a small storage capacity of iL.

以上、本発明の構成を実施例にもとづいて説明したが、
これらの記載は本発明の権利範囲を限定するものではな
い。特に本発明ではＤＰマツチングの漸化式として最も
簡単な（１７）式を用いたが、他のより高性能なものを
用いる方法も考えられる。The configuration of the present invention has been described above based on examples, but
These descriptions do not limit the scope of the present invention. In particular, in the present invention, the simplest equation (17) is used as the recurrence equation for DP matching, but it is also possible to use other higher-performance equations.

例えばなるものを用いる方法が考えられる。ただし、この場合
には漸化式値は時刻（ｍ−２）まで、距離も時刻（ｍ−
１）まで保存しなくてはならないのでテーブル記憶Ｇ（
ｐ　、　ｎ　、　ｊ　）と同様なものを増設する必要が
ある。また、本説明では距離を基本として認識を行なっ
たが、相関を基本吉して認識する方法も考えられる。こ
の場合（とは、大小関係が逆になるので不説明の最大値
検出機能を、すべて最小値検出機能に置換する必要があ
る。また。For example, one possible method is to use Naru. However, in this case, the recurrence formula value is up to time (m-2), and the distance is also up to time (m-2).
Since it is necessary to save up to 1), table memory G (
It is necessary to add something similar to p, n, j). Further, in this explanation, recognition was performed based on distance, but a method of recognition based on correlation may also be considered. In this case, the magnitude relationship is reversed, so it is necessary to replace all unexplained maximum value detection functions with minimum value detection functions.Also.

本説明ではテーブル記憶０（ｐ　、　ｎ　、　ｊ　）お
よびＨ（ｐ、ｎ、ｊうは、フォートランで用いられる三
次元配列としＣあるが、ｐ、ｎ、ｊによりＣ指定される
メモリとしてよい。すなわら、チーフル記憶はｐ＋　”
　＋　Ｊのすべての範囲（テーブルサイズ”　ｐｍ−ｘ
　Ｘ　ｒｌｍａｘ　Ｘ　Ｊｍａｘ　）を持つのではなく
、使用される範囲（チーフルサイズ＝Σ　Ｊ’）のみを
持（ｐ、ｎ％Δ っているものとしてもよい。これらの自明な変更は本発
明の権利範囲に楓するものである。In this description, table storage 0 (p, n, j) and H (p, n, j) are three-dimensional arrays used in Fortran, but C may be a memory designated by p, n, j. In other words, Chiful memory is p+”
+ All ranges of J (table size” pm-x
Instead of having the range (X rlmax This covers the scope of rights.

[Brief explanation of drawings]

第１図は従来技術を説明するための図であり、第２図は
本発明の詳細な説明するための図であり、９〜第３図、第４図、第５図、第６図は本発明の実施−トマ
トン記憶部のメモリ構成図であり、第１０図である。図面において、１００・・・マイクロホン、１１０・・・入力部、１２
０・・・入力バタンバッファ、１３０・・・標準バタン
記憶部、１４０・・・ＤＰマツチング部、１４１・・・
距離計算部、１５０・・・ｑメモリ、１６０・・・Ｈメ
モリ、１７０・・・比較回路、　１８０・・・Ｐメモリ
、１９０・・・１Ｎメモリ、２００・・・Ｔメモリ。２１０・・・Ｌメモ、す、　２２０・・・判定部、２３
０・・・オートマトン記憶部、２４０・・・制御部、で
ある。入力へ〇タン、１１　ｍ第２図ジ＼カバ６９ンＡ　ｍ、ｉ第５図第６７第７図ｒ　−」土ｌ二ニョｊ− 第　８図（イの１）第９図（ｂ）第ｔｏ図・　−口ｍ百− ￥１１図　（子の１）第　１１０（イの２）Fig. 1 is a diagram for explaining the prior art, Fig. 2 is a diagram for explaining the present invention in detail, and Figs. FIG. 10 is a memory configuration diagram of an implementation of the present invention--a tomato storage unit. In the drawings, 100...microphone, 110...input section, 12
0... Input button buffer, 130... Standard button storage section, 140... DP matching section, 141...
Distance calculation section, 150...Q memory, 160...H memory, 170...comparison circuit, 180...P memory, 190...1N memory, 200...T memory. 210... L memo, 220... Judgment section, 23
0: automaton storage unit, 240: control unit. To input 〇tan, 11 m Fig. 2 Cover 69 A m, i Fig. 5 Fig. 67 Fig. 7 r - "Sat l Niyo j -" Fig. 8 (A-1) Fig. 9 (b) Figure to: -M100 Figure 11 (Child 1) No. 110 (A 2)

Claims

[Claims] Time series A=all, a! featuring input speech. ,...,
ai. ...e "I" input section and word set Σ=
For each word n in (n), the standard button B”=l)i
, lb; . ..., 呵..., +b:;・, and a state that is a group of rules (p, q, n) that means that state transition p-+q occurs due to the input of word n. An automaton storage unit that stores a transition table Δ and a group F of final states, and a table storage T(m, q) whose address is specified by an input slam time m and a state q. P(m, q), N(m, q) and L(m, q),
A table memory G(p. 1-1)・IL+1
．． .... 1-Distance d (m, j) between each feature vector 7 of the standard pattern corresponding to the word designation signal n sequentially specified in LL (IL is a positive integer) and the feature vector 4 of the input button.
and the table memory G (p , n, j) and H(p, n, j), the value of the table storage T(m-1, q) is a recursion boundary condition, and L(m 1*q) is a path boundary condition. As DP matching recurrence formula value g(m, j) and path value h(m,
m=(1-1) ・From IL+1, m=1-IL
and from j=1 to j=J', the DP matching unit stores the table memories G(p, n, j) and H(p, n, j), and the DP matching unit calculates Write the recurrence formula value g(m, J'') into T(m, q) to find the minimum value with respect to state q, and write the word name n that gave this minimum,
State specification p, route value h(m, J”) as N(m, q), P
(m, q), L(rn. q), and signals such as input kabatan time m, input kabatan block number i, word designation n, state designation p, q, standard baton time j, etc. and a control unit for generating the table memory T(m, q
) is determined as the minimum, and based on the minimum value q and time m, the table storage P(m, q), N(
a determination unit that determines a recognition result with reference to L(m, q) and L(m, q).