JPS62144200A

JPS62144200A - Continuous speech recognition device

Info

Publication number: JPS62144200A
Application number: JP60283044A
Authority: JP
Inventors: 桜庭　孝宏
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1985-12-18
Filing date: 1985-12-18
Publication date: 1987-06-27
Also published as: JPH0336436B2

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】〔概　要〕連続音声認識装置において、複数個の標４ｔパターンを
前半と後半の半単語標準パターンに分け、音声の入力パ
ターンを前半単語標準パターンについて連続ＤＰ照合を
行い、後半単語標単パターンについて逆時間連Ｖｔ　Ｄ
　Ｐ照合を行い、その結果から入力パターンの各フレー
ム毎にそのフレームを前記各半単語標準パターンの境界
とする標準パターンを選択し、それによって与えられた
中心候補を、その候補が位置するフレームから始端方向
に逆時間ＤＰ照合を行い、終端方向にＤＰ照合を行い、
与えられた各候補毎の部分区間照合距離からその合計が
最小となる様にＤＰ処理を行って認識結果を求める。[Detailed Description of the Invention] [Summary] In a continuous speech recognition device, a plurality of standard 4t patterns are divided into first half and second half word standard patterns, and the speech input pattern is subjected to continuous DP matching with respect to the first half word standard patterns. , Reverse time series Vt D for the second half word mark single pattern
P matching is performed, and from the result, for each frame of the input pattern, a standard pattern is selected that makes that frame the boundary of each half-word standard pattern, and the center candidate given by this is selected from the frame in which that candidate is located. Perform reverse time DP verification in the direction of the start end, perform DP verification in the direction of the end,
A recognition result is obtained by performing DP processing to minimize the sum of partial section matching distances for each given candidate.

[Industrial application field]

本発明は２、複数の単語を連続的に発声した音声を認識
する連続音声認識装置、特に動的計画法（Ｄｙｎａｍｉ
ｃ　ｐｒｏｇｒａｍｍｉｎｇ　、以下ＤＰという）によ
り行う連続音声の認、識を、効率良く且つ高い認識率を
もって行う様に改良した連続音声認識装置に関する。2. The present invention provides a continuous speech recognition device that recognizes speech in which a plurality of words are continuously uttered, and in particular a continuous speech recognition device that recognizes speech made by continuously uttering a plurality of words.
The present invention relates to a continuous speech recognition device that has been improved to efficiently recognize and recognize continuous speech using c programming (hereinafter referred to as DP) with a high recognition rate.

[Conventional technology]

複数の単語を連続的に発声仕手形成される連続音声を認
識する方式として各種の方式が提案されているが、代表
的なものとして次の方式がある。Various methods have been proposed for recognizing continuous speech formed by uttering a plurality of words in succession, and the following methods are representative.

第１の方式は、連続音声を形成する連続単語を、各単語
の境界での音声パワーの落ち込みなどの特徴を利用して
個々の単語に区分しくこの操作はセグメンテーションと
呼ばれる）、その区分された単語毎に標準パターンと照
合して認識する方式である。The first method is to segment continuous words that form continuous speech into individual words using characteristics such as a drop in speech power at the boundaries of each word (this operation is called segmentation). This method recognizes each word by comparing it with a standard pattern.

第２の方式は、連続単語の全ての区間の組を取り出して
標準パターンと照合し、その照合結果から最適である区
間の組合せを求めて認識する方式〔発明が解決しようと
する問題点〕連続音声を認識する方式のうち、前述の第１の方式は、
連続単語のセグメンテーションを行うことが非常に難し
く、且つ高い認識率が確保しにくいという問題があった
。The second method is a method of extracting all sets of intervals of continuous words, comparing them with a standard pattern, and finding and recognizing the optimal combination of intervals from the results of the comparison. [Problem to be solved by the invention] Continuous Among the voice recognition methods, the first method mentioned above is
There have been problems in that it is very difficult to segment continuous words and it is difficult to ensure a high recognition rate.

一方、第２の方式は、高い認識率が得られるが、演算量
が第１の方式に比べて格段に大きく、早いレスポンスタ
イムをもった連続音声認識が出来ないという問題があっ
た。On the other hand, the second method has the problem that although a high recognition rate can be obtained, the amount of calculation is much larger than that of the first method, and continuous speech recognition with a fast response time is not possible.

又、連続音声中の各単語の結合部分は前後の単語の影響
を受けて変形しやすいので、その位置を正確に特定する
ことは困難である。この為、ＤＰ照合により求められた
音声の入力パターンの各フレーム毎の終端候補又は始端
候補に誤りが生じ、認識率を低下させるという問題があ
った。Furthermore, since the joined part of each word in continuous speech is easily deformed due to the influence of the words before and after it, it is difficult to specify its position accurately. For this reason, there is a problem in that an error occurs in the end candidate or the start end candidate for each frame of the audio input pattern obtained by DP matching, which lowers the recognition rate.

本発明は、連続音声認識処理時の演算量を低下させて処
理効率を大幅に向上させると共に、音声入力パターンの
各フレーム毎の候補を正しく求めることにより高認識率
を確保出来る様にした連続音声認識装置を提供すること
を目的とする。The present invention reduces the amount of calculations during continuous speech recognition processing to significantly improve processing efficiency, and also ensures a high recognition rate by correctly determining candidates for each frame of the speech input pattern. The purpose is to provide a recognition device.

[Means for solving problems]

従来の連続音声認識装置における前述の各問題点を解決
する為に本発明が講じた手段を、第１図を参照して説明
する。The means taken by the present invention to solve the above-mentioned problems in the conventional continuous speech recognition device will be explained with reference to FIG.

第１図は、本発明の構成のブロック説明図である。FIG. 1 is a block diagram illustrating the configuration of the present invention.

第１図において、１１０は標準パターン辞書部で、複数
の音声単語の標準パターンが格納されている。読み出す
ときは、各標準パターンは前半の半単語に対する前半単
語標準パターンと後半の半単語に対する後半単語標準パ
ターンに区分されて読み出される。In FIG. 1, 110 is a standard pattern dictionary section in which standard patterns of a plurality of spoken words are stored. When reading, each standard pattern is divided into a first half word standard pattern for the first half word and a second half word standard pattern for the second half word.

１２０は半単語連続ＤＰ照合部で、入力パターンを複数
個の標準パターン中の各前半単語標準パターンについて
連続動的計画法による照合（連続ＤＰ照合）を行うと共
に、入力パターンを複数個の標準パターン中の各後半単
語標準パターンについて逆時間連続動的計画法による照
合（逆時間連続ＤＰ照合）を行う。Reference numeral 120 denotes a half-word continuous DP matching unit, which performs matching (continuous DP matching) using continuous dynamic programming for each first-half word standard pattern among a plurality of standard patterns with respect to the input pattern, and also matches the input pattern with a plurality of standard patterns. Verification using inverse time continuous dynamic programming (inverse time continuous DP matching) is performed for each standard pattern of the second half of the words.

１３０は中心候補選択部で、半単語連続ＤＰ照合部１２
０の結果から入力パターンの各フレーム毎にそのフレー
ムを前半単語及び後半単語の各標準パターンの境界とす
る標準パターンを求め、その標準パターンより中心候補
を選択する。130 is a central candidate selection unit, and a half-word continuous DP matching unit 12
From the result of 0, a standard pattern is found for each frame of the input pattern, with that frame as the boundary between the standard patterns of the first half word and the second half word, and a central candidate is selected from the standard pattern.

１４０は単語始終端レベルＤＰ処理部で、前記中心候補
が位置するフレームから始端方向に逆時開動的計画法に
よる照合（逆時間ＤＰ照合）を行うと共に、終端方向に
動的計画法による照合（ＤＰ照合）を行って部分区間照
合距離を求める。Reference numeral 140 denotes a word start/end level DP processing unit that performs matching by inverse time dynamic programming (inverse time DP matching) from the frame in which the center candidate is located toward the start end, and also performs matching (inverse time DP matching) by dynamic programming in the direction toward the end. DP matching) is performed to find the partial interval matching distance.

１５０は文レベルＤＰ処理部で、単語始終端レベルＤＰ
処理部１４０より入力された部分区間照合距離に基づい
て入力パターンの各桁毎に部分区間照合距離の合計が最
小となる組合せを求め、その結果から入力パターンが回
折入力であるかを判定して認識結果を求める。150 is a sentence level DP processing unit, which processes word start and end level DP.
Based on the partial interval matching distances input from the processing unit 140, a combination that minimizes the sum of the partial interval matching distances is determined for each digit of the input pattern, and based on the result, it is determined whether the input pattern is a diffraction input. Obtain recognition results.

なお、“桁”は各単語の位置関係を表現するもので、連
続単語の各単語を先頭から１桁目、２桁目・・・・・・
Ｘ桁目で表現する。Note that "digit" expresses the positional relationship of each word, and each word in a continuous word is placed in the first digit, second digit, etc. from the beginning.
Expressed in the X digit.

[For production]

第１図の作用を、第２図及び第３図を参照して説明する
。The operation of FIG. 1 will be explained with reference to FIGS. 2 and 3.

第２図は入力パターンと前後容土単語標準パターンの連
続ＤＰ照合及び逆時間連続ＤＰ照合及びそれらの照合ル
ート例の説明図、第３図は単語中心候補の終端及び始端
方向のＤＰ照合及び逆時間ＤＰ照合の説明図である。Figure 2 is an explanatory diagram of continuous DP matching and inverse time continuous DP matching between the input pattern and standard patterns of front and rear words, and examples of their matching routes. It is an explanatory diagram of time DP verification.

連続単語音声の場合には、前述の様に各単語の結合部分
がそれに隣接する前後の単語の影響を受けて変形しやす
いので、その位置を正確に特定することは困難であり、
その為ＤＰ照合により求められた入力パターンの各フレ
ーム毎の終端又は始端候補に誤りが生じ、認識率を低下
させていた。In the case of continuous word speech, as mentioned above, the joined part of each word is easily deformed by the influence of the adjacent words before and after it, so it is difficult to specify its position accurately.
Therefore, an error occurs in the end or start end candidates for each frame of the input pattern obtained by DP matching, reducing the recognition rate.

そこで、本発明は、連続単語音声の場合は、各単語の始
終端部はそれに隣接する前後の単語の影響を受けて変形
しやすいが、その中央部分は隣接する前後の影響を受け
ることが少く安定していることに着目し、連続音声の各
単語の標準パターンの候補をその単語の安定部分におい
て抽出する様にしたものである。Therefore, in the case of continuous word speech, the beginning and end of each word are easily deformed due to the influence of the words before and after it, but the center part is less affected by the words before and after it. Focusing on stability, the standard pattern candidates for each word in continuous speech are extracted from the stable portion of that word.

標準パターン辞書部１１０には複数の標準パターンが格
納されているが、読み出されるときは、各標準パターン
を前半と後半の半単語に２分し、前半の半単語に対する
前半単語標準パターンと後半の半単語に対する後半単語
標準パターンに区分されて読み出される。A plurality of standard patterns are stored in the standard pattern dictionary section 110. When reading out each standard pattern, each standard pattern is divided into two parts, the first half word and the second half word. The second half of a half word is divided into standard patterns and read out.

各標準パターンを前半と後半の半単語に分ける方法は、
一般に中心部分が安定しているので中心部分で２分され
るが、中心部分以外の安定している部分において２分す
ることも出来る。The method of dividing each standard pattern into the first and second half words is as follows:
Generally, the center part is stable, so it is divided into two parts at the center part, but it is also possible to divide it into two parts at a stable part other than the center part.

半単語連続ＤＰ照合部１２０は、第２図に示す様に、入
力パターンを複数個の標準パターンの各前半単語標準パ
ターンについて連続ＤＰ照合を行うと共に、各後半単語
標準パターンについて逆時間連続ＤＰ照合を行う。そし
て、各照合結果である連続ＤＰ照合距離と逆時間連続Ｄ
Ｐ照合距離を出力する。As shown in FIG. 2, the half-word continuous DP matching unit 120 performs continuous DP matching on each first-half word standard pattern of a plurality of standard patterns of the input pattern, and performs inverse time continuous DP matching on each second-half word standard pattern. I do. Then, each matching result is the continuous DP matching distance and the inverse time continuous D
Output the P matching distance.

中心候補選択部１３０は、半単語連続ＤＰ照合部１２０
より入力された連続ＤＰ照合距離及び逆時間連ＨＤｐ照
合距離に基づいて、入力パターンの各フレーム毎にその
フレームを前半及ヒ後半の半車語標準パターンの境界と
する標準パターンを求め、その標準パターンより中心候
補を選択する。The central candidate selection unit 130 includes the half-word continuous DP matching unit 120
Based on the continuous DP matching distance and the inverse time continuous HDp matching distance input from Select the center candidate from the pattern.

単語始終端レベルＤＰ処理部１４０は、中心候補選択部
１３０で選択された各中心候補につき、第３図に示す様
に、その中心候補が位置するフレーム（ｋで図示されて
いる）から始端方向に逆時間ＤＰ照合を行うと共に、終
端方向にＤＰ照合を行って、それぞれ始端中心照合距離
及び中心終端照合距離を求める。次いでこの両照合距離
に基づいて部分区間照合距離を求める。As shown in FIG. 3, for each center candidate selected by the center candidate selection section 130, the word start/end level DP processing unit 140 calculates the word start/end level DP from the frame in which the center candidate is located (indicated by k) in the direction of the start end. Inverse time DP verification is performed in the direction of the end, and DP verification is performed in the direction of the end to determine the start-end center verification distance and center-end verification distance, respectively. Next, a partial section matching distance is determined based on both matching distances.

文レベルＤＰ処理部１５０は、単語始終端レベルＤＰ処
理部１４０より入力された部分区間照合距離に基づいて
入力パターンの各桁毎に部分区間照合距離の合計が最小
となる組合せを求め、その結果から入力パターンが回折
入力であるかを判定して認識結果を求める。Sentence level DP processing section 150 finds a combination that minimizes the sum of subsection matching distances for each digit of the input pattern based on the subsection matching distances input from word start/end level DP processing section 140, and calculates the combination that minimizes the sum of subsection matching distances for each digit of the input pattern. A recognition result is obtained by determining whether the input pattern is a diffraction input.

以上の様に、連続ＤＰ照合により中心候補を求めて部分
区間候補を選択する様にしたので、入力パターンから全
ての部分区間の組合せを取り出して複数の標準パターン
と照合することにより部分区間候補を求める従来方式（
前述の第２の方式）に比べて大幅に演算量を低減させる
ことが出来る。As described above, since the center candidate is found by continuous DP matching and subinterval candidates are selected, subinterval candidates are selected by extracting all subinterval combinations from the input pattern and comparing them with multiple standard patterns. The desired conventional method (
The amount of calculation can be significantly reduced compared to the above-mentioned second method).

又、連続音声の各単語の標準パターンの候補をその単語
の安定部分において抽出する様にしたので、候補が正し
く求められて高認識率を確保することが出来る。Furthermore, since the standard pattern candidates for each word of continuous speech are extracted from the stable portion of that word, the candidates can be found correctly and a high recognition rate can be ensured.

〔Example〕

本発明の一実施例を、第２図〜第９図を参照して説明す
る。An embodiment of the present invention will be described with reference to FIGS. 2 to 9.

第４図は本発明の一実施例の構成の説明図、第５図は始
終端固定のＤＰ照合の説明図、第６図は連続ＤＰ照合の
説明図、第７図は逆時間連続ＤＰ照合の説明図、第８図
は連続ＤＰの演算手法の説明図、第９図は中心候補選択
動作の説明図である。Fig. 4 is an explanatory diagram of the configuration of an embodiment of the present invention, Fig. 5 is an explanatory diagram of DP verification with fixed start and end ends, Fig. 6 is an explanatory diagram of continuous DP verification, and Fig. 7 is an explanatory diagram of reverse time continuous DP verification. FIG. 8 is an explanatory diagram of the continuous DP calculation method, and FIG. 9 is an explanatory diagram of the center candidate selection operation.

第２図及び第３図については、既に説明した通りである
。2 and 3 are as already explained.

（Ａ）実施例の構成第４図において、標準パターン辞書部１１０、半単語連
続ＤＰ照合部１２０、中心候補選択部１３０、単語始終
端レベルＤＰ処理部１４０及び文レベルＤＰ処理部１５
０については、第１図で説明した通りである。(A) Configuration of the embodiment In FIG. 4, a standard pattern dictionary section 110, a half-word continuous DP matching section 120, a central candidate selection section 130, a word start/end level DP processing section 140, and a sentence level DP processing section 15 are shown.
0 is as explained in FIG.

半単語連ｆｉＤＰ照合部１２０において、１２１は連続
ＤＰ照合部で、入力パターンを複数個の標準パターンの
各前半単語標準パターンについて連続ＤＰ照合を行って
、連続ＤＰ照合距離（Ｇｆ　　（ｋ　、　ｎ）　）を求
める。１２２は逆時間連続ＤＰ照合部で、入力パターン
を複数個の標準パターンの各後半単語標準パターンにつ
いて逆時間連続ＤＰ照合を行って、逆時間連続ＤＰ照合
距離（Ｇｂ　（ｋ　、　ｎ）　）を求める。In the half-word series fiDP matching unit 120, reference numeral 121 is a continuous DP matching unit that performs continuous DP matching on each of the first half word standard patterns of a plurality of standard patterns for the input pattern, and calculates the continuous DP matching distance (Gf (k, n) ). Reference numeral 122 is a reverse time continuous DP matching unit, which performs reverse time continuous DP matching on each second half word standard pattern of a plurality of standard patterns of the input pattern to obtain a reverse time continuous DP matching distance (Gb (k, n)). .

単語始終端レベルＤＰ処理部１４０において、１４１は
逆時間ＤＰ照合部で、中心候補選択部１３０より入力さ
れた各中心候補につき、その中心候補が位置するフレー
ムから始端方向に逆時間ＤＰ照合を行って始端中心照合
距離（Ｇｓ（ｋ　、　Ｉｔ））を求める。１４２はＤＰ
照合部で、前記各中心候補につき、その中心候補が位置
するフレームから終端方向にＤＰ照合を行って中心終端
照合距離（Ｇｅ　（ｋ　、　ｍ）　）を求める。In the word start/end level DP processing unit 140, reference numeral 141 is a reverse time DP matching unit that performs reverse time DP matching for each center candidate input from the center candidate selection unit 130 from the frame in which the center candidate is located in the direction of the start end. to find the starting end center matching distance (Gs(k, It)). 142 is DP
The matching unit performs DP matching for each center candidate in the direction of the end from the frame in which the center candidate is located to obtain a center-end matching distance (Ge (k, m)).

１４３は部分区間候補選択部で、前記始端中心照合距離
Ｇ　ｓ　　（ｋ　ｙ　Ｉｔ　）及び中心終端照合距離Ｑ
ｅ（ｋ、ｍ）に基づいて部分区間照合距離（Ｄ（Ｉｔ、
ｍ））と対応する標準パターン（Ｎｓ（ｆｆ　、　ｍ＞
　　）を求める。Reference numeral 143 denotes a partial section candidate selection unit, which selects the start end center matching distance G s (k y It ) and the center end matching distance Q
Based on e(k, m), subinterval matching distance (D(It,
m)) and the corresponding standard pattern (Ns(ff, m>
).

文レベルＤＰ処理ｍ１５ｏにおいて、１５１は文レベル
ＤＰ照合部で、単語始終端レベルＤＰ処理部１４０より
入力された部分区間照合距離Ｄ（ｌ、ｍ）と対応する標
準パターンＮｓ　　（ｅ、ｍ）を用いて、最適な単語の
結合をＤＰ照合により求める。１５２は認識結果処理部
で、文レベルＤＰ照合部１５１の結果に基づいて入力パ
ターンが回折入力であるかを判定して認識結果を求める
。In the sentence level DP processing m15o, 151 is a sentence level DP matching unit that calculates a standard pattern Ns (e, m) corresponding to the subsection matching distance D (l, m) inputted from the word start/end level DP processing unit 140. The optimal combination of words is determined by DP matching. A recognition result processing unit 152 determines whether the input pattern is a diffraction input based on the result of the sentence level DP matching unit 151 and obtains a recognition result.

１６０はマイクロホン、１７０は入力された連続音声の
特徴を抽出して照合用の入力パターンに変換する音声特
徴抽出部、１８０はこの入力パターンが格納される入力
パターンバッファである。160 is a microphone; 170 is a voice feature extraction unit that extracts the features of the input continuous voice and converts it into an input pattern for comparison; and 180 is an input pattern buffer in which this input pattern is stored.

＜Ｂ）実施例の動作マイクロホン１６０は、使用者の発声した連続単語から
なる連続音声を音声特徴抽出部１７０に入力する。音声
特徴抽出部１７０は、入力された連続音声の特徴を抽出
して照合用の入力パターンに変換し、入力パターンバッ
ファ１８０に格納する。<B) The operational microphone 160 of the embodiment inputs continuous speech consisting of continuous words uttered by the user to the speech feature extraction unit 170. The audio feature extraction unit 170 extracts the features of the input continuous audio, converts it into an input pattern for verification, and stores it in the input pattern buffer 180.

このときの音声の入力パターンＡを次の様に表わす。The audio input pattern A at this time is expressed as follows.

Ａ＝　ａ（１１、ａ（２）　、　ａ（３１・−・・ａ（
ｍ）・・・・・ａ　（１）。A= a(11, a(2), a(31...a(
m)...a (1).

ここで、ａ　（ｍ）はフレームｍにおけるパラメータで
、例えば周波数方向を１６チヤネルに区分した場合、各
区分におけるパワースペクトル値で表わされるベクトル
量である。最後のＩは入力パターンＡの終端フレームで
、且つ入力パターンＡの語長を意味する。なお、フレー
ムは音声区間（正しくは窓関数によって切り出した音声
区間）である。Here, a (m) is a parameter in frame m, and for example, when the frequency direction is divided into 16 channels, it is a vector quantity represented by a power spectrum value in each division. The last I is the end frame of input pattern A and means the word length of input pattern A. Note that a frame is a voice section (more accurately, a voice section cut out by a window function).

同様に、標準パターン辞書部１１０の標準パターンは、
単語ｎの標準パターンをＢ　（ｎｌとして、次の様に表
わす。Similarly, the standard pattern of the standard pattern dictionary section 110 is
The standard pattern of word n is expressed as B (nl) as follows.

Ｂ（ｎ）＝ｂ（１，ｎ）、ｂ（２，ｎ）・　ｂ（ｊ、ｎ
）−ｂ（Ｊ（ｎ）、ｎ）ここで、ｂ（ｊ、ｎ）は、単語
ｎのフレームｊにおけるパラメータで、入力パターンＡ
と同様に、例えば１６チヤネルのパワースペクトル値な
どで表わされるベクトル量である。Ｊ　（ｎｌは単語ｎ
の終端フレームで、且つ標準パターンＢ（ｎ）の語長を
表わす。B(n)=b(1,n), b(2,n)・b(j,n
)-b(J(n), n) where b(j, n) is the parameter of word n in frame j and input pattern A
Similarly, it is a vector quantity expressed by, for example, power spectrum values of 16 channels. J (nl is word n
and represents the word length of the standard pattern B(n).

本発明では、入力パターンを標準パターンの前半単語標
準パターンと後半単語標準パターンについて、連続ＤＰ
照合と逆時間連続ＤＰ照合により半単語の照合を行う。In the present invention, the input pattern is converted into continuous DP for the first half word standard pattern and the second half word standard pattern of the standard pattern.
Half-word matching is performed by matching and reverse time continuous DP matching.

この処理は、単語中心レベルＤＰと呼ばれる。This process is called word center level DP.

（Ｂ−１）連続ＤＰ照合部１２１の動作通常のＤＰ照合
は入力パターンの始終端を固定したもので、第５図に示
す様に、始端と終端を通る一本の最適な照合経路が与え
られ、その経路の照合距離が求められる。(B-1) Operation of continuous DP matching unit 121 In normal DP matching, the start and end of the input pattern are fixed, and as shown in FIG. 5, one optimal matching path passing through the start and end is given. Then, the matching distance of the route is determined.

これに対し、連続ＤＰ照合は、第６図に示す様に、入力
パターンの任意のフレームを終端とする始端フリーなり
Ｐ照合をいう。これにより入力パターンＡの中に存在す
る最適な対応区間が自動的に抽出されて連［Ｄｐ照合距
離が求まる。この結果に基づいて、入力パターンＡ中に
標準パターンＢ　（ｎ）が存在しそうか、又、存在しそ
うな場合に、入力パターンＡのどのフレームを終端とす
るかが容易に求められる。Continuous DP matching, on the other hand, refers to P matching where the starting end is free and the end is any frame of the input pattern, as shown in FIG. As a result, the optimal corresponding section existing in the input pattern A is automatically extracted, and the series [Dp matching distance is determined. Based on this result, it can be easily determined whether the standard pattern B (n) is likely to exist in the input pattern A, and if so, which frame of the input pattern A should be the end.

（Ｂ−２）逆時間連続ＤＰ照合部１２２の動作逆時間連
続ＤＰ照合部１２２の行う逆時間連続ＤＰ照合は、第７
図に示す様に、入力パターンＡを逆時間方向に連続ＤＰ
照合を行うものである。これにより、入力パターンＡ中
に標準パターンＢ　（ｎ）が存在しそうか、又、存在し
そうな場合に、入力パターンＡのどのフレームを始端と
するかが容易に求められる。(B-2) Operation of the reverse time continuous DP verification unit 122 The reverse time continuous DP verification performed by the reverse time continuous DP verification unit 122 is performed by the seventh
As shown in the figure, the input pattern A is continuously DP in the reverse time direction.
This is for verification. This makes it easy to determine whether the standard pattern B (n) is likely to exist in the input pattern A, and if so, which frame of the input pattern A should be the starting point.

前述の連続ＤＰ照合及び逆時間連続ＤＰ照合において行
われる連続ＤＰの演算手法はマツチングパスの取り方等
によって種々の方法が有るが、ここでは第８図に示す様
な対称型を用いるものとする。There are various methods of continuous DP calculation performed in the above-mentioned continuous DP matching and inverse time continuous DP matching depending on how to take matching paths, etc., but here, a symmetric type as shown in FIG. 8 is used.

第８図において、格子点（１、Ｊ）上の標準パターンＢ
と入力パターンＡのベクトル間距離をｄ（ｉ、ｊ）とす
ると、次の漸化式によって照合距離ｇｆ　（ｉ、ｊ）が
求められる。In Figure 8, standard pattern B on grid point (1, J)
Let d(i, j) be the distance between the vectors of input pattern A and input pattern A, then the matching distance gf (i, j) can be found using the following recurrence formula.

ｊ＝１のとき、ｇ　ｆ（ｘｙｊ）＝ｇ　ｆ（ｉ・１）　＝　ｄ　（ｉ・
１）ｉ＝１　、ｊ＞１のとき、ｇｆ（１，ｊ）＝ｄ（１・ｊ）　十ｇ　’　（１９Ｊ−
１）ｊ＞１のとき、このｇｆ（ｉｙｊ）は、（１、１）点から（ｉ、ｊ）点
に至るまでの距離和を最小としたものである。When j=1, g f (xyj) = g f (i・1) = d (i・
1) When i=1, j>1, gf(1,j)=d(1・j) 10g' (19J-
1) When j>1, this gf(iyj) is the one that minimizes the sum of distances from point (1, 1) to point (i, j).

逆時間連続ＤＰ照合は入力パターンを反転させて逆時間
方向に連続ＤＰ照合を行うもので、その演算手法は前述
の連続ＤＰ照合時の演算手法と同様であるので、説明は
省略する。Reverse time continuous DP matching is performed by reversing the input pattern and performing continuous DP matching in the reverse time direction, and its calculation method is the same as the calculation method for continuous DP matching described above, so a description thereof will be omitted.

この連続ＤＰ照合及び逆時間連続ＤＰ照合により、標準
パターンＢの前半区間と後半区間とを、第２図に示す様
にそれぞれ照合する。Through this continuous DP matching and reverse time continuous DP matching, the first half section and the second half section of the standard pattern B are respectively matched as shown in FIG.

入力パターンのにフレーム上の連’ｆｌｔ、　Ｄ　Ｐ照
合距離をＧ　ｆ　（ｋ）とし、逆時間連続ＤＰ照合距離
をＧ　ｂ　（ｋ）とする。更に、このＧ　ｆ　（ｋ）及
びＧ　ｂ　（ｋｌは、Ｎ個の標準パターンに対してはＧ
ｆ（ｋ、ｎ）及びＧｂ（ｋ、ｎ）の形で表現されて、連
続ＤＰ照合部１２１及び逆時間連続ＤＰ照合部１２２よ
りそれぞれ出力される。Let G f (k) be the continuous DP matching distance of the input pattern on the frame, and G b (k) be the inverse time continuous DP matching distance. Furthermore, G f (k) and G b (kl are G for N standard patterns.
They are expressed in the form of f(k, n) and Gb(k, n), and are output from the continuous DP matching section 121 and the inverse time continuous DP matching section 122, respectively.

（Ｂ−３）中心候補選択部１３０の動作中心候補選択部
１３０における中心候補選択処理は、次の様にして行わ
れる。(B-3) The center candidate selection process in the motion center candidate selection section 130 of the center candidate selection section 130 is performed as follows.

半単語連続ＤＰ照合部１４０より入力されたＧｆ（ｋ、
ｎ）及びＧｂ（ｋ、ｎ）は、中心候補選択部１３０にお
いて次式で示す様に累積されて、累積半単語連続ＤＰ照
合距離ＤＣ（ｋ。Gf(k,
n) and Gb(k, n) are accumulated in the center candidate selection unit 130 as shown in the following equation, and the cumulative half-word consecutive DP matching distance DC(k) is accumulated.

ｎ）を生成する。n).

Ｄ　Ｃ（ｋ　、　ｎ）　＝　Ｇ　ｒ　（ｋ　、　ｎ）　
＋　Ｇ　ｂ　（ｋ　、　ｎ）ＤＣ（ｋ、ｎ）は、このま
までは標準パター７　Ｂ　ｆｎ）の個々の語長Ｊ　（ｎ
）によって、短いもの程値が小さくなる傾向が出る為、
入力パターンＡ及び標準パターンＢ　（ｎ）の各パター
ン長の和でＤＣ（ｋ　、　ｎ）の値を割算して時間正規
化を行う。D C (k, n) = G r (k, n)
+ G b (k, n) DC (k, n) is the individual word length J (n
), the shorter the value, the smaller the value.
Time normalization is performed by dividing the value of DC (k, n) by the sum of each pattern length of input pattern A and standard pattern B (n).

この時間正規化されたＤＣ（ｋ、ｎ）をＤＲ（ｋ　、　
ｎ）で表現すると、ＤＲ（ｋ、ｎ）は下式で表わされる
。This time-normalized DC(k, n) is converted to DR(k,
n), DR(k, n) is expressed by the following formula.

ＤＲ（ｋ、ｎ）−ＤＣ（ｋ、ｎ）／２　Ｊ（ｎ）ここで
２　Ｊ　（ｎ）を用いる理由は、入力パターンＡが標準
パターン！３　ｆｎ）と正しくマツチングしたー場合に
は、対応する入力パターン長はＪ　（ｎ）に近い値をと
るものと考えられるからである。もし、何らかの手法で
ＤＣ（ｋ、ｎ）のｌ、ｍに対する始端Ｌ（ｋ、ｎ）及び
終端Ｍ　（ｋ　、　ｎ）が判明している場合には、ＤＲ
（ｋ、ｎ）は次式で求められる。DR (k, n) - DC (k, n)/2 J (n) The reason for using 2 J (n) here is that input pattern A is a standard pattern! 3fn), the corresponding input pattern length is considered to take a value close to J(n). If the starting point L(k, n) and ending point M (k, n) for l, m of DC(k, n) are known by some method, DR
(k, n) is determined by the following formula.

Ｄ　Ｒ（ｋ　、　ｎ）　＝　Ｄ　Ｃ（ｋ　、ｎ）／（Ｊ
（ｎ）＋Ｍ（ｋ、ｎ）　　Ｌ（ｋ、ｎ）＋１）このＤＲ
（ｋ、ｎ）は、第９図に示す様に、ｎ毎に異った距離パ
ターンを示す。D R(k, n) = D C(k, n)/(J
(n)+M(k,n) L(k,n)+1) This DR
As shown in FIG. 9, (k, n) shows a different distance pattern for each n.

このＮ個の距離パターンをに毎に値の最小のものを選択
して、第９図の下段に示す様にそのｎをにフレームを中
心とする候補（中心候補）としてＮ　Ｃ（ｋ）に設定す
る。即ち、Ｎ　Ｃ（ｋｌ＝　ａｒｇ　ｍａｎ　（Ｄ　Ｒ
（ｋ　、ｎ））例えば、第９図において、ｋｉフレーム
では、ｎ＝２のＤＲ（ｋ、２）が最小となるので、ｎ＝
２の標準パターンをｋｉフレームを中心とする候補（中
心候補）としてＮ　Ｃ（２）に設定する。For each of these N distance patterns, select the one with the smallest value, and as shown in the lower part of Figure 9, select n as a candidate centered on the frame (center candidate) and convert it into N C (k). Set. That is, N C(kl= arg man (D R
(k, n)) For example, in FIG. 9, in ki frames, DR(k, 2) of n=2 is the minimum, so n=
The standard pattern of No. 2 is set in NC(2) as a candidate centered on the ki frame (center candidate).

以上の処理によって、入力パターンＡの各にフレームが
中心になるＢ　ｆｎ）が−個に限定される。Through the above processing, the number of frames Bfn) in which each input pattern A is centered is limited to -.

ＤＲ値の最小のものが最も候補に適すると考えられるの
で、各にフレーム毎の中心候補をＮＣ（ト））の１個に
限定することにより、各にフレーム毎の中心候補を正し
く選択することが出来ると共に、演算量を更に減少させ
ることが出来る。Since the one with the smallest DR value is considered to be the most suitable candidate, by limiting the number of center candidates for each frame to one (NC(g)), it is possible to correctly select the center candidate for each frame. In addition, the amount of calculation can be further reduced.

（Ｂ−４）ＤＰ照合部１４２及び逆時間ＤＰ照合部１４
１の動作（単語始終端レベルＤＰ）ＮＣ（ｋ）で示される中心候補を、ＤＰ照合部１４２に
よるＤＰ照合及び逆時間ＤＰ照合部１４１による逆時間
ＤＰ照合により、始終端のＤＰ照合距離を求める。(B-4) DP matching section 142 and reverse time DP matching section 14
Operation 1 (word start/end level DP) The center candidate indicated by NC(k) is subjected to DP matching by the DP matching unit 142 and reverse time DP matching by the inverse time DP matching unit 141 to calculate the DP matching distance of the start and end. demand.

ここで用いるＤＰの演算手法もマツチングパスの取り方
等によって種々の方法があるが、ここでは、前述の連続
ＤＰと同様に対称型を用いるものとする。即ち、ｉ＝ｌ　、ｊ＝１のとき：ｇ　ｅ　（１、１）　＝　ｄ　（１、１）ｉ＝ｌ　、ｈ
＞ｌのとき：ｇ　ｅ（Ｌｊ）＝ｄ（１，ｊ）＋ｇ　ｅ（Ｌｊ−１）ｉ
＞１　、ｊ＝１のとき；ｇ（ｉ、１）＝ｄ（ｉ、１）＋ｇ　ｅ（ｉ４，１）ｓ　
＞　ｌ　ｔ　ｊ＞　１のとき：このｇｅ（ｉ、ｊ）は、格子点（１、１）から（ｉ　、
　ｊ）点に至るまでの距離和を最小としたものである。There are various methods for calculating the DP used here depending on how to take matching paths, etc., but here, a symmetric type is used like the continuous DP described above. That is, when i=l, j=1: ge (1, 1) = d (1, 1)i=l, h
>l: ge(Lj)=d(1,j)+ge(Lj-1)i
>1, when j=1; g(i,1)=d(i,1)+g e(i4,1)s
> l t j > 1: This ge(i, j) is the transition from the grid point (1, 1) to (i,
j) Minimize the sum of distances to the point.

前記の各式によってＤＰ照合距離ｇｅ（ｉ。DP matching distance ge(i.

ｊ）が求まるが、標準パターン終端のものをＧｅ　（ｉ
ｆとする。j) is found, but the one at the end of the standard pattern is Ge (i
Let it be f.

又、逆時間ＤＰ照合距離ｇｓ（ｉ、ｊ）は、華にパター
ンを反転して照合するものであり、その演算手法はＤＰ
照合距離ｇｃ（ｔ＋Ｊ）と同様であるので、その説明は
省略する。標準パターン始端のものをＧｓ（１）とする
。In addition, the inverse time DP matching distance gs (i, j) is to match by inverting the pattern, and its calculation method is DP
Since it is the same as the matching distance gc(t+J), its explanation will be omitted. Let Gs(1) be the starting end of the standard pattern.

これらＤＰ照合及び逆時間ＤＰ照合により、単語中心候
補Ｎ　Ｃ（ｋ）の終端及び始端方向の照合距離を求める
。Through these DP matching and inverse time DP matching, matching distances in the direction of the end and start end of the word center candidate NC(k) are determined.

第３図は、中心候補が位置するフレーム（ｋで示されて
いる）から始端方向に逆時間Ｄ・Ｐ照合を行い、終端方
向にＤＰ照合を行って、そのフレームに対スる始端フレ
ームｌ及び終端フレームを求める例を示したものである
。なお、Ｈ（ＮＣ（ｋｌ）は、中心候補の標準パターン
の語長である。In Figure 3, reverse time DP matching is performed from the frame where the center candidate is located (indicated by k) in the direction of the start end, DP matching is performed in the direction of the end, and the starting end frame l is compared to that frame. This shows an example of finding the final frame. Note that H(NC(kl)) is the word length of the standard pattern of the center candidate.

ＤＰ照合部１４２がＤＰ照合により求めた中心終端照合
距離を、Ｇｅ（ｋ、ｍ）とし、又、逆時間ＤＰ照合部１
４１が逆時間ＤＰ照合により求めた始端中心照合距離を
、Ｑｓ（ｋ、Ｉりとする。The center-end matching distance obtained by the DP matching unit 142 by DP matching is Ge(k, m), and the inverse time DP matching unit 1
The starting end center matching distance obtained by 41 by inverse time DP matching is defined as Qs(k, I).

（Ｂ−５）部分区間候補選択部１４３の動作部分区間候
補選択部１４３は、入力されたＧｅ（ｋ、ｍ）及びＧｓ
（ｋ、７りを累積したものを、部分パターン長（ｍ−６
＋ｌ）と標準パターン長Ｊ（ＮＣ（ｋ））の和で割算し
て時間正規化する。この時間正規化された部分区間のＤ
Ｐ照合距離をＤＩ（ｋ、ｊ２．ｍ）とすると、次式で求
められる。即ち、次に部分区間候補選択部１４３は、このＤＩ（ｋ、ｊ！
、ｍ）をｋについて最小のものを選択してＤ（Ｉｔ、ｍ
）とし、対応する標準パターンをＮＳ（ｊ２．ｍ）に設
定する。(B-5) Operation of the partial section candidate selection section 143 The partial section candidate selection section 143 selects the input Ge(k, m) and Gs
The partial pattern length (m-6
+l) and the standard pattern length J (NC(k)) for time normalization. D of this time-normalized subinterval
Letting P matching distance be DI (k, j2.m), it can be determined by the following equation. That is, next, the partial section candidate selection unit 143 selects this DI(k, j!
, m) with respect to k and D(It, m
) and set the corresponding standard pattern to NS(j2.m).

Ｄ（ｊ！、ｍ）＝ｍｉｎ（ＤＩ　　（ｋ、Ａ、ｍ）１Ｎ
Ｓ　（１、ｍ）＝ＮＣ（ａｒｇｍｉｎ　（ＤＩ（ｋ　、
　１２　、　ｍ））ここで、“ａｒｇｍｉｎ”は、（）
内の最小北条に件を満足するｋを選定する機能をもったオペレータであ
る。D(j!, m)=min(DI(k,A,m)1N
S (1, m) = NC(argmin (DI(k,
12, m)) where “argmin” is ()
This is an operator that has the function of selecting k that satisfies the minimum Hojo condition within.

もし、終端候補が無いｍの場合は、Ｎ５ＣＩ。If m has no termination candidate, N5CI.

ｍ）に“０”を設定し、Ｄ（＃、ｍ）に最大値を設定し
ておく。m) is set to "0", and D(#, m) is set to the maximum value.

このＤ　（’　＋　ｍ）及びＮ　Ｓ　（１９ｍ　）によ
って、入力パターンの部分パターン（ｎ−ｍ）の最適な
照合距離（Ｄ　（ｌ、ｍ））及び照合相手（ＮＳ　（１
、ｍ）　）が決定されて、部分区間候補選択部１４３よ
り出力される。Based on this D (' + m) and N S (19m), the optimal matching distance (D (l, m)) and matching partner (NS (1)
, m)) are determined and output from the partial section candidate selection unit 143.

（Ｂ−６）文レベルＤＰ照合部１５１の動作文レベルＤ
Ｐ照合部１５１は、単語始終端レベルＤＰ処理部１４０
より入力されたＤ（β。(B-6) Action sentence level D of sentence level DP matching unit 151
The P matching unit 151 uses the word start/end level DP processing unit 140
D(β.

ｍ）及びＮＳ（ｊ２．ｍ）を用いて、最適な単語の結合
をＤＰ照合により求める。m) and NS (j2.m), find the optimal word combination by DP matching.

ここで、連続単語中の各単語を先頭から１桁目、２桁目
・・・・・・Ｘ桁目という表現で位置関係を表現するこ
とにする。Here, the positional relationship will be expressed by expressing each word in the continuous words as the first digit, second digit, . . . X digit from the beginning.

（イ）・１桁目の設定１桁目は、入力パターンの始端フレームからｍフレーム
の部分区間の躍層をＴ（ｍ、１）に設定する。即ち、Ｔ　（ｍ、　１）　＝Ｄ　（１、ｍ）又、その標準語の始端がどこにあるかを、ＬＴ（ｍ、１
）に設定する。即ち、ＬＴ　　（ｍ、１）＝１ここで、部分区間候補が存在しないＤ（１゜ｍ）につい
ては処理は行わず、ＮＴ　（ｍ、１）に“０”を、Ｔ（
ｍ、１）に最大値を設定してお（。(a) Setting of 1st digit The 1st digit sets the cline of a subsection of m frames from the start frame of the input pattern to T(m, 1). That is, T (m, 1) = D (1, m) Also, where the beginning of the standard language is, LT (m, 1
). That is, LT (m, 1) = 1 Here, no processing is performed for D (1°m) for which there is no subsection candidate, and NT (m, 1) is set to "0", and T (
m, 1) to the maximum value (.

（ロ）Ｘ桁目の設定、　Ｘ桁目では、ｌの位置が未定の為、次のＤＰ漸化式
で決定する。即ち、＋　Ｔ（Ｉｔ　ｙ　ｘ−１）　ｌ　　＋　”　）この場
合もＤ（ｊ！、ｍ）とｍについて部分区間候補が存在し
ない場合は処理しないで、ＮＴ（ｍ、ｘ）に“Ｏ”を、
Ｔ（ｍ、ｘ）に最大値を、またＬＴ（ｍ、ｘ）には（不
定値で良いが）“ｌ”を設定しておく。(b) Setting the X digit In the X digit, the position of l is undetermined, so it is determined using the following DP recurrence formula. In other words, + T (It y of,
The maximum value is set for T(m, x), and "l" is set for LT(m, x) (although an undefined value may be used).

（Ｂ−７）認識結果処理部１５２の動作認識結果処理部
１５２は、文しベルＤＰ照合部１５１により送られたＴ
　（ｍ、Ｘ）　、ＮＴ　（ｍ、ｘ）及びＬＴ（ｍ、ｘ）
中のＴ（ｍ、ｘ）から、先ず入力単語の桁数Ｘを、次式
から求める。(B-7) The motion recognition result processing unit 152 of the recognition result processing unit 152 receives the T
(m, x), NT (m, x) and LT (m, x)
First, the number of digits X of the input word is found from T(m, x) in the following equation.

Ｘ　＝ａｒｇｍｉｎ　（Ｔ（１、ｘ）／　Ｘ）ここで、
■は入力パターンの終端フレ□−ムである。X = argmin (T(1, x)/X) where,
■ is the final frame □- of the input pattern.

次に、認識結果処理部１５２は、Ｘ桁の認識結果をＬＴ
（ｍ、ｘ）とＮＴ（ｍ、ｘ）から次の（イ）及び（ロ）
によって求める。ここで、Ｘ桁の認識結果をＮ　Ｒ（Ｘ
）とし、又Ｎ　Ｒ（Ｘｌの始端フレームをＬ　（Ｘ）と
する。Next, the recognition result processing unit 152 converts the recognition result of X digits into LT
(m, x) and NT (m, x) to the following (a) and (b)
Find it by Here, the recognition result of X digits is N R(X
), and the starting frame of N R (Xl is L (X)).

（イ）Ｘ桁目の認識結果ＮＲ（Ｘ）＝ＮＴ（１，Ｘ）ＩｊＸ１＝　ＬＴ（Ｉ　、　Ｘ）（ロ）Ｘ−Ｘ桁目の認識結果ＮＲ（Ｘ−ｙ）−ＮＴ　ＥＬ（Ｘ−ｙ＋１）　、Ｘ−ｙ
　）Ｌ（Ｘ−１）　＝ＬＴ　ＣＬ（Ｘ−ｙ＋１）　、Ｘ
−ｙ　）以上の様にして、認識結果Ｎ　Ｒ（Ｘ）が、従
来よりも大幅に少い演算量で、且つ高い認識率をもって
求められる。(b) Recognition result of X-th digit NR (X) = NT (1, X) IjX1 = LT (I, -y+1), X-y
)L(X-1) =LT CL(X-y+1),X
-y) In the above manner, the recognition result N R(X) can be obtained with a significantly smaller amount of calculation than conventional methods and with a high recognition rate.

例えば演算量の場合、従来の方式では入力パターンから
全ての部分区間の組合せを取り出して複数の標準パター
ンと照合して部分区間候補を求めていたが、その部分区
間候補を求めるまでの演算量は、次の様になる。いま、
入力パターン長を２００フレーム、標準パターン長ヲ５
０フレーム、部分区間のＤＰ照合の窓幅を２０フレーム
、標準パターン数を１００個とすると、ＤＰ漸化式の演
算回数は、２００Ｘ　１００Ｘ５０Ｘ２０＝　２０．００００，０
００回である。For example, in the case of the amount of calculation, in the conventional method, subinterval candidates are obtained by extracting all combinations of subintervals from the input pattern and comparing them with multiple standard patterns. , becomes as follows. now,
Input pattern length is 200 frames, standard pattern length is 5
Assuming that the window width for DP matching of 0 frame and subinterval is 20 frames and the number of standard patterns is 100, the number of operations of the DP recurrence formula is 200X 100X50X20 = 20.0000,0
00 times.

これに対して、本発明による中心候補選択を行う方法で
は、同じ条件で２００Ｘ５０Ｘ　１００＋　２００Ｘ５０Ｘ２０となる
。従って、約ｌ／１７の演算量で済むことになり、大幅
な演算量の減少が実現できる。On the other hand, in the method of selecting center candidates according to the present invention, the result is 200X50X 100+200X50X20 under the same conditions. Therefore, the amount of calculation is only about 1/17, and a significant reduction in the amount of calculation can be achieved.

〔Effect of the invention〕

以上説明した様に、本発明によれば次の諸効果が得られ
る。As explained above, according to the present invention, the following effects can be obtained.

（イ）連続ＤＰ照合により中心候補を求めて部分区間候
補を選択する様にしたので、従来方式よりも大幅に演算
量を低減させることが出来る。(a) Since the center candidate is determined by continuous DP matching and the partial interval candidates are selected, the amount of calculation can be significantly reduced compared to the conventional method.

（ロ）各単語の安定部分においてその単語の標準パター
ンの候補を抽出する様にしたので、候補が正しく求めら
れ高認識率を確保することが出来る。(b) Since candidates for the standard pattern of each word are extracted from the stable part of that word, candidates can be found correctly and a high recognition rate can be ensured.

[Brief explanation of drawings]

第１図・・・本発明の詳細な説明図、第２図・・・前半単語標準パターンの連続ＤＰ照合及び
後半単語の逆時間連続ＤＰ照合並びにそれらの照合ルート例の説明図、第３図・・・単語中心候補の終端及び始端方向のＤＰ照
合及び逆時間照合の説明図、第４図・・・本発明の一実施例の構成の説明図、第５図
・・・始終端固定のＤＰ照合の説明図、第６図・・・連
続ＤＰ照合の説明図、第７図・・・逆時間連続ＤＰ照合の説明図、第８図・・
・連続ＤＰの演算手法の説明図、第９図・・・中心候補
選択動作の説明図。第１図及び第４図において、１１０・・・標準パターン辞書部、１２０・・・半単語
連続ＤＰ照合部、１３０・・・中心候補選択部、１４０
・・・単語始終端レベルＤＰ処理部、１５０・・・文し
ベルＤＰ処理部。Figure 1: A detailed explanatory diagram of the present invention. Figure 2: An explanatory diagram of continuous DP matching of the first half word standard pattern, reverse time continuous DP matching of the second half word, and examples of their matching routes. ...Explanatory diagram of DP matching and reverse time matching in the direction of the end and beginning of a word center candidate, FIG. 4...Explanatory diagram of the configuration of an embodiment of the present invention, FIG. An explanatory diagram of DP verification, Fig. 6... An explanatory diagram of continuous DP verification, Fig. 7... An explanatory diagram of reverse time continuous DP verification, Fig. 8...
・Explanatory diagram of continuous DP calculation method, FIG. 9...Explanatory diagram of center candidate selection operation. In FIG. 1 and FIG. 4, 110... Standard pattern dictionary section, 120... Half word continuous DP matching section, 130... Center candidate selection section, 140
. . . Word start/end level DP processing section, 150 . . . Sentence bell DP processing section.

Claims

[Claims] Speech in which a plurality of words are continuously uttered is converted into an input pattern in a speech feature extraction section, and the input pattern is compared with a plurality of standard patterns to select candidates for each section. In a continuous speech recognition device that obtains recognition by finding the one with the minimum matching distance, (a) the input pattern is matched by continuous dynamic programming for each first-half word standard pattern among a plurality of standard patterns; Half-word continuous DP in which input patterns are matched using inverse time continuous dynamic programming for each second-half word standard pattern among multiple standard patterns.
(b) From the results of the half-word continuous DP matching unit (120), a standard pattern is determined for each frame of the input pattern, with that frame as the boundary of each standard pattern of the first half word and the second half word; a center candidate selection unit (130) that selects a center candidate from the standard pattern; (d) a word start/end level DP processing unit (140) that performs matching according to the method and calculates a subinterval matching distance; A sentence level DP processing unit (150
), a continuous speech recognition device comprising: