JPH0336439B2

JPH0336439B2 -

Info

Publication number: JPH0336439B2
Application number: JP60287798A
Authority: JP
Inventors: Takahiro Sakuraba
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1985-12-23
Filing date: 1985-12-23
Publication date: 1991-05-31
Also published as: JPS62147496A

Description

【発明の詳細な説明】〔概要〕連続音声認識装置において、複数個の標準パタ
ーンを前半と後半の半単語標準パターンに分け、
音声の入力パターンを前半単語標準パターンにつ
いて連続DP照合を行い、後半単語標準パターン
について逆時間連続DP照合を行い、その結果を
ある閾値で限定したものから入力パターンの各フ
レーム毎にそのフレームを前記各半単語標準パタ
ーンの境界とする標準パターンを選択し、それに
よつて与えられた中心候補を、その候補が位置す
るフレームから始端方向に逆時間DP照合を行い、
終端方向にDP照合を行い、与えられた各候補毎
の部分区間照合距離からその合計が最小となる様
にDP処理を行つて認識結果を求める。[Detailed Description of the Invention] [Summary] In a continuous speech recognition device, a plurality of standard patterns are divided into first half and second half word standard patterns,
Continuous DP matching is performed on the first half word standard pattern of the speech input pattern, reverse time continuous DP matching is performed on the second half word standard pattern, and the results are limited by a certain threshold, and each frame of the input pattern is compared to the above. Select a standard pattern as the boundary of each half-word standard pattern, perform reverse time DP matching on the center candidate given by the standard pattern from the frame where the candidate is located in the direction of the start end,
DP matching is performed in the terminal direction, and DP processing is performed to minimize the sum of partial section matching distances for each given candidate to obtain a recognition result.

[Industrial application field]

本発明は、複数の単語を連続的に発声した音声
を認識する連続音声認識装置、特に動的計画法
（Dynamic programming、以下DPという）によ
り行う連続音声の認識を、効率良く且つ高い認識
率をもつて行う様に改良した連続音声認識装置に
関する。 The present invention is a continuous speech recognition device that recognizes speech in which multiple words are uttered continuously, and in particular recognizes continuous speech using dynamic programming (hereinafter referred to as DP), efficiently and with a high recognition rate. This paper relates to a continuous speech recognition device that has been improved so that it can be used in a long time.

[Conventional technology]

複数の単語を連続的に発声して形成される連続
音声を認識とする方式として各種の方式が提案さ
れているが、代表的なものとして次の方式があ
る。 Various methods have been proposed for recognizing continuous speech formed by continuously uttering a plurality of words, and the following methods are typical.

第１の方式は、連続音声を形成する連続単語
を、各単語の境界での音声パワーの落ち込みなど
の特徴を利用して個々の単語に区分し（この操作
はセグメンテーシヨンと呼ばれる）、その区分さ
れた単語毎に標準パターンと照合して認識する方
式である。 The first method divides continuous words that form continuous speech into individual words using characteristics such as a drop in voice power at the boundaries of each word (this operation is called segmentation). This method recognizes each word by comparing it with a standard pattern.

第２の方式は、連続単語の全ての区間の組を取
り出して標準パターンと照合し、その照合結果か
ら最適である区間の組合せを求めて認識する方式
である。 The second method is a method in which all sets of sections of continuous words are extracted and compared with a standard pattern, and an optimal combination of sections is determined and recognized from the matching results.

[Problem that the invention seeks to solve]

連続音声を認識する方式のうち、前述の第１の
方式は、連続単語のセグメンテーシヨンを行うこ
とが非常に難しく、且つ高い認識率が確保しにく
いという問題があつた。 Among the continuous speech recognition methods, the above-mentioned first method has problems in that it is very difficult to segment continuous words and it is difficult to ensure a high recognition rate.

一方、第２の方式は、高い認識率が得られる
が、演算量が第１の方式に比べて格段に大きく、
早いレスポンスタイムをもつた連続音声認識が出
来ないという問題があつた。 On the other hand, the second method achieves a high recognition rate, but the amount of calculation is much larger than the first method.
There was a problem that continuous speech recognition with fast response time was not possible.

又、連続音声中の各単語の結合部分は前後の単
語の影響を受けて変形しやすいので、その位置を
正確に特定することは困難である。この為、DP
照合により求められた音声の入力パターンの各フ
レーム毎の終端候補又は始端候補に誤りが生じ、
認識率を低下させるという問題があつた。 Furthermore, since the joined part of each word in continuous speech is easily deformed due to the influence of the words before and after it, it is difficult to specify its position accurately. For this reason, DP
An error occurs in the end candidate or start end candidate for each frame of the audio input pattern found by matching,
There was a problem that the recognition rate decreased.

そこで、これらの問題点を解決する為に、複数
個の標準パターンを前半と後半区間の半単語標準
パターンに分け、音声の入力パターンを前半単語
標準パターンについて連続DP照合を行い、後半
単語標準パターンについて逆時間連続DP照合を
行い、その結果から入力パターンの各フレーム毎
にそのフレームを前記各半単語標準パターンの境
界とする標準パターンを１個践択して中心候補ち
し、与えられた中心候補を、その候補が位置する
フレームから始端方向に逆時間DP照合を行い、
終端方向にDP照合を行い、与えられた各候補毎
の部分区間照合距離からその合計が最小となる様
にDP処理を行つて認識結果を求める連続音声認
識装置（以下、原発明という）が、同一出願人に
よつて提案されている。 Therefore, in order to solve these problems, multiple standard patterns are divided into half-word standard patterns for the first half and second half, and continuous DP matching is performed on the voice input pattern for the first half word standard pattern, and the second half word standard pattern is Perform inverse time continuous DP matching on Perform inverse time DP matching on candidates from the frame where the candidate is located in the direction of the starting edge,
A continuous speech recognition device (hereinafter referred to as the original invention) that performs DP matching in the terminal direction and obtains a recognition result by performing DP processing such that the sum becomes the minimum from the partial interval matching distance for each given candidate. Proposed by the same applicant.

この原発明は、連続DP照合により中心候補を
選択する様にしたので大幅に演算量を低減させる
ことが出来ると共に、各単語の安定部（例えば中
心部分）においてその単語の標準パターンの候補
を抽出する様にしたので、候補が正しく求められ
て高い認識率を得ることが出来るものである。 This original invention selects the center candidate by continuous DP matching, which can significantly reduce the amount of calculation, and also extracts standard pattern candidates for each word in the stable part (for example, the center part). As a result, candidates can be found correctly and a high recognition rate can be obtained.

然しながら、各フレーム毎に必ず中心候補が存
在するので中心候補の数が多くなること、また照
合距離が大きく正当な中心候補となり得ない場合
でも各フレーム毎に必ず中心候補が選択されるの
で以降の処理に無駄が多くなること等の為に、演
算量の低減効果が損われるという問題があつた。 However, since there is always a center candidate for each frame, the number of center candidates increases, and even if the matching distance is large and it cannot be a valid center candidate, a center candidate is always selected for each frame, so the following There was a problem in that the effect of reducing the amount of calculations was lost due to increased processing waste.

なお、原発明の内容と本出願の発明の内容とは
共通する部分が多いので、後記の実施例の説明中
において、本発明の実施例と関連させて適宜説明
する。 It should be noted that since the content of the original invention and the content of the invention of the present application have many parts in common, they will be appropriately explained in connection with the embodiments of the present invention in the description of the embodiments below.

本発明は、中心候補の存在する入力パターンの
フレームを限定することにより、従来方式よりも
大幅に演算量を低減させた前記原発明よりも更に
演算量を低減して処理効率を向上させ、早いレス
ポンスタイムをもつた連続音声認識装置を提供す
ることを目的とする。 By limiting the frames of the input pattern in which the center candidate exists, the present invention further reduces the amount of calculations and improves processing efficiency than the original invention, which significantly reduces the amount of calculations compared to the conventional method. The purpose of this invention is to provide a continuous speech recognition device with response time.

[Means for solving problems]

従来の連続音声認識装置における前述の各問題
点を解決する為に本発明が講じた手段を、第１図
を参照して説明する。 The means taken by the present invention to solve the above-mentioned problems in the conventional continuous speech recognition device will be explained with reference to FIG.

第１図は、本発明の構成のブロツク説明図であ
る。 FIG. 1 is a block diagram illustrating the configuration of the present invention.

第１図において、１１０は標準パターン辞書部
で、複数の音声単語の標準パターンが格納されて
いる。読み出すときは、各標準パターンは前半の
半単語に対する前半単語標準パターンと後半の半
単語に対する後半単語標準パターンに区分されて
読み出される。 In FIG. 1, 110 is a standard pattern dictionary section in which standard patterns of a plurality of spoken words are stored. When reading, each standard pattern is divided into a first half word standard pattern for the first half word and a second half word standard pattern for the second half word.

１２０は半単語連続DP照合部で、入力パター
ンを複数個の標準パターン中の各前半単語標準パ
ターンについて連続動的計画法による照合（連続
DP照合）を行うと共に、入力パターンを複数個
の標準パターン中の各後半単語標準パターンにつ
いて逆時間連続動的計画法による照合（逆時間連
続DP照合）を行う。 120 is a half-word continuous DP matching unit that performs matching (continuous
DP matching) is performed, and the input pattern is matched using inverse time continuous dynamic programming (inverse time continuous DP matching) for each second half word standard pattern among a plurality of standard patterns.

１３０は中心候補限定選択部で、半単語連続
DP照合部１２０の結果を極小値で限定したもの
から入力パターンの各フレーム毎にそのフレーム
を前半単語及び後半単語の各標準パターンの境界
とする標準パターンを求め、その標準パターンよ
り中心候補を選択する。 130 is the central candidate limited selection part, half word consecutive
From the results of the DP matching unit 120 limited to minimum values, for each frame of the input pattern, a standard pattern is found that makes that frame the boundary of each standard pattern of the first half word and the second half word, and a central candidate is selected from the standard pattern. do.

１４０は単語始終端レベルDP処理部で、前記
中心候補が位置するフレームから始端方向に逆時
間動的計画法による照合（逆時間DP照合）を行
うと共に、終端方向に動的計画法による照合
（DP照合）を行つて部分区間照合距離を求める。 Reference numeral 140 is a word start/end level DP processing unit that performs matching by inverse time dynamic programming (inverse time DP matching) from the frame in which the center candidate is located in the direction toward the start end, and also performs matching (inverse time DP matching) by dynamic programming in the direction toward the end. DP matching) to find the subinterval matching distance.

１５０は文レベルDP処理部で、単語始終端レ
ベルDP処理部１４０より入力された部分区間照
合距離に基づいて入力パターンの各桁毎に部分区
間照合距離の合計が最小となる組合せを求め、そ
の結果から入力パターンが何桁入力であるかを判
定して認識結果を求める。 Reference numeral 150 denotes a sentence level DP processing unit, which calculates the combination that minimizes the sum of the subsection matching distances for each digit of the input pattern based on the subsection matching distances input from the word start/end level DP processing unit 140. Based on the results, it is determined how many digits the input pattern contains, and a recognition result is obtained.

なお、“桁”は各単語の位置関係を表現するも
ので、連続単語の各単語を先頭から１桁目、２桁
目……ｘ桁目で表現する。 Note that "digit" expresses the positional relationship of each word, and each word of a continuous word is expressed as the first digit, second digit, . . . xth digit from the beginning.

[Effect]

第１図の作用を、第２図及び第３図を参照して
説明する。 The operation of FIG. 1 will be explained with reference to FIGS. 2 and 3.

第２図は入力パターンと前後各半単語標準パタ
ーンの連続DP照合及び逆時間連続DP照合及びそ
れらの照合ルート例の説明図、第３図は単語中心
候補の終端及び始端方向のDP照合及び逆時間DP
照合の説明図である。 Figure 2 is an explanatory diagram of continuous DP matching and inverse time continuous DP matching between the input pattern and each half-word standard pattern before and after, and examples of their matching routes. time dp
It is an explanatory diagram of collation.

連続単語音声の場合には、前述の様に各単語の
結合部分がそれに隣接する前後の単語の影響を受
けて変形しやすいので、その位置を正確に特定す
ることは困難であり、その為DP照合により求め
られた入力パターンの各フレーム毎の終端又は始
端候補に誤りが生じ、認識率を低下させていた。 In the case of continuous word speech, as mentioned above, the joined part of each word is easily deformed due to the influence of the adjacent words before and after it, so it is difficult to specify its position accurately, so DP An error occurs in the end or start end candidates for each frame of the input pattern found through matching, reducing the recognition rate.

そこで、本発明は、連続単語音声の場合は、各
単語の始終端部はそれに隣接する前後の単語の影
響を受けて変形しやすいが、その中央部分は隣接
する前後の影響を受けることが少く安定している
ことに着目し、連続音声の各単語の標準パターン
の候補をその単語の安定部分において抽出する様
にしたものである。 Therefore, in the case of continuous word speech, the beginning and end of each word are easily deformed due to the influence of the words before and after it, but the center part is less affected by the words before and after it. Focusing on stability, the standard pattern candidates for each word in continuous speech are extracted from the stable portion of that word.

標準パターン辞書部１１０には複数の標準パタ
ーンが格納されているが、読み出されるときは、
各標準パターンを前半と後半の半単語に２分し、
前半の半単語に対する前半単語標準パターンと後
半の半単語に対する後半単語標準パターンに区分
されて読み出される。 A plurality of standard patterns are stored in the standard pattern dictionary section 110, but when read out,
Divide each standard pattern into two half words, the first half and the second half.
The first half word standard pattern for the first half word and the second half word standard pattern for the second half word are read out.

各標準パターンを前半と後半の半単語に分ける
方法は、一般に中心部分が安定しているので中心
部分で２分されるが、中心部分以外の安定してい
る部分において２分することも出来る。 In the method of dividing each standard pattern into half words, the first half and the second half, the central part is generally stable, so it is divided into two at the central part, but it is also possible to divide it into two at a stable part other than the central part.

半単語連続DP照合部１２０は、第２図に示す
様に、入力パターンを複数個の標準パターンの各
前半単語標準パターンについて連続DP照合を行
うと共に、各後半単語標準パターンについて逆時
間連続DP照合を行う。そして、各照合結果であ
る連続DP照合距離と逆時間連続DP照合距離を出
力する。 As shown in FIG. 2, the half-word continuous DP matching unit 120 performs continuous DP matching on each of the first half word standard patterns of a plurality of standard patterns of the input pattern, and performs inverse time continuous DP matching on each second half word standard pattern. I do. Then, the continuous DP matching distance and the inverse time continuous DP matching distance, which are the matching results, are output.

中心候補限定選択部１３０は、半単語連続DP
照合部１２０より入力された連続DP照合距離及
び逆時間連続DP照合距離の累積結果を極小値で
限定したものに基づいて、入力パターンの各フレ
ーム毎にそのフレームを前半及び後半の半単語標
準パターンの境界とする標準パターンを求め、そ
の標準パターンより中心候補を選択する。 The central candidate limited selection unit 130 selects half-word continuous DP
Based on the cumulative results of the continuous DP matching distance and the inverse time continuous DP matching distance inputted from the matching unit 120, which are limited to minimum values, each frame of the input pattern is compared to the first half and second half of the half-word standard pattern. A standard pattern is found as the boundary of , and a center candidate is selected from that standard pattern.

これにより、中心候補の数は限定され、且つ正
当な中心候補となり得るものが選択されるので、
以降の演算量を大きく低減することが出来る。 As a result, the number of center candidates is limited, and those that can be valid center candidates are selected.
The amount of subsequent calculations can be greatly reduced.

単語始終端レベルDP処理部１４０は、中心候
補限定選択部１３０で選択された各中心候補につ
き、第３図に示す様に、その中心候補が位置する
フレーム（ｋで図示されている）から始端方向に
逆時間DP照合を行うと共に、終端方向にDP照合
を行つて、それぞれ始端中心照合距離及び中心終
端照合距離を求める。次いでこの両照合距離に基
づいて部分区間照合距離を求める。 As shown in FIG. 3, for each center candidate selected by the center candidate limited selection section 130, the word start/end level DP processing section 140 extracts the start end from the frame in which the center candidate is located (indicated by k). Reverse time DP matching is performed in the direction, and DP matching is performed in the end direction to find the starting end center matching distance and the center ending matching distance, respectively. Next, a partial section matching distance is determined based on both matching distances.

文レベルDP処理部１５０は、単語始終端レベ
ルDP処理部１４０より入力された部分区間照合
距離に基づいて入力パターンの各桁毎に部分区間
照合距離の合計が最小となる組合せを求め、その
結果から入力パターンが何桁入力であるかを判定
して認識結果を求める。 The sentence level DP processing unit 150 finds a combination that minimizes the sum of the subsection matching distances for each digit of the input pattern based on the subsection matching distances input from the word start/end level DP processing unit 140, and calculates the combination that minimizes the sum of the subsection matching distances for each digit of the input pattern. The recognition result is determined by determining how many digits the input pattern contains.

以上の様に、連続DP照合により中心候補を求
め、且つ極小値で中心候補を限定して部分区間候
補を選択する様にしたので、入力パターンから全
ての部分区間の組合せを取り出して複数の標準パ
ターンと照合することにより部分区間候補を求め
る従来方式（前述の第２の方式）よりはもちろん
のこと、前記原発明よりも大幅に演算量を低減さ
せ、早いレスポンスタイムを得ることが出来る。
又、連続音声の各単語の標準パターンの候補をそ
の単語の安定部分において抽出する様したので、
候補が正しく求められて高認識率を確保すること
が出来る。 As described above, center candidates are obtained by continuous DP matching, and subinterval candidates are selected by limiting the center candidates to the minimum value. Therefore, all subinterval combinations are extracted from the input pattern and multiple standard It is possible to significantly reduce the amount of calculation and obtain a faster response time than the conventional method (the above-mentioned second method) in which subsection candidates are found by matching patterns, as well as the original invention.
In addition, we extracted standard pattern candidates for each word in continuous speech from stable parts of that word.
Candidates can be found correctly and a high recognition rate can be ensured.

〔Example〕

本発明の一実施例を、第２図〜第９図を参照し
て説明する。 An embodiment of the present invention will be described with reference to FIGS. 2 to 9.

第４図は本発明の一実施例の構成の説明図、第
５図は始終端固定のDP照合の説明図、第６図は
連続DP照合の説明図、第７図は逆時間連続DP照
合の説明図、第８図は連続DPの演算手法の説明
図、第９図は中心候補フレーム限定動作及び中心
候補選択動作の説明図である。第２図及び第３図
については、既に説明した通りである。 Fig. 4 is an explanatory diagram of the configuration of an embodiment of the present invention, Fig. 5 is an explanatory diagram of DP verification with fixed start and end ends, Fig. 6 is an explanatory diagram of continuous DP verification, and Fig. 7 is an explanatory diagram of reverse time continuous DP verification. FIG. 8 is an explanatory diagram of the continuous DP calculation method, and FIG. 9 is an explanatory diagram of the center candidate frame limiting operation and the center candidate selection operation. 2 and 3 are as already explained.

(A) 実施例の構成第４図において、標準パターン辞書部１１
０、半単語連続DP照合部１２０、中心候限定
補選択部１３０、単語始終端レベルDP処理部
１４０及び文レベルDP処理部１５０について
は、第１図で説明した通りである。(A) Configuration of the embodiment In FIG. 4, the standard pattern dictionary section 11
0, the half-word continuous DP matching unit 120, the central candidate selection unit 130, the word start/end level DP processing unit 140, and the sentence level DP processing unit 150 are as described in FIG.

半単語連続DP照合部１２０において、１２
１は連続DP照合部で、入力パターンを複数個
の標準パターンの各前半単語標準パターンにつ
いて連続DP照合を行つて、連続DP照合距離
（Gf（ｋ，ｎ））を求める。１２２は逆時間連続
DP照合部で、入力パターンを複数個の標準パ
ターンの各後半単語標準パターンについて逆時
間連続DP照合を行つて、逆時間連続DP照合距
離（Gb（ｋ，ｎ））を求める。 In the half-word continuous DP matching unit 120, 12
1 is a continuous DP matching unit that performs continuous DP matching on each first half word standard pattern of a plurality of standard patterns to obtain a continuous DP matching distance (Gf (k, n)). 122 is backward time continuous
The DP matching section performs reverse time continuous DP matching on each of the second half word standard patterns of the input pattern to obtain a reverse time continuous DP matching distance (Gb (k, n)).

中心候補限定選択部１３０において、１３１
は中心候補フレーム限定部で、半単語連続DP
連続DP照合部１２０より入力された連続DP照
合距離Gf（ｋ，ｎ）及び逆時間連続DP照合距
離Gb（ｋ，ｎ）の累積値を正規化し、その値
（DS（ｋ，ｎ））が極小値をとるフレームを中心
候補に選択する対象となるフレームとする。１
３２は中心候補選択部で、前記限定されたフレ
ームからその正規化された累積半単語連続DP
照合距離DS（ｋ，ｎ）が最小のフレームを求
め、そのフレームを中心とする標準パターンを
中心候補として選択する。 In the center candidate limited selection unit 130, 131
is the central candidate frame limited part, and half-word continuous DP
The cumulative values of the continuous DP matching distance Gf (k, n) and the inverse time continuous DP matching distance Gb (k, n) input from the continuous DP matching unit 120 are normalized, and the value (DS (k, n)) is The frame that takes the minimum value is the frame to be selected as the center candidate. 1
32 is a central candidate selection unit that selects the normalized cumulative half-word continuous DP from the limited frames.
A frame with the minimum matching distance DS (k, n) is found, and a standard pattern centered on that frame is selected as a center candidate.

単語始終端レベルDP処理部１４０において、
１４１は逆時間DP照合部で、中心候補限定選
択部１３０より入力された各中心候補につき、
その中心候補が位置するフレームから始端方向
に逆時間DP照合を行つて始端中心照合距離
（Gs（ｋ，ｌ））を求める。１４２はDP照合部
で、前記各中心候補につき、その中心候補が位
置するフレームから終端方向にDP照合を行つ
て中心終端照合距離（Ge（ｋ，ｍ））を求める。
１４３は部分区間候補選択部で、前記始端中心
照合距離Gs（ｋ，ｌ）及び中心終端照合距離Ge
（ｋ，ｍ）に基づいて部分区間照合距離（Ｄ
（ｌ，ｍ））と対応する標準パターン（Ns（ｌ，
ｍ））を求める。 In the word start/end level DP processing unit 140,
Reference numeral 141 denotes an inverse time DP matching unit, for each center candidate input from the center candidate limited selection unit 130,
Reverse time DP matching is performed in the direction of the starting edge from the frame where the center candidate is located to find the starting edge center matching distance (Gs (k, l)). Reference numeral 142 denotes a DP matching unit, which performs DP matching for each center candidate in the direction of the end from the frame in which the center candidate is located to obtain a center-end matching distance (Ge (k, m)).
Reference numeral 143 denotes a partial section candidate selection unit, which selects the start end center matching distance Gs (k, l) and the center end matching distance Ge.
(k, m) based on subinterval matching distance (D
(l, m)) and the corresponding standard pattern (Ns(l, m))
Find m)).

文レベルDP処理部１５０において、１５１
は文レベルDP照合部で、単語始終端レベルDP
処理部１４０より入力された部分区間照合距離
Ｄ（ｌ，ｍ）と対応する標準パターンNs（ｌ，
ｍ）を用いて、最適な単語の結合をDP照合に
より求める。１５２は認識結果処理部で、文レ
ベルDP照合部１５１の結果に基づいて入力パ
ターンが何桁入力であるかを判定して認識結果
を求める。 In the sentence level DP processing unit 150, 151
is the sentence level DP matching part, and the word start/end level DP
The standard pattern Ns(l, m) corresponding to the subsection matching distance D(l, m) input from the processing unit 140 is
m) to find the optimal combination of words by DP matching. A recognition result processing unit 152 determines how many digits the input pattern contains based on the result of the sentence level DP matching unit 151 to obtain a recognition result.

１６０はマイクロホン、１７０は入力された
連続音声の特徴を抽出して照合用の入力パター
ンに変換する音声特徴抽出部、１８０はこの入
力パターンが格納される入力パターンバツフア
である。 160 is a microphone; 170 is a voice feature extraction unit that extracts the features of the input continuous voice and converts it into an input pattern for verification; and 180 is an input pattern buffer in which this input pattern is stored.

(B) 実施例の動作マイクロホン１６０は、使用者の発声した連
続単語からなる連続音声を音声特徴抽出部１７
０に入力する。音声特徴抽出部１７０は、入力
された連続音声の特徴を抽出して照合用の入力
パターンに変換し、入力パターンバツフア１８
０に格納する。(B) Operation of the embodiment The microphone 160 extracts continuous speech consisting of continuous words uttered by the user from the speech feature extraction unit 17.
Enter 0. The audio feature extraction unit 170 extracts the features of the input continuous audio, converts it into an input pattern for matching, and stores it in the input pattern buffer 18.
Store at 0.

このときの音声の入力パターンＡを次の様に
表わす。 The audio input pattern A at this time is expressed as follows.

Ａ＝ａ(1)，ａ(2)，ａ(3)，…ａ(m)…ａ() ここで、ａ（ｍ）はフレームｍにおけるパラ
メータで、例えば周波数方向を16チヤネルに区
分した場合、各区分におけるパワースペクトル
値で表わされるベクトル量である。最後のＩは
入力パターンＡの終端フレームで、且つ入力パ
ターンＡの語長を意味する。なお、フレームは
音声区間（正しくは窓関数によつて切り出した
音声区間）である。 A=a(1),a(2),a(3),...a(m)...a() Here, a(m) is a parameter in frame m, for example, when the frequency direction is divided into 16 channels. , is a vector quantity expressed by a power spectrum value in each section. The last I is the end frame of input pattern A and means the word length of input pattern A. Note that a frame is a voice section (more accurately, a voice section cut out using a window function).

同様に、標準パターン辞書部１１０の標準パ
ターンは、単語ｎの標準パターンをＢ（ｎ）と
して、次の様に表わす。 Similarly, the standard pattern of the standard pattern dictionary section 110 is expressed as follows, where B(n) is the standard pattern of word n.

Ｂ（ｎ）＝ｂ（１，ｎ），ｂ（２，ｎ）…ｂ（ｊ，
ｎ）…ｂ（Ｊ（ｎ），ｎ）ここで、ｂ（ｊ，ｎ）は、単語ｎのフレーム
ｊにおけるパラメータで、入力パターンＡと同
様に、例えば16チヤネルのパワースペクトル値
などで表わされるベクトル量である。Ｊ（ｎ）
は単語ｎの終端フレームで、且つ標準パターン
Ｂ（ｎ）の語長を表わす。 B(n) = b(1, n), b(2, n)...b(j,
n)...b(J(n), n) Here, b(j, n) is a parameter of word n in frame j, and similarly to input pattern A, it is expressed by, for example, the power spectrum value of 16 channels. It is a vector quantity. J(n)
is the final frame of word n and represents the word length of standard pattern B(n).

本発明では、入力パターンを標準パターンの
前半単語標準パターンと後半単語標準パターン
について、連続DP照合と逆時間連続DP照合に
より半単語の照合を行う。この処理は、単語中
心レベルDPと呼ばれる。 In the present invention, half-word matching is performed using continuous DP matching and reverse time continuous DP matching for the first half word standard pattern and the second half word standard pattern of the input pattern. This process is called word center level DP.

(B‐1) 連続DP照合部１２１の動作通常のDP照合は入力パターンの始終端を
固定したもので、第５図に示す様に、始端と
終端を通る一本の最適な照合経路が与えら
れ、その経路の照合距離が求められる。(B-1) Operation of continuous DP matching unit 121 In normal DP matching, the start and end of the input pattern are fixed, and as shown in Figure 5, one optimal matching path passing through the start and end is given. Then, the matching distance of the route is determined.

これに対して、連続DP照合は、第６図に
示す様に、入力パターンの任意のフレームを
終端とする始端フリーなDP照合をいう。こ
れにより入力パターンＡの中に存在する最適
な対応区間が自動的に抽出されて連続DP照
合距離が求まる。この結果に基づいて、入力
パターンＡ中に標準パターンＢ（ｎ）が存在
しそうか、又、存在しそうな場合に、入力パ
ターンＡのどのフレームを終端とするかが容
易に求められる。 On the other hand, continuous DP matching, as shown in FIG. 6, refers to DP matching that is free from the start and ends at any frame of the input pattern. As a result, the optimal corresponding section existing in the input pattern A is automatically extracted, and the continuous DP matching distance is determined. Based on this result, it can be easily determined whether the standard pattern B(n) is likely to exist in the input pattern A, and if so, which frame of the input pattern A should be the end.

(B‐2) 逆時間連続DP照合部１２２の動作逆時間連続DP照合部１２２の行う逆時間
連続DP照合は、第７図に示す様に、入力パ
ターンＡを逆時間方向に連続DP照合を行う
ものである。これにより、入力パターンＡ中
に標準パターンＢ（ｎ）が存在しそうか、又、
存在しそうな場合に、入力パターンＡのどの
フレームを始端とするかが容易に求められ
る。(B-2) Operation of the backward time continuous DP matching section 122 The backward time continuous DP matching performed by the backward time continuous DP matching section 122 performs continuous DP matching on the input pattern A in the reverse time direction, as shown in FIG. It is something to do. This determines whether standard pattern B(n) is likely to exist in input pattern A, and
If it is likely to exist, it is easy to determine which frame of input pattern A should be the starting point.

前述の連続DP照合及び逆時間連続DP照合
において行われる連続DPの演算手法はマツ
チングパスの取り方等によつて種々の方法が
有るが、ここでは第８図に示す様な対称型を
用いるものとする。 There are various methods of continuous DP calculation performed in the above-mentioned continuous DP matching and inverse time continuous DP matching, depending on how to take the matching path, etc., but here we will use a symmetric type as shown in Figure 8. do.

第８図において、格子点（ｉ，ｊ）上の標
準パターンＢと入力パターンＡのベクトル間
距離をｄ（ｉ，ｊ）とすると、次の漸化式に
よつて照合距離gf（ｉ，ｊ）が求められる。 In FIG. 8, if the distance between the vectors of the standard pattern B and the input pattern A on the grid point (i, j) is d(i, j), then the matching distance gf(i, j ) is required.

ｊ＝１のとき、 gf（ｉ，ｊ）＝gf（ｉ，１）＝ｄ（ｉ，１）ｉ＝１，ｊ＞１のとき、 gf（１，ｊ）＝ｄ（１，ｊ）＋gf（１，ｊ−
１）ｊ＞１のとき、 gf（ｉ，ｊ）＝minｄ（ｉ，ｊ）＋gf（ｉ−１，ｊ） 2d（ｉ，ｊ）＋gf（ｉ−１，ｊ−１）ｄ（ｉ，ｊ）＋gf（ｉ，ｊ−１）このgf（ｉ，ｊ）は、（１，１）点から
（ｉ，ｊ）点に至るまでの距離和を最小とし
たものである。 When j = 1, gf (i, j) = gf (i, 1) = d (i, 1) When i = 1, j > 1, gf (1, j) = d (1, j) + gf (1,j-
1) When j > 1, gf (i, j) = mind (i, j) + gf (i-1, j) 2d (i, j) + gf (i-1, j-1) d (i, j )+gf(i,j-1) This gf(i,j) is the one that minimizes the sum of distances from point (1,1) to point (i,j).

逆時間連続DP照合は入力パターンを反転
させて逆時間方向に連続DP照合を行うもで、
その演算手法は前述の連続DP照合時の演算
手法と同様であるので、説明は省略する。 Reverse time continuous DP matching is a method in which the input pattern is reversed and continuous DP matching is performed in the reverse time direction.
The calculation method is the same as the calculation method used during continuous DP verification described above, so the explanation will be omitted.

この連続DP照合及び逆時間連続DP照合に
より、標準パターンＢの前半区間と後半区間
とを、第２図に示す様にそれぞれ照合する。 Through this continuous DP matching and reverse time continuous DP matching, the first half section and the second half section of standard pattern B are respectively matched as shown in FIG.

入力パターンのｋフレーム上の連続DP照
合距離をGf（ｋ）とし、逆時間連続DP照合
距離をGb(k)とする。更に、このGf(k)及びGb
(k)は、Ｎ個の標準パターンに対してはGf
（ｋ，ｎ）及びGb（ｋ，ｎ）の形で表現され
て、連続DP照合部１２１及び逆時間連続DP
照合部１２２よりそれぞれ出力される。 Let Gf(k) be the continuous DP matching distance on k frames of the input pattern, and let Gb(k) be the inverse time continuous DP matching distance. Furthermore, this Gf(k) and Gb
(k) is Gf for N standard patterns.
(k, n) and Gb (k, n), and the continuous DP matching unit 121 and the inverse time continuous DP
Each is output from the matching unit 122.

(B‐3) 中心候補フレーム限定部１３１の動作半単語連続DP照合部１４０より入力され
たGf（ｋ，ｎ）及びGb（ｋ，ｎ）は、中心候
補フレーム限定部１３０において次式で示す
様に累積されて、累積半単語連続DP照合距
離DC（ｋ，ｎ）を生成する。(B-3) Operation of the central candidate frame limiting unit 131 Gf (k, n) and Gb (k, n) input from the half-word continuous DP matching unit 140 are expressed by the following equation in the central candidate frame limiting unit 130. are accumulated to produce a cumulative half-word consecutive DP matching distance DC(k,n).

DC（ｋ，ｎ）＝Gf（ｋ，ｎ）＋Gb（ｋ，ｎ） DC（ｋ，ｎ）は、このままでは標準パター
ンＢ（ｎ）の個々の語長Ｊ（ｎ）によつて、短
いもの程値が小さくなる傾向が出る為、入力
パターンＡ及び標準パターンＢ（ｎ）の各パ
ターン長の和でDC（ｋ，ｎ）の値を割算して
時間正規化を行う。この時間正規化された
DC（ｋ，ｎ）をDR（ｋ，ｎ）で表現すると、
DR（ｋ，ｎ）は下式で表わされる。 DC (k, n) = Gf (k, n) + Gb (k, n) DC (k, n) is a short one depending on the individual word length J (n) of the standard pattern B (n) as it is. Since the value tends to become smaller, time normalization is performed by dividing the value of DC(k, n) by the sum of each pattern length of input pattern A and standard pattern B(n). This time normalized
When DC (k, n) is expressed as DR (k, n),
DR(k,n) is expressed by the following formula.

DR（ｋ，ｎ）＝DC（ｋ，ｎ）／2J（ｎ）ここで2J（ｎ）を用いる理由は、入力パタ
ーンＡが標準パターンＢ（ｎ）と正しくマツ
チングした場合には、対応する入力パターン
長はＪ（ｎ）に近い値をとるものと考えられ
るからである。もし、何らかの手法でDC
（ｋ，ｎ）のｌ，ｍに対する始端Ｌ（ｋ，ｎ）
及び終端Ｍ（ｋ，ｎ）が判明している場合に
は、DR（ｋ，ｎ）は次式で求められる。 DR (k, n) = DC (k, n) / 2J (n) The reason for using 2J (n) here is that if input pattern A is correctly matched with standard pattern B (n), the corresponding input This is because the pattern length is considered to take a value close to J(n). If you use some method to
Starting point L(k,n) for l,m of (k,n)
DR(k, n) can be obtained by the following equation when the terminal M(k, n) is known.

DR（ｋ，ｎ）＝DC（ｋ，ｎ）／｛Ｊ（ｎ）＋Ｍ（ｋ，ｎ）−Ｌ（ｋ，ｎ）＋１｝このDR（ｋ，ｎ）は、第９図に示す様に、
ｎ毎に異つた距離パターンを示す。 DR (k, n) = DC (k, n) / {J (n) + M (k, n) - L (k, n) + 1} This DR (k, n) is as shown in Figure 9. ,
A different distance pattern is shown for each n.

前記原発明は、半単語連続DP照合部１２
０までの処理は本発明と同じであるが、第９
図におけるＮ個の距離パターンをｋ毎にDR
（ｋ，ｎ）値の最小のものを選択し、そのｎ
をｋフレームを中心とする候補（中心候補）
に選定している。この為、前述の様に中心候
補が多くなり、また正当な中心候補となり得
ないものまで選択される結果演算量がそれだ
け多くなり、早いレスポンスタイムが得られ
ないという不都合があつた。 The original invention includes a half-word continuous DP matching unit 12.
The processing up to 0 is the same as in the present invention, but the ninth
DR the N distance patterns in the figure for each k
Select the smallest (k, n) value and
Candidate whose center is k frame (center candidate)
has been selected. For this reason, as described above, the number of center candidates increases, and even those that cannot be valid center candidates are selected, resulting in a correspondingly large amount of calculations, resulting in the inconvenience that a quick response time cannot be obtained.

これに対し、本発明では第９図に示す様に
DR（ｋ，ｎ）が極小値をとるとき、そのフ
レームはその標準パターンの中心フレームで
ある可能性がたかいとして、中心候補選択の
対象とする。極小値をとらなければそのフレ
ームはその標準パターンの中心フレームであ
る可能性は低いとして、中心候補選択の対象
に含めない様にする。これにより、以降の各
処理における演算量を大幅に低減させること
が出来る。 In contrast, in the present invention, as shown in FIG.
When DR (k, n) takes a minimum value, it is assumed that there is a high possibility that the frame is the center frame of the standard pattern, and the frame is selected as a center candidate. If the minimum value is not taken, the frame is considered unlikely to be the center frame of the standard pattern, and is not included in center candidate selection. Thereby, the amount of calculation in each subsequent process can be significantly reduced.

この様に極小値をとるものに限定されたフ
レームのDR（ｋ，ｎ）をDS（ｋ，ｎ）とし
て、中心候補選択部１３２に送る。 The DR(k,n) of frames limited to those having minimum values in this manner is sent to the center candidate selection unit 132 as DS(k,n).

(B‐4) 中心候補選択部１３２の動作中心候補選択部１３２は、中心候補フレー
ム限定部１３１より入力されたDS（ｋ，ｎ）
からｎについてその値が最小のものを求め、
そのｎをフレームｋを中心とする候補（中心
候補）として選択し、その標準パターン番号
をNC(k)に設定する。(B-4) Operation of center candidate selection unit 132 The center candidate selection unit 132 uses DS(k,n) input from the center candidate frame limiting unit 131.
Find the minimum value for n from
Select n as a candidate centered on frame k (center candidate), and set its standard pattern number to NC(k).

もし、候補が無い場合には、NC(k)に
“０”を設定して、中心候補と区別出来る様
にしておく。 If there is no candidate, NC(k) is set to "0" so that it can be distinguished from the central candidate.

以上の処理により、入力パターンＡのｋフ
レームが中心とする標準パターンＢ（ｎ）が
１個に限定されたことになる。 Through the above processing, the number of standard patterns B(n) centered on frame k of input pattern A is limited to one.

(B‐5) DP照合部１４２及び逆時間DP照合部１
４１の動作（単語始終端レベルDP） NC(k)で示される中心候補を、DP照合部１
４２によるDP照合及び逆時間DP照合部１４
１による逆時間DP照合により、始終端のDP
照合距離を求める。(B-5) DP verification unit 142 and reverse time DP verification unit 1
41 operation (word start/end level DP) The center candidate indicated by NC(k) is
DP matching by 42 and inverse time DP matching unit 14
By inverse time DP matching using 1, the starting and ending DP
Find the matching distance.

ここで用いるDPの演算手法もマツチング
パスの取り方等によつて種々の方法がある
が、ここでは、前述の連続DPと同様に対称
型を用いるものとする。即ち、ｉ＝１，ｊ＝１のとき： ge（１，１）＝ｄ（１，１）ｉ＝１，ｊ＞１のとき： ge(1,j)＝ｄ(1,j)＋ge(1,j-1) ｉ＞１，ｊ＝１のとき：ｇ(i,1)＝ｄ(i,1)＋ge(i-1,j) ｉ＞１，ｊ＞１のとき： ge（ｉ，ｊ）＝minｄ（ｉ，ｊ）＋ge（ｉ−１，ｊ） 2d（ｉ，ｊ）＋ge（ｉ−１，ｊ−１）ｄ（ｉ，ｊ）＋ge（ｉ，ｊ−１）このge（ｉ，ｊ）は、格子点（１，１）か
ら（ｉ，ｊ）点に至るまでの距離和を最小と
したものである。 There are various DP calculation methods used here depending on how to take matching paths, etc., but here, a symmetric type is used like the continuous DP described above. That is, when i=1, j=1: ge(1,1)=d(1,1) When i=1, j>1: ge(1,j)=d(1,j)+ge( 1,j-1) When i>1, j=1: g(i,1)=d(i,1)+ge(i-1,j) When i>1, j>1: ge(i , j) = mind (i, j) + ge (i-1, j) 2d (i, j) + ge (i-1, j-1) d (i, j) + ge (i, j-1) This ge (i, j) is the minimum sum of distances from the grid point (1, 1) to the (i, j) point.

前記の各式によつてDP照合距離ge（ｉ，
ｊ）が求まるが、標準パターン終端のものを
Ge(i)とする。 The DP matching distance ge(i,
j) is found, but the one at the end of the standard pattern is
Let it be Ge(i).

又、逆時間DP照合距離gs（ｉ，ｊ）は、単
にパターンを反転して照合するものであり、
その演算手法はDP照合距離ge（ｉ，ｊ）と同
様であるので、その説明は省略する。標準パ
ターン始端のものをGs(i)とする。 In addition, the inverse time DP matching distance gs(i, j) simply inverts the pattern and matches it,
The calculation method is the same as that for the DP matching distance ge(i,j), so the explanation thereof will be omitted. Let Gs(i) be the starting edge of the standard pattern.

これらDP照合及び逆時間DP照合により、
単語中心候補NC(k)の終端及び始端方向の照
合距離を求める。 Through these DP matching and reverse time DP matching,
The matching distance in the direction of the end and start end of the word center candidate NC(k) is determined.

第３図は、中心候補が位置するフレーム
（ｋで示されている）から始端方向に逆時間
DP照合を行い、終端方向にDP照合を行つ
て、そのフレームに対する始端フレームｌ及
び終端フレームを求める例と示したものであ
る。なお、Ｈ（NC(k)）は、中心候補の標準
パターンの語長である。 Figure 3 shows the reverse time from the frame where the center candidate is located (indicated by k) in the direction of the starting point.
This is an example in which DP verification is performed, DP verification is performed in the direction of the end, and the start frame l and end frame for that frame are determined. Note that H(NC(k)) is the word length of the standard pattern of the central candidate.

DP照合部１４２がDP照合により求めた中
心終端照合距離を、Ge（ｋ，ｍ）とし、又、
逆時間DP照合部１４１が逆時間DP照合によ
り求めた始端中心照合距離を、Gs（ｋ，ｌ）
とする。 Let the center-end matching distance obtained by the DP matching unit 142 by DP matching be Ge(k, m), and
The starting end center matching distance obtained by the inverse time DP matching unit 141 by the inverse time DP matching is expressed as Gs(k,l)
shall be.

(B‐6) 部分区間候補選択部１４３の動作部分区間候補選択部１４３は、入力された
Ge（ｋ，ｍ）及びGs（ｋ，ｌ）を累積したも
のを、部分パターン長（ｍ−ｌ＋１）と標準
パターン長Ｊ（NC(k)）の和で割算して時間
正規化する。この時間正規化された部分区間
のDP照合距離をDI（ｋ，ｌ，ｍ）とすると、
次式で求められる。即ち、 DI（ｋ，ｌ，ｍ）＝Ge（ｋ，ｍ）＋Gs（ｋ，ｌ）／Ｊ
（NC(K)））＋ｍ−ｌ＋１次に部分区間候補選択部１４３は、この
DI（ｋ，ｌ，ｍ）をｋについて最小のものを
選択してＤ（ｌ，ｍ）とし、対応する標準パ
ターンをNS（ｌ，ｍ）に設定する。(B-6) Operation of the partial section candidate selection section 143 The partial section candidate selection section 143 selects the input
The cumulative value of Ge (k, m) and Gs (k, l) is time-normalized by dividing it by the sum of the partial pattern length (ml+1) and the standard pattern length J (NC(k)). Letting the DP matching distance of this time-normalized subinterval be DI (k, l, m),
It is determined by the following formula. That is, DI (k, l, m) = Ge (k, m) + Gs (k, l)/J
(NC(K)))+m−l+1 Next, the partial section candidate selection unit 143 selects this
Select the minimum value of DI(k, l, m) for k and set it as D(l, m), and set the corresponding standard pattern to NS(l, m).

Ｄ（ｌ，ｍ）＝ min ｋ｛DI（ｋ，ｌ，ｍ）｝ NS（ｌ，ｍ）＝NC〔 argmin ｋ｛DI（ｋ，ｌ，ｍ）｝ここで、“ argmin ｋ ”は、｛｝内の最小化条件を満足するｋを選定す
る機能をもつたオペレータである。 D(l,m)=min k {DI(k,l,m)} NS(l,m)=NC[argmin k {DI(k,l,m)} Here, “argmin k” is { This operator has the function of selecting k that satisfies the minimization condition within }.

もし、終端候補が無いｍの場合は、NS
（ｌ，ｍ）に“０”を設定し、Ｄ（ｌ，ｍ）に
最大値を設定しておく。 If m has no terminal candidate, NS
Set "0" to (l, m) and set the maximum value to D(l, m).

このＤ（ｌ，ｍ）及びNS（ｌ，ｍ）によつ
て、入力パターンの部分パターン（ｌ−ｍ）
の最適な照合距離（Ｄ（ｌ，ｍ））及び照合相
手（NS（ｌ，ｍ））が決定されて、部分区間
候補選択部１４３より出力される。 By this D(l,m) and NS(l,m), the partial pattern (l-m) of the input pattern
The optimal matching distance (D(l, m)) and matching partner (NS(l, m)) are determined and output from the partial section candidate selection unit 143.

(B‐7) 文レベルDP照合部１５１の動作文レベルDP照合部１５１は、単語始終端
レベルDP処理部１４０より入力されたＤ
（ｌ，ｍ）及びNS（ｌ，ｍ）を用いて、最適
な単語の結合をDP照合により求める。(B-7) Operation of the sentence level DP matching unit 151 The sentence level DP matching unit 151 uses the D input from the word start/end level DP processing unit 140.
Using (l, m) and NS (l, m), find the optimal combination of words by DP matching.

ここで、連続単語中の各単語を先頭から１
桁目、２桁目…ｘ桁目という表現で位置関係
を表現することにする。 Here, each word in the continuous word is 1 from the beginning.
The positional relationship will be expressed using expressions such as digit, second digit, and so on.

(イ) １桁目の設定１桁目は、入力パターンの始端フレーム
からｍフレームの部分区間の距離をＴ（ｍ，
ｌ）に設定する。即ち、Ｔ（ｍ，１）＝Ｄ（１，ｍ）又、その標準語の始端がどこにあるか
を、LT（ｍ，１）に設定する。即ち、 LT（ｍ，ｌ）＝１ここで、部分区間候補が存在しないＤ
（１，ｍ）については処理は行わず、NT
（ｍ，１）に“０”を、Ｔ（ｍ，１）に最大
値を設定しておく。 (b) Setting the 1st digit The 1st digit is the distance of the partial section of m frames from the start frame of the input pattern, T(m,
l). That is, T (m, 1) = D (1, m) Also, set where the beginning of the standard word is in LT (m, 1). That is, LT (m, l) = 1 Here, D where there is no subinterval candidate
No processing is performed for (1, m), and NT
Set "0" to (m, 1) and the maximum value to T(m, 1).

(ロ) ｘ桁目の設定ｘ桁目では、ｌの位置が未定の為、次の
DP漸化式で決定する。即ち、Ｔ(m,x) ｌ＜ｍ＝ min ｌ〔Ｄ(l,m)＋Ｔ(l,x-1)〕 NT(m,x) ｌ＜ｍ＝NS〔 argmin ｌ｛Ｄ(l,m) ＋Ｔ(l,x-1)｝，ｍ〕 LT(m,x) ｌ＜ｍ＝ argmin ｌ〔Ｄ(l,m) ＋Ｔ(l,x-1)〕この場合もＤ（ｌ，ｍ）とｍについて部
分区間候補が存在しない場合は処理しない
で、NT（ｍ，ｘ）に“０”を、Ｔ（ｍ，
ｘ）に最大値を、またLT（ｍ，ｘ）には
（不定値で良いが）“１”を設定しておく。 (b) Setting the xth digit In the xth digit, the position of l is undetermined, so the next
Determined by the DP recurrence formula. That is, T(m,x) l<m= min l [D(l,m)+T(l,x-1)] NT(m,x) l<m=NS[ argmin l {D(l,m ) +T(l,x-1)},m] LT(m,x) l<m= argmin l [D(l,m) +T(l,x-1)] In this case too, D(l,m) If there is no subinterval candidate for and m, no processing is performed, and NT(m,
x) is set to the maximum value, and LT(m, x) is set to "1" (although an indefinite value is fine).

(B‐8) 認識結果処理部１５２の動作認識結果処理部１５２は、文レベルDP照
合部１５１により送られたＴ（ｍ，ｘ）、NT
（ｍ，ｘ）及びLT（ｍ，ｘ）中のＴ（ｍ，ｘ）
から、先ず入力単語の桁数Ｘを、次式から求
める。(B-8) Operation of recognition result processing unit 152 The recognition result processing unit 152 uses T(m,x) and NT sent by the sentence level DP matching unit 151.
T(m,x) in (m,x) and LT(m,x)
First, the number of digits X of the input word is found from the following equation.

Ｘ＝ argmin ｘ〔Ｔ（Ｉ，ｘ）／ｘ〕ここで、Ｉは入力パターンの終端フレーム
である。 X=argmin x [T(I,x)/x] where I is the terminal frame of the input pattern.

次に、認識結果処理部１５２は、Ｘ桁の認
識結果をLT（ｍ，ｘ）とNT（ｍ，ｘ）から
次の(イ)及び(ロ)によつて求める。ここで、ｘ桁
の認識結果をNR（ｘ）とし、又NR（ｘ）の
始端フレームをＬ（ｘ）とする。 Next, the recognition result processing unit 152 obtains the recognition result of X digits from LT (m, x) and NT (m, x) using the following (a) and (b). Here, let NR(x) be the recognition result of x digits, and let L(x) be the starting frame of NR(x).

(イ) Ｘ桁目の認識結果 NR（ｘ）＝NT（Ｉ，Ｘ）Ｌ（ｘ）＝LT（Ｉ，Ｘ） (ロ) Ｘ−ｙ桁目の認識結果 NR（Ｘ−ｙ）＝NT〔Ｌ（Ｘ−ｙ＋１），
Ｘ−ｙ〕Ｌ（Ｘ−１）＝LT〔Ｌ（Ｘ−ｙ＋１），Ｘ
−ｙ〕以上の様にして、認識結果NR（ｘ）が、
従来よりも大幅に少い演算量で且つて高い
認識率をもつて求められる。 (b) Recognition result of X-th digit NR (x) = NT (I, X) L (x) = LT (I, X) (b) Recognition result of X-y-th digit NR (X-y) = NT [L(X-y+1),
X-y] L(X-1)=LT[L(X-y+1),X
−y] In the above manner, the recognition result NR(x) is
This can be determined with a significantly smaller amount of calculation and a higher recognition rate than in the past.

〔発明の効果〕以上説明した様に、本発明によれば次の諸効果
が得られる。[Effects of the Invention] As explained above, according to the present invention, the following effects can be obtained.

(イ) 連続DP照合により中心候補を求めて部分区
間候補を選択する様にしたので、従来方式より
も大幅に演算量を低減させることが出来る。(b) Since the central candidate is determined by continuous DP matching and the partial interval candidates are selected, the amount of calculation can be significantly reduced compared to the conventional method.

(ロ) 各単語の安定部分においてその単語の標準パ
ターンの候補を抽出する様にしたので、候補が
正しく求められ高認識率を確保することが出来
る。(b) Since candidates for the standard pattern of each word are extracted from the stable part of that word, candidates can be found correctly and a high recognition rate can be ensured.

(ハ) 中心候補の存在するフレームを限定する様に
したので、以降の処理における演算量を低減さ
せて、早いレスポンスタイムを実現することが
出来る。(c) Since the frames in which the center candidate exists are limited, the amount of calculation in subsequent processing can be reduced and a fast response time can be achieved.

[Brief explanation of the drawing]

第１図…本発明の構成の説明図、第２図…前半
単語標準パターンの連続DP照合及び後半単語の
逆時間連続DP照合及びにそれらの照合ルート例
の説明図、第３図…単語中心候補の終端及び始端
方向のDP照合及び逆時間照合の説明図、第４図
…本発明の一実施例の構成の説明図、第５図…始
終端固定のDP照合の説明図、第６図…連続DP照
合の説明図、第７図…逆時間連続DP照合の説明
図、第８図…連続DPの演算手法の説明図、第９
図…中心候補フレーム限定動作及び中心候補選択
動作の説明図。第１図及び第４図において、１１０…標準パタ
ーン辞書部、１２０…半単語連続DP照合部、１
３０…中心候補限定選択部、１４０…単語始終端
レベルDP処理部、１５０…文レベルDP処理部。 Fig. 1: An explanatory diagram of the configuration of the present invention; Fig. 2: An explanatory diagram of continuous DP matching of the first half word standard pattern and reverse time continuous DP matching of the second half word; and an explanatory diagram of an example of their matching route; Fig. 3: Word-centered An explanatory diagram of DP matching and reverse time verification in the terminal and starting end directions of candidates, FIG. 4...An explanatory diagram of the configuration of an embodiment of the present invention, FIG. 5...An explanatory diagram of DP matching with a fixed starting and ending end, FIG. 6 ...Explanatory diagram of continuous DP matching, Fig. 7...Explanatory diagram of reverse time continuous DP matching, Fig. 8...Explanatory diagram of continuous DP calculation method, Fig. 9
Figure: An explanatory diagram of a center candidate frame limiting operation and a center candidate selection operation. 1 and 4, 110... standard pattern dictionary section, 120... half word continuous DP matching section, 1
30... Central candidate limited selection section, 140... Word start/end level DP processing section, 150... Sentence level DP processing section.

Claims

[Claims] 1. A speech feature extraction unit converts speech in which a plurality of words are continuously uttered into an input pattern, compares the input pattern with a plurality of standard patterns, selects candidates for each section, and In a continuous speech recognition device that obtains recognition by finding the one with the minimum matching distance between intervals, (a) the input pattern is matched by continuous dynamic programming for each first-half word standard pattern among multiple standard patterns; , a half-word continuous DP matching unit 120 that performs matching of the input pattern using inverse time continuous dynamic programming for each second-half word standard pattern among a plurality of standard patterns; (b) a result of the half-word continuous DP matching unit 120; A central candidate limited selection unit 130 that determines a standard pattern in which each frame of the input pattern is defined as a boundary between the standard patterns of the first half word and the second half word from those limited by the minimum value, and selects the central candidate from the standard pattern. (d) Perform matching using inverse time dynamic programming from the frame in which the center candidate is located in the direction toward the starting end, and perform matching using dynamic programming in the direction toward the end to find the subinterval matching distance at the word start/end level.
DP processing unit 140; (d) Find a combination that minimizes the sum of subinterval matching distances for each digit of the input pattern based on the subinterval matching distance, and determine how many digits the input pattern is from the result. A continuous speech recognition device comprising: a sentence level DP processing unit 150 that determines a recognition result.