JPS62105199A

JPS62105199A - Voice recognition equipment

Info

Publication number: JPS62105199A
Application number: JP60244863A
Authority: JP
Inventors: 納田　重利
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1985-10-31
Filing date: 1985-10-31
Publication date: 1987-05-15

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】Ｃ産業上の利用分野〕この発明は、例えば特定話者の音声を単語単位で認識す
るのに１ｍ用される音声認１識″Ａ置に関する。DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] This invention relates to a speech recognition device used for recognizing the speech of a specific speaker word by word, for example.

Ｃ発明の（既要〕この発明は、例えば７４９敗的な１ｍｌ彼数スペクトル
により表現される多次元ベクトルの軌跡の長さに略々比
例する適応な分割数によって時間軸の正規化処理を行っ
て特Ｉｉ＆データを抽出する手段をイ「する音声認識装
置のパターンマツチング判定器において、スイングサー
チを行って最小となるフレーム間距離を選択し、選択さ
れたフレーム間距離の（’＊　ｉ１３に応してサーチ幅
の中心値をシフトしながら入力パターンと標準パターン
との距離計算処理を行うことにより、入力パターン及び
標串パターンを構成するフレーム数がやや大きく異なる
場合や音声の時間軸変動が大きく時間軸正規化後のパタ
ーンに時間軸変動の影響が残されている場合においても
、正確にマツチングがなされるようにするものである。Invention C (Already Needed) This invention performs time axis normalization processing using an adaptive number of divisions that is approximately proportional to the length of the trajectory of a multidimensional vector expressed by, for example, a 749-dimensional 1ml Helix number spectrum. In the pattern matching judger of the speech recognition device that selects the means for extracting data, performs a swing search, selects the minimum interframe distance, and calculates the selected interframe distance ('*i13). By calculating the distance between the input pattern and the standard pattern while shifting the center value of the search width accordingly, it is possible to avoid cases where the number of frames that make up the input pattern and the standard pattern are slightly different, or where there is a change in the time axis of the audio. This is to ensure accurate matching even when the pattern after time axis normalization is largely influenced by time axis fluctuations.

[Conventional technology]

従来の音声認識ＷｉＦは、音声入力部としてのマイクロ
ホン、前処理回路、音響分析器、特徴データ抽出器、登
録パターンメモリ及びパターンマツチング判定器等によ
り構成されている。A conventional voice recognition WiF is configured with a microphone as a voice input unit, a preprocessing circuit, an acoustic analyzer, a feature data extractor, a registered pattern memory, a pattern matching determiner, and the like.

マイクロホンから入力される音声信号が前処理回路に８
いて、音声認識に必要とされる帯域に制限され、ノ＼／
Ｄ変換器によりディジタル音声信号とされる。このディ
ジクル音声信号が音響分析器に供給される。The audio signal input from the microphone is sent to the preprocessing circuit.
However, the bandwidth is limited to that required for speech recognition.
A D converter converts it into a digital audio signal. This digital audio signal is fed to an acoustic analyzer.

音響分析器において、音声信号が周波数スペクトルに変
換され、例えば対数軸上で一定間隔となるように周波数
スペクトルのレヘルが正規化され、離散的な周波数スペ
クトルデータが発生される。In the acoustic analyzer, the audio signal is converted into a frequency spectrum, and the levels of the frequency spectrum are normalized, for example, at regular intervals on a logarithmic axis, to generate discrete frequency spectrum data.

この周波数スペクトルデータ列が単位時間（フレーム周
汀月）毎に１つのフレームデータとして出力される。即
ち、フレーム１’ｑＩ　Ｋ月毎の１フレームのデータが
Ｘチャンネルの周波数スペクトルデータとされ、Ｎ次元
ベクトルにより表現されるパラメータとして切り出され
、特徴データ抽出Ｈａこ供給される。This frequency spectrum data string is output as one frame data every unit time (frame cycle). That is, data of one frame every K months (frame 1'qI) is taken as frequency spectrum data of the X channel, cut out as a parameter expressed by an N-dimensional vector, and supplied to the feature data extraction section.

特徴データ抽出器において、隣り合うフレームデータの
距離が計算される。対応するチャンヱルのスペクトルデ
ータの差の絶対値が夫々求めツれ、その総和がフレーム
間距離とされる。In the feature data extractor, the distance between adjacent frame data is calculated. The absolute value of the difference between the spectral data of the corresponding channels is determined, and the sum thereof is taken as the interframe distance.

更に、夫々のフレーム間距離の総和が求めらｎ、音声信
叶の始端フレームから終端フレームまでのＮ次元ベクト
ルの軌跡長が求められる。そして最も語数が多く長い音
声の場合に特徴を抽出するのに必要な所定の分割数でも
って軌跡長が等分割され、その分割点に対応したフレー
ムデータのみが特徴データとして抽出され、話者の音声
の発生速度変動に影響されることがないように時間軸が
正規化されて出力される。Furthermore, the sum of the distances between each frame is determined, and the trajectory length of the N-dimensional vector from the start frame to the end frame of the audio signal is determined. Then, in the case of the longest speech with the largest number of words, the trajectory length is divided equally by a predetermined number of divisions necessary to extract features, and only the frame data corresponding to the division points are extracted as feature data, and the speaker's The time axis is normalized and output so as not to be affected by variations in the rate of sound generation.

この特徴データが登録時においては、登録パターンメモ
リに登録特徴データブロツク（標準パターン）として記
憶される。認識時においては、入力音声信号が前述した
処理を経て、入力待（牧データブロック（入力パターン
）とされ、パターンマツチング判定器に供給され、入力
特徴データブロックと登録特徴データブロックとの間の
パターンマツチングが行われる。When this feature data is registered, it is stored in the registered pattern memory as a registered feature data block (standard pattern). During recognition, the input audio signal undergoes the above-mentioned processing, becomes an input data block (input pattern), and is supplied to the pattern matching determiner, which compares the input feature data block with the registered feature data block. Pattern matching is performed.

パターンマツチング判定器において、入力特徴データブ
ロックと登録特徴データブロフクの間のマツチング距離
が算出される。例えば、特徴データ抽出器において１個
のフレームデータが抽出され、０フレーム〜Ｎ−１）フ
レームにより特徴データプロ、りが構成される。登録時
１改データプロ／りを構成するフレームデータと入力特
徴データブロックを構成するフレームデータとの間で、
スイングサーチが行われながら対応するフレーム間の距
離が計算される。In the pattern matching determiner, a matching distance between the input feature data block and the registered feature data block is calculated. For example, one frame data is extracted by the feature data extractor, and the feature data pro- gram is composed of frames 0 to N-1). At the time of registration, between the frame data that makes up the 1 revised data program and the frame data that makes up the input feature data block,
While the swing search is being performed, the distance between corresponding frames is calculated.

例えば、ｉ番目のフレーム問罪＾１ｔｓＤｉをｎをチャ
ンネル番号とし、入力待１″５！ｉ、データブロックの
スペクトルデータを８１．とじ、登録特徴データプロ、
りのスペクトルデータＲ１，，とし、サーチ幅を１とし
てスイングサーチを用いて絶対値距離により算出するとで求められろ。即ち、入力待ｉクデータブロノクの１番
目のフレームと登録特徴データブロックの（ｉ−１）番
目のフレーム、入力待ｉ放データブロックのｉ番目のフ
レームとｅ　ｌＴｈ特ｉ孜データフロックの１番目のフ
レーム、入力待（１シデータブロノクのｉ番目のフレー
ムとｅ　ｆｆ＆特徴データブロックの（ｉ＋１）番目フ
レームの夫々の間の対応するチャン♀ルの差の絶対値の
総和が求められ、得られた３つの値のうちで最小となる
値がフレーム間距離ＳＤ、　　とされる。For example, for the i-th frame interrogation^1tsDi, n is the channel number, input waiting 1"5!i, data block spectrum data is 81., registered feature data pro,
The spectrum data R1, . That is, the first frame of the input waiting data block, the (i-1)th frame of the registered feature data block, the i-th frame of the input waiting data block, and the first frame of the special data block. , Input waiting (1) The sum of the absolute values of the differences of the corresponding channels between the i-th frame of the data block and the (i+1)-th frame of the e ff & feature data block is calculated, and the obtained three values are calculated. The minimum value among them is taken as the interframe distance SD.

そして、入力特徴データブロックの全てのフレ−ムに関
して、スイングサーチされなから登１．を特ｆｔデータ
ブロックのフレームとの間のフレーム間距離ＳＤ、が求
められ、このフレーム間距離ＳＤ、の（ｉ＝Ｉ−１）ま
での総和、即ち、マツチング距離が求められる。池の登
録特徴データブロックに関しても、同様にマツチング距
離が求められ、マツチング距離が最小で十分に距離が近
いものと判断される登録特徴データブロックに対応する
華語が認識結果として出力される。Then, since swing search is not performed for all frames of the input feature data block, registration 1. The interframe distance SD between the frame of the specific ft data block is determined, and the sum of this interframe distance SD up to (i=I-1), that is, the matching distance is determined. The matching distance is similarly calculated for the registered feature data block of the pond, and the Chinese language corresponding to the registered feature data block for which the matching distance is the minimum and is determined to be sufficiently close is output as a recognition result.

また、パターンマツチング判定器において、スイングサ
ーチを行いながら、サーチ幅に略々比例し、かつ入力特
徴データブロックを構成するフレーム数に略々反比例す
る重み係数を求めて、スイングサーチにより得られるフ
レーム間距離にこの重み係数を乗して距離計算処理を行
う音声認識装置として特願昭６０−１５０６９７号が本
願出願人により提案されている。In addition, while performing a swing search, the pattern matching determiner determines a weighting coefficient that is approximately proportional to the search width and approximately inversely proportional to the number of frames constituting the input feature data block. Japanese Patent Application No. 150697/1983 has been proposed by the applicant of the present invention as a voice recognition device that performs distance calculation processing by multiplying the distance between the two by this weighting factor.

[Problem that the invention seeks to solve]

従来のパターンマツチング判定器におけるスイングサー
チを用い７こフレーム間距離は、前述しｆこいずれの音
声認識装置に８いてもサーチ幅（＼ｒ−ｍ）に応して入
力待ｔ”ｓｌデータブロックの１番目のフレームに登録
特徴データブロックの（ｉ　−ｍ）フレーム〜（ｉ＋ｍ
）フレームまでのフレームを対応させて求めるものであ
る。Using a swing search in a conventional pattern matching judger, the inter-frame distance is determined by the input waiting t''sl data according to the search width (\r-m), as described above. (i - m) frame ~ (i + m) of the feature data block registered in the first frame of the block
) is determined by matching the frames up to the frame.

しかし、例えば特ｔＴｈデータ抽出器において離数的な
周波数スペクトルにより表現される多次元ヘクトルの軌
跡の長さに応して分割数が可変とされ、音声の語長に応
して特徴データが抽出さ昶、て時間軸が正規化される方
式においては、登録″＋！Ｆ敬データブロックのフレー
ム数が様々なものとされて登録される。However, for example, in a special tTh data extractor, the number of divisions is variable depending on the length of the trajectory of a multidimensional hector expressed by a discrete frequency spectrum, and feature data is extracted depending on the word length of the speech. In the method in which the time axis is normalized, the number of frames of the registered data block is registered as being various.

このため、入力待ｉ攻データブロックのフレーム数とや
や大きく異なり、サーチ幅以上にフレーム数が違う登録
特徴データブロックと入力待ｉｆｉデークプロ、りとの
間においで従来のスイングサーチを用いてパターンマツ
チングが行われると、正確なマツチングが行えすＬ’７
１４１ｊ率が低下する問題点があった。For this reason, it is possible to search for patterns using the conventional swing search between the registered feature data block, which has a slightly different number of frames than the input waiting data block, and the number of frames differs by more than the search width, and the input waiting ifi data block. When matching is performed, accurate matching can be performed.L'7
There was a problem that the 141j rate decreased.

また、音声の時間軸変動が大きく特徴データ抽出器にお
いて時間軸正規化された後においても特徴データブロッ
クに時間軸変動の影響が残されている場合には、同様に
正確にマツチングが行えず認、識率が低下する問題点が
あった。In addition, if the time axis fluctuations of the audio are large and the effects of the time axis fluctuations remain in the feature data block even after time axis normalization in the feature data extractor, matching cannot be performed accurately and recognition cannot be performed. , there was a problem that the comprehension rate decreased.

従って、この発明の目的は、大力特徴データブロックと
登録特徴データブロックのフレーム数がやや大きく異な
る場合や、特徴データブロック中に時間＠変動の影響が
残されている場合においても正確にマツチングすること
ができる手段を有した音声認、爪装置を堤供することに
ある。Therefore, an object of the present invention is to perform accurate matching even when the number of frames of a large-scale feature data block and a registered feature data block are slightly different, or when the influence of time @ fluctuation remains in the feature data block. The purpose of the present invention is to provide a voice recognition/claw device that has a means to enable this.

[Means for solving problems]

この発明は、入力音声信号をスペクトル変）Ｑ等音声認
識に必要とされる前処理を行う音響分析手段５と、音響分析手段５の出力データが供給され、適応な分割数
によって時間軸の正規化処理を行い、特徴データを抽出
する特徴データ抽出手段６と、特ＩＴｈデータが１票小
パターンとして六己憶されるメモリ８と、特徴データ抽出手段６からの入力パターンとメモリ８か
ら３売み出された標Ｙ廿パターンとが供給され、入力パ
ターン及び種化パターンの一方を構成するフレームとサ
ーチ幅に応した＋１１１　Ｆｔの入力パターン及び標情
パターンの他方を構成する複数個のフレームとを対応さ
せて夫々のフレーム間距離を求め、最小となるフレーム
間距離のｉｎ報に応してサーチ幅の中心１直をシフトし
なが５入力パターンと標嘔パターンとの距離計算処理を
行い、距離計算処理の結果に基づいてマツチング判定す
るパターンマツチング判定手段９と、からなることを特ｉ牧とする音声認識装置である。In this invention, an acoustic analysis means 5 performs preprocessing necessary for speech recognition such as spectral modification (Q) of an input speech signal, and the output data of the acoustic analysis means 5 is supplied, and the time axis is normalized by an appropriate number of divisions. a feature data extraction means 6 that performs processing to extract feature data; a memory 8 in which the special ITh data is stored as a single small pattern; The found target Y 廿 pattern is supplied, and a frame constituting one of the input pattern and the speciation pattern, and a plurality of frames constituting the other of the +111 Ft input pattern and the facial expression pattern corresponding to the search width. The distance between each frame is determined by corresponding the distance between the frames, and the center of the search width is shifted in accordance with the input information of the minimum interframe distance, and the distance between the 5 input patterns and the pattern is calculated. , a pattern matching determination means 9 that performs a matching determination based on the result of distance calculation processing;

[Effect]

パターンマツチング判定器９において、入力特徴データ
ブロックのフレームに対してサーチ幅に応じて登録特徴
データブロックのフレームが句数個対応させら丸、夫々
のフレーム間に関してフレーム間距離が求められ、最小
となるフレーム間距剤が選択されると共に、最小となる
フレーム間距離が登録特徴データブロックのどのフレー
ムにより算出さ着、たちのであるかが判断され、登録時
（攻データブロックの半方向のフレームから最小となる
フレーム間距離が算出される傾向にある場合には、中方
向にサーチ幅の中心値がシフトされ、また、一方向のフ
レームから最小となるフレーム間距離が算出される傾向
にある場合には、一方向にサーチ幅の中心値がシフトさ
れて、順次、フレーム間距離が算出されて、最小となる
フレーム間距離が累算されることによりマンアンプ距離
が算出される。In the pattern matching determiner 9, the frames of the registered feature data block are made to correspond to the frames of the input feature data block according to the search width, and the interframe distance between each frame is determined, and the At the same time, it is determined which frame of the registered feature data block is used to calculate the minimum inter-frame distance. If the minimum inter-frame distance tends to be calculated, the center value of the search width is shifted in the middle direction, and if the minimum inter-frame distance tends to be calculated from frames in one direction. In this case, the center value of the search width is shifted in one direction, inter-frame distances are calculated sequentially, and the minimum inter-frame distance is accumulated to calculate the man-amp distance.

〔Example〕

以下、この発明の一実施例を図面を参照して説明する。 An embodiment of the present invention will be described below with reference to the drawings.

第１図は、この発明の一実施例を示すもので、第１図に
おいて、１が音声入力部としてのマイクロホンを示して
いる。FIG. 1 shows an embodiment of the present invention, and in FIG. 1, 1 indicates a microphone as an audio input section.

マイクロホン１からのアナログ音声信号がフィルタ２に
供給される。フィルタ２は、例えば力。An analog audio signal from microphone 1 is supplied to filter 2 . Filter 2 is, for example, a force filter.

トオフ周波数７．．５　ＫＮ、、のローパスフィルタで
あり、音声信−ぢがフィルタ２において、音声認識に必
要とされる７、５にＨ２以下の帯域に制限され、この音
声信号がアンプ３を介してＡ／Ｄ変換器４に供給される
。To-off frequency 7. ．． This is a low-pass filter of 5KN, , and the voice signal is limited to the band below 7,5H2 required for voice recognition in the filter 2, and this voice signal is passed through the amplifier 3 to the A/D. It is supplied to converter 4.

Ａ／Ｄ変換器４は、例えば、サンプリング周波数１２．
５Ｋ）Ｉ２の８ビツトＡ／Ｄ変換器であり、音声信号が
Ａ／Ｄ変換器４において、アナログ−ディジタル変換さ
れて、８ビツトのディジタル信号とされ、音響分析器５
に供給される。For example, the A/D converter 4 has a sampling frequency of 12.
5K) I2 is an 8-bit A/D converter, and the audio signal is analog-to-digital converted into an 8-bit digital signal in the A/D converter 4, which is then converted into an 8-bit digital signal.
supplied to

音響分析器５は、音声信号を１？ｉ１波数スペクトルに
変換じて、例え；よ、Ｎチャンスルのスペクトルデータ
列を発生するものである。音響分析器５において、音声
信号が演算処理により周波数スペクトルに変換され、例
えば対数軸上で一定間隔となるＮ個の周波数を代表値と
するスペクトルデータ列が得られる。従って、音声信号
がＸチャンネルの離散的な周波数スペクトルの大きさに
よって表現される。そして、単位時間（フレーム周！ｔ
ＪＩ）毎にＮチャンネルのスペクトルデータ列が一つの
フレームデータとして出力される。即ち、フレーム周朋
毎に音声信号がＮ次元ベクトルにより表現されるパラメ
ータとして切り出され、特徴データ抽出器６に供給され
る。The acoustic analyzer 5 detects the audio signal as 1? By converting it into an i1 wave number spectrum, a spectral data string of, for example, N chances is generated. In the acoustic analyzer 5, the audio signal is converted into a frequency spectrum through arithmetic processing, and a spectral data string whose representative values are, for example, N frequencies at regular intervals on the logarithmic axis is obtained. Therefore, the audio signal is expressed by the magnitude of the discrete frequency spectrum of the X channel. And unit time (frame period!t
JI), N-channel spectral data strings are output as one frame data. That is, the audio signal is extracted for each frame period as a parameter expressed by an N-dimensional vector, and is supplied to the feature data extractor 6.

特徴データ抽出器６において、隣り合うフレームデータ
の距離が計算される。対応するチャンぶルのスペクトル
データの差の絶対値が夫々求められ、その総和がフレー
ム間距離とされる。The feature data extractor 6 calculates the distance between adjacent frame data. The absolute value of the difference between the spectral data of the corresponding chamble is determined, and the sum thereof is taken as the interframe distance.

更に、夫々のフレーム間距離の総和が求められ、音声信
号の始端フレームから終端フレームまでのＮ次元ベクト
ルの軌跡長が求められる。そしてこの軌跡長に略々比例
した分割数でもって軌跡長が分再すされ、その分割点に
対応したフレームデータのみが特徴データとして抽出さ
れる。即ち、話者の音声の発生速度変動に影響されるこ
とがないように時間軸が正規化され、強い独自の特徴を
示すフレームデータが音声の語長に応した数だけ抽出さ
れて出力される。Furthermore, the sum of the distances between each frame is determined, and the trajectory length of the N-dimensional vector from the start frame to the end frame of the audio signal is determined. Then, the trajectory length is divided by a number of divisions approximately proportional to this trajectory length, and only the frame data corresponding to the division point is extracted as feature data. In other words, the time axis is normalized so as not to be affected by fluctuations in the rate at which the speaker's voice is generated, and a number of frame data that exhibit strong unique characteristics are extracted and output in accordance with the word length of the voice. .

この特徴データが登録時においては、モード切替回路７
を介して登録パターンメモリ８に供給され、登録特徴デ
ータプロ、りとして記憶される。When this characteristic data is registered, the mode switching circuit 7
The data is supplied to the registered pattern memory 8 via the registered pattern memory 8 and stored as registered feature data.

認識時においては、入力音声信号がｉ′ｉ′ｉ述じた処
理を経て、特徴データとされ、パターンマツチング判定
器９に供給され、入力待ｉ枚データブロックとされる。At the time of recognition, the input audio signal undergoes the processing described above and is made into feature data, which is supplied to the pattern matching determiner 9 and is made into an i-sheet data block waiting for input.

パターンマツチング判定器９において、入力待ＦＪデー
タブロックのフレーム数と大きく異ならないフレーム数
の６２＋＝　＃＊　ｆｆｉデータブロックのみが比較の
対象とされ、入力待ｉ政データブロックと登録特徴デー
タブロックの間において、パターンマツチングが行われ
る。In the pattern matching judger 9, only the 62+= #* ffi data block whose number of frames is not significantly different from the number of frames of the input waiting FJ data block is compared, and the input waiting FJ data block and the registered feature data block are compared. In between, pattern matching is performed.

第２図は、パターンマツチング判定器９の一例を示し、
第２図に示すようにサーチ幅内フレーム距離計算回路１
０．最小値抽出回路１１．サーチ幅区間可変計算回路１
２．マ、チング距湘計算回路１３．加算回路１４．終端
側フレーム距離計算回路１５．終端側マツチング距鮨計
算回路１６及び最小距離判定回路１７によりパターンマ
ツチング判定器８が構成される。FIG. 2 shows an example of the pattern matching determiner 9,
As shown in Fig. 2, the search width frame distance calculation circuit 1
0. Minimum value extraction circuit 11. Search width interval variable calculation circuit 1
2. Maching distance calculation circuit 13. Addition circuit 14. End-side frame distance calculation circuit 15. A pattern matching determiner 8 is configured by the end-side matching distance calculating circuit 16 and the minimum distance determining circuit 17.

サーチ幅内フレーム距＾■計算回路１０及び終端側フレ
ーム距離計算回路１５の夫々に特徴データ抽出器６から
の特徴データが供給され、例えば、音声区間の終端に対
応するフレームを（Ｑ−１）とした場合、各々がチャン
ネルＯ〜チャンネルＮ−１のデータにより構成されるフ
レームデータが０フレーム〜Ｑ−１フレームまで供給さ
れ入力特徴データブロックとされる。また、それと共に
、登録パターンメモリ８から比較の対象とされる入力特
徴データブロックのフレーム数と大きく異ならない、例
えばＯフレーム幅即−１フレームにより構成される登録
特徴データブロックがサーチ幅内フレーム距離計算回路
１０及び終端側フレーム距離計算回路１５に供給される
。The feature data from the feature data extractor 6 is supplied to each of the frame distance within search width calculation circuit 10 and the end-side frame distance calculation circuit 15, and, for example, the frame corresponding to the end of the voice section is extracted (Q-1). In this case, frame data each composed of data from channel O to channel N-1 is supplied from frame 0 to frame Q-1 and used as an input feature data block. At the same time, a registered feature data block composed of, for example, O frame width - 1 frame, which is not significantly different from the frame number of the input feature data block to be compared from the registered pattern memory 8, has a frame distance within the search width. The signal is supplied to the calculation circuit 10 and the end-side frame distance calculation circuit 15.

サーチ幅内フレーム距離計算回路１０において、入力特
徴データブロック側から登録特徴データブロック側にサ
ーチを行う場合には、入力特徴データブロック側のｑフ
レームに登録特数データブロック側のｊフレーム及びｊ
フレームに近接した複数のフレームが対応され、その夫
々の間のフレーム間距離の計算が行われる。In the search width frame distance calculation circuit 10, when searching from the input feature data block side to the registered feature data block side, q frames on the input feature data block side are j frames and j on the registered feature data block side.
A plurality of frames in the vicinity of the frame are matched, and an interframe distance between each of them is calculated.

例えば、対応させるフレーム幅即ち、サーチ幅が１とさ
れた場合には、第３図Ｓこ示すように入力特徴データブ
ロックのｑフレームに、登録特徴データブロックの（ｊ
−１）フレーム、Ｊフレーム及び（ｊ＋１＞フレームが
対応され、夫々の間のフレーム間距離が算出される。ま
た、サーチ幅が２とされた場合には、入力特徴データブ
ロックのｑフレームに登録特徴データブロックの（ｊ　
−２）フレーム〜（ｊ＋２）フレームが対応され、その
夫々の間のフレーム間距離が算出される。更にサーチ幅
がｍとされた場合には、入力特徴データブロックのｑフ
レームに登録特徴データブロックの（ｊ−ｍ）フレーム
〜（ｊ十ｍ）フレーム力く対応され、（２ｍ＝１）個の
フレーム間距離が夫々算出される。For example, when the corresponding frame width, that is, the search width is set to 1, as shown in FIG.
-1) Frame, J frame, and (j+1> frame are matched, and the interframe distance between them is calculated. Also, when the search width is set to 2, it is registered in the q frame of the input feature data block. Feature data block (j
-2) frames to (j+2) frames are matched, and the interframe distance between each of them is calculated. Furthermore, when the search width is set to m, q frames of the input feature data block are closely matched with (j-m) frames to (j 10 m) frames of the registered feature data block, and (2m=1) frames are Inter-frame distances are calculated respectively.

計算処理により得られた（２ｍ＋１）個のフレーム間距
離データが最小値抽出回路１１に供給される。(2m+1) pieces of interframe distance data obtained through the calculation process are supplied to the minimum value extraction circuit 11.

最小値抽出回路１１において、（２ｍ＝１）個のフレー
ム間距離データの夫々が比較され、最小値となるフレー
ム間距離データが入力特徴データブロックのｑフレーム
に対するフレーム間距離とされ、選択されたフレーム間
距離データがマツチング距離計算回路１３に供給される
。また、それと共に、選択されたフレーム間距離データ
が登録特徴データブロック中のどのフレームに関して求
められたものであるかを示すフレーム番号データがサー
チ幅区間可変計算回路１２に供給される。In the minimum value extraction circuit 11, each of the (2m=1) pieces of interframe distance data is compared, and the interframe distance data having the minimum value is selected as the interframe distance for q frames of the input feature data block. The interframe distance data is supplied to a matching distance calculation circuit 13. At the same time, frame number data indicating which frame in the registered feature data block the selected interframe distance data was obtained is supplied to the search width interval variable calculation circuit 12.

サーチ幅区間可変計算回路１２において、フレーム番号
データに基づいてサーチ幅区間のシフト方向が判断され
、シフト操作データが発生される。In the search width section variable calculation circuit 12, the shift direction of the search width section is determined based on the frame number data, and shift operation data is generated.

例えば、入力特徴データブロックのｑフレームに対応す
るサーチ幅区間の中心となる登録時１攻データブロック
のｊフレームを基準として、ｊフレーム−（ｊ＋ｍ）フ
レームの間のフレームが選択すれる傾向にあると判断さ
れる場合には、十方向皿ちｊフレーム−（ｊ”ｍ）フレ
ーム（則にサーチ幅区間をシフトするシフト操作データ
が発生される。For example, based on frame j of the first attack data block at registration, which is the center of the search width section corresponding to frame q of the input feature data block, frames between frame j - (j+m) tend to be selected. If it is determined that this is the case, shift operation data for shifting the search width section in the ten-direction direction by j frames - (j''m) frames is generated.

また、（ｊ−ｍ）フレーム番号デ−タの間のフレームが
選択される傾向にあると判断される場合には、一方向、
即ち（ｊ−ｍ）フレーム〜ｊフレーム側にサーチ幅区間
をシフトするシフト操作データが発生される。In addition, if it is determined that frames between (j-m) frame number data tend to be selected, one-way,
That is, shift operation data for shifting the search width section from frame (j-m) to frame j is generated.

このシフト操作データがサーチ幅内フレーム距離計算回
路ｉｏに供給され、順次、人力特徴データブロックの各
フレームに関してのフレーム間距離の計算が、シフト操
作データにより設定される登録時ｉ攻データブロックｌ
ｊｌ、＋１のサーチ幅区間内のフレームとの間において
なされる。つまり、最小となるであろうフレーム間距離
が常に登録特徴データブロックのサーチ幅区間内のフレ
ームから算出されるように、サーチ幅区間がシフトされ
、（２ｍ＋１）個のフレーム間距離が算出され、最小値
抽出回路１１において最小値として選択されたフレーム
間距離データがマツチング距離計算回路１３に供給され
る。This shift operation data is supplied to the search width frame distance calculation circuit io, and the inter-frame distance calculation for each frame of the human-powered feature data block is sequentially performed on the registration time i attack data block l set by the shift operation data.
This is done between frames within the search width section of jl, +1. In other words, the search width section is shifted so that the minimum interframe distance is always calculated from the frames within the search width section of the registered feature data block, and (2m+1) interframe distances are calculated. The interframe distance data selected as the minimum value in the minimum value extraction circuit 11 is supplied to the matching distance calculation circuit 13.

マツチング距離計算回路１３において、順次供給される
最小値として選択されたフレーム間距離データの累算処
理がなされ、入力特徴データブロックの最大フレーム（
Ｑ−１）までフレーム間距離データが累算されると、こ
の累算イ直がマツチング距離データとされ、マツチング
距＾■データが加算回路１４に供給される。In the matching distance calculation circuit 13, the interframe distance data sequentially supplied and selected as the minimum value is accumulated, and the maximum frame (
When the inter-frame distance data is accumulated up to Q-1), this accumulated value is used as matching distance data, and the matching distance data is supplied to the adder circuit 14.

また、終端側フレーム距離計算回路１５において、入力
特徴データブロックの所定の終端側フレームと登録特徴
データブロックの所定の終端側フレームとに関して、フ
レーム間距離の計算がなされる。例えば、直接マツチン
グによりフレーム間距離が算出される場合には、Ｃを定
数フレーム数とすると、入力特徴データブロックの所定
フレーム（Ｑ−Ｃ）から最大フレーム（Ｑ−１）までの
各フレームに登録特徴データブロックの所定フレーム（
Ｊ−Ｃ”）から最大フレーム（Ｊ−１）までの夫々が１
対１に対応され、順次、フレーム間距離が求められ、フ
レーム間距離データが終端側マツチング距離計算回路１
６に供給される。Further, in the end-side frame distance calculation circuit 15, an inter-frame distance is calculated for a predetermined end-side frame of the input feature data block and a predetermined end-side frame of the registered feature data block. For example, when the interframe distance is calculated by direct matching, and C is a constant number of frames, registration is made in each frame from the predetermined frame (Q-C) to the maximum frame (Q-1) of the input feature data block. A given frame of feature data block (
JC”) to the maximum frame (J-1) are each 1
The interframe distances are sequentially determined, and the interframe distance data is sent to the terminal side matching distance calculation circuit 1.
6.

終端側マツチング距離計算回路１６において、順次供給
されるフレーム間距離データの累算処理がなされ、入力
特徴データブロックの最大フレーム（Ｑ−１）までフレ
ーム間距離データが累算されると、この累算値が終端側
フレームに関して求められたマツチング距離データとさ
れ、マツチング距離データが加算回路１４に供給される
。In the end-side matching distance calculation circuit 16, the sequentially supplied inter-frame distance data is accumulated, and when the inter-frame distance data is accumulated up to the maximum frame (Q-1) of the input feature data block, this accumulated distance data is accumulated. The calculated value is used as matching distance data obtained for the end frame, and the matching distance data is supplied to the adding circuit 14.

加算回路１４において、マツチング距離計算回路１３か
ら供給されるマツチング距離データと終端側マツチング
距離計算回路１６から供給されるマツチング距離データ
とが加算されて判定用距離データとされ、この判定用距
離データが最小距離判定回路１７に供給される。また、
同様に、フレーム数が太き（異ならない他の登録特徴デ
ータブロックとの間においても判定用距離が算出され、
判定用距離データが最小距離判定回路１７に供給される
。In the adding circuit 14, the matching distance data supplied from the matching distance calculation circuit 13 and the matching distance data supplied from the end-side matching distance calculation circuit 16 are added to form determination distance data, and this determination distance data is The signal is supplied to the minimum distance determination circuit 17. Also,
Similarly, the determination distance is calculated between other registered feature data blocks with a large number of frames (not different),
The distance data for determination is supplied to the minimum distance determination circuit 17.

最小距離判定回路１７は、判定用距離が最小で十分に距
離が近いものと判断される登録特徴データブロックに対
応する単語を認識結果として出力する。The minimum distance determination circuit 17 outputs, as a recognition result, a word corresponding to a registered feature data block that is determined to have a minimum determination distance and is sufficiently close.

第４図は、例えばサーチ幅を１として、フレーム間距離
の最小値抽出の際に二度以上続けて登録特徴データブロ
ックの＋１方向又は−１方向のフレームが選択された時
にサーチ幅区間の可変操作が行われて、判定用距離が算
出される過程の一例を示すものである。Fig. 4 shows the change in the search width section when, for example, the search width is set to 1 and frames in the +1 direction or -1 direction of the registered feature data block are selected twice or more in succession when extracting the minimum value of the interframe distance. It shows an example of a process in which an operation is performed and a determination distance is calculated.

第４図において、縦軸が登録特徴データブロックのフレ
ーム番号を示し、横軸が入力特徴チータブロックのフレ
ーム番号を示している。第４図中の細線上の各点が可変
操作されたサーチ幅区間に対応するフレーム番号を示し
、第４図中の破線上の各点がサーチ幅区間が可変操作さ
れない場合のフレーム番号を示し、第４図中の実線上の
各点が判定用距離に採用されたフレーム番号を示してい
る。In FIG. 4, the vertical axis indicates the frame number of the registered feature data block, and the horizontal axis indicates the frame number of the input feature cheater block. Each point on the thin line in Fig. 4 indicates the frame number corresponding to the search width section that has been variably manipulated, and each point on the broken line in Fig. 4 indicates the frame number when the search width section has not been variably manipulated. , each point on the solid line in FIG. 4 indicates a frame number adopted as the determination distance.

第４図に示すように、二度続けて登録特徴データブロッ
クの＋１方向又は−１方向が選択された場合においての
み、サーチ幅区間がシフトされ、フレーム間距離が算出
され、入力待ｆｌデータブロックと登録特徴データブロ
ックとに関するマツチング距離が求められる。また、入
力特徴データブロック及び登録特徴データブロックの所
定の終端側フレームに関してのみ、フレーム間距離が算
出され、Ｖ！端測フレームに関するマツチング距離が求
められる。As shown in FIG. 4, only when the +1 direction or -1 direction of the registered feature data block is selected twice in succession, the search width section is shifted, the interframe distance is calculated, and the input waiting fl data block is A matching distance between the registered feature data block and the registered feature data block is determined. Further, the interframe distance is calculated only for the predetermined terminal side frames of the input feature data block and the registered feature data block, and V! The matching distance for the end measurement frame is determined.

そして、全体にわたって求められたマツチング距離と終
端側フレームに関してのみ求められたマツチング距離と
が加算されて判定用距離とされ、この判定用距離が用い
られて単語認、識の判定がなされる。Then, the matching distance determined for the entire frame and the matching distance determined only for the end frame are added to form a determination distance, and this determination distance is used to determine word recognition and identification.

上述のこの発明の一実施例におけるパターンマツチング
判定器９のサーチ幅内フレーム距離計算回路１０．最小
値抽出回路１１．サーチ幅区間可変計算回路１２及びマ
ツチング距離計算回路１３の動作を、例えばサーチ幅を
１止して第５図に示すフローチャートを参照して説明す
る。また、第５図のフローチャートは、フレーム間距離
の最小値抽出の際に二度以上続けて登録特徴データブロ
ックのフレームの４−１方向または一１方向が選択され
た時にサーチ幅区間の可変操作が行われる場合のもので
ある。Frame distance calculation circuit 10 within the search width of the pattern matching determiner 9 in the embodiment of the present invention described above. Minimum value extraction circuit 11. The operations of the search width interval variable calculation circuit 12 and the matching distance calculation circuit 13 will be explained with reference to the flowchart shown in FIG. 5, for example, with the search width set at one. In addition, the flowchart in Fig. 5 shows that when the 4-1 direction or the 11 direction of the frame of the registered feature data block is selected twice or more consecutively when extracting the minimum value of the inter-frame distance, the search width section variable operation is performed. This applies when

サーチ幅内フレーム距離計算回路１０において、入力特
徴データブロックと登録特徴データプロソりとの間でス
テップ■及び■の処理が行われる。In the search range frame distance calculation circuit 10, the processes of steps (2) and (2) are performed between the input feature data block and the registered feature data processor.

まず、登録特徴データブロックのフレームの士１方向の
選択回数Ｋ及び登録特徴データブロックのフレームの一
１方向の選択回数Ｐが０に初期設定されると共に、登録
特徴データブロックのフレーム番号を示す変数ｊ及び入
力特徴データブロックのフレーム番号を示す変数ｑが０
に初期設定される。また、スイングサーチにより求めら
れるフレーム間距離の累積距離りがＯに初期設定される
と共に、登録特徴データブロックの最大フレーム番号り
がＪ−１に設定され、入力特徴データブロックの最大フ
レーム番号ＩがＱ−１に設定され、サーチ幅区間に対す
る制限値Δが所定の値に設定される（ステップ■）。First, the number of times K of selections in one direction between frames of the registered feature data block and the number of times P of selections in one direction of each frame of the registered feature data block are initialized to 0, and a variable indicating the frame number of the registered feature data block is initialized. j and the variable q indicating the frame number of the input feature data block are 0.
is initialized to . Additionally, the cumulative distance between frames determined by the swing search is initially set to O, the maximum frame number of the registered feature data block is set to J-1, and the maximum frame number I of the input feature data block is set to O. Q-1, and the limit value Δ for the search width section is set to a predetermined value (step ■).

そして、ステップ■において、入力特徴データブロック
のｑフレームに登録特徴データブロックの（ｊ＋１）フ
レーム、ｊフレーム及び（ｊ　−１）フレームが対応さ
れ、その夫々の間のフレーム間距離Ｄ　Ｉ＋　Ｄ　ｚ、
　Ｄ　：ｌがｎをチャンネル番号とし、Ｓ　Ｑｎを入カ
スベクトルデータ＋Ｒｊｎを登録スペクトルデータとし
てで求められる。即ち、対応するチャンネルのスペクトル
データの差の絶対値のチャンネルＯ〜チャンネル（Ｎ−
１）までの総和がフレーム間距離Ｄ＋、　Ｄ　ｚ、　Ｄ
　１とされる。また、このステップ■の処理に８いては
、入力特徴データブロックのＣｊ＝Ｏ）とされる場合に
は、（ｊ　−Ｎ　フレームが登録時ｉ攻データブロック
側に存在しないため（ｊ　−１−〇）とされる。また、
（ｊ＋１）フレームが登録特徴データブロックの最大フ
レーム番号り以上となる場合には、登録特徴データブロ
ック側に（ｊ＋１）フレームが存在しないため（ｊ十１
＝Ｌ）とされる。Then, in step (2), the q frames of the input feature data block are associated with the (j+1), j, and (j −1) frames of the registered feature data block, and the interframe distances between them are D I+ D z,
D:l is determined by using n as the channel number and SQn as the input cassette vector data+Rjn as the registered spectrum data. That is, the absolute value of the difference between the spectral data of the corresponding channels from channel O to channel (N-
The sum up to 1) is the interframe distance D+, D z, D
1. In addition, in the process of step (8), when Cj of the input feature data block is set to O), since (j −N frame does not exist in the i-attack data block side at the time of registration, (j −1− 〇).Also,
If the number of (j+1) frames is greater than or equal to the maximum frame number of the registered feature data block, there is no (j+1) frame in the registered feature data block, so
=L).

最小値抽出回路１１．マツチング距離計算回路１３及び
サーチ幅区間可変計算回路１２において、ステップ■〜
Ｏの処理がなされる。ステップ■において、入力特徴デ
ータブロックのｑフレームに（ｊ↓１）フレームを対応
させて求められたフレーム間距ＡＩＤ、、入力特徴デー
タブロックのｑフレームにｊフレームを対応させて求め
られたフレーム闇路＾ＩＤ２及び入力特徴データブロッ
クのｑフレームに（ｊ−１）フレームを対応させて求め
られたフレーム闇路Ａｉ　Ｄ　ｚの比較がなされ、最小
となるものが入力特徴データブロックのｑフレームに対
するフレーム間距離とされる。このフレーム間距離が累
算され累積距離りとされる。Minimum value extraction circuit 11. In the matching distance calculation circuit 13 and the search width section variable calculation circuit 12, steps
Processing of O is performed. In step ■, the interframe distance AID obtained by associating (j↓1) frames with q frames of the input feature data block, and the frame dark path obtained by associating j frames with q frames of the input feature data block. ^ID2 and the frame dark path Ai D z obtained by associating the (j-1) frame with the q frame of the input feature data block are compared, and the smallest one is the interframe for the q frame of the input feature data block. considered to be the distance. This inter-frame distance is accumulated to form a cumulative distance.

また、ステップ■及びステップ■において、最小として
選択されたフレーム間距離が登録時（攻データ側の（ｊ
＋１）フレーム、ｊフレーム及び（ｊ−１）フレームの
いずれかのうちから算出されたものであるかが判断され
る。In addition, in step ■ and step ■, the interframe distance selected as the minimum is registered ((j
+1) frame, j frame, and (j-1) frame is determined.

ステップ■において、フレーム間距離り、が選択された
かどうかが判断され、ＤＩが選択された場合には、十１
方向の選択回数Ｋがインクリメントされると共に、−１
方向の選択回数Ｐが０とされる（ステップ■）。また、
ＤＩが選択されない場合には、ステップ■にａいて、フ
レーム間距離Ｄ３が選択されたかどうかが判断され、Ｄ
３が選択された場合には、−１方向の選択回数Ｐがイン
クリメントされると共に、−１方向の選択回数Ｋが０と
される（ステップ０）。また、Ｄｌ及びＤ３が選択され
ない場合、即ちＤ２が選択された場合には、ステップ■
において、＋１方向の選択回数Ｋ及び−１方向の選択回
数Ｐが共に０とされる。In step ■, it is determined whether or not the interframe distance is selected, and if DI is selected,
The number of times K of direction selection is incremented, and -1
The number of times P of direction selection is set to 0 (step ■). Also,
If DI is not selected, it is determined in step (a) whether or not the interframe distance D3 is selected, and D is determined.
If 3 is selected, the number of selections P in the -1 direction is incremented, and the number of selections K in the -1 direction is set to 0 (step 0). Further, if Dl and D3 are not selected, that is, if D2 is selected, step
In this case, the number of selections K in the +1 direction and the number P of selections in the -1 direction are both set to 0.

ステップ■及びステップ■において、サーチ幅区間を可
変操作すべきかどうかが判断される。In steps (2) and (2), it is determined whether the search width section should be varied.

ステップ■において、＋１方向の選択回数にと２とが比
較されることにより、二度以上続けて＋１方向のフレー
ムが選択されたかどうかが判断され、（Ｋ≧２）の場合
には、登録特徴データブロックのフレーム番号を示す変
数ｊと入力特赦チータブロックのフレーム番号を示す変
数ｑとの差（ｊ−ｑ）が制限値Δ以下かどうかが判断さ
れる（ステップ０）。そして、１ｒｌｌ　ｌ（旧直Δ以
下の場合には、ｊがインクリメントされ１士１とされ（
ステノブ・Ｄ）、制限値Δを越える場合には、ｊはその
ままとされ、次のステップに移行する。In step ■, the number of selections in the +1 direction is compared with 2 to determine whether frames in the +1 direction have been selected twice or more in a row, and if (K≧2), the registered feature It is determined whether the difference (j-q) between the variable j indicating the frame number of the data block and the variable q indicating the frame number of the input amnesty cheater block is less than or equal to the limit value Δ (step 0). Then, 1rll l (if the old direct Δ is less than or equal to Δ, j is incremented and set to 1 minus 1 (
Stenobu D), if the limit value Δ is exceeded, j is left as is and the process moves to the next step.

また、（Ｋ＜２）の場合には、ステップ■において、−
１方向の選択口数Ｐと２とが比較されることにより、二
度以上続けて一１方向のフレームが選択されたかどうか
が判断され、（Ｐ≧２）の場合には、入力待１敢データ
ブロックのフレーム番号を示す変数ｑと登録特徴データ
ブロックのフレーム番号を示す変数ｊとの差（ｑ−ｊ）
が制限値Δ以下かどうかが判断される（ステップ＠）。Moreover, in the case of (K<2), in step (■), -
By comparing the number of selections P in one direction with 2, it is determined whether frames in one direction have been selected twice or more in a row, and if (P≧2), the input waiting data is Difference between variable q indicating the frame number of the block and variable j indicating the frame number of the registered feature data block (q-j)
It is determined whether or not is less than or equal to the limit value Δ (step @).

そして、制限値Δ以下の場合には、ｊがデクリメントさ
れｊ−１とされ（ステップＯ）、制限値Δを越える場合
には、ｊはそのままとされ次のステップに移行する。Then, if it is less than the limit value Δ, j is decremented to j-1 (step O), and if it exceeds the limit value Δ, j is left as is and the process moves to the next step.

ステップ■において、入力特徴データブロックのフレー
ム番号を示す変数ｑと入力特徴データブロックの最大フ
レーム番号■とが比較され、（ｑ＜１）の場合には、登
録特徴データブロックのフレーム番号を示す変数ｊがイ
ンクリメントされｊ↑１とされる（ステップ［相］）と
共に、ｑがインクリメントされｑ”１とされ、再びステ
ップ■及び■の処理が行われる。つまり′、＋１方向の
フレーム又は−１方間のフレームが制御＠値Δを越えな
い範囲で二度以上続けて選択される場合を除いて、入力
特徴データブロックのフレーム番号を示す変数ｑ及び登
録特徴データブロックのフレーム番号を示す変数ｊが両
者ともに士１インクリメントされ、次のフレームに関す
る計算処理に移行する。In step ■, the variable q indicating the frame number of the input feature data block is compared with the maximum frame number ■ of the input feature data block, and if (q<1), the variable q indicating the frame number of the registered feature data block is At the same time, j is incremented to j↑1 (step [phase]), q is incremented to q''1, and the processes of steps ■ and ■ are performed again. Variable q indicating the frame number of the input feature data block and variable j indicating the frame number of the registered feature data block are Both are incremented by 1, and the calculation process for the next frame is started.

また、士１方向のフレームが二度以上続けて選択される
場合にはｑが＋１インクリメントされると共に、ｊが±
２インクリメントされるため、＋１方向にサーチ幅区間
がシフトされ、次のフレームに関する計算処理に移行す
る。−１方向のフレームが二度以上続けて選択される場
合には、ｑのみが十ｌインクリメントされるため、−１
方向にサーチ幅区間がシフトされ、次のフレームに関す
る計算処理に移行する。Additionally, if frames in the +1 direction are selected twice or more consecutively, q is incremented by +1, and j is incremented by +/-.
Since it is incremented by 2, the search width section is shifted in the +1 direction, and the calculation process for the next frame is started. - If frames in one direction are selected twice or more in succession, only q is incremented by 10l, so -1
The search width section is shifted in the direction, and calculation processing for the next frame is started.

上述した処理が繰り返し行われることにより、サーチ幅
区間がシフトされながら各フレームに関してフレーム間
距離が算出され、最小とされるフレーム間距離が累算さ
れ、入力特徴データブロックのフレーム番号を示す変数
ｑが最大フレーム番号■とされ、終端フレームに関して
計算処理がなされると、一つの登録特徴データブロック
に関する計算が終了される。この時のフレーム間距離の
累積距ＡＩＤがマツチング距離とされる。そして、他の
登録時（攻データブロックとの間においても同様にマツ
チング距離が算出される。By repeating the above process, the interframe distance is calculated for each frame while the search width section is shifted, the minimum interframe distance is accumulated, and the variable q indicating the frame number of the input feature data block is calculated. is set as the maximum frame number ■, and when the calculation process is performed on the end frame, the calculation on one registered feature data block is completed. The cumulative distance AID of the inter-frame distances at this time is taken as the matching distance. Then, matching distances are similarly calculated during other registrations (between offensive data blocks).

尚、この発明の一実施例においては、入力特徴データブ
ロック側から登録特徴データブロック側にサーチを行う
場合について説明したが、登録特徴データブロック側か
ら入力特徴データブロック側にサーチを行う構成として
も良い。また、この発明の一実施例においては、終端側
のフレーム間距離を求める際に、直接マツチングにより
フレーム間距離を丁める場合について説明したが、従来
の固定サーチのマツチングにより終端側フレームに関す
るマツチング距離を算出する構成としても良い。また、
この発明の一実施例においては、二度以上続けて、登録
待機データブロックの＋１方向又は−１方向のフレーム
が選択された時にサーチ幅区間の可変操作が行われる場
合について説明したが、二度以上に限定するものでなく
、また、確率的な判断に基づいて可変操作を行う構成と
しても良い。In one embodiment of the present invention, a case has been described in which the search is performed from the input feature data block side to the registered feature data block side, but it is also possible to perform a search from the registered feature data block side to the input feature data block side. good. In addition, in one embodiment of the present invention, when determining the distance between frames on the end side, a case has been described in which the distance between frames is determined by direct matching. It may also be configured to calculate the distance. Also,
In one embodiment of the present invention, a case has been described in which the variable operation of the search width section is performed when frames in the +1 direction or -1 direction of the registration waiting data block are selected twice or more in succession. The present invention is not limited to the above, and a configuration may also be adopted in which variable operations are performed based on probabilistic judgments.

また、この発明は、ハードワイヤードの構成に限らず、
マイクロコンピュータ又はマイクロプログラム方式を用
いてソフトウェアにより処理を行うようにしても良い。Moreover, this invention is not limited to a hard-wired configuration.
The processing may be performed by software using a microcomputer or a microprogram method.

〔Effect of the invention〕

この発明では、パターンマツチング判定器において、入
力待１攻データブロックのフレームに対してサーチ幅に
応じて登録特徴データブロックのフレームが複数個対応
させらゎ１、夫々のフレーム間に関してフレーム間距離
が求められ、最小となるフレーム間距離が選択されると
共に、最小となるフレーム間距離が登録特徴データプロ
、りのどのフレームにより算出されたものであるがが判
断され、登録特徴データブロックの十方向のフレームか
ら最小となるフレーム間距離が算出される傾向にある場
合には、士方向にサーチ幅の中心値がシフトされ、また
、一方向のフレームから最小となるフレーム間距離が算
出される傾向にある場合には、一方向にサーチ幅の中心
値がシフトされて、サーチ幅内において常に最小となる
フレーム間距離が算出されるようになされ、順次、フレ
ーム間距離が算出され、最小となるフレーム間距離が累
算されることによりマツチング距離が算出される。In this invention, in the pattern matching judger, a plurality of frames of the registered feature data block are made to correspond to a frame of the input waiting data block according to the search width. is calculated, the minimum inter-frame distance is selected, it is determined which frame the minimum inter-frame distance was calculated by the registered feature data professional, and the tenth of the registered feature data blocks are determined. If the minimum inter-frame distance tends to be calculated from the frames in one direction, the center value of the search width is shifted in the direction, and the minimum inter-frame distance is calculated from the frames in one direction. If there is a tendency, the center value of the search width is shifted in one direction so that the minimum inter-frame distance is always calculated within the search width, and the inter-frame distances are sequentially calculated and the minimum distance is calculated. The matching distance is calculated by accumulating the inter-frame distances.

従って、この発明に依れば、入力特徴データブロックと
登録特徴データブロックのフレーム数がやや大きくこと
なる場合や、特徴データブロック中に時間軸変動の音響
が残されている場合においても対応することができ、正
確にマツチングが行え、認識率を向上させることができ
る。Therefore, according to the present invention, it is possible to cope with the case where the number of frames of the input feature data block and the registered feature data block are slightly different, or even when the sound with time axis fluctuation remains in the feature data block. It is possible to perform accurate matching and improve the recognition rate.

また、この発明の一実施例においては、入力特徴データ
ブロックの所定の終端側フレームと登録特徴データブロ
ックの所定の終端側フレームとの部分的なマツチング距
離が算出され、このマツチング距離が入力待１攻データ
ブロックと登録特徴データブロックとの間においてサー
チ幅をシフトさせながら求められた全体的なマツチング
距離に加算されて判定用距離とされるため、相方の語尾
の差異を強めることができ、更に認識率を向上させるこ
とができる。Further, in one embodiment of the present invention, a partial matching distance between a predetermined end frame of the input feature data block and a predetermined end frame of the registered feature data block is calculated, and this matching distance is Since it is added to the overall matching distance obtained while shifting the search width between the attack data block and the registered feature data block and used as the judgment distance, it is possible to strengthen the difference in the endings of the partner words, and The recognition rate can be improved.

[Brief explanation of drawings]

第１図はこの発明の一実施例の全体の構成のブロック図
、第２図はこの発明の一実施例におけるパターンマツチ
ング判定器のブロック図、第３図及び第４図はこの発明
の一実施例におけるパターンマツチング判定器の動作説
明に用いる路線図、第５図はこの発明の一実施例におけ
るパターンマツチング判定器の動作説明に用いるフロー
チャートである。図面における主要な符号の説明ｌ：マイクロホン、　　５：音響分析器。６：特徴データ抽出器、　　７：モード切替回路。８：登録パターンメモリ、　　９：パターンマッチング
判定器、　　１０：サーチ幅内フレーム距離計算回路、
　　１１：最小値抽出回路、　　１２：サーチ幅区間可
変計算回路、　　１３：マソチング距離計算回路、　　
１４：加算回路、　　１５：終端側フレーム距離計算回
路、　　１６：終端側マツチング距離計算回路、　　１
７：最小距離判定回路。代理人　　　弁理士　杉　浦　正　知会１体のａへ゛第１図第２図FIG. 1 is a block diagram of the overall configuration of an embodiment of the present invention, FIG. 2 is a block diagram of a pattern matching determiner in an embodiment of the invention, and FIGS. 3 and 4 are diagrams of an embodiment of the invention. FIG. 5 is a route map used to explain the operation of the pattern matching determiner in the embodiment, and FIG. 5 is a flowchart used to explain the operation of the pattern matching determiner in the embodiment of the present invention. Explanation of main symbols in the drawings 1: Microphone, 5: Acoustic analyzer. 6: Feature data extractor, 7: Mode switching circuit. 8: Registered pattern memory, 9: Pattern matching determiner, 10: Frame distance calculation circuit within search width,
11: Minimum value extraction circuit, 12: Search width section variable calculation circuit, 13: Masoching distance calculation circuit,
14: Addition circuit, 15: Termination side frame distance calculation circuit, 16: Termination side matching distance calculation circuit, 1
7: Minimum distance determination circuit. Agent Patent Attorney Tadashi Sugiura Tomokai 1A Figure 1 Figure 2

Claims

[Claims] Acoustic analysis means that performs preprocessing necessary for speech recognition such as spectrum conversion of an input audio signal; and output data of the acoustic analysis means is supplied, and the time axis is normalized by an appropriate number of divisions. a feature data extracting means for processing and extracting feature data; a memory in which the feature data is stored as a standard pattern; and an input pattern from the feature data extracting means and the standard pattern read from the memory. The distance between each frame is determined by associating the supplied frame constituting one of the input pattern and the standard pattern with a number of frames corresponding to the search width constituting the other of the input pattern and the standard pattern. is calculated, and the distance calculation process between the input pattern and the standard pattern is performed while shifting the center value of the search width according to the information on the minimum inter-frame distance, and matching is determined based on the result of the distance calculation process. A speech recognition device comprising: a pattern matching determination means;