JPH04242300A

JPH04242300A - Voice recognition device

Info

Publication number: JPH04242300A
Application number: JP3003913A
Authority: JP
Inventors: Waichiro Tsujita; 辻田　和一郎; Kenichi Hirayama; 健一平山
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1991-01-17
Filing date: 1991-01-17
Publication date: 1992-08-28

Abstract

PURPOSE:To obtain the voice recognition device which automatically stops a tape recorder when there is a message for voice input interruption even in the absence of an operator who confirms the message from a display part and automatically restarts the rotation of the tape recorder when the message for voice input interruption is released. CONSTITUTION:This device is equipped with an up-counter 15a which counts the feature quantity from a feature detection part 7 and outputs a data-full signal when the counted value reaches a specific value, a down-counter 15b which counts down each time the matching distance is read in from an input buffer memory and outputs a data-empty signal when reaching a specific counted value, and a tape voice input judging circuit part 15c which judges that the input buffer memory is full when an end detection signal is inputted after the data-full signal is inputted and stops the tape recorder, and judges that the input buffer memory is empty when the data empty signal is inputted from the down counter and then restarts the rotation of the tape recorder.

Description

[Detailed description of the invention]

【０００１】0001

【産業上の利用分野】本発明は音声認識装置に関し、特
にテ−プレコ−ダからの音声信号を誤認識しないための
制御に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voice recognition apparatus, and more particularly to control for preventing erroneous recognition of voice signals from a tape recorder.

【０００２】0002

【従来の技術】従来の音声認識装置は、マイクロホンに
よる入力が一般的であるが、マイクロホンと併せてテ−
プレコ−ダ等からのライン入力も可能となっている場合
があり本説明ではライン入力も可能となっている音声認
識装置を用いて説明する。[Prior Art] Conventional speech recognition devices generally use microphones for input.
In some cases, line input from a precoder or the like is also possible, and in this explanation, a voice recognition device that also allows line input will be used.

【０００３】図２は従来の音声認識装置の概略構成図で
ある。図において、１はマイクロホン、２はテ−プレコ
−ダ、３はマイクロホン入力端子、４はライン入力端子
である。FIG. 2 is a schematic diagram of a conventional speech recognition device. In the figure, 1 is a microphone, 2 is a tape recorder, 3 is a microphone input terminal, and 4 is a line input terminal.

【０００４】５は切替スイッチであり、マイクロホン１
又はテ−プレコ−ダ２のいずれかからの音声信号を入力
する場合に手動で切替えられるものである。[0004] 5 is a changeover switch, and microphone 1
Alternatively, when an audio signal from either tape recorder 2 is input, the switch can be made manually.

【０００５】６は音声入力部であり、マイクロホン１又
はテ−プレコ−ダ２から入力する音声信号の増幅、帯域
制限及びＡ／Ｄ変換等がなされたデジタルの音声信号（
以下音声デ−タという）を出力するものである。Reference numeral 6 denotes an audio input section, which receives a digital audio signal (which is amplified, band-limited, A/D converted, etc.) input from the microphone 1 or tape recorder 2.
This is to output audio data (hereinafter referred to as audio data).

【０００６】７は特徴検出部であり、音声区間の検出及
びＦＥＴ、ＬＰＣ分析等の公知の分析方法によるスペク
トラル、ケプストラム等の特徴量を抽出し、その区間の
終了を知らせる終端検出信号を出力するものである。[0006] Reference numeral 7 denotes a feature detection unit, which detects a voice section, extracts feature quantities such as spectral and cepstrum by known analysis methods such as FET and LPC analysis, and outputs an end detection signal indicating the end of the section. It is something.

【０００７】８はマッチング部であり、特徴抽出部７で
得られた音声デ−タの特徴量を標準パタ−ン（音節、単
語等の認識対象となる音声の特徴量）との類似度演算を
行い距離値（市街値距離、ＷＬＲ等の公知の距離値）を
出力するものである。[0007] Reference numeral 8 denotes a matching unit, which performs a similarity calculation between the features of the speech data obtained by the feature extraction unit 7 and standard patterns (features of speech to be recognized such as syllables and words). and outputs a distance value (known distance value such as city value distance, WLR, etc.).

【０００８】９は最小値探索部であり、マッチング部８
から出力する距離値（以下マッチング距離という）をｎ
個記憶できる入力バッファメモリに一時記憶し、最小の
距離を選び出し、最小の距離値に対応する標準パタ−ン
を認識した音声デ−タ（認識音声デ−タという）として
表示部１０に出力する。9 is a minimum value search unit, and a matching unit 8
The distance value output from (hereinafter referred to as matching distance) is n
The minimum distance is selected, and the standard pattern corresponding to the minimum distance value is output to the display unit 10 as recognized voice data (referred to as recognized voice data). .

【０００９】１０は表示部であり、出力される認識音声
デ−タを表示し、次の音声入力が可能か及び入力を待つ
必要があるかの表示を行う。これは、単語、文節等の単
位で音声を連続して入力した場合は、当然入力バッファ
メモリは一杯になり、認識されない内に次の距離値が入
力することを防ぐためである。１１は各部にタイミング
信号を出力して所定のデ−タを得る制御部である。Reference numeral 10 denotes a display unit that displays the output recognized speech data and indicates whether the next speech input is possible or whether it is necessary to wait for the next input. This is to prevent the input buffer memory from becoming full if speech is continuously input in units of words, phrases, etc., and the next distance value is input before it is recognized. Reference numeral 11 denotes a control section that outputs timing signals to each section to obtain predetermined data.

【００１０】上記のように構成された従来の音声認識装
置について説明する。例えば切替スイッチ５を切替えて
テ−プレコ−ダ２からの再生音声信号を入力すると、音
声入力部６はノイズ等を除去した再生音声デ−タを特徴
抽出部７に出力する。A conventional speech recognition device configured as described above will be explained. For example, when the selector switch 5 is switched to input a reproduced audio signal from the tape recorder 2, the audio input section 6 outputs reproduced audio data from which noise and the like have been removed to the feature extraction section 7.

【００１１】特徴抽出部７は入力した再生音声デ−タか
ら公知の分析方法により、特徴量を抽出してマッチング
部８に出力する。次にマッチング部８は、特徴抽出部７
で得られた再生音声デ−タの特徴量と標準パタ−ンとの
類似度演算を行いマッチング距離を終端検出信号に基づ
いて最小値探索部９に出力する。すると、最小値探索部
９はマッチング部８から出力するマッチング距離を入力
バッファメモリに一時記憶し、最小の距離を選び出し、
最小の距離値に対応する標準パタ−ンを認識した再生音
声デ−タとして表示部１０に出力する。そして、表示部
１０は認識した再生音声デ−タを表示し、次の音声入力
が可能か及び入力を待つ必要があるかの表示を行い。The feature extraction section 7 extracts feature amounts from the input reproduced audio data using a known analysis method and outputs them to the matching section 8. Next, the matching unit 8 performs the feature extraction unit 7
A similarity calculation is performed between the feature amount of the reproduced audio data obtained in step 1 and the standard pattern, and a matching distance is output to the minimum value search section 9 based on the end detection signal. Then, the minimum value search unit 9 temporarily stores the matching distance output from the matching unit 8 in the input buffer memory, selects the minimum distance,
The standard pattern corresponding to the minimum distance value is output to the display unit 10 as recognized reproduced audio data. Then, the display unit 10 displays the recognized reproduced audio data, and displays whether the next audio input is possible and whether it is necessary to wait for the next input.

【００１２】また、切替スイッチ５を切替えてマイクロ
ホンからの音声信号を入力すると、各部は上記と同様な
動作をして表示部１０に上記の必要なメッセ−ジを表示
するのでオペレ−タはそのメッセ−ジに応じて次の音声
をマイクロホン１を介して入力していた。Furthermore, when the selector switch 5 is switched to input the audio signal from the microphone, each part operates in the same manner as described above and displays the above-mentioned necessary message on the display part 10, so that the operator can The next voice was input via the microphone 1 in response to the message.

【００１３】[0013]

【発明が解決しようとする課題】上記のような従来の音
声認識装置では、表示部が最小探索部から音声を認識で
きる状態かできない状態かが知らせられて、そのことを
メッセ−ジで表示すると、マイクロホンによりオペレ−
タが音声を入力していた場合は、メッセ−ジに応じて音
声を入力したり、中止したりすることができるが、テ−
プレコ−ダで連続して音声を入力した場合は、最小探索
部は入力バッファメモリの状態に関係なく、マッチング
距離が入力するとそのマッチング距離に入力バッファメ
モリを更新するので、表示部で音声の入力ができないこ
とをメッセ−ジとして表示しても、テ−プレコ−ドは連
続して再生音声信号を出力し、結果として入力バッファ
メモリを更新するので音声の認識ができなくなるという
問題点があった。[Problem to be Solved by the Invention] In the conventional speech recognition device as described above, when the display section is notified from the minimum search section whether the speech is able to be recognized or not, and the display section displays this in a message. , operated by microphone.
If the computer is inputting audio, you can input or stop audio depending on the message, but the
When inputting audio continuously with the precoder, the minimum search unit updates the input buffer memory to the matching distance when the matching distance is input, regardless of the state of the input buffer memory, so the input buffer memory is updated to the matching distance on the display unit. Even if a message is displayed indicating that the tape recorder cannot be used, the problem is that the tape recorder continuously outputs the reproduced audio signal and updates the input buffer memory as a result, making it impossible to recognize the audio. .

【００１４】本発明は以上の問題点を解決するためにな
されたもので、表示部からのメッセ−ジを確認するオペ
レ−タがいなくとも、音声入力中断のメッセ−ジがあっ
た場合はテ−プレコ−ダを自動的に停止し、音声入力中
断のメッセ−ジが解除されると自動的にテ−プレコ−ダ
の回転を再生させることができる音声認識装置を得るこ
とを目的とする。The present invention has been made to solve the above problems, and even if there is no operator to check the message from the display, if there is a message to interrupt voice input, the screen will be displayed. - It is an object of the present invention to provide a voice recognition device that can automatically stop a recorder and automatically reproduce the rotation of a tape recorder when a message to interrupt voice input is released.

【００１５】[0015]

【課題を解決するための手段】本発明に係る音声認識装
置は、少なくとも再生指示があるとテ−プに記録された
音声信号を出力し、停止指令があると再生を停止するテ
−プレコ−ダからの音声信号から音声区間を検出して、
音声の特徴量を出力した後に、音声区間の終端を知らせ
る終端検出信号を出力する特徴検出部を有し、特徴検出
部からの音声の特徴量のマッチング距離を終端検出信号
が出力される毎に求め、そのマッチング距離を所定数格
納可能な入力バッファメモリに一時記憶し、記憶順から
音声を認識して表示部に出力する最小探索部を有した音
声認識装置において、特徴検出部から特徴量がマッチン
グ部に出力される毎にカウントアップし、そのカウント
値が入力バッファメモリが満たされる前の所定数に到達
すると、デ−タフル信号を出力して、カウント値をクリ
アにするアップカウンタと、入力バッファメモリの格納
個数が予め設定され、最小値探索部によって、入力バッ
ファメモリからマッチング距離が読込まれる毎に、カウ
ントダウンして、そのカウント値が入力バッファメモリ
が空であると判断できるカウント値に到達すると、デ−
タ空信号を出力した後に、設定した格納個数に設定する
ダウンカウンタと、アップカウンタからのデ−タフル信
号が入力した後に、終端検出信号が入力すれば入力バッ
ファメモリが一杯であると判断してテ−プレコ−ダに停
止信号を出力し、またダウンカウンタからデ−タ空信号
が入力すると入力バッファメモリが空であると判断して
テ−プレコ−ダに再生信号を出力するテ−プ音声入力判
断回路部とを備えたものである。[Means for Solving the Problems] A voice recognition device according to the present invention is a tape recorder that outputs a voice signal recorded on a tape when a playback instruction is given, and stops playback when a stop command is given. Detect the audio section from the audio signal from the da,
It has a feature detection section that outputs an end detection signal that indicates the end of the speech section after outputting the feature amount of the voice, and calculates the matching distance of the feature amount of the voice from the feature detection section every time the end detection signal is output. In a speech recognition device that has a minimum search unit that temporarily stores a predetermined number of matching distances in an input buffer memory that can store a predetermined number of matching distances, recognizes speech in the order in which it is stored, and outputs it to a display unit, a feature value is detected from a feature detection unit. an up counter that counts up each time it is output to the matching section, and when the count value reaches a predetermined number before the input buffer memory is filled, outputs a data full signal to clear the count value; The number of stored items in the buffer memory is set in advance, and the minimum value search unit counts down each time the matching distance is read from the input buffer memory, and the count value becomes a count value that allows it to be determined that the input buffer memory is empty. When it arrives, the date
After outputting the data empty signal, inputting the data full signal from the down counter which sets the storage number to the set value, and the data full signal from the up counter, it is determined that the input buffer memory is full. A tape audio device that outputs a stop signal to the tape recorder, and when a data empty signal is input from the down counter, it determines that the input buffer memory is empty and outputs a playback signal to the tape recorder. It is equipped with an input determination circuit section.

【００１６】[0016]

【作用】本発明の音声認識装置においては、テ−プレコ
−ダからの音声が特徴検出部によって、その特徴量が求
められてマッチング部に出力されると、アップカウンタ
がマッチング部に特徴量が出力される毎にカウントアッ
プし、そのカウント値が最小値検索手段の入力バッファ
メモリが満たされる前の所定数に到達すると、デ−タフ
ル信号をテ−プ音声入力判断回路部に出力すると共に、
カウント値をクリアにする。[Operation] In the speech recognition device of the present invention, when the feature detection section calculates the feature amount of the speech from the tape recorder and outputs it to the matching section, the up counter causes the matching section to detect the feature amount. It counts up each time it is output, and when the count value reaches a predetermined number before the input buffer memory of the minimum value search means is filled, it outputs a data full signal to the tape audio input determination circuit, and
Clear the count value.

【００１７】次にテ−プ音声入力判断回路部はデ−タフ
ル信号が入力してのち、音声区間の終端を知らせる終端
検出信号が特徴部から入力すると、入力バッファメモリ
が一杯であると判断してテ−プレコ−ダに停止信号を出
力して、テ−プレコ−ダのテ−プの再生を停止させる。Next, the tape voice input judgment circuit section judges that the input buffer memory is full when an end detection signal indicating the end of the voice section is input from the characteristic section after the data full signal is input. and outputs a stop signal to the tape recorder to stop the tape recorder from playing the tape.

【００１８】次に、入力バッファメモリのマッチング距
離が最小値探索手段により読出されると、ダウンカウン
タはカウントダウンし、そのカウント値が入力バッファ
メモリが空であると判断できるカウント値に到達すると
、デ−タ空信号をテ−プ音声入力判断回路部に出力した
後に、予め設定されている格納個数に設定する。Next, when the matching distance of the input buffer memory is read by the minimum value search means, the down counter counts down, and when the count value reaches a count value at which it can be determined that the input buffer memory is empty, the down counter counts down. - After outputting the tape empty signal to the tape audio input determining circuit section, the number of stored tapes is set to a preset number.

【００１９】そして、テ−プ音声入力判断回路部はデ−
タ空信号が入力すると入力バッファメモリが空であると
判断してテ−プレコ−ダに再生信号を出力して、再び再
生を開始させる。Then, the tape audio input judgment circuit section
When a tape empty signal is input, it is determined that the input buffer memory is empty, and a playback signal is output to the tape recorder to start playback again.

【００２０】[0020]

【実施例】図１は本発明の音声認識装置の概略構成図で
ある。図において、１〜１１は上記の図２と同様なもの
である。１５は入力バッファ監視部であり、以下に説明
する回路を有するものである。DESCRIPTION OF THE PREFERRED EMBODIMENTS FIG. 1 is a schematic diagram of a speech recognition apparatus according to the present invention. In the figure, 1 to 11 are the same as those in FIG. 2 above. Reference numeral 15 denotes an input buffer monitoring section, which has a circuit described below.

【００２１】１５ａはアップカウンタであり、マッチン
グ部８から出力される音声デ−タのマッチング距離を１
個ずつカウンタアップし、その値が入力バッファメモリ
の格納個数ｎより、ｎ−２に到達すると、入力バッファ
が一杯になったとするデ−タフル信号を後述するテ−プ
音声入力判断回路に出力するものである。15a is an up counter, which increases the matching distance of the audio data output from the matching section 8 by 1.
The counter is incremented one by one, and when the value reaches n-2 from the number n stored in the input buffer memory, a data full signal indicating that the input buffer is full is output to the tape audio input judgment circuit described later. It is something.

【００２２】１５ｂはダウンカウンタであり、最小値探
索部９の入力バッファメモリに格納されたｎ−２のマッ
チング距離がそれぞれ出力されると、ダウンカウントし
、その値が入力バッファメモリが空であると判定できる
値になれば、後述するテ−プ音声入力判断回路にデ−タ
空信号を出力するものである。Reference numeral 15b is a down counter, which counts down when each n-2 matching distance stored in the input buffer memory of the minimum value search unit 9 is output, and the value indicates that the input buffer memory is empty. If the value reaches a value that can be determined, a data empty signal is output to a tape audio input determining circuit, which will be described later.

【００２３】１５ｃはテ−プ音声入力判断回路であり、
アップカウンタ１５ａからデ−タフル信号が入力すると
、テ−プレコ−ダ２の再生を停止させる準備をし、特徴
検出部７から終端検出信号が入力すると、直ちにテ−プ
レコ−ダ２に停止信号を出力して、再生を停止させ、ま
たダウンカウンタ１５ｂからデ−タ空信号が入力すると
、直ちにテ−プレコ−ダ２に再生信号を出力して再生音
声デ−タを出力させるものである。15c is a tape audio input judgment circuit;
When a data full signal is input from the up counter 15a, preparations are made to stop the playback of the tape recorder 2, and when an end detection signal is input from the feature detection section 7, a stop signal is immediately sent to the tape recorder 2. When a data empty signal is input from the down counter 15b, a reproduction signal is immediately output to the tape recorder 2 to output reproduced audio data.

【００２４】なお、この場合は入力バッファメモリがｎ
に到達する間に必ず終端検出信号が出力されるようにさ
れている。Note that in this case, the input buffer memory is n
The termination detection signal is always outputted while reaching the end point.

【００２５】上記のように構成された音声認識装置につ
いて以下に動作を説明する。切替スイッチ５を切替えて
テ−プレコ−ダ２を動作させて、再生した再生音声デ−
タを音声入力部６に出力させる。The operation of the speech recognition device configured as described above will be explained below. Switch the selector switch 5 to operate the tape recorder 2 and play the reproduced audio data.
output the data to the audio input section 6.

【００２６】音声入力部６はその再生音声信号を増幅、
帯域制限及びＡ／Ｄ変換等がなされた再生音声デ−タを
特徴検出部７に出力する。The audio input section 6 amplifies the reproduced audio signal,
The reproduced audio data that has been subjected to band limitation, A/D conversion, etc. is output to the feature detection section 7.

【００２７】特徴検出部７は入力した再生音声デ−タか
ら公知の分析方法により、音声区間の検出し、再生音声
デ−タの特徴量を逐次抽出して出力すると共に、音声区
間の終端を知らせる終端検出信号を出力する。次にマッ
チング部８は、特徴検出部７で得られた再生音声デ−タ
の特徴量と標準パタ−ンとの類似度演算を行いマッチン
グ距離を最小値探索部９に出力する。The feature detection unit 7 detects a voice section from the input reproduced voice data using a known analysis method, sequentially extracts and outputs the feature quantities of the reproduced voice data, and detects the end of the voice section. Outputs a termination detection signal to notify. Next, the matching section 8 calculates the degree of similarity between the feature amount of the reproduced audio data obtained by the feature detecting section 7 and the standard pattern, and outputs the matching distance to the minimum value searching section 9.

【００２８】すると、最小値探索部９はマッチング部８
から出力するマッチング距離をｎ個記憶できるバッファ
メモリに一時記憶して読込み、最小の距離を選び出し、
最小の距離値に対応する標準パタ−ンを認識再生音声デ
−タとして表示部１０に出力する。そして、表示部１０
は認識音声デ−タを表示し、次の音声入力が可能か及び
入力を待つ必要があるかの表示を行う。Then, the minimum value search section 9 performs the matching section 8.
Temporarily store and read the matching distances output from n into a buffer memory, select the minimum distance,
The standard pattern corresponding to the minimum distance value is output to the display section 10 as recognized and reproduced audio data. And display section 10
displays the recognized voice data and indicates whether the next voice input is possible or whether it is necessary to wait for the next voice input.

【００２９】また、アップカウンタ１５ａはマッチング
部８から出力されるマッチング距離をアップカウントし
、その値が入力バッファメモリの記憶容量ｎからｎ−２
に到達した値になると、テ−プ音声入力判断回路１５ｃ
に入力バッファメモリが一杯になったとしてデ−タフル
信号を出力する。Further, the up counter 15a counts up the matching distance output from the matching section 8, and the value increases from the storage capacity n of the input buffer memory to n-2.
When the value reached is reached, the tape audio input judgment circuit 15c
When the input buffer memory is full, a data full signal is output.

【００３０】テ−プ音声入力判断回路１５ｃはデ−タフ
ル信号が入力すると、テ−プレコ−ダ２を停止させる準
備をし、特徴検出部７から終端検出信号が入力すると、
テ−プレコ−ダ２に停止信号を出力して停止させる。When the tape audio input judgment circuit 15c receives the data full signal, it prepares to stop the tape recorder 2, and when the end detection signal is input from the feature detection section 7, it prepares to stop the tape recorder 2.
A stop signal is output to the tape recorder 2 to stop it.

【００３１】このとき、上記説明のように最小値探索部
９はバッファメモリに一時記憶されたマッチング距離を
読込み、最小の距離を選び出し、最小の距離値に対応す
る標準パタ−ンを認識再生音声デ−タと出力するので、
ダウンカウンタ１５ｂは入力バッファメモリからマッチ
ング距離が出力されて読込まれる都度、ダウンカウント
しその値が例えば０に到達すると、入力バッファメモリ
が空になったことを知らせるデ−タ空信号をテ−プ音声
入力判断回路１５ｃに出力する。At this time, as explained above, the minimum value search unit 9 reads the matching distances temporarily stored in the buffer memory, selects the minimum distance, and recognizes the standard pattern corresponding to the minimum distance value to reproduce the reproduced voice. Since it is output as data,
The down counter 15b counts down each time the matching distance is output and read from the input buffer memory, and when the value reaches 0, for example, it outputs a data empty signal indicating that the input buffer memory is empty. The input signal is output to the voice input judgment circuit 15c.

【００３２】すると、テ−プ音声入力判断回路１５ｃは
テ−プレコ−ダ２に再び音声を再生して出力させる再生
信号を出力して、再生音声デ−タを出力するので各部は
上記と同様な動作をする。Then, the tape audio input determination circuit 15c outputs a reproduction signal that causes the tape recorder 2 to reproduce and output the audio again, and outputs the reproduced audio data, so each part is the same as above. make certain movements.

【００３３】[0033]

【発明の効果】以上のように本発明によれば、テ−プレ
コ−ダからの音声が特徴検出部によって、その特徴量が
求められてマッチング部に出力される毎に、その出力回
数をカウントし、その値が最小値検索部の入力バッファ
メモリが満たされる前の所定数に到達すると、テ−プレ
コ−ダを停止させ、また入力バッファメモリに格納され
たマッチング距離が読出される毎に、その読みだし回数
をカウントし、その値から入力バッファメモリが空であ
ると判断してテ−プレコ−ダに再生信号を出力して、再
び再生を開始させるようにしたことにより、表示部から
のメッセ−ジを確認するオペレ−タがいなくとも、表示
部のメッセ−ジに応じてテ−プレコ−ダを自動的に停止
させたり、テ−プレコ−ダの回転を再生させることがで
きるので誤認識を防止できるという効果が得られている
。As described above, according to the present invention, each time the feature amount of the sound from the tape recorder is determined by the feature detection section and outputted to the matching section, the number of outputs is counted. However, when the value reaches a predetermined number before the input buffer memory of the minimum value search section is filled, the tape recorder is stopped, and each time the matching distance stored in the input buffer memory is read out, By counting the number of readings, determining from that value that the input buffer memory is empty, and outputting a playback signal to the tape recorder to start playback again, the display section Even if there is no operator to check the message, the tape recorder can be stopped automatically or the tape recorder can be replayed according to the message on the display to avoid mistakes. The effect of preventing recognition has been achieved.

[Brief explanation of the drawing]

【図１】本発明の音声認識装置の概略構成図FIG. 1 A schematic configuration diagram of a speech recognition device of the present invention.

【図２】従
来の音声認識装置の概略構成図[Figure 2] Schematic configuration diagram of a conventional speech recognition device

[Explanation of symbols]

１　　マイク２　　テ−プレコ−ダ５　　切替スイッチ６　　音声入力部７　　特徴検出部８　　マッチング部９　　最小値探索部１０　　表示部１１　　制御部１５ａ　　アップカウンタ１５ｂ　　ダウンカウンタ１５ｃ　　テ−プ音声入力判断回路 1. Microphone 2 Tape recorder 5 Selector switch 6 Audio input section 7 Feature detection section 8 Matching section 9 Minimum value search unit 10 Display section 11 Control section 15a Up counter 15b Down counter 15c Tape audio input judgment circuit

Claims

[Claims]

Claim 1: Detecting an audio section from an audio signal from a tape recorder that outputs an audio signal recorded on a tape when there is at least a playback instruction, and stops playback when a stop command is given, a feature detection unit that outputs an end detection signal indicating the end of the voice section after outputting the voice feature, and the end detection signal outputs a matching distance of the voice feature from the feature detection unit; In the speech recognition device, the speech recognition device has a minimum search unit that temporarily stores the matching distance in an input buffer memory capable of storing a predetermined number of matching distances, recognizes the speech from the stored order, and outputs it to the display unit. Each time a feature quantity is output from the matching section to the matching section, the count is counted up, and when the count value reaches a predetermined number before the input buffer memory is filled, a data full signal is output and the count value is cleared. An up counter to store the matching distance in the input buffer memory is set in advance, and each time a matching distance is read from the input buffer memory by the minimum value search unit, the count value is counted down and the count value is stored in the input buffer memory. When the counter reaches a count value that can be determined to be empty, a data empty signal is output, and then a down count is set to the set storage number, and after a data full signal from the up counter is input, the data empty signal is output. When an end detection signal is input, it is determined that the input buffer memory is full, and a stop signal is output to the tape recorder, and when a data empty signal is input from the down counter, the input buffer memory is determined to be full. 1. A speech recognition device comprising: a tape audio input determining circuit section that determines that the tape is empty and outputs a reproduction signal to the tape recorder.