JP2002156997A

JP2002156997A - Voice detection controller

Info

Publication number: JP2002156997A
Application number: JP2000355072A
Authority: JP
Inventors: Toshio Akaha; 俊夫赤羽
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 2000-11-21
Filing date: 2000-11-21
Publication date: 2002-05-31

Abstract

PROBLEM TO BE SOLVED: To provide a voice detection controller which can reduce the power consumption of a voice processor by performing voice detection adaptively to environment. SOLUTION: An input signal is converted by a rectifier 101 into an amplitude signal, a threshold is found by a low-pass filter 102 and an amplifier 103, and the quantity of variation in amplitude is found by a band-pass filter 104 and a rectifier 105. A comparator 106 compares the found variation quantity with the threshold and judges that the input signal possibly includes a voice when the variation quantity exceeds the threshold. Then a timer 107 outputs a control signal for a specific time and the voice processor is placed in operation.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、音声入力装置、音
声検出装置または音声認識装置等の音声処理装置に用い
られる音声検出制御装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voice detection control device used for a voice processing device such as a voice input device, a voice detection device or a voice recognition device.

【０００２】[0002]

【従来の技術】音声認識装置に代表されるデジタル音声
処理装置は、一般に、高速処理が可能なＤＳＰやＣＰＵ
を用いており、動作中に大きな電力を必要とする。この
ため、常時入力信号を処理するために商用電源を利用す
るか、または、スイッチ操作があるまで動作を停止させ
ておく必要があった。2. Description of the Related Art Digital voice processing devices represented by voice recognition devices generally include DSPs and CPUs capable of high-speed processing.
And requires large power during operation. For this reason, it is necessary to always use a commercial power supply for processing the input signal or stop the operation until a switch operation is performed.

【０００３】これを改善するために、低消費電力の音量
検出回路を設けて、この音量検出回路のみを常時作動さ
せ、音量が所定の値よりも大きくなった場合にＡＤ（ア
ナログデジタル）変換装置および音声認識装置の動作を
開始させる提案が幾つかなされている。この音量検出回
路は、アナログ回路や単純なＣＭＯＳ回路で実現されて
いる。例えば、特開２０００−８９７９２号において
は、ＣＭＯＳ論理回路を用いて音声検出を行う音声認識
装置が提案されている。In order to improve this, a low power consumption sound volume detection circuit is provided, and only this sound volume detection circuit is constantly operated, and when the sound volume becomes larger than a predetermined value, an AD (analog-digital) conversion device is provided. Some proposals have been made to start the operation of the speech recognition device. This volume detection circuit is realized by an analog circuit or a simple CMOS circuit. For example, Japanese Patent Application Laid-Open No. 2000-89792 proposes a speech recognition apparatus that performs speech detection using a CMOS logic circuit.

【０００４】[0004]

【発明が解決しようとする課題】人間が会話をする場合
には、無意識に環境騒音に合わせて声の大きさを調整す
るため、通常、静かな環境では小さな声で話し、騒がし
い環境では大きな声を出す。このため、音量検出回路に
おいて、入力信号の振幅に対して一定の閾値を設けた場
合、静かな環境での小さな声を検出することができない
ために音声認識装置が動作せず、逆に、騒音下では常に
声が含まれているものとして音声認識装置が動作するこ
とにより、無駄な消費電力を消費する結果となる。When a human has a conversation, he or she unconsciously adjusts the volume of the voice in accordance with the environmental noise. Therefore, in a quiet environment, the user usually speaks with a small voice, and in a noisy environment, the voice is loud. Put out. For this reason, when a certain threshold value is provided for the amplitude of the input signal in the volume detection circuit, the voice recognition device does not operate because a small voice cannot be detected in a quiet environment, and conversely, Below, the operation of the speech recognition apparatus assuming that a voice is always included results in wasteful power consumption.

【０００５】また、音量検出してから認識装置を作動さ
せた場合、音声認識装置に送られる音声信号は語頭が欠
けたものとなり、望ましくない。When the recognition device is operated after detecting the sound volume, the voice signal sent to the voice recognition device lacks the beginning of a word, which is not desirable.

【０００６】さらに、背景雑音が大きい場合には、騒音
の振幅に比べて音声の振幅が小さい場合もあるため、入
力信号の振幅や短時間エネルギーのみから音声の有無を
判断するのは困難であり、語頭や語尾が欠落したり、検
出ができなくなる場合もある。Further, when the background noise is large, the amplitude of the voice may be smaller than the amplitude of the noise, so it is difficult to determine the presence or absence of the voice only from the amplitude of the input signal or the short-time energy. In some cases, the beginning or end of a word may be missing or detection may not be possible.

【０００７】本発明は、このような従来技術の課題を解
決するためになされたものであり、環境に適応した音声
検出を行って、音声処理装置の消費電力を低減すること
ができる音声検出制御装置を提供することを目的とす
る。SUMMARY OF THE INVENTION The present invention has been made to solve the above-mentioned problems of the prior art, and performs voice detection adapted to an environment to reduce power consumption of a voice processing apparatus. It is intended to provide a device.

【０００８】[0008]

【課題を解決するための手段】本発明の音声検出制御装
置は、入力信号を整流して振幅信号に変換する整流手段
と、該整流手段からの振幅信号を個別に処理して、振幅
の変化量と閾値とを各々求める特性の異なる２種類のフ
ィルタ手段と、各フィルタ手段からの出力を比較して、
振幅の変化量が閾値を超えた場合に、該入力信号中に音
声が含まれている可能性があると判断して音声検知信号
を出力する比較手段と、該比較手段から音声検知信号が
出力された場合に、所定の時間、制御信号を出力するタ
イマー手段とを備えており、そのことにより上記目的が
達成される。A voice detection control device according to the present invention comprises a rectifier for rectifying an input signal and converting it into an amplitude signal, and individually processing the amplitude signal from the rectifier to change the amplitude. By comparing the output from each filter means with two types of filter means having different characteristics for obtaining the amount and the threshold value, respectively,
A comparison unit that outputs a voice detection signal by determining that the input signal may include voice when the amplitude change amount exceeds a threshold, and outputs a voice detection signal from the comparison unit. A timer means for outputting a control signal for a predetermined time when the operation is performed, thereby achieving the above object.

【０００９】本発明の音声検出制御装置は、入力信号を
整流して振幅信号に変換する整流手段と、該整流手段に
よって振幅信号に変換された入力信号を対数変換する対
数増幅手段と、該対数増幅手段からの出力を処理する帯
域通過フィルタ手段と、該帯域通過フィルタ手段からの
出力が所定の閾値を超えた場合に、該入力信号中に音声
が含まれている可能性があると判断して音声検知信号を
出力する比較手段と、該比較手段から音声検知信号が出
力された場合に、所定の時間、制御信号を出力するタイ
マー手段とを備えており、そのことにより上記目的が達
成される。A voice detection control device according to the present invention comprises: a rectifier for rectifying an input signal and converting the input signal into an amplitude signal; a logarithmic amplifier for logarithmically converting the input signal converted to an amplitude signal by the rectifier; A band-pass filter for processing the output from the amplifying unit, and determining that there is a possibility that voice is included in the input signal when the output from the band-pass filter exceeds a predetermined threshold. And a timer means for outputting a control signal for a predetermined time when the sound detection signal is output from the comparing means, thereby achieving the above object. You.

【００１０】帯域通過フィルタ手段によって処理した振
幅信号を再度整流する第２の整流手段を備え、該帯域通
過フィルタ手段からの出力が該第２の整流手段を介して
前記比較手段に入力されてもよい。A second rectifier for re-rectifying the amplitude signal processed by the band-pass filter, wherein an output from the band-pass filter is input to the comparator via the second rectifier; Good.

【００１１】低域通過フィルタ手段によって処理した振
幅信号を増幅する増幅手段を備え、該増幅手段の増幅率
を外部から調整可能とされ、該低域通過フィルタ手段か
らの出力が該増幅手段を介して前記比較手段に入力され
てもよい。Amplifying means for amplifying the amplitude signal processed by the low-pass filter means, wherein an amplification factor of the amplifying means can be adjusted from the outside, and an output from the low-pass filter means is passed through the amplifying means. May be input to the comparing means.

【００１２】前記入力信号をデジタルデータに変換する
アナログデジタル変換器と、バッファメモリとを備え、
音声検出前の入力信号をデジタルデータとして該バッフ
ァメモリに記憶してもよい。An analog-to-digital converter for converting the input signal into digital data, and a buffer memory;
The input signal before voice detection may be stored as digital data in the buffer memory.

【００１３】前記低域通過フィルタ手段からの出力を増
幅する増幅手段と比較手段とタイマー手段とを複数ずつ
備え、各増幅手段の増幅率を異なる値に設定することに
より複数の制御信号を生成し、そのうちの１つにより前
記アナログデジタル変換器と前記バッファメモリの電源
とクロックとを制御してもよい。A plurality of amplifying means for amplifying the output from the low-pass filter means, a plurality of comparing means, and a plurality of timer means are provided, and a plurality of control signals are generated by setting the amplification factors of the respective amplifying means to different values. One of them may control a power supply and a clock of the analog-to-digital converter and the buffer memory.

【００１４】入力信号を周波数帯域毎に分離する複数の
第２の帯域通過フィルタ手段を備え、さらに、各第２の
帯域通過フィルタ手段によって分離された各帯域信号を
振幅信号に変換する複数の前記整流手段と、該整流手段
からの振幅信号を個別に処理して、振幅の変化量と閾値
とを各々求める複数組の前記特性の異なる２種類のフィ
ルタ手段と、各周波数帯域毎に各フィルタ手段からの出
力を比較して、振幅の変化量が閾値を超えた場合に、該
入力信号中に音声が含まれている可能性があると判断し
て音声検知信号を出力する複数の前記比較手段と、各比
較手段からの音声検知信号を総合する総合手段と、該総
合手段からの出力によって、所定の時間、制御信号を出
力する前記タイマー手段とを備えていてもよい。A plurality of second band-pass filter means for separating the input signal for each frequency band, and a plurality of the plurality of band signals for converting each band signal separated by each second band-pass filter means into an amplitude signal. Rectifier means, a plurality of sets of two types of filter means having different characteristics, each of which individually processes an amplitude signal from the rectifier means to obtain an amplitude change amount and a threshold value, and a filter means for each frequency band. A plurality of the comparing means for outputting a sound detection signal by judging that there is a possibility that sound is included in the input signal when the amount of change in amplitude exceeds a threshold value. And a totaling means for synthesizing the voice detection signal from each comparing means, and the timer means for outputting a control signal for a predetermined time based on an output from the generalizing means.

【００１５】入力信号を周波数帯域毎に分離する複数の
第２の帯域通過フィルタ手段を備え、さらに、各第２の
帯域通過フィルタ手段によって分離された各帯域信号を
振幅信号に変換する複数の前記整流手段と、該整流手段
からの周波数帯域毎の振幅信号を対数変換する複数の前
記対数増幅手段と、各周波数帯域毎に該対数増幅手段か
らの出力を処理する複数の前記帯域通過フィルタ手段
と、該帯域通過フィルタ手段からの出力が所定の閾値を
超えた場合に、その周波数帯域の入力信号中に音声が含
まれている可能性があると判断して音声検知信号を出力
する複数の前記比較手段と、各比較手段からの音声検知
信号を総合する総合手段と、該総合手段からの出力によ
って、所定の時間、制御信号を出力する前記タイマー手
段とを備えていてもよい。A plurality of second band-pass filter means for separating an input signal for each frequency band, and a plurality of the plurality of band signals for converting each band signal separated by each second band-pass filter means into an amplitude signal. Rectifier, a plurality of logarithmic amplifiers that logarithmically convert the amplitude signal for each frequency band from the rectifier, and a plurality of the bandpass filter units that process the output from the logarithmic amplifier for each frequency band. When the output from the band-pass filter exceeds a predetermined threshold, it is determined that there is a possibility that voice is included in the input signal of the frequency band, and a plurality of voice detection signals are output. Even if it is provided with a comparing means, a general means for synthesizing the voice detection signal from each comparing means, and the timer means for outputting a control signal for a predetermined time by an output from the general means. There.

【００１６】前記第２の帯域通過フィルタ手段および前
記整流手段によって変換された周波数帯域毎の振幅信号
を処理する第２の低域通過フィルタ手段と、該第２の低
域通過フィルタ手段からの出力をデジタルデータに変換
するマルチプレクサおよびアナログデジタル変換器と、
バッファメモリとを備え、周波数帯域毎の振幅信号をデ
ジタルデータとして該バッファメモリに記憶してもよ
い。A second low-pass filter for processing an amplitude signal for each frequency band converted by the second band-pass filter and the rectifier, and an output from the second low-pass filter; A multiplexer and an analog-to-digital converter for converting
A buffer memory may be provided, and an amplitude signal for each frequency band may be stored as digital data in the buffer memory.

【００１７】前記第２の帯域通過フィルタ手段および前
記整流手段によって変換された周波数帯域毎の振幅信号
を前記低域通過フィルタ手段によって平均したものを、
該周波数帯域毎の振幅信号から減算して雑音の影響を低
減した後で、デジタルデータに変換してもよい。An average of the amplitude signals for each frequency band converted by the second band-pass filter means and the rectifier means by the low-pass filter means,
After reducing the influence of noise by subtracting from the amplitude signal for each frequency band, the data may be converted into digital data.

【００１８】本発明の音声検出制御装置は、入力信号を
複数の異なる周波数で振幅変調する複数の振幅変調手段
と、各振幅変調手段からの出力を処理する、各々所定の
特性を有する複数の第３の低域通過フィルタ手段とを備
え、さらに、該振幅変調手段および該第３の低域通過フ
ィルタ手段によって変換された周波数帯域毎の振幅信号
を個別に処理して、振幅の変化量と閾値とを各々求める
複数組の特性の異なる２種類のフィルタ手段と、各周波
数帯域毎に各フィルタ手段からの出力を比較して、振幅
の変化量が閾値を超えた場合に、該入力信号中に音声が
含まれている可能性があると判断して音声検知信号を出
力する複数の比較手段と、各比較手段からの音声検知信
号を総合する総合手段と、該総合手段からの出力によっ
て、所定の時間、制御信号を出力するタイマー手段とを
備えており、そのことにより上記目的が達成される。A voice detection control device according to the present invention comprises a plurality of amplitude modulation means for amplitude-modulating an input signal at a plurality of different frequencies, and a plurality of amplitude modulation means for processing an output from each amplitude modulation means, each having a predetermined characteristic. 3 low-pass filter means, and further individually processes the amplitude signal for each frequency band converted by the amplitude modulation means and the third low-pass filter means, thereby obtaining an amplitude change amount and a threshold value. A plurality of sets of two types of filter means having different characteristics are respectively obtained, and outputs from the respective filter means are compared for each frequency band, and when the amount of change in amplitude exceeds a threshold value, A plurality of comparing means for determining that there is a possibility that sound is included and outputting a sound detection signal, a total means for summing the sound detection signals from the respective comparing means, and a predetermined time of, And a timer means for outputting a control signal, the object is achieved.

【００１９】本発明の音声検出制御装置は、入力信号を
複数の異なる周波数で振幅変調する複数の振幅変調手段
と、各振幅変調手段からの出力を処理する、各々所定の
特性を有する複数の第３の低域通過フィルタ手段とを備
え、さらに、該整流手段からの周波数帯域毎の振幅信号
を対数変換する複数の対数増幅手段と、該振幅変調手段
および該第３の低域通過フィルタ手段によって変換され
た周波数帯域毎の振幅信号を対数変換する複数の対数増
幅手段と、各周波数帯域毎に該対数増幅手段からの出力
を処理する複数の帯域通過フィルタ手段と、該帯域通過
フィルタ手段からの出力が所定の閾値を超えた場合に、
その周波数帯域の入力信号中に音声が含まれている可能
性があると判断して音声検知信号を出力する複数の比較
手段と、各比較手段からの音声検知信号を総合する総合
手段と、該総合手段からの出力によって、所定の時間、
制御信号を出力するタイマー手段とを備えており、その
ことにより上記目的が達成される。A voice detection control device according to the present invention comprises a plurality of amplitude modulation means for amplitude modulating an input signal at a plurality of different frequencies, and a plurality of amplitude modulation means for processing an output from each amplitude modulation means, each having a predetermined characteristic. And a plurality of logarithmic amplifiers that logarithmically convert the amplitude signal for each frequency band from the rectifier, the amplitude modulator and the third low-pass filter. A plurality of logarithmic amplifiers that logarithmically convert the converted amplitude signal for each frequency band, a plurality of bandpass filter units for processing the output from the logarithmic amplifier unit for each frequency band, If the output exceeds a predetermined threshold,
A plurality of comparing means for judging that there is a possibility that sound is included in the input signal of the frequency band and outputting a sound detection signal; a total means for summing the sound detection signals from the respective comparing means; For a predetermined time,
Timer means for outputting a control signal, whereby the above object is achieved.

【００２０】アナログデジタル変換器およびデジタルプ
ロセッサを備え、入力信号をソフトウェアによってデジ
タル処理してもよい。[0020] An analog-to-digital converter and a digital processor may be provided, and the input signal may be digitally processed by software.

【００２１】以下、本発明の作用について説明する。Hereinafter, the operation of the present invention will be described.

【００２２】本発明にあっては、整流手段により入力信
号を整流して振幅信号に変換し、この振幅信号を特性の
異なる２種類のフィルタ手段により個別に処理する。具
体的には、低域通過フィルタ手段により振幅信号の数Ｈ
ｚ以下の低域成分のみを取り出して入力信号の平均振幅
を求める。この平均振幅を増幅手段によって所定の倍率
で増幅することにより、振幅の変化量に対する閾値とし
て出力する。この増幅手段の増幅率（ゲイン）を外部か
ら調整することにより、閾値を調整することが可能であ
る。In the present invention, the input signal is rectified by the rectifier and converted into an amplitude signal, and the amplitude signal is individually processed by two types of filter having different characteristics. Specifically, the number H of the amplitude signals is determined by the low-pass filter means.
An average amplitude of an input signal is obtained by extracting only low-frequency components equal to or lower than z. The average amplitude is amplified by the amplification means at a predetermined magnification, and is output as a threshold value for the change amount of the amplitude. The threshold can be adjusted by externally adjusting the amplification factor (gain) of the amplifying means.

【００２３】また、帯域通過フィルタ手段により入力信
号の振幅信号に含まれる数Ｈｚから２０Ｈｚ程度の変化
成分を求める。この変化成分を第２の整流手段によって
再度整流することにより、振幅の変化が増加方向であっ
ても減少方向であっても、正の値に表現することができ
る。Further, a change component of several Hz to about 20 Hz contained in the amplitude signal of the input signal is obtained by the band-pass filter means. By rectifying this change component again by the second rectifier, it is possible to express a positive value whether the change in the amplitude is in the increasing direction or in the decreasing direction.

【００２４】そして、比較器によって振幅の変化量を示
す信号と上記閾値とを比較し、振幅の変化量が閾値を超
えた場合に、入力信号中に音声が含まれている可能性が
あると判断して音声検知信号を出力する。比較手段から
音声検知信号が出力されると、タイマー手段によって所
定の時間、制御信号を出力する。Then, a signal indicating the amount of change in amplitude is compared with the threshold value by a comparator, and if the amount of change in amplitude exceeds the threshold value, it is determined that there is a possibility that voice is included in the input signal. Judge and output a voice detection signal. When the sound detection signal is output from the comparing means, the control signal is output for a predetermined time by the timer means.

【００２５】他の本発明にあっては、整流手段により入
力信号を整流して振幅信号に変換し、対数増幅手段によ
って対数増幅する。この対数化された振幅信号を帯域通
過フィルタ手段によって処理して、対数化された振幅の
変化成分を得る。この場合にも、変化成分を第２の整流
手段によって再度整流することにより、振幅の変化が増
加方向であっても減少方向であっても、正の値に表現す
ることができる。According to another aspect of the present invention, the input signal is rectified by the rectifying means, converted into an amplitude signal, and logarithmically amplified by the logarithmic amplification means. The logarithmic amplitude signal is processed by band-pass filter means to obtain a logarithmic amplitude change component. Also in this case, by rectifying the change component again by the second rectifier, the change in the amplitude can be expressed as a positive value regardless of whether the change is in the increasing direction or the decreasing direction.

【００２６】そして、比較器によって対数化された振幅
の変化量を示す信号と所定の閾値とを比較し、対数化さ
れた振幅の変化量を示す信号が所定の閾値を超えた場合
に、入力信号中に音声が含まれている可能性があると判
断して音声検知信号を出力する。比較手段から音声検知
信号が出力されると、タイマー手段によって所定の時
間、制御信号を出力する。Then, a signal indicating the amount of change in amplitude logarithmized by the comparator is compared with a predetermined threshold value. When the signal indicating the amount of change in logarithmized amplitude exceeds the predetermined threshold value, It determines that there is a possibility that the signal contains sound, and outputs a sound detection signal. When the sound detection signal is output from the comparing means, the control signal is output for a predetermined time by the timer means.

【００２７】さらに、アナログデジタル変換器によって
入力信号をデジタルデータに変換し、音声検出前の入力
信号をデジタルデータとしてバッファメモリに記憶させ
てもよい。このとき、低域通過フィルタ手段からの出力
を増幅する増幅手段と比較手段とタイマー手段とを複数
ずつ設けて各増幅手段の増幅率を異なる値に設定するこ
とにより、複数の制御信号を生成して音声検出よりも速
いタイミングでアナログデジタル変換器とバッファメモ
リの電源とクロックとを制御することが可能である。Further, the input signal may be converted into digital data by an analog-to-digital converter, and the input signal before voice detection may be stored as digital data in a buffer memory. At this time, a plurality of control signals are generated by providing a plurality of amplifying means, comparing means, and timer means for amplifying the output from the low-pass filter means and setting the amplification factors of the respective amplifying means to different values. Thus, it is possible to control the power supply and clock of the analog-to-digital converter and the buffer memory at a timing earlier than the voice detection.

【００２８】さらに、複数の第２の帯域通過フィルタ手
段を設けてフィルタバンクを構成し、入力信号を周波数
帯域毎に分離して、各々の信号を帯域毎に音声検出する
ことも可能である。この場合、総合手段によって各比較
手段からの音声検知信号を総合し、総合手段からの出力
によってタイマー手段から制御信号を出力させることが
可能である。Further, it is possible to provide a filter bank by providing a plurality of second band-pass filter means, to separate the input signal for each frequency band, and to detect the sound of each signal for each band. In this case, it is possible to integrate the voice detection signals from the respective comparing means by the integrating means, and to output the control signal from the timer means by the output from the integrating means.

【００２９】さらに、周波数帯域毎の振幅信号を処理す
る第２の低域通過フィルタ手段を設けて、この出力をマ
ルチプレクサによって時分割してアナログデジタル変換
器に入力し、各周波数帯域毎の振幅信号をデジタルデー
タとしてバッファメモリに記憶させることも可能であ
る。Further, second low-pass filter means for processing an amplitude signal for each frequency band is provided. The output is time-divided by a multiplexer and input to an analog-to-digital converter. Can be stored in the buffer memory as digital data.

【００３０】さらに、周波数帯域毎の振幅信号を低域通
過フィルタ手段によって平均し、これを周波数帯域毎の
振幅信号から減算してからデジタルデータに変換するこ
とにより、雑音の影響を低減することが可能である。Furthermore, the influence of noise can be reduced by averaging the amplitude signal for each frequency band by the low-pass filter means, subtracting the average from the amplitude signal for each frequency band, and converting it to digital data. It is possible.

【００３１】または、振幅変調手段によって入力信号を
複数の異なる周波数で振幅変調した後、各々所定の特性
を有する複数の第３の低域通過フィルタ手段によって処
理することにより、周波数帯域毎の振幅信号に変換して
もよい。Alternatively, the amplitude signal is amplitude-modulated at a plurality of different frequencies by the amplitude modulation means, and then processed by a plurality of third low-pass filter means each having a predetermined characteristic, whereby the amplitude signal for each frequency band is obtained. May be converted to

【００３２】これらの処理は、アナログデジタル変換器
およびデジタルプロセッサを用いて、ソフトウェアによ
ってデジタル処理することも可能である。These processes can also be digitally processed by software using an analog-to-digital converter and a digital processor.

【００３３】[0033]

【発明の実施の形態】以下に、本発明の実施の形態につ
いて説明する。Embodiments of the present invention will be described below.

【００３４】環境騒音の振幅は、状況により１０００倍
（６０ｄＢ）以上変化するのに対して、音圧に対する人
間の感覚量は概ね対数領域で表現されるため、かなり大
きな環境騒音やかなり小さな環境騒音でない限り、発話
者は環境騒音に比べて数倍から数百倍程度大きな声を出
す。また、音声の発声区間内では、振幅の定常区間が短
く、変化が多い。従って、人が充分に聞き取れる程度の
音声の発声区間では、環境騒音の音圧を下限として、そ
の数倍以上の振幅変化が観察される。The amplitude of the environmental noise changes by a factor of 1000 (60 dB) or more depending on the situation, while the amount of human perception of the sound pressure is generally expressed in a logarithmic region, so that a considerably large environmental noise or a fairly small environmental noise is obtained. Unless otherwise, the speaker speaks out several to several hundred times as loud as the environmental noise. Further, in the voice utterance section, the steady section of the amplitude is short and changes frequently. Therefore, in a speech utterance section in which a person can sufficiently hear the sound, an amplitude change that is several times as large as the lower limit of the sound pressure of the environmental noise is observed.

【００３５】そこで、本発明にあっては、音声を検出す
るために、（１）振幅の変化量を、環境騒音の振幅に適
応した閾値と比較する方法、または、（２）対数化した
振幅の変化量を一定の閾値と比較する方法の２つの方法
を用いる。これらの方法により、様々な環境騒音レベル
に対して音声検出が可能になる。Therefore, in the present invention, in order to detect voice, (1) a method of comparing the amount of change in amplitude with a threshold value adapted to the amplitude of environmental noise, or (2) a logarithmic amplitude Are compared with a fixed threshold value. These methods allow voice detection for various environmental noise levels.

【００３６】上記（１）の方法においては、入力信号の
振幅信号を特性の異なる複数のフィルタによって処理
し、一方で閾値を求め、他方で振幅の変化量を求めて、
これらを比較することにより音声を検出する。具体的に
は、数Ｈｚ以下の低域通過フィルタを用いて、環境騒音
の振幅レベルを得、それを増幅器によって増幅または減
衰することにより、環境に適応した閾値とする。振幅レ
ベルの増幅または減衰は入力マイクの特性やフィルタの
特性に応じて行い、用途に応じて調整する。検出し易く
するためには若干減衰し、検出し難くするためには若干
増幅する。なお、本明細書において低域通過フィルタと
称しているフィルタは、直流の影響を除くために直流を
遮断して低域のみを通過させる帯域通過フィルタにより
実現することも可能である。In the above-mentioned method (1), the amplitude signal of the input signal is processed by a plurality of filters having different characteristics.
The sound is detected by comparing these. Specifically, the amplitude level of the environmental noise is obtained using a low-pass filter of several Hz or less, and the amplitude level is amplified or attenuated by an amplifier, thereby obtaining a threshold value adapted to the environment. The amplification or attenuation of the amplitude level is performed according to the characteristics of the input microphone and the characteristics of the filter, and is adjusted according to the application. The signal is slightly attenuated for easy detection, and slightly amplified for difficult detection. Note that a filter referred to as a low-pass filter in this specification can be realized by a band-pass filter that cuts off direct current and passes only low frequencies in order to eliminate the influence of direct current.

【００３７】また、帯域通過フィルタを用いて音声によ
る振幅の変化成分を捉える。この場合、音声以外の騒音
による直流成分や速い変化成分を取り除くためには、帯
域通過フィルタの低域遮断周波数を数Ｈｚ以下とし、高
域遮断周波数を数Ｈｚから２０Ｈｚ程度に設定するのが
望ましい。Further, a change component of amplitude due to voice is captured by using a band-pass filter. In this case, in order to remove a DC component or a fast changing component due to noise other than voice, it is desirable to set the low cutoff frequency of the bandpass filter to several Hz or less and set the high cutoff frequency to several Hz to about 20 Hz. .

【００３８】上記（２）の方法においては、入力信号の
振幅信号を対数増幅し、帯域通過フィルタを通すことに
よって対数化された振幅の変化量を求める。これを一定
の域値と比較することにより音声を検出する。この場合
にも、帯域通過フィルタは音声の変化成分であるところ
の数Ｈｚ〜２０Ｈｚ程度を通過させるのが好ましい。In the above method (2), the amplitude signal of the input signal is logarithmically amplified and passed through a band-pass filter to obtain a logarithmic change in amplitude. The sound is detected by comparing this with a certain threshold value. Also in this case, it is preferable that the band-pass filter passes a few Hz to 20 Hz, which is a change component of the sound.

【００３９】音声の変化分を検出した後、数秒間は音声
が継続する可能性があるので、タイマーを設けて、所定
の時間、制御信号を出力し続けるようにする。タイマー
の設定時間は、音声の定常状態が続く可能性がある充分
な時間が必要であることから、１秒から数秒程度である
のが望ましい。Since the voice may continue for several seconds after detecting the change in the voice, a timer is provided so that the control signal is continuously output for a predetermined time. The setting time of the timer is desirably about one second to several seconds, since a sufficient time that the steady state of the voice may continue is required.

【００４０】さらに、検出してからＡＤ変換を行ったの
では音声の立ち上がりに間に合わないという問題につい
ては、ＡＤ変換回路等の消費電力が許容される場合に
は、常にＡＤ変換回路を動作させておき、バッファメモ
リに過去の一定時間のデータを蓄えておくという方法を
取ることができる。しかし、消費電力を削減する必要が
ある場合には、音声検出の閾値よりも低い閾値での音声
検出を別途行って、音声検出よりも少し速いタイミング
でＡＤ変換を開始し、バッファメモリにデータを蓄える
ようにするのが望ましい。具体的には、ＡＤ変換回路内
にＡＤ変換器とバッファメモリとインターフェイスとク
ロックの他に、電源制御器を設ける。そして、音声検出
制御装置の出力に接続された音声処理装置が音声検出制
御信号によって動作を開始する前に、バッファメモリに
蓄えた過去のデータを読み出すことにより、データが失
われることを防ぐことが可能である。Further, with respect to the problem that if the AD conversion is performed after the detection, it is not possible to keep up with the rise of the voice. When the power consumption of the AD conversion circuit or the like is allowed, the AD conversion circuit is always operated. Alternatively, a method of storing data for a certain period of time in the past in a buffer memory can be adopted. However, when it is necessary to reduce power consumption, voice detection is performed separately at a threshold lower than the voice detection threshold, AD conversion is started at a timing slightly earlier than voice detection, and data is stored in the buffer memory. It is desirable to store it. Specifically, a power supply controller is provided in the AD conversion circuit in addition to the AD converter, the buffer memory, the interface, and the clock. Then, before the sound processing device connected to the output of the sound detection control device starts operating according to the sound detection control signal, the past data stored in the buffer memory is read to prevent data loss. It is possible.

【００４１】以上の説明では、入力信号の全周波数帯域
を１つの振幅信号として利用しているため、大きな騒音
が存在する場合には、相対的に小さな音声を検出するこ
とができなくなることもあり得る。一方、現実的には背
景騒音が全ての帯域に同レベルで存在する白色雑音であ
る場合は少なく、ある周波数帯域に集中していることが
多い。典型的な例としては、走行時の自動車内の騒音は
数十Ｈｚ以下の低周波域の騒音エネルギー成分が非常に
大きい。そのため、全ての帯域で観測すると音声よりも
大きな振幅の背景騒音が存在する場合でも、低域を除い
た周波数帯域では、音声の振幅を充分に観測することが
できる場合が多い。これを利用して、周波数帯域毎に帯
域通過フィルタを設けたフィルタバンクを前段に設け
て、周波数帯域毎の出力に対して各々上述したように音
声検出を行って、周波数帯域毎に音声の有無を判断す
る。そして、この結果を総合することにより、騒音の大
きな環境でも、騒音の影響が少ない周波数帯域の情報を
利用して、充分な検出精度を得ることができる。帯域通
過フィルタと整流器を設ける代わりに、入力信号を複数
の異なる周波数で振幅変調した後、各々所定の特性を有
する低域通過フィルタによって処理することにより、周
波数帯域毎の振幅信号に変換することも可能である。In the above description, since the entire frequency band of the input signal is used as one amplitude signal, it may not be possible to detect a relatively small sound when there is a loud noise. obtain. On the other hand, in reality, the background noise is rarely white noise existing at the same level in all bands, and is often concentrated in a certain frequency band. As a typical example, the noise in a vehicle during traveling has a very large noise energy component in a low frequency range of several tens Hz or less. For this reason, even when background noise having a larger amplitude than voice exists when observed in all bands, it is often possible to sufficiently observe the voice amplitude in the frequency band excluding the low frequency band. Utilizing this, a filter bank provided with a band-pass filter for each frequency band is provided at the preceding stage, and voice detection is performed on the output for each frequency band as described above, and the presence or absence of voice for each frequency band. Judge. By integrating these results, sufficient detection accuracy can be obtained even in an environment with a large amount of noise by using information in a frequency band where the influence of the noise is small. Instead of providing a band-pass filter and a rectifier, the input signal may be amplitude-modulated at a plurality of different frequencies and then processed by a low-pass filter having predetermined characteristics, thereby converting the input signal into an amplitude signal for each frequency band. It is possible.

【００４２】さらに、フィルタバンク通過後の各帯域の
振幅信号を、低域通過フィルタに通して、マルチプレク
サを介してＡＤ変換回路に入力させることにより、周波
数帯域毎の振幅信号をデジタルデータとしてバッファメ
モリに記憶させ、音声処理装置に送ることができる。Further, the amplitude signal of each band after passing through the filter bank is passed through a low-pass filter and input to an AD conversion circuit via a multiplexer, so that the amplitude signal of each frequency band is converted into digital data in a buffer memory. And send it to the audio processor.

【００４３】定常的な雑音が存在する場合には、低域通
過フィルタの出力から、より低い遮断周波数を有する低
域通過フィルタの出力を減算することにより、雑音の影
響を低減することが可能である。When stationary noise is present, the effect of the noise can be reduced by subtracting the output of the low-pass filter having a lower cutoff frequency from the output of the low-pass filter. is there.

【００４４】このような音声検出処理は、ソフトウェア
によって行うことも可能である。Such a voice detection process can be performed by software.

【００４５】以下に、本実施の形態について、図面を参
照しながら説明する。Hereinafter, the present embodiment will be described with reference to the drawings.

【００４６】（実施形態１）図１は、実施形態１の音声
検出制御装置の構成を説明するための図である。音声を
対象とする場合には、入力信号はマイクロホンからの出
力を増幅器で増幅したものが一般的であるが、オーディ
オ装置やテレビジョンセット、電話や無線機等からの信
号であってもよい。(Embodiment 1) FIG. 1 is a diagram for explaining a configuration of a voice detection control device of Embodiment 1. In the case of audio, an input signal is generally obtained by amplifying an output from a microphone with an amplifier, but may be a signal from an audio device, a television set, a telephone, a wireless device, or the like.

【００４７】整流器１０１は、全波整流回路または半波
整流回路からなり、正負の値を有する入力信号を正の値
だけの振幅信号に整流する。この整流器は、入力の自乗
を出力する回路を利用することも可能である。The rectifier 101 comprises a full-wave rectifier circuit or a half-wave rectifier circuit, and rectifies an input signal having a positive or negative value into an amplitude signal having only a positive value. The rectifier can use a circuit that outputs the square of the input.

【００４８】低域通過フィルタ１０２は、数Ｈｚ程度か
ら１Ｈｚ程度以下の遮断周波数を有し、低域成分のみを
取り出して入力信号の振幅信号の数百ミリ秒から数秒程
度の平均に相当する値（平均振幅値）を求める。The low-pass filter 102 has a cutoff frequency of about several Hz to about 1 Hz or less, and extracts only a low-frequency component to obtain a value corresponding to an average of several hundred milliseconds to several seconds of an amplitude signal of an input signal. (Average amplitude value).

【００４９】増幅器１０３は、低域通過フィルタ１０２
で求めた平均振幅値を増幅し、音声検出のための閾値と
する。この閾値は、音声の振幅が騒音に対してどの程度
の大きさ以上である場合に検出するか、ということに影
響する。本実施形態では、可変増幅器を用いて外部から
増幅率（ゲイン）を調整できるようにしているが、数倍
程度の固定増幅器と分圧抵抗とを用いてもよい。なお、
増幅器１０３の増幅率は、低域通過フィルタ１０２と帯
域通過フィルタ１０４の帯域幅に合わせて調整するのが
好ましいが、実験により、ほぼ１倍の前後で調整すれば
良いことが分かっている、なお、１倍の場合には増幅器
を省略することも可能である。The amplifier 103 includes a low-pass filter 102
The average amplitude value obtained in step (1) is amplified and set as a threshold for voice detection. This threshold value affects how much the sound amplitude is detected with respect to noise. In the present embodiment, the amplification factor (gain) can be externally adjusted by using a variable amplifier. However, a fixed amplifier and a voltage dividing resistor which are several times larger may be used. In addition,
It is preferable to adjust the amplification factor of the amplifier 103 in accordance with the bandwidth of the low-pass filter 102 and the band-pass filter 104. However, experiments have shown that it is sufficient to adjust the gain around about 1 times. In the case of 1.times., The amplifier can be omitted.

【００５０】帯域通過フィルタ１０４は、数Ｈｚ〜２０
Ｈｚ程度の信号を通過させるフィルタである。この帯域
通過フィルタ１０４により、入力信号の振幅信号中、音
声によって変動する成分を取り出す。The band pass filter 104 has a frequency of several Hz to 20 Hz.
It is a filter that passes signals of about Hz. The band-pass filter 104 extracts a component that fluctuates due to voice in the amplitude signal of the input signal.

【００５１】整流器１０５は、音声の変動成分が正であ
っても負であっても正の信号として検出するためのもの
であり、省略することが可能である。なお、整流器１０
５を省略した場合には、帯域通過フィルタ１０４により
取り出された音声の変動成分のうち、正方向の変動、つ
まり振幅の増加に対しては正の信号として検出が可能と
なる。一方、負方向の変動、つまり振幅の減少に対して
は負の信号となるので、後述する比較器で検出すること
はできないが、後述するタイマーから制御信号を出力す
る時間を推定される発声時間に対して充分長く設定する
ことにより、音声の検出が可能である。The rectifier 105 is for detecting whether the fluctuation component of the sound is positive or negative as a positive signal, and can be omitted. The rectifier 10
If the step 5 is omitted, of the fluctuation components of the voice extracted by the band-pass filter 104, the fluctuation in the positive direction, that is, the increase in the amplitude, can be detected as a positive signal. On the other hand, since the signal in the negative direction, that is, the decrease in the amplitude, becomes a negative signal, it cannot be detected by the comparator described later, but the utterance time in which the time for outputting the control signal from the timer described later is estimated. By setting the length sufficiently long, voice can be detected.

【００５２】比較器１０６は、低域通過フィルタ１０２
からの出力を増幅器１０３により増幅して求められる閾
値と、帯域通過フィルタ１０４からの出力を整流器１０
５によって再度整流して得られる振幅変化信号（整流器
１０５を省略した場合には、帯域通過フィルタ１０４か
らの振幅変化信号）との比較を行い、音声の変動（振幅
変化信号と閾値の差）がある一定のオフセット値より大
きい場合に、入力信号に音声信号が含まれている可能性
があると判断して、音声検知信号を出力する。このオフ
セット値は、ゼロまたはゼロ付近の小さな値とする。The comparator 106 includes the low-pass filter 102
The threshold value obtained by amplifying the output from the amplifier 103 by the amplifier 103 and the output from the band-pass filter 104 are
5, and a comparison with an amplitude change signal obtained when the rectifier 105 is omitted (the amplitude change signal from the band-pass filter 104 when the rectifier 105 is omitted) is used. If the input signal is larger than a certain offset value, it is determined that the input signal may include a voice signal, and a voice detection signal is output. This offset value is set to zero or a small value near zero.

【００５３】タイマー１０７は、比較器１０６からの音
声検知信号を受け取ると、音声が継続すると推定される
時間、制御信号を出力する。このタイマー１０７の設定
時間としては、１秒程度から１０秒程度が適切であると
考えられる。この制御信号によって、音声認識装置等の
音声処理装置の作動や停止を制御することができる。When the timer 107 receives the sound detection signal from the comparator 106, the timer 107 outputs a control signal for a time period in which the sound is estimated to continue. It is considered that about 1 second to about 10 seconds is appropriate as the setting time of the timer 107. With this control signal, the operation and stop of a speech processing device such as a speech recognition device can be controlled.

【００５４】図２は、図１に示した本実施形態の音声検
出制御装置における内部処理波形を示す図である。図中
の１ａ〜１ｅは入力信号に音声が含まれている場合を示
し、２ａ〜２ｅは同じ環境騒音下で音声が含まれていな
い場合を示す。縦軸は振幅を示し、横軸は時間を示す。FIG. 2 is a diagram showing an internal processing waveform in the voice detection control device of this embodiment shown in FIG. 1a to 1e in the drawing show the case where the input signal includes a voice, and 2a to 2e show the case where no voice is included under the same environmental noise. The vertical axis indicates amplitude, and the horizontal axis indicates time.

【００５５】１ａおよび２ａは入力波形を示す。入力波
形１ａの音声のＳＮ比は約１５ｄＢである。1a and 2a show input waveforms. The S / N ratio of the voice of the input waveform 1a is about 15 dB.

【００５６】１ｂおよび２ｂは増幅器１０３の出力であ
り、音声検出のための閾値を示す。この例では、増幅器
１０３のゲインを１．０としているので、低域通過フィ
ルタ１０２の出力である平均振幅と同じものを示してい
る。低域通過フィルタ１０２は、遮断周波数が１Ｈｚで
２次のバタワースフィルタとした。Reference numerals 1b and 2b denote outputs of the amplifier 103, which indicate threshold values for voice detection. In this example, since the gain of the amplifier 103 is set to 1.0, the same amplitude as the output of the low-pass filter 102 is shown. The low-pass filter 102 was a second-order Butterworth filter with a cutoff frequency of 1 Hz.

【００５７】１ｃおよび２ｃは帯域通過フィルタ１０４
の出力を示す。帯域通過フィルタ１０４は、低域遮断周
波数１Ｈｚで２次、高域遮断周波数５Ｈｚで２次の各々
バタワースフィルタとした。1c and 2c are band-pass filters 104
The output of The band-pass filter 104 was a second-order Butterworth filter with a low cutoff frequency of 1 Hz and a second order with a high cutoff frequency of 5 Hz.

【００５８】１ｄおよび２ｄは整流器１０５の出力であ
り、入力信号の振幅の変化の大きさを表していると考え
られる。1d and 2d are the outputs of the rectifier 105, and are considered to indicate the magnitude of the change in the amplitude of the input signal.

【００５９】１ｅおよび２ｅは比較器１０６への入力で
ある振幅の変化量（１ｄ、２ｄ）から閾値（１ｂ、２
ｂ）を減算した値を示す。ここでは、閾値のオフセット
値をゼロとしているので、この値が正になると、比較器
１０６は音声を検出したことを示す音声検知信号を出力
し、タイマー１０７から一定時間ＯＮの制御信号を出力
する。１ｅに示す差信号は、入力信号のレベルの大小に
比例するが、入力音声のＳＮ比が同じであれば、差信号
の符号は全く同じであることが原理的に明らかである。
従って、比較器１０６は、入力信号のレベルに関係な
く、音声のＳＮ比が同じであれば、同じ結果を出力す
る。従って、本実施形態の音声検出制御装置が、入力信
号のレベルに関係無く動作することが分かる。1e and 2e denote the threshold values (1b, 2d) from the amplitude variation (1d, 2d) input to the comparator 106.
The value obtained by subtracting b) is shown. Here, since the offset value of the threshold value is zero, when this value becomes positive, the comparator 106 outputs a voice detection signal indicating that voice has been detected, and outputs a control signal that is ON for a predetermined time from the timer 107. . The difference signal 1e is proportional to the level of the input signal, but it is in principle clear that the sign of the difference signal is exactly the same if the S / N ratio of the input voice is the same.
Therefore, the comparator 106 outputs the same result regardless of the level of the input signal if the S / N ratio of the voice is the same. Therefore, it can be seen that the voice detection control device of the present embodiment operates regardless of the level of the input signal.

【００６０】（実施形態２）図３は実施形態２の音声検
出制御装置の構成を説明するための図である。この音声
検出制御装置は、音声検出回路２１６とＡＤ変換回路２
１７とから構成されている。(Embodiment 2) FIG. 3 is a diagram for explaining the configuration of a voice detection control device according to Embodiment 2. This voice detection control device includes a voice detection circuit 216 and an AD conversion circuit 2
17.

【００６１】音声検出回路２１６の基本構成は、図１の
構成とほぼ同様であるが、増幅器２０３、２０８、比較
器２０６、２０９およびタイマー２０７、２１０を２つ
ずつ設けている。整流器２０１、２０５、低域通過フィ
ルタ２０２および帯域通過フィルタ２０４は、図１に示
した整流器１０１、１０５、低域通過フィルタ１０２お
よび帯域通過フィルタ１０４と同様の機能を有する。The basic configuration of the voice detection circuit 216 is almost the same as the configuration of FIG. 1, except that two amplifiers 203 and 208, two comparators 206 and 209, and two timers 207 and 210 are provided. The rectifiers 201 and 205, the low-pass filter 202, and the band-pass filter 204 have the same functions as the rectifiers 101 and 105, the low-pass filter 102, and the band-pass filter 104 shown in FIG.

【００６２】増幅器２０３、比較器２０６およびタイマ
ー２０７によって外部の音声処理装置用の制御信号を作
成すると共に、増幅器２０８、比較器２０９およびタイ
マー２１０によってＡＤ変換回路２１７用の制御信号を
作成している。増幅器２０３と増幅器２０８の増幅率を
異なる値に設定することにより、２つの閾値による制御
信号を得ることができる。音声処理装置に送られる音声
の始まりが欠落しないようにするためには、帯域通過フ
ィルタの高域遮断周波数を少し高くすることにより応答
を速くすると共に、ＡＤ変換回路２１７用の増幅器２０
８の閾値を少し低く設定し、音声検出よりも少し早いタ
イミングでＡＤ変換回路２１７を動作させ、後述するバ
ッファメモリ２１４にデータを格納させるようにするの
が望ましい。制御される音声処理装置の立ち上がり時間
遅れのみが問題となる場合には、増幅器２０８、比較器
２０９およびタイマー２１０を省略して制御信号を１つ
だけ求め、外部の音声処理装置用の制御信号とＡＤ変換
回路２１７用の制御信号とを兼ねてもよい。また、制御
信号を外部の音声処理装置用の１つだけ求めて、ＡＤ変
換回路２１７を常に動作させることにより、消費電力は
増加するものの、音声の欠落は完全に防ぐことができ
る。A control signal for an external audio processing device is created by the amplifier 203, the comparator 206 and the timer 207, and a control signal for the AD conversion circuit 217 is created by the amplifier 208, the comparator 209 and the timer 210. . By setting the amplification factors of the amplifier 203 and the amplifier 208 to different values, a control signal based on two thresholds can be obtained. In order to prevent the beginning of the sound sent to the sound processing device from being lost, the response is speeded up by slightly increasing the high cutoff frequency of the band-pass filter, and the amplifier 20 for the AD conversion circuit 217 is used.
It is desirable to set the threshold value of 8 slightly lower, operate the AD conversion circuit 217 at a timing slightly earlier than the voice detection, and store data in the buffer memory 214 described later. If only the rise time delay of the controlled speech processing device is a problem, the amplifier 208, the comparator 209, and the timer 210 are omitted, and only one control signal is obtained. It may also serve as a control signal for the AD conversion circuit 217. Also, by finding only one control signal for the external audio processing device and constantly operating the AD conversion circuit 217, power consumption is increased, but loss of audio can be completely prevented.

【００６３】ＡＤ変換回路２１７に設けた電源制御器２
１１は、制御信号がＯＮになったときにのみ、クロック
２１２およびその他の回路に電源を供給する。ＡＤ変換
器２１３は、電源ＯＮのとき（制御信号がＯＮのとき）
にクロックを受け、入力信号をデジタルデータに変換し
てバッファメモリ２１４に格納する。インターフェイス
２１５は、外部の音声処理装置との通信を行って、バッ
ファメモリ２１４に蓄えられたデータを出力する。Power supply controller 2 provided in AD conversion circuit 217
11 supplies power to the clock 212 and other circuits only when the control signal is turned on. The AD converter 213 is turned on (when the control signal is turned on).
, And converts the input signal into digital data and stores it in the buffer memory 214. The interface 215 communicates with an external voice processing device and outputs data stored in the buffer memory 214.

【００６４】（実施形態３）図４は実施形態３の音声検
出制御装置の構成を説明するための図である。この音声
検出制御装置は、複数の帯域通過フィルタ３０１〜３０
３と複数の音声検出回路３０４〜３０６とＯＲ回路３１
３とタイマー３１４から構成されている。整流器３０
７、３１１、低域通過フィルタ３０８、帯域通過フィル
タ３１０、増幅器３０９および比較器３１２は、図１に
示した整流器１０１、１０５、低域通過フィルタ１０
２、帯域通過フィルタ１０４、増幅器１０３および比較
器１０６と同様の機能を有する。(Embodiment 3) FIG. 4 is a diagram for explaining the configuration of a voice detection control device according to Embodiment 3. This voice detection control device includes a plurality of bandpass filters 301 to 30.
3 and a plurality of voice detection circuits 304 to 306 and an OR circuit 31
3 and a timer 314. Rectifier 30
7 and 311, the low-pass filter 308, the band-pass filter 310, the amplifier 309, and the comparator 312 include the rectifiers 101 and 105 and the low-pass filter 10 shown in FIG.
2. It has the same functions as the band-pass filter 104, the amplifier 103, and the comparator 106.

【００６５】帯域通過フィルタ３０１〜３０３は、各々
異なる中心周波数を有し、フィルタバンクを構成してい
る。中心周波数の間隔は等間隔であってもよく、中心周
波数を対数化したものが等間隔になるようにしてもよ
い。また、メル特性やバーク特性等の聴覚特性を考慮し
た中心周波数と帯域幅のフィルタバンクを構成してもよ
い。帯域の分割数は特に制限されない。検出精度を高く
するためには帯域分割数が多いほど好ましいが、回路規
模の観点からは４帯域から１６帯域程度とするのが現実
的である。Each of the band-pass filters 301 to 303 has a different center frequency and constitutes a filter bank. The intervals between the center frequencies may be equal, and the logarithm of the center frequency may be equal. Further, a filter bank having a center frequency and a bandwidth in consideration of auditory characteristics such as a mel characteristic and a bark characteristic may be formed. The number of band divisions is not particularly limited. To increase the detection accuracy, it is preferable to increase the number of band divisions. However, from the viewpoint of the circuit scale, it is practical to set the number of bands to about 4 to 16 bands.

【００６６】各々の帯域通過フィルタを通過した入力信
号に対して、図１に示した音声検出制御装置と同様に、
整流器３０７〜比較器３１２までを用いて、帯域毎に音
声検出を行う。For the input signal passed through each band-pass filter, as in the voice detection control device shown in FIG.
Voice detection is performed for each band using the rectifier 307 to the comparator 312.

【００６７】ＯＲ回路３１２は、各帯域の音声検出結果
を総合し、例えば、どれか１つの帯域がＯＮになったと
き（比較器から音声検知信号が出力されたとき、音声検
知信号としてＯＮを出力する。なお、ＯＲ回路３１２
は、１つの帯域がＯＮになったときにＯＮを出力するの
ではなく、複数の帯域が同時にＯＮになったときにＯＮ
を出力するように構成してもよい。The OR circuit 312 integrates the sound detection results of the respective bands and, for example, when any one of the bands is turned on (when a sound detection signal is output from the comparator, when the band is turned on, the sound detection signal is turned on). The OR circuit 312 outputs
Does not output ON when one band is turned on, but turns on when multiple bands are turned on at the same time.
May be output.

【００６８】タイマー３１３は、ＯＲ回路３１２から音
声検知信号としてＯＮが入力されたときに、所定の時
間、制御信号としてＯＮを出力する。When ON is input as a sound detection signal from the OR circuit 312, the timer 313 outputs ON as a control signal for a predetermined time.

【００６９】（実施形態４）図５は実施形態４の音声検
出制御装置の構成を説明するための図である。この音声
検出制御装置は、複数の帯域通過フィルタ４０１〜４０
３と複数の音声検出回路４０４〜４０６とＡＤ変換回路
４１５とＯＲ回路４１３とタイマー４１４から構成され
ている。整流器４０７、４１１、低域通過フィルタ４０
８、帯域通過フィルタ４１０、増幅器４０９および比較
器４１２は、図１に示した整流器１０１、１０５、低域
通過フィルタ１０２、帯域通過フィルタ１０４、増幅器
１０３および比較器１０６と同様の機能を有する。ま
た、帯域通過フィルタ４０１〜４０３、ＯＲ回路４１３
およびタイマー４１４は、図４に示した帯域通過フィル
タ３０１〜３０３、ＯＲ回路３１３およびタイマー３１
４と同様の機能を有する。さらに、クロック４１７と電
源制御器４１６は、図３のクロック２１２と電源制御器
２１１と同様の機能を有する。(Embodiment 4) FIG. 5 is a diagram for explaining the configuration of a voice detection control device according to Embodiment 4. This voice detection control device includes a plurality of bandpass filters 401 to 40.
3 and a plurality of voice detection circuits 404 to 406, an AD conversion circuit 415, an OR circuit 413, and a timer 414. Rectifiers 407, 411, low-pass filter 40
8, the band-pass filter 410, the amplifier 409, and the comparator 412 have the same functions as the rectifiers 101 and 105, the low-pass filter 102, the band-pass filter 104, the amplifier 103, and the comparator 106 shown in FIG. Further, the band-pass filters 401 to 403 and the OR circuit 413
The timer 414 includes the band-pass filters 301 to 303, the OR circuit 313, and the timer 31 illustrated in FIG.
4 has the same function as that of FIG. Further, the clock 417 and the power controller 416 have the same functions as the clock 212 and the power controller 211 in FIG.

【００７０】ＡＤ変換回路４１５では、各音声検出回路
から出力される各帯域の振幅信号をマルチプレクサ４２
２によって切り替えながらＡＤ変換器４１８によってデ
ジタルデータに変換し、バッファメモリ４１９に格納す
る。格納したデータは、インターフェイス４２０を通し
て外部の音声処理装置へ転送される。このときのＡＤ変
換の速度は、全帯域データを５ｍｓから１０ｍｓの間隔
でＡＤ変換すればよいので、低いクロックで充分であ
る。The AD conversion circuit 415 converts the amplitude signal of each band output from each sound detection circuit into a multiplexer 42.
The data is converted into digital data by the AD converter 418 while being switched by 2 and stored in the buffer memory 419. The stored data is transferred to an external voice processing device through the interface 420. At this time, a low clock is sufficient for the AD conversion speed since the AD conversion of the entire band data may be performed at intervals of 5 ms to 10 ms.

【００７１】ＡＤ変換のためには、サンプリングによる
折り返しひずみ（サンプリング周波数の１／２よりも高
い周波数が入力されることによりデータに歪みが生じ、
精度が悪くなる現象）を避けるため、低域通過フィルタ
が必要である。このため、本実施形態では、図５に示し
た各音声検出検出回路に加えて、低域通過フィルタ４２
１を設けることにより、整流器４０７を通過した振幅信
号の高周波成分を遮断している。遮断周波数は、サンプ
リング定理より、サンプリング周波数の半分以下とする
必要がある。さらに、音声のピッチ周期による影響を除
くため、男性の低い声に相当すると言われる５０Ｈｚ程
度よりも低い周波数で遮断することが望ましい。以上か
ら、５ｍｓサンプリングの場合には５０Ｈｚ程度、１０
ｍｓサンプリングの場合には２５Ｈｚから５０Ｈｚ程度
の遮断周波数で良いと考えられる。For AD conversion, aliasing due to sampling (data is distorted by inputting a frequency higher than 1/2 of the sampling frequency,
A low-pass filter is required in order to avoid the phenomenon that accuracy is deteriorated). For this reason, in the present embodiment, in addition to the sound detection and detection circuits shown in FIG.
By providing 1, the high frequency component of the amplitude signal passing through the rectifier 407 is cut off. According to the sampling theorem, the cutoff frequency needs to be half or less of the sampling frequency. Furthermore, in order to eliminate the influence of the pitch cycle of the voice, it is desirable to cut off at a frequency lower than about 50 Hz, which is said to correspond to a male low voice. From the above, in the case of 5 ms sampling, about 50 Hz, 10
In the case of ms sampling, a cutoff frequency of about 25 Hz to about 50 Hz is considered to be sufficient.

【００７２】また、図５では、各帯域につき、１つの制
御信号を求めているが、帯域毎に図３に示したように増
幅器と比較器とを設けて、ＯＲ回路とタイマーを２個ず
つ設けることにより、２つの制御信号を求めて一方の制
御信号をＡＤ変換回路４１５の制御用に用いてもよい。In FIG. 5, one control signal is obtained for each band. However, as shown in FIG. 3, an amplifier and a comparator are provided for each band, and two OR circuits and two timers are provided. With this arrangement, two control signals may be obtained and one of the control signals may be used for controlling the AD conversion circuit 415.

【００７３】（実施形態５）図６は実施形態５の音声検
出制御装置の構成を説明するための図である。本実施形
態では、図５に示した音声検出制御装置において、図６
に示すように加算器４３２を低域通過フィルタ４０８、
４２１の出力に接続し、低域通過フィルタ４２１の出力
から、より低い遮断周波数を有する低域通過フィルタ４
０８の出力を加算器４３２を用いて減算することによ
り、その帯域の定常振幅値を０とした振幅値を求める。(Embodiment 5) FIG. 6 is a diagram for explaining the configuration of a voice detection control device according to Embodiment 5. In the present embodiment, in the voice detection control device shown in FIG.
The adder 432 is connected to the low-pass filter 408 as shown in FIG.
421, the output of the low-pass filter 421, the low-pass filter 4 having a lower cut-off frequency.
08 is subtracted using the adder 432 to obtain an amplitude value with the steady amplitude value of the band set to 0.

【００７４】このことは、定常的な雑音が存在する場合
についても可能であり、結果的に雑音を減算したのとほ
ぼ同等の効果を得ることができる。動作の安定性のため
には、定域通過フィルタ４０８の出力を若干減衰させて
減算しすぎないようにする方法や、減算した結果が負に
ならないような回路を追加することも効果がある。This is possible even in the case where stationary noise is present, and as a result, an effect almost equivalent to that obtained by subtracting noise can be obtained. To stabilize the operation, it is also effective to slightly attenuate the output of the constant-pass filter 408 so that the output is not excessively subtracted, or to add a circuit so that the result of the subtraction does not become negative.

【００７５】（実施形態６）図７は実施形態６の音声検
出制御装置を構成する部分を説明するための図であり、
入力信号を各帯域に分割して振幅信号に変換する部分の
構成例を、１帯域分だけ示したものである。(Embodiment 6) FIG. 7 is a diagram for explaining the parts constituting the voice detection control device of Embodiment 6.
The configuration example of a portion that divides an input signal into each band and converts it into an amplitude signal is shown for only one band.

【００７６】図７（ａ）に示す帯域振幅検出回路５０１
は、帯域通過フィルタ５０２と整流器５０３からなる。
この帯域通過フィルタ５０２および整流器５０３は、図
４および図５に示した帯域通過フィルタ３０１〜３０
３、４０１〜４０３および整流器３１１、４１１と同様
の機能を有する。The band amplitude detection circuit 501 shown in FIG.
Comprises a bandpass filter 502 and a rectifier 503.
The band-pass filter 502 and the rectifier 503 correspond to the band-pass filters 301 to 30 shown in FIGS.
3, 401 to 403 and rectifiers 311, 411.

【００７７】図７（ｂ）に示す帯域振幅検出回路５０４
は、５０１の回路を別の構成で実現したものである。正
弦波発振器５０５は、位相が９０度ずれた同じ周波数の
正弦波を生成する。乗算器５０６と５０８は、正弦波発
振器５０５からの位相が異なる２種類の正弦波を各々入
力信号に乗算する。乗算器５０７と５０９は、乗算器５
０６と５０８の乗算結果を各々自乗する。加算器５１０
は２つの乗算結果を加算する。低域通過フィルタ５１１
は、加算結果に含まれる低域成分のみを取り出す。以上
の処理により、正弦波発振器５０５の発振周波数を中心
周波数とし、低域通過フィルタ５１１の遮断周波数の２
倍を帯域幅とする帯域通過フィルタを通過した信号の短
時間エネルギーを求めることができる。この出力を、図
４と同様の帯域通過フィルタ（図４の３０８）と低域通
過フィルタ（図４の３１０）に各々通過させ、比較する
ことによって、各帯域の音声検出を行う。The band amplitude detection circuit 504 shown in FIG.
Is an implementation of the circuit 501 in another configuration. The sine wave oscillator 505 generates a sine wave having the same frequency and a phase difference of 90 degrees. Multipliers 506 and 508 respectively multiply the input signal by two types of sine waves having different phases from sine wave oscillator 505. The multipliers 507 and 509 are connected to the multiplier 5
The result of multiplication of 06 and 508 is squared. Adder 510
Adds two multiplication results. Low-pass filter 511
Extracts only low-frequency components included in the addition result. With the above processing, the oscillation frequency of the sine wave oscillator 505 is set as the center frequency, and the cutoff frequency of the low-pass filter 511 is set to 2
The short-term energy of a signal that has passed through a band-pass filter whose bandwidth is twice as long can be obtained. This output is passed through a band-pass filter (308 in FIG. 4) and a low-pass filter (310 in FIG. 4) similar to those shown in FIG. 4, and compared, thereby performing voice detection in each band.

【００７８】なお、図７（ａ）に示した帯域振幅検出回
路５０１では、各帯域のエネルギーではなく、振幅が求
められる。各帯域のエネルギーを求めるためには、図７
（ｃ）に示す帯域振幅検出回路５１２のように、帯域通
過フィルタ５１３および整流器５１４によって変換され
た帯域毎の振幅信号を、乗算器５１５によって自乗する
ことにより、可能である。音声検出のためには、信号の
振幅でも、エネルギーでも、同様に利用することができ
る。In the band amplitude detection circuit 501 shown in FIG. 7A, not the energy of each band but the amplitude is obtained. To find the energy for each band, see FIG.
As in the band amplitude detection circuit 512 shown in (c), the amplitude signal for each band converted by the band-pass filter 513 and the rectifier 514 can be squared by the multiplier 515. For voice detection, either the amplitude or the energy of the signal can be used as well.

【００７９】（実施形態７）図８は実施形態７の音声検
出制御装置の構成を説明するための図である。本実施形
態では、入力信号を整流器６０１によって整流して振幅
信号とした後、対数増幅器６０２によって対数化して、
帯域通過フィルタ６０３によって時間的に変動する成分
だけを取り出し、整流器６０４によって増加と減少を正
の値に変換する。この値を、比較器６０５によって固定
の閾値と比較して音声を検出し、タイマー６０６によっ
て制御信号を生成する。(Embodiment 7) FIG. 8 is a diagram for explaining the configuration of a voice detection control device according to Embodiment 7. In this embodiment, after the input signal is rectified by the rectifier 601 to be an amplitude signal, the input signal is logarithmized by the logarithmic amplifier 602,
The band-pass filter 603 extracts only the component that varies with time, and the rectifier 604 converts the increase and decrease into positive values. This value is compared with a fixed threshold value by the comparator 605 to detect sound, and the timer 606 generates a control signal.

【００８０】これまでの実施形態１〜実施形態６では、
入力のレンジに応じた音声検出を行うため、低域通過フ
ィルタによって入力信号の平均振幅から閾値を計算して
いたのに対して、本実施形態では、対数増幅器６０２を
用いることにより、固定閾値によって様々な入力のレン
ジに対応することが可能である。本実施形態でも、整流
器６０４は省略可能である。さらに、対数増幅器６０２
の増幅率を変化することも可能であるが、装置の設計段
階で固定してしまうことも可能である。In the first to sixth embodiments described above,
In order to perform voice detection according to the input range, the threshold is calculated from the average amplitude of the input signal by a low-pass filter. In the present embodiment, however, by using the logarithmic amplifier 602, a fixed threshold is used. It is possible to support various input ranges. Also in the present embodiment, the rectifier 604 can be omitted. Further, logarithmic amplifier 602
Can be changed, but it is also possible to fix it at the stage of designing the device.

【００８１】（実施形態８）図９は実施形態８の音声検
出制御装置の構成を説明するための図である。本実施形
態では、上述した実施形態１〜実施形態７の音声検出処
理をデジタル処理によって行う。(Eighth Embodiment) FIG. 9 is a diagram for explaining the configuration of a voice detection control device according to an eighth embodiment. In the present embodiment, the voice detection processing of the above-described first to seventh embodiments is performed by digital processing.

【００８２】入力信号をＡＤ変換器８０１によってデジ
タル信号に変換した後、プロセッサ８０２を用いて、整
流、低域通過フィルタの処理、帯域通過フィルタの処
理、比較器による加算と減算、および増幅器による乗算
を行い、制御信号を出力する。プロセッサ８０２として
は、デジタル信号処理プロセッサ（ＤＳＰ）、汎用のマ
イクロコンピュータ（ＣＰＵ）または論理回路を用いる
ことができる。音声データのサンプリング周波数は１０
ｋＨｚから２０ｋＨｚ程度であり、かつ、処理が比較的
単純であるため、低速のプロセッサを用いて消費電力を
少なくすることができる。After the input signal is converted into a digital signal by the AD converter 801, rectification, low-pass filter processing, band-pass filter processing, addition and subtraction by a comparator, and multiplication by an amplifier are performed using a processor 802. And outputs a control signal. As the processor 802, a digital signal processor (DSP), a general-purpose microcomputer (CPU), or a logic circuit can be used. The sampling frequency of audio data is 10
Since the frequency is about kHz to 20 kHz and the processing is relatively simple, it is possible to reduce power consumption by using a low-speed processor.

【００８３】さらに、ＡＤ変換した結果を有効利用する
ために、ＲＡＭ８０３を接続し、音声が検出されるまで
のデータをリングバッファ形式で格納し、必要に応じて
後続の音声処理装置と通信することにより、データの欠
落も防ぐことができる。音声を格納するためのバッファ
容量は１秒以下でもよく、プロセッサの内部メモリを利
用することも可能である。Further, in order to effectively use the result of the A / D conversion, a RAM 803 is connected, data until a voice is detected is stored in a ring buffer format, and communication with a subsequent voice processing device is performed as necessary. Thus, data loss can be prevented. The buffer capacity for storing audio may be 1 second or less, and the internal memory of the processor may be used.

【００８４】[0084]

【発明の効果】以上詳述したように、本発明によれば、
アナログ処理や比較的単純な論理回路、または低速のプ
ロセッサを用いて、環境に適応的な音声検出と、その音
声検出による外部機器の制御が可能となる。従って、音
声認識装置に代表される比較的消費電力の大きな音声処
理装置の動作・停止を制御することにより、低消費電力
でありながら、常時音声を監視して処理を行うセキュリ
ティー装置や音声リモコン等を実現することが可能とな
る。As described in detail above, according to the present invention,
Using analog processing, a relatively simple logic circuit, or a low-speed processor, it is possible to perform sound detection adaptive to the environment and to control an external device by the sound detection. Therefore, by controlling the operation / stop of a relatively high power consumption voice processing device represented by a voice recognition device, a security device or a voice remote control that constantly monitors and processes voice while using low power consumption. Can be realized.

【００８５】本発明は環境騒音に対する適応範囲が広
く、非常に静かな環境から雑音の多き環境まで、人間に
よる調整が不要である。また、音声が発声されてから、
音声を検出して処理が開始されるまでの時間にも、音声
データをバッファメモリに蓄えることができるので、デ
ータが欠落しにくく、欠落による誤動作を防ぐことがで
きる。さらに、フィルタバンクからの出力をＡＤ変換す
ることにより、外部の音声処理装置における処理を軽減
することができ、消費電力を低減することが可能であ
る。The present invention has a wide range of adaptation to environmental noise, and does not require adjustment by a person from a very quiet environment to a noisy environment. Also, after the voice is spoken,
Since the audio data can be stored in the buffer memory even during the time from when the audio is detected until the processing is started, the data is less likely to be lost, and malfunction due to the loss can be prevented. Furthermore, by performing AD conversion on the output from the filter bank, processing in an external audio processing device can be reduced, and power consumption can be reduced.

[Brief description of the drawings]

【図１】実施形態１の音声検出制御装置の構成を説明す
るための図である。FIG. 1 is a diagram for describing a configuration of a voice detection control device according to a first embodiment.

【図２】実施形態１の音声検出制御装置における内部処
理波形の例を示す図である。FIG. 2 is a diagram illustrating an example of an internal processing waveform in the voice detection control device according to the first embodiment.

【図３】実施形態２の音声検出制御装置の構成を説明す
るための図である。FIG. 3 is a diagram illustrating a configuration of a voice detection control device according to a second embodiment.

【図４】実施形態３の音声検出制御装置の構成を説明す
るための図である。FIG. 4 is a diagram illustrating a configuration of a voice detection control device according to a third embodiment.

【図５】実施形態４の音声検出制御装置の構成を説明す
るための図である。FIG. 5 is a diagram illustrating a configuration of a voice detection control device according to a fourth embodiment.

【図６】実施形態５の音声検出制御装置の構成を説明す
るための図である。FIG. 6 is a diagram illustrating a configuration of a voice detection control device according to a fifth embodiment.

【図７】（ａ）〜（ｃ）は、実施形態６の音声検出制御
装置における、入力信号を各帯域に分割して振幅信号に
変換する部分の構成例を示す図である。FIGS. 7A to 7C are diagrams illustrating a configuration example of a part of an audio detection control device according to a sixth embodiment, which divides an input signal into respective bands and converts the divided signals into amplitude signals.

【図８】実施形態７の音声検出制御装置の構成を説明す
るための図である。FIG. 8 is a diagram illustrating a configuration of a voice detection control device according to a seventh embodiment.

【図９】実施形態８の音声検出制御装置の構成を説明す
るための図である。FIG. 9 is a diagram illustrating a configuration of a voice detection control device according to an eighth embodiment.

[Explanation of symbols]

１０１整流器１０２低域通過フィルタ１０３増幅器１０４帯域通過フィルタ１０５整流器１０６比較器１０７タイマー２０１整流器２０２低域通過フィルタ２０３増幅器２０４帯域通過フィルタ２０５整流器２０６比較器２０７タイマー２０８増幅器２０９比較器２１０タイマー２１１電源制御器２１２クロック２１３ＡＤ変換器２１４バッファメモリ２１５インターフェイス２１６音声検出回路２１７ＡＤ変換回路３０１帯域通過フィルタ３０２帯域通過フィルタ３０３帯域通過フィルタ３０４音声検出回路３０５音声検出回路３０６音声検出回路３０７整流器３０８低域通過フィルタ３０９増幅器３１０帯域通過フィルタ３１１整流器３１２比較器３１３ＯＲ回路３１４タイマー４０１帯域通過フィルタ４０２帯域通過フィルタ４０３帯域通過フィルタ４０４音声検出回路４０５音声検出過炉４０６音声検出回路４０７整流器４０８低域通過フィルタ４０９増幅器４１０帯域通過フィルタ４１１整流器４１２比較器４１３ＯＲ回路４１４タイマー４１５ＡＤ変換回路４１６電源制御器４１７クロック４１８ＡＤ変換器４１９バッファメモリ４２０インターフェイス４２１低域通過フィルタ４２２マルチプレクサ４２３加算器５０１帯域振幅検出回路５０２帯域通過フィルタ５０３整流器５０４帯域振幅検出回路５０５正弦波発振器５０６乗算器５０７乗算器５０８乗算器５０９乗算器５１０加算器５１１低域通過フィルタ５１２帯域振幅検出回路５１３帯域通過フィルタ５１４整流器５１５乗算器６０１整流器６０２対数増幅器６０３帯域通過フィルタ６０４整流器６０５比較器６０６タイマー８０１ＡＤ変換器８０２ＣＰＵ８０３ＲＡＭ DESCRIPTION OF SYMBOLS 101 Rectifier 102 Low-pass filter 103 Amplifier 104 Band-pass filter 105 Rectifier 106 Comparator 107 Timer 201 Rectifier 202 Low-pass filter 203 Amplifier 204 Band-pass filter 205 Rectifier 206 Comparator 207 Timer 208 Amplifier 209 Comparator 210 Timer 211 Power supply control 212 clock 213 AD converter 214 buffer memory 215 interface 216 audio detection circuit 217 AD conversion circuit 301 band-pass filter 302 band-pass filter 303 band-pass filter 304 audio detection circuit 305 audio detection circuit 306 audio detection circuit 307 rectifier 308 low-pass Filter 309 Amplifier 310 Band-pass filter 311 Rectifier 312 Comparator 313 OR circuit 314 Timer 4 01 band pass filter 402 band pass filter 403 band pass filter 404 voice detection circuit 405 voice detection furnace 406 voice detection circuit 407 rectifier 408 low pass filter 409 amplifier 410 band pass filter 411 rectifier 412 comparator 413 OR circuit 414 timer 415 AD Conversion circuit 416 power controller 417 clock 418 AD converter 419 buffer memory 420 interface 421 low-pass filter 422 multiplexer 423 adder 501 band-amplitude detection circuit 502 band-pass filter 503 rectifier 504 band-amplitude detection circuit 505 sine-wave oscillator 506 multiplier 507 Multiplier 508 Multiplier 509 Multiplier 510 Adder 511 Low-pass filter 512 Band-amplitude detection circuit 513 Band-pass filter 514 rectifier 515 multiplier 601 rectifier 602 logarithmic amplifier 603 band pass filter 604 rectifier 605 comparator 606 timer 801 AD converter 802 CPU 803 RAM

Claims

[Claims]

1. A rectifier for rectifying an input signal to convert it into an amplitude signal, and two kinds of different characteristics having different characteristics for individually processing the amplitude signal from the rectifier and obtaining an amplitude change amount and a threshold value. By comparing the output from each of the filter means and the filter means, when the variation in the amplitude exceeds a threshold value, it is determined that there is a possibility that the input signal contains sound, and the sound detection signal is detected. An audio detection control device comprising: a comparing unit that outputs a signal; and a timer unit that outputs a control signal for a predetermined time when an audio detection signal is output from the comparing unit.

2. A rectifier for rectifying an input signal and converting the input signal into an amplitude signal, a logarithmic amplifier for logarithmically converting the input signal converted to an amplitude signal by the rectifier, and processing an output from the logarithmic amplifier. A band-pass filter means for performing the operation, and when the output from the band-pass filter means exceeds a predetermined threshold value, determine that there is a possibility that sound is included in the input signal and output a sound detection signal. An audio detection control device comprising: a comparison unit; and a timer unit that outputs a control signal for a predetermined time when an audio detection signal is output from the comparison unit.

A second rectifier for re-rectifying the amplitude signal processed by the band-pass filter, wherein an output from the band-pass filter is input to the comparator via the second rectifier. The voice detection control device according to claim 1 or 2, wherein:

4. An amplifying means for amplifying an amplitude signal processed by the low-pass filter means, wherein an amplification factor of the amplifying means can be adjusted from the outside, and an output from the low-pass filter means is output from the amplifying means. The voice detection control device according to claim 1, wherein the voice detection control device is input to the comparison unit via a control unit.

5. An analog-to-digital converter for converting the input signal into digital data, and a buffer memory, wherein the input signal before voice detection is stored as digital data in the buffer memory. The voice detection control device according to the above.

6. A plurality of amplifying means for amplifying an output from said low-pass filter means, a plurality of comparing means, and a plurality of timer means, and a plurality of control signals are set by setting amplification factors of the respective amplifying means to different values. 6. The voice detection control device according to claim 5, wherein a power supply and a clock of the analog-to-digital converter and the buffer memory are controlled by one of them.

7. A plurality of second band-pass filter means for separating an input signal for each frequency band, and a plurality of second band-pass filter means for converting each band signal separated by each second band-pass filter means into an amplitude signal. And a plurality of sets of different characteristics, each of which separately processes an amplitude signal from the rectifier and obtains an amplitude change amount and a threshold.
The type of filter means and the output from each filter means are compared for each frequency band, and when the amount of change in amplitude exceeds a threshold value, there is a possibility that voice may be included in the input signal. A plurality of comparing means for judging and outputting a sound detection signal; a total means for summing the sound detection signals from the respective comparing means; and the timer for outputting a control signal for a predetermined time based on an output from the synthesizing means. The voice detection control device according to claim 1, further comprising:

8. A plurality of second band-pass filter means for separating an input signal for each frequency band, and a plurality of band signals for converting each band signal separated by each second band-pass filter means into an amplitude signal. A plurality of logarithmic amplifiers that logarithmically convert an amplitude signal for each frequency band from the rectifiers, and a plurality of bandpass filters that process outputs from the logarithmic amplifiers for each frequency band. Means for outputting a voice detection signal by determining that voice may be included in an input signal of the frequency band when an output from the band-pass filter means exceeds a predetermined threshold. Said comparing means, comprehensive means for integrating the voice detection signals from each of the comparing means, and the timer means for outputting a control signal for a predetermined time based on the output from the comprehensive means. The voice detection control device according to claim 2.

9. A second low-pass filter for processing an amplitude signal for each frequency band converted by the second band-pass filter and the rectifier, and 9. The voice detection according to claim 7, further comprising: a multiplexer and an analog-to-digital converter for converting an output of the digital signal into digital data; and a buffer memory, wherein an amplitude signal for each frequency band is stored as digital data in the buffer memory. Control device.

10. An amplitude signal for each frequency band converted by said second band-pass filter means and said rectifying means, averaged by said low-pass filter means, is subtracted from said amplitude signal for each frequency band. The voice detection control device according to claim 9, wherein the digital data is converted into digital data after reducing the influence of noise.

11. A plurality of amplitude modulation means for amplitude-modulating an input signal at a plurality of different frequencies, and a plurality of third low-pass filter means each processing output from each of the amplitude modulation means and having predetermined characteristics. With
Further, the amplitude signal for each frequency band converted by the amplitude modulation means and the third low-pass filter means is individually processed, and a plurality of sets of two different characteristic sets for obtaining an amplitude change amount and a threshold value, respectively. The type of filter means and the output from each filter means are compared for each frequency band, and when the amount of change in amplitude exceeds a threshold value, there is a possibility that voice may be included in the input signal. A plurality of comparing means for judging and outputting a sound detection signal; a total means for summing the sound detection signals from the respective comparing means; and a timer means for outputting a control signal for a predetermined time based on an output from the summing means. A voice detection control device comprising:

12. A plurality of amplitude modulating means for modulating the amplitude of an input signal at a plurality of different frequencies, and a plurality of third low-pass filter means each processing output from each of the amplitude modulating means and having predetermined characteristics. With
A plurality of logarithmic amplifiers for logarithmically converting the amplitude signal of each frequency band from the rectifier; and a logarithmic conversion of the amplitude signal for each frequency band converted by the amplitude modulator and the third low-pass filter. A plurality of logarithmic amplifying means for converting, a plurality of bandpass filter means for processing an output from the logarithmic amplifying means for each frequency band, and when an output from the bandpass filter means exceeds a predetermined threshold, A plurality of comparing means for judging that there is a possibility that sound is included in the input signal of the frequency band and outputting a sound detection signal; a total means for summing the sound detection signals from the respective comparing means; A voice detection control device comprising: timer means for outputting a control signal for a predetermined time according to an output from the synthesis means.

13. The voice detection control device according to claim 1, further comprising an analog-to-digital converter and a digital processor, wherein the input signal is digitally processed by software.