JP2897628B2

JP2897628B2 - Voice detector

Info

Publication number: JP2897628B2
Application number: JP5328158A
Authority: JP
Inventors: 幸正杉野
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1993-12-24
Filing date: 1993-12-24
Publication date: 1999-05-31
Anticipated expiration: 2014-05-31
Also published as: JPH07181991A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】この発明は音声信号が低レベルで
も正常に閾値適応をし、良好な音声検出特性を維持する
音声検出器に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a speech detector which normally performs threshold adaptation even when a speech signal is at a low level and maintains good speech detection characteristics.

【０００２】[0002]

【従来の技術】たとえば特開平３−１４１７４０号公報
に示す従来例の音声検出器は図１１のように、低レベル
検出部１は、音声入力信号１０１の低レベル（音声の子
音部位等）を閾値設定部９からの適応閾値１１０と比較
し、有音／無音を判定し、低レベル有音／無音判定結果
１０２として出力する。高レベル検出部２は、音声入力
信号１０１の高レベル（音声の母音部位等）を予め決め
る所定閾値１１１と比較し、有音／無音を判定し、高レ
ベル有音／無音判定結果１０３として出力する。論理和
演算器３は、低レベル検出部１からの低レベル有音／無
音判定結果１０２と高レベル検出部２からの高レベル有
音／無音判定結果１０３とに対し論理和演算を施し、有
音／無音判定結果１０４として出力する。ハングオーバ
付加部４は、論理和演算器３からの有音／無音判定結果
１０４に対しハングオーバ付加処理（有音から無音へ状
態変化する後の所定時間中、有音判定を保持する処理）
を施し、音声検出出力信号１０５とする。雑音レベル算
出部８ａは、音声入力信号１０１の所定ブロック区間内
サンプルに対し絶対値加算平均を施す。高レベル検出部
２からの音声入力信号１０１と所定閾値１１１との比較
結果１２４に従い、所定ブロック区間内で音声入力信号
１０１が所定閾値１１１を一度も越えないときは、背景
雑音レベル１０９を更新する。少なくとも一度越えると
きは、更新しないで直前の背景雑音レベル算出値を維持
する。閾値設定部９は、雑音レベル算出部８ａからの背
景雑音レベル１０９に適応し閾値１１０を設定する。2. Description of the Related Art For example, a conventional sound detector disclosed in Japanese Patent Application Laid-Open No. 3-141740 has a low level detecting section 1 which detects a low level (a consonant part of a sound, etc.) of a sound input signal 101 as shown in FIG. It compares with the adaptive threshold value 110 from the threshold value setting unit 9 to determine sound / non-sound, and outputs the result as a low-level sound / silence judgment result 102. The high-level detection unit 2 compares a high level (a vowel part or the like of the voice) of the voice input signal 101 with a predetermined threshold value 111 to determine the presence or absence of sound, and outputs the result as a high-level presence / absence determination result 103. I do. The OR operation unit 3 performs an OR operation on the low-level sound / non-sound determination result 102 from the low-level detection unit 1 and the high-level sound / non-sound determination result 103 from the high-level detection unit 2. The sound / silence determination result 104 is output. The hangover adding unit 4 performs a hangover addition process on the sound / non-sound determination result 104 from the logical sum operation unit 3 (a process for holding the sound determination during a predetermined time after the state is changed from the sound to the silent).
To obtain a sound detection output signal 105. The noise level calculator 8a performs an absolute value averaging on samples in the predetermined block section of the audio input signal 101. According to the comparison result 124 between the audio input signal 101 from the high level detection unit 2 and the predetermined threshold value 111, if the audio input signal 101 does not exceed the predetermined threshold value 111 within a predetermined block section, the background noise level 109 is updated. . If it exceeds at least once, the previous calculated background noise level is maintained without updating. The threshold setting unit 9 sets a threshold 110 according to the background noise level 109 from the noise level calculation unit 8a.

【０００３】上記従来例の音声検出器は、背景雑音レベ
ルの急激な変化にも追従できる適応閾値型音声検出方式
を採る。The above-mentioned conventional voice detector employs an adaptive threshold type voice detection system which can follow a rapid change in the background noise level.

【０００４】低レベル検出部１は図１２のように、まず
比較器１１で音声入力信号１０１の低レベルを閾値設定
部９からの適応閾値１１０と比較する。つぎに音声入力
信号１０１と適応閾値１１０との比較結果１２５に従
い、第１の判定器１２で所定時間のブロック単位に有音
／無音の判定をする。さらに第２の判定器１３で所定ブ
ロック数連続の有音判定時だけ有音と判定し、低レベル
有音／無音判定結果１０２として出力する。[0006] As shown in FIG. 12, the low level detector 1 first compares the low level of the audio input signal 101 with the adaptive threshold 110 from the threshold setting unit 9 by the comparator 11. Next, in accordance with the comparison result 125 between the audio input signal 101 and the adaptive threshold value 110, the first determiner 12 determines the presence / absence of sound / no sound for each block of a predetermined time. Further, the second determiner 13 determines that there is sound only when sound is determined for a predetermined number of consecutive blocks, and outputs the result as a low-level sound / silence determination result 102.

【０００５】高レベル検出部２は図１３のように、まず
比較器２１で音声入力信号１０１の高レベルを予め適応
閾値１１０より高い値に決定する（通常予期できる最大
背景雑音レベルよりも高く設定する）所定閾値１１１と
比較する。つぎに音声入力信号１０１と所定閾値１１１
との比較結果１２４に従い、第１の判定器２２で所定時
間のブロック単位に有音／無音の判定をする。さらに第
２の判定器２３で所定ブロック数連続の有音判定時だけ
有音と判定し、高レベル有音／無音判定結果１０３とし
て出力する。また比較器２１から音声入力信号１０１と
所定閾値１１１との比較結果１２４を出力する。As shown in FIG. 13, the high level detector 2 first determines the high level of the audio input signal 101 to a value higher than the adaptive threshold value 110 in advance by the comparator 21 (usually set higher than the maximum background noise level which can be expected). D) Compare with a predetermined threshold value 111. Next, the voice input signal 101 and the predetermined threshold 111
In accordance with the comparison result 124, the first determiner 22 determines the presence / absence of sound / non-speech in units of blocks of a predetermined time. Further, the second determiner 23 determines that there is sound only when the sound is determined for a predetermined number of consecutive blocks, and outputs the result as a high-level sound / non-sound determination result 103. The comparator 21 outputs a comparison result 124 between the audio input signal 101 and the predetermined threshold 111.

【０００６】図１４（ａ）のように音声入力信号１０１
の平均レベルが極端に低いとき、まず音声入力信号１０
１と所定閾値１１１との比較結果１２４は図１４（ｃ）
のようになる。従って高レベル検出部２は、有音区間に
対しても無音判定をする。また雑音レベル算出部８ａは
無音だけでなく有音区間に対しても背景雑音レベル１０
９を引き続き算出するから、この背景雑音レベル１０９
は図１４（ｂ）のように望ましい背景雑音レベル算出値
よりも高くなる。また閾値設定部９は、背景雑音レベル
算出値の上昇に応じ適応閾値１１０を上昇する。つぎに
音声入力信号１０１と適応閾値１１０との比較結果１２
５は図１４（ｄ）のようになる。従って低レベル検出部
１は、低レベルの音声区間（話頭や話尾などの低い音声
レベル区間）に対して無音判定をする。また論理和演算
器３は、図１４（ｅ）のように同じに有音／無音判定結
果１０４を出力する。[0006] As shown in FIG.
When the average level of the audio input signal 10 is extremely low,
The comparison result 124 between 1 and the predetermined threshold value 111 is shown in FIG.
become that way. Therefore, the high-level detection unit 2 performs a silence determination even for a sound section. The noise level calculator 8a calculates the background noise level 10 not only for silence but also for sounded sections.
9, the background noise level 109 is calculated.
Is higher than the desired background noise level calculation value as shown in FIG. Further, the threshold setting unit 9 increases the adaptive threshold 110 according to the increase of the background noise level calculation value. Next, a comparison result 12 between the voice input signal 101 and the adaptive threshold value 110
5 is as shown in FIG. Therefore, the low-level detection unit 1 performs silence determination for a low-level voice section (a low-voice level section such as the beginning or end of a speech). Also, the logical sum operator 3 outputs the sound / non-sound determination result 104 in the same manner as shown in FIG.

【０００７】[0007]

【発明が解決しようとする課題】上記のような従来の音
声検出器では、背景雑音レベルの急激な変化にも追従で
きる適応閾値型音声検出方式を採るが、音声信号レベル
が極端に低いとき、低レベルの音声区間も背景雑音レベ
ル算出区間に加えることになり適応閾値を上昇するか
ら、低レベルの音声区間に対して無音判定をする問題点
があった。The above conventional speech detector employs an adaptive threshold type speech detection system capable of following a sudden change in the background noise level. However, when the speech signal level is extremely low, Since the low-level voice section is also added to the background noise level calculation section and the adaptive threshold is increased, there is a problem that the low-level voice section is determined to be silent.

【０００８】この発明が解決しようとする課題は、音声
信号が低レベルでも正常に閾値適応をし、良好な音声検
出特性を維持する音声検出器を提供することにある。[0008] Problems to be this invention solves the audio signal is normally threshold adapted even at low levels, is to provide a speech detector to maintain good voice detection characteristic.

【０００９】[0009]

【課題を解決するための手段】この発明の請求項１の音
声検出器は、低レベル検出器、高レベル検出器、論理和
演算部、ハングオーバー付加部、信号強度算出部、平滑
部、比較器、雑音レベル算出部、閾値設定部を備える音
声検出器であって、低レベル検出器は、音声入力信号を
閾値設定部からの適応閾値と比較して低レベル有音／無
音判定結果を出力し、高レベル検出部は、音声入力信号
を所定閾値または閾値設定部からの適応閾値と比較して
高レベル有音／無音判定結果を出力し、論理和演算部
は、低レベル有音／無音判定結果と高レベル有音／無音
判定結果の論理和演算を施して有音／無音判定結果を出
力し、ハングオーバー付加部は、有音／無音判定結果に
ハングオーバー付加処理を施して音声検出出力信号と
し、信号強度算出部は、所定区間毎の音声入力信号の信
号強度を算出し信号強度出力として出力し、平滑部は、
信号強度算出部からの信号強度出力を平滑化し、比較器
は、信号強度出力と平滑化出力を比較し比較結果を出力
し、雑音レベル算出部は、比較器の比較結果に従い、信
号強度出力に対し相加平均を施して背景雑音レベルとし
て出力するか、直前に出力した値を背景雑音レベルとし
て出力し、閾値設定部は背景雑音レベルに適応した適応
閾値を算出し出力するものである。 According to a first aspect of the present invention, there is provided a speech detector comprising a low level detector, a high level detector, a logical sum.
Operation unit, hangover addition unit, signal strength calculation unit, smoothing
Unit, a comparator, a noise level calculator, and a threshold setting unit
A voice detector, wherein the low level detector detects a voice input signal.
Low level sound / no compared to the adaptive threshold from the threshold setting unit
The sound detection result is output, and the high-level detection unit outputs the sound input signal.
Is compared with the predetermined threshold or the adaptive threshold from the threshold setting unit.
Outputs high-level sound / non-speech judgment result and performs logical sum operation
Indicates the low-level sound / silence judgment result and the high-level sound / silence
Performs a logical OR operation on the judgment results to produce a sound / silence judgment result
The hangover addition section adds
Hangover addition processing is performed to
Then, the signal strength calculation unit calculates the signal of the audio input signal for each predetermined section.
The signal strength is calculated and output as a signal strength output .
The signal strength output from the signal strength calculation unit smoothes comparator
Compares the signal strength output with the smoothed output and outputs the comparison result
Then, the noise level calculation unit calculates the signal based on the comparison result of the comparator.
Performs arithmetic averaging on the signal intensity output to obtain the background noise level.
Or the value output immediately before is used as the background noise level.
Output, and the threshold setting unit adapts to the background noise level.
A threshold is calculated and output.

【００１０】この発明の請求項２の音声検出器は、平滑
部が、平滑部比較器、アップダウンカウンタを備え、平
滑部比較器は、信号強度出力とアップダウンカウンタの
出力とを比較して比較結果を出力し、アップダウンカウ
ンタは、平滑部比較器の比較結果により信号強度出力が
平滑化出力より小さい時は、カウンタ現在値を減算し、
信号強度出力が平滑化出力より大きい時は、カウンタ現
在値を加算して平滑化出力とするものである。 [0010] speech detector according to claim 2 of the present invention, smooth
Section has a smoothing section comparator, up-down counter,
The slip comparator compares the signal strength output and the up / down counter.
By comparing the output to output the result of the comparison, the up-down Cow
Signal strength output based on the comparison result of the smoothing unit comparator.
If it is smaller than the smoothed output, subtract the current value of the counter,
If the signal strength output is greater than the smoothed output,
The present value is added to obtain a smoothed output.

【００１１】この発明の請求項３の音声検出器は、平滑
部が、平滑部第１比較器、アップダウンカウンタ、平滑
部第２比較器、セレクタを備え、平滑部第１比較器は、
信号強度出力とアップダウンカウンタの出力とを比較し
て比較結果を出力し、アップダウンカウンタは、平滑部
第１比較器の比較結果により信号強度出力が平滑化出力
より小さい時は、カウンタ現在値を減算し、信号強度出
力が平滑化出力より大きい時は、カウンタ現在値を加算
して出力し、平滑部第２比較器は、アップダウンカウン
タの出力と定数Ａを比較して比較結果を出力し、セレク
タは、平滑部第２比較器の比較結果に従いアップダウン
カウンタの出力もしくは定数Ａを平滑化出力とするもの
である。 [0011] speech detector according to claim 3 of the present invention, smooth
Unit is a smoothing unit first comparator, an up-down counter, a smoothing unit
Unit second comparator, a selector, the smoothing unit first comparator,
Compare the signal strength output with the output of the up / down counter.
And outputs the comparison result Te, up-down counter, smooth portion
The signal strength output is smoothed according to the comparison result of the first comparator.
If it is smaller, subtract the current value of the counter and output the signal strength.
If the force is greater than the smoothed output, add the current counter value
The smoothing unit second comparator outputs an up-down count.
By comparing the output and the constant A of the motor and outputs a comparison result, selector
Up and down according to the comparison result of the smoothing section second comparator.
Output of counter or constant A as smoothed output
It is.

【００１２】この発明の請求項４の音声検出器は、平滑
部は、第１乗算器、第２乗算器、平滑部第１比較器、平
滑部第２比較器、平滑部第３比較器、論理和演算器、ア
ップダウンカウンタを備え、第１乗算器は、定数αとア
ップダウンカウンタの出力とを乗算して上限値を算出
し、第２乗算器は、定数βとアップダウンカウンタの出
力とを乗算して下限値を算出し、平滑部第１比較器は、
信号強度出力とアップダウンカウンタの出力とを比較し
て比較結果を出力し、平滑部第２比較器は、信号強度出
力と第１乗算器の出力とを比較して比較結果を出力し、
平滑部第３比較器は、信号強度出力と第２乗算器の出力
とを比較して比較結果を出力し、論理和演算器は、平滑
部第２比較器と平滑部第３比較器の比較結果の論理和演
算を施し、信号強度が所定範囲内か否かを判定結果とし
て出力し、アップダウンカウンタは、論理和演算器の判
定結果が所定範囲内のときは現在値を保持し、所定範囲
外のときは、平滑部第１比較器の比較結果により、信号
強度出力がアップダウンカウンタの出力より小さい時
は、カウンタ現在値を減算し、信号強度出力がアップダ
ウンカウンタの出力より大きい時は、カウンタ現在値を
加算して平滑化出力とするものである。 [0012] speech detector according to claim 4 of the present invention, smooth
The unit includes a first multiplier, a second multiplier, a smoothing unit first comparator,
Smooth part second comparator, smoothing part third comparator, logical sum operator,
Comprising a-down counter, the first multiplier, a constant α and A
Calculate the upper limit by multiplying the output of the up / down counter
And, second multiplier output constant β and the up-down counter
Calculating a lower limit by multiplying the force, the smoothing unit first comparator,
Compare the signal strength output with the output of the up / down counter.
And outputs the comparison result. The second comparator of the smoothing unit outputs the signal strength.
Comparing the output with the output of the first multiplier and outputting a comparison result;
The smoothing unit third comparator outputs the signal strength output and the output of the second multiplier.
And the comparison result is output, and the logical sum
OR of the comparison result between the second comparator and the third comparator
And determine whether the signal strength is within a predetermined range as the determination result.
Output, and the up / down counter is determined by the OR
If the result is within the specified range, the current value is held and the
When the signal is outside, the signal obtained from the comparison result of the smoothing unit first comparator is
When the intensity output is smaller than the output of the up / down counter
Subtracts the current value of the counter and increases the signal strength output.
When the output of the counter is larger than the
The sum is used as a smoothed output.

【００１３】この発明の請求項５の音声検出器は、平滑
部が、移動平均フィルタを形成する低域通過フィルタか
らなり、信号強度出力から平滑化出力を算出して出力す
るものである。 [0013] Voice detector according to claim 5 of the present invention, smooth
Is a low-pass filter forming a moving average filter
Calculates and outputs a smoothed output from the signal strength output.
Things.

【００１４】この発明の請求項６の音声検出器は、比較
器は、その比較結果が変化しても一定時間直前の比較結
果を保持して出力するものである。 [0014] Voice detector according to claim 6 of the present invention, compared
Even if the result of the comparison changes, the
The result is stored and output.

【００１５】[0015]

【作用】この発明の音声検出器は、音声信号が低レベル
でも正常に閾値適応し、良好な音声検出特性を維持す
る。 According to the voice detector of the present invention, the voice signal has a low level.
Threshold adaptation normally and maintain good speech detection characteristics
You.

【００１６】[0016]

【実施例】この発明を示す一実施例の音声検出器は図１
のように、低レベル検出部１と高レベル検出部２と論理
和演算器３とハングオーバ付加部４と閾値設定部９は、
上記従来例の図１１と対応する。信号強度算出部５は、
音声入力信号１０１について信号強度算出周期ｔのブロ
ック区間内所定数Ｎ個のサンプルに対し２乗値相加平均
を施し、信号強度出力１０６とする。平滑部６は、信号
強度算出部５からの信号強度出力１０６を信号強度算出
周期ｔごとにカウンタ現在値と比較し、結果に応じカウ
ンタ現在値を加減し、平滑化出力１０７とする。比較器
７は、信号強度算出部５からの信号強度出力１０６を平
滑部６からの平滑化出力１０７と比較し、信号強度出力
１０６が平滑化出力１０７より大きいとき１（オン）、
小さいとき０（オフ）を比較結果１０８として出力す
る。雑音レベル算出部８は、比較器７からの比較結果１
０８が０区間のとき、信号強度算出部５からの信号強度
出力１０６に対し相加平均を施して背景雑音レベル１０
９として更新する。１区間のとき、更新しないで直前背
景雑音レベル算出値を維持する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS A speech detector according to one embodiment of the present invention is shown in FIG.
, The low-level detection unit 1, the high-level detection unit 2, the OR operation unit 3, the hangover addition unit 4, and the threshold setting unit 9
This corresponds to FIG. 11 of the above conventional example. The signal strength calculation unit 5
The predetermined number N of samples in the block section of the signal strength calculation period t are subjected to the square-value arithmetic mean with respect to the audio input signal 101, and a signal strength output 106 is obtained. The smoothing unit 6 compares the signal strength output 106 from the signal strength calculation unit 5 with the counter current value at each signal strength calculation cycle t, and adjusts the counter current value according to the result to obtain a smoothed output 107. The comparator 7 compares the signal strength output 106 from the signal strength calculation unit 5 with the smoothed output 107 from the smoothing unit 6, and when the signal strength output 106 is larger than the smoothed output 107, 1 (ON);
When smaller, 0 (off) is output as the comparison result 108. The noise level calculator 8 calculates the comparison result 1 from the comparator 7.
When 08 is the 0 section, the signal strength output 106 from the signal strength calculation unit 5 is arithmetically averaged to obtain a background noise level 10
Updated as 9. At the time of one section, the immediately preceding background noise level calculation value is maintained without updating.

【００１７】上記実施例の音声検出器は、音声信号が低
レベルでも正常に閾値適応をし、良好な音声検出特性を
維持する閾値適応型音声検出方式を採る。The voice detector of the above-described embodiment employs a threshold adaptive voice detection system that performs threshold adaptation normally even when a voice signal is at a low level and maintains good voice detection characteristics.

【００１８】信号強度算出部５は図２のように、まず２
乗器５１でサンプリング周期ごとに音声入力信号１０１
の２乗値を算出する。つぎに信号強度算出周期ｔのブロ
ック区間内所定数Ｎ個のサンプルに対し、初期設定時
（電源立上げ直後）と出力ラッチ直後に出力値を０にリ
セットする累積加算器５２で、２乗値と累積加算器５２
の現在値とを加算する。さらに加算値のラッチ５３出力
に対し乗算器５４で定数１／Ｎを乗じ２乗値相加平均を
施し、信号強度出力１０６とする。As shown in FIG. 2, the signal strength calculator 5
The voice input signal 101 is sampled by the multiplier 51 every sampling period.
Is calculated. Next, for a predetermined number N of samples in the block section of the signal strength calculation cycle t, the cumulative adder 52 resets the output value to 0 at the time of initial setting (immediately after power-on) and immediately after output latching. And the accumulator 52
And the current value of. Further, a multiplier 54 multiplies the output of the latch 53 of the added value by a constant 1 / N, and performs an arithmetic mean of square values to obtain a signal strength output 106.

【００１９】平滑部６は図３のように、信号強度算出周
期ｔごとに、まず信号強度算出部５からの信号強度出力
１０６を比較器６１でアップダウンカウンタ６２の現在
値と比較し、信号強度出力１０６の方が大きいとき１
（オン）、小さいとき０（オフ）を比較結果１１３とし
て出力する。つぎにアップダウンカウンタ６２で比較器
６１からの比較結果１１３が１または０区間のとき、カ
ウンタ現在値を１だけ加算または減算し、比較器７への
平滑化出力１０７とする。As shown in FIG. 3, the smoothing unit 6 compares the signal strength output 106 from the signal strength calculation unit 5 with the current value of the up / down counter 62 by the comparator 61 at every signal strength calculation period t. 1 when intensity output 106 is greater
(On), when it is small, 0 (Off) is output as the comparison result 113. Next, when the comparison result 113 from the comparator 61 in the up / down counter 62 is 1 or 0, the counter current value is added or subtracted by 1 to obtain a smoothed output 107 to the comparator 7.

【００２０】図４（ａ）のように音声入力信号１０１の
平均レベルが極端に低いとき、まず図４（ｂ）のように
音声入力信号１０１が無音から有音へ状態変化をする
と、信号強度算出部５の信号強度出力１０６は、増加し
平滑化出力１０７より大きくなる。平滑部６は、図４
（ｃ）のように信号強度出力１０６とカウンタ現在値１
０７との比較結果１１３を１とし、信号強度出力１０６
に等しくなるまで平滑化出力１０７を１ずつ増加し続け
る。有音から無音へ状態変化をすると、信号強度出力１
０６は減小し、平滑化出力１０７より小さくなる。図４
（ｃ）のように信号強度出力１０６とカウンタ現在値１
０７との比較結果１１３を０とし、信号強度出力１０６
に等しくなるまで平滑化出力１０７を１ずつ減小し続け
る、信号強度算出周期ｔ（平滑化出力１０７の更新周
期）を十分長く設定すれば、平滑化出力１０７を有音と
無音の状態変化に伴う音声信号強度の変化に比べ十分ゆ
るやかに変化させることができ、平滑化出力１０７を有
音と無音時音声信号強度の間の値にできる。また比較器
７は、信号強度出力１０６を平滑化出力１０７と比較
し、音声入力信号１０１が有音と無音状態のいずれかを
大まかに判定する。雑音レベル算出部８は、比較器７か
らの信号強度出力１０６と平滑化出力１０７との比較結
果が０区間（音声入力信号１０１が無音状態と判定され
た区間）のとき、背景雑音レベル１０９を算出する。雑
音レベル算出区間に有音区間を加えることが少なくな
り、図４（ｂ）のように音声信号レベルが低いときで
も、背景雑音レベル算出値の誤差を少なくできる。つぎ
に閾値設定部９で雑音レベル算出値に適応し閾値１１０
を設定するから、図４（ａ）のように適応閾値１１０は
音声信号レベルの低い話頭や話尾部位を有音判定するの
に十分な低い値になる。従って低レベル有音／無音判定
結果１０２は、図４（ｅ）のように有音区間を正しく有
音判定するようになる。低レベル有音／無音判定結果１
０２と高レベル有音／無音判定結果１０３（図４（ｄ）
参照）との論理和も図４（ｆ）のように音声信号を正し
く検出した有音／無音判定結果１０４を得る。When the average level of the audio input signal 101 is extremely low as shown in FIG. 4A, first, when the audio input signal 101 changes state from silence to speech as shown in FIG. The signal strength output 106 of the calculator 5 increases and becomes larger than the smoothed output 107. FIG.
As shown in (c), the signal strength output 106 and the counter current value 1
07 as 1 and the signal strength output 106
The smoothed output 107 continues to be increased by 1 until it becomes equal to When the state changes from sound to silence, signal strength output 1
06 decreases and becomes smaller than the smoothed output 107. FIG.
As shown in (c), the signal strength output 106 and the counter current value 1
07 and the signal strength output 106
If the signal intensity calculation cycle t (update cycle of the smoothed output 107) is set to be sufficiently long, the smoothed output 107 is changed to a state of sound or no sound. The change can be made sufficiently gently as compared with the accompanying change in the audio signal strength, and the smoothed output 107 can be set to a value between the sound signal intensity at the time of sound and at the time of no sound. Further, the comparator 7 compares the signal strength output 106 with the smoothed output 107, and roughly determines whether the audio input signal 101 is in a sound state or in a silent state. When the comparison result between the signal strength output 106 and the smoothed output 107 from the comparator 7 is in the 0 section (the section in which the voice input signal 101 is determined to be in a silent state), the noise level calculation section 8 calculates the background noise level 109. calculate. Addition of a sound section to the noise level calculation section is reduced, and the error in the background noise level calculation value can be reduced even when the audio signal level is low as shown in FIG. Next, the threshold setting unit 9 adjusts the noise level calculated value to the threshold value 110.
Therefore, as shown in FIG. 4A, the adaptive threshold value 110 is a sufficiently low value to determine whether the speech head or the tail part having a low audio signal level is sound. Therefore, in the low-level sound / non-speech determination result 102, a sound period is correctly determined as a sound as shown in FIG. Low level sound / silence judgment result 1
02 and high-level sound / silence determination result 103 (FIG. 4D)
4), a sound / silence determination result 104 in which an audio signal is correctly detected is obtained as shown in FIG.

【００２１】なお上記実施例で平滑部６は図５のよう
に、第２の比較器６３とセレクタ６４とを設け、平滑部
６ａとし、、まずアップダウンカウンタ６２からの平滑
化出力１０７を第２の比較器６３で予め決める定数Ａと
比較し、平滑化出力１０７の方が大きいとき１（オ
ン）、小さいとき０（オフ）を比較結果１１５として出
力する。つぎにセレクタ６４で第２の比較器６３からの
比較結果１１５が１または０区間のとき、定数Ａまたは
平滑化出力１０７を選択し、比較器７への選択平滑化出
力１０７ａとしてもよい。有音継続時間の長い信号（Ｆ
ＡＸ送受信用モデム信号など）のとき、有音の間平滑化
出力１０７を増加し続けるから、平滑化出力１０７が信
号強度出力１０６に一致後の有音区間も背景雑音レベル
算出区間に取り込み、背景雑音レベル算出値に誤差を生
じる問題点を解消する効果がある。図６（ａ）のように
音声入力信号１０１が有音継続時間の長いとき、図６
（ｂ）のように音声入力信号１０１が無音から有音へ状
態変化すると、信号強度算出部５の信号強度出力１０６
は、平滑化出力１０７より大きくなる。平滑部６ａは、
図６ｃのように信号強度出力１０５とカウンタ現在値１
０７との比較結果１１３を１とし、平滑化出力１０７を
１ずつ増加し始める。定数Ａを越えるか否かで図６
（ｄ）のように平滑化出力１０７と定数Ａとの比較結果
１１５を０または１とし、選択平滑化出力１０７ａとし
て図６（ｂ）のように、平滑化出力１０７または定数Ａ
を選択する。有音継続時間の長いときでも、定数Ａが上
限値となり、選択平滑化出力１０７ａは信号強度出力１
０６を越えることはない。In the above embodiment, as shown in FIG. 5, the smoothing section 6 is provided with a second comparator 63 and a selector 64 to form a smoothing section 6a . The second comparator 63 compares it with a predetermined constant A, and outputs 1 (ON) when the smoothed output 107 is larger, and outputs 0 (OFF) as the comparison result 115 when the smoothed output 107 is smaller. Next, when the comparison result 115 from the second comparator 63 is 1 or 0 in the selector 64, the constant A or the smoothed output 107 may be selected as the selected smoothed output 107a to the comparator 7. Signal with long sound duration (F
AX transmission / reception modem signal, etc.), the smoothed output 107 continues to increase during a sound, so that a sound section after the smoothed output 107 matches the signal strength output 106 is also included in the background noise level calculation section, and the background noise level is calculated. This has the effect of solving the problem of causing an error in the noise level calculation value. When the voice input signal 101 has a long sound duration as shown in FIG.
When the state of the audio input signal 101 changes from silence to sound as shown in FIG.
Is larger than the smoothed output 107. The smoothing part 6a
As shown in FIG. 6c, the signal strength output 105 and the counter current value 1
The comparison result 113 with 07 is set to 1, and the smoothed output 107 starts to be increased by one. FIG. 6 shows whether the constant A is exceeded or not.
As shown in FIG. 6D, the comparison result 115 between the smoothed output 107 and the constant A is set to 0 or 1, and as the selected smoothed output 107a, as shown in FIG.
Select Even when the sound duration is long, the constant A becomes the upper limit value, and the selected smoothed output 107a becomes the signal intensity output 1
06 will not be exceeded.

【００２２】また上記実施例で平滑部６は図７のよう
に、第１、第２および第３の比較器６１ａ、６１ｂおよ
び６１ｃと、アップダウン６２ａと、論理和演算器６５
と、第１および第２の乗算器６６および６７とを備える
平滑部６ｂとしてもよい、上記図５と同じ効果がある。
信号強度算出周期ｔごとに、まず信号強度算出部５から
の信号強度出力１０６を第１と第２と第３の各比較器６
１ａと６１ｂと６１ｃで、それぞれカウンタ現在値１０
７ｂ、カウンタ現在値１０７ｂと予め決める定数α（α
＞１）との乗算結果１１８およびカウンタ現在値１０７
ｂと予め決める定数β（０＜β＜１）との乗算結果１１
９と比較する。第１の比較器６１ａは、信号強度出力１
０６がカウンタ現在値１０７ｂより大きいとき１（オ
ン）、小さいとき０（オフ）を、信号強度出力１０６と
カウンタ現在値１０７ｂとの比較結果１２０として出力
する。第２の比較器６１ｂは、信号強度出力１０６が第
１の乗算結果１１８より大きいとき１（オン）、小さい
とき０（オフ）を、信号強度出力１０６と第１の乗算結
果１１８との比較結果１２１として出力する。第３の比
較器６１ｃは、信号強度出力１０６が第２の乗算結果よ
り大きいとき０（オフ）、小さいとき１（オン）を、信
号強度出力１０６と第２の乗算結果１１９との比較結果
１２２として出力する。つぎに第２と第３の比較器６１
ｂと６１ｃからの比較結果１２１と１２２とに対し論理
和演算器６５で、論理和演算を施し、信号強度判定結果
１２３として出力する。さらにアップダウンカウンタ６
２ａで、論理和演算器６５からの信号強度判定結果１２
３が０の場合、カウンタ現在値１０７ｂを前値保持とす
る。１の場合、第１の比較器６１ａからの比較結果１２
０が１のときはカウンタ現在値１０７ｂを１だけ加算
し、０のときはカウンタ現在値１０７ｂを１だけ減算
し、判定平滑化出力１０７ｂとする。図８（ａ）のよう
に音声入力信号１０１が有音継続時間の長いとき、図８
（ｂ）のように音声入力信号１０１が無音から有音へ状
態変化をすると、信号強度算出部５の信号強度出力１０
６は、判定平滑化出力１０７ｂより大きくなる。平滑部
６ｂは、図８（ｃ）、（ｄ）および（ｆ）のように信号
強度出力１０６とカウンタ現在値１０７ｂとの比較結果
１２０、信号強度出力１０６と第１の乗算結果１１８と
の比較結果１２１および信号強度判定結果１２３をそれ
ぞれ１とし、判定平滑化出力１０７ｂを１ずつ増加し始
める。信号強度出力１０６が第１の乗算結果１１８に等
しくなると、図８（ｄ）、（ｅ）及び（ｆ）のように比
較結果１２１、比較結果１２２および判定結果１２３を
それぞれ０とし、判定平滑化出力１０７ｂとして図８
（ｂ）のように、前値保持を続ける。有音から無音へ状
態変化をすると、信号強度算出部５の信号強度出力１０
６は、判定平滑化出力１０７ｂより小さくなる。平滑部
６ｂは、図８（ｃ）、（ｅ）及び（ｆ）のように信号強
度出力１０６とカウンタ現在値１０７ｂとの比較結果１
２０、信号強度出力１０６と第２の乗算結果１１９との
比較結果１２２および信号強度判定結果１２３をそれぞ
れ０、１および１とし、判定平滑化出力１０７ｂは１ず
つ減小し始める。信号強度出力１０６が第２の乗算結果
１１９に等しくなると、図８（ｄ）、（ｅ）および
（ｆ）のように比較結果１２１、比較結果１２２および
判定結果１２３をそれぞれ０とし、判定平滑化出力１０
７ｂとして図８（ｂ）のように、前値保持を続ける。有
音継続時間の長いときでも、第１の乗算結果１１８が上
限値、第２の乗算結果１１９が下限値となり、判定平滑
化出力１０７ｂは有音と無音時信号強度の間の値をとる
ように制御される。In the above embodiment, as shown in FIG. 7, the smoothing unit 6 includes first, second and third comparators 61a, 61b and 61c, an up-down unit 62a, and a logical sum operation unit 65.
And a smoothing unit 6b including first and second multipliers 66 and 67, which has the same effect as in FIG.
At each signal strength calculation period t, first, the signal strength output 106 from the signal strength calculation unit 5 is output to the first, second, and third comparators 6.
At 1a, 61b and 61c, the counter current value is 10 respectively.
7b, a counter current value 107b and a predetermined constant α (α
> 1) and the current counter value 107
multiplication result 11 of b and a predetermined constant β (0 <β <1)
Compare with 9. The first comparator 61a outputs the signal strength output 1
When 06 is larger than the current counter value 107b, 1 (ON) is output, and when 0 is smaller, 0 (OFF) is output as a comparison result 120 between the signal strength output 106 and the current counter value 107b. The second comparator 61b determines 1 (on) when the signal strength output 106 is larger than the first multiplication result 118 , and 0 (off) when the signal strength output 106 is smaller than the first multiplication result 118. The comparison result between the signal strength output 106 and the first multiplication result 118 Output as 121. The third comparator 61c determines 0 (off) when the signal strength output 106 is larger than the second multiplication result, and 1 (on) when the signal strength output 106 is smaller than the second multiplication result. The comparison result 122 between the signal strength output 106 and the second multiplication result 119 Output as Next, the second and third comparators 61
The logical sum operation unit 65 performs a logical sum operation on the comparison results 121 and 122 from b and 61c, and outputs the result as a signal strength determination result 123. Up / down counter 6
2a, the signal strength determination result 12 from the logical sum operator 65
When 3 is 0, the current counter value 107b is held as the previous value. In the case of 1, the comparison result 12 from the first comparator 61a
When 0 is 1, the counter current value 107b is incremented by 1; when it is 0, the counter current value 107b is decremented by 1 to obtain a decision smoothed output 107b. When the voice input signal 101 has a long sound duration as shown in FIG.
When the state of the audio input signal 101 changes from silence to sound as shown in (b), the signal intensity output 10
6 is larger than the judgment smoothed output 107b. The smoothing unit 6b compares the signal strength output 106 with the current counter value 107b 120 and compares the signal strength output 106 with the first multiplication result 118 as shown in FIGS. 8C, 8D and 8F. The result 121 and the signal strength determination result 123 are each set to 1, and the determination smoothed output 107b starts to increase by one. When the signal strength output 106 becomes equal to the first multiplication result 118, the comparison result 121, the comparison result 122, and the judgment result 123 are set to 0 as shown in FIGS. 8D, 8E, and 8F, and the judgment smoothing is performed. 8 as the output 107b.
As shown in (b), the previous value is maintained. When the state changes from sound to silence, the signal strength output 10
6 is smaller than the decision smoothing output 107b. As shown in FIGS. 8C, 8E, and 8F, the smoothing unit 6b compares the signal strength output 106 with the counter current value 107b.
20, the comparison result 122 between the signal strength output 106 and the second multiplication result 119 and the signal strength determination result 123 are set to 0, 1, and 1, respectively, and the determination smoothed output 107b starts to decrease by one. When the signal strength output 106 is equal to the second multiplication result 119, the comparison result 121, the comparison result 122, and the judgment result 123 are set to 0 as shown in FIGS. 8D, 8E, and 8F, and the judgment smoothing is performed. Output 10
As shown in FIG. 8B, the previous value is maintained as 7b. Even when the sound duration is long, the first multiplication result 118 becomes the upper limit value and the second multiplication result 119 becomes the lower limit value, and the judgment smoothed output 107b takes a value between the sound intensity and the signal intensity at the time of no sound. Is controlled.

【００２３】また上記実施例で平滑部６として、図９の
ように第１〜３の加算器６００と６０１と６０２、第１
〜４の遅延素子６１１と６１２と６２１と６２２および
第１〜５の乗算器６３０と６３１と６３２と６４１と６
４２を備える低域通過フィルタを設けてもよい。上記図
５と同じ効果がある。図９でサンプリング間隔を信号強
度算出周期ｔ、時刻（ｋ×ｔ、ｋ＝０、±１、±２、
…）時の入力と出力信号をＸ_k とＹ_k とすると、たとえ
ばｂ _n ＝０の場合、次のとおり移動平均フィルタを形成
する。Ｙ_k ＝Σ_n ａ_n Ｘ_k-n （ｎ＝０〜ＮＮ：タップ数）＝１／Ｎ＋１ Σ_n Ｘ_k-n （ａ_n ＝１／Ｎ＋１の場合）入力信号Ｘ_k としての信号強度出力１０６に対し信号強
度算出周期ｔのブロック区間内でフィルタリング演算を
施すから、出力信号Ｙ_k としての低域通過平滑化出力１
０７ｃは、移動平均時間幅（Ｎ＋１）×ｔのブロック区
間内入力サンプルの２乗値相加平均となる。移動平均時
間幅（Ｎ＋１）×ｔを信号強度算出ブロック時間幅ｔに
比べ十分広くなるようにＮを設定すれば、移動平均時間
幅内に有音／無音とも含み、低域通過平滑化出力１０７
ｃは有音時音声レベルと無音時背景雑音レベル間の値を
とるようになる。In the above embodiment, the first to third adders 600, 601 and 602, as shown in FIG.
To 4 delay elements 611, 612, 621, and 622 and first to fifth multipliers 630, 631, 632, 641, and 6
A low-pass filter comprising 42 may be provided. There is the same effect as in FIG. In FIG. 9, the sampling interval is set to the signal strength calculation cycle t, time (k × t, k = 0, ± 1, ± 2,
If the input and output signal when ...) and X _k and Y _k, if
For example, if b _n = 0, a moving average filter is formed as follows. To: (number of taps n = 0~N N) = 1 / N + 1 Σ n X kn (a n = 1 / N + 1 if) the signal intensity output 106 as an input signal _{_{_{X k Y k = Σ n a}}} n X kn since performing a filtering operation in the block section of the signal strength calculation cycle t, the low-pass smoothing output as an output signal Y _k 1
07c is the squared arithmetic mean of the input samples in the block section of the moving average time width (N + 1) × t. If the moving average time width (N + 1) × t a signal strength calculation block time setting the N to be sufficiently wider than the width t, also include a voiced / silent to the moving average time in bandwidth, low-pass smoothing output 107
c takes a value between the sound level at the time of speech and the background noise level at the time of silence.

【００２４】また上記実施例で比較器７に出力保持手段
を設け比較器７ｄとし、信号強度出力１０６と平滑化出
力１０７との比較結果１０８が１から０へ状態変化をす
ると、所定時間ｔ１を経過する時までは出力を１に保持
し、０から１へ状態変化をする時までは出力を０に保持
するように制御してもよい。雑音レベル算出部８で低レ
ベルの音声区間（語尾部位など）を背景雑音レベルの算
出区間に加えることを防ぐ効果がある。図１０（ａ）の
ように音声入力信号１０１の平均レベルが極端に低いと
き、まず図１０（ｂ）のように音声入力信号１０１が無
音から有音へ状態変化をすると、信号強度算出部５の信
号強度出力１０６は、増加し平滑化出力１０７より大き
くなる。平滑部６は、信号強度出力１０６に等しくなる
まで平滑化出力１０７を１ずつ増加し続ける。比較器７
は、図１０（ｃ）のように信号強度出力１０６と平滑化
出力１０７との比較結果１０８を１とする。有音から無
音へ状態変化をすると、信号強度出力１０６は減小し、
平滑化出力１０７より小さくなる。信号強度出力１０６
に等しくなるまで平滑化出力１０７を１ずつ減小し続け
る。信号強度出力１０６と平滑化出力１０７との比較結
果１０８を０とする。図１０（ｃ）のように比較結果１
０８が１から０へ状態変化をすると、所定時間ｔ₁ を経
過する時までは、図１０（ｄ）のように保持機能付加後
の信号強度出力１０６と平滑化出力１０７との比較結果
１０８ａを１とし、比較結果１０８が０から１へ状態変
化をする時までは、比較結果１０８ａを０とする。従っ
て図１０（ｃ）のように比較結果１０８の０区間をその
まま背景雑音レベル算出区間とすると、低レベルの音声
区間をも背景雑音レベル算出区間に加えることになる
が、図１０（ｄ）のように比較結果１０８ａの０区間を
背景雑音レベル算出区間とすれば防げる。In the above-described embodiment, the output holding means is provided in the comparator 7 to form a comparator 7d. When the comparison result 108 of the signal strength output 106 and the smoothed output 107 changes from 1 to 0, the predetermined time t1 is reduced. Control may be performed such that the output is held at 1 until the time elapses and the output is held at 0 until the state changes from 0 to 1. This has the effect of preventing the noise level calculation unit 8 from adding a low-level voice section (such as an end part) to the background noise level calculation section. When the average level of the audio input signal 101 is extremely low as shown in FIG. 10A, first, when the state of the audio input signal 101 changes from silence to speech as shown in FIG. Signal strength output 106 increases and becomes larger than the smoothed output 107. The smoothing unit 6 keeps increasing the smoothed output 107 by one until it becomes equal to the signal strength output 106. Comparator 7
Sets the comparison result 108 between the signal strength output 106 and the smoothed output 107 to 1 as shown in FIG. When the state changes from the voiced to the silence, the signal strength output 106 is reduced small,
It becomes smaller than the smoothed output 107. Signal strength output 106
The smoothed output 107 is continuously reduced by 1 until the value becomes equal to. The comparison result 108 between the signal strength output 106 and the smoothed output 107 is set to 0. As shown in FIG.
If 08 is a state change from 1 to 0, until when a predetermined time elapses t _1, the comparison result 108a of the signal strength output 106 and the smooth output 107 after holding additional function as shown in FIG. 10 (d) The comparison result 108a is set to 1 until the state of the comparison result 108 changes from 0 to 1. Therefore, assuming that the 0 section of the comparison result 108 is the background noise level calculation section as it is as shown in FIG. 10C, a low-level speech section is also added to the background noise level calculation section. As described above, this can be prevented by setting the 0 section of the comparison result 108a as the background noise level calculation section.

【００２５】また上記実施例で高レベル検出部２は、所
定閾値１１１の代わりに閾値設定部９で背景雑音レベル
に適応する閾値を用いてもよい。また信号強度算出部５
は、２乗値加算平均値の代わりに絶対値加算平均値また
はピーク値を信号強度出力１０６としてもよい。また平
滑部６は、平滑化出力１０７を１ずつではなく一定量ず
つ増加／減小してもよい。増加／減小時で一定量ではな
く異なる値をとってもよい。In the above embodiment, the high level detector 2 may use a threshold adapted to the background noise level in the threshold setting unit 9 instead of the predetermined threshold 111. In addition, the signal strength calculator 5
The signal strength output 106 may use an absolute value average value or a peak value instead of the square value average value. Further, the smoothing unit 6 may increase / decrease the smoothed output 107 by a fixed amount instead of by one. When increasing / decreasing, a different value may be used instead of a fixed amount.

【００２６】以上のようにこの一実施例では、まず音声
入力信号の低レベルおよび高レベルをそれぞれ適応閾値
および所定閾値と比較し、有音／無音判定結果に対し論
理和演算とハングオーバ付加処理を施し音声検出出力信
号とする。つぎに音声入力信号の信号強度算出周期ブロ
ック区間内所定数サンプルに対し、たとえば２乗値相加
平均を施す信号強度出力とカウンタ現在値との比較結果
でカウンタ現在値を加減し平滑化出力とする。または平
滑化出力と予め決める定数との比較結果で平滑化出力と
定数とのいずれかを選択し出力とする。または前記信号
強度出力とカウンタ現在値、第１の乗算結果（カウンタ
現在値と予め決める第１の定数との乗算結果）および第
２の乗算結果（カウンタ現在値と予め決める第２の定数
との乗算結果）との第１、第２および第３の比較結果に
従い、第２と第３の比較結果に論理和演算を施す信号強
度判定結果と第１の比較結果との組合せでカウンタ現在
値を加減または前値保持し出力する。または信号強度算
出周期ごとに入力する前記信号強度出力に対しフィルタ
リング演算を施し移動平均時間幅のブロック区間内で信
号強度を平滑化し出力する。さらに前記信号強度出力と
平滑化出力との比較結果でまたはその状態変化で出力保
持機能を制御し、信号強度出力に対し相加平均を施して
更新するか、しないで直前値を維持する背景雑音レベル
に適応し閾値を設定する。 As described above, in this embodiment, first,
Adaptive threshold for low and high levels of input signal
And a predetermined threshold value, and discuss the sound / non-speech judgment result.
Performs a logical sum operation and hangover addition processing to output a voice detection output signal.
No. Then the signal strength calculation cycle Bro audio input signal
For a given number of samples in the
Comparison result between the signal strength output to be averaged and the current counter value
To add or subtract the current value of the counter to obtain a smoothed output. Or flat
The smoothed output is calculated based on the result of comparing the smoothed output with a predetermined constant.
Select one of constants and output. Or the signal
Intensity output, counter current value, first multiplication result (counter
Multiplication result of current value and first predetermined constant) and
Multiplication result of 2 (counter current value and second constant determined in advance
And the first, second and third comparison results with
Accordingly, the signal strength for performing a logical sum operation on the second and third comparison results
The counter is determined by the combination of the degree judgment result and the first comparison result.
Adds, subtracts, or retains the previous value and outputs it. Or signal strength calculation
A filter for the signal strength output input every output cycle
The ring operation is performed and the signal is transmitted within the block section of the moving average time width.
The signal intensity is smoothed and output. Further, the signal strength output and
Output is held as a result of comparison with the smoothed output or
Control and maintain the signal strength output by arithmetic averaging.
Background noise level to keep the previous value without updating
And set a threshold.

【００２７】[0027]

【発明の効果】以上のようなこの発明の音声検出器で
は、雑音レベル算出部は、音声入力信号の所定区間毎
に信号強度を算出し、この信号強度と、先に入力され、
平滑化された音声入力信号の信号強度と比較して背景雑
音レベルを定め、この背景雑音レベルに適応した適応閾
値を閾値設定部が算出し出力するので、音声信号が低レ
ベルでも雑音レベルの算出値が誤って上昇するのを防ぐ
ことができ、正常に閾値適応し、良好な音声検出特性を
維持することができる。As described above, in the speech detector according to the present invention, the noise level calculation unit performs the processing for each predetermined section of the speech input signal.
, Calculate the signal strength, and this signal strength is input first,
Compared with the signal strength of the smoothed audio input signal,
Determines the sound level and an adaptive threshold adapted to this background noise level
Since the threshold setting unit calculates and outputs the value, the audio signal is low level.
Prevents the noise level calculation value from rising accidentally even at the bell
Threshold adaptation can be performed normally, and good voice detection characteristics can be maintained.

[Brief description of the drawings]

【図１】この発明を示す一実施例の音声検出器の機能
ブロック図。FIG. 1 is a functional block diagram of a voice detector according to an embodiment of the present invention.

【図２】図１に示す信号強度算出部の機能ブロック
図。FIG. 2 is a functional block diagram of a signal strength calculator shown in FIG. 1;

【図３】図１に示す平滑部の機能ブロック図。FIG. 3 is a functional block diagram of a smoothing unit shown in FIG. 1;

【図４】図１に示す音声検出器の動作を説明する図。FIG. 4 is a view for explaining the operation of the voice detector shown in FIG. 1;

【図５】図１に示す平滑部の他の一実施例の機能ブロ
ック図。FIG. 5 is a functional block diagram of another embodiment of the smoothing unit shown in FIG. 1;

【図６】図５に示す平滑部の動作を説明する図。FIG. 6 is a view for explaining the operation of the smoothing unit shown in FIG. 5;

【図７】図１に示す平滑部の他の一実施例の機能ブロ
ック図。FIG. 7 is a functional block diagram of another embodiment of the smoothing unit shown in FIG. 1;

【図８】図７に示す平滑部の動作を説明する図。FIG. 8 is a view for explaining the operation of the smoothing unit shown in FIG. 7;

【図９】図１に示す平滑部として用いる低域通過フィ
ルタを説明する図。FIG. 9 is a diagram illustrating a low-pass filter used as a smoothing unit shown in FIG. 1;

【図１０】図１に示す比較器に付加する出力保持機能
を説明する図。FIG. 10 is a view for explaining an output holding function added to the comparator shown in FIG. 1;

【図１１】従来例の音声検出器の機能ブロック図。FIG. 11 is a functional block diagram of a conventional voice detector.

【図１２】図１１に示す低レベル検出部の機能ブロッ
ク図。FIG. 12 is a functional block diagram of a low-level detection unit shown in FIG. 11;

【図１３】図１１に示す高レベル検出部の機能ブロッ
ク図。FIG. 13 is a functional block diagram of a high-level detection unit shown in FIG. 11;

【図１４】図１１に示す音声検出器の動作を説明する
図。FIG. 14 is a view for explaining the operation of the voice detector shown in FIG. 11;

[Explanation of symbols]

１低レベル検出部、２高レベル検出部、３論理和
演算器、４ハングオーバ付加部、５信号強度算出
部、６平滑部、７比較器、８雑音レベル算出部、
９閾値設定部、１０１音声入力信号、１０２低レ
ベル有音／無音判定結果、１０３高レベル有音／無音
判定結果、１０４有音／無音判定結果、１０５音声
検出出力信号、１０６信号強度出力、１０７平滑化
出力、１０８信号強度出力と平滑化出力との比較結
果、１０９背景雑音レベル、１１０適応閾値、１１１
所定閾値、１１２定数１／Ｎ。1 low level detection section, 2 high level detection section, 3 OR operation section, 4 hangover addition section, 5 signal strength calculation section, 6 smoothing section, 7 comparator, 8 noise level calculation section,
9 Threshold setting unit, 101 voice input signal, 102 low-level voice / non-voice determination result, 103 high-level voice / non-voice determination result, 104 voice / non-voice determination result, 105 voice detection output signal, 106 signal strength output, 107 Smoothed output, 108 comparison result between signal strength output and smoothed output, 109 background noise level, 110 adaptive threshold, 111
Predetermined threshold, 112 constant 1 / N.

フロントページの続き (58)調査した分野(Int.Cl.⁶，ＤＢ名) G10L 3/00 G10L 9/00 Continuation of the front page (58) Field surveyed (Int. Cl. ⁶ , DB name) G10L 3/00 G10L 9/00

Claims

(57) [Claims]

1. Low level detector, high level detector, logic
Sum calculation unit, hangover addition unit, signal strength calculation unit, flat
Smooth section, comparator, noise level calculation section, threshold setting section
The low-level detector detects an audio input signal from a threshold setting unit.
Outputs low-level sound / no-sound judgment result in comparison with response threshold
The high-level detection unit detects the audio input signal as a predetermined threshold or a threshold.
High level sound / silence judgment compared to the adaptive threshold from the setting unit
The logical sum operation unit outputs a low level sound / non-speech judgment result and a high level
Performs a logical OR operation on the voiced / silent judgment result to generate voiced / silent
The judgment result is output, and the hangover adding unit hangs on the sound / silence judgment result.
An over-addition process is performed to generate a voice detection output signal, and the signal strength calculation unit determines the signal strength of the voice input signal for each predetermined section.
Calculates the degree and outputs it as a signal strength output, and the smoothing unit smoothes the signal strength output from the signal strength calculation unit
The comparator compares the signal strength output with the smoothed output and
And the noise level calculator calculates the signal strength based on the comparison result of the comparator.
When the output is smaller than the smoothed output,
Apply arithmetic averaging and output as background noise level, signal strength
If the degree output is larger than the smoothed output, the value output immediately before
Is output as a background noise level, and the threshold setting unit calculates an adaptive threshold value adapted to the background noise level.
Output sound detector.

2. The smoothing section includes a smoothing section comparator, and an up-down converter.
It has a counter, a smoothing unit comparator, a signal strength output and an up-down counter.
The up / down counter outputs the result of comparison by the smoothing unit comparator.
If the signal strength output is smaller than the smoothed output,
Subtract current value and signal strength output is greater than smoothed output
2. The voice detector according to claim 1 , wherein at the time, the current value of the counter is added to obtain a smoothed output .

3. The smoothing unit includes a smoothing unit first comparator and an up-converter.
A smoothing unit second comparator and a selector. The smoothing unit first comparator has a signal strength output and an up / down counter.
The up / down counter outputs the comparison result of the smoothing unit first comparator.
When the signal strength output is smaller than the smoothed output,
Subtracts the current signal value and the signal strength output is greater than the smoothed output.
When the threshold is high, the counter current value is added and output, and the second comparator of the smoothing unit determines the output of the up / down counter as constant.
The number A is compared and the comparison result is output, and the selector is increased according to the comparison result of the smoothing unit second comparator.
The output of the down counter or the constant A is used as the smoothed output.
Speech detector according to claim 1, wherein that.

4. The smoothing unit includes a first multiplier, a second multiplier, and a flat multiplier.
Smooth part first comparator, smoothing part second comparator, smoothing part third comparison
, An OR operation unit, and an up / down counter, and the first multiplier has a constant α and an output of the up / down counter.
And the second multiplier calculates the upper limit by multiplying the constant β by the output of the up / down counter.
To calculate the lower limit, and the smoothing unit first comparator compares the signal strength output with the up / down count.
The smoothing unit second comparator compares the signal strength output with the output of the first multiplier.
And outputs a comparison result, and the smoothing unit third comparator outputs the signal strength output and the output of the second multiplier.
And a comparison result is output, and the OR operation unit includes a smoothing unit second comparator and a smoothing unit third comparator
OR operation of the comparison result of
Is output as a determination result, and the up / down counter indicates that the determination result of the OR
When the current value is within the specified range, the current value is held.
Is a signal strength output based on the comparison result of the smoothing unit first comparator.
Is smaller than the output of the up / down counter,
The current value is subtracted, and the signal strength output is
If the output of the counter is larger than the
2. The sound detector according to claim 1, wherein the sound output is a smoothed output .

5. The smoothing unit forms a moving average filter.
Equipped with low-pass filter, smoothed output from signal strength output
The voice detector according to claim 1, wherein the voice detector calculates and outputs the following .

6. The comparator according to claim 1 , wherein said comparison result changes.
4. The method according to claim 1, wherein the comparison result immediately before the fixed time is held and output.
6. The voice detector according to any one of 5 .