JPH0832526A - Voice detector - Google Patents

Voice detector

Info

Publication number
JPH0832526A
JPH0832526A JP6186840A JP18684094A JPH0832526A JP H0832526 A JPH0832526 A JP H0832526A JP 6186840 A JP6186840 A JP 6186840A JP 18684094 A JP18684094 A JP 18684094A JP H0832526 A JPH0832526 A JP H0832526A
Authority
JP
Japan
Prior art keywords
voice
noise
prediction coefficient
threshold level
average value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP6186840A
Other languages
Japanese (ja)
Inventor
Ichiro Matsumoto
一郎 松本
Seiji Sasaki
誠司 佐々木
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Kokusai Electric Corp
Original Assignee
Kokusai Electric Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Kokusai Electric Corp filed Critical Kokusai Electric Corp
Priority to JP6186840A priority Critical patent/JPH0832526A/en
Publication of JPH0832526A publication Critical patent/JPH0832526A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE:To reduce an unpleasant sense of a reproduced voice by preventing missing of a voice frame due to noise/voice detection error and executing voice detection processing without giving an unpleasant sense to a recipient. CONSTITUTION:A VAD threshold level 1, that is, a voice detection threshold level 1 is equal to a prediction coefficient threshold level used to decide a noise region by a noise region decision device, and a VAD threshold level 2 and a VAD threshold level 3 are used to extract a region of higher frequency of occurrence than that of the VAD threshold level 1. Thus, they are set to decide a narrower region to be a noise (silence section). A voice/silence deciding device 35 sets a voice detection flag (u) to silence L when a mean value of prediction coefficients a1, a2 enters a specific range set by any of the VAD threshold levels 1, 2, 3 selected by a threshold level changeover device 41 among plural predetermined ranges and sets the flag to a voiced sound H in other cases. A final voice detection output (v) is obtained from the obtained result by using a hang-over processing circuit 36.

Description

【発明の詳細な説明】Detailed Description of the Invention

【0001】本発明は、音声符号化方式に用いられる音
声検出器に関するものである。
The present invention relates to a voice detector used in a voice coding system.

【0002】[0002]

【従来の技術】携帯型の無線機等では、送信時の消費電
力を低減するために、音声があるときのみ送信し音声が
無いときには送信を中断するVOX(Voice Operate Sw
itch Exchange )制御が使用されており、これを用いる
と送信時の平均電力を削減することができる。このよう
なVOX機能を実行するために送信出力回路の前段に音
声信号の有無を検出する音声検出器が必要になる。この
ような音声検出器を本願発明者の一人が先に提案した
〔特開平4−301930号公報参照〕。
2. Description of the Related Art In a portable radio device or the like, in order to reduce power consumption during transmission, VOX (Voice Operate Swing) which transmits only when there is voice and suspends transmission when there is no voice
itch exchange) control is used, which can reduce the average power during transmission. In order to execute such a VOX function, a voice detector for detecting the presence / absence of a voice signal is required in the preceding stage of the transmission output circuit. One of the inventors of the present application previously proposed such a voice detector [see Japanese Patent Application Laid-Open No. 4-301930].

【0003】[0003]

【発明が解決しようとする課題】しかし、従来技術で
は、有音/無音判定用の予測係数しきい値が固定のため
背景雑音が大きくなるほど、音声の予測係数の発生頻度
分布領域が雑音の予測係数発生頻度分布領域に近づくた
め、音声区間であるのに雑音区間と判定される判定誤り
を生じ、送信出力に音声区間の欠落が起きてしまう。音
声区間の欠落は話者に対し不快感を与えるばかりではな
く、会話の明瞭度も低下させてしまう。このため雑音環
境下における音声検出としてはまだ十分とはいえない。
However, in the prior art, as the background noise becomes larger because the prediction coefficient threshold for determining the voice / silence is fixed, the occurrence frequency distribution region of the prediction coefficient of the voice predicts the noise. Since the region is close to the coefficient occurrence frequency distribution region, a determination error that a voice segment is determined to be a noise segment occurs, and a voice segment is missing in the transmission output. The lack of the voice section not only makes the speaker uncomfortable, but also reduces the intelligibility of the conversation. Therefore, it cannot be said to be sufficient for voice detection in a noisy environment.

【0004】本発明の目的は、従来技術の問題点である
雑音環境下でのVOX制御による音声信号の欠落をなく
し、受信側の再生音声の不快感を軽減した音声検出器を
提供することにある。
An object of the present invention is to provide a voice detector which eliminates the loss of voice signal due to VOX control in a noisy environment, which is a problem of the prior art, and reduces the discomfort of the reproduced voice on the receiving side. is there.

【0005】[0005]

【課題を解決するための手段】この目的を達成するため
に、本発明による音声検出器は、音声符号化装置の適応
予測器から得られる予測係数をフレーム化する第1のフ
レーム器と、該予測係数のフレーム区間の平均値を計算
する平均値計算器と、前記予測係数の発生分布から予め
求めた予測係数しきい値と前記平均値とを比較し雑音領
域発生頻度分布の発生頻度の高い部分に位置する区間を
見つけて雑音領域を判定する雑音領域判定器と、均一P
CM信号をフレーム化する第2のフレーム化器と、その
フレーム化された均一PCM信号が雑音領域と判定され
たときのみ該均一PCM信号の電力平均値を計算する電
力計算器と、前記雑音領域と判定されたときのみ、該均
一PCM信号の電力平均値がある決められた数種類の電
力しきい値により設定された複数の領域のいずれの領域
にあるかの判定結果を出力する電力判定器と、該判定結
果により前記電力平均値が高い領域にあることが判定さ
れたときに音声検出用しきい値を前記予測係数しきい値
より高くするように切換えるしきい値切換え器と、前記
均一PCM信号の有音/無音判定をするために前記予測
係数の平均値が前記しきい値切換え器により切換えられ
た音声検出用しきい値により設定される特定の範囲にあ
るかの判定により前記区間が有音区間であるか無音区間
であるかのいずれかを示す有音/無音フラグを出力する
有音/無音判定器と、該フラグに適当なハングオーバ処
理を行い音声検出出力信号を出力するハングオーバー処
理装置とを備えた構成を有している。
To achieve this object, a speech detector according to the present invention comprises a first framer for framing a prediction coefficient obtained from an adaptive predictor of a speech coding apparatus, and An average value calculator for calculating the average value of the prediction coefficient in the frame section, and a prediction coefficient threshold value previously obtained from the occurrence distribution of the prediction coefficient and the average value are compared to generate a high occurrence frequency of the noise region occurrence frequency distribution. A noise region determiner for determining a noise region by finding a section located in the portion, and a uniform P
A second framing device for framing the CM signal; a power calculator for calculating a power average value of the uniform PCM signal only when the framed uniform PCM signal is determined to be in the noise region; Only when it is determined that the power average value of the uniform PCM signal outputs a determination result as to which region among a plurality of regions set by a certain number of determined power thresholds. A threshold switch for switching the voice detection threshold to a value higher than the prediction coefficient threshold when it is determined from the determination result that the power average value is in a high region, and the uniform PCM By determining whether the average value of the prediction coefficient is within a specific range set by the voice detection threshold value switched by the threshold value switcher to determine whether the signal is voiced or not. A voice / sound determiner that outputs a voice / silence flag indicating whether the segment is a voice segment or a silence segment, and outputs a voice detection output signal by performing an appropriate hangover process on the flag. And a hangover processing device that operates.

【0006】[0006]

【実施例】実施例として、本発明をPHS(パーソナル
・ハンディホン・システム)の標準符号化方式である3
2kb/s(キロビット/秒)ADPCMに適用する例
を以下に示す。図2は本発明を適用する音声検出機能を
有するADPCM音声符号化装置のブロック図であり、
図1は本発明の音声検出器の実施例を示すブロック図で
ある。
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS As an embodiment, the present invention is a PHS (Personal Handyphone System) standard encoding system.
An example applied to 2 kb / s (kilobits / second) ADPCM is shown below. FIG. 2 is a block diagram of an ADPCM voice encoding device having a voice detection function to which the present invention is applied.
FIG. 1 is a block diagram showing an embodiment of a voice detector of the present invention.

【0007】まず、図2のADPCM符号化装置につい
て説明する。21は64kb/sのμ則PCM入力信号
を線形13ビットPCMの均一PCM信号oに変換する
均一PCM変換器である。22は均一PCM変換器21
の出力oから適応予測器23の出力である予測信号jを
差し引いて差分信号kを得る減算器である。24は差分
信号kを量子化する適応量子化器である。26は32k
b/sの音声データを逆量子化し量子化差分信号mを出
力する適応逆量子化器である。25は量子化差分信号m
と予測信号jを加算して再生信号nを出力する加算器で
ある。23は再生信号nと量子化差分信号mより予測信
号jを生成する適応予測器である。27は本発明に係る
音声検出器である。
First, the ADPCM coding apparatus shown in FIG. 2 will be described. Reference numeral 21 denotes a uniform PCM converter for converting a 64 kb / s μ-law PCM input signal into a uniform 13-bit PCM uniform PCM signal o. 22 is a uniform PCM converter 21
Is a subtractor that subtracts the prediction signal j, which is the output of the adaptive predictor 23, from the output o of the above to obtain the difference signal k. An adaptive quantizer 24 quantizes the difference signal k. 26 is 32k
It is an adaptive dequantizer that dequantizes b / s voice data and outputs a quantized difference signal m. 25 is the quantized difference signal m
And a predicted signal j and add the predicted signal j to output a reproduced signal n. Reference numeral 23 is an adaptive predictor that generates a prediction signal j from the reproduction signal n and the quantized difference signal m. 27 is a voice detector according to the present invention.

【0008】64kb/sのμ則PCM入力信号は線形
13ビットPCMの均一PCM信号oに均一変換器21
で変換される。減算器22で均一PCM信号oから適応
予測器23の出力である予測信号jを差し引いて、差分
信号kを得る。この差分信号kは適応量子化器24によ
り量子化され、ADPCM音声符号化装置の出力として
32kb/sの音声データが伝送路に送出される。一
方、適応逆量子化器26で32kb/sの音声データが
適応逆量子化され量子化差分信号mが出力される。加算
器25で、量子化差分信号mと予測信号jが加算され、
再生信号nが出力される。適応予測器23で、予測係数
a1,a2を算出してそれを用いて量子化差分信号m及
び再生信号nから予測信号jが生成される。適応予測器
23が予測信号jを生成するために算出する予測係数a
1,a2はある時点の標本値を隣接する過去の2つの標
本値で予測するための係数であり、その値は、自己相関
が大きい音声信号の場合と自己相関が小さい背景雑音の
場合とでは領域の異なった発生分布となる。この予測係
数a1,a2と均一PCM信号oが本発明の音声検出器
27に入力される。音声検出器27ではこれらの信号を
用い、有音/無音の判定をして音声検出出力信号を出力
する。
The 64 kb / s μ-law PCM input signal is converted into a uniform 13-bit PCM uniform PCM signal o.
Is converted by. The subtractor 22 subtracts the prediction signal j output from the adaptive predictor 23 from the uniform PCM signal o to obtain the difference signal k. This differential signal k is quantized by the adaptive quantizer 24, and 32 kb / s voice data is sent to the transmission line as the output of the ADPCM voice coding device. On the other hand, the adaptive dequantizer 26 adaptively dequantizes the voice data of 32 kb / s and outputs the quantized difference signal m. In the adder 25, the quantized difference signal m and the prediction signal j are added,
The reproduction signal n is output. The adaptive predictor 23 calculates the prediction coefficients a1 and a2 and uses them to generate the prediction signal j from the quantized difference signal m and the reproduction signal n. Prediction coefficient a calculated by the adaptive predictor 23 to generate the prediction signal j
1, a2 are coefficients for predicting a sample value at a certain time point with two adjacent sample values in the past, and the values thereof are different between a case of a speech signal with a large autocorrelation and a case of background noise with a small autocorrelation The distribution of occurrence is different in the area. The prediction coefficients a1 and a2 and the uniform PCM signal o are input to the voice detector 27 of the present invention. The voice detector 27 uses these signals to determine whether there is sound or no sound and outputs a voice detection output signal.

【0009】次に本発明の音声検出器について説明す
る。図1は本発明の音声検出器の構成例を示すブロック
図である。31,32は適応予測器23から得られる予
測係数a1,a2をそれぞれ5msecにフレーム化す
る第1のフレーム化器である。33,34は第1のフレ
ーム化器31,32の各出力の平均値をそれぞれ計算す
る平均値計算器である。35は平均値計算器33,34
の出力を与えられたしきい値と比較して有音/無音判定
を行う有音/無音判定器である。36は有音/無音判定
器35の結果に対してハングオーバー処理を行うハング
オーバー処理装置である。37は均一PCM信号oをフ
レーム化する第2のフレーム化器である。38は平均値
計算器33,34の各出力が予測係数a1,a2の発生
分布から図3のように雑音領域として予め定めた予測係
数しきい値の領域Aに含まれるか否かを比較し、その雑
音領域Aに含まれると判定したときに雑音領域信号tを
出力する雑音領域判定器である。39は第2のフレーム
化器37の出力の電力の平均値を計算する電力計算器で
ある。40は、有音/無音判定器35に与える音声検出
用しきい値すなわち、VAD(Voice Activity Detecti
on)のためのVADしきい値1,VADしきい値2,V
ADしきい値3を背景雑音の電力の大きさにより切換え
るために、電力計算器39の出力を予め設定された電力
しきい値(背景雑音電力判定用しきい値である Back Gr
ound Noise Thresholdとして用いられる bgnth_1, b
gnth_2)と比較し電力の大きさを判定する電力判定器
である。41は有音/無音判定器35に与える音声検出
用しきい値(VADしきい値1,VADしきい値2,V
ADしきい値3)を切換えるしきい値切換え器である。
Next, the voice detector of the present invention will be described. FIG. 1 is a block diagram showing a configuration example of a voice detector of the present invention. Reference numerals 31 and 32 are first framing devices for framing the prediction coefficients a1 and a2 obtained from the adaptive predictor 23 into 5 msec, respectively. 33 and 34 are average value calculators that calculate the average value of each output of the first framing devices 31 and 32, respectively. 35 is an average value calculator 33, 34
Is a voiced / non-voiced discriminator which compares the output of the above with a given threshold value to make a voiced / non-voiced determination. Reference numeral 36 denotes a hangover processing device that performs a hangover process on the result of the sound / silence determiner 35. 37 is a second framing device for framing the uniform PCM signal o. Reference numeral 38 compares whether or not each output of the average value calculators 33 and 34 is included in the area A of the predetermined prediction coefficient threshold value as a noise area from the occurrence distribution of the prediction coefficients a1 and a2. , A noise area determiner that outputs a noise area signal t when it is determined to be included in the noise area A. Reference numeral 39 is a power calculator that calculates the average value of the power output from the second framing device 37. Reference numeral 40 denotes a voice detection threshold value given to the voice / non-voice determination device 35, that is, VAD (Voice Activity Detecti).
on) VAD threshold 1, VAD threshold 2, V
In order to switch the AD threshold value 3 according to the power level of the background noise, the output of the power calculator 39 is set to a preset power threshold value (Back Gr
bgnth_1, b used as sound noise threshold
gnth_2) to determine the magnitude of power. Reference numeral 41 designates a voice detection threshold value (VAD threshold value 1, VAD threshold value 2, V
This is a threshold value switching device for switching the AD threshold value 3).

【0010】まず、予測係数a1,a2は第1のフレー
ム化器31,32でフレーム化され、平均値計算器3
3,34で平均値が計算される。雑音領域判定器38か
らは予測係数a1,a2のフレーム平均値を用いてその
フレームが雑音領域か否かの判定結果を電力計算器39
と電力判定器40に出力する。電力計算器39では雑音
領域判定器38からの雑音領域信号tを受けたときのみ
式(1)の電力平均値(移動平均値)を計算する。
First, the prediction coefficients a1 and a2 are framed by the first framers 31 and 32, and the average value calculator 3
At 3,34 the average value is calculated. From the noise area determiner 38, the power calculator 39 is used to determine whether the frame is in the noise area using the frame average values of the prediction coefficients a1 and a2.
And output to the power determiner 40. The power calculator 39 calculates the power average value (moving average value) of the equation (1) only when receiving the noise area signal t from the noise area determiner 38.

【数1】 bgnp=pbgnp × 0.95 +nbgnp × 0.05 ……(1) ここで、 bgnp :雑音電力平均値 pbgnp :前回の雑音領域フレームの電力平均値 nbgnp :今回の雑音領域フレームの電力平均値[Equation 1] bgnp = pbgnp × 0.95 + nbgnp × 0.05 (1) where, bgnp: average noise power value pbgnp: average power value of previous noise area frame nbgnp: average power value of current noise area frame

【0011】電力判定器40は雑音領域信号tを受けた
ときのみ、電力計算器39の出力(bgnp)を背景雑音電
力判定用電力しきい値と比較し、例えば、(2)式に従
い判定して、しきい値切換え器41を制御し、その判定
結果に対応するVADしきい値1,VADしきい値2又
はVADしきい値3が図4のように選択されて有音/無
音判定器35にVADしきい値として与えられるように
する。
The power determiner 40 compares the output (bgnp) of the power calculator 39 with the power threshold for background noise power determination only when it receives the noise region signal t, and makes a determination according to, for example, equation (2). The threshold switch 41 is controlled to select the VAD threshold 1, VAD threshold 2 or VAD threshold 3 corresponding to the determination result as shown in FIG. 35 as a VAD threshold.

【数2】 ここで、 bgnth_n(n=1,2)は、VADしきい値
1,2,3を切換えるため次式(3)の関係で固定され
電力判定器40にしきい値として与えられた電力であ
る。
[Equation 2] Here, bgnth_n (n = 1, 2) is the power fixed as the threshold to the power determiner 40 in order to switch the VAD thresholds 1, 2, and 3 by the relationship of the following expression (3).

【数3】 bgnth _1<bgnth _2 ……(3)[Equation 3] bgnth_1 <bgnth_2 (3)

【0012】VADしきい値1すなわち音声検出用しき
い値1は、雑音領域判定器38で雑音領域を判定するた
めの予測係数しきい値と等しく、VADしきい値2,V
ADしきい値3は、VADしきい値1より発生頻度の高
い領域を抽出するため、より狭い領域を雑音(無音区
間)と判定するように設定されている。有音/無音判定
器35は、予測係数a1,a2の平均値が図5に示すよ
うな予め定めた複数の範囲のうちしきい値切換え器41
で選択されたVADしきい値1,2又は3により設定さ
れる特定の範囲に入れば音声検出フラグuを無音(L)
に設定し、それ以外の場合は有音(H)に設定する。以
上で得られた結果に対してハングオーバー処理装置36
によりここでは例として100msecのハングオーバ
ー処理を施し最終的な音声検出出力信号vを得る。
The VAD threshold 1, that is, the voice detection threshold 1, is equal to the prediction coefficient threshold for determining the noise region by the noise region determiner 38, and the VAD thresholds 2, V
The AD threshold value 3 is set so that a narrower area is determined as noise (silent section) in order to extract an area having a higher occurrence frequency than the VAD threshold value 1. The voiced / non-voiced determination unit 35 uses the threshold value switching unit 41 among a plurality of predetermined ranges in which the average values of the prediction coefficients a1 and a2 are as shown in FIG.
If it falls within a specific range set by the VAD threshold value 1, 2 or 3 selected in step 3, the voice detection flag u is set to silence (L).
Otherwise, set to voice (H) otherwise. The hangover processing device 36 is applied to the results obtained above.
Therefore, here, as an example, a hangover process of 100 msec is performed to obtain a final voice detection output signal v.

【0013】[0013]

【発明の効果】以上詳細に説明したように、本発明によ
れば、雑音環境下での雑音/音声検出誤りによる音声フ
レームの欠落を防ぎ受話者に不快感を与えることなく音
声検出処理を実行することができるため、その効果は極
めて大きい。
As described above in detail, according to the present invention, the voice detection processing is executed without causing the listener to feel uncomfortable by preventing the voice frame from being dropped due to noise / voice detection error in a noisy environment. Therefore, the effect is extremely large.

【図面の簡単な説明】[Brief description of drawings]

【図1】本発明の音声検出器の実施例を示すブロック図
である。
FIG. 1 is a block diagram showing an embodiment of a voice detector of the present invention.

【図2】音声検出機能を有するADPCM音声符号化装
置のブロック図である。
FIG. 2 is a block diagram of an ADPCM speech coding apparatus having a speech detection function.

【図3】本発明の動作を説明するための特性図である。FIG. 3 is a characteristic diagram for explaining the operation of the present invention.

【図4】本発明の動作を説明するための特性図である。FIG. 4 is a characteristic diagram for explaining the operation of the present invention.

【図5】本発明の動作を説明するための特性図である。FIG. 5 is a characteristic diagram for explaining the operation of the present invention.

【符号の説明】 31,32 第1のフレーム化器(予測係数用) 33,34 平均値計算器(予測係数用) 35 有音/無音判定器 36 ハングオーバー処理装置 37 第2のフレーム化器(均一PCM信号用) 38 雑音領域判定器 39 電力計算器 40 電力判定器 41 しきい値切換え器 t 雑音領域信号 u 音声検出フラグ v 音声検出出力信号 21 均一PCM変換器 22 減算器 23 適応予測器 24 適応量子化器 25 加算器 26 適応逆量子化器 27 音声検出器 j 予測信号 k 差分信号 m 量子化差分信号 n 再生信号 o 均一PCM信号[Explanation of Codes] 31, 32 First Framer (for Prediction Coefficient) 33, 34 Average Value Calculator (for Prediction Coefficient) 35 Sound / Silence Determiner 36 Hangover Processing Device 37 Second Framer (For uniform PCM signal) 38 Noise region determiner 39 Power calculator 40 Power determiner 41 Threshold switching device t Noise region signal u Speech detection flag v Speech detection output signal 21 Uniform PCM converter 22 Subtractor 23 Adaptive predictor 24 adaptive quantizer 25 adder 26 adaptive dequantizer 27 speech detector j prediction signal k difference signal m quantized difference signal n reproduced signal o uniform PCM signal

Claims (1)

【特許請求の範囲】[Claims] 【請求項1】 音声符号化装置の適応予測器から得られ
る予測係数をフレーム化する第1のフレーム器と、 該予測係数のフレーム区間の平均値を計算する平均値計
算器と、 前記予測係数の発生分布から予め求めた予測係数しきい
値と前記平均値とを比較し雑音領域発生頻度分布の発生
頻度の高い部分に位置する区間を見つけて雑音領域を判
定する雑音領域判定器と、 均一PCM信号をフレーム化する第2のフレーム化器
と、 そのフレーム化された均一PCM信号が雑音領域と判定
されたときのみ該均一PCM信号の電力平均値を計算す
る電力計算器と、 前記雑音領域と判定されたときのみ、該均一PCM信号
の電力平均値がある決められた数種類の電力しきい値に
より設定された複数の領域のいずれの領域にあるかの判
定結果を出力する電力判定器と、 該判定結果により前記電力平均値が高い領域にあること
が判定されたときに音声検出用しきい値を前記予測係数
しきい値より高くするように切換えるしきい値切換え器
と、 前記均一PCM信号の有音/無音判定をするために前記
予測係数の平均値が前記しきい値切換え器により切換え
られた音声検出用しきい値により設定される特定の範囲
にあるかの判定により前記区間が有音区間であるか無音
区間であるかのいずれかを示す有音/無音フラグを出力
する有音/無音判定器と、 該フラグに適当なハングオーバ処理を行い音声検出出力
信号を出力するハングオーバー処理装置とを備えた音声
検出器。
1. A first frame unit for framing a prediction coefficient obtained from an adaptive predictor of a speech coding apparatus, an average value calculator for calculating an average value of a frame section of the prediction coefficient, and the prediction coefficient. A noise region determiner that compares a prediction coefficient threshold previously obtained from the occurrence distribution of the above with the average value to find a section located in a high occurrence frequency portion of the noise region occurrence frequency distribution to determine the noise region, and a uniform A second framing device for framing the PCM signal; a power calculator for calculating a power average value of the uniform PCM signal only when the framed uniform PCM signal is determined to be in the noise region; Only when it is determined that the average power value of the uniform PCM signal is in a plurality of regions set by a certain number of predetermined power thresholds A determining device, and a threshold value switching device for switching the voice detection threshold value to be higher than the prediction coefficient threshold value when it is determined by the determination result that the power average value is in a high region, By determining whether the average value of the prediction coefficient is within a specific range set by the voice detection threshold value switched by the threshold value switcher to determine whether the uniform PCM signal is voiced or not. A voice / non-voice deciding device for outputting a voice / non-voice flag indicating whether the period is a voice period or a silence period, and a voice detection output signal by performing an appropriate hangover process on the flag. A voice detector having a hangover processing device.
JP6186840A 1994-07-18 1994-07-18 Voice detector Pending JPH0832526A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP6186840A JPH0832526A (en) 1994-07-18 1994-07-18 Voice detector

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP6186840A JPH0832526A (en) 1994-07-18 1994-07-18 Voice detector

Publications (1)

Publication Number Publication Date
JPH0832526A true JPH0832526A (en) 1996-02-02

Family

ID=16195558

Family Applications (1)

Application Number Title Priority Date Filing Date
JP6186840A Pending JPH0832526A (en) 1994-07-18 1994-07-18 Voice detector

Country Status (1)

Country Link
JP (1) JPH0832526A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005031632A (en) * 2003-06-19 2005-02-03 Advanced Telecommunication Research Institute International Utterance section detecting device, voice energy normalizing device, computer program, and computer
KR101327664B1 (en) * 2012-01-20 2013-11-13 세종대학교산학협력단 Method for voice activity detection and apparatus for thereof
CN108877778A (en) * 2018-06-13 2018-11-23 百度在线网络技术(北京)有限公司 Sound end detecting method and equipment

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005031632A (en) * 2003-06-19 2005-02-03 Advanced Telecommunication Research Institute International Utterance section detecting device, voice energy normalizing device, computer program, and computer
JP4521673B2 (en) * 2003-06-19 2010-08-11 株式会社国際電気通信基礎技術研究所 Utterance section detection device, computer program, and computer
KR101327664B1 (en) * 2012-01-20 2013-11-13 세종대학교산학협력단 Method for voice activity detection and apparatus for thereof
CN108877778A (en) * 2018-06-13 2018-11-23 百度在线网络技术(北京)有限公司 Sound end detecting method and equipment
CN108877778B (en) * 2018-06-13 2019-09-17 百度在线网络技术(北京)有限公司 Sound end detecting method and equipment
US10937448B2 (en) 2018-06-13 2021-03-02 Baidu Online Network Technology (Beijing) Co., Ltd. Voice activity detection method and apparatus

Similar Documents

Publication Publication Date Title
US6223154B1 (en) Using vocoded parameters in a staggered average to provide speakerphone operation based on enhanced speech activity thresholds
US9646621B2 (en) Voice detector and a method for suppressing sub-bands in a voice detector
US7165035B2 (en) Compressed domain conference bridge
RU2251750C2 (en) Method for detection of complicated signal activity for improved classification of speech/noise in audio-signal
FI119085B (en) A method and apparatus for selecting an encoding rate in a variable rate vocoder
US6122531A (en) Method for selectively including leading fricative sounds in a portable communication device operated in a speakerphone mode
US6389391B1 (en) Voice coding and decoding in mobile communication equipment
JPS62274941A (en) Audio coding system
JPH0644195B2 (en) Speech analysis and synthesis system having energy normalization and unvoiced frame suppression function and method thereof
WO2000025301A1 (en) Method and arrangement for providing comfort noise in communications systems
JP2000172283A (en) System and method for detecting sound
JP2006323230A (en) Noise level estimating method and device thereof
JPH11338499A (en) Noise canceller
RU2237296C2 (en) Method for encoding speech with function for altering comfort noise for increasing reproduction precision
JPH0832526A (en) Voice detector
JP2541484B2 (en) Speech coding device
JP3580906B2 (en) Voice decoding device
JP2762938B2 (en) Audio coding device
JP3081264B2 (en) Voice detector
JPH08202394A (en) Voice detector
JPH08130515A (en) Voice coding device
JPH07210199A (en) Method and device for voice encoding
JPH10341211A (en) Voice coding method and its system
JPH07115403A (en) Circuit for encoding and decoding silent section information
JPH07336311A (en) Voice decoder