JP2014052553A - Sound volume correction device - Google Patents

Sound volume correction device Download PDF

Info

Publication number
JP2014052553A
JP2014052553A JP2012197869A JP2012197869A JP2014052553A JP 2014052553 A JP2014052553 A JP 2014052553A JP 2012197869 A JP2012197869 A JP 2012197869A JP 2012197869 A JP2012197869 A JP 2012197869A JP 2014052553 A JP2014052553 A JP 2014052553A
Authority
JP
Japan
Prior art keywords
gain
value
unit
input signal
calculated
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2012197869A
Other languages
Japanese (ja)
Other versions
JP6065308B2 (en
Inventor
Teppei Washi
哲平 鷲
Minoru Fukushima
実 福島
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Panasonic Corp
Original Assignee
Panasonic Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Panasonic Corp filed Critical Panasonic Corp
Priority to JP2012197869A priority Critical patent/JP6065308B2/en
Publication of JP2014052553A publication Critical patent/JP2014052553A/en
Application granted granted Critical
Publication of JP6065308B2 publication Critical patent/JP6065308B2/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

PROBLEM TO BE SOLVED: To provide a sound volume correction device capable of reducing the amount of volume correction processing.SOLUTION: A buffer part 11 has a storage region capable of storing sound input signals for a predetermined period of time. When the storage region has no space, the buffer part 11 overwrites new sound input signals in a storage region where the oldest sound input signals are stored. Every time when all sound input signals stored in the buffer part 11 are replaced with new ones, a gain calculation section 13 calculates the gain based on a representative value of the sound input signals stored in the buffer part 11. A gain limitation section 14 calculates the amount of change between the gain previously calculated by the gain calculation section 13 and the gain calculated this time. When the absolute value of the amount of change is larger than a predetermined first threshold value, a limitation value which is obtained by changing the previously calculated gain by an amount equivalent to a predetermined changing upper limit so that the previously calculated gain gets closer to the gain calculated this time is determined as the gain his time. A gain multiplication section 15 multiplies the gain which is set by the gain calculation section 13 and a gain limitation section 14 by a sound input signal read out from the buffer part 11 and outputs the same.

Description

本発明は、音量補正装置に関するものである。   The present invention relates to a volume correction device.

従来、パケットにより音声信号を送信する音声送信端末装置と、受信した音声信号に基づいて音声を出力する音声受信端末装置とを備え、リアルタイムで音声通話を行う音声伝送システムがあった(例えば特許文献1参照)。   2. Description of the Related Art Conventionally, there has been a voice transmission system that includes a voice transmission terminal device that transmits a voice signal by a packet and a voice reception terminal device that outputs a voice based on the received voice signal, and performs a voice call in real time (for example, Patent Documents). 1).

この音声伝送システムでは、音声受信端末装置が、音声送信端末装置から受信したパケットが有する音声データの一音声フレームを復号化した後、復号化されたディジタル音声信号に異音対策処理を行って、伝送エラーによる異音の発生を抑えていた。すなわち、音声送信端末装置側では、第1差分演算手段が、ディジタル音声信号の標本点の量子化値とその一つ前の標本点の量子化値との差分の絶対値を算出し、一音声フレームについて絶対値の最大値を求めている。そして、音声受信端末装置の異音対策処理手段が、ディジタル音声信号の標本点の量子化値とその一つ前の標本点の量子化値との差分の絶対値を算出し、この絶対値が上記の最大値を超えた場合は異音と判断して、異音を抑制する処理を行っていた。   In this audio transmission system, after the audio receiving terminal device decodes one audio frame of audio data included in the packet received from the audio transmitting terminal device, the anti-noise processing is performed on the decoded digital audio signal, The generation of abnormal noise due to transmission errors was suppressed. That is, on the voice transmitting terminal device side, the first difference calculation means calculates the absolute value of the difference between the quantized value of the sample point of the digital voice signal and the quantized value of the immediately preceding sample point. The maximum absolute value is obtained for the frame. Then, the noise countermeasure processing means of the voice receiving terminal device calculates the absolute value of the difference between the quantized value of the sample point of the digital voice signal and the quantized value of the previous sample point, and this absolute value is When the above maximum value is exceeded, it is determined that the noise is abnormal, and processing for suppressing the abnormal noise is performed.

特開2008−292961号公報JP 2008-292961 A

上述のようなリアルタイムで音声通話を行う音声伝送システムでは、音声入力信号の信号レベルが変化するのに合わせて、適応的に利得を調整する必要があった。そのため、従来は音声入力信号をサンプリングする毎に、サンプリングした瞬時値から音声全体の振幅を推定し、その推定値をもとに利得を算出することが行われていた。これらの処理はサンプリング毎に行われるため、信号処理の処理量が多量になって、演算処理部に高性能のマイコンを使用する必要があった。   In the voice transmission system that performs a voice call in real time as described above, it is necessary to adaptively adjust the gain as the signal level of the voice input signal changes. Therefore, conventionally, every time an audio input signal is sampled, the amplitude of the entire audio is estimated from the sampled instantaneous value, and the gain is calculated based on the estimated value. Since these processes are performed for each sampling, the amount of signal processing increases, and it is necessary to use a high-performance microcomputer for the arithmetic processing unit.

本発明は上記課題に鑑みて為されたものであり、その目的とするところは、音量補正処理の処理量を低減した音量補正装置を提供することにある。   The present invention has been made in view of the above problems, and an object of the present invention is to provide a volume correction apparatus that reduces the volume of volume correction processing.

本発明の音量補正装置は、バッファ部と、利得算出部と、利得制限部と、利得乗算部とを備える。バッファ部は、所定時間分の音声入力信号を記憶可能な記憶領域を有し、前記記憶領域に空きがなくなると、最も古い音声入力信号を記憶する記憶領域に、新しい音声入力信号を上書き保存する。利得算出部は、前記バッファ部に保存される音声入力信号が全て入れ替わる毎に、前記バッファ部に保存されている音声入力信号の代表値に基づいて利得を算出する。利得制限部は、前記利得算出部が前回算出した利得と今回算出した利得との変化分を求め、前記変化分の絶対値が所定の第1閾値以上の場合、前回算出した利得を、今回算出した利得に近付くように所定の変動上限分だけ変化させて得た制限値を今回の利得とする。利得乗算部は、前記利得算出部及び前記利得制限部によって前記利得が設定されると、前記バッファ部から音声入力信号を古い順番に読み出し、読み出した音声入力信号に前記利得を乗算して出力する。   The volume correction apparatus of the present invention includes a buffer unit, a gain calculation unit, a gain limiting unit, and a gain multiplication unit. The buffer unit has a storage area capable of storing a voice input signal for a predetermined time, and when the storage area is full, a new voice input signal is overwritten and saved in the storage area for storing the oldest voice input signal. . The gain calculating unit calculates the gain based on the representative value of the audio input signal stored in the buffer unit every time the audio input signal stored in the buffer unit is replaced. The gain limiting unit obtains a change between the gain calculated by the gain calculation unit last time and the gain calculated this time, and when the absolute value of the change is equal to or greater than a predetermined first threshold, the gain calculated last time is calculated. The limit value obtained by changing by a predetermined fluctuation upper limit so as to approach the gain obtained is set as the current gain. When the gain is set by the gain calculation unit and the gain limiting unit, the gain multiplication unit reads out the audio input signal from the buffer unit in the oldest order, and multiplies the read audio input signal by the gain and outputs the result. .

この音量補正装置において、前記利得算出部は、前記バッファ部に保存されている音声入力信号の絶対値の最大値を求め、前記最大値を前記代表値とすることも好ましい。   In this volume correction apparatus, it is also preferable that the gain calculation unit obtains a maximum absolute value of the audio input signal stored in the buffer unit and sets the maximum value as the representative value.

この音量補正装置において、前記利得算出部は、前記バッファ部に保存されている音声入力信号の絶対値の平均値を求め、前記平均値を前記代表値とすることも好ましい。   In this volume correction apparatus, it is also preferable that the gain calculation unit obtains an average value of absolute values of the audio input signal stored in the buffer unit, and sets the average value as the representative value.

この音量補正装置において、前記利得算出部は、前記代表値に利得を乗算した値が予め設定された目標値と等しくなるように利得を設定することも好ましい。   In this volume correction apparatus, it is also preferable that the gain calculation unit sets a gain so that a value obtained by multiplying the representative value by a gain is equal to a preset target value.

この音量補正装置において、前記利得制限部が前記制限値を今回の利得とした場合に前記利得と前記代表値との積が所定の第2閾値を超えると、前記利得制限部は、前記代表値に利得を乗算した値が所定の目標値となるように前記利得算出部によって算出された利得を今回の利得とすることも好ましい。   In this volume correction device, when the gain limiting unit sets the limit value as the current gain and the product of the gain and the representative value exceeds a predetermined second threshold, the gain limiting unit It is also preferable that the gain calculated by the gain calculation unit is the current gain so that a value obtained by multiplying the gain by the gain becomes a predetermined target value.

この音量補正装置において、前記変化分の絶対値が前記第1閾値以上となって、前記利得制限部が前記制限値を今回の利得とした場合、前記バッファ部が前記記憶領域のサイズを小さくすることも好ましい。   In this volume correction device, when the absolute value of the change is equal to or greater than the first threshold value and the gain limiter uses the limit value as the current gain, the buffer unit reduces the size of the storage area. It is also preferable.

この音量補正装置において、前記代表値が、無音状態か有音状態かを判定する判定レベル以下であれば、前記利得乗算部は、前記利得算出部によって前回算出された利得を、前記バッファ部から読み出した音声入力信号に乗算して出力することも好ましい。   In this volume correction device, if the representative value is equal to or lower than a determination level for determining whether the sound is in a silent state or a sound state, the gain multiplication unit calculates the gain previously calculated by the gain calculation unit from the buffer unit. It is also preferable to multiply the read audio input signal and output it.

この音量補正装置において、前記バッファ部に入力される音声入力信号が音声か非音声かを判定する音声判定部を備えることも好ましい。前記音声判定部によって前記バッファ部に入力されている音声入力信号が全て非音声と判定された場合、前記利得乗算部は、前記利得算出部によって前回算出された利得を、前記バッファ部から読み出した音声入力信号に乗算して出力する。   The volume correction apparatus preferably further includes a sound determination unit that determines whether a sound input signal input to the buffer unit is sound or non-speech. When it is determined that all the audio input signals input to the buffer unit by the audio determination unit are non-audio, the gain multiplication unit reads the gain previously calculated by the gain calculation unit from the buffer unit. Multiply the audio input signal and output.

この音量補正装置において、前記利得算出部は、算出した利得が所定の上限値を超えた場合、利得を前記上限値に設定することも好ましい。   In this volume correction apparatus, it is also preferable that the gain calculation unit sets the gain to the upper limit value when the calculated gain exceeds a predetermined upper limit value.

本発明によれば、バッファ部に保存される音声入力信号が全て入れ替わる毎に、利得算出部が、バッファ部に保存された音声入力信号の代表値に基づいて利得を算出している。したがって、サンプリング毎に利得を算出する従来例に比べて、音量補正の処理量を低減でき、またバッファ部の記憶容量に応じた遅延のみで利得を算出できるから、出力音声の遅延を低減できる。さらに、利得算出部によって前回算出された利得と今回算出された利得との変化分を求め、この変化分の絶対値が第1閾値以上の場合、利得制限部は、前回算出した利得を、今回算出した利得に近付くように、所定の変動上限分だけ変化させて得た制限値を今回の利得としている。これにより、入力音声の音量レベルが急変したために、利得の算出結果が第1閾値以上変動した場合でも、利得の変動が所定の変動上限分に抑えられるから、出力される音声の音量が急変するのを抑制できる。   According to the present invention, every time the audio input signal stored in the buffer unit is replaced, the gain calculation unit calculates the gain based on the representative value of the audio input signal stored in the buffer unit. Therefore, compared with the conventional example in which the gain is calculated for each sampling, the processing amount of the volume correction can be reduced, and the gain can be calculated only by the delay corresponding to the storage capacity of the buffer unit, so that the delay of the output sound can be reduced. Further, a change between the previously calculated gain and the currently calculated gain is obtained by the gain calculating unit, and when the absolute value of the change is equal to or greater than the first threshold, the gain limiting unit calculates the previously calculated gain The limit value obtained by changing by a predetermined fluctuation upper limit so as to approach the calculated gain is used as the current gain. As a result, since the volume level of the input voice has suddenly changed, even if the gain calculation result fluctuates more than the first threshold value, the gain fluctuation can be suppressed to the predetermined fluctuation upper limit, and thus the volume of the output voice suddenly changes. Can be suppressed.

本実施形態のブロック図である。It is a block diagram of this embodiment. 同上の動作を説明する波形図であり、(a)は利得、(b)は音声入力信号の絶対値を示す図である。It is a wave form diagram explaining operation | movement same as the above, (a) is a gain, (b) is a figure which shows the absolute value of an audio | voice input signal. 同上の別の動作を説明する波形図であり、(a)は利得、(b)は音声入力信号の絶対値を示す図である。It is a wave form diagram explaining another operation | movement same as the above, (a) is a gain, (b) is a figure which shows the absolute value of an audio | voice input signal. 同上のバッファ部に保存された音声入力信号から代表値を求める方法の説明図である。It is explanatory drawing of the method of calculating | requiring a representative value from the audio | voice input signal preserve | saved at the buffer part same as the above. 同上のバッファ部に保存された音声入力信号から代表値を求める別の方法の説明図である。It is explanatory drawing of another method which calculates | requires a representative value from the audio | voice input signal preserve | saved at the buffer part same as the above. 同上のまた別の動作を説明する波形図であり、(a)は利得、(b)は音声入力信号の絶対値を示す図である。It is a wave form diagram explaining another operation | movement same as the above, (a) is a gain, (b) is a figure which shows the absolute value of an audio | voice input signal. 同上のさらに別の動作を説明する波形図であり、(a)は利得、(b)は音声入力信号の絶対値を示す図である。It is a wave form diagram explaining another operation | movement same as the above, (a) is a gain, (b) is a figure which shows the absolute value of an audio | voice input signal. 同上のまた別の動作を説明する波形図であり、(a)は利得、(b)は音声入力信号の絶対値を示す図である。It is a wave form diagram explaining another operation | movement same as the above, (a) is a gain, (b) is a figure which shows the absolute value of an audio | voice input signal.

以下に本発明の実施の形態を図面に基づいて説明する。   Embodiments of the present invention will be described below with reference to the drawings.

図1は、本発明に係る音量補正装置10を用いた音声出力装置1のブロック図である。   FIG. 1 is a block diagram of an audio output device 1 using a volume correction device 10 according to the present invention.

音声出力装置1は、マイク2と、A/D変換部3と、音量補正装置10と、符号化・パケット化処理部4と、無線送信部5とを備える。   The audio output device 1 includes a microphone 2, an A / D conversion unit 3, a volume correction device 10, an encoding / packetization processing unit 4, and a wireless transmission unit 5.

マイク2は、入力された音声を電気信号に変換してA/D変換部3に出力する。   The microphone 2 converts the input sound into an electrical signal and outputs it to the A / D conversion unit 3.

A/D変換部3は、マイク2から入力された電気信号をA/D変換する。そして、A/D変換部3によりデジタル信号に変換された音声入力信号は音量補正装置10に出力される。   The A / D converter 3 performs A / D conversion on the electrical signal input from the microphone 2. Then, the audio input signal converted into a digital signal by the A / D conversion unit 3 is output to the volume correction device 10.

音量補正装置10は、バッファ部11と、利得設定部12(利得算出部13及び利得制限部14からなる)と、利得乗算部15とを主要な構成として備え、A/D変換部3から入力された音声入力信号の音量レベルを補正する。尚、利得設定部12及び利得乗算部15は、例えばマイクロコンピュータに組み込みのプログラムを実行させることによって、実現される。   The volume correction apparatus 10 includes a buffer unit 11, a gain setting unit 12 (consisting of a gain calculation unit 13 and a gain limiting unit 14), and a gain multiplication unit 15 as main components, and is input from the A / D conversion unit 3. The volume level of the input audio signal is corrected. The gain setting unit 12 and the gain multiplication unit 15 are realized by causing a microcomputer to execute a built-in program, for example.

バッファ部11は、A/D変換部3から入力される所定時間分の音声入力信号を記憶可能な記憶領域を備えている。バッファ部11は、記憶領域に空きがなくなると、最も古い音声入力信号を記憶する記憶領域に、新しい音声入力信号を上書き保存する。   The buffer unit 11 includes a storage area capable of storing a voice input signal for a predetermined time input from the A / D conversion unit 3. When there is no more free space in the storage area, the buffer unit 11 overwrites and saves the new audio input signal in the storage area for storing the oldest audio input signal.

利得設定部12は利得算出部13と利得制限部14とからなり、バッファ部11に保存される音声入力信号が全て入れ替わる毎(すなわち所定時間毎)に利得を決定する。   The gain setting unit 12 includes a gain calculating unit 13 and a gain limiting unit 14, and determines the gain every time when all the audio input signals stored in the buffer unit 11 are replaced (that is, every predetermined time).

利得算出部13は、バッファ部11に保存される音声入力信号が全て入れ替わる毎に、バッファ部11に保存されている音声入力信号の代表値に基づいて利得を算出する。具体的には、利得算出部13は、バッファ部11に保存されている所定時間分の音声入力信号をもとに、音声入力信号の絶対値の最大値を求め、この最大値を代表値としている。図4に音声入力信号(絶対値)の波形の一例を示す。図4の期間T1,T2,T3,T4は、それぞれ、バッファ部11に保存可能な音声入力信号の時間長に対応しており、期間T1,T2,T3,T4の各々で、バッファ部11内に保存される音声入力信号が全て入れ替わるようになっている。利得算出部13は、期間T1,T2,T3,T4の終わりで、バッファ部11に保存されている音声入力信号の最大値P1,P2,P3,P4を求め、この最大値P1,P2,P3,P4を各期間における音声入力信号の代表値とする。そして、利得算出部13は、各期間における音声入力信号の最大値(代表値)に利得を乗算した値が所定の目標値となるように、利得を算出する。   The gain calculation unit 13 calculates the gain based on the representative value of the audio input signal stored in the buffer unit 11 every time the audio input signal stored in the buffer unit 11 is replaced. Specifically, the gain calculation unit 13 obtains the maximum value of the absolute value of the audio input signal based on the audio input signal for a predetermined time stored in the buffer unit 11, and uses this maximum value as a representative value. Yes. FIG. 4 shows an example of the waveform of the audio input signal (absolute value). The periods T1, T2, T3, and T4 in FIG. 4 correspond to the time lengths of the audio input signals that can be stored in the buffer unit 11, respectively. All the audio input signals stored in the are exchanged. The gain calculation unit 13 obtains the maximum values P1, P2, P3, and P4 of the audio input signals stored in the buffer unit 11 at the end of the periods T1, T2, T3, and T4, and the maximum values P1, P2, P3 , P4 is a representative value of the audio input signal in each period. Then, the gain calculation unit 13 calculates the gain so that a value obtained by multiplying the maximum value (representative value) of the audio input signal in each period by the gain becomes a predetermined target value.

利得制限部14は、利得算出部13によって利得が算出されると、利得算出部13によって前回算出された利得と今回算出された利得との変化分を求める。この変化分の絶対値が所定の第1閾値以上の場合、利得制限部14は、前回算出した利得を、今回算出した利得に近付くように所定の変動上限分ΔGだけ変化させて得た制限値を今回の利得として、利得乗算部15に出力する。一方、上述した変化分の絶対値が第1閾値未満であれば、利得制限部14は、利得を制限する処理を行わず、利得算出部13が今回算出した利得をそのまま利得乗算部15に出力する。尚、第1閾値は、少なくとも変動上限分ΔGより大きい値に設定されている。   When the gain calculation unit 13 calculates the gain, the gain limiting unit 14 obtains a change between the gain previously calculated by the gain calculation unit 13 and the gain calculated this time. When the absolute value of this change is equal to or greater than the predetermined first threshold, the gain limiting unit 14 obtains a limit value obtained by changing the previously calculated gain by a predetermined fluctuation upper limit ΔG so as to approach the gain calculated this time. Is output to the gain multiplier 15 as the current gain. On the other hand, if the absolute value of the change described above is less than the first threshold value, the gain limiting unit 14 does not perform the process of limiting the gain, and the gain calculating unit 13 outputs the gain calculated this time to the gain multiplying unit 15 as it is. To do. The first threshold value is set to a value that is at least larger than the variation upper limit ΔG.

図2(b)は音声入力信号の波形の一例であり、図2(a)は音声入力信号をもとに決定された利得を示している。利得算出部13は、バッファ部11に保存される音声入力信号が全て入れ替わる毎に利得を算出しており、期間T1,T2,T3,T4の終わりに算出された利得はそれぞれG1,G2,G3,G4となる。   FIG. 2B shows an example of the waveform of the voice input signal, and FIG. 2A shows the gain determined based on the voice input signal. The gain calculation unit 13 calculates the gain every time the audio input signals stored in the buffer unit 11 are replaced, and the gains calculated at the end of the periods T1, T2, T3, and T4 are G1, G2, and G3, respectively. , G4.

ここで、期間T2の終わりに利得算出部13によって利得が算出された場合、利得制限部14は、前回算出された利得G1と今回算出された利得G2との変化分を求め、この変化分の絶対値(|G1−G2|)と第1閾値との高低を比較する。利得G1と利得G2との変化分の絶対値は第1閾値未満であるので、利得制限部14は、今回算出された利得G2を期間T2における利得に設定して、利得乗算部15に出力する。   Here, when the gain is calculated by the gain calculating unit 13 at the end of the period T2, the gain limiting unit 14 obtains a change between the previously calculated gain G1 and the currently calculated gain G2, and calculates the change. The absolute value (| G1-G2 |) is compared with the first threshold value. Since the absolute value of the change between the gain G1 and the gain G2 is less than the first threshold, the gain limiting unit 14 sets the gain G2 calculated this time as the gain in the period T2, and outputs the gain to the gain multiplication unit 15. .

また期間T3の終わりに利得算出部13によって利得が算出された場合、利得制限部14は、前回算出された利得G2(期間T2の利得)と、今回算出された利得G3との変化分を求め、この変化分の絶対値と第1閾値との高低を比較する。利得G2と利得G3との変化分の絶対値は第1閾値未満であるので、利得制限部14は、今回算出された利得G3を期間T3における利得に設定して、利得乗算部15に出力する。   When the gain is calculated by the gain calculating unit 13 at the end of the period T3, the gain limiting unit 14 obtains a change between the previously calculated gain G2 (gain of the period T2) and the currently calculated gain G3. The absolute value of this change is compared with the first threshold value. Since the absolute value of the change between the gain G2 and the gain G3 is less than the first threshold, the gain limiter 14 sets the gain G3 calculated this time as the gain in the period T3 and outputs the gain to the gain multiplier 15. .

一方、期間T4の終わりに利得算出部13によって利得が算出された場合、利得制限部14は、前回算出された利得G3(期間T3の利得)と、今回算出された利得G4との変化分を求め、この変化分の絶対値と第1閾値との高低を比較する。ここで、期間T4では期間T3に比べて音声入力信号の最大値が大幅に低下しているため、利得G3と利得G4との変化分の絶対値は第1閾値以上となる。よって、利得制限部14は、前回算出した利得G3を、今回算出した利得G4に近付く向きに所定の変動上限分ΔGだけ変化させて得た制限値G4b(=G3+ΔG)を今回の利得に設定する。ここにおいて、変動上限分ΔGは、人間の聴覚特性に鑑みて3dB程度に設定されるのが好ましく、利得の変化に伴って発生する音量の変化に気付きにくくなるから、ユーザが違和感を抱かないように音量変化を抑制できる。   On the other hand, when the gain is calculated by the gain calculation unit 13 at the end of the period T4, the gain limiting unit 14 calculates the change between the previously calculated gain G3 (gain of the period T3) and the gain G4 calculated this time. The absolute value of this change is compared with the level of the first threshold. Here, since the maximum value of the audio input signal is significantly lower in the period T4 than in the period T3, the absolute value of the change between the gain G3 and the gain G4 is equal to or greater than the first threshold value. Therefore, the gain limiter 14 sets a limit value G4b (= G3 + ΔG) obtained by changing the previously calculated gain G3 by a predetermined variation upper limit ΔG in a direction approaching the currently calculated gain G4 as the current gain. . Here, the variation upper limit ΔG is preferably set to about 3 dB in view of human auditory characteristics, and it becomes difficult to notice the change in volume that occurs with the change in gain, so that the user does not feel uncomfortable. The volume change can be suppressed.

利得乗算部15は、利得設定部12によって利得が設定されると、バッファ部11から音声入力信号を古い順番に読み出し、読み出した音声入力信号に、利得設定部12から入力された利得を乗算して、符号化・パケット化処理部4に出力する。   When the gain is set by the gain setting unit 12, the gain multiplication unit 15 reads the audio input signal from the buffer unit 11 in the oldest order, and multiplies the read audio input signal by the gain input from the gain setting unit 12. And output to the encoding / packetization processing unit 4.

符号化・パケット化処理部4は、音量補正装置10によって音量が補正された信号に、音声符号化処理、パケット化処理を施した後、無線送信部5に出力する。   The encoding / packetization processing unit 4 performs voice encoding processing and packetization processing on the signal whose volume has been corrected by the volume correction device 10, and then outputs the signal to the wireless transmission unit 5.

無線送信部5は、符号化・パケット化処理部4によって音声符号化処理およびパケット化処理が施された音声信号を無線送信する。   The wireless transmission unit 5 wirelessly transmits the audio signal subjected to the audio encoding process and the packetization process by the encoding / packetization processing unit 4.

以上のように本実施形態では、マイク2から入力された音声は、A/D変換部3によってA/D変換され、音量補正装置10によって音量レベルが補正された後、音声符号化処理、パケット化処理が施されて無線送信部5から無線送信されるのである。   As described above, in the present embodiment, the voice input from the microphone 2 is A / D converted by the A / D converter 3 and the volume level is corrected by the volume correction device 10, and then the voice encoding process, packet The wireless transmission is performed from the wireless transmission unit 5.

そして、音量補正装置10では、バッファ部11に保存される音声入力信号が全て入れ替わる毎に、利得算出部13が、バッファ部11に保存された音声入力信号の代表値に基づいて利得を算出している。したがって、サンプリング毎に利得を算出する従来例に比べて、音量補正の処理量を低減でき、またバッファ部11の記憶容量に応じた遅延のみで利得を算出できるから、出力音声の遅延を低減できる。さらに、前回算出された利得と今回算出された利得との変化分を求め、この変化分の絶対値が第1閾値以上の場合、利得制限部14は、前回算出した利得が今回算出した利得に近付くように、前回算出した利得を所定の変動上限分だけ変化させて得た制限値を今回の利得とする。これにより、入力音声の音量レベルが急変したために、利得の算出結果が第1閾値以上変動した場合でも、利得の変動が所定の変動上限分に抑えられるから、出力される音声の音量が急変するのを抑制できる。   In the volume correction device 10, the gain calculation unit 13 calculates the gain based on the representative value of the audio input signal stored in the buffer unit 11 every time the audio input signal stored in the buffer unit 11 is replaced. ing. Therefore, compared to the conventional example in which the gain is calculated for each sampling, the processing amount of the volume correction can be reduced, and the gain can be calculated only by the delay according to the storage capacity of the buffer unit 11, so that the delay of the output sound can be reduced. . Further, a change between the previously calculated gain and the currently calculated gain is obtained, and when the absolute value of the change is equal to or greater than the first threshold, the gain limiting unit 14 sets the previously calculated gain to the currently calculated gain. The limit value obtained by changing the previously calculated gain by a predetermined fluctuation upper limit is set as the current gain so as to approach. As a result, since the volume level of the input voice has suddenly changed, even if the gain calculation result fluctuates more than the first threshold value, the gain fluctuation can be suppressed to the predetermined fluctuation upper limit, and thus the volume of the output voice suddenly changes. Can be suppressed.

ところで、バッファ部11に記憶される音声信号の時間長が短すぎると、入力される音声の信号レベルの変化に対して、音量補正装置10が過剰に反応し、音量補正処理が頻繁に行われるため、音量補正装置10の処理量が増大することになる。また、バッファ部11に記憶される音声信号の時間長が長すぎると、記憶容量の大きなバッファ部11が必要になり、バッファ部11に保存可能な時間分だけの遅延が発生するという問題がある。   By the way, if the time length of the audio signal stored in the buffer unit 11 is too short, the volume correction device 10 reacts excessively to the change in the signal level of the input audio, and the volume correction processing is frequently performed. Therefore, the processing amount of the sound volume correction device 10 increases. Further, if the time length of the audio signal stored in the buffer unit 11 is too long, the buffer unit 11 having a large storage capacity is required, and there is a problem that a delay corresponding to the time that can be stored in the buffer unit 11 occurs. .

また、通常の発話において1音節或いは1文字の時間長を平均すると130(mSec)程度であることが知られている(参考文献:「音声情報処理の基礎」,斎藤収三,中田和男著,オーム社刊,1981年)。本実施形態では、少なくとも数音節のデータを用いて音量調整が行えるよう、バッファ部11に記憶される音声信号の時間を数100(mSec)〜数1000(mSec)程度に設定しており、バッファリングによる遅延を短くしつつ、音量補正処理が頻繁に行われるのを抑制することができる。   Moreover, it is known that the average time length of one syllable or one character in a normal utterance is about 130 (mSec) (reference: “Basics of Speech Information Processing”, Shuzo Saito, Kazuo Nakata, Published by Ohmsha (1981). In this embodiment, the time of the audio signal stored in the buffer unit 11 is set to about several hundreds (mSec) to several thousand (mSec) so that the volume can be adjusted using data of at least several syllables. It is possible to suppress frequent volume correction processing while shortening the delay due to the ring.

また利得算出部13は、算出した利得が所定の上限値を超えた場合、出力する利得を上限値に設定することも好ましい。このように、利得算出部13から出力される利得を上限値内に制限することで、音声入力信号が過大な利得で増幅されるのを防止でき、それによって出力音声に含まれるフロアノイズを低減できる。またインターホンなどの通話システムに利用された場合には、出力レベルを抑制してハウリングを発生し難くできる。   The gain calculating unit 13 preferably sets the output gain to the upper limit value when the calculated gain exceeds a predetermined upper limit value. In this way, by limiting the gain output from the gain calculation unit 13 within the upper limit value, it is possible to prevent the voice input signal from being amplified with an excessive gain, thereby reducing the floor noise included in the output voice. it can. Further, when used in a call system such as an interphone, it is possible to suppress howling by suppressing the output level.

ここで、前回算出した利得と今回算出した利得との変化分の絶対値が上記第1閾値以上の場合、利得制限部14は、上述のようにして求めた制限値を今回の利得としているが、それに加えて、バッファ部11が記憶領域のサイズを小さくすることも好ましい。   Here, when the absolute value of the change between the previously calculated gain and the currently calculated gain is equal to or greater than the first threshold, the gain limiting unit 14 uses the limit value obtained as described above as the current gain. In addition, it is also preferable that the buffer unit 11 reduces the size of the storage area.

図3(b)は音声入力信号の波形の一例を示し、図3(a)は音声入力信号をもとに決定された利得を示している。利得算出部13は、バッファ部11に保存される音声入力信号が全て入れ替わる毎に利得を算出しており、期間T11,T12,T13の終わりにバッファ部11内に保存された音声入力信号をもとに算出された利得はG11,G12,G13となる。   FIG. 3B shows an example of the waveform of the voice input signal, and FIG. 3A shows the gain determined based on the voice input signal. The gain calculation unit 13 calculates the gain every time the audio input signal stored in the buffer unit 11 is replaced. The gain calculation unit 13 also stores the audio input signal stored in the buffer unit 11 at the end of the periods T11, T12, and T13. The gains calculated in the above are G11, G12, and G13.

期間T12の終わりでは、前回算出された利得G11と今回算出された利得G12との変化分の絶対値(|G11−G12|)が第1閾値未満となっているから、利得制限部14は、算出された利得G12を今回との利得とする。   At the end of the period T12, since the absolute value (| G11−G12 |) of the change between the previously calculated gain G11 and the currently calculated gain G12 is less than the first threshold, the gain limiting unit 14 The calculated gain G12 is defined as the current gain.

一方、期間T13では期間T12に比べて音声入力信号の音量レベルが大きく低下しているから、前回(期間T12において)算出された利得G12と今回算出された利得G13との変化分の絶対値が第1閾値以上となっている。よって、利得制限部14は、前回算出された利得G12を、今回算出された利得G13に近付くように所定の変動上限分ΔGだけ変化させて得た制限値G13bを今回の利得とする。また利得制限部14は、バッファ部11を制御して、音声入力信号を記憶するのに用いる記憶領域のサイズを小さくしており、記憶領域のサイズを小さくすることで保存可能な音声入力信号の時間長を短くしている。すなわち、期間T11〜T13までの時間長D1に比べて、期間T14以後はバッファ部11に保存可能な音声入力信号の時間長D2が短くなっている。尚、期間T14,T15の終了時においても、利得算出部13が今回算出した利得と前回の利得との変化分の絶対値が第1閾値以上となっているので、利得制限部14は、前回の利得に変動上限分ΔGを加算して得た制限値を今回の利得として設定する。また期間T16の終了時には、利得算出部13が今回算出した利得と前回の利得との変化分の絶対値が第1閾値未満となっているので、利得算出部13は今回算出した値を利得G16として設定しており、期間T13の終わりに算出された利得G13に略一致する。   On the other hand, in the period T13, the volume level of the audio input signal is significantly lower than in the period T12. Therefore, the absolute value of the change between the gain G12 calculated in the previous period (in the period T12) and the gain G13 calculated this time is It is greater than or equal to the first threshold. Therefore, the gain limiting unit 14 sets the limit value G13b obtained by changing the previously calculated gain G12 by the predetermined fluctuation upper limit ΔG so as to approach the currently calculated gain G13 as the current gain. The gain limiting unit 14 controls the buffer unit 11 to reduce the size of the storage area used for storing the voice input signal. The gain limiter 14 reduces the size of the voice input signal that can be stored by reducing the size of the storage area. The time length is shortened. That is, the time length D2 of the audio input signal that can be stored in the buffer unit 11 after the period T14 is shorter than the time length D1 from the period T11 to T13. Even at the end of the periods T14 and T15, since the absolute value of the change between the gain calculated by the gain calculating unit 13 and the previous gain is equal to or greater than the first threshold, the gain limiting unit 14 A limit value obtained by adding the variation upper limit ΔG to the gain is set as the current gain. At the end of the period T16, since the absolute value of the change between the gain calculated this time by the gain calculation unit 13 and the previous gain is less than the first threshold value, the gain calculation unit 13 uses the value calculated this time as the gain G16. And substantially matches the gain G13 calculated at the end of the period T13.

このように、期間T14以降はバッファ部11に保存される音声入力信号の時間長が短くなるから、短時間で利得が更新されることになる。よって、利得の変化分が上記の変動上限値ΔG以内に制限された場合でも、より短い時間で利得を目標利得(利得制限部14で利得の変化分が制限される前に利得算出部13で算出された利得G13)に到達させることができる。尚、バッファ部11が記憶領域のサイズを小さくした後、利得算出部13が今回算出した利得と前回の利得との変化分の絶対値が第1閾値未満になれば、バッファ部11が、音声入力信号を記憶するために用いる記憶領域のサイズを元の大きさに戻してもよい。   Thus, since the time length of the audio input signal stored in the buffer unit 11 is shortened after the period T14, the gain is updated in a short time. Therefore, even when the amount of change in gain is limited within the above-described variation upper limit ΔG, the gain is calculated by the gain calculation unit 13 before the gain is limited in a shorter time before the gain change is limited by the gain limiting unit 14. The calculated gain G13) can be reached. After the buffer unit 11 reduces the size of the storage area, if the absolute value of the change between the gain calculated by the gain calculation unit 13 and the previous gain is less than the first threshold, the buffer unit 11 The size of the storage area used for storing the input signal may be returned to the original size.

また本実施形態では、バッファ部11内に保存された音声入力信号が全て入れ替わった後に、利得算出部13が、バッファ部11に保存されている音声入力信号の絶対値の最大値を求め、この最大値を代表値としている。そして、利得算出部13は、音声入力信号の代表値(最大値)に利得を乗算して得た値が所定の目標値となるように、利得を設定している。これにより、音量補正装置10から出力される信号の最大値を一定にすることができる。   In this embodiment, after all the audio input signals stored in the buffer unit 11 are replaced, the gain calculation unit 13 obtains the maximum value of the absolute values of the audio input signals stored in the buffer unit 11. The maximum value is the representative value. The gain calculation unit 13 sets the gain so that a value obtained by multiplying the representative value (maximum value) of the audio input signal by the gain becomes a predetermined target value. Thereby, the maximum value of the signal output from the sound volume correction apparatus 10 can be made constant.

尚、音量補正装置10から出力される信号の平均レベルを一定としたい場合には、バッファ部11に保存された音声入力信号の平均値を代表値とし、この代表値(平均値)に利得を乗算して得た値が所定の目標値となるように、利得を設定してもよい。図5に音声入力信号(絶対値)の波形の一例を示し、期間T1,T2,T3,T4の時間長はバッファ部11内に保存される音声入力信号の時間長に対応している。バッファ部11に保存される音声入力信号が全て入れ替わる毎(すなわち期間T1,T2,T3,T4の終了時)に、利得算出部13は、期間T1,T2,T3,T4の各々において音声入力信号の平均値A1,A2,A3,A4を求める。そして、利得算出部13は、各期間で求めた平均値を各期間における音声入力信号の代表値とし、この代表値に利得を乗算した値が所定の目標値となるように利得を算出する。これにより音量補正装置10から出力される信号の平均値を所定の目標値とすることができ、音量レベルの平均値を一定にできる。   If the average level of the signal output from the volume correction device 10 is to be constant, the average value of the audio input signal stored in the buffer unit 11 is used as a representative value, and a gain is added to this representative value (average value). The gain may be set so that a value obtained by multiplication becomes a predetermined target value. FIG. 5 shows an example of the waveform of the audio input signal (absolute value). The time lengths of the periods T1, T2, T3, and T4 correspond to the time length of the audio input signal stored in the buffer unit 11. Each time the audio input signals stored in the buffer unit 11 are all switched (that is, at the end of the periods T1, T2, T3, and T4), the gain calculating unit 13 performs the audio input signal in each of the periods T1, T2, T3, and T4. Average values A1, A2, A3, and A4 are obtained. Then, the gain calculation unit 13 uses the average value obtained in each period as a representative value of the audio input signal in each period, and calculates the gain so that a value obtained by multiplying the representative value by the gain becomes a predetermined target value. As a result, the average value of the signals output from the sound volume correction device 10 can be set as a predetermined target value, and the average value of the sound volume level can be made constant.

ところで、図6(a)(b)に示すように期間T4において音声入力信号の音量レベルが急激に大きくなると、利得算出部13によって算出された利得G4は、前回算出された利得G3に比べて大幅に低下する。ここで、利得G3と利得G4との変化分の絶対値が第1閾値を超えた場合、利得制限部14は、前回算出された利得G3を、今回算出された利得G4に近付く向きに変動上限分ΔGだけ変化させた値G4bを、今回の利得として設定する。この場合、期間T4における利得が十分低下していないため、期間T4における音声入力信号の代表値と利得G4bとの積が、所定の第2閾値を超える可能性がある。利得制限部14は、制限値を今回の利得とした場合にこの利得と代表値との積が所定の第2閾値を超えると、代表値に利得を乗算した値が所定の目標値となるように利得算出部13によって算出された利得を今回の利得とする。これにより、マイク2に入力される音声の信号レベルが急激に大きくなった場合は、第1閾値を超えて利得を変化(低下)させることによって、過大な音量の信号が出力されるのを防止できる。尚、上記の第2閾値は、上記した目標値よりも大きい値であって、出力音量の上限レベルに対応した値に設定されていればよい。   Incidentally, as shown in FIGS. 6A and 6B, when the volume level of the audio input signal suddenly increases in the period T4, the gain G4 calculated by the gain calculating unit 13 is larger than the previously calculated gain G3. Decrease significantly. Here, when the absolute value of the change between the gain G3 and the gain G4 exceeds the first threshold value, the gain limiting unit 14 sets the previously calculated gain G3 to a variation upper limit in a direction approaching the currently calculated gain G4. A value G4b changed by the amount ΔG is set as the current gain. In this case, since the gain in the period T4 is not sufficiently reduced, the product of the representative value of the audio input signal and the gain G4b in the period T4 may exceed a predetermined second threshold value. When the limit value is the current gain and the product of the gain and the representative value exceeds a predetermined second threshold value, the gain limiting unit 14 causes the value obtained by multiplying the representative value by the gain to be the predetermined target value. The gain calculated by the gain calculation unit 13 is defined as the current gain. As a result, when the signal level of the sound input to the microphone 2 suddenly increases, it is possible to prevent an excessively loud signal from being output by changing (decreasing) the gain exceeding the first threshold. it can. The second threshold value may be set to a value larger than the target value and corresponding to the upper limit level of the output volume.

また本実施形態において、バッファ部11内に保存された音声入力信号の代表値が、無音状態か有音状態かを判定する判定レベル以下であれば、利得算出部13は利得の算出を行わず、利得算出部13が前回算出した利得を利得乗算部15が用いることも好ましい。   In the present embodiment, if the representative value of the audio input signal stored in the buffer unit 11 is equal to or lower than the determination level for determining whether the sound is in the silent state or the voiced state, the gain calculating unit 13 does not calculate the gain. The gain multiplying unit 15 preferably uses the gain previously calculated by the gain calculating unit 13.

図7(b)は音声入力信号の波形の一例であり、図7(a)は音声入力信号をもとに決定された利得を示している。図示例では、期間T1,T2,T3ではバッファ部11内に保存された音声入力信号の代表値(例えば最大値)が所定の判定レベルL1を超えているので、利得算出部13は、各期間における音声入力信号の代表値をもとに利得を算出する。一方、期間T4ではバッファ部11内に保存された音声入力信号の代表値(最大値)が判定レベルL1以下であるから、利得算出部13は利得の算出を行わない。そして、利得乗算部15は、期間T4において、利得算出部13が前回算出した利得G3を用い、バッファ部11から読み出した音声入力信号に利得G3を乗算して、符号化・パケット化処理部4に出力する。   FIG. 7B shows an example of the waveform of the audio input signal, and FIG. 7A shows the gain determined based on the audio input signal. In the illustrated example, since the representative value (for example, the maximum value) of the audio input signal stored in the buffer unit 11 exceeds the predetermined determination level L1 in the periods T1, T2, and T3, the gain calculation unit 13 The gain is calculated based on the representative value of the voice input signal at. On the other hand, since the representative value (maximum value) of the audio input signal stored in the buffer unit 11 is equal to or lower than the determination level L1 in the period T4, the gain calculation unit 13 does not calculate the gain. Then, the gain multiplication unit 15 multiplies the voice input signal read from the buffer unit 11 by the gain G3 using the gain G3 previously calculated by the gain calculation unit 13 in the period T4, and thereby encodes and packetizes the processing unit 4. Output to.

音声入力信号の代表値が判定レベルL1以下の場合、すなわち意味のある信号が入力されていない無音状態で、利得算出部13が利得を算出すると、利得が過大な値に設定されてしまうが、上述のように代表値が判定レベルL1以下であれば利得算出部13が利得の算出を行わず、前回算出された利得で音声入力信号を乗算しているので、フロアノイズが過大な利得で増幅される可能性を低減できる。尚、上記の判定レベルL1は、フロアノイズの音量レベルよりは大きく、且つ、意味のある信号(すなわち音声信号)の代表値の下限より小さい値に設定されることが好ましく、意味のある信号とフロアノイズとを確実に弁別できる。   When the representative value of the audio input signal is equal to or lower than the determination level L1, that is, when the gain calculation unit 13 calculates the gain in a silent state in which no meaningful signal is input, the gain is set to an excessive value. As described above, if the representative value is equal to or less than the determination level L1, the gain calculation unit 13 does not calculate the gain, but multiplies the voice input signal by the previously calculated gain, so the floor noise is amplified by an excessive gain. The possibility of being reduced can be reduced. The determination level L1 is preferably set to a value larger than the volume level of the floor noise and smaller than the lower limit of the representative value of the meaningful signal (that is, the audio signal). It can be reliably distinguished from floor noise.

また本実施形態において、バッファ部11に入力される音声入力信号が音声か非音声かを判定する音声判定部(図示せず)を備えることも好ましい。この場合、音声判定部によってバッファ部11に入力されている音声入力信号が全て非音声と判定されると、利得算出部13は利得の算出を行わず、利得乗算部15は、利得算出部13によって前回算出された利得を用いて音声入力信号を乗算している。   In the present embodiment, it is also preferable to include an audio determination unit (not shown) that determines whether the audio input signal input to the buffer unit 11 is audio or non-audio. In this case, when all the audio input signals input to the buffer unit 11 are determined as non-speech by the audio determination unit, the gain calculation unit 13 does not calculate the gain, and the gain multiplication unit 15 does not calculate the gain. Is multiplied by the voice input signal using the previously calculated gain.

例えば図8(b)に示す音声入力信号の場合、期間T1,T2,T3では、バッファ部11内に保存されている音声入力信号の少なくとも一部が音声と判定されている。よって、期間T1,T2,T3では利得算出部13が利得の算出を行っており、期間T1,T2,T3における利得はそれぞれG1,G2,G3となっている(図8(a)参照)。一方、期間T4では、音声判定部によってバッファ部11内に保存されている音声入力信号の全てが非音声と判定されており、この場合、利得算出部13は利得の算出を行わない。そして、利得乗算部15は、期間T4において、利得算出部13が前回算出した利得G3を用い、バッファ部11から読み出した音声入力信号に利得G3を乗算して、符号化・パケット化処理部4に出力する。ここで、期間T4における非音声の信号から利得を求めた場合、その利得G21は、音声信号から求めた利得(例えば期間T3の利得G3)に比べて大幅に大きくなり、この利得G21を用いて音声入力信号を乗算すると、雑音を高利得で増幅してしまうことになる。それに対して、本実施形態では、音声入力信号が全て非音声と判定された期間T4において利得の算出を行わず、前回求めた利得G3を用いて音声入力信号を増幅しているので、非常に高い利得で雑音が増幅されるのを抑制できる。   For example, in the case of the audio input signal shown in FIG. 8B, at least a part of the audio input signal stored in the buffer unit 11 is determined to be audio during the periods T1, T2, and T3. Therefore, the gain calculation unit 13 calculates the gain in the periods T1, T2, and T3, and the gains in the periods T1, T2, and T3 are G1, G2, and G3, respectively (see FIG. 8A). On the other hand, in the period T4, all of the audio input signals stored in the buffer unit 11 are determined as non-speech by the audio determination unit, and in this case, the gain calculation unit 13 does not calculate the gain. Then, the gain multiplication unit 15 multiplies the voice input signal read from the buffer unit 11 by the gain G3 using the gain G3 previously calculated by the gain calculation unit 13 in the period T4, and thereby encodes and packetizes the processing unit 4. Output to. Here, when the gain is obtained from the non-speech signal in the period T4, the gain G21 is significantly larger than the gain obtained from the speech signal (for example, the gain G3 in the period T3), and this gain G21 is used. Multiplying the audio input signal will amplify the noise with high gain. On the other hand, in the present embodiment, since the calculation of the gain is not performed in the period T4 in which all the audio input signals are determined to be non-audio, and the audio input signal is amplified using the gain G3 obtained last time, Noise can be suppressed from being amplified with high gain.

このように、音声判定部によってバッファ部11に入力されている音声入力信号が全て非音声と判定された場合、利得乗算部15は、利得算出部13によって前回算出された利得を、バッファ部11から読み出した音声入力信号に乗算している。よって、音声以外のノイズが高い利得で増幅されるのを抑制することができる。   As described above, when all the audio input signals input to the buffer unit 11 are determined to be non-speech by the audio determination unit, the gain multiplication unit 15 calculates the gain previously calculated by the gain calculation unit 13 as the buffer unit 11. Is multiplied by the audio input signal read from. Therefore, it is possible to suppress noise other than voice from being amplified with a high gain.

尚、音声判定部は以下のような方法で音声か非音声かを判定する。すなわち音声判定部は、A/D変換部3から入力される音声入力信号の比較的長い時間における長時間平均値と、音声入力信号の比較的短い時間における短時間平均値とを求める。ここで、短時間平均値は、音声入力信号に含まれる音声成分によってそのレベルが決定され、長時間平均値は、音声入力信号に含まれる雑音成分(音声以外の音成分)によってそのレベルが決定されると考えられる。而して音声判定部は、長時間平均値に対する短時間平均値の割合が所定の基準値以上であれば音声と判定し、基準値未満であれば非音声と判定しており、音声と非音声とを確実に判別することができる。尚、音声か非音声かを判定する方法は上記の方法に限定されるものではなく、例えば音声入力時に話者が操作するスイッチの入力から音声か非音声かを判定してもよい。   The sound determination unit determines whether the sound is sound or non-speech by the following method. That is, the voice determination unit obtains a long-time average value of a voice input signal input from the A / D conversion unit 3 for a relatively long time and a short-time average value of a voice input signal for a relatively short time. Here, the level of the short-time average value is determined by the voice component included in the voice input signal, and the level of the long-time average value is determined by the noise component (sound component other than voice) included in the voice input signal. It is thought that it is done. Thus, the sound determination unit determines that the sound is a sound if the ratio of the short-time average value to the long-time average value is equal to or greater than a predetermined reference value, and determines that the sound is non-speech if it is less than the reference value. It is possible to reliably distinguish the voice. Note that the method for determining voice or non-speech is not limited to the above-described method. For example, it may be determined whether the voice is input or not from the input of a switch operated by the speaker at the time of voice input.

10 音量補正装置
11 バッファ部
12 利得設定部
13 利得算出部
14 利得制限部
15 利得乗算部
DESCRIPTION OF SYMBOLS 10 Volume correction apparatus 11 Buffer part 12 Gain setting part 13 Gain calculation part 14 Gain limit part 15 Gain multiplication part

Claims (9)

所定時間分の音声入力信号を記憶可能な記憶領域を有し、前記記憶領域に空きがなくなると、最も古い音声入力信号を記憶する記憶領域に、新しい音声入力信号を上書き保存するバッファ部と、
前記バッファ部に保存される音声入力信号が全て入れ替わる毎に、前記バッファ部に保存されている音声入力信号の代表値に基づいて利得を算出する利得算出部と、
前記利得算出部が前回算出した利得と今回算出した利得との変化分を求め、前記変化分の絶対値が所定の第1閾値以上の場合、前回算出した利得を、今回算出した利得に近付くように所定の変動上限分だけ変化させて得た制限値を今回の利得とする利得制限部と、
前記利得算出部及び前記利得制限部によって利得が設定されると、前記バッファ部から音声入力信号を古い順番に読み出し、読み出した音声入力信号に前記利得を乗算して出力する利得乗算部とを備えることを特徴とする音量補正装置。
A storage area capable of storing a voice input signal for a predetermined time, and when there is no space in the storage area, a buffer section for overwriting and saving a new voice input signal in a storage area for storing the oldest voice input signal;
A gain calculating unit that calculates a gain based on a representative value of the audio input signal stored in the buffer unit every time the audio input signal stored in the buffer unit is replaced;
The gain calculation unit obtains a change between the previously calculated gain and the currently calculated gain, and when the absolute value of the change is equal to or greater than a predetermined first threshold, the previously calculated gain approaches the currently calculated gain. A gain limiter that uses the limit value obtained by changing the amount of change as much as a predetermined fluctuation upper limit as the current gain,
A gain multiplier that reads out the audio input signals from the buffer unit in the oldest order when the gain is set by the gain calculator and the gain limiter, multiplies the read audio input signals by the gain, and outputs the result The volume correction apparatus characterized by the above-mentioned.
前記利得算出部は、前記バッファ部に保存されている音声入力信号の絶対値の最大値を求め、前記最大値を前記代表値とすることを特徴とする請求項1記載の音量補正装置。   The volume correction apparatus according to claim 1, wherein the gain calculation unit obtains a maximum absolute value of the audio input signal stored in the buffer unit and sets the maximum value as the representative value. 前記利得算出部は、前記バッファ部に保存されている音声入力信号の絶対値の平均値を求め、前記平均値を前記代表値とすることを特徴とする請求項1記載の音量補正装置。   The volume correction apparatus according to claim 1, wherein the gain calculation unit calculates an average value of absolute values of audio input signals stored in the buffer unit, and uses the average value as the representative value. 前記利得算出部は、前記代表値に利得を乗算した値が予め設定された目標値と等しくなるように利得を設定することを特徴とする請求項1乃至3の何れか1項に記載の音量補正装置。   The volume according to any one of claims 1 to 3, wherein the gain calculation unit sets the gain so that a value obtained by multiplying the representative value by a gain is equal to a preset target value. Correction device. 前記利得制限部が前記制限値を今回の利得とした場合に前記利得と前記代表値との積が所定の第2閾値を超えると、前記利得制限部は、前記代表値に利得を乗算した値が所定の目標値となるように前記利得算出部によって算出された利得を今回の利得とすることを特徴とする請求項1乃至4の何れか1項に記載の音量補正装置。   When the gain limiting unit sets the limiting value as the current gain and the product of the gain and the representative value exceeds a predetermined second threshold, the gain limiting unit multiplies the representative value by the gain. 5. The volume correction apparatus according to claim 1, wherein the gain calculated by the gain calculation unit is set to be a current gain so that the value becomes a predetermined target value. 6. 前記変化分の絶対値が前記第1閾値以上となって、前記利得制限部が前記制限値を今回の利得とした場合、前記バッファ部が前記記憶領域のサイズを小さくすることを特徴とする請求項1乃至5の何れか1項に記載の音量補正装置。   The buffer unit reduces the size of the storage area when the absolute value of the change is equal to or greater than the first threshold and the gain limiting unit sets the limiting value as the current gain. Item 6. The sound volume correction device according to any one of Items 1 to 5. 前記代表値が、無音状態か有音状態かを判定する判定レベル以下であれば、
前記利得乗算部は、前記利得算出部によって前回算出された利得を、前記バッファ部から読み出した音声入力信号に乗算して出力することを特徴とする請求項1乃至6の何れか1項に記載の音量補正装置。
If the representative value is below the determination level for determining whether the sound is silent or sound,
7. The gain multiplication unit according to claim 1, wherein the gain multiplication unit multiplies the audio input signal read from the buffer unit by the gain previously calculated by the gain calculation unit and outputs the result. Volume correction device.
前記バッファ部に入力される音声入力信号が音声か非音声かを判定する音声判定部を備え、
前記音声判定部によって前記バッファ部に入力されている音声入力信号が全て非音声と判定された場合、
前記利得乗算部は、前記利得算出部によって前回算出された利得を、前記バッファ部から読み出した音声入力信号に乗算して出力することを特徴とする請求項1乃至6の何れか1項に記載の音量補正装置。
A voice determination unit that determines whether a voice input signal input to the buffer unit is voice or non-voice;
When it is determined that all audio input signals input to the buffer unit by the audio determination unit are non-audio,
7. The gain multiplication unit according to claim 1, wherein the gain multiplication unit multiplies the audio input signal read from the buffer unit by the gain previously calculated by the gain calculation unit and outputs the result. Volume correction device.
前記利得算出部は、算出した利得が所定の上限値を超えた場合、利得を前記上限値に設定することを特徴とする請求項1乃至8の何れか1項に記載の音量補正装置。   The volume correction apparatus according to claim 1, wherein the gain calculation unit sets a gain to the upper limit value when the calculated gain exceeds a predetermined upper limit value.
JP2012197869A 2012-09-07 2012-09-07 Volume correction device Active JP6065308B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2012197869A JP6065308B2 (en) 2012-09-07 2012-09-07 Volume correction device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2012197869A JP6065308B2 (en) 2012-09-07 2012-09-07 Volume correction device

Publications (2)

Publication Number Publication Date
JP2014052553A true JP2014052553A (en) 2014-03-20
JP6065308B2 JP6065308B2 (en) 2017-01-25

Family

ID=50611069

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2012197869A Active JP6065308B2 (en) 2012-09-07 2012-09-07 Volume correction device

Country Status (1)

Country Link
JP (1) JP6065308B2 (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2015219285A (en) * 2014-05-14 2015-12-07 沖電気工業株式会社 Speech enhancement device, program, and speech decryption device, program
JPWO2017042875A1 (en) * 2015-09-08 2018-07-19 パイオニア株式会社 Electronic auscultation apparatus, control method, computer program, and recording medium
CN109716432A (en) * 2018-11-30 2019-05-03 深圳市汇顶科技股份有限公司 Gain process method and device thereof, electronic equipment, signal acquisition method and its system
JP2021183201A (en) * 2020-02-27 2021-12-02 パイオニア株式会社 Electronic auscultation device, control method, computer program and recording medium
US11191486B2 (en) 2017-09-19 2021-12-07 Ausculsciences, Inc. System and method for detecting decoupling of an auscultatory sound sensor from a test-subject
JP2022530903A (en) * 2019-04-29 2022-07-04 北京声智科技有限公司 Automatic gain control method and its device, readable recording medium

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001095082A (en) * 1999-09-24 2001-04-06 Yamaha Corp Directional loudspeaker
JP2004129187A (en) * 2002-07-29 2004-04-22 Matsushita Electric Works Ltd Automatic gain control instrument
JP2006042333A (en) * 2004-07-24 2006-02-09 Samsung Electronics Co Ltd Volume automatic correction device by channel change and its method
WO2007141923A1 (en) * 2006-06-02 2007-12-13 Nec Corporation Gain control system, gain control method, and gain control program
JP2007324741A (en) * 2006-05-30 2007-12-13 Oki Electric Ind Co Ltd Automatic gain controller
JP2008070877A (en) * 2006-09-15 2008-03-27 Aisin Seiki Co Ltd Voice signal pre-processing device, voice signal processing device, voice signal pre-processing method and program for voice signal pre-processing
JP2008070878A (en) * 2006-09-15 2008-03-27 Aisin Seiki Co Ltd Voice signal pre-processing device, voice signal processing device, voice signal pre-processing method and program for voice signal pre-processing
JP2008261978A (en) * 2007-04-11 2008-10-30 Toshiba Microelectronics Corp Reproduction volume automatically adjustment method
JP2009171057A (en) * 2008-01-11 2009-07-30 Sanyo Electric Co Ltd Amplifier circuit
WO2011074164A1 (en) * 2009-12-15 2011-06-23 パナソニック株式会社 Automatic gain control device, receiver, electronic device, and automatic gain control method
JP2012147326A (en) * 2010-12-20 2012-08-02 Panasonic Corp Outside line transfer apparatus for apartment house intercom system

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001095082A (en) * 1999-09-24 2001-04-06 Yamaha Corp Directional loudspeaker
JP2004129187A (en) * 2002-07-29 2004-04-22 Matsushita Electric Works Ltd Automatic gain control instrument
JP2006042333A (en) * 2004-07-24 2006-02-09 Samsung Electronics Co Ltd Volume automatic correction device by channel change and its method
JP2007324741A (en) * 2006-05-30 2007-12-13 Oki Electric Ind Co Ltd Automatic gain controller
WO2007141923A1 (en) * 2006-06-02 2007-12-13 Nec Corporation Gain control system, gain control method, and gain control program
JP2008070877A (en) * 2006-09-15 2008-03-27 Aisin Seiki Co Ltd Voice signal pre-processing device, voice signal processing device, voice signal pre-processing method and program for voice signal pre-processing
JP2008070878A (en) * 2006-09-15 2008-03-27 Aisin Seiki Co Ltd Voice signal pre-processing device, voice signal processing device, voice signal pre-processing method and program for voice signal pre-processing
JP2008261978A (en) * 2007-04-11 2008-10-30 Toshiba Microelectronics Corp Reproduction volume automatically adjustment method
JP2009171057A (en) * 2008-01-11 2009-07-30 Sanyo Electric Co Ltd Amplifier circuit
WO2011074164A1 (en) * 2009-12-15 2011-06-23 パナソニック株式会社 Automatic gain control device, receiver, electronic device, and automatic gain control method
JP2012147326A (en) * 2010-12-20 2012-08-02 Panasonic Corp Outside line transfer apparatus for apartment house intercom system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
JPN6016012860; Peter L. CHU: 'Voice-Activated AGC for Teleconferencing' Proc. ICASSP 1996 Vol.2, 19960507, pp.929-932, IEEE *

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2015219285A (en) * 2014-05-14 2015-12-07 沖電気工業株式会社 Speech enhancement device, program, and speech decryption device, program
JPWO2017042875A1 (en) * 2015-09-08 2018-07-19 パイオニア株式会社 Electronic auscultation apparatus, control method, computer program, and recording medium
US11191486B2 (en) 2017-09-19 2021-12-07 Ausculsciences, Inc. System and method for detecting decoupling of an auscultatory sound sensor from a test-subject
CN109716432A (en) * 2018-11-30 2019-05-03 深圳市汇顶科技股份有限公司 Gain process method and device thereof, electronic equipment, signal acquisition method and its system
JP2022530903A (en) * 2019-04-29 2022-07-04 北京声智科技有限公司 Automatic gain control method and its device, readable recording medium
JP7333972B2 (en) 2019-04-29 2023-08-28 北京声智科技有限公司 Automatic gain control method and device, readable recording medium
JP2021183201A (en) * 2020-02-27 2021-12-02 パイオニア株式会社 Electronic auscultation device, control method, computer program and recording medium
JP7357031B2 (en) 2020-02-27 2023-10-12 エア・ウォーター・バイオデザイン株式会社 Electronic auscultation device and control method

Also Published As

Publication number Publication date
JP6065308B2 (en) 2017-01-25

Similar Documents

Publication Publication Date Title
JP6065308B2 (en) Volume correction device
US9531338B2 (en) Signal processing apparatus, signal processing method, program, signal processing system, and communication terminal
JP4579273B2 (en) Stereo sound signal processing method and apparatus
US8744100B2 (en) Hearing aid in which signal processing is controlled based on a correlation between multiple input signals
US8473291B2 (en) Sound processing apparatus, apparatus and method for controlling gain, and computer program
JP5151762B2 (en) Speech enhancement device, portable terminal, speech enhancement method, and speech enhancement program
EP2265039A1 (en) Hearing aid
US8498862B2 (en) Speech signal processing apparatus
EP3471099B1 (en) Speech processing method and terminal
US9640168B2 (en) Noise cancellation with dynamic range compression
JPWO2012090282A1 (en) Voice control device, voice control method, voice control program, and portable terminal device
EP2743923B1 (en) Voice processing device, voice processing method
WO2011055489A1 (en) Hearing aid
JP2013168856A (en) Noise reduction device, audio input device, radio communication device, noise reduction method and noise reduction program
US9424830B2 (en) Apparatus and method for encoding audio signal, system and method for transmitting audio signal, and apparatus for decoding audio signal
JP2008309955A (en) Noise suppresser
WO2014194273A2 (en) Systems and methods for enhancing targeted audibility
US10264366B2 (en) Hearing aid and method for dynamically adjusting recovery time in wide dynamic range compression
US20190214029A1 (en) Audio processing method and non-transitory computer readable medium
JP2019110447A (en) Electronic device, control method of electronic device, and control program of electronic device
JP5213733B2 (en) Transmitting device, receiving device, transmitting method, receiving method
JP2000347679A (en) Audio encoder, and audio coding method
US11245997B2 (en) Headphone volume control method and headphone
JP5857216B2 (en) Automatic gain controller
JP5453131B2 (en) Hands-free control device and noise control method

Legal Events

Date Code Title Description
A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20150310

A711 Notification of change in applicant

Free format text: JAPANESE INTERMEDIATE CODE: A711

Effective date: 20150312

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20160405

A521 Written amendment

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20160606

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20161115

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20161209

R151 Written notification of patent or utility model registration

Ref document number: 6065308

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R151