JP2014052553A

JP2014052553A - Sound volume correction device

Info

Publication number: JP2014052553A
Application number: JP2012197869A
Authority: JP
Inventors: Teppei Washi; 哲平鷲; Minoru Fukushima; 実福島
Original assignee: Panasonic Corp
Current assignee: Panasonic Corp
Priority date: 2012-09-07
Filing date: 2012-09-07
Publication date: 2014-03-20
Anticipated expiration: 2032-09-07
Also published as: JP6065308B2

Abstract

PROBLEM TO BE SOLVED: To provide a sound volume correction device capable of reducing the amount of volume correction processing.SOLUTION: A buffer part 11 has a storage region capable of storing sound input signals for a predetermined period of time. When the storage region has no space, the buffer part 11 overwrites new sound input signals in a storage region where the oldest sound input signals are stored. Every time when all sound input signals stored in the buffer part 11 are replaced with new ones, a gain calculation section 13 calculates the gain based on a representative value of the sound input signals stored in the buffer part 11. A gain limitation section 14 calculates the amount of change between the gain previously calculated by the gain calculation section 13 and the gain calculated this time. When the absolute value of the amount of change is larger than a predetermined first threshold value, a limitation value which is obtained by changing the previously calculated gain by an amount equivalent to a predetermined changing upper limit so that the previously calculated gain gets closer to the gain calculated this time is determined as the gain his time. A gain multiplication section 15 multiplies the gain which is set by the gain calculation section 13 and a gain limitation section 14 by a sound input signal read out from the buffer part 11 and outputs the same.

Description

本発明は、音量補正装置に関するものである。 The present invention relates to a volume correction device.

従来、パケットにより音声信号を送信する音声送信端末装置と、受信した音声信号に基づいて音声を出力する音声受信端末装置とを備え、リアルタイムで音声通話を行う音声伝送システムがあった（例えば特許文献１参照）。 2. Description of the Related Art Conventionally, there has been a voice transmission system that includes a voice transmission terminal device that transmits a voice signal by a packet and a voice reception terminal device that outputs a voice based on the received voice signal, and performs a voice call in real time (for example, Patent Documents). 1).

この音声伝送システムでは、音声受信端末装置が、音声送信端末装置から受信したパケットが有する音声データの一音声フレームを復号化した後、復号化されたディジタル音声信号に異音対策処理を行って、伝送エラーによる異音の発生を抑えていた。すなわち、音声送信端末装置側では、第１差分演算手段が、ディジタル音声信号の標本点の量子化値とその一つ前の標本点の量子化値との差分の絶対値を算出し、一音声フレームについて絶対値の最大値を求めている。そして、音声受信端末装置の異音対策処理手段が、ディジタル音声信号の標本点の量子化値とその一つ前の標本点の量子化値との差分の絶対値を算出し、この絶対値が上記の最大値を超えた場合は異音と判断して、異音を抑制する処理を行っていた。 In this audio transmission system, after the audio receiving terminal device decodes one audio frame of audio data included in the packet received from the audio transmitting terminal device, the anti-noise processing is performed on the decoded digital audio signal, The generation of abnormal noise due to transmission errors was suppressed. That is, on the voice transmitting terminal device side, the first difference calculation means calculates the absolute value of the difference between the quantized value of the sample point of the digital voice signal and the quantized value of the immediately preceding sample point. The maximum absolute value is obtained for the frame. Then, the noise countermeasure processing means of the voice receiving terminal device calculates the absolute value of the difference between the quantized value of the sample point of the digital voice signal and the quantized value of the previous sample point, and this absolute value is When the above maximum value is exceeded, it is determined that the noise is abnormal, and processing for suppressing the abnormal noise is performed.

特開２００８−２９２９６１号公報JP 2008-292961 A

上述のようなリアルタイムで音声通話を行う音声伝送システムでは、音声入力信号の信号レベルが変化するのに合わせて、適応的に利得を調整する必要があった。そのため、従来は音声入力信号をサンプリングする毎に、サンプリングした瞬時値から音声全体の振幅を推定し、その推定値をもとに利得を算出することが行われていた。これらの処理はサンプリング毎に行われるため、信号処理の処理量が多量になって、演算処理部に高性能のマイコンを使用する必要があった。 In the voice transmission system that performs a voice call in real time as described above, it is necessary to adaptively adjust the gain as the signal level of the voice input signal changes. Therefore, conventionally, every time an audio input signal is sampled, the amplitude of the entire audio is estimated from the sampled instantaneous value, and the gain is calculated based on the estimated value. Since these processes are performed for each sampling, the amount of signal processing increases, and it is necessary to use a high-performance microcomputer for the arithmetic processing unit.

本発明は上記課題に鑑みて為されたものであり、その目的とするところは、音量補正処理の処理量を低減した音量補正装置を提供することにある。 The present invention has been made in view of the above problems, and an object of the present invention is to provide a volume correction apparatus that reduces the volume of volume correction processing.

本発明の音量補正装置は、バッファ部と、利得算出部と、利得制限部と、利得乗算部とを備える。バッファ部は、所定時間分の音声入力信号を記憶可能な記憶領域を有し、前記記憶領域に空きがなくなると、最も古い音声入力信号を記憶する記憶領域に、新しい音声入力信号を上書き保存する。利得算出部は、前記バッファ部に保存される音声入力信号が全て入れ替わる毎に、前記バッファ部に保存されている音声入力信号の代表値に基づいて利得を算出する。利得制限部は、前記利得算出部が前回算出した利得と今回算出した利得との変化分を求め、前記変化分の絶対値が所定の第１閾値以上の場合、前回算出した利得を、今回算出した利得に近付くように所定の変動上限分だけ変化させて得た制限値を今回の利得とする。利得乗算部は、前記利得算出部及び前記利得制限部によって前記利得が設定されると、前記バッファ部から音声入力信号を古い順番に読み出し、読み出した音声入力信号に前記利得を乗算して出力する。 The volume correction apparatus of the present invention includes a buffer unit, a gain calculation unit, a gain limiting unit, and a gain multiplication unit. The buffer unit has a storage area capable of storing a voice input signal for a predetermined time, and when the storage area is full, a new voice input signal is overwritten and saved in the storage area for storing the oldest voice input signal. . The gain calculating unit calculates the gain based on the representative value of the audio input signal stored in the buffer unit every time the audio input signal stored in the buffer unit is replaced. The gain limiting unit obtains a change between the gain calculated by the gain calculation unit last time and the gain calculated this time, and when the absolute value of the change is equal to or greater than a predetermined first threshold, the gain calculated last time is calculated. The limit value obtained by changing by a predetermined fluctuation upper limit so as to approach the gain obtained is set as the current gain. When the gain is set by the gain calculation unit and the gain limiting unit, the gain multiplication unit reads out the audio input signal from the buffer unit in the oldest order, and multiplies the read audio input signal by the gain and outputs the result. .

この音量補正装置において、前記利得算出部は、前記バッファ部に保存されている音声入力信号の絶対値の最大値を求め、前記最大値を前記代表値とすることも好ましい。 In this volume correction apparatus, it is also preferable that the gain calculation unit obtains a maximum absolute value of the audio input signal stored in the buffer unit and sets the maximum value as the representative value.

この音量補正装置において、前記利得算出部は、前記バッファ部に保存されている音声入力信号の絶対値の平均値を求め、前記平均値を前記代表値とすることも好ましい。 In this volume correction apparatus, it is also preferable that the gain calculation unit obtains an average value of absolute values of the audio input signal stored in the buffer unit, and sets the average value as the representative value.

この音量補正装置において、前記利得算出部は、前記代表値に利得を乗算した値が予め設定された目標値と等しくなるように利得を設定することも好ましい。 In this volume correction apparatus, it is also preferable that the gain calculation unit sets a gain so that a value obtained by multiplying the representative value by a gain is equal to a preset target value.

この音量補正装置において、前記利得制限部が前記制限値を今回の利得とした場合に前記利得と前記代表値との積が所定の第２閾値を超えると、前記利得制限部は、前記代表値に利得を乗算した値が所定の目標値となるように前記利得算出部によって算出された利得を今回の利得とすることも好ましい。 In this volume correction device, when the gain limiting unit sets the limit value as the current gain and the product of the gain and the representative value exceeds a predetermined second threshold, the gain limiting unit It is also preferable that the gain calculated by the gain calculation unit is the current gain so that a value obtained by multiplying the gain by the gain becomes a predetermined target value.

この音量補正装置において、前記変化分の絶対値が前記第１閾値以上となって、前記利得制限部が前記制限値を今回の利得とした場合、前記バッファ部が前記記憶領域のサイズを小さくすることも好ましい。 In this volume correction device, when the absolute value of the change is equal to or greater than the first threshold value and the gain limiter uses the limit value as the current gain, the buffer unit reduces the size of the storage area. It is also preferable.

この音量補正装置において、前記代表値が、無音状態か有音状態かを判定する判定レベル以下であれば、前記利得乗算部は、前記利得算出部によって前回算出された利得を、前記バッファ部から読み出した音声入力信号に乗算して出力することも好ましい。 In this volume correction device, if the representative value is equal to or lower than a determination level for determining whether the sound is in a silent state or a sound state, the gain multiplication unit calculates the gain previously calculated by the gain calculation unit from the buffer unit. It is also preferable to multiply the read audio input signal and output it.

この音量補正装置において、前記バッファ部に入力される音声入力信号が音声か非音声かを判定する音声判定部を備えることも好ましい。前記音声判定部によって前記バッファ部に入力されている音声入力信号が全て非音声と判定された場合、前記利得乗算部は、前記利得算出部によって前回算出された利得を、前記バッファ部から読み出した音声入力信号に乗算して出力する。 The volume correction apparatus preferably further includes a sound determination unit that determines whether a sound input signal input to the buffer unit is sound or non-speech. When it is determined that all the audio input signals input to the buffer unit by the audio determination unit are non-audio, the gain multiplication unit reads the gain previously calculated by the gain calculation unit from the buffer unit. Multiply the audio input signal and output.

この音量補正装置において、前記利得算出部は、算出した利得が所定の上限値を超えた場合、利得を前記上限値に設定することも好ましい。 In this volume correction apparatus, it is also preferable that the gain calculation unit sets the gain to the upper limit value when the calculated gain exceeds a predetermined upper limit value.

本発明によれば、バッファ部に保存される音声入力信号が全て入れ替わる毎に、利得算出部が、バッファ部に保存された音声入力信号の代表値に基づいて利得を算出している。したがって、サンプリング毎に利得を算出する従来例に比べて、音量補正の処理量を低減でき、またバッファ部の記憶容量に応じた遅延のみで利得を算出できるから、出力音声の遅延を低減できる。さらに、利得算出部によって前回算出された利得と今回算出された利得との変化分を求め、この変化分の絶対値が第１閾値以上の場合、利得制限部は、前回算出した利得を、今回算出した利得に近付くように、所定の変動上限分だけ変化させて得た制限値を今回の利得としている。これにより、入力音声の音量レベルが急変したために、利得の算出結果が第１閾値以上変動した場合でも、利得の変動が所定の変動上限分に抑えられるから、出力される音声の音量が急変するのを抑制できる。 According to the present invention, every time the audio input signal stored in the buffer unit is replaced, the gain calculation unit calculates the gain based on the representative value of the audio input signal stored in the buffer unit. Therefore, compared with the conventional example in which the gain is calculated for each sampling, the processing amount of the volume correction can be reduced, and the gain can be calculated only by the delay corresponding to the storage capacity of the buffer unit, so that the delay of the output sound can be reduced. Further, a change between the previously calculated gain and the currently calculated gain is obtained by the gain calculating unit, and when the absolute value of the change is equal to or greater than the first threshold, the gain limiting unit calculates the previously calculated gain The limit value obtained by changing by a predetermined fluctuation upper limit so as to approach the calculated gain is used as the current gain. As a result, since the volume level of the input voice has suddenly changed, even if the gain calculation result fluctuates more than the first threshold value, the gain fluctuation can be suppressed to the predetermined fluctuation upper limit, and thus the volume of the output voice suddenly changes. Can be suppressed.

本実施形態のブロック図である。It is a block diagram of this embodiment. 同上の動作を説明する波形図であり、（ａ）は利得、（ｂ）は音声入力信号の絶対値を示す図である。It is a wave form diagram explaining operation | movement same as the above, (a) is a gain, (b) is a figure which shows the absolute value of an audio | voice input signal. 同上の別の動作を説明する波形図であり、（ａ）は利得、（ｂ）は音声入力信号の絶対値を示す図である。It is a wave form diagram explaining another operation | movement same as the above, (a) is a gain, (b) is a figure which shows the absolute value of an audio | voice input signal. 同上のバッファ部に保存された音声入力信号から代表値を求める方法の説明図である。It is explanatory drawing of the method of calculating | requiring a representative value from the audio | voice input signal preserve | saved at the buffer part same as the above. 同上のバッファ部に保存された音声入力信号から代表値を求める別の方法の説明図である。It is explanatory drawing of another method which calculates | requires a representative value from the audio | voice input signal preserve | saved at the buffer part same as the above. 同上のまた別の動作を説明する波形図であり、（ａ）は利得、（ｂ）は音声入力信号の絶対値を示す図である。It is a wave form diagram explaining another operation | movement same as the above, (a) is a gain, (b) is a figure which shows the absolute value of an audio | voice input signal. 同上のさらに別の動作を説明する波形図であり、（ａ）は利得、（ｂ）は音声入力信号の絶対値を示す図である。It is a wave form diagram explaining another operation | movement same as the above, (a) is a gain, (b) is a figure which shows the absolute value of an audio | voice input signal. 同上のまた別の動作を説明する波形図であり、（ａ）は利得、（ｂ）は音声入力信号の絶対値を示す図である。It is a wave form diagram explaining another operation | movement same as the above, (a) is a gain, (b) is a figure which shows the absolute value of an audio | voice input signal.

以下に本発明の実施の形態を図面に基づいて説明する。 Embodiments of the present invention will be described below with reference to the drawings.

図１は、本発明に係る音量補正装置１０を用いた音声出力装置１のブロック図である。 FIG. 1 is a block diagram of an audio output device 1 using a volume correction device 10 according to the present invention.

音声出力装置１は、マイク２と、Ａ／Ｄ変換部３と、音量補正装置１０と、符号化・パケット化処理部４と、無線送信部５とを備える。 The audio output device 1 includes a microphone 2, an A / D conversion unit 3, a volume correction device 10, an encoding / packetization processing unit 4, and a wireless transmission unit 5.

マイク２は、入力された音声を電気信号に変換してＡ／Ｄ変換部３に出力する。 The microphone 2 converts the input sound into an electrical signal and outputs it to the A / D conversion unit 3.

Ａ／Ｄ変換部３は、マイク２から入力された電気信号をＡ／Ｄ変換する。そして、Ａ／Ｄ変換部３によりデジタル信号に変換された音声入力信号は音量補正装置１０に出力される。 The A / D converter 3 performs A / D conversion on the electrical signal input from the microphone 2. Then, the audio input signal converted into a digital signal by the A / D conversion unit 3 is output to the volume correction device 10.

音量補正装置１０は、バッファ部１１と、利得設定部１２（利得算出部１３及び利得制限部１４からなる）と、利得乗算部１５とを主要な構成として備え、Ａ／Ｄ変換部３から入力された音声入力信号の音量レベルを補正する。尚、利得設定部１２及び利得乗算部１５は、例えばマイクロコンピュータに組み込みのプログラムを実行させることによって、実現される。 The volume correction apparatus 10 includes a buffer unit 11, a gain setting unit 12 (consisting of a gain calculation unit 13 and a gain limiting unit 14), and a gain multiplication unit 15 as main components, and is input from the A / D conversion unit 3. The volume level of the input audio signal is corrected. The gain setting unit 12 and the gain multiplication unit 15 are realized by causing a microcomputer to execute a built-in program, for example.

バッファ部１１は、Ａ／Ｄ変換部３から入力される所定時間分の音声入力信号を記憶可能な記憶領域を備えている。バッファ部１１は、記憶領域に空きがなくなると、最も古い音声入力信号を記憶する記憶領域に、新しい音声入力信号を上書き保存する。 The buffer unit 11 includes a storage area capable of storing a voice input signal for a predetermined time input from the A / D conversion unit 3. When there is no more free space in the storage area, the buffer unit 11 overwrites and saves the new audio input signal in the storage area for storing the oldest audio input signal.

利得設定部１２は利得算出部１３と利得制限部１４とからなり、バッファ部１１に保存される音声入力信号が全て入れ替わる毎（すなわち所定時間毎）に利得を決定する。 The gain setting unit 12 includes a gain calculating unit 13 and a gain limiting unit 14, and determines the gain every time when all the audio input signals stored in the buffer unit 11 are replaced (that is, every predetermined time).

利得算出部１３は、バッファ部１１に保存される音声入力信号が全て入れ替わる毎に、バッファ部１１に保存されている音声入力信号の代表値に基づいて利得を算出する。具体的には、利得算出部１３は、バッファ部１１に保存されている所定時間分の音声入力信号をもとに、音声入力信号の絶対値の最大値を求め、この最大値を代表値としている。図４に音声入力信号（絶対値）の波形の一例を示す。図４の期間Ｔ１，Ｔ２，Ｔ３，Ｔ４は、それぞれ、バッファ部１１に保存可能な音声入力信号の時間長に対応しており、期間Ｔ１，Ｔ２，Ｔ３，Ｔ４の各々で、バッファ部１１内に保存される音声入力信号が全て入れ替わるようになっている。利得算出部１３は、期間Ｔ１，Ｔ２，Ｔ３，Ｔ４の終わりで、バッファ部１１に保存されている音声入力信号の最大値Ｐ１，Ｐ２，Ｐ３，Ｐ４を求め、この最大値Ｐ１，Ｐ２，Ｐ３，Ｐ４を各期間における音声入力信号の代表値とする。そして、利得算出部１３は、各期間における音声入力信号の最大値（代表値）に利得を乗算した値が所定の目標値となるように、利得を算出する。 The gain calculation unit 13 calculates the gain based on the representative value of the audio input signal stored in the buffer unit 11 every time the audio input signal stored in the buffer unit 11 is replaced. Specifically, the gain calculation unit 13 obtains the maximum value of the absolute value of the audio input signal based on the audio input signal for a predetermined time stored in the buffer unit 11, and uses this maximum value as a representative value. Yes. FIG. 4 shows an example of the waveform of the audio input signal (absolute value). The periods T1, T2, T3, and T4 in FIG. 4 correspond to the time lengths of the audio input signals that can be stored in the buffer unit 11, respectively. All the audio input signals stored in the are exchanged. The gain calculation unit 13 obtains the maximum values P1, P2, P3, and P4 of the audio input signals stored in the buffer unit 11 at the end of the periods T1, T2, T3, and T4, and the maximum values P1, P2, P3 , P4 is a representative value of the audio input signal in each period. Then, the gain calculation unit 13 calculates the gain so that a value obtained by multiplying the maximum value (representative value) of the audio input signal in each period by the gain becomes a predetermined target value.

利得制限部１４は、利得算出部１３によって利得が算出されると、利得算出部１３によって前回算出された利得と今回算出された利得との変化分を求める。この変化分の絶対値が所定の第１閾値以上の場合、利得制限部１４は、前回算出した利得を、今回算出した利得に近付くように所定の変動上限分ΔＧだけ変化させて得た制限値を今回の利得として、利得乗算部１５に出力する。一方、上述した変化分の絶対値が第１閾値未満であれば、利得制限部１４は、利得を制限する処理を行わず、利得算出部１３が今回算出した利得をそのまま利得乗算部１５に出力する。尚、第１閾値は、少なくとも変動上限分ΔＧより大きい値に設定されている。 When the gain calculation unit 13 calculates the gain, the gain limiting unit 14 obtains a change between the gain previously calculated by the gain calculation unit 13 and the gain calculated this time. When the absolute value of this change is equal to or greater than the predetermined first threshold, the gain limiting unit 14 obtains a limit value obtained by changing the previously calculated gain by a predetermined fluctuation upper limit ΔG so as to approach the gain calculated this time. Is output to the gain multiplier 15 as the current gain. On the other hand, if the absolute value of the change described above is less than the first threshold value, the gain limiting unit 14 does not perform the process of limiting the gain, and the gain calculating unit 13 outputs the gain calculated this time to the gain multiplying unit 15 as it is. To do. The first threshold value is set to a value that is at least larger than the variation upper limit ΔG.

図２（ｂ）は音声入力信号の波形の一例であり、図２（ａ）は音声入力信号をもとに決定された利得を示している。利得算出部１３は、バッファ部１１に保存される音声入力信号が全て入れ替わる毎に利得を算出しており、期間Ｔ１，Ｔ２，Ｔ３，Ｔ４の終わりに算出された利得はそれぞれＧ１，Ｇ２，Ｇ３，Ｇ４となる。 FIG. 2B shows an example of the waveform of the voice input signal, and FIG. 2A shows the gain determined based on the voice input signal. The gain calculation unit 13 calculates the gain every time the audio input signals stored in the buffer unit 11 are replaced, and the gains calculated at the end of the periods T1, T2, T3, and T4 are G1, G2, and G3, respectively. , G4.

ここで、期間Ｔ２の終わりに利得算出部１３によって利得が算出された場合、利得制限部１４は、前回算出された利得Ｇ１と今回算出された利得Ｇ２との変化分を求め、この変化分の絶対値（｜Ｇ１−Ｇ２｜）と第１閾値との高低を比較する。利得Ｇ１と利得Ｇ２との変化分の絶対値は第１閾値未満であるので、利得制限部１４は、今回算出された利得Ｇ２を期間Ｔ２における利得に設定して、利得乗算部１５に出力する。 Here, when the gain is calculated by the gain calculating unit 13 at the end of the period T2, the gain limiting unit 14 obtains a change between the previously calculated gain G1 and the currently calculated gain G2, and calculates the change. The absolute value (| G1-G2 |) is compared with the first threshold value. Since the absolute value of the change between the gain G1 and the gain G2 is less than the first threshold, the gain limiting unit 14 sets the gain G2 calculated this time as the gain in the period T2, and outputs the gain to the gain multiplication unit 15. .

また期間Ｔ３の終わりに利得算出部１３によって利得が算出された場合、利得制限部１４は、前回算出された利得Ｇ２（期間Ｔ２の利得）と、今回算出された利得Ｇ３との変化分を求め、この変化分の絶対値と第１閾値との高低を比較する。利得Ｇ２と利得Ｇ３との変化分の絶対値は第１閾値未満であるので、利得制限部１４は、今回算出された利得Ｇ３を期間Ｔ３における利得に設定して、利得乗算部１５に出力する。 When the gain is calculated by the gain calculating unit 13 at the end of the period T3, the gain limiting unit 14 obtains a change between the previously calculated gain G2 (gain of the period T2) and the currently calculated gain G3. The absolute value of this change is compared with the first threshold value. Since the absolute value of the change between the gain G2 and the gain G3 is less than the first threshold, the gain limiter 14 sets the gain G3 calculated this time as the gain in the period T3 and outputs the gain to the gain multiplier 15. .

一方、期間Ｔ４の終わりに利得算出部１３によって利得が算出された場合、利得制限部１４は、前回算出された利得Ｇ３（期間Ｔ３の利得）と、今回算出された利得Ｇ４との変化分を求め、この変化分の絶対値と第１閾値との高低を比較する。ここで、期間Ｔ４では期間Ｔ３に比べて音声入力信号の最大値が大幅に低下しているため、利得Ｇ３と利得Ｇ４との変化分の絶対値は第１閾値以上となる。よって、利得制限部１４は、前回算出した利得Ｇ３を、今回算出した利得Ｇ４に近付く向きに所定の変動上限分ΔＧだけ変化させて得た制限値Ｇ４ｂ（＝Ｇ３＋ΔＧ）を今回の利得に設定する。ここにおいて、変動上限分ΔＧは、人間の聴覚特性に鑑みて３ｄＢ程度に設定されるのが好ましく、利得の変化に伴って発生する音量の変化に気付きにくくなるから、ユーザが違和感を抱かないように音量変化を抑制できる。 On the other hand, when the gain is calculated by the gain calculation unit 13 at the end of the period T4, the gain limiting unit 14 calculates the change between the previously calculated gain G3 (gain of the period T3) and the gain G4 calculated this time. The absolute value of this change is compared with the level of the first threshold. Here, since the maximum value of the audio input signal is significantly lower in the period T4 than in the period T3, the absolute value of the change between the gain G3 and the gain G4 is equal to or greater than the first threshold value. Therefore, the gain limiter 14 sets a limit value G4b (= G3 + ΔG) obtained by changing the previously calculated gain G3 by a predetermined variation upper limit ΔG in a direction approaching the currently calculated gain G4 as the current gain. . Here, the variation upper limit ΔG is preferably set to about 3 dB in view of human auditory characteristics, and it becomes difficult to notice the change in volume that occurs with the change in gain, so that the user does not feel uncomfortable. The volume change can be suppressed.

利得乗算部１５は、利得設定部１２によって利得が設定されると、バッファ部１１から音声入力信号を古い順番に読み出し、読み出した音声入力信号に、利得設定部１２から入力された利得を乗算して、符号化・パケット化処理部４に出力する。 When the gain is set by the gain setting unit 12, the gain multiplication unit 15 reads the audio input signal from the buffer unit 11 in the oldest order, and multiplies the read audio input signal by the gain input from the gain setting unit 12. And output to the encoding / packetization processing unit 4.

符号化・パケット化処理部４は、音量補正装置１０によって音量が補正された信号に、音声符号化処理、パケット化処理を施した後、無線送信部５に出力する。 The encoding / packetization processing unit 4 performs voice encoding processing and packetization processing on the signal whose volume has been corrected by the volume correction device 10, and then outputs the signal to the wireless transmission unit 5.

無線送信部５は、符号化・パケット化処理部４によって音声符号化処理およびパケット化処理が施された音声信号を無線送信する。 The wireless transmission unit 5 wirelessly transmits the audio signal subjected to the audio encoding process and the packetization process by the encoding / packetization processing unit 4.

以上のように本実施形態では、マイク２から入力された音声は、Ａ／Ｄ変換部３によってＡ／Ｄ変換され、音量補正装置１０によって音量レベルが補正された後、音声符号化処理、パケット化処理が施されて無線送信部５から無線送信されるのである。 As described above, in the present embodiment, the voice input from the microphone 2 is A / D converted by the A / D converter 3 and the volume level is corrected by the volume correction device 10, and then the voice encoding process, packet The wireless transmission is performed from the wireless transmission unit 5.

そして、音量補正装置１０では、バッファ部１１に保存される音声入力信号が全て入れ替わる毎に、利得算出部１３が、バッファ部１１に保存された音声入力信号の代表値に基づいて利得を算出している。したがって、サンプリング毎に利得を算出する従来例に比べて、音量補正の処理量を低減でき、またバッファ部１１の記憶容量に応じた遅延のみで利得を算出できるから、出力音声の遅延を低減できる。さらに、前回算出された利得と今回算出された利得との変化分を求め、この変化分の絶対値が第１閾値以上の場合、利得制限部１４は、前回算出した利得が今回算出した利得に近付くように、前回算出した利得を所定の変動上限分だけ変化させて得た制限値を今回の利得とする。これにより、入力音声の音量レベルが急変したために、利得の算出結果が第１閾値以上変動した場合でも、利得の変動が所定の変動上限分に抑えられるから、出力される音声の音量が急変するのを抑制できる。 In the volume correction device 10, the gain calculation unit 13 calculates the gain based on the representative value of the audio input signal stored in the buffer unit 11 every time the audio input signal stored in the buffer unit 11 is replaced. ing. Therefore, compared to the conventional example in which the gain is calculated for each sampling, the processing amount of the volume correction can be reduced, and the gain can be calculated only by the delay according to the storage capacity of the buffer unit 11, so that the delay of the output sound can be reduced. . Further, a change between the previously calculated gain and the currently calculated gain is obtained, and when the absolute value of the change is equal to or greater than the first threshold, the gain limiting unit 14 sets the previously calculated gain to the currently calculated gain. The limit value obtained by changing the previously calculated gain by a predetermined fluctuation upper limit is set as the current gain so as to approach. As a result, since the volume level of the input voice has suddenly changed, even if the gain calculation result fluctuates more than the first threshold value, the gain fluctuation can be suppressed to the predetermined fluctuation upper limit, and thus the volume of the output voice suddenly changes. Can be suppressed.

ところで、バッファ部１１に記憶される音声信号の時間長が短すぎると、入力される音声の信号レベルの変化に対して、音量補正装置１０が過剰に反応し、音量補正処理が頻繁に行われるため、音量補正装置１０の処理量が増大することになる。また、バッファ部１１に記憶される音声信号の時間長が長すぎると、記憶容量の大きなバッファ部１１が必要になり、バッファ部１１に保存可能な時間分だけの遅延が発生するという問題がある。 By the way, if the time length of the audio signal stored in the buffer unit 11 is too short, the volume correction device 10 reacts excessively to the change in the signal level of the input audio, and the volume correction processing is frequently performed. Therefore, the processing amount of the sound volume correction device 10 increases. Further, if the time length of the audio signal stored in the buffer unit 11 is too long, the buffer unit 11 having a large storage capacity is required, and there is a problem that a delay corresponding to the time that can be stored in the buffer unit 11 occurs. .

また、通常の発話において１音節或いは１文字の時間長を平均すると１３０（mSec）程度であることが知られている（参考文献：「音声情報処理の基礎」，斎藤収三，中田和男著，オーム社刊，１９８１年）。本実施形態では、少なくとも数音節のデータを用いて音量調整が行えるよう、バッファ部１１に記憶される音声信号の時間を数１００（mSec）〜数１０００（mSec）程度に設定しており、バッファリングによる遅延を短くしつつ、音量補正処理が頻繁に行われるのを抑制することができる。 Moreover, it is known that the average time length of one syllable or one character in a normal utterance is about 130 (mSec) (reference: “Basics of Speech Information Processing”, Shuzo Saito, Kazuo Nakata, Published by Ohmsha (1981). In this embodiment, the time of the audio signal stored in the buffer unit 11 is set to about several hundreds (mSec) to several thousand (mSec) so that the volume can be adjusted using data of at least several syllables. It is possible to suppress frequent volume correction processing while shortening the delay due to the ring.

また利得算出部１３は、算出した利得が所定の上限値を超えた場合、出力する利得を上限値に設定することも好ましい。このように、利得算出部１３から出力される利得を上限値内に制限することで、音声入力信号が過大な利得で増幅されるのを防止でき、それによって出力音声に含まれるフロアノイズを低減できる。またインターホンなどの通話システムに利用された場合には、出力レベルを抑制してハウリングを発生し難くできる。 The gain calculating unit 13 preferably sets the output gain to the upper limit value when the calculated gain exceeds a predetermined upper limit value. In this way, by limiting the gain output from the gain calculation unit 13 within the upper limit value, it is possible to prevent the voice input signal from being amplified with an excessive gain, thereby reducing the floor noise included in the output voice. it can. Further, when used in a call system such as an interphone, it is possible to suppress howling by suppressing the output level.

ここで、前回算出した利得と今回算出した利得との変化分の絶対値が上記第１閾値以上の場合、利得制限部１４は、上述のようにして求めた制限値を今回の利得としているが、それに加えて、バッファ部１１が記憶領域のサイズを小さくすることも好ましい。 Here, when the absolute value of the change between the previously calculated gain and the currently calculated gain is equal to or greater than the first threshold, the gain limiting unit 14 uses the limit value obtained as described above as the current gain. In addition, it is also preferable that the buffer unit 11 reduces the size of the storage area.

図３（ｂ）は音声入力信号の波形の一例を示し、図３（ａ）は音声入力信号をもとに決定された利得を示している。利得算出部１３は、バッファ部１１に保存される音声入力信号が全て入れ替わる毎に利得を算出しており、期間Ｔ１１，Ｔ１２，Ｔ１３の終わりにバッファ部１１内に保存された音声入力信号をもとに算出された利得はＧ１１，Ｇ１２，Ｇ１３となる。 FIG. 3B shows an example of the waveform of the voice input signal, and FIG. 3A shows the gain determined based on the voice input signal. The gain calculation unit 13 calculates the gain every time the audio input signal stored in the buffer unit 11 is replaced. The gain calculation unit 13 also stores the audio input signal stored in the buffer unit 11 at the end of the periods T11, T12, and T13. The gains calculated in the above are G11, G12, and G13.

期間Ｔ１２の終わりでは、前回算出された利得Ｇ１１と今回算出された利得Ｇ１２との変化分の絶対値（｜Ｇ１１−Ｇ１２｜）が第１閾値未満となっているから、利得制限部１４は、算出された利得Ｇ１２を今回との利得とする。 At the end of the period T12, since the absolute value (| G11−G12 |) of the change between the previously calculated gain G11 and the currently calculated gain G12 is less than the first threshold, the gain limiting unit 14 The calculated gain G12 is defined as the current gain.

一方、期間Ｔ１３では期間Ｔ１２に比べて音声入力信号の音量レベルが大きく低下しているから、前回（期間Ｔ１２において）算出された利得Ｇ１２と今回算出された利得Ｇ１３との変化分の絶対値が第１閾値以上となっている。よって、利得制限部１４は、前回算出された利得Ｇ１２を、今回算出された利得Ｇ１３に近付くように所定の変動上限分ΔＧだけ変化させて得た制限値Ｇ１３ｂを今回の利得とする。また利得制限部１４は、バッファ部１１を制御して、音声入力信号を記憶するのに用いる記憶領域のサイズを小さくしており、記憶領域のサイズを小さくすることで保存可能な音声入力信号の時間長を短くしている。すなわち、期間Ｔ１１〜Ｔ１３までの時間長Ｄ１に比べて、期間Ｔ１４以後はバッファ部１１に保存可能な音声入力信号の時間長Ｄ２が短くなっている。尚、期間Ｔ１４，Ｔ１５の終了時においても、利得算出部１３が今回算出した利得と前回の利得との変化分の絶対値が第１閾値以上となっているので、利得制限部１４は、前回の利得に変動上限分ΔＧを加算して得た制限値を今回の利得として設定する。また期間Ｔ１６の終了時には、利得算出部１３が今回算出した利得と前回の利得との変化分の絶対値が第１閾値未満となっているので、利得算出部１３は今回算出した値を利得Ｇ１６として設定しており、期間Ｔ１３の終わりに算出された利得Ｇ１３に略一致する。 On the other hand, in the period T13, the volume level of the audio input signal is significantly lower than in the period T12. Therefore, the absolute value of the change between the gain G12 calculated in the previous period (in the period T12) and the gain G13 calculated this time is It is greater than or equal to the first threshold. Therefore, the gain limiting unit 14 sets the limit value G13b obtained by changing the previously calculated gain G12 by the predetermined fluctuation upper limit ΔG so as to approach the currently calculated gain G13 as the current gain. The gain limiting unit 14 controls the buffer unit 11 to reduce the size of the storage area used for storing the voice input signal. The gain limiter 14 reduces the size of the voice input signal that can be stored by reducing the size of the storage area. The time length is shortened. That is, the time length D2 of the audio input signal that can be stored in the buffer unit 11 after the period T14 is shorter than the time length D1 from the period T11 to T13. Even at the end of the periods T14 and T15, since the absolute value of the change between the gain calculated by the gain calculating unit 13 and the previous gain is equal to or greater than the first threshold, the gain limiting unit 14 A limit value obtained by adding the variation upper limit ΔG to the gain is set as the current gain. At the end of the period T16, since the absolute value of the change between the gain calculated this time by the gain calculation unit 13 and the previous gain is less than the first threshold value, the gain calculation unit 13 uses the value calculated this time as the gain G16. And substantially matches the gain G13 calculated at the end of the period T13.

このように、期間Ｔ１４以降はバッファ部１１に保存される音声入力信号の時間長が短くなるから、短時間で利得が更新されることになる。よって、利得の変化分が上記の変動上限値ΔＧ以内に制限された場合でも、より短い時間で利得を目標利得（利得制限部１４で利得の変化分が制限される前に利得算出部１３で算出された利得Ｇ１３）に到達させることができる。尚、バッファ部１１が記憶領域のサイズを小さくした後、利得算出部１３が今回算出した利得と前回の利得との変化分の絶対値が第１閾値未満になれば、バッファ部１１が、音声入力信号を記憶するために用いる記憶領域のサイズを元の大きさに戻してもよい。 Thus, since the time length of the audio input signal stored in the buffer unit 11 is shortened after the period T14, the gain is updated in a short time. Therefore, even when the amount of change in gain is limited within the above-described variation upper limit ΔG, the gain is calculated by the gain calculation unit 13 before the gain is limited in a shorter time before the gain change is limited by the gain limiting unit 14. The calculated gain G13) can be reached. After the buffer unit 11 reduces the size of the storage area, if the absolute value of the change between the gain calculated by the gain calculation unit 13 and the previous gain is less than the first threshold, the buffer unit 11 The size of the storage area used for storing the input signal may be returned to the original size.

また本実施形態では、バッファ部１１内に保存された音声入力信号が全て入れ替わった後に、利得算出部１３が、バッファ部１１に保存されている音声入力信号の絶対値の最大値を求め、この最大値を代表値としている。そして、利得算出部１３は、音声入力信号の代表値（最大値）に利得を乗算して得た値が所定の目標値となるように、利得を設定している。これにより、音量補正装置１０から出力される信号の最大値を一定にすることができる。 In this embodiment, after all the audio input signals stored in the buffer unit 11 are replaced, the gain calculation unit 13 obtains the maximum value of the absolute values of the audio input signals stored in the buffer unit 11. The maximum value is the representative value. The gain calculation unit 13 sets the gain so that a value obtained by multiplying the representative value (maximum value) of the audio input signal by the gain becomes a predetermined target value. Thereby, the maximum value of the signal output from the sound volume correction apparatus 10 can be made constant.

尚、音量補正装置１０から出力される信号の平均レベルを一定としたい場合には、バッファ部１１に保存された音声入力信号の平均値を代表値とし、この代表値（平均値）に利得を乗算して得た値が所定の目標値となるように、利得を設定してもよい。図５に音声入力信号（絶対値）の波形の一例を示し、期間Ｔ１，Ｔ２，Ｔ３，Ｔ４の時間長はバッファ部１１内に保存される音声入力信号の時間長に対応している。バッファ部１１に保存される音声入力信号が全て入れ替わる毎（すなわち期間Ｔ１，Ｔ２，Ｔ３，Ｔ４の終了時）に、利得算出部１３は、期間Ｔ１，Ｔ２，Ｔ３，Ｔ４の各々において音声入力信号の平均値Ａ１，Ａ２，Ａ３，Ａ４を求める。そして、利得算出部１３は、各期間で求めた平均値を各期間における音声入力信号の代表値とし、この代表値に利得を乗算した値が所定の目標値となるように利得を算出する。これにより音量補正装置１０から出力される信号の平均値を所定の目標値とすることができ、音量レベルの平均値を一定にできる。 If the average level of the signal output from the volume correction device 10 is to be constant, the average value of the audio input signal stored in the buffer unit 11 is used as a representative value, and a gain is added to this representative value (average value). The gain may be set so that a value obtained by multiplication becomes a predetermined target value. FIG. 5 shows an example of the waveform of the audio input signal (absolute value). The time lengths of the periods T1, T2, T3, and T4 correspond to the time length of the audio input signal stored in the buffer unit 11. Each time the audio input signals stored in the buffer unit 11 are all switched (that is, at the end of the periods T1, T2, T3, and T4), the gain calculating unit 13 performs the audio input signal in each of the periods T1, T2, T3, and T4. Average values A1, A2, A3, and A4 are obtained. Then, the gain calculation unit 13 uses the average value obtained in each period as a representative value of the audio input signal in each period, and calculates the gain so that a value obtained by multiplying the representative value by the gain becomes a predetermined target value. As a result, the average value of the signals output from the sound volume correction device 10 can be set as a predetermined target value, and the average value of the sound volume level can be made constant.

ところで、図６（ａ）（ｂ）に示すように期間Ｔ４において音声入力信号の音量レベルが急激に大きくなると、利得算出部１３によって算出された利得Ｇ４は、前回算出された利得Ｇ３に比べて大幅に低下する。ここで、利得Ｇ３と利得Ｇ４との変化分の絶対値が第１閾値を超えた場合、利得制限部１４は、前回算出された利得Ｇ３を、今回算出された利得Ｇ４に近付く向きに変動上限分ΔＧだけ変化させた値Ｇ４ｂを、今回の利得として設定する。この場合、期間Ｔ４における利得が十分低下していないため、期間Ｔ４における音声入力信号の代表値と利得Ｇ４ｂとの積が、所定の第２閾値を超える可能性がある。利得制限部１４は、制限値を今回の利得とした場合にこの利得と代表値との積が所定の第２閾値を超えると、代表値に利得を乗算した値が所定の目標値となるように利得算出部１３によって算出された利得を今回の利得とする。これにより、マイク２に入力される音声の信号レベルが急激に大きくなった場合は、第１閾値を超えて利得を変化（低下）させることによって、過大な音量の信号が出力されるのを防止できる。尚、上記の第２閾値は、上記した目標値よりも大きい値であって、出力音量の上限レベルに対応した値に設定されていればよい。 Incidentally, as shown in FIGS. 6A and 6B, when the volume level of the audio input signal suddenly increases in the period T4, the gain G4 calculated by the gain calculating unit 13 is larger than the previously calculated gain G3. Decrease significantly. Here, when the absolute value of the change between the gain G3 and the gain G4 exceeds the first threshold value, the gain limiting unit 14 sets the previously calculated gain G3 to a variation upper limit in a direction approaching the currently calculated gain G4. A value G4b changed by the amount ΔG is set as the current gain. In this case, since the gain in the period T4 is not sufficiently reduced, the product of the representative value of the audio input signal and the gain G4b in the period T4 may exceed a predetermined second threshold value. When the limit value is the current gain and the product of the gain and the representative value exceeds a predetermined second threshold value, the gain limiting unit 14 causes the value obtained by multiplying the representative value by the gain to be the predetermined target value. The gain calculated by the gain calculation unit 13 is defined as the current gain. As a result, when the signal level of the sound input to the microphone 2 suddenly increases, it is possible to prevent an excessively loud signal from being output by changing (decreasing) the gain exceeding the first threshold. it can. The second threshold value may be set to a value larger than the target value and corresponding to the upper limit level of the output volume.

また本実施形態において、バッファ部１１内に保存された音声入力信号の代表値が、無音状態か有音状態かを判定する判定レベル以下であれば、利得算出部１３は利得の算出を行わず、利得算出部１３が前回算出した利得を利得乗算部１５が用いることも好ましい。 In the present embodiment, if the representative value of the audio input signal stored in the buffer unit 11 is equal to or lower than the determination level for determining whether the sound is in the silent state or the voiced state, the gain calculating unit 13 does not calculate the gain. The gain multiplying unit 15 preferably uses the gain previously calculated by the gain calculating unit 13.

図７（ｂ）は音声入力信号の波形の一例であり、図７（ａ）は音声入力信号をもとに決定された利得を示している。図示例では、期間Ｔ１，Ｔ２，Ｔ３ではバッファ部１１内に保存された音声入力信号の代表値（例えば最大値）が所定の判定レベルＬ１を超えているので、利得算出部１３は、各期間における音声入力信号の代表値をもとに利得を算出する。一方、期間Ｔ４ではバッファ部１１内に保存された音声入力信号の代表値（最大値）が判定レベルＬ１以下であるから、利得算出部１３は利得の算出を行わない。そして、利得乗算部１５は、期間Ｔ４において、利得算出部１３が前回算出した利得Ｇ３を用い、バッファ部１１から読み出した音声入力信号に利得Ｇ３を乗算して、符号化・パケット化処理部４に出力する。 FIG. 7B shows an example of the waveform of the audio input signal, and FIG. 7A shows the gain determined based on the audio input signal. In the illustrated example, since the representative value (for example, the maximum value) of the audio input signal stored in the buffer unit 11 exceeds the predetermined determination level L1 in the periods T1, T2, and T3, the gain calculation unit 13 The gain is calculated based on the representative value of the voice input signal at. On the other hand, since the representative value (maximum value) of the audio input signal stored in the buffer unit 11 is equal to or lower than the determination level L1 in the period T4, the gain calculation unit 13 does not calculate the gain. Then, the gain multiplication unit 15 multiplies the voice input signal read from the buffer unit 11 by the gain G3 using the gain G3 previously calculated by the gain calculation unit 13 in the period T4, and thereby encodes and packetizes the processing unit 4. Output to.

音声入力信号の代表値が判定レベルＬ１以下の場合、すなわち意味のある信号が入力されていない無音状態で、利得算出部１３が利得を算出すると、利得が過大な値に設定されてしまうが、上述のように代表値が判定レベルＬ１以下であれば利得算出部１３が利得の算出を行わず、前回算出された利得で音声入力信号を乗算しているので、フロアノイズが過大な利得で増幅される可能性を低減できる。尚、上記の判定レベルＬ１は、フロアノイズの音量レベルよりは大きく、且つ、意味のある信号（すなわち音声信号）の代表値の下限より小さい値に設定されることが好ましく、意味のある信号とフロアノイズとを確実に弁別できる。 When the representative value of the audio input signal is equal to or lower than the determination level L1, that is, when the gain calculation unit 13 calculates the gain in a silent state in which no meaningful signal is input, the gain is set to an excessive value. As described above, if the representative value is equal to or less than the determination level L1, the gain calculation unit 13 does not calculate the gain, but multiplies the voice input signal by the previously calculated gain, so the floor noise is amplified by an excessive gain. The possibility of being reduced can be reduced. The determination level L1 is preferably set to a value larger than the volume level of the floor noise and smaller than the lower limit of the representative value of the meaningful signal (that is, the audio signal). It can be reliably distinguished from floor noise.

また本実施形態において、バッファ部１１に入力される音声入力信号が音声か非音声かを判定する音声判定部（図示せず）を備えることも好ましい。この場合、音声判定部によってバッファ部１１に入力されている音声入力信号が全て非音声と判定されると、利得算出部１３は利得の算出を行わず、利得乗算部１５は、利得算出部１３によって前回算出された利得を用いて音声入力信号を乗算している。 In the present embodiment, it is also preferable to include an audio determination unit (not shown) that determines whether the audio input signal input to the buffer unit 11 is audio or non-audio. In this case, when all the audio input signals input to the buffer unit 11 are determined as non-speech by the audio determination unit, the gain calculation unit 13 does not calculate the gain, and the gain multiplication unit 15 does not calculate the gain. Is multiplied by the voice input signal using the previously calculated gain.

例えば図８（ｂ）に示す音声入力信号の場合、期間Ｔ１，Ｔ２，Ｔ３では、バッファ部１１内に保存されている音声入力信号の少なくとも一部が音声と判定されている。よって、期間Ｔ１，Ｔ２，Ｔ３では利得算出部１３が利得の算出を行っており、期間Ｔ１，Ｔ２，Ｔ３における利得はそれぞれＧ１，Ｇ２，Ｇ３となっている（図８（ａ）参照）。一方、期間Ｔ４では、音声判定部によってバッファ部１１内に保存されている音声入力信号の全てが非音声と判定されており、この場合、利得算出部１３は利得の算出を行わない。そして、利得乗算部１５は、期間Ｔ４において、利得算出部１３が前回算出した利得Ｇ３を用い、バッファ部１１から読み出した音声入力信号に利得Ｇ３を乗算して、符号化・パケット化処理部４に出力する。ここで、期間Ｔ４における非音声の信号から利得を求めた場合、その利得Ｇ２１は、音声信号から求めた利得（例えば期間Ｔ３の利得Ｇ３）に比べて大幅に大きくなり、この利得Ｇ２１を用いて音声入力信号を乗算すると、雑音を高利得で増幅してしまうことになる。それに対して、本実施形態では、音声入力信号が全て非音声と判定された期間Ｔ４において利得の算出を行わず、前回求めた利得Ｇ３を用いて音声入力信号を増幅しているので、非常に高い利得で雑音が増幅されるのを抑制できる。 For example, in the case of the audio input signal shown in FIG. 8B, at least a part of the audio input signal stored in the buffer unit 11 is determined to be audio during the periods T1, T2, and T3. Therefore, the gain calculation unit 13 calculates the gain in the periods T1, T2, and T3, and the gains in the periods T1, T2, and T3 are G1, G2, and G3, respectively (see FIG. 8A). On the other hand, in the period T4, all of the audio input signals stored in the buffer unit 11 are determined as non-speech by the audio determination unit, and in this case, the gain calculation unit 13 does not calculate the gain. Then, the gain multiplication unit 15 multiplies the voice input signal read from the buffer unit 11 by the gain G3 using the gain G3 previously calculated by the gain calculation unit 13 in the period T4, and thereby encodes and packetizes the processing unit 4. Output to. Here, when the gain is obtained from the non-speech signal in the period T4, the gain G21 is significantly larger than the gain obtained from the speech signal (for example, the gain G3 in the period T3), and this gain G21 is used. Multiplying the audio input signal will amplify the noise with high gain. On the other hand, in the present embodiment, since the calculation of the gain is not performed in the period T4 in which all the audio input signals are determined to be non-audio, and the audio input signal is amplified using the gain G3 obtained last time, Noise can be suppressed from being amplified with high gain.

このように、音声判定部によってバッファ部１１に入力されている音声入力信号が全て非音声と判定された場合、利得乗算部１５は、利得算出部１３によって前回算出された利得を、バッファ部１１から読み出した音声入力信号に乗算している。よって、音声以外のノイズが高い利得で増幅されるのを抑制することができる。 As described above, when all the audio input signals input to the buffer unit 11 are determined to be non-speech by the audio determination unit, the gain multiplication unit 15 calculates the gain previously calculated by the gain calculation unit 13 as the buffer unit 11. Is multiplied by the audio input signal read from. Therefore, it is possible to suppress noise other than voice from being amplified with a high gain.

尚、音声判定部は以下のような方法で音声か非音声かを判定する。すなわち音声判定部は、Ａ／Ｄ変換部３から入力される音声入力信号の比較的長い時間における長時間平均値と、音声入力信号の比較的短い時間における短時間平均値とを求める。ここで、短時間平均値は、音声入力信号に含まれる音声成分によってそのレベルが決定され、長時間平均値は、音声入力信号に含まれる雑音成分（音声以外の音成分）によってそのレベルが決定されると考えられる。而して音声判定部は、長時間平均値に対する短時間平均値の割合が所定の基準値以上であれば音声と判定し、基準値未満であれば非音声と判定しており、音声と非音声とを確実に判別することができる。尚、音声か非音声かを判定する方法は上記の方法に限定されるものではなく、例えば音声入力時に話者が操作するスイッチの入力から音声か非音声かを判定してもよい。 The sound determination unit determines whether the sound is sound or non-speech by the following method. That is, the voice determination unit obtains a long-time average value of a voice input signal input from the A / D conversion unit 3 for a relatively long time and a short-time average value of a voice input signal for a relatively short time. Here, the level of the short-time average value is determined by the voice component included in the voice input signal, and the level of the long-time average value is determined by the noise component (sound component other than voice) included in the voice input signal. It is thought that it is done. Thus, the sound determination unit determines that the sound is a sound if the ratio of the short-time average value to the long-time average value is equal to or greater than a predetermined reference value, and determines that the sound is non-speech if it is less than the reference value. It is possible to reliably distinguish the voice. Note that the method for determining voice or non-speech is not limited to the above-described method. For example, it may be determined whether the voice is input or not from the input of a switch operated by the speaker at the time of voice input.

１０音量補正装置
１１バッファ部
１２利得設定部
１３利得算出部
１４利得制限部
１５利得乗算部 DESCRIPTION OF SYMBOLS 10 Volume correction apparatus 11 Buffer part 12 Gain setting part 13 Gain calculation part 14 Gain limit part 15 Gain multiplication part

Claims

A storage area capable of storing a voice input signal for a predetermined time, and when there is no space in the storage area, a buffer section for overwriting and saving a new voice input signal in a storage area for storing the oldest voice input signal;
A gain calculating unit that calculates a gain based on a representative value of the audio input signal stored in the buffer unit every time the audio input signal stored in the buffer unit is replaced;
The gain calculation unit obtains a change between the previously calculated gain and the currently calculated gain, and when the absolute value of the change is equal to or greater than a predetermined first threshold, the previously calculated gain approaches the currently calculated gain. A gain limiter that uses the limit value obtained by changing the amount of change as much as a predetermined fluctuation upper limit as the current gain,
A gain multiplier that reads out the audio input signals from the buffer unit in the oldest order when the gain is set by the gain calculator and the gain limiter, multiplies the read audio input signals by the gain, and outputs the result The volume correction apparatus characterized by the above-mentioned.

The volume correction apparatus according to claim 1, wherein the gain calculation unit obtains a maximum absolute value of the audio input signal stored in the buffer unit and sets the maximum value as the representative value.

The volume correction apparatus according to claim 1, wherein the gain calculation unit calculates an average value of absolute values of audio input signals stored in the buffer unit, and uses the average value as the representative value.

The volume according to any one of claims 1 to 3, wherein the gain calculation unit sets the gain so that a value obtained by multiplying the representative value by a gain is equal to a preset target value. Correction device.

When the gain limiting unit sets the limiting value as the current gain and the product of the gain and the representative value exceeds a predetermined second threshold, the gain limiting unit multiplies the representative value by the gain. 5. The volume correction apparatus according to claim 1, wherein the gain calculated by the gain calculation unit is set to be a current gain so that the value becomes a predetermined target value. 6.

The buffer unit reduces the size of the storage area when the absolute value of the change is equal to or greater than the first threshold and the gain limiting unit sets the limiting value as the current gain. Item 6. The sound volume correction device according to any one of Items 1 to 5.

If the representative value is below the determination level for determining whether the sound is silent or sound,
7. The gain multiplication unit according to claim 1, wherein the gain multiplication unit multiplies the audio input signal read from the buffer unit by the gain previously calculated by the gain calculation unit and outputs the result. Volume correction device.

A voice determination unit that determines whether a voice input signal input to the buffer unit is voice or non-voice;
When it is determined that all audio input signals input to the buffer unit by the audio determination unit are non-audio,
7. The gain multiplication unit according to claim 1, wherein the gain multiplication unit multiplies the audio input signal read from the buffer unit by the gain previously calculated by the gain calculation unit and outputs the result. Volume correction device.

The volume correction apparatus according to claim 1, wherein the gain calculation unit sets a gain to the upper limit value when the calculated gain exceeds a predetermined upper limit value.