KR100388454B1

KR100388454B1 - Method for controling voice output gain by predicting background noise

Info

Publication number: KR100388454B1
Application number: KR10-2001-0032342A
Authority: KR
Inventors: 김원철
Original assignee: 주식회사 하이닉스반도체
Priority date: 2001-06-09
Filing date: 2001-06-09
Publication date: 2003-06-25
Also published as: KR20020093540A

Abstract

1. 청구범위에 기재된 발명이 속한 기술분야1. TECHNICAL FIELD OF THE INVENTION

본 발명은 배경잡음 예측을 통한 음성 출력 이득 조정 방법과 상기 방법을 실현시키기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체에 관한 것임.The present invention relates to a method of adjusting a voice output gain through prediction of a background noise and a computer-readable recording medium having recorded thereon a program for realizing the method.

2. 발명이 해결하려고 하는 기술적 과제2. The technical problem to be solved by the invention

본 발명은, 현재 자신의 배경잡음 정도에 따라 상대방이 보내준 음성 출력에 대한 이득을 조정함으로써, 자동적으로 음성 출력 이득을 조정할 수 있도록 하는 음성 출력 이득 조정 방법과 상기 방법을 실현시키기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체를 제공하고자 함.The present invention provides a method for adjusting the audio output gain so as to automatically adjust the audio output gain by adjusting the gain for the audio output sent by the other party according to the background noise level of the present computer, and a computer for recording the program for realizing the method. To provide a recording medium that can be read by.

3. 발명의 해결방법의 요지3. Summary of Solution to Invention

본 발명은, 음성 출력 이득 조정 시스템에 적용되는 배경잡음 예측을 통한 음성 출력 이득 조정 방법에 있어서, 외부로부터 입력되는 음성 신호에 대하여 배경잡음(G(k))을 구하는 제 1 단계; 상기 구해진 배경잡음에 따라, 채널 에너지 측정기를 통해 각 채널의 에너지의 분산을 구하여 배경잡음을 추정하고, 상기 추정된 배경잡음 신호 구간을 통해 에너지를 파악하여, 현재 프레임에 대하여 출력되는 음성 신호에 대한 이득 조정 여부를 판단하는 제 2 단계; 및 상기 제 2 단계의 판단 결과에 따라, 이득 조정 플래그를 설정하여 여러 프레임에 대하여 출력되는 음성 신호에 대한 이득을 자동으로 조정하는 제 3 단계를 포함함.A voice output gain adjusting method using background noise prediction applied to a voice output gain adjusting system, comprising: a first step of obtaining a background noise (G (k)) with respect to a voice signal input from an outside; According to the obtained background noise, the energy of each channel is calculated through a channel energy meter to estimate the background noise, and the energy is determined through the estimated background noise signal interval, and the A second step of determining whether to adjust gain; And a third step of automatically adjusting gain for an audio signal output for several frames by setting a gain adjustment flag according to the determination result of the second step.

4. 발명의 중요한 용도4. Important uses of the invention

본 발명은 음성 출력 이득 조정 시스템 등에 이용됨.The present invention is used in a voice output gain adjustment system.

Description

Method for controling voice output gain by predicting background noise}

본 발명은 배경잡음 예측을 통한 음성 출력 이득 조정 방법과 상기 방법을 실현시키기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체에 관한 것으로, 특히 현재 자신의 배경잡음 정도에 따라 상대방이 보내준 음성 출력에 대한 이득을 조정함으로써, 자동적으로 음성 출력 이득을 조정할 수 있도록 하는 음성 출력 이득 조정 방법과 상기 방법을 실현시키기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체에 관한 것이다.The present invention relates to a method of adjusting a voice output gain through prediction of a background noise and a computer-readable recording medium recording a program for realizing the method. Particularly, the present invention relates to a voice output transmitted from a counterpart according to the current background noise level. An audio output gain adjustment method for automatically adjusting the audio output gain by adjusting the gain, and a computer-readable recording medium storing a program for realizing the method.

일반적으로, EVRC(Enhanced Variable Rate Coder)의 경우 입력 음에 대하여 NS(Noise Suppression)기능을 가지고 있으며, 이것은 입력 음에서 구간별로 배경잡음과 음성 신호 구간을 추정하여 신호 대 잡음비(SNR : Signal to Noise Ratio)에 따라 배경잡음이 클 경우 구간별 스펙트럴 이득(Spectral Gain)을 조정하여 잡음을 줄이도록 하는 방법이다. 이 방법을 이용하여 현재 자신의 단말에 대한 배경잡음을 추정하면 상대 단말측에서 보내온 음성 출력 이득을 조정하게 하는데 사용할 수 있다.In general, the EVRC (Enhanced Variable Rate Coder) has an NS (Noise Suppression) function for the input sound, which estimates the background noise and the voice signal interval for each section of the input sound, and then uses a signal-to-noise ratio (SNR). If the background noise is large according to the ratio, the spectral gain for each section is adjusted to reduce the noise. Using this method, it is possible to estimate the background noise of its own terminal and to adjust the voice output gain sent from the other terminal.

즉, 모든 음성 코덱에서 인코딩(Encoding)시 배경잡음 추정에 대한 계산을 더 함으로써, 이에 대한 정보를 이용하여 상대측에서 보내온 음성을 디코딩(Decoding)시 음성 출력의 크기를 조정할 수 있다.That is, by calculating the background noise estimation during encoding in all the speech codecs, the size of the speech output can be adjusted when decoding the speech sent from the other side by using the information about this.

예를 들어, 무선 단말 M1, M2 간에 서로 통화를 할 때, M1측의 배경잡음이 커질 경우 M2 측에서 말한 소리가 M1 측의 배경잡음에 묻혀 소리가 잘 들리지 않게 된다. 즉, 종래에는 배경잡음이 커진 통화 환경에서도 이에 대한 고려 사항없이 디코딩을 함으로써, 통화 청자가 시끄러운 환경에서 소리를 들을 때 음이 배경잡음에묻혀 소리를 잘 들을 수가 없었다. 이에 대한 해결 방법으로는 단순히 단말측에서 배경잡음에 상관없이 출력되는 소리를 키울 수는 있으나, 이는 사용자들이 환경에 따라 출력되는 소리의 크기를 계속 조정하게 함으로써, 많은 불편을 초래하였다.For example, when talking between the wireless terminals M1 and M2, if the background noise on the M1 side is increased, the sound spoken by the M2 side is buried in the background noise on the M1 side, so that the sound is hard to be heard. That is, conventionally, even in a call environment in which the background noise is increased, decoding is performed without consideration of this, and when the call listener hears the sound in the noisy environment, the sound is buried in the background noise and thus cannot hear the sound well. As a solution to this problem, it is possible to simply increase the sound output regardless of the background noise on the terminal side, but this has caused a lot of inconvenience by allowing users to continuously adjust the volume of the sound output according to the environment.

본 발명은, 상기한 바와 같은 문제점을 해결하기 위하여 제안된 것으로, 현재 자신의 배경잡음 정도에 따라 상대방이 보내준 음성 출력에 대한 이득을 조정함으로써, 자동적으로 음성 출력 이득을 조정할 수 있도록 하는 음성 출력 이득 조정 방법과 상기 방법을 실현시키기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체를 제공하는데 그 목적이 있다.The present invention has been proposed in order to solve the above-mentioned problems, and the present invention provides a voice output gain which enables the voice output gain to be automatically adjusted by adjusting the gain for the voice output sent by the other party according to the background noise level of the present person. It is an object of the present invention to provide a computer-readable recording medium that records an adjustment method and a program for realizing the method.

도 1 은 본 발명에 따른 배경잡음 예측을 통한 음성 출력 이득 조정 시스템의 일실시예 설명도.BRIEF DESCRIPTION OF THE DRAWINGS Fig. 1 illustrates an embodiment of a system for adjusting a voice output gain through background noise prediction according to the present invention.

도 2 는 본 발명에 따른 입력 음성에 대하여 스펙트럴 분석을 통한 정지신호와 비정지신호를 구분하는 과정에 대한 일예시도.2 is an exemplary view illustrating a process of classifying a stop signal and a non-stop signal through spectral analysis of an input voice according to the present invention;

도 3 은 본 발명에 따른 배경잡음 예측을 통한 음성 출력 이득 조정 방법에 대한 일실시예 설명도.3 is a diagram illustrating a method of adjusting a voice output gain through background noise prediction according to the present invention.

* 도면의 주요 부분에 대한 부호의 설명* Explanation of symbols for the main parts of the drawings

11 : 예측기 12 : 인코더11: predictor 12: encoder

13 : 배경잡음 추정 및 이득 조정 플래그 설정부 14 : 디코더13: background noise estimation and gain adjustment flag setting unit 14: decoder

상기 목적을 달성하기 위한 본 발명은, 음성 출력 이득 조정 시스템에 적용되는 배경잡음 예측을 통한 음성 출력 이득 조정 방법에 있어서, 외부로부터 입력되는 음성 신호에 대하여 배경잡음(G(k))을 구하는 제 1 단계; 상기 구해진 배경잡음에 따라, 채널 에너지 측정기를 통해 각 채널의 에너지의 분산을 구하여 배경잡음을 추정하고, 상기 추정된 배경잡음 신호 구간을 통해 에너지를 파악하여, 현재 프레임에 대하여 출력되는 음성 신호에 대한 이득 조정 여부를 판단하는 제 2 단계; 및 상기 제 2 단계의 판단 결과에 따라, 이득 조정 플래그를 설정하여 여러 프레임에 대하여 출력되는 음성 신호에 대한 이득을 자동으로 조정하는 제 3 단계를포함하여 이루어진 것을 특징으로 한다.According to the present invention for achieving the above object, a method for adjusting the voice output gain through the background noise prediction applied to the voice output gain adjustment system, the method for obtaining the background noise (G (k)) for the voice signal input from the outside; Stage 1; According to the obtained background noise, the energy of each channel is calculated through a channel energy meter to estimate the background noise, and the energy is determined through the estimated background noise signal interval, and the A second step of determining whether to adjust gain; And a third step of automatically setting a gain adjustment flag to automatically adjust gain for an audio signal output for several frames according to the determination result of the second step.

또한, 본 발명은, 프로세서를 구비한 음성 출력 이득 조정 시스템에, 외부로부터 입력되는 음성 신호에 대하여 배경잡음(G(k))을 구하는 제 1 기능; 상기 구해진 배경잡음에 따라, 채널 에너지 측정기를 통해 각 채널의 에너지의 분산을 구하여 배경잡음을 추정하고, 상기 추정된 배경잡음 신호 구간을 통해 에너지를 파악하여, 현재 프레임에 대하여 출력되는 음성 신호에 대한 이득 조정 여부를 판단하는 제 2 기능; 및 상기 제 2 기능의 판단 결과에 따라, 이득 조정 플래그를 설정하여 여러 프레임에 대하여 출력되는 음성 신호에 대한 이득을 자동으로 조정하는 제 3 기능을 실현시키기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체를 제공한다.The present invention also provides a voice output gain adjusting system having a processor, comprising: a first function of obtaining a background noise (G (k)) with respect to a voice signal input from the outside; According to the obtained background noise, the energy of each channel is calculated through a channel energy meter to estimate the background noise, and the energy is determined through the estimated background noise signal interval, and the A second function of determining whether to adjust gain; And setting a gain adjustment flag according to the determination result of the second function to realize a third function of automatically adjusting a gain for an audio signal output for a plurality of frames. To provide.

본 발명에서는 현재 상태에서 배경잡음 구간을 추정하고 배경잡음 구간에서 특정 에너지 문턱값 보다 클 경우 디코딩시 음성 출력 이득을 조정하도록 한다.In the present invention, the background noise section is estimated in the current state, and the speech output gain is adjusted during decoding when the background noise section is larger than a specific energy threshold.

현재 통화 환경에 대한 배경잡음은 입력 음성에 대하여 스펙트럴(Spectral) 특성을 파악하여 스펙트럴 에너지의 분산을 구하고, 지속적으로 관찰함으로써, 정지(Stationary) 신호와 비정지(Non-Stationary) 신호를 구분하는 방법을 이용하여 추정하도록 한다. 이렇게 해서, 추정한 배경잡음 신호 구간에서 에너지를 파악하여 디코더(Decoder)에서 현재 프레임에 대한 음성 출력에 이득을 조정할지 여부를 알려주는 플래그를 설정하도록 하면 디코더에서는 이 플래그를 보고 이득을 키울 필요가 있을 경우 여러 프레임에 대하여 출력 음성에 대한 이득을 높이도록 한다.The background noise of the current call environment distinguishes stationary and non-stationary signals by grasping the spectral characteristics of the input voice to obtain spectral energy dispersion and continuously observing them. The estimation method is used. In this way, if the energy is detected in the estimated background noise signal interval, and the decoder sets a flag indicating whether or not to adjust the gain in the speech output for the current frame, the decoder does not need to look at this flag and increase the gain. If so, increase the gain for the output voice over multiple frames.

따라서, 본 발명은 현재 배경잡음 상태에 따라 자동적으로 음성 출력 이득을조정하게 함으로써, 단말기 사용자에 대한 상대적인 통화 품질에 대한 만족감을 높여줄 수 있는 특징이 있다.Therefore, the present invention has a feature that can increase the satisfaction of the relative call quality to the terminal user by automatically adjusting the voice output gain in accordance with the current background noise.

상술한 목적, 특징들 및 장점은 첨부된 도면과 관련한 다음의 상세한 설명을 통하여 보다 분명해 질 것이다. 이하, 첨부된 도면을 참조하여 본 발명에 따른 바람직한 일실시예를 상세히 설명한다.The above objects, features and advantages will become more apparent from the following detailed description taken in conjunction with the accompanying drawings. Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings.

도 1 은 본 발명에 따른 배경잡음 예측을 통한 음성 출력 이득 조정 시스템의 일실시예 설명도이다.1 is a diagram illustrating an embodiment of a voice output gain adjustment system using background noise prediction according to the present invention.

도 1에 도시된 바와 같이, 외부로부터 입력되는 입력신호(s(n))가 예측기(11)로 전달되면 예측기(11)는 배경잡음을 추정하여 이득 조정 플래그(Gain Control Flag)를 설정하는 배경잡음 추정 및 이득 조정 플래그 설정부(13)로 입력신호(s(n))를 전달한다. 이렇게 설정된 이득 조정 플래그는 배경잡음이 추정된 단말의 디코더(14)로 전달된다. 이후, 디코더(14)를 통해 이득 조정 플래그가 셋팅(setting)되면 여러 프레임에 걸쳐 출력 음성 이득을 키워 출력신호(s'(n))를 내보내게 된다.As shown in FIG. 1, when an input signal s (n) input from the outside is transmitted to the predictor 11, the predictor 11 estimates background noise to set a gain control flag. The noise estimation and gain adjustment flag setting unit 13 transmits the input signal s (n). The gain adjustment flag thus set is transmitted to the decoder 14 of the terminal whose background noise is estimated. Subsequently, when the gain adjustment flag is set through the decoder 14, the output voice gain is increased over several frames to output the output signal s' (n).

상기한 바와 같은 구조를 갖는 본 발명의 배경잡음 예측을 통한 음성 출력 이득 조정 방법의 동작 과정을 상세하게 설명하면 다음과 같다.The operation of the voice output gain adjustment method through the background noise prediction of the present invention having the structure as described above will be described in detail as follows.

도 3 은 본 발명에 따른 배경잡음 예측을 통한 음성 출력 이득 조정 방법에 대한 일실시예 설명도이다.3 is a diagram illustrating a method of adjusting a voice output gain through background noise prediction according to the present invention.

먼저, 입력 음에 대하여 배경잡음(G(k))을 구한다. 이 방법은 도 3에 도시된 바와 같이, 일반 코덱(CODEC)에서 HPF(High Pass Filter)와 같은 전처리 과정을 통과한 신호를 이용하여 스펙트럴(Spectral) 상에서 배경잡음 정보를 추출한다.First, the background noise G (k) is obtained for the input sound. As shown in FIG. 3, the background noise information is extracted on a spectral by using a signal that passes a preprocessing process such as a high pass filter (HPF) in a general codec.

이와 같이, G(k)는 DFT(Discrete Fourier Transform)를 이용하여 하기의 [수학식 1]과 같이 구한다.As such, G (k) is obtained by using Equation 1 below using a Discrete Fourier Transform (DFT).

이렇게 구해진 G(k)는 도 3의 채널 에너지 측정기(Channel Energy Estimator)로 입력되고 하기의 [수학식 2]와 같이 각 채널의 에너지를 구한다.The G (k) obtained as described above is input to a channel energy estimator of FIG. 3 and the energy of each channel is obtained as shown in Equation 2 below.

여기서, m은 현재 프레임, Nc는 16 전체 채널의 수, E_min은 허용 최소 에너지 및α _ch (m)는 채널 에너지 평활 요소(smoothing factor)를 의미한다. 또한, i는 i^th채널로써 시작과 끝을 각각f _L (i),f _H (i)로 나타내며, 이는 하기의 [수학식 3]과 같다.Where m is the current frame, Nc is the total number of 16 channels, E _min is the allowable minimum energy and α _ch (m) is the channel energy smoothing factor. Also, i is represented by _{_{f L (i), f H}} (i) each of the beginning and the end as i ^th channel, which as shown in [Equation 3] below.

다음, 이렇게 추정되어진 채널 에너지들을 이용하여 SNR을 구하면 하기의 [수학식 4]와 같다.Next, the SNR is calculated using the channel energies estimated as shown in Equation 4 below.

여기서, SNR(i)은 0에서 89값으로 제한하였으며, 이 값을 음성 메트릭(Voice metric)에 넣어 정지신호와 비정지신호를 구분하는 파라미터 중에 하나로 이용한다.Here, the SNR (i) is limited to a value from 0 to 89, and this value is used as one of the parameters for distinguishing the stop signal from the non-stop signal by putting it in a voice metric.

이어서, 음성 메트릭(Voice metric)을 수식으로 나타내면 하기의 [수학식 5]와 같다.Then, the voice metric (Voice metric) is represented by the formula (5) below.

여기서, V는 하기의 [수학식 6]과 같다.Here, V is the same as [Equation 6] below.

현재, 채널 에너지는 다시 도 3의 스펙트럴 디비에이션 측정기(Spectral Deviation Estimator)로 입력되어 사용된다. 여기서, 스펙트럴 디비에이션 측정기는 하기의 [수학식 7]과 같이 E_tot와 DEV_E를 구한다.At present, the channel energy is input again to the Spectral Deviation Estimator of FIG. 3 and used. Here, the spectral division measuring device obtains E _tot and DEV _E as shown in Equation 7 below.

여기서,,그리고α(m)은 지수의 윈도윙 요소(exponential windowing factor)로써 현재 입력에 대한 함수를 나타내며, 큰 입력에 대하여 긴 지수의 윈도우(exponential window)가 적용되는 효과를 가지고 있다.here, , Α (m) is an exponential windowing factor of the exponent and represents a function for the current input, and has an effect that a long exponential window is applied to a large input.

현재까지 구해진 v(m)과 E_tot(m), 그리고 DEV_E(m)을 이용하여 도 2와 같은 방법으로 업데이트 플래그(UPDATE FLAG)를 결정한다. 업데이트 명령이 발생할 경우 채널 에너지는 평활 필터(Smoothing filter)를 이용하여 다음의 [수학식 8]과 같이프레임의 배경잡음을 구하게 된다.The update flag UPDATE FLAG is determined in the same manner as in FIG. 2 using v (m), E _tot (m), and DEV _E (m) obtained so far. When the update command occurs, the channel energy is obtained by using a smoothing filter to obtain the background noise of the frame as shown in Equation 8 below.

여기서,E _min=0.0625로 가장 작은 채널 허용 에너지를 나타내고,α _n =0.9로 채널 평활 요소(smoothing factor)를 나타낸다.Here, E _min = 0.0625 represents the smallest channel allowable energy, and α _n = 0.9 represents the channel smoothing factor.

도 2 는 본 발명에 따른 입력 음성에 대하여 스펙트럴 분석을 통한 정지신호와 비정지신호를 구분하는 과정에 대한 일예시도로서, 업데이트 결정 순서도에 대한 의사-코드(Pseudo-Code)는 다음과 같다.FIG. 2 is an exemplary view illustrating a process of distinguishing a stop signal from a non-stop signal through spectral analysis for an input voice according to the present invention, and a pseudo-code for an update decision flowchart is as follows. .

이렇게 구해진 E_n(m)은 배경잡음의 에너지를 의미하므로 이 값을 E_n(m)과"ENERGY_BGN_THLD"와 비교하여 클 경우 "GAIN_CTL_FLAG"를 셋팅한다.Since E _n (m) obtained is the energy of background noise, this value is set to "GAIN_CTL_FLAG" when it is large compared to E _n (m) and "ENERGY_BGN_THLD".

한편, 이득 조정 플래그를 셋팅하는 의사-코드(Pseudo-Code)는 다음과 같다. 여기서, "GAIN_CTL_FLAG"는 이득 조정 플래그를 의미하며, "ENERGY_BGN_THLD"는 배경잡음에 대한 문턱값을 의미한다.On the other hand, the pseudo-code for setting the gain adjustment flag is as follows. Here, "GAIN_CTL_FLAG" means a gain adjustment flag, and "ENERGY_BGN_THLD" means a threshold for background noise.

그리고 나서, 도 1에서와 같이 이득 조정 플래그를 디코더에 전송한다. 디코더에 전송된 이득 조정 플래그가 셋팅되었을 경우에는 음성 출력을 크게 하여 다시 외부로 출력하게 된다.Then, the gain adjustment flag is transmitted to the decoder as shown in FIG. When the gain adjustment flag sent to the decoder is set, the audio output is increased to output again.

상술한 바와 같은 본 발명의 방법은 프로그램으로 구현되어 컴퓨터로 읽을 수 있는 기록매체(씨디롬, 램, 롬, 플로피 디스크, 하드 디스크, 광자기 디스크 등)에 저장될 수 있다.The method of the present invention as described above may be implemented as a program and stored in a computer-readable recording medium (CD-ROM, RAM, ROM, floppy disk, hard disk, magneto-optical disk, etc.).

이상에서 설명한 본 발명은 전술한 실시예 및 첨부된 도면에 의해 한정되는 것이 아니고, 본 발명의 기술적 사상을 벗어나지 않는 범위 내에서 여러 가지 치환, 변형 및 변경이 가능하다는 것이 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자에게 있어 명백할 것이다.The present invention described above is not limited to the above-described embodiments and the accompanying drawings, and various substitutions, modifications, and changes are possible in the art without departing from the technical spirit of the present invention. It will be apparent to those of ordinary knowledge.

상기한 바와 같은 본 발명은, 현재의 배경 상태에 따라 자동적으로 음성 출력 이득을 조정하게 함으로써, 단말기 사용자에 대한 상대적인 통화 품질에 대한 만족감을 높여줄 수 있는 효과가 있다.The present invention as described above, by automatically adjusting the voice output gain according to the current background state, there is an effect that can increase the satisfaction for the relative call quality for the terminal user.

Claims

In the audio output gain adjustment method through the background noise prediction applied to the audio output gain adjustment system,

A first step of obtaining a background noise (G (k)) with respect to an audio signal input from the outside;

According to the obtained background noise, the energy of each channel is calculated through a channel energy meter to estimate the background noise, and the energy is determined through the estimated background noise signal interval, and the A second step of determining whether to adjust gain; And

A third step of automatically adjusting a gain for an audio signal output for several frames by setting a gain adjustment flag according to the determination result of the second step

Voice output gain adjustment method through the background noise prediction comprising a.

The method of claim 1,

The background noise G (k) is,

In the codec, the background noise information is extracted on a spectral in accordance with a signal that has passed through a preprocessing process such as a high pass filter (HPF), and is represented by the following equation using a discrete fourier transform (DFT). A voice output gain adjusting method through the background noise prediction, characterized in that the.

The method according to claim 1 or 2,

The second step,

A fourth step of obtaining energy ( E _ch (m, i)) of each channel to measure channel energy for each estimated channel;

A fifth step of obtaining a signal-to-noise ratio (SNR) using the obtained channel energies and using the obtained value as a parameter for distinguishing a stop signal from a non-stop signal by putting the obtained value into a voice metric (V (m));

A sixth step of obtaining spectral division measurements E _tot (m) and DEV _E (m), respectively, using the channel energies; And

The update flag is determined using the voice metric (V (m)) and the spectral division measurement values E _tot (m) and DEV _E (m), and a smoothing filter is used when an update command occurs according to the determined result. To obtain the energy of the background noise ( E _n (m + 1, i)) for the next frame

The method of claim 3, wherein

The energy E _ch (m, i) of each channel is

A voice output gain adjustment method through background noise prediction, characterized by the following equation.

Where m is the current frame, Nc is the total number of 16 channels, E _min is the allowable minimum energy and α _ch (m) is the channel energy smoothing factor, and i is the start and end of the i ^th channel, respectively. f _L (i), f _H (i), and f _L (i), f _H (i) is being)

The method of claim 3, wherein

The signal-to-noise ratio (SNR) is

(In this case, SNR (i) is limited from 0 to 89, and this value is used as one of the parameters for distinguishing the stop signal from the non-stop signal by putting it in a voice metric.)

The method of claim 3, wherein

The voice metric (V (m)),

here, being)

The method of claim 3, wherein

The spectral division measurement (E _tot (m), DEV _E (m)),

A method of adjusting the voice output gain through the prediction of the background noise, each of which is represented by the following equation.

(here, , And α (m) is an exponential windowing factor of the exponential, representing a function for the current input and applying a long exponential window to large inputs)

The method of claim 3, wherein

The energy of the background noise ( E _n (m + 1, i)) for the next frame is

(Wherein E _min = 0.0625 represents the smallest channel allowable energy and α _n = 0.9 represents the channel smoothing factor)

The method according to any one of claims 1 to 8,

The third step,

Comparing the energy ( E _n (m)) of the background noise with the threshold value (ENERGY_BGN_THLD) for the background noise, and setting the gain adjustment flag to transmit to the decoder if the threshold value is large according to the comparison result. How to adjust the voice output gain through the prediction of background noise.

In the audio output gain adjustment system with a processor,

A first function of obtaining a background noise G (k) with respect to an audio signal input from the outside;

According to the obtained background noise, the energy of each channel is calculated through a channel energy meter to estimate the background noise, and the energy is determined through the estimated background noise signal interval, and the A second function of determining whether to adjust gain; And

A third function of automatically adjusting a gain for an audio signal output for several frames by setting a gain adjustment flag according to the determination result of the second function

A computer-readable recording medium having recorded thereon a program for realizing this.