KR20140052489A

KR20140052489A - Method and apparatus of adaptive audio gain adjustment in multiple mic based sound quality enhancement system

Info

Publication number: KR20140052489A
Application number: KR1020120118610A
Authority: KR
Inventors: 정성일; 김재훈; 신옥근
Original assignee: (주)트란소노
Priority date: 2012-10-24
Filing date: 2012-10-24
Publication date: 2014-05-07

Abstract

The present invention relates to sound quality improvement and provides a method and apparatus for adaptive audio gain adjustment in a multi-microphone based sound quality improvement system. According to the present invention, even when the kind of noise changes, it can adaptively react to provide a uniform tone, and has a small voice loss even in an abnormal noise environment of low SNR (Signal to Noise Ratio). It can cope with real time, and even if the kind of noise suddenly changes, it can be processed in real time. In addition, it uses the low power according to the small amount of computation, the solution is provided by the software type library, the general purpose is increased, and the short time tuning is required.

Description

BACKGROUND OF THE INVENTION 1. Field of the Invention [0001] The present invention relates to an adaptive audio gain adjustment method and apparatus for a multi-microphone based sound quality improvement system,

The present invention relates to sound quality improvement, and more particularly, to a method and apparatus for adaptive audio gain adjustment in a multi-microphone-based sound quality improvement system.

The ambient noise that occurs in real life always pollutes the pure voice and severely lowers the sound quality of mobile communication. Various methods have been studied for improving the sound quality of mobile communication by removing the effect of ambient noise for a long time. Recently, the elimination or solving of ambient noise in mobile communication has become a big issue.

A technical problem of the present invention is to extract only noise from a voice contaminated by ambient noise and to provide clean sound quality in mobile communication based on the extracted noise.

According to one aspect of the present invention, there is provided an adaptive audio gain adjustment method in a multiple microphone-based sound quality improvement system.

According to another aspect of the present invention, there is provided an adaptive audio gain adjustment apparatus in a multi-microphone-based sound quality improvement system.

According to the present invention, even if the type of noise changes, it can adaptively respond to provide a certain tone color, and has a small voice loss even in a noisy noise environment with low SNR (Signal to Noise Ratio).

Also, according to the present invention, impulse noise can be responded in real time, and even if the kind of noise suddenly changes, it can be processed in real time.

Further, according to the present invention, there is an advantage that a low power according to a small amount of computation is used, a solution is provided by a software type library, the versatility is increased, and short-term tuning is required.

1 shows a method for processing a noisy speech signal according to the present invention.
2 shows an example of a waveform and a spectrogram of speech according to the present invention.
Fig. 3 shows another example of the waveform and spectrogram of speech according to the present invention.
Fig. 4 shows another example of waveforms and spectrograms of speech according to the present invention.

Hereinafter, some embodiments of the present invention will be described in detail with reference to exemplary drawings. It should be noted that, in adding reference numerals to the constituent elements of the drawings, the same constituent elements are denoted by the same reference symbols as possible even if they are shown in different drawings. In the following description of the embodiments of the present invention, a detailed description of known configurations or functions will be omitted when it is determined that the description of the present invention may be omitted.

Various methods are proposed for solving the problem of ambient noise in mobile communication and providing clean sound quality. However, existing methods do not provide a uniform tone when the type of noise changes, and a lot of voice loss occurs in a non-static noise environment with low SNR. Also, according to the conventional methods, impulse noise occurring over a short period of time can not be coped with in real time, and a certain adaptation time is required when the kind of noise suddenly changes. In addition, the conventional methods require a high computation amount, so that high power is used in many cases, and a solution is provided with a chip of a hardware type, so that there is no general versatility and a case where a long tuning is required There were many.

In order to solve the above-mentioned problems, the present invention proposes the following method. The present invention may be referred to as ElectroVox-2.

1 shows a method for processing a noisy speech signal according to the present invention.

Referring to FIG. 1, a sound quality improvement apparatus according to the present invention acquires noisy speech through a first microphone and a second microphone, respectively. For example, the first microphone may be a rear mic, and the second microphone may be a front mic.

Then, the sound quality improvement apparatus performs pre-processing. The preprocessing step includes applying adaptive gain, performing mixture tracking, and performing spectrum whiting.

Then, the sound quality improvement apparatus performs noise estimation. The noise estimation step includes applying a probability weight, applying a psychoacoustic forgetting factor, and performing an estimated noise refine.

Then, the sound quality improvement device performs speech enhancement. The speech enhancement step includes performing noise masking and performing enhanced speech tuning.

Through the above process, the sound quality improvement apparatus can obtain improved speech. According to the technical advantages of the present invention as described above, even when the kind of noise changes, the system can adaptively respond to a certain tone color, and has a small voice loss even in an abnormal noise environment of low SNR (Signal to Noise Ratio). Also, according to the present invention, impulse noise can be responded in real time, and even if the kind of noise suddenly changes, it can be processed in real time. Further, according to the present invention, there is an advantage that a low power according to a small amount of computation is used, a solution is provided by a software type library, the versatility is increased, and short-term tuning is required.

2 shows an example of a waveform and a spectrogram of speech according to the present invention. 2 shows a case where the present invention is applied to a clear voice.

(B) is a spectrogram of the clean speech; (c) is a waveform of the speech processed by the present invention on the clean speech; (d) Represents a spectrogram about the processed speech. When a clear voice as shown in (a) is input, most of the existing solutions cause voice loss. However, according to the present invention, the spectrogram of the clean speech and the clean speech in (a) and (b) and the waveform of the processed speech in (c) and (d) It can be seen that the loss of the original signal hardly occurs when the spectrograms are compared with each other.

Fig. 3 shows another example of the waveform and spectrogram of speech according to the present invention. FIG. 3 shows a case where the present invention is applied to a voice contaminated with public noise.

Referring to FIG. 3, (a) is a waveform of a voice contaminated with public noise, (b) is a spectrogram of a voice contaminated with the public noise, and (c) The waveform of the speech processed by the invention, and (d) the spectrogram of the processed speech. It is technically very difficult to accurately remove noise from a voice polluted by a public noise generated in a restaurant such as the above (a). Generally, even when the noise is removed, the effect of haptic effect is not appreciable due to the noise remaining in the human audible range . However, when comparing (a) and (b) for speech contaminated with public noise and (c) and (d) for speech treated according to the present invention, It can be seen that there is an effect. As shown by the white dotted line in (d) of FIG. 3, the public noise is removed and replaced with a comport residual noise, which provides a highly audible noise canceling performance (or effect).

Fig. 4 shows another example of waveforms and spectrograms of speech according to the present invention. FIG. 4 shows a case where the present invention is applied to a voice contaminated with music noise.

Referring to FIG. 4, (a) is a waveform of a voice contaminated with music noise, (b) is a spectrogram of a voice contaminated with the music noise, and (c) The waveform of the speech processed by the invention, and (d) the spectrogram of the processed speech. It is technically very difficult to remove noise from a voice contaminated with music noise composed of an impulse component as in (a). However, when comparing (a) and (b) with respect to the voice contaminated with music noise and (c) and (d) with the voice treated according to the present invention, It can be seen that there is an effect. According to the present invention, as shown by the white dotted line in FIG. 4 (d), the reaction occurs in real time even in a region where noise suddenly occurs. In addition, as shown by the yellow dotted line in FIG. 4 (d), the impulse noise is largely replaced with the remnant noise after the impulse noise is removed.

The foregoing description is merely illustrative of the technical idea of the present invention and various changes and modifications may be made by those skilled in the art without departing from the essential characteristics of the present invention. Therefore, the embodiments disclosed in the present invention are intended to illustrate rather than limit the scope of the present invention, and the scope of the technical idea of the present invention is not limited by these embodiments. The scope of protection of the present invention should be construed according to the following claims, and all technical ideas falling within the scope of the same shall be construed as falling within the scope of the present invention.

Claims

Adaptive Audio Gain Control Difference Unit in Multi - microphone Based Sound Quality Improvement System.