WO2017104876A1 - Noise removal device and method therefor - Google Patents

Noise removal device and method therefor Download PDF

Info

Publication number
WO2017104876A1
WO2017104876A1 PCT/KR2015/013970 KR2015013970W WO2017104876A1 WO 2017104876 A1 WO2017104876 A1 WO 2017104876A1 KR 2015013970 W KR2015013970 W KR 2015013970W WO 2017104876 A1 WO2017104876 A1 WO 2017104876A1
Authority
WO
WIPO (PCT)
Prior art keywords
signal
noise
noise ratio
value
mixed
Prior art date
Application number
PCT/KR2015/013970
Other languages
French (fr)
Korean (ko)
Inventor
이석필
서지훈
한혁수
Original Assignee
상명대학교 서울산학협력단
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 상명대학교 서울산학협력단 filed Critical 상명대학교 서울산학협력단
Publication of WO2017104876A1 publication Critical patent/WO2017104876A1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering

Definitions

  • the present invention relates to signal processing for speech enhancement, and more particularly, to a signal processing method and apparatus for improving the clarity of speech by removing wind noise included in the speech.
  • Apple's Siri and Google Now's Google Now are typical smartphone services using voice recognition.
  • the recognition rate of the voice recognition service is high, and even in a general call situation, the other party's voice can be heard well, but when the surroundings are noisy and the wind is mixed with the user's voice and input into the smartphone, the voice recognition is performed.
  • the voice recognition rate of the service may be lowered and the voice of the other party may not be recognized well.
  • the prior art attempts to reduce the wind noise by simply cutting out a specific band of a signal by using a low pass filter (LPF) or a high pass filter (HPF).
  • LPF low pass filter
  • HPF high pass filter
  • the present invention relates to a method for automatically removing the wind noise according to the level to filter the mixed signal with a low pass filter and to measure the level to generate a control signal according to the measured level
  • the invention is to remove wind noise through a high pass filter.
  • An object of the present invention is to provide a device and method for obtaining a filter coefficient using the preceding signal-to-noise ratio and the post-signal-to-noise ratio and removing wind noise using the same.
  • Noise reduction method for achieving the above object of the present invention, receiving a mixed signal including a voice signal and a noise signal; Obtaining the noise signal using a section in which the voice signal is absent among the mixed signals; Obtaining a post-signal-to-noise ratio using the noise signal and the mixed signal; Estimating a preceding signal-to-noise ratio of the current frame using the post-signal-to-noise ratio, the noise signal of the previous frame, and the preceding signal-to-noise ratio of the previous frame; Calculating a weight value using the estimated preceding signal to noise ratio; Calculating a filter value for each frequency using the calculated weight value; And multiplying the calculated filter value by the mixed signal to obtain the improved estimated speech signal.
  • an apparatus for removing noise comprising: an input unit configured to receive a mixed signal including a voice signal and a noise signal; A frequency signal converter for converting the mixed signal into a frequency domain signal; From the mixed signal, the noise signal is obtained using a section without the voice signal, and a post-signal-to-noise ratio is obtained using the noise signal and the mixed signal, and the post-signal-to-noise ratio, the noise signal of the previous frame, and the preceding of the previous frame.
  • the present invention by using a filter formed by using the signal-to-noise ratio and the signal-to-noise ratio before and after the signal mixed with the wind noise to provide a more improved speech enhancement technology to increase the speech recognition rate and the speech intelligibility There is.
  • FIG. 1 is a flowchart of a noise removing method according to an embodiment of the present invention.
  • Figure 2 is a structural diagram showing the flow of the signal of the noise removing method according to an embodiment of the present invention.
  • FIG. 3 is a structural diagram of a noise removing device according to another embodiment of the present invention.
  • FIG. 4 is a structural diagram of a computer device in which a noise canceling method according to another embodiment of the present invention is implemented.
  • FIG. 1 shows a flowchart of a noise removing method according to an embodiment of the present invention.
  • the mixed signal is first received (S110).
  • the input mixed signal is usually a time domain signal
  • an FFT (Fast Fourier Transform) operation is performed to convert the mixed signal into a frequency domain signal.
  • the signal changed into the frequency domain signal through the FFT operation is composed of a magnitude signal and a phase signal.
  • the phase signal is transmitted to the output side without modification since the calculation is performed only with the amplitude signal.
  • a noise signal, a mixed signal, and a posteriori SNR are required. Since only the mixed signal is input, the remaining noise signal and the post-signal noise ratio are estimated from the mixed signal.
  • a noise signal is obtained by using an interval without speech in a mixed signal.
  • a human voice does not always exist. Therefore, a short section after receiving a mixed signal input will not have a human voice. Therefore, it is assumed that only a noise signal exists.
  • a post-signal-to-noise ratio may be obtained using the noise signal and the mixed signal.
  • the post-signal-to-noise ratio may be obtained as in Equation 1 below (S120).
  • Post Signal to Noise Ratio Denotes the post-signal-to-noise ratio at the p-th frame and the k-th frequency index, and Y (p, k) and N (p, k) represent the mixed signal and the noise signal at the p-th frame and the k-th frequency index, respectively.
  • the noise signal uses the value assumed in the previous step.
  • the preceding signal-to-noise ratio is calculated using the calculated after-signal-to-noise ratio (S130), and is calculated as in Equation 2.
  • the speech signal before the calculation according to the present invention starts is initialized to 0, the speech signal of the corresponding frame is estimated, and used to calculate the preceding signal-to-noise ratio from the next frame. .
  • is a value of a preset coefficient that is used to adjust the influence of the estimated voice signal and the noise signal of the previous frame and the post signal-to-noise ratio accumulated from the first frame to the previous frame in estimating the voice signal.
  • is a value between 0 and 1, the closer to 1, the more affected by the value of the previous frame, and the closer to 0, the more affected by the accumulated value from the first frame to the previous frame. Means greater impact.
  • the weight value is calculated using this value (S140), and the weight value can be obtained by Equation 3.
  • is a weighting parameter. If the value of the preceding signal-to-noise ratio is large, it means that the size of the voice signal is large. Therefore, the weight value should be large. On the contrary, if the value of the preceding signal-to-noise ratio is small, the weight value is smaller than the noise signal. Should also be small.
  • the filter values H (p, k) used for noise reduction can be obtained using the two values (S150), which is shown in Equation 4.
  • Y (p, k) represents a mixed signal and the estimated speech signal thus obtained as described above Is used to find the preceding signal-to-noise ratio in the next frame.
  • FIG. 2 is a flowchart of a signal until a mixed signal mixed with a noise signal is filtered and outputs a signal in which the noise signal is attenuated.
  • the estimated speech signal is an amplitude signal of the speech signal, it is converted into a time domain signal by IFFT (Inverse Fast Fourier Transform) method together with the phase signal of the speech signal which has not been transformed to provide a signal from which noise is removed.
  • IFFT Inverse Fast Fourier Transform
  • the noise reduction effect is superior to that of the conventional LPF filter.
  • FIG. 3 is a structural diagram of a noise removing device according to another embodiment of the present invention.
  • the input unit 310 receives a mixed signal in which a voice signal and a noise signal are mixed.
  • the input unit may be composed of a microphone or the like, or may receive only a mixed signal, which is a voice signal, by receiving an input in the form of a file such as a voice file or a video file.
  • the frequency signal converter 320 converts the received signal into a frequency signal through a method such as an FFT.
  • the frequency signal transformation may use methods such as Discrete Fourier Transform (DFT), Discrete Cosine Transform (DCT), and Filterbank as well as FFT.
  • the calculator 330 extracts a filter value for noise removal from the input signal.
  • the signal to noise ratio is first obtained from the input mixed signal and the noise signal, and the process is as shown in Equation 1 above. Although it is impossible to distinguish between a speech signal and a noise signal in a mixed signal, it is assumed that a voice signal does not exist in the initial input signal, and then a signal of the post-noise ratio is calculated by assuming that the signal in this section is a noise signal.
  • the preceding signal-to-noise ratio is obtained by using the calculated post-signal-to-noise ratio, the average value of the speech signal estimated from the previous frame, and the previously obtained noise signal.
  • the proportional coefficient value may be used to adjust the rate at which the estimated value of the previous frame and the history value of previous frames including the previous frame affect the preceding signal-to-noise ratio.
  • Increasing the ratio of the previous frame value has the advantage of being sensitive to the change between frames, but it can cause inconvenience to the user due to frequent changes, and abrupt change can be suppressed when increasing the ratio of the history value. You can hear a natural voice signal, but can not respond quickly to a signal that changes quickly in time, so it can be used to determine the optimal value between the two by experiment.
  • Equation 2 can be used to obtain the weighted value by calculating the preceding signal-to-noise ratio.
  • the preceding signal-to-noise ratio is large, the speech signal is expected to be large. Therefore, the weighted value is increased and the weighted value is reduced to reduce the influence of the noise signal. to be.
  • the weight value is obtained by the equation (3).
  • the filter value can be finally obtained using the weight value and the preceding signal-to-noise ratio value.
  • the filter unit 340 multiplies the thus obtained filter value by the mixed signal to obtain a signal from which the noise is removed.
  • the filter unit 340 Since the signal from which the noise is removed through the filter unit 340 is a signal in the frequency domain, the user finally removes the noise by converting the voice signal into a time domain signal through the time signal converter 350 and providing it to the output unit. You can hear the signal.
  • the time signal converter 350 may convert a frequency domain signal into a time domain signal using a method such as IFFT, IDFT (Inverse DFT), IDCT (Inverse DCT), Inverse Filterbank, or the like.
  • a computer system includes at least one processor 421, a memory 423, a user input device 426, a data communication bus 422, a user output device 427, It may include a reservoir 428.
  • Each of the components described above communicates data via a data communication bus 422.
  • the computer system can further include a network interface 429 coupled to the network.
  • the processor 421 may be a central processing unit (CPU) or a semiconductor device that processes instructions stored in the memory 423 and / or the storage 428.
  • the memory 423 and the storage 428 may include various types of volatile or nonvolatile storage media.
  • the memory 423 may include a ROM 424 and a RAM 425.
  • the noise reduction method according to the embodiment of the present invention can be implemented in a computer executable method.
  • computer readable instructions may perform the recognition method according to the present invention.
  • the noise canceling method according to the present invention described above may be embodied as computer readable codes on a computer readable recording medium.
  • Computer-readable recording media include all kinds of recording media having data stored thereon that can be decrypted by a computer system. For example, there may be a read only memory (ROM), a random access memory (RAM), a magnetic tape, a magnetic disk, a flash memory, an optical data storage device, and the like.
  • the computer readable recording medium can also be distributed over computer systems connected over a computer network, stored and executed as readable code in a distributed fashion.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Noise Elimination (AREA)

Abstract

The present invention relates to a voice signal processing method, wherein a noise removal method, according to one aspect of the present invention, comprises the steps of: receiving, as input, a mixed signal comprising a voice signal and a noise signal; obtaining the noise signal by using a section in the mixed signal that does not include the voice signal; obtaining an a posteriori signal-to-noise ratio by using the noise signal and the mixed signal; estimating an a priori signal-to-noise ratio of a current frame by using the a posteriori signal-to-noise ratio, a noise signal of a previous frame and an a priori signal-to-noise ratio of the previous frame; calculating a weighted value by using the estimated a priori signal-to-noise ratio; calculating a filter value per each frequency by using the calculated weighted value; and by multiplying the calculated filter value by the mixed signal, obtaining the estimated voice signal which has been improved.

Description

잡음 제거장치 및 그 방법Noise canceller and method
본 발명은 음성향상을 위한 신호처리에 관한 것으로서, 보다 구체적으로는 음성에 포함된 바람소리를 제거하여 음성의 명료도를 향상시키기 위한 신호처리 방법 및 그 장치에 관한 것이다.The present invention relates to signal processing for speech enhancement, and more particularly, to a signal processing method and apparatus for improving the clarity of speech by removing wind noise included in the speech.
스마트폰의 보급이 늘어남에 따라 음성인식 기술이 다양하게 사용되고 있다. 애플(Apple)사의 시리(Siri)나 구글(Google)사의 구글 나우(Google Now) 등은 음성인식을 이용한 대표적인 스마트폰 서비스이다.As the spread of smart phones increases, voice recognition technology is being used in various ways. Apple's Siri and Google Now's Google Now are typical smartphone services using voice recognition.
주변이 조용한 상황에서는 이러한 음성인식 서비스의 인식률도 높고, 일반 통화상황에서도 상대방의 음성을 잘 들을 수 있으나, 주변이 시끄러운 상황이나 바람소리 등이 사용자의 음성에 섞여서 스마트폰에 입력되는 경우에는 음성인식 서비스의 음성 인식률이 떨어지고, 상대방의 목소리를 잘 인식할 수 없게 되기도 한다.When the surroundings are quiet, the recognition rate of the voice recognition service is high, and even in a general call situation, the other party's voice can be heard well, but when the surroundings are noisy and the wind is mixed with the user's voice and input into the smartphone, the voice recognition is performed. The voice recognition rate of the service may be lowered and the voice of the other party may not be recognized well.
이렇게 바람소리가 혼합된 경우에 종래 기술은 단순히 LPF(Low Pass Filter)나 HPF(High Pass Filter)를 사용하여 신호의 특정 대역을 깎아내는 방식으로 바람소리를 줄이려는 시도를 하였다.When the wind noise is mixed, the prior art attempts to reduce the wind noise by simply cutting out a specific band of a signal by using a low pass filter (LPF) or a high pass filter (HPF).
대한민국 출원번호 제10-2005-0120682호 발명은 바람소리를 레벨에 따라 자동으로 제거하는 방법에 관한 것으로 혼합신호를 로우패스필터로 필터링 하고 그 레벨을 측정하여 측정된 레벨에 따라 제어신호를 생성하고 하이패스필터를 거쳐 바람소리를 제거하려는 발명이다.Republic of Korea Application No. 10-2005-0120682 The present invention relates to a method for automatically removing the wind noise according to the level to filter the mixed signal with a low pass filter and to measure the level to generate a control signal according to the measured level The invention is to remove wind noise through a high pass filter.
그러나 이렇게 단순한 필터링 방법에 의할 경우 바람소리뿐 아니라 사용자의 음성 대역에도 손실이 생기기 때문에 음성 인식률이 향상되지 못하는 문제점이 존재한다.However, there is a problem that the speech recognition rate is not improved because the simple filtering method causes loss not only in wind noise but also in user's voice band.
본 발명은 전술한 바와 같은 기술적 배경에서 안출된 것으로서, 선행 신호대잡음비와 사후 신호대잡음비를 이용하여 필터계수를 구하고 이를 이용하여 바람소리를 제거하는 장치와 방법을 제공하는 것을 그 목적으로 한다.An object of the present invention is to provide a device and method for obtaining a filter coefficient using the preceding signal-to-noise ratio and the post-signal-to-noise ratio and removing wind noise using the same.
본 발명의 목적은 이상에서 언급한 목적으로 제한되지 않으며, 언급되지 않은 또 다른 목적들은 아래의 기재로부터 당업자에게 명확하게 이해될 수 있을 것이다.The object of the present invention is not limited to the above-mentioned object, and other objects that are not mentioned will be clearly understood by those skilled in the art from the following description.
전술한 본 발명의 목적을 달성하기 위한 본 발명의 일면에 따른 잡음제거 방법은, 음성신호와 잡음신호를 포함하는 혼합신호를 입력받는 단계; 상기 혼합신호 중 상기 음성 신호가 없는 구간을 이용하여 상기 잡음신호를 구하는 단계; 상기 잡음신호와 상기 혼합신호를 이용하여 사후 신호대잡음비를 구하는 단계; 상기 사후 신호대 잡음비와 이전 프레임의 잡음신호와 이전프레임의 선행 신호대잡음비를 이용하여 현재 프레임의 선행 신호대잡음비를 추정하는 단계; 상기 추정된 선행 신호대잡음비를 이용하여 가중치 값을 계산하는 단계; 상기 계산된 가중치값을 이용하여 각 주파수 별 필터값을 계산하는 단계; 및 상기 계산된 필터값을 상기 혼합신호에 곱하여 향상된 상기 추정 음성신호를 구하는 단계를 포함하는 것을 특징으로 한다.Noise reduction method according to an aspect of the present invention for achieving the above object of the present invention, receiving a mixed signal including a voice signal and a noise signal; Obtaining the noise signal using a section in which the voice signal is absent among the mixed signals; Obtaining a post-signal-to-noise ratio using the noise signal and the mixed signal; Estimating a preceding signal-to-noise ratio of the current frame using the post-signal-to-noise ratio, the noise signal of the previous frame, and the preceding signal-to-noise ratio of the previous frame; Calculating a weight value using the estimated preceding signal to noise ratio; Calculating a filter value for each frequency using the calculated weight value; And multiplying the calculated filter value by the mixed signal to obtain the improved estimated speech signal.
본 발명의 다른 일면에 따른 잡음제거 장치는, 하나이상의 프로세서를 포함하고, 상기 프로세서는 음성신호와 잡음신호를 포함하는 혼합신호를 입력받는 입력부; 상기 혼합신호를 주파수 영역 신호로 변환하는 주파수 신호 변환부; 상기 혼합신호 중 상기 음성 신호가 없는 구간을 이용하여 상기 잡음신호를 구하고, 상기 잡음신호와 상기 혼합신호를 이용하여 사후 신호대잡음비를 구하고, 상기 사후 신호대 잡음비와 이전 프레임의 잡음신호와 이전프레임의 선행 신호대잡음비를 이용하여 현재 프레임의 선행 신호대잡음비를 추정하고, 상기 추정된 선행 신호대잡음비를 이용하여 가중치 값을 계산하고, 상기 계산된 가중치값을 이용하여 각 주파수 별 필터값을 계산하는연산부; 상기 계산된 필터값을 상기 혼합신호에 곱하여 향상된 음성신호를 구하는 필터부; 상기 향상된 음성신호를 시간 영역 신호로 변환하는 시간 영역 신호 변환부; 를 포함하여 구현하는 것을 특징으로 한다.According to another aspect of the present invention, there is provided an apparatus for removing noise, comprising: an input unit configured to receive a mixed signal including a voice signal and a noise signal; A frequency signal converter for converting the mixed signal into a frequency domain signal; From the mixed signal, the noise signal is obtained using a section without the voice signal, and a post-signal-to-noise ratio is obtained using the noise signal and the mixed signal, and the post-signal-to-noise ratio, the noise signal of the previous frame, and the preceding of the previous frame. An estimator for estimating a preceding signal-to-noise ratio of the current frame using a signal-to-noise ratio, calculating a weight value using the estimated preceding signal-to-noise ratio, and calculating a filter value for each frequency using the calculated weight value; A filter unit to obtain an improved speech signal by multiplying the calculated filter value by the mixed signal; A time domain signal converter for converting the enhanced voice signal into a time domain signal; Characterized by the implementation including.
본 발명에 따르면, 선행 신호대잡음비와 사후 신호대잡음비를 이용하여 형성된 필터를 이용하여 바람소리가 섞인 신호를 필터링 함으로써 보다 향상된 음성향상 기술을 제공함으로써 음성인식률을 높이고 통화 시 음성의 명료도를 높일 수 있는 효과가 있다.According to the present invention, by using a filter formed by using the signal-to-noise ratio and the signal-to-noise ratio before and after the signal mixed with the wind noise to provide a more improved speech enhancement technology to increase the speech recognition rate and the speech intelligibility There is.
도 1은 본 발명의 일실시예에 따른 잡음제거 방법의 흐름도.1 is a flowchart of a noise removing method according to an embodiment of the present invention;
도 2는 본 발명의 일실시예에 따른 잡음제거 방법의 신호의 흐름을 나타낸 구조도.Figure 2 is a structural diagram showing the flow of the signal of the noise removing method according to an embodiment of the present invention.
도 3은 본 발명의 다른 실시예에 따른 잡음제거장치의 구조도.3 is a structural diagram of a noise removing device according to another embodiment of the present invention;
도 4는 본 발명에 또 다른 실시예에 따른 잡음제거 방법이 구현되는 컴퓨터장치의 구조도.4 is a structural diagram of a computer device in which a noise canceling method according to another embodiment of the present invention is implemented.
본 발명의 이점 및 특징, 그리고 그것들을 달성하는 방법은 첨부되는 도면과 함께 상세하게 후술되어 있는 실시예들을 참조하면 명확해질 것이다. 그러나 본 발명은 이하에서 개시되는 실시예들에 한정되는 것이 아니라 서로 다른 다양한 형태로 구현될 것이며, 단지 본 실시예들은 본 발명의 개시가 완전하도록 하며, 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자에게 발명의 범주를 완전하게 알려주기 위해 제공되는 것이며, 본 발명은 청구항의 범주에 의해 정의될 뿐이다. 한편, 본 명세서에서 사용된 용어는 실시예들을 설명하기 위한 것이며 본 발명을 제한하고자 하는 것은 아니다. 본 명세서에서, 단수형은 문구에서 특별히 언급하지 않는 한 복수형도 포함한다. 명세서에서 사용되는 "포함한다(comprises)" 및/또는 "포함하는(comprising)"은 언급된 구성소자, 단계, 동작 및/또는 소자는 하나 이상의 다른 구성소자, 단계, 동작 및/또는 소자의 존재 또는 추가를 배제하지 않는다.Advantages and features of the present invention and methods for achieving them will be apparent with reference to the embodiments described below in detail with the accompanying drawings. However, the present invention is not limited to the embodiments disclosed below, but may be implemented in various forms. It is provided to fully convey the scope of the invention to those skilled in the art, and the present invention is defined only by the scope of the claims. Meanwhile, the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. In this specification, the singular also includes the plural unless specifically stated otherwise in the phrase. As used herein, “comprises” and / or “comprising” refers to a component, step, operation and / or device that is present in one or more other components, steps, operations and / or elements. Or does not exclude additions.
이하, 본 발명의 바람직한 실시예에 대하여 첨부한 도면을 참조하여 상세히 설명하기로 한다.Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings.
도 1은 본 발명의 일실시예에 따른 잡음제거 방법의 흐름도를 나타낸다.1 shows a flowchart of a noise removing method according to an embodiment of the present invention.
음성신호 향상을 위한 잡음제거를 위해 우선 혼합신호를 입력 받는다(S110).In order to remove the noise for improving the voice signal, the mixed signal is first received (S110).
입력 받은 혼합신호는 보통 시간 영역 신호이므로 이를 주파수 영역 신호로 바꾸기 위해 FFT(Fast Fourier Transform)연산을 수행한다. FFT연산을 거쳐 주파수 영역 신호로 바뀐 신호는 진폭(Magnitude)신호와 위상(Phase)신호로 구성되는데 본 발명에서는 진폭신호만으로 연산을 수행하므로 위상신호는 변형 없이 그대로 출력측으로 전달된다.Since the input mixed signal is usually a time domain signal, an FFT (Fast Fourier Transform) operation is performed to convert the mixed signal into a frequency domain signal. The signal changed into the frequency domain signal through the FFT operation is composed of a magnitude signal and a phase signal. In the present invention, the phase signal is transmitted to the output side without modification since the calculation is performed only with the amplitude signal.
선행 신호대잡음비(a priori SNR)를 구하기 위해서는 잡음신호와 혼합신호와 사후 신호대잡음비(a posteriori SNR)가 필요한데, 혼합신호만을 입력으로 받으므로 나머지 잡음신호와 사후 신호대잡음비는 혼합신호로부터 추정한다.In order to obtain a priori SNR, a noise signal, a mixed signal, and a posteriori SNR are required. Since only the mixed signal is input, the remaining noise signal and the post-signal noise ratio are estimated from the mixed signal.
우선 잡음신호는 혼합신호에서 음성이 없는 구간을 이용하여 구한다. 혼합신호에서 항상 사람의 목소리가 존재하는 것은 아니고, 따라서 혼합신호입력을 받기 시작한 후 짧은 구간은 사람의 목소리가 존재하지 않을 것이므로 이 구간을 잡음신호만 존재하는 것으로 가정하여 잡음신호를 구한다.First, a noise signal is obtained by using an interval without speech in a mixed signal. In the mixed signal, a human voice does not always exist. Therefore, a short section after receiving a mixed signal input will not have a human voice. Therefore, it is assumed that only a noise signal exists.
잡음신호를 구한 후 잡음신호와 혼합신호를 이용하여 사후 신호대잡음비를 구할 수 있는데, 사후 신호대잡음비는 다음 수학식 1과 같이 구할 수 있다(S120).After obtaining the noise signal, a post-signal-to-noise ratio may be obtained using the noise signal and the mixed signal. The post-signal-to-noise ratio may be obtained as in Equation 1 below (S120).
Figure PCTKR2015013970-appb-M000001
Figure PCTKR2015013970-appb-M000001
사후 신호대잡음비인
Figure PCTKR2015013970-appb-I000001
은 p번째 프레임과 k번째 주파수 인덱스에서의 사후 신호대잡음비를 나타내고, Y(p,k)와 N(p,k)는 각각 p번째 프레임과 k번째 주파수 인덱스에서의 혼합신호와 잡음신호를 나타낸다. 잡음신호는 전단계에서 가정한 값을 사용한다.
Post Signal to Noise Ratio
Figure PCTKR2015013970-appb-I000001
Denotes the post-signal-to-noise ratio at the p-th frame and the k-th frequency index, and Y (p, k) and N (p, k) represent the mixed signal and the noise signal at the p-th frame and the k-th frequency index, respectively. The noise signal uses the value assumed in the previous step.
계산한 사후 신호대잡음비를 이용하여 선행 신호대잡음비를 계산하고(S130), 수학식 2와 같이 구한다.The preceding signal-to-noise ratio is calculated using the calculated after-signal-to-noise ratio (S130), and is calculated as in Equation 2.
Figure PCTKR2015013970-appb-M000002
Figure PCTKR2015013970-appb-M000002
Figure PCTKR2015013970-appb-I000002
는 혼합신호에서 잡음신호를 제거한 추정된 음성신호를 말하는데 본 발명에 의한 계산이 시작되기 전의 음성신호는 0으로 초기화하고 해당 프레임의 음성신호를 추정한 후 다음 프레임부터 선행 신호대잡음비를 계산하는데 이용된다.
Figure PCTKR2015013970-appb-I000002
Refers to the estimated speech signal from which the noise signal is removed from the mixed signal. The speech signal before the calculation according to the present invention starts is initialized to 0, the speech signal of the corresponding frame is estimated, and used to calculate the preceding signal-to-noise ratio from the next frame. .
α는 미리 설정해놓는 비례계수 값으로, 음성신호를 추정함에 있어 직전 프레임의 추정 음성신호 및 잡음신호의 영향과 첫 번째 프레임부터 이전 프레임까지 누적된 사후 신호대잡음비의 영향을 조절하기 위한 값이다.α is a value of a preset coefficient that is used to adjust the influence of the estimated voice signal and the noise signal of the previous frame and the post signal-to-noise ratio accumulated from the first frame to the previous frame in estimating the voice signal.
즉 α는 0에서 1사이의 값인데 1에 가까울수록 직전 프레임의 값에 많은 영향을 받게 되고 0에 가까울수록 첫 번째 프레임부터 직전프레임까지 누적된 값에 의한 영향을 받게 되는, 다시말해 히스토리에 의한 영향이 커지는 것을 의미한다.That is, α is a value between 0 and 1, the closer to 1, the more affected by the value of the previous frame, and the closer to 0, the more affected by the accumulated value from the first frame to the previous frame. Means greater impact.
선행 신호대잡음비를 추출하면 이 값을 이용하여 가중치값을 계산하게 되고(S140), 가중치값은 수학식 3에 의해 구할 수 있다.When the preceding signal-to-noise ratio is extracted, the weight value is calculated using this value (S140), and the weight value can be obtained by Equation 3.
Figure PCTKR2015013970-appb-M000003
Figure PCTKR2015013970-appb-M000003
μ값은 가중치 파라미터이고, 선행 신호대잡음비 값이 크게 추정되면 음성신호의 크기가 크다는 의미이므로 가중치값도 커져야 하고 반대로 선행 신호대잡음비의 값이 작으면 잡음신호에 비해 음성신호도 작다는 의미이므로 가중치값도 작아져야 한다.μ is a weighting parameter. If the value of the preceding signal-to-noise ratio is large, it means that the size of the voice signal is large. Therefore, the weight value should be large. On the contrary, if the value of the preceding signal-to-noise ratio is small, the weight value is smaller than the noise signal. Should also be small.
이렇게 가중치값과 선행 신호대잡음비 값을 구하면 두 값을 이용하여 잡음제거에 이용되는 필터값 H(p,k)를 구할 수 있고(S150), 이는 수학식 4와 같다.When the weight value and the preceding signal-to-noise ratio value are obtained, the filter values H (p, k) used for noise reduction can be obtained using the two values (S150), which is shown in Equation 4.
Figure PCTKR2015013970-appb-M000004
Figure PCTKR2015013970-appb-M000004
Y(p,k)는 혼합신호를 나타내고, 전술한 바와 같이 이렇게 구한 추정된 음성신호
Figure PCTKR2015013970-appb-I000003
는 다음 프레임에서 선행 신호대잡음비를 구하는데 사용된다.
Y (p, k) represents a mixed signal and the estimated speech signal thus obtained as described above
Figure PCTKR2015013970-appb-I000003
Is used to find the preceding signal-to-noise ratio in the next frame.
도 2는 잡음신호가 섞인 혼합신호가 필터링을 거쳐 잡음신호가 감쇄된 신호를 출력하기까지 신호의 흐름도를 나타낸다.2 is a flowchart of a signal until a mixed signal mixed with a noise signal is filtered and outputs a signal in which the noise signal is attenuated.
최종적으로 추정된 음성신호는 음성신호의 진폭신호이므로 변형을 거치지 않은 음성신호의 위상신호와 함께 IFFT(Inverse Fast Fourier Transform)방법으로 시간 영역 신호로 바뀌어 사용자에게 잡음이 제거된 신호를 제공하게 된다.Finally, since the estimated speech signal is an amplitude signal of the speech signal, it is converted into a time domain signal by IFFT (Inverse Fast Fourier Transform) method together with the phase signal of the speech signal which has not been transformed to provide a signal from which noise is removed.
이렇게 선행 신호대잡음비를 추정하여 잡음을 제거를 하는 경우 기존 LPF 등의 단순한 필터로 잡음을 제거하는 것 보다 더 뛰어난 잡음제거 효과를 얻을 수 있다.When the noise is removed by estimating the preceding signal-to-noise ratio, the noise reduction effect is superior to that of the conventional LPF filter.
도 3은 본 발명의 다른 실시예에 따른 잡음제거 장치의 구조도를 나타낸다.3 is a structural diagram of a noise removing device according to another embodiment of the present invention.
입력부(310)는 음성신호와 잡음신호가 혼재된 혼합신호를 입력받는다. 입력부는 마이크 등으로 구성될 수도 있고 음성파일이나 동영상파일 등 파일형태의 입력을 받아 음성신호인 혼합신호만을 추출할 수도 있다.The input unit 310 receives a mixed signal in which a voice signal and a noise signal are mixed. The input unit may be composed of a microphone or the like, or may receive only a mixed signal, which is a voice signal, by receiving an input in the form of a file such as a voice file or a video file.
본 발명은 신호를 주파수 영역에서 처리하기 때문에 주파수신호 변환부(320)는 입력받은 신호를 FFT 등의 방법을 통해 주파수 신호로 변환한다. 주파수 신호 변환은 FFT뿐 아니라 DFT(Discrete Fourier Transform), DCT(Discrete Cosine Transform), Filterbank 등의 방법을 사용할 수 있다.In the present invention, since the signal is processed in the frequency domain, the frequency signal converter 320 converts the received signal into a frequency signal through a method such as an FFT. The frequency signal transformation may use methods such as Discrete Fourier Transform (DFT), Discrete Cosine Transform (DCT), and Filterbank as well as FFT.
연산부(330)는 잡음제거를 위한 필터값을 입력신호로부터 추출한다.The calculator 330 extracts a filter value for noise removal from the input signal.
입력받은 혼합신호와 잡음신호로부터 우선 사후 신호대잡음비를 구하고 이 과정은 전술한 수학식 1과 같다. 혼합신호에서 음성신호와 잡음신호를 구분하는 것은 불가능하지만, 초기 입력신호에서는 음성신호가 존재하지 않는 것으로 가정하여 이 구간의 신호를 잡음신호로 가정하여 사후 신호대잡음비를 계산하는 것이다.The signal to noise ratio is first obtained from the input mixed signal and the noise signal, and the process is as shown in Equation 1 above. Although it is impossible to distinguish between a speech signal and a noise signal in a mixed signal, it is assumed that a voice signal does not exist in the initial input signal, and then a signal of the post-noise ratio is calculated by assuming that the signal in this section is a noise signal.
이렇게 계산한 사후 신호대잡음비와 이전 프레임에서 추정한 음성신호와 앞에서 구한 잡음신호의 평균값을 이용하여 선행 신호대잡음비를 구한다. 이 과정에서 비례계수값을 이용하여 직전프레임의 추정값과 직전프레임을 포함한 이전 프레임들의 히스토리값이 선행 신호대잡음비에 영향을 주는 비율을 조절할 수 있다.The preceding signal-to-noise ratio is obtained by using the calculated post-signal-to-noise ratio, the average value of the speech signal estimated from the previous frame, and the previously obtained noise signal. In this process, the proportional coefficient value may be used to adjust the rate at which the estimated value of the previous frame and the history value of previous frames including the previous frame affect the preceding signal-to-noise ratio.
직전프레임 값의 비율을 높이는 경우에는 프레임 간 변화에 민감하게 변화할 수 있는 장점이 있으나 잦은 변화로 인해 사용자에게 불편함을 초래할 수 있고, 히스토리값의 비율을 높이는 경우에는 급격한 변화를 억제할 수 있어 자연스러운 음성신호를 들을 수 있으나 시간적으로 빨리 변화하는 신호에 신속하게 대응하지 못하는 단점이 있으므로 실험에 의해 둘 사이의 최적값을 결정하여 사용할 수 있다.Increasing the ratio of the previous frame value has the advantage of being sensitive to the change between frames, but it can cause inconvenience to the user due to frequent changes, and abrupt change can be suppressed when increasing the ratio of the history value. You can hear a natural voice signal, but can not respond quickly to a signal that changes quickly in time, so it can be used to determine the optimal value between the two by experiment.
수학식 2에 의해 선행 신호대잡음비를 구하면 가중치값을 구할 수 있는데, 선행 신호대잡음비가 크면 음성신호가 큰것으로 예상되는 것이므로 가중치값을 크게하고 반대인 경우 가중치값을 작게 하여 잡음신호의 영향을 줄이기 위함이다. 가중치값은 수학식 3에 의해 구한다.Equation 2 can be used to obtain the weighted value by calculating the preceding signal-to-noise ratio. When the preceding signal-to-noise ratio is large, the speech signal is expected to be large. Therefore, the weighted value is increased and the weighted value is reduced to reduce the influence of the noise signal. to be. The weight value is obtained by the equation (3).
가중치값과 선행 신호대잡음비 값을 이용하여 최종적으로 필터값을 구할 수 있고 이는 수학식 4와 같다.The filter value can be finally obtained using the weight value and the preceding signal-to-noise ratio value.
필터부(340)는 이렇게 구한 필터값을 혼합신호에 곱하여 잡음이 제거된 신호를 구하게 되고 이 과정은 수학식 5와 같다.The filter unit 340 multiplies the thus obtained filter value by the mixed signal to obtain a signal from which the noise is removed.
필터부(340)를 거쳐 잡음이 제거된 신호는 주파수 영역의 신호이기 때문에 마지막으로 시간신호 변환부(350)를 거쳐 음성신호를 시간영역 신호로 변환하여 출력부에 제공함으로써 사용자가 잡음이 제거된 신호를 들을 수 있다.Since the signal from which the noise is removed through the filter unit 340 is a signal in the frequency domain, the user finally removes the noise by converting the voice signal into a time domain signal through the time signal converter 350 and providing it to the output unit. You can hear the signal.
시간신호 변환부(350)는 IFFT, IDFT(Inverse DFT), IDCT(Inverse DCT), Inverse Filterbank등의 방법을 사용하여 주파수 영역 신호를 시간영역 신호로 변환할 수 있다.The time signal converter 350 may convert a frequency domain signal into a time domain signal using a method such as IFFT, IDFT (Inverse DFT), IDCT (Inverse DCT), Inverse Filterbank, or the like.
한편, 본 발명의 실시예에 잡음제거 방법은 컴퓨터 시스템에서 구현되거나, 또는 기록매체에 기록될 수 있다. 도 4에 도시된 바와 같이, 컴퓨터 시스템은 적어도 하나 이상의 프로세서(421)와, 메모리(423)와, 사용자 입력 장치(426)와, 데이터 통신 버스(422)와, 사용자 출력 장치(427)와, 저장소(428)를 포함할 수 있다. 전술한 각각의 구성 요소는 데이터 통신 버스(422)를 통해 데이터 통신을 한다.On the other hand, the noise reduction method in an embodiment of the present invention may be implemented in a computer system or recorded on a recording medium. As shown in FIG. 4, a computer system includes at least one processor 421, a memory 423, a user input device 426, a data communication bus 422, a user output device 427, It may include a reservoir 428. Each of the components described above communicates data via a data communication bus 422.
컴퓨터 시스템은 네트워크에 커플링된 네트워크 인터페이스(429)를 더 포함할 수 있다. 상기 프로세서(421)는 중앙처리 장치(central processing unit (CPU))이거나, 혹은 메모리(423) 및/또는 저장소(428)에 저장된 명령어를 처리하는 반도체 장치일 수 있다. The computer system can further include a network interface 429 coupled to the network. The processor 421 may be a central processing unit (CPU) or a semiconductor device that processes instructions stored in the memory 423 and / or the storage 428.
상기 메모리(423) 및 상기 저장소(428)는 다양한 형태의 휘발성 혹은 비휘발성 저장매체를 포함할 수 있다. 예컨대, 상기 메모리(423)는 ROM(424) 및 RAM(425)을 포함할 수 있다.The memory 423 and the storage 428 may include various types of volatile or nonvolatile storage media. For example, the memory 423 may include a ROM 424 and a RAM 425.
따라서, 본 발명의 실시예에 따른 잡음제거 방법은 컴퓨터에서 실행 가능한 방법으로 구현될 수 있다. 본 발명의 실시예에 따른 잡음제거 방법이 컴퓨터 장치에서 수행될 때, 컴퓨터로 판독 가능한 명령어들이 본 발명에 따른 인식 방법을 수행할 수 있다.Therefore, the noise reduction method according to the embodiment of the present invention can be implemented in a computer executable method. When the noise canceling method according to an embodiment of the present invention is performed in a computer device, computer readable instructions may perform the recognition method according to the present invention.
한편, 상술한 본 발명에 따른 잡음제거 방법은 컴퓨터로 읽을 수 있는 기록매체에 컴퓨터가 읽을 수 있는 코드로서 구현되는 것이 가능하다. 컴퓨터가 읽을 수 있는 기록 매체로는 컴퓨터 시스템에 의하여 해독될 수 있는 데이터가 저장된 모든 종류의 기록 매체를 포함한다. 예를 들어, ROM(Read Only Memory), RAM(Random Access Memory), 자기 테이프, 자기 디스크, 플래시 메모리, 광 데이터 저장장치 등이 있을 수 있다. 또한, 컴퓨터로 판독 가능한 기록매체는 컴퓨터 통신망으로 연결된 컴퓨터 시스템에 분산되어, 분산방식으로 읽을 수 있는 코드로서 저장되고 실행될 수 있다.Meanwhile, the noise canceling method according to the present invention described above may be embodied as computer readable codes on a computer readable recording medium. Computer-readable recording media include all kinds of recording media having data stored thereon that can be decrypted by a computer system. For example, there may be a read only memory (ROM), a random access memory (RAM), a magnetic tape, a magnetic disk, a flash memory, an optical data storage device, and the like. The computer readable recording medium can also be distributed over computer systems connected over a computer network, stored and executed as readable code in a distributed fashion.
이상, 본 발명의 구성에 대하여 첨부 도면을 참조하여 상세히 설명하였으나, 이는 예시에 불과한 것으로서, 본 발명이 속하는 기술분야에 통상의 지식을 가진자라면 본 발명의 기술적 사상의 범위 내에서 다양한 변형과 변경이 가능함은 물론이다. 따라서 본 발명의 보호 범위는 전술한 실시예에 국한되어서는 아니되며 이하의 특허청구범위의 기재에 의하여 정해져야 할 것이다.In the above, the configuration of the present invention has been described in detail with reference to the accompanying drawings, which are merely examples, and those skilled in the art to which the present invention pertains various modifications and changes within the scope of the technical idea of the present invention. Of course this is possible. Therefore, the protection scope of the present invention should not be limited to the above-described embodiment but should be defined by the following claims.

Claims (10)

  1. 음성신호와 잡음신호를 포함하는 혼합신호를 입력받는 단계;Receiving a mixed signal including a voice signal and a noise signal;
    상기 혼합신호 중 상기 음성 신호가 없는 구간을 이용하여 상기 잡음신호를 구하는 단계;Obtaining the noise signal using a section in which the voice signal is absent among the mixed signals;
    상기 잡음신호와 상기 혼합신호를 이용하여 사후 신호대잡음비를 구하는 단계;Obtaining a post-signal-to-noise ratio using the noise signal and the mixed signal;
    상기 사후 신호대 잡음비와 이전 프레임의 잡음신호와 이전프레임의 선행 신호대잡음비를 이용하여 현재 프레임의 선행 신호대잡음비를 추정하는 단계;Estimating a preceding signal-to-noise ratio of the current frame using the post-signal-to-noise ratio, the noise signal of the previous frame, and the preceding signal-to-noise ratio of the previous frame;
    상기 추정된 선행 신호대잡음비를 이용하여 가중치 값을 계산하는 단계;Calculating a weight value using the estimated preceding signal to noise ratio;
    상기 계산된 가중치값을 이용하여 각 주파수 별 필터값을 계산하는 단계; 및Calculating a filter value for each frequency using the calculated weight value; And
    상기 계산된 필터값을 상기 혼합신호에 곱하여 향상된 상기 추정 음성신호를 구하는 단계;Multiplying the calculated filter value by the mixed signal to obtain an improved estimated speech signal;
    를 포함하는 잡음제거 방법.Noise reduction method comprising a.
  2. 제1항에 있어서, 상기 사후 신호대잡음비를 구하는 단계는The method of claim 1, wherein the post-signal to noise ratio is calculated by
    상기 혼합신호의 크기를 상기 잡음신호의 크기로 나눈 값을 사후 신호대잡음비로 하는 것Dividing the magnitude of the mixed signal by the magnitude of the noise signal as a post-signal-to-noise ratio
    인 잡음제거 방법.Noise reduction method.
  3. 제1항에 있어서, 상기 현재 프레임의 선행 신호대 잡음비는The method of claim 1, wherein the preceding signal to noise ratio of the current frame is
    상기 이전프레임의 추정된 음성신호의 크기를 제곱한 값을 상기 잡음신호의 크기를 제곱한 값의 평균값으로 나눈 다음 기설정된 비례계수를 곱한값과,A value obtained by dividing the squared magnitude of the estimated speech signal of the previous frame by an average value of the squared magnitude of the noise signal, and multiplying a predetermined proportional coefficient by
    상기 사후 신호대잡음비에서 1을 뺀 값과 0 중 큰 값에 1에서 상기 기설정된 비례계수를 뺀 값을 곱한 값을 첫 번째 프레임에서 이전 프레임까지 모두 더한 값을 더한 값으로 하는 것A value obtained by subtracting 1 from the post-signal-to-noise ratio and a larger value of 0 times a value obtained by subtracting the predetermined proportional coefficient from 1 is obtained by adding up the sum of the first frame to the previous frame.
    인 잡음제거 방법.Noise reduction method.
  4. 제1항에 있어서, 상기 가중치 값은The method of claim 1, wherein the weight value is
    상기 현재 프레임의 선행 신호대잡음비를 제곱한 값과 상기 현재 프레임의 선행 신호대잡음비의 절대값을 더한 값의 제곱근 값을 상기 현재 프레임의 선행 신호대잡음비의 절대값으로 나눈 값으로 하는 것The square root of the sum of the squares of the preceding signal-to-noise ratios of the current frame and the absolute value of the preceding signal-to-noise ratios of the current frame is divided by the absolute value of the preceding signal-to-noise ratios of the current frame.
    인 잡음제거 방법.Noise reduction method.
  5. 제1항에 있어서, 상기 필터값은The method of claim 1, wherein the filter value
    상기 선행 신호대잡음비와 상기 가중치를 곱한 값을 상기 선행 신호대잡음비와 상기 가중치를 곱한 값에 1을 더한 값으로 나눈 값으로 하는 것The value obtained by multiplying the preceding signal-to-noise ratio by the weight is divided by the value obtained by multiplying the preceding signal-to-noise ratio by the weight.
    인 잡음제거 방법.Noise reduction method.
  6. 하나이상의 프로세서를 포함하는 잡음제거 장치에 있어서, 상기 프로세서는A noise reduction device comprising at least one processor, the processor comprising:
    음성신호와 잡음신호를 포함하는 혼합신호를 입력받는 입력부;An input unit configured to receive a mixed signal including a voice signal and a noise signal;
    상기 혼합신호를 주파수 영역 신호로 변환하는 주파수 신호 변환부;A frequency signal converter for converting the mixed signal into a frequency domain signal;
    상기 혼합신호 중 상기 음성 신호가 없는 구간을 이용하여 상기 잡음신호를 구하고, 상기 잡음신호와 상기 혼합신호를 이용하여 사후 신호대잡음비를 구하고, 상기 사후 신호대 잡음비와 이전 프레임의 잡음신호와 이전프레임의 선행 신호대잡음비를 이용하여 현재 프레임의 선행 신호대잡음비를 추정하고, 상기 추정된 선행 신호대잡음비를 이용하여 가중치 값을 계산하고, 상기 계산된 가중치값을 이용하여 각 주파수 별 필터값을 계산하는연산부;From the mixed signal, the noise signal is obtained using a section without the voice signal, and a post-signal-to-noise ratio is obtained using the noise signal and the mixed signal, and the post-signal-to-noise ratio, the noise signal of the previous frame, and the preceding of the previous frame. An estimator for estimating a preceding signal-to-noise ratio of the current frame using a signal-to-noise ratio, calculating a weight value using the estimated preceding signal-to-noise ratio, and calculating a filter value for each frequency using the calculated weight value;
    상기 계산된 필터값을 상기 혼합신호에 곱하여 향상된 음성신호를 구하는 필터부;A filter unit to obtain an improved speech signal by multiplying the calculated filter value by the mixed signal;
    상기 향상된 음성신호를 시간 영역 신호로 변환하는 시간 영역 신호 변환부;A time domain signal converter for converting the enhanced voice signal into a time domain signal;
    를 포함하여 구현하는 것인 잡음제거 장치.Noise canceling device to implement including.
  7. 제6항에 있어서, 상기 연산부는The method of claim 6, wherein the operation unit
    상기 혼합신호의 크기를 상기 잡음신호의 크기로 나눈 값을 사후 신호대잡음비로 하는 것Dividing the magnitude of the mixed signal by the magnitude of the noise signal as a post-signal-to-noise ratio
    인 잡음제거 장치.Noise reduction device.
  8. 제6항에 있어서, 상기 연산부는The method of claim 6, wherein the operation unit
    상기 이전프레임의 추정된 음성신호의 크기를 제곱한 값을 상기 잡음신호의 크기를 제곱한 값의 평균값으로 나눈 다음 기설정된 비례계수를 곱한값과,A value obtained by dividing the squared magnitude of the estimated speech signal of the previous frame by an average value of the squared magnitude of the noise signal, and multiplying a predetermined proportional coefficient by
    상기 사후 신호대잡음비에서 1을 뺀 값과 0 중 큰 값에 1에서 상기 기설정된 비례계수를 뺀 값을 곱한 값을 첫 번째 프레임에서 이전 프레임까지 모두 더한 값을 상기 선행 신호대 잡음비로 하는 것The preceding signal-to-noise ratio is obtained by subtracting 1 from the post-signal-to-noise ratio and multiplying the larger value of 0 by the value obtained by subtracting the predetermined proportional coefficient from 1 to the previous frame.
    인 잡음제거 장치.Noise reduction device.
  9. 제6항에 있어서, 상기 연산부는The method of claim 6, wherein the operation unit
    상기 현재 프레임의 선행 신호대잡음비를 제곱한 값과 상기 현재 프레임의 선행 신호대잡음비의 절대값을 더한 값의 제곱근 값을 상기 현재 프레임의 선행 신호대잡음비의 절대값으로 나눈 값을 상기 가중치값으로 하는 것The weighted value is obtained by dividing the square root of the sum of the square of the preceding signal-to-noise ratio of the current frame and the absolute value of the preceding signal-to-noise ratio of the current frame by the absolute value of the preceding signal-to-noise ratio of the current frame.
    인 잡음제거 장치.Noise reduction device.
  10. 제6항에 있어서, 상기 연산부는The method of claim 6, wherein the operation unit
    상기 선행 신호대잡음비와 상기 가중치를 곱한 값을 상기 선행 신호대잡음비와 상기 가중치를 곱한 값에 1을 더한 값으로 나눈 값을 상기 필터값으로 하는 것The filter value is obtained by dividing the value obtained by multiplying the preceding signal-to-noise ratio by the weight by the value obtained by multiplying the value obtained by multiplying the preceding signal-to-noise ratio by the weight.
    인 잡음제거 장치.Noise reduction device.
PCT/KR2015/013970 2015-12-18 2015-12-18 Noise removal device and method therefor WO2017104876A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR1020150181624A KR101741141B1 (en) 2015-12-18 2015-12-18 Apparatus for suppressing noise and method thereof
KR10-2015-0181624 2015-12-18

Publications (1)

Publication Number Publication Date
WO2017104876A1 true WO2017104876A1 (en) 2017-06-22

Family

ID=59053261

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2015/013970 WO2017104876A1 (en) 2015-12-18 2015-12-18 Noise removal device and method therefor

Country Status (2)

Country Link
KR (1) KR101741141B1 (en)
WO (1) WO2017104876A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115881155A (en) * 2022-12-02 2023-03-31 宁波硕正电子科技有限公司 Transient noise suppression method, device, equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20080044743A (en) * 2006-11-17 2008-05-21 한국전자통신연구원 Apparatus and method to reduce a noise for voip service
US20080167866A1 (en) * 2007-01-04 2008-07-10 Harman International Industries, Inc. Spectro-temporal varying approach for speech enhancement
KR20080075362A (en) * 2007-02-12 2008-08-18 인하대학교 산학협력단 A method for obtaining an estimated speech signal in noisy environments
KR20130024156A (en) * 2011-08-30 2013-03-08 광주과학기술원 Apparatus and method for eliminating noise
CN103871421A (en) * 2014-03-21 2014-06-18 厦门莱亚特医疗器械有限公司 Self-adaptive denoising method and system based on sub-band noise analysis

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20080044743A (en) * 2006-11-17 2008-05-21 한국전자통신연구원 Apparatus and method to reduce a noise for voip service
US20080167866A1 (en) * 2007-01-04 2008-07-10 Harman International Industries, Inc. Spectro-temporal varying approach for speech enhancement
KR20080075362A (en) * 2007-02-12 2008-08-18 인하대학교 산학협력단 A method for obtaining an estimated speech signal in noisy environments
KR20130024156A (en) * 2011-08-30 2013-03-08 광주과학기술원 Apparatus and method for eliminating noise
CN103871421A (en) * 2014-03-21 2014-06-18 厦门莱亚特医疗器械有限公司 Self-adaptive denoising method and system based on sub-band noise analysis

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115881155A (en) * 2022-12-02 2023-03-31 宁波硕正电子科技有限公司 Transient noise suppression method, device, equipment and storage medium

Also Published As

Publication number Publication date
KR101741141B1 (en) 2017-05-29

Similar Documents

Publication Publication Date Title
US11056130B2 (en) Speech enhancement method and apparatus, device and storage medium
CA2679476C (en) Method of estimating noise levels in a communication system
US9343056B1 (en) Wind noise detection and suppression
EP1086453B1 (en) Noise suppression using external voice activity detection
US6968064B1 (en) Adaptive thresholds in acoustic echo canceller for use during double talk
US20110124380A1 (en) Method and system for double-end talk detection, and method and system for echo elimination
CN102160296B (en) Method and apparatus for detecting double talk
CN112004177B (en) Howling detection method, microphone volume adjustment method and storage medium
US5848151A (en) Acoustical echo canceller having an adaptive filter with passage into the frequency domain
WO1995023477A1 (en) Doubletalk detection by means of spectral content
JP6833616B2 (en) Echo suppression device, echo suppression method and echo suppression program
CN110556125B (en) Feature extraction method and device based on voice signal and computer storage medium
CN111383647B (en) Voice signal processing method and device and readable storage medium
WO2015184499A1 (en) Reducing instantaneous wind noise
CN103824563A (en) Hearing aid denoising device and method based on module multiplexing
CN110503973B (en) Audio signal transient noise suppression method, system and storage medium
RU2767297C1 (en) Echo cancellation device, echo cancellation method and echo cancellation program
WO2017104876A1 (en) Noise removal device and method therefor
CN108053834B (en) Audio data processing method, device, terminal and system
CN110890099A (en) Sound signal processing method, device and storage medium
CN112637833B (en) Communication terminal information detection method and equipment
WO2012053809A2 (en) Method and system based on voice communication for eliminating interference noise
US10692514B2 (en) Single channel noise reduction
WO2021091357A1 (en) Auditory compensation method for hearing aid
CN112489669A (en) Audio signal processing method, device, equipment and medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15910799

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 15910799

Country of ref document: EP

Kind code of ref document: A1