CN104867498A - Mobile communication terminal and voice enhancement method and module thereof - Google Patents
Mobile communication terminal and voice enhancement method and module thereof Download PDFInfo
- Publication number
- CN104867498A CN104867498A CN201510164111.0A CN201510164111A CN104867498A CN 104867498 A CN104867498 A CN 104867498A CN 201510164111 A CN201510164111 A CN 201510164111A CN 104867498 A CN104867498 A CN 104867498A
- Authority
- CN
- China
- Prior art keywords
- voice
- noise
- subband
- detection
- ratio
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Telephone Function (AREA)
Abstract
The invention provides a voice enhancement method comprising the following steps that S101, each frame of input voice is transformed to a frequency domain and divided into multiple sub-bands; S103, signal-to-noise ratio of each sub-band is estimated; S105, voice and noise detection is performed; S107, and gain of the sub-bands is modified according to the size of the signal-to-noise ratio of each sub-band. The invention also provides a voice enhancement module and a mobile communication terminal. The voice enhancement method and module have significant noise suppression effect, and noise can be eliminated so that computation amount of the whole system is greatly reduced. A better voice communication effect can be acquired by utilizing the voice enhancement module and the mobile communication terminal so that environment applicability is relatively high.
Description
Technical field
The present invention relates to the communication technology, particularly relate to a kind of communication terminal and sound enhancement method thereof and module.
Background technology
In speech communication, neighbourhood noise (as air-conditioning, fan, computing machine, noisy environment etc.) very easily produces interference to the voice of speaker, thus voice quality is declined, and affects the performance of whole communication system.For solving this problem, usually adopt speech enhan-cement (or being called squelch) technology.
Since the latter stage seventies, studying various speech enhancement technique both at home and abroad always, propose spectrum subtraction method, Wiener Filtering, kalman filter method etc., and be applied in actual communication system, these technology can improve the voice quality in communication preferably.
As shown in Figure 1, for the process flow diagram of noise estimation method followed the tracks of based on minimum value adopted in prior art, the method first carries out filtering with an optimal smoothing filtering to the power spectrum of noisy speech, obtain the guestimate of a noise, then the minimum value in the certain hour window in rough noise is found out, finally some drift correction are carried out to this minimum value, namely obtain the variance of the noise that will estimate.
But there is following shortcoming in this method: one, system operations amount are large; Two, require ground unrest held stationary, signal to noise ratio (S/N ratio) is higher, is difficult to match with actual conditions.
Summary of the invention
Based on this, be necessary the problems referred to above existed for prior art, provide a kind of communication terminal and sound enhancement method thereof and module, to solve prior art Problems existing.
A kind of sound enhancement method, it comprises the steps: S101, every frame is inputted phonetic modification to frequency domain, and is divided into multiple subband; S103, estimate the signal to noise ratio (S/N ratio) of each described subband; S105, carry out the detection of voice and noise; S 107, size according to the signal to noise ratio (S/N ratio) of each described subband, the gain of amendment subband.
In the present invention one better embodiment, in step S101, by every frame input phonetic modification to frequency domain, and be divided into 16 subbands.
In the present invention one better embodiment, in step S105, utilize voice to estimate detection that mechanism carries out voice and noise.
In the present invention one better embodiment, adopt update_flag as the mark of speech detection, advanced row speech enhan-cement, then enters speech detection, then according to the result of speech detection and the feature of voice signal, carries out auto level control.
The present invention provides a kind of speech enhan-cement module in addition, it comprises the voice-input unit, Audio Processing Unit and the voice-output unit that connect successively, voice input described Audio Processing Unit from described voice-input unit, every frame is inputted phonetic modification to frequency domain by described Audio Processing Unit, and be divided into multiple subband, then the signal to noise ratio (S/N ratio) of each described subband is estimated, carry out the detection of voice and noise again, finally according to the size of the signal to noise ratio (S/N ratio) of each described subband, the gain of amendment subband, and export via described voice-output unit.
In the present invention one better embodiment, every frame is inputted phonetic modification to frequency domain by described Audio Processing Unit, and is divided into 16 subbands.
In the present invention one better embodiment, utilize voice the to estimate detection that mechanism carries out voice and noise of described Audio Processing Unit.
The present invention also provides a kind of communication terminal, and it comprises above-mentioned speech enhan-cement module.
Compared to prior art, sound enhancement method provided by the invention and module tool have the following advantages: one, noise suppression effect are remarkable, can stress release treatment; Two, in low signal-to-noise ratio situation, system performance declines less; Three, not only can suppress common noise, also can suppress narrow band noise, unexpected very noisy, and have the ability suppressing nonstationary noise; Four, voice enhancement algorithm, VAD, ALC three organically combine together, greatly reduce the calculated amount of whole system.Utilize the communication terminal of described speech enhan-cement module can obtain preferably speech communication effect, the applicability of environment is higher.
Accompanying drawing explanation
Fig. 1 is the process flow diagram of the noise estimation method based on minimum value tracking that prior art adopts;
Fig. 2 is the process flow diagram of sound enhancement method provided by the invention;
Fig. 3 is the workflow diagram of sound enhancement method described in Fig. 2;
Fig. 4 is the schematic diagram of speech enhan-cement module provided by the invention.
Embodiment
For the ease of understanding the present invention, below with reference to relevant drawings, the present invention is described more fully.Better embodiment of the present invention is given in accompanying drawing.But the present invention can realize in many different forms, is not limited to embodiment described herein.On the contrary, provide the object of these embodiments be make to disclosure of the present invention understand more thorough comprehensively.
It should be noted that, when element is called as " being fixed on " another element, directly can there is element placed in the middle in it on another element or also.When an element is considered to " connection " another element, it can be directly connected to another element or may there is centering elements simultaneously.Term as used herein " vertical ", " level ", " left side ", " right side " and similar statement just for illustrative purposes, do not represent it is unique embodiment.
Unless otherwise defined, all technology used herein and scientific terminology are identical with belonging to the implication that those skilled in the art of the present invention understand usually.The object of term used in the description of the invention herein just in order to describe concrete embodiment, is not intended to be restriction the present invention.Term as used herein " and/or " comprise arbitrary and all combinations of one or more relevant Listed Items.
Refer to Fig. 2, the invention provides a kind of sound enhancement method, it comprises the steps: S101, every frame is inputted phonetic modification to frequency domain, and is divided into multiple subband; S103, estimate the signal to noise ratio (S/N ratio) of each described subband; S105, carry out the detection of voice and noise; S107, size according to the signal to noise ratio (S/N ratio) of each described subband, the gain of amendment subband.
In the present embodiment, by every frame input phonetic modification to frequency domain, and be divided into 16 subbands; Then the signal to noise ratio (S/N ratio) of each subband is estimated; The detection that (Voice Metric) mechanism carries out voice and noise estimated in recycling voice, realizes the accurate estimation of ground unrest; Finally according to the size of the signal to noise ratio (S/N ratio) of each described subband, the gain of amendment subband, thus realize the object of squelch.
Further, refer to Fig. 3, the present invention, by computing in subband after Fourier transform, then obtains time domain output signal through inverse Fourier transform.Particularly, for making full use of the resource of voice enhancement algorithm, the present invention adopts the update_flag in voice enhancement algorithm as the mark of VAD (Voice Activity Detection, voice activity detection, also known as speech terminals detection or speech endpoint detection).Advanced row speech enhan-cement, then carries out VAD, thus makes VAD better effects if herein.Again according to the result of VAD and the feature of voice signal, effective auto level control (Automatic Level Control, ALC) can be carried out.
The ultimate principle of auto level control is: current energy and long-term average energy are compared, thus determines it is strengthen or decay voice, upgrades according to current energy to long-term average energy simultaneously.
Described sound enhancement method adopts the voice enhancement algorithm of the improvement based on spectrum subtraction and subband combine with technique, by speech detection method and automatic level control method based on the parameter all in voice enhancement algorithm, thus make speech enhan-cement, VAD, ALC three organically combines together, greatly reduce the calculated amount of whole system, not only can suppress common noise, also can suppress narrow band noise, unexpected very noisy, and have the ability suppressing nonstationary noise.
Refer to Fig. 4, the present invention provides a kind of speech enhan-cement module 100 in addition, it comprises the voice-input unit 10 connected successively, Audio Processing Unit 20 and voice-output unit 30, voice input described Audio Processing Unit 20 from described voice-input unit 10, every frame is inputted phonetic modification to frequency domain by described Audio Processing Unit 20, and be divided into multiple subband, then the signal to noise ratio (S/N ratio) of each described subband is estimated, carry out the detection of voice and noise again, finally according to the size of the signal to noise ratio (S/N ratio) of each described subband, the gain of amendment subband, and export via described voice-output unit 30.
In the present embodiment, every frame is inputted phonetic modification to frequency domain by described Audio Processing Unit 20, and is divided into 16 subbands; Then the signal to noise ratio (S/N ratio) of each subband is estimated; The detection that mechanism carries out voice and noise estimated in recycling voice, realizes the accurate estimation of ground unrest; Finally according to the size of the signal to noise ratio (S/N ratio) of each described subband, the gain of amendment subband, thus realize the object of squelch.
Further, see also Fig. 3, the present invention, by computing in subband after Fourier transform, then obtains time domain output signal through inverse Fourier transform.Particularly, for making full use of the resource of voice enhancement algorithm, described Audio Processing Unit 20 adopts the update_flag in voice enhancement algorithm as the mark of VAD.Advanced row speech enhan-cement, then carries out VAD, thus makes VAD better effects if herein.Again according to the result of VAD and the feature of voice signal, effective auto level control can be carried out.
Described speech enhan-cement module 100 adopts the voice enhancement algorithm of the improvement based on spectrum subtraction and subband combine with technique, by speech detection method and automatic level control method based on the parameter all in voice enhancement algorithm, thus make speech enhan-cement, VAD, ALC three organically combines together, greatly reduce the calculated amount of whole system, not only can suppress common noise, also can suppress narrow band noise, unexpected very noisy, and have the ability suppressing nonstationary noise.
The present invention also provides a kind of communication terminal, and it comprises above-mentioned speech enhan-cement module 100.Utilize the communication terminal of described speech enhan-cement module 100 can obtain preferably speech communication effect.
Compared to prior art, sound enhancement method provided by the invention and module tool have the following advantages: one, noise suppression effect are remarkable, can stress release treatment; Two, in low signal-to-noise ratio situation, system performance declines less; Three, not only can suppress common noise, also can suppress narrow band noise, unexpected very noisy, and have the ability suppressing nonstationary noise; Four, voice enhancement algorithm, VAD, ALC three organically combine together, greatly reduce the calculated amount of whole system, and the applicability of environment is higher.
The above embodiment only have expressed several embodiment of the present invention, and it describes comparatively concrete and detailed, but therefore can not be interpreted as the restriction to the scope of the claims of the present invention.It should be pointed out that for the person of ordinary skill of the art, without departing from the inventive concept of the premise, can also make some distortion and improvement, these all belong to protection scope of the present invention.Therefore, the protection domain of patent of the present invention should be as the criterion with claims.
Claims (8)
1. a sound enhancement method, is characterized in that, comprises the steps:
S101, by every frame input phonetic modification to frequency domain, and be divided into multiple subband;
S103, estimate the signal to noise ratio (S/N ratio) of each described subband;
S105, carry out the detection of voice and noise;
S107, size according to the signal to noise ratio (S/N ratio) of each described subband, the gain of amendment subband.
2. sound enhancement method as claimed in claim 1, is characterized in that, in step S101, by every frame input phonetic modification to frequency domain, and is divided into 16 subbands.
3. sound enhancement method as claimed in claim 1, is characterized in that, in step S105, and utilize voice to estimate detection that mechanism carries out voice and noise.
4. sound enhancement method as claimed in claim 1, is characterized in that, adopts update_flag as the mark of speech detection, advanced row speech enhan-cement, then enter speech detection, then according to the result of speech detection and the feature of voice signal, carry out auto level control.
5. a speech enhan-cement module, it is characterized in that, comprise the voice-input unit, Audio Processing Unit and the voice-output unit that connect successively, voice input described Audio Processing Unit from described voice-input unit, every frame is inputted phonetic modification to frequency domain by described Audio Processing Unit, and be divided into multiple subband, then the signal to noise ratio (S/N ratio) of each described subband is estimated, carry out the detection of voice and noise again, finally according to the size of the signal to noise ratio (S/N ratio) of each described subband, the gain of amendment subband, and export via described voice-output unit.
6. speech enhan-cement module as claimed in claim 5, is characterized in that, every frame is inputted phonetic modification to frequency domain by described Audio Processing Unit, and is divided into 16 subbands.
7. speech enhan-cement module as claimed in claim 5, is characterized in that, utilize voice the to estimate detection that mechanism carries out voice and noise of described Audio Processing Unit.
8. a communication terminal, is characterized in that, described communication terminal comprises the speech enhan-cement module described in any one of claim 5 ~ 7.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510164111.0A CN104867498A (en) | 2014-12-26 | 2015-04-09 | Mobile communication terminal and voice enhancement method and module thereof |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410833773 | 2014-12-26 | ||
CN2014108337738 | 2014-12-26 | ||
CN201510164111.0A CN104867498A (en) | 2014-12-26 | 2015-04-09 | Mobile communication terminal and voice enhancement method and module thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
CN104867498A true CN104867498A (en) | 2015-08-26 |
Family
ID=53913290
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510164111.0A Pending CN104867498A (en) | 2014-12-26 | 2015-04-09 | Mobile communication terminal and voice enhancement method and module thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104867498A (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108806712A (en) * | 2018-04-27 | 2018-11-13 | 深圳市沃特沃德股份有限公司 | Reduce the method and apparatus of frequency domain treating capacity |
CN111415685A (en) * | 2020-03-26 | 2020-07-14 | 腾讯科技(深圳)有限公司 | Audio signal detection method, device, equipment and computer readable storage medium |
CN112349277A (en) * | 2020-09-28 | 2021-02-09 | 紫光展锐(重庆)科技有限公司 | Feature domain voice enhancement method combined with AI model and related product |
CN114023352A (en) * | 2021-11-12 | 2022-02-08 | 华南理工大学 | Voice enhancement method and device based on energy spectrum depth modulation |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1684143A (en) * | 2004-04-14 | 2005-10-19 | 华为技术有限公司 | Method for strengthening sound |
CN101415045A (en) * | 2007-10-17 | 2009-04-22 | 北京三星通信技术研究有限公司 | Method and apparatus for implementing intelligent automatic level control in communication network |
CN101582264A (en) * | 2009-06-12 | 2009-11-18 | 瑞声声学科技(深圳)有限公司 | Method and voice collecting system for speech enhancement |
CN101599274A (en) * | 2009-06-26 | 2009-12-09 | 瑞声声学科技(深圳)有限公司 | The method that voice strengthen |
CN101625870A (en) * | 2009-08-06 | 2010-01-13 | 杭州华三通信技术有限公司 | Automatic noise suppression (ANS) method, ANS device, method for improving audio quality of monitoring system and monitoring system |
CN101976566A (en) * | 2010-07-09 | 2011-02-16 | 瑞声声学科技(深圳)有限公司 | Voice enhancement method and device using same |
US20140148111A1 (en) * | 2012-11-26 | 2014-05-29 | Hon Hai Precision Industry Co., Ltd. | Electronic device and method for avoiding rf interference |
CN103871421A (en) * | 2014-03-21 | 2014-06-18 | 厦门莱亚特医疗器械有限公司 | Self-adaptive denoising method and system based on sub-band noise analysis |
-
2015
- 2015-04-09 CN CN201510164111.0A patent/CN104867498A/en active Pending
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1684143A (en) * | 2004-04-14 | 2005-10-19 | 华为技术有限公司 | Method for strengthening sound |
CN101415045A (en) * | 2007-10-17 | 2009-04-22 | 北京三星通信技术研究有限公司 | Method and apparatus for implementing intelligent automatic level control in communication network |
CN101582264A (en) * | 2009-06-12 | 2009-11-18 | 瑞声声学科技(深圳)有限公司 | Method and voice collecting system for speech enhancement |
CN101599274A (en) * | 2009-06-26 | 2009-12-09 | 瑞声声学科技(深圳)有限公司 | The method that voice strengthen |
CN101625870A (en) * | 2009-08-06 | 2010-01-13 | 杭州华三通信技术有限公司 | Automatic noise suppression (ANS) method, ANS device, method for improving audio quality of monitoring system and monitoring system |
CN101976566A (en) * | 2010-07-09 | 2011-02-16 | 瑞声声学科技(深圳)有限公司 | Voice enhancement method and device using same |
US20140148111A1 (en) * | 2012-11-26 | 2014-05-29 | Hon Hai Precision Industry Co., Ltd. | Electronic device and method for avoiding rf interference |
CN103871421A (en) * | 2014-03-21 | 2014-06-18 | 厦门莱亚特医疗器械有限公司 | Self-adaptive denoising method and system based on sub-band noise analysis |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108806712A (en) * | 2018-04-27 | 2018-11-13 | 深圳市沃特沃德股份有限公司 | Reduce the method and apparatus of frequency domain treating capacity |
CN108806712B (en) * | 2018-04-27 | 2020-08-18 | 深圳市沃特沃德股份有限公司 | Method and apparatus for reducing frequency domain processing |
CN111415685A (en) * | 2020-03-26 | 2020-07-14 | 腾讯科技(深圳)有限公司 | Audio signal detection method, device, equipment and computer readable storage medium |
CN112349277A (en) * | 2020-09-28 | 2021-02-09 | 紫光展锐(重庆)科技有限公司 | Feature domain voice enhancement method combined with AI model and related product |
CN112349277B (en) * | 2020-09-28 | 2023-07-04 | 紫光展锐(重庆)科技有限公司 | Feature domain voice enhancement method combined with AI model and related product |
CN114023352A (en) * | 2021-11-12 | 2022-02-08 | 华南理工大学 | Voice enhancement method and device based on energy spectrum depth modulation |
CN114023352B (en) * | 2021-11-12 | 2022-12-16 | 华南理工大学 | Voice enhancement method and device based on energy spectrum depth modulation |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10482896B2 (en) | Multi-band noise reduction system and methodology for digital audio signals | |
CN111418010B (en) | Multi-microphone noise reduction method and device and terminal equipment | |
US9438992B2 (en) | Multi-microphone robust noise suppression | |
CN109643554A (en) | Adaptive voice Enhancement Method and electronic equipment | |
CN103871421A (en) | Self-adaptive denoising method and system based on sub-band noise analysis | |
CN110634500B (en) | Method for calculating prior signal-to-noise ratio, electronic device and storage medium | |
CN105280193B (en) | Priori signal-to-noise ratio estimation method based on MMSE error criterion | |
KR20120114327A (en) | Adaptive noise reduction using level cues | |
US20180308503A1 (en) | Real-time single-channel speech enhancement in noisy and time-varying environments | |
CN104505099A (en) | Method and equipment for removing known interference in voice signal | |
CN104867498A (en) | Mobile communication terminal and voice enhancement method and module thereof | |
CN113539285B (en) | Audio signal noise reduction method, electronic device and storage medium | |
CN104867499A (en) | Frequency-band-divided wiener filtering and de-noising method used for hearing aid and system thereof | |
CN103813251A (en) | Hearing-aid denoising device and method allowable for adjusting denoising degree | |
CN101587712A (en) | A kind of directional speech enhancement method based on minitype microphone array | |
DE102011004338B3 (en) | Method and device for estimating a noise | |
US10149047B2 (en) | Multi-aural MMSE analysis techniques for clarifying audio signals | |
Sadiq et al. | Spectral subtraction for speech enhancement in modulation domain | |
US20190348060A1 (en) | Apparatus and method for enhancing a wanted component in a signal | |
Lu et al. | Reduction of musical residual noise using hybrid median filter | |
Rao et al. | Speech enhancement using perceptual Wiener filter combined with unvoiced speech—A new Scheme | |
Unoki et al. | Unified denoising and dereverberation method used in restoration of MTF-based power envelope | |
KR101958006B1 (en) | Apparatus and method for speech enhancement, and recording medium thereof | |
KR20130112287A (en) | Apparatus and method for adaptive noise processing | |
Nguyen et al. | An MC-SPP approach for noise reduction in dual microphone case with power level difference |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
EXSB | Decision made by sipo to initiate substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20150826 |
|
RJ01 | Rejection of invention patent application after publication |