CN106531175A - Network telephone soft noise generation method - Google Patents
Network telephone soft noise generation method Download PDFInfo
- Publication number
- CN106531175A CN106531175A CN201610996520.1A CN201610996520A CN106531175A CN 106531175 A CN106531175 A CN 106531175A CN 201610996520 A CN201610996520 A CN 201610996520A CN 106531175 A CN106531175 A CN 106531175A
- Authority
- CN
- China
- Prior art keywords
- noise
- voice
- decoding
- speech
- signal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 32
- 230000003044 adaptive effect Effects 0.000 claims abstract description 8
- 230000003595 spectral effect Effects 0.000 claims description 6
- 238000001514 detection method Methods 0.000 claims description 5
- 230000000694 effects Effects 0.000 claims description 3
- 238000006243 chemical reaction Methods 0.000 claims description 2
- 238000000205 computational method Methods 0.000 claims description 2
- 238000004364 calculation method Methods 0.000 abstract description 3
- 238000001228 spectrum Methods 0.000 description 6
- 230000006835 compression Effects 0.000 description 4
- 238000007906 compression Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000005611 electricity Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 230000035807 sensation Effects 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/012—Comfort noise or silence coding
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
The invention discloses a network telephone soft noise generation method. In the condition of not changing a standard protocol, a random adaptive codebook and a random fixed codebook are added in a white noise, through detecting whether a load packet signal source is active speech or non-active speech, a speech signal model is generated through noise decoding and linear prediction encoding calculation, and a soft noise is generated through a linear prediction filter. The technical scheme of the invention has the advantages that the background noise of an actual environment can be reflected well, and the continuity and stability are obtained in auditory feeling.
Description
Technical field
A kind of a kind of the present invention relates to audio digital signals processing system in network telephone, more particularly to network telephone
The method that middle comfort noise is produced, the invention belongs to embedded computer system, network service, media information treatment technology neck
Domain.
Background technology
Audio coder & decoder (codec) technical standard ITU-T issued in International Telecommunication Union communication standardization tissue is G.711
In Appendix II annex 2 of agreement (ITU-T G.711), the production method of two kinds of comfort noises is defined, it is international electricity
A set of voice compression that letter alliance ITU-T formulates out, represents logarithm PCM (logarithmic pulse-code
Modulation) sampling standard;It is mainly used in phone, using pulse code modulation to audio sample, sample rate is that 8k is per second, profit
With the uncompressed channel transfer speech sound signal of 64Kbps, compression ratio is 1:2, i.e., 16 data are compressed into 8, are G.711 marked
Standard is the waveform sound codec of main flow, G.711 mainly has two kinds of compression algorithms under standard, is described below respectively:
As shown in figure 1, traditional noise generation method schematic diagram with noise gain and frequency spectrum parameter, first method is
Send the voice signal that payload length is 11 bytes, the gain parameter of wherein the 1st byte for comfort noise, 10 words below
Save the frequency spectrum parameter for noise, in receiving terminal, as long as from load bag in decoding obtain noise gain and frequency spectrum parameter it is linearly pre-
Code coefficient is surveyed, by the use of random white noise as driving source, you can obtain comfort noise.
As shown in Fig. 2 traditional only noise generation method schematic diagram with noise gain, second method is to send load
Length only has the voice signal of 1 byte, the only gain parameter comprising comfort noise in load bag, without carrying in first method
The frequency spectrum parameter for arriving.
In order to reduce the capacity of load bag, so at present actual, adopt is all comfort noise that second method is produced,
As the comfort noise produced under 1 byte mode does not have frequency spectrum parameter, so " soft " noise that actual this method is produced is simultaneously
It is not soft.
In addition, in above-mentioned two ways, due to comfort noise driving source using be all white noise, using white
Noise is also bad as the comfort noise effect produced by driving source, acoustically also has discontinuous sensation.
Therefore, in this case, it is proposed that a kind of method that comfort noise is produced in new network phone, solves to make an uproar
The stability problem of sound effective value.
The content of the invention
In order to solve the drawbacks described above of prior art, the technical program purpose is, under the feelings for not changing standard agreement,
Using random adaptive codebook and random fixed codebook is added in white noise, by detecting whether load bag signal source is living
Property voice or nonactive voice, produce voice signal model after calculating through noise decoding and linear predictive coding, then pass through
The milder noise of Linear Prediction filter producing ratio, can preferably reflect the background noise of actual environment, make acoustically to feel tool
There is stability.
The purpose of the present invention is achieved through the following technical solutions:
A kind of method that network phone comfort noise is produced, it is characterised in that:In the voice recognition processing mistake of network phone
Whether Cheng Zhong, detection load bag signal source are active speech or nonactive voice, and the noise of nonactive voice is through noise decoding
Afterwards, output noise band is made to change spectral characteristic, then into linear prediction filter;The random adaptive code of one group of addition is set
This exciting signal source with random fixed codebook, band change the output noise of spectral characteristic and exciting signal source through linear pre-
Survey wave filter;Meanwhile, voice output of the voice signal that active speech is obtained after tone decoding as vocoder, Huo Zhezuo
For the phonetic entry that linear predictive coding is calculated, then after linear prediction filter, comfort noise signal is exported.
The present invention has the advantages that unique and has the beneficial effect that:
The method that a kind of network phone comfort noise that the technical program is proposed is produced, before not changing the capacity of load bag
Put, the signal by the use of the generation model for more meeting voice signal is used as driving source so that the comfort noise of output is more comfortable,
Improve the Consumer's Experience of voice call.
Description of the drawings
Fig. 1 is traditional noise generation method schematic diagram with noise gain and frequency spectrum parameter;
Fig. 2 is traditional only noise generation method schematic diagram with noise gain;
Fig. 3 is the Organization Chart of the method that a kind of network phone comfort noise of the invention is produced;
Fig. 4 is the computing formula of the noise gain of the method that a kind of network phone comfort noise of the invention is produced.
Specific embodiment
With reference to Figure of description in detail technical solution of the present invention is described in detail:
As shown in figure 3, a kind of method that network phone comfort noise is produced, it is characterised in that:In the voice of network phone
In identification processing procedure, whether detection load bag signal source is active speech or nonactive voice, the noise Jing of nonactive voice
After crossing noise decoding, output noise band is made to change spectral characteristic, then into linear prediction filter;Arrange one group of addition with
The exciting signal source of machine adaptive codebook and random fixed codebook, band change the output noise of spectral characteristic and exciting signal source
Through linear prediction filter;Meanwhile, voice of the voice signal that active speech is obtained after tone decoding as vocoder
Output, or as the phonetic entry that linear predictive coding is calculated, then after linear prediction filter, output comfort noise letter
Number.
Further, during the voice recognition processing of network phone, receive load source speech signal, according to load bag
Length detection judges to load whether bag signal source is active speech signal, and setting noise as the length of nonactive speech payloads bag is
1 byte is 0 byte, when for 1 byte when, decode and obtain noise gain, when for 0 byte when, then the noise gain of present frame
It is constant, then replaced with the noise gain of previous frame, and the length of active speech load bag is 160 bytes, be such as active speech frame,
Tone decoding is then proceeded to, otherwise turns noise decoding.
Further, described active speech load bag proceeds to tone decoding, using the compress speech side in G.711 standard
Formula, speech decoding process is for the obtaining group voice signal from μ rates or A rates to the conversion of Linear Pulse Code Modulation, a side
Output of the face as vocoder, current frame speech decoding terminate, then turn to do next frame decoding;On the other hand compile as linear prediction
The phonetic entry that code device is calculated.
Further, the noise of described nonactive voice carries out noise decoding, is by the noise energy solution in load bag
Code, obtains noise gain G, using audio coder & decoder (codec) technical standard ITU-T G.711 II protocol modes of Appendix, its meter
Calculation mode isWherein E is the noise energy that load decoding is obtained.
And then, the voice signal that described active speech is obtained after tone decoding is used as Linear Predictive Coder meter
The phonetic entry of calculation, determines linear forecast coding coefficient using CELP QCELP Qualcomms.
Then, one group of exciting signal source for adding random adaptive codebook and random fixed codebook in white noise is set,
Using International Telecommunication Union's voice compression algorithm ITU-T computational methods that G.729 B.4.4 annex B agreements save, according to
Audio coder & decoder (codec) technical standard G.711 in every frame length determining, obtain e [80] sequences as output voice signal.
Finally, the noise gain G that the e [80] for being obtained by the use of after calculating is obtained as driving source, noise decoding, by linear
The calculated voice signal of predictive coding is filtered through linear prediction filter, obtains comfort noise output signal.
In sum, the method that a kind of network phone comfort noise proposed by the present invention is produced, is added in white noise
Random adaptive codebook and random fixed codebook, produce voice signal model so that the comfort noise of output is softer.
Above-mentioned technical proposal is only the concrete application example of the present invention, can be drunk as the case may be in actual application
Feelings select alternate device device, but the protection domain to inventing to be not limited in any way.
Claims (7)
1. a kind of method that network phone comfort noise is produced, it is characterised in that:In the voice recognition processing process of network phone
In, whether detection load bag signal source is active speech or nonactive voice, the noise of nonactive voice after noise decoding,
Output noise band is made to change spectral characteristic, then into linear prediction filter;The random adaptive codebook of one group of addition is set
With the exciting signal source of random fixed codebook, band changes the output noise of spectral characteristic and exciting signal source through linear prediction
Wave filter;Meanwhile, voice output of the voice signal that active speech is obtained after tone decoding as vocoder, or conduct
The phonetic entry that linear predictive coding is calculated, then after linear prediction filter, export comfort noise signal.
2. the method that a kind of network phone comfort noise according to claim 1 is produced, it is characterised in that:Network phone
During voice recognition processing, receive load source speech signal, judge that load bag signal source is according to the length detection of load bag
No it is 1 byte or for 0 byte to set noise as the length of nonactive speech payloads bag for active speech signal, when for 1 byte
When, decoding obtain noise gain, when for 0 byte when, then the noise gain of present frame is constant, then with the noise gain generation of previous frame
Replace, and the length of active speech load bag is 160 bytes, be such as active speech frame, then proceed to tone decoding, otherwise turn noise solution
Code.
3. the method that a kind of network phone comfort noise according to claim 1 is produced, it is characterised in that:Described activity
Speech payloads bag proceeds to tone decoding, and speech decoding process is, from μ rates or A rates to the conversion of Linear Pulse Code Modulation, to obtain
The one group of voice signal for arriving, on the one hand as the output of vocoder, current frame speech decoding terminates, then turns to do next frame decoding;
On the other hand the phonetic entry for calculating as linear predictive coding.
4. the method that a kind of network phone comfort noise according to claim 1 is produced, it is characterised in that:Described non-live
Property voice noise carry out noise decoding, be will load bag in noise energy decoding, obtain noise gain G, using voice coder
Decoder technique standard agreement mode.
5. the method that a kind of network phone comfort noise according to claim 1 is produced, it is characterised in that:Described activity
The phonetic entry that the voice signal that voice is obtained after tone decoding is calculated as Linear Predictive Coder, is swashed using CELP codes
Encourage linear predictive coding to determine linear forecast coding coefficient.
6. the method that a kind of network phone comfort noise according to claim 1 is produced, it is characterised in that:Arrange one group
The exciting signal source of random adaptive codebook and random fixed codebook is added in white noise, using International Telecommunication Union's compress speech
The canonical algorithm ITU-T computational methods that G.729 B.4.4 annex B agreements save, according to audio coder & decoder (codec) technical standard
The length of the every frame in G.711 obtains e [80] sequences as output voice signal determining.
7. the method that a kind of network phone comfort noise according to claim 1 is produced, it is characterised in that:After calculating
The noise gain G that the e [80] for obtaining is obtained as driving source, noise decoding, by the calculated voice of linear predictive coding
Signal is filtered through linear prediction filter, obtains comfort noise output signal.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610996520.1A CN106531175B (en) | 2016-11-13 | 2016-11-13 | A kind of method that network phone comfort noise generates |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610996520.1A CN106531175B (en) | 2016-11-13 | 2016-11-13 | A kind of method that network phone comfort noise generates |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106531175A true CN106531175A (en) | 2017-03-22 |
CN106531175B CN106531175B (en) | 2019-09-03 |
Family
ID=58351327
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610996520.1A Active CN106531175B (en) | 2016-11-13 | 2016-11-13 | A kind of method that network phone comfort noise generates |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106531175B (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108694938A (en) * | 2017-03-31 | 2018-10-23 | 英特尔公司 | System and method for carrying out energy efficient and the identification of low-power distributed automatic speech on wearable device |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050203733A1 (en) * | 2004-03-15 | 2005-09-15 | Ramkummar Permachanahalli S. | Method of comfort noise generation for speech communication |
CN101339767A (en) * | 2008-03-21 | 2009-01-07 | 华为技术有限公司 | Background noise excitation signal generating method and apparatus |
CN101632119A (en) * | 2007-03-05 | 2010-01-20 | 艾利森电话股份有限公司 | Method and arrangement for smoothing of stationary background noise |
CN102930872A (en) * | 2012-11-05 | 2013-02-13 | 深圳广晟信源技术有限公司 | Method and device for postprocessing pitch enhancement in broadband speech decoding |
CN103137133A (en) * | 2011-11-29 | 2013-06-05 | 中兴通讯股份有限公司 | In-activated sound signal parameter estimating method, comfortable noise producing method and system |
CN103680509A (en) * | 2013-12-16 | 2014-03-26 | 重庆邮电大学 | Method for discontinuous transmission of voice signals and generation of background noise |
CN104978970A (en) * | 2014-04-08 | 2015-10-14 | 华为技术有限公司 | Noise signal processing and generation method, encoder/decoder and encoding/decoding system |
-
2016
- 2016-11-13 CN CN201610996520.1A patent/CN106531175B/en active Active
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050203733A1 (en) * | 2004-03-15 | 2005-09-15 | Ramkummar Permachanahalli S. | Method of comfort noise generation for speech communication |
CN101632119A (en) * | 2007-03-05 | 2010-01-20 | 艾利森电话股份有限公司 | Method and arrangement for smoothing of stationary background noise |
CN101339767A (en) * | 2008-03-21 | 2009-01-07 | 华为技术有限公司 | Background noise excitation signal generating method and apparatus |
CN103137133A (en) * | 2011-11-29 | 2013-06-05 | 中兴通讯股份有限公司 | In-activated sound signal parameter estimating method, comfortable noise producing method and system |
CN102930872A (en) * | 2012-11-05 | 2013-02-13 | 深圳广晟信源技术有限公司 | Method and device for postprocessing pitch enhancement in broadband speech decoding |
CN103680509A (en) * | 2013-12-16 | 2014-03-26 | 重庆邮电大学 | Method for discontinuous transmission of voice signals and generation of background noise |
CN104978970A (en) * | 2014-04-08 | 2015-10-14 | 华为技术有限公司 | Noise signal processing and generation method, encoder/decoder and encoding/decoding system |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108694938A (en) * | 2017-03-31 | 2018-10-23 | 英特尔公司 | System and method for carrying out energy efficient and the identification of low-power distributed automatic speech on wearable device |
Also Published As
Publication number | Publication date |
---|---|
CN106531175B (en) | 2019-09-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102985969B (en) | Coding device, decoding device, and methods thereof | |
US20100010812A1 (en) | Speech codecs | |
CN1815558B (en) | Low bit-rate coding of unvoiced segments of speech | |
CN103187065B (en) | The disposal route of voice data, device and system | |
US8190440B2 (en) | Sub-band codec with native voice activity detection | |
ZA200606713B (en) | Classification of audio signals | |
CN103137133B (en) | Inactive sound modulated parameter estimating method and comfort noise production method and system | |
WO2015154397A1 (en) | Noise signal processing and generation method, encoder/decoder and encoding/decoding system | |
CN108231083A (en) | A kind of speech coder code efficiency based on SILK improves method | |
US20050143984A1 (en) | Multirate speech codecs | |
EP2202726B1 (en) | Method and apparatus for judging dtx | |
CN101246688B (en) | Method, system and device for coding and decoding ambient noise signal | |
KR100847391B1 (en) | Method of comfort noise generation for speech communication | |
JPH0850500A (en) | Voice encoder and voice decoder as well as voice coding method and voice encoding method | |
CN102985968A (en) | Method and device for processing audio signal | |
CN101783142B (en) | Transcoding method, device and communication equipment | |
CN103680509B (en) | A kind of voice signal discontinuous transmission and ground unrest generation method | |
US8949121B2 (en) | Method and means for encoding background noise information | |
CN1244090C (en) | Speech coding with background noise reproduction | |
CN105009208A (en) | Methods and apparatuses for dtx hangover in audio coding | |
CN106531175A (en) | Network telephone soft noise generation method | |
CN101170590B (en) | A method, system and device for transmitting encoding stream under background noise | |
US7584096B2 (en) | Method and apparatus for encoding speech | |
JP4437011B2 (en) | Speech encoding device | |
CN112992166A (en) | Method, device and storage medium for dynamically adjusting LC3 audio coding rate |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |