CN106531175A - Network telephone soft noise generation method - Google Patents

Network telephone soft noise generation method Download PDF

Info

Publication number
CN106531175A
CN106531175A CN201610996520.1A CN201610996520A CN106531175A CN 106531175 A CN106531175 A CN 106531175A CN 201610996520 A CN201610996520 A CN 201610996520A CN 106531175 A CN106531175 A CN 106531175A
Authority
CN
China
Prior art keywords
noise
voice
decoding
speech
signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201610996520.1A
Other languages
Chinese (zh)
Other versions
CN106531175B (en
Inventor
丁海忠
何延伟
叶成竞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing Hanlong Technology Co Ltd
Original Assignee
Nanjing Hanlong Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing Hanlong Technology Co Ltd filed Critical Nanjing Hanlong Technology Co Ltd
Priority to CN201610996520.1A priority Critical patent/CN106531175B/en
Publication of CN106531175A publication Critical patent/CN106531175A/en
Application granted granted Critical
Publication of CN106531175B publication Critical patent/CN106531175B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/012Comfort noise or silence coding

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

The invention discloses a network telephone soft noise generation method. In the condition of not changing a standard protocol, a random adaptive codebook and a random fixed codebook are added in a white noise, through detecting whether a load packet signal source is active speech or non-active speech, a speech signal model is generated through noise decoding and linear prediction encoding calculation, and a soft noise is generated through a linear prediction filter. The technical scheme of the invention has the advantages that the background noise of an actual environment can be reflected well, and the continuity and stability are obtained in auditory feeling.

Description

A kind of method that network phone comfort noise is produced
Technical field
A kind of a kind of the present invention relates to audio digital signals processing system in network telephone, more particularly to network telephone The method that middle comfort noise is produced, the invention belongs to embedded computer system, network service, media information treatment technology neck Domain.
Background technology
Audio coder & decoder (codec) technical standard ITU-T issued in International Telecommunication Union communication standardization tissue is G.711 In Appendix II annex 2 of agreement (ITU-T G.711), the production method of two kinds of comfort noises is defined, it is international electricity A set of voice compression that letter alliance ITU-T formulates out, represents logarithm PCM (logarithmic pulse-code Modulation) sampling standard;It is mainly used in phone, using pulse code modulation to audio sample, sample rate is that 8k is per second, profit With the uncompressed channel transfer speech sound signal of 64Kbps, compression ratio is 1:2, i.e., 16 data are compressed into 8, are G.711 marked Standard is the waveform sound codec of main flow, G.711 mainly has two kinds of compression algorithms under standard, is described below respectively:
As shown in figure 1, traditional noise generation method schematic diagram with noise gain and frequency spectrum parameter, first method is Send the voice signal that payload length is 11 bytes, the gain parameter of wherein the 1st byte for comfort noise, 10 words below Save the frequency spectrum parameter for noise, in receiving terminal, as long as from load bag in decoding obtain noise gain and frequency spectrum parameter it is linearly pre- Code coefficient is surveyed, by the use of random white noise as driving source, you can obtain comfort noise.
As shown in Fig. 2 traditional only noise generation method schematic diagram with noise gain, second method is to send load Length only has the voice signal of 1 byte, the only gain parameter comprising comfort noise in load bag, without carrying in first method The frequency spectrum parameter for arriving.
In order to reduce the capacity of load bag, so at present actual, adopt is all comfort noise that second method is produced, As the comfort noise produced under 1 byte mode does not have frequency spectrum parameter, so " soft " noise that actual this method is produced is simultaneously It is not soft.
In addition, in above-mentioned two ways, due to comfort noise driving source using be all white noise, using white Noise is also bad as the comfort noise effect produced by driving source, acoustically also has discontinuous sensation.
Therefore, in this case, it is proposed that a kind of method that comfort noise is produced in new network phone, solves to make an uproar The stability problem of sound effective value.
The content of the invention
In order to solve the drawbacks described above of prior art, the technical program purpose is, under the feelings for not changing standard agreement, Using random adaptive codebook and random fixed codebook is added in white noise, by detecting whether load bag signal source is living Property voice or nonactive voice, produce voice signal model after calculating through noise decoding and linear predictive coding, then pass through The milder noise of Linear Prediction filter producing ratio, can preferably reflect the background noise of actual environment, make acoustically to feel tool There is stability.
The purpose of the present invention is achieved through the following technical solutions:
A kind of method that network phone comfort noise is produced, it is characterised in that:In the voice recognition processing mistake of network phone Whether Cheng Zhong, detection load bag signal source are active speech or nonactive voice, and the noise of nonactive voice is through noise decoding Afterwards, output noise band is made to change spectral characteristic, then into linear prediction filter;The random adaptive code of one group of addition is set This exciting signal source with random fixed codebook, band change the output noise of spectral characteristic and exciting signal source through linear pre- Survey wave filter;Meanwhile, voice output of the voice signal that active speech is obtained after tone decoding as vocoder, Huo Zhezuo For the phonetic entry that linear predictive coding is calculated, then after linear prediction filter, comfort noise signal is exported.
The present invention has the advantages that unique and has the beneficial effect that:
The method that a kind of network phone comfort noise that the technical program is proposed is produced, before not changing the capacity of load bag Put, the signal by the use of the generation model for more meeting voice signal is used as driving source so that the comfort noise of output is more comfortable, Improve the Consumer's Experience of voice call.
Description of the drawings
Fig. 1 is traditional noise generation method schematic diagram with noise gain and frequency spectrum parameter;
Fig. 2 is traditional only noise generation method schematic diagram with noise gain;
Fig. 3 is the Organization Chart of the method that a kind of network phone comfort noise of the invention is produced;
Fig. 4 is the computing formula of the noise gain of the method that a kind of network phone comfort noise of the invention is produced.
Specific embodiment
With reference to Figure of description in detail technical solution of the present invention is described in detail:
As shown in figure 3, a kind of method that network phone comfort noise is produced, it is characterised in that:In the voice of network phone In identification processing procedure, whether detection load bag signal source is active speech or nonactive voice, the noise Jing of nonactive voice After crossing noise decoding, output noise band is made to change spectral characteristic, then into linear prediction filter;Arrange one group of addition with The exciting signal source of machine adaptive codebook and random fixed codebook, band change the output noise of spectral characteristic and exciting signal source Through linear prediction filter;Meanwhile, voice of the voice signal that active speech is obtained after tone decoding as vocoder Output, or as the phonetic entry that linear predictive coding is calculated, then after linear prediction filter, output comfort noise letter Number.
Further, during the voice recognition processing of network phone, receive load source speech signal, according to load bag Length detection judges to load whether bag signal source is active speech signal, and setting noise as the length of nonactive speech payloads bag is 1 byte is 0 byte, when for 1 byte when, decode and obtain noise gain, when for 0 byte when, then the noise gain of present frame It is constant, then replaced with the noise gain of previous frame, and the length of active speech load bag is 160 bytes, be such as active speech frame, Tone decoding is then proceeded to, otherwise turns noise decoding.
Further, described active speech load bag proceeds to tone decoding, using the compress speech side in G.711 standard Formula, speech decoding process is for the obtaining group voice signal from μ rates or A rates to the conversion of Linear Pulse Code Modulation, a side Output of the face as vocoder, current frame speech decoding terminate, then turn to do next frame decoding;On the other hand compile as linear prediction The phonetic entry that code device is calculated.
Further, the noise of described nonactive voice carries out noise decoding, is by the noise energy solution in load bag Code, obtains noise gain G, using audio coder & decoder (codec) technical standard ITU-T G.711 II protocol modes of Appendix, its meter Calculation mode isWherein E is the noise energy that load decoding is obtained.
And then, the voice signal that described active speech is obtained after tone decoding is used as Linear Predictive Coder meter The phonetic entry of calculation, determines linear forecast coding coefficient using CELP QCELP Qualcomms.
Then, one group of exciting signal source for adding random adaptive codebook and random fixed codebook in white noise is set, Using International Telecommunication Union's voice compression algorithm ITU-T computational methods that G.729 B.4.4 annex B agreements save, according to Audio coder & decoder (codec) technical standard G.711 in every frame length determining, obtain e [80] sequences as output voice signal.
Finally, the noise gain G that the e [80] for being obtained by the use of after calculating is obtained as driving source, noise decoding, by linear The calculated voice signal of predictive coding is filtered through linear prediction filter, obtains comfort noise output signal.
In sum, the method that a kind of network phone comfort noise proposed by the present invention is produced, is added in white noise Random adaptive codebook and random fixed codebook, produce voice signal model so that the comfort noise of output is softer.
Above-mentioned technical proposal is only the concrete application example of the present invention, can be drunk as the case may be in actual application Feelings select alternate device device, but the protection domain to inventing to be not limited in any way.

Claims (7)

1. a kind of method that network phone comfort noise is produced, it is characterised in that:In the voice recognition processing process of network phone In, whether detection load bag signal source is active speech or nonactive voice, the noise of nonactive voice after noise decoding, Output noise band is made to change spectral characteristic, then into linear prediction filter;The random adaptive codebook of one group of addition is set With the exciting signal source of random fixed codebook, band changes the output noise of spectral characteristic and exciting signal source through linear prediction Wave filter;Meanwhile, voice output of the voice signal that active speech is obtained after tone decoding as vocoder, or conduct The phonetic entry that linear predictive coding is calculated, then after linear prediction filter, export comfort noise signal.
2. the method that a kind of network phone comfort noise according to claim 1 is produced, it is characterised in that:Network phone During voice recognition processing, receive load source speech signal, judge that load bag signal source is according to the length detection of load bag No it is 1 byte or for 0 byte to set noise as the length of nonactive speech payloads bag for active speech signal, when for 1 byte When, decoding obtain noise gain, when for 0 byte when, then the noise gain of present frame is constant, then with the noise gain generation of previous frame Replace, and the length of active speech load bag is 160 bytes, be such as active speech frame, then proceed to tone decoding, otherwise turn noise solution Code.
3. the method that a kind of network phone comfort noise according to claim 1 is produced, it is characterised in that:Described activity Speech payloads bag proceeds to tone decoding, and speech decoding process is, from μ rates or A rates to the conversion of Linear Pulse Code Modulation, to obtain The one group of voice signal for arriving, on the one hand as the output of vocoder, current frame speech decoding terminates, then turns to do next frame decoding; On the other hand the phonetic entry for calculating as linear predictive coding.
4. the method that a kind of network phone comfort noise according to claim 1 is produced, it is characterised in that:Described non-live Property voice noise carry out noise decoding, be will load bag in noise energy decoding, obtain noise gain G, using voice coder Decoder technique standard agreement mode.
5. the method that a kind of network phone comfort noise according to claim 1 is produced, it is characterised in that:Described activity The phonetic entry that the voice signal that voice is obtained after tone decoding is calculated as Linear Predictive Coder, is swashed using CELP codes Encourage linear predictive coding to determine linear forecast coding coefficient.
6. the method that a kind of network phone comfort noise according to claim 1 is produced, it is characterised in that:Arrange one group The exciting signal source of random adaptive codebook and random fixed codebook is added in white noise, using International Telecommunication Union's compress speech The canonical algorithm ITU-T computational methods that G.729 B.4.4 annex B agreements save, according to audio coder & decoder (codec) technical standard The length of the every frame in G.711 obtains e [80] sequences as output voice signal determining.
7. the method that a kind of network phone comfort noise according to claim 1 is produced, it is characterised in that:After calculating The noise gain G that the e [80] for obtaining is obtained as driving source, noise decoding, by the calculated voice of linear predictive coding Signal is filtered through linear prediction filter, obtains comfort noise output signal.
CN201610996520.1A 2016-11-13 2016-11-13 A kind of method that network phone comfort noise generates Active CN106531175B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610996520.1A CN106531175B (en) 2016-11-13 2016-11-13 A kind of method that network phone comfort noise generates

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610996520.1A CN106531175B (en) 2016-11-13 2016-11-13 A kind of method that network phone comfort noise generates

Publications (2)

Publication Number Publication Date
CN106531175A true CN106531175A (en) 2017-03-22
CN106531175B CN106531175B (en) 2019-09-03

Family

ID=58351327

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610996520.1A Active CN106531175B (en) 2016-11-13 2016-11-13 A kind of method that network phone comfort noise generates

Country Status (1)

Country Link
CN (1) CN106531175B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108694938A (en) * 2017-03-31 2018-10-23 英特尔公司 System and method for carrying out energy efficient and the identification of low-power distributed automatic speech on wearable device

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050203733A1 (en) * 2004-03-15 2005-09-15 Ramkummar Permachanahalli S. Method of comfort noise generation for speech communication
CN101339767A (en) * 2008-03-21 2009-01-07 华为技术有限公司 Background noise excitation signal generating method and apparatus
CN101632119A (en) * 2007-03-05 2010-01-20 艾利森电话股份有限公司 Method and arrangement for smoothing of stationary background noise
CN102930872A (en) * 2012-11-05 2013-02-13 深圳广晟信源技术有限公司 Method and device for postprocessing pitch enhancement in broadband speech decoding
CN103137133A (en) * 2011-11-29 2013-06-05 中兴通讯股份有限公司 In-activated sound signal parameter estimating method, comfortable noise producing method and system
CN103680509A (en) * 2013-12-16 2014-03-26 重庆邮电大学 Method for discontinuous transmission of voice signals and generation of background noise
CN104978970A (en) * 2014-04-08 2015-10-14 华为技术有限公司 Noise signal processing and generation method, encoder/decoder and encoding/decoding system

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050203733A1 (en) * 2004-03-15 2005-09-15 Ramkummar Permachanahalli S. Method of comfort noise generation for speech communication
CN101632119A (en) * 2007-03-05 2010-01-20 艾利森电话股份有限公司 Method and arrangement for smoothing of stationary background noise
CN101339767A (en) * 2008-03-21 2009-01-07 华为技术有限公司 Background noise excitation signal generating method and apparatus
CN103137133A (en) * 2011-11-29 2013-06-05 中兴通讯股份有限公司 In-activated sound signal parameter estimating method, comfortable noise producing method and system
CN102930872A (en) * 2012-11-05 2013-02-13 深圳广晟信源技术有限公司 Method and device for postprocessing pitch enhancement in broadband speech decoding
CN103680509A (en) * 2013-12-16 2014-03-26 重庆邮电大学 Method for discontinuous transmission of voice signals and generation of background noise
CN104978970A (en) * 2014-04-08 2015-10-14 华为技术有限公司 Noise signal processing and generation method, encoder/decoder and encoding/decoding system

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108694938A (en) * 2017-03-31 2018-10-23 英特尔公司 System and method for carrying out energy efficient and the identification of low-power distributed automatic speech on wearable device

Also Published As

Publication number Publication date
CN106531175B (en) 2019-09-03

Similar Documents

Publication Publication Date Title
CN102985969B (en) Coding device, decoding device, and methods thereof
US20100010812A1 (en) Speech codecs
CN1815558B (en) Low bit-rate coding of unvoiced segments of speech
CN103187065B (en) The disposal route of voice data, device and system
US8190440B2 (en) Sub-band codec with native voice activity detection
ZA200606713B (en) Classification of audio signals
CN103137133B (en) Inactive sound modulated parameter estimating method and comfort noise production method and system
WO2015154397A1 (en) Noise signal processing and generation method, encoder/decoder and encoding/decoding system
CN108231083A (en) A kind of speech coder code efficiency based on SILK improves method
US20050143984A1 (en) Multirate speech codecs
EP2202726B1 (en) Method and apparatus for judging dtx
CN101246688B (en) Method, system and device for coding and decoding ambient noise signal
KR100847391B1 (en) Method of comfort noise generation for speech communication
JPH0850500A (en) Voice encoder and voice decoder as well as voice coding method and voice encoding method
CN102985968A (en) Method and device for processing audio signal
CN101783142B (en) Transcoding method, device and communication equipment
CN103680509B (en) A kind of voice signal discontinuous transmission and ground unrest generation method
US8949121B2 (en) Method and means for encoding background noise information
CN1244090C (en) Speech coding with background noise reproduction
CN105009208A (en) Methods and apparatuses for dtx hangover in audio coding
CN106531175A (en) Network telephone soft noise generation method
CN101170590B (en) A method, system and device for transmitting encoding stream under background noise
US7584096B2 (en) Method and apparatus for encoding speech
JP4437011B2 (en) Speech encoding device
CN112992166A (en) Method, device and storage medium for dynamically adjusting LC3 audio coding rate

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant