CN106531175A

CN106531175A - Network telephone soft noise generation method

Info

Publication number: CN106531175A
Application number: CN201610996520.1A
Authority: CN
Inventors: 丁海忠; 何延伟; 叶成竞
Original assignee: Nanjing Hanlong Technology Co Ltd
Current assignee: Nanjing Hanlong Technology Co Ltd
Priority date: 2016-11-13
Filing date: 2016-11-13
Publication date: 2017-03-22
Anticipated expiration: 2036-11-13
Also published as: CN106531175B

Abstract

The invention discloses a network telephone soft noise generation method. In the condition of not changing a standard protocol, a random adaptive codebook and a random fixed codebook are added in a white noise, through detecting whether a load packet signal source is active speech or non-active speech, a speech signal model is generated through noise decoding and linear prediction encoding calculation, and a soft noise is generated through a linear prediction filter. The technical scheme of the invention has the advantages that the background noise of an actual environment can be reflected well, and the continuity and stability are obtained in auditory feeling.

Description

A kind of method that network phone comfort noise is produced

Technical field

A kind of a kind of the present invention relates to audio digital signals processing system in network telephone, more particularly to network telephone The method that middle comfort noise is produced, the invention belongs to embedded computer system, network service, media information treatment technology neck Domain.

Background technology

Audio coder ＆ decoder (codec) technical standard ITU-T issued in International Telecommunication Union communication standardization tissue is G.711 In Appendix II annex 2 of agreement (ITU-T G.711), the production method of two kinds of comfort noises is defined, it is international electricity A set of voice compression that letter alliance ITU-T formulates out, represents logarithm PCM (logarithmic pulse-code Modulation) sampling standard；It is mainly used in phone, using pulse code modulation to audio sample, sample rate is that 8k is per second, profit With the uncompressed channel transfer speech sound signal of 64Kbps, compression ratio is 1:2, i.e., 16 data are compressed into 8, are G.711 marked Standard is the waveform sound codec of main flow, G.711 mainly has two kinds of compression algorithms under standard, is described below respectively：

As shown in figure 1, traditional noise generation method schematic diagram with noise gain and frequency spectrum parameter, first method is Send the voice signal that payload length is 11 bytes, the gain parameter of wherein the 1st byte for comfort noise, 10 words below Save the frequency spectrum parameter for noise, in receiving terminal, as long as from load bag in decoding obtain noise gain and frequency spectrum parameter it is linearly pre- Code coefficient is surveyed, by the use of random white noise as driving source, you can obtain comfort noise.

As shown in Fig. 2 traditional only noise generation method schematic diagram with noise gain, second method is to send load Length only has the voice signal of 1 byte, the only gain parameter comprising comfort noise in load bag, without carrying in first method The frequency spectrum parameter for arriving.

In order to reduce the capacity of load bag, so at present actual, adopt is all comfort noise that second method is produced, As the comfort noise produced under 1 byte mode does not have frequency spectrum parameter, so " soft " noise that actual this method is produced is simultaneously It is not soft.

In addition, in above-mentioned two ways, due to comfort noise driving source using be all white noise, using white Noise is also bad as the comfort noise effect produced by driving source, acoustically also has discontinuous sensation.

Therefore, in this case, it is proposed that a kind of method that comfort noise is produced in new network phone, solves to make an uproar The stability problem of sound effective value.

The content of the invention

In order to solve the drawbacks described above of prior art, the technical program purpose is, under the feelings for not changing standard agreement, Using random adaptive codebook and random fixed codebook is added in white noise, by detecting whether load bag signal source is living Property voice or nonactive voice, produce voice signal model after calculating through noise decoding and linear predictive coding, then pass through The milder noise of Linear Prediction filter producing ratio, can preferably reflect the background noise of actual environment, make acoustically to feel tool There is stability.

The purpose of the present invention is achieved through the following technical solutions：

A kind of method that network phone comfort noise is produced, it is characterised in that：In the voice recognition processing mistake of network phone Whether Cheng Zhong, detection load bag signal source are active speech or nonactive voice, and the noise of nonactive voice is through noise decoding Afterwards, output noise band is made to change spectral characteristic, then into linear prediction filter；The random adaptive code of one group of addition is set This exciting signal source with random fixed codebook, band change the output noise of spectral characteristic and exciting signal source through linear pre- Survey wave filter；Meanwhile, voice output of the voice signal that active speech is obtained after tone decoding as vocoder, Huo Zhezuo For the phonetic entry that linear predictive coding is calculated, then after linear prediction filter, comfort noise signal is exported.

The present invention has the advantages that unique and has the beneficial effect that：

The method that a kind of network phone comfort noise that the technical program is proposed is produced, before not changing the capacity of load bag Put, the signal by the use of the generation model for more meeting voice signal is used as driving source so that the comfort noise of output is more comfortable, Improve the Consumer's Experience of voice call.

Description of the drawings

Fig. 1 is traditional noise generation method schematic diagram with noise gain and frequency spectrum parameter；

Fig. 2 is traditional only noise generation method schematic diagram with noise gain；

Fig. 3 is the Organization Chart of the method that a kind of network phone comfort noise of the invention is produced；

Fig. 4 is the computing formula of the noise gain of the method that a kind of network phone comfort noise of the invention is produced.

Specific embodiment

With reference to Figure of description in detail technical solution of the present invention is described in detail：

As shown in figure 3, a kind of method that network phone comfort noise is produced, it is characterised in that：In the voice of network phone In identification processing procedure, whether detection load bag signal source is active speech or nonactive voice, the noise Jing of nonactive voice After crossing noise decoding, output noise band is made to change spectral characteristic, then into linear prediction filter；Arrange one group of addition with The exciting signal source of machine adaptive codebook and random fixed codebook, band change the output noise of spectral characteristic and exciting signal source Through linear prediction filter；Meanwhile, voice of the voice signal that active speech is obtained after tone decoding as vocoder Output, or as the phonetic entry that linear predictive coding is calculated, then after linear prediction filter, output comfort noise letter Number.

Further, during the voice recognition processing of network phone, receive load source speech signal, according to load bag Length detection judges to load whether bag signal source is active speech signal, and setting noise as the length of nonactive speech payloads bag is 1 byte is 0 byte, when for 1 byte when, decode and obtain noise gain, when for 0 byte when, then the noise gain of present frame It is constant, then replaced with the noise gain of previous frame, and the length of active speech load bag is 160 bytes, be such as active speech frame, Tone decoding is then proceeded to, otherwise turns noise decoding.

Further, described active speech load bag proceeds to tone decoding, using the compress speech side in G.711 standard Formula, speech decoding process is for the obtaining group voice signal from μ rates or A rates to the conversion of Linear Pulse Code Modulation, a side Output of the face as vocoder, current frame speech decoding terminate, then turn to do next frame decoding；On the other hand compile as linear prediction The phonetic entry that code device is calculated.

Further, the noise of described nonactive voice carries out noise decoding, is by the noise energy solution in load bag Code, obtains noise gain G, using audio coder ＆ decoder (codec) technical standard ITU-T G.711 II protocol modes of Appendix, its meter Calculation mode isWherein E is the noise energy that load decoding is obtained.

And then, the voice signal that described active speech is obtained after tone decoding is used as Linear Predictive Coder meter The phonetic entry of calculation, determines linear forecast coding coefficient using CELP QCELP Qualcomms.

Then, one group of exciting signal source for adding random adaptive codebook and random fixed codebook in white noise is set, Using International Telecommunication Union's voice compression algorithm ITU-T computational methods that G.729 B.4.4 annex B agreements save, according to Audio coder ＆ decoder (codec) technical standard G.711 in every frame length determining, obtain e [80] sequences as output voice signal.

Finally, the noise gain G that the e [80] for being obtained by the use of after calculating is obtained as driving source, noise decoding, by linear The calculated voice signal of predictive coding is filtered through linear prediction filter, obtains comfort noise output signal.

In sum, the method that a kind of network phone comfort noise proposed by the present invention is produced, is added in white noise Random adaptive codebook and random fixed codebook, produce voice signal model so that the comfort noise of output is softer.

Above-mentioned technical proposal is only the concrete application example of the present invention, can be drunk as the case may be in actual application Feelings select alternate device device, but the protection domain to inventing to be not limited in any way.

Claims

1. a kind of method that network phone comfort noise is produced, it is characterised in that：In the voice recognition processing process of network phone In, whether detection load bag signal source is active speech or nonactive voice, the noise of nonactive voice after noise decoding, Output noise band is made to change spectral characteristic, then into linear prediction filter；The random adaptive codebook of one group of addition is set With the exciting signal source of random fixed codebook, band changes the output noise of spectral characteristic and exciting signal source through linear prediction Wave filter；Meanwhile, voice output of the voice signal that active speech is obtained after tone decoding as vocoder, or conduct The phonetic entry that linear predictive coding is calculated, then after linear prediction filter, export comfort noise signal.

2. the method that a kind of network phone comfort noise according to claim 1 is produced, it is characterised in that：Network phone During voice recognition processing, receive load source speech signal, judge that load bag signal source is according to the length detection of load bag No it is 1 byte or for 0 byte to set noise as the length of nonactive speech payloads bag for active speech signal, when for 1 byte When, decoding obtain noise gain, when for 0 byte when, then the noise gain of present frame is constant, then with the noise gain generation of previous frame Replace, and the length of active speech load bag is 160 bytes, be such as active speech frame, then proceed to tone decoding, otherwise turn noise solution Code.

3. the method that a kind of network phone comfort noise according to claim 1 is produced, it is characterised in that：Described activity Speech payloads bag proceeds to tone decoding, and speech decoding process is, from μ rates or A rates to the conversion of Linear Pulse Code Modulation, to obtain The one group of voice signal for arriving, on the one hand as the output of vocoder, current frame speech decoding terminates, then turns to do next frame decoding； On the other hand the phonetic entry for calculating as linear predictive coding.

4. the method that a kind of network phone comfort noise according to claim 1 is produced, it is characterised in that：Described non-live Property voice noise carry out noise decoding, be will load bag in noise energy decoding, obtain noise gain G, using voice coder Decoder technique standard agreement mode.

5. the method that a kind of network phone comfort noise according to claim 1 is produced, it is characterised in that：Described activity The phonetic entry that the voice signal that voice is obtained after tone decoding is calculated as Linear Predictive Coder, is swashed using CELP codes Encourage linear predictive coding to determine linear forecast coding coefficient.

6. the method that a kind of network phone comfort noise according to claim 1 is produced, it is characterised in that：Arrange one group The exciting signal source of random adaptive codebook and random fixed codebook is added in white noise, using International Telecommunication Union's compress speech The canonical algorithm ITU-T computational methods that G.729 B.4.4 annex B agreements save, according to audio coder ＆ decoder (codec) technical standard The length of the every frame in G.711 obtains e [80] sequences as output voice signal determining.

7. the method that a kind of network phone comfort noise according to claim 1 is produced, it is characterised in that：After calculating The noise gain G that the e [80] for obtaining is obtained as driving source, noise decoding, by the calculated voice of linear predictive coding Signal is filtered through linear prediction filter, obtains comfort noise output signal.