CN102760441B - Background noise coding/decoding device and method as well as communication equipment - Google Patents

Background noise coding/decoding device and method as well as communication equipment Download PDF

Info

Publication number
CN102760441B
CN102760441B CN201210194567.8A CN201210194567A CN102760441B CN 102760441 B CN102760441 B CN 102760441B CN 201210194567 A CN201210194567 A CN 201210194567A CN 102760441 B CN102760441 B CN 102760441B
Authority
CN
China
Prior art keywords
background noise
unit
silence description
discontinuous transmission
frame
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201210194567.8A
Other languages
Chinese (zh)
Other versions
CN102760441A (en
Inventor
王喆
张立斌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to CN201210194567.8A priority Critical patent/CN102760441B/en
Publication of CN102760441A publication Critical patent/CN102760441A/en
Application granted granted Critical
Publication of CN102760441B publication Critical patent/CN102760441B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

本发明实施例提供一种背景噪声编码/解码装置、方法和通信设备,以实现根据通信环境的背景噪声情况灵活调整SID帧编码方式和非连续传输机制。本发明技术方案可以根据背景噪声的不同类型灵活采用不同的SID帧编码方法和非连续传输机制,在合理利用通信系统实际资源的情况下自适应于不同类型背景噪声所要求的编码精度和时间分辨力,来达到在合理利用信道资源的前提下在解码端高质量的提供各种类型背景噪声的目的。

Figure 201210194567

Embodiments of the present invention provide a background noise coding/decoding device, method and communication equipment, so as to flexibly adjust the SID frame coding method and the discontinuous transmission mechanism according to the background noise of the communication environment. The technical scheme of the present invention can flexibly adopt different SID frame coding methods and discontinuous transmission mechanisms according to different types of background noise, and adapt to the coding accuracy and time resolution required by different types of background noise under the condition of rational use of actual resources of the communication system To achieve the purpose of providing various types of background noise at the decoding end with high quality under the premise of rational use of channel resources.

Figure 201210194567

Description

A kind of background noise encoding/decoding device, method and communication facilities
Technical field
The present invention relates to voice communication technology, particularly a kind of background noise encoding/decoding device, method and communication facilities.
Background technology
In voice communication course, the voice signal being transmitted can be divided into audible signal and without acoustical signal two classes, wherein: audible signal is the voice signal sending when efficient voice being detected, mainly comprises communication party's voice and the information such as ground unrest at that time; The ambient noise signal sending in the gap that efficient voice do not detected without acoustical signal, mainly comprises diversity of settings noise, such as white noise, background brouhaha and quiet etc.Concerning communicating pair, audible signal is useful signal, is the carrier of the main communication information, and is garbage signal without acoustical signal, does not comprise the main communication information.
In voice communication course, what mainly pay close attention to due to communicating pair is useful audible signal, therefore considers the pressure of bandwidth, does not wish to transmit useless without acoustical signal, to reduce transmission bandwidth, increases power system capacity.If but only pass audible signal, can cause the discontinuous of ground unrest, make a side who listens to feel very uncomfortable, especially more obvious in the situation that ground unrest is stronger, even can make voice indigestion, so can only reduce as far as possible transmit useless without acoustical signal, on the basis that guarantees user's susceptibility, reduce bandwidth as far as possible, CNG (Comfort Noise generation has arisen at the historic moment for this reason, comfort noise generates) technology, in CNG technology, transmitting terminal adopts silence compression coding techniques coding background noise frames in noiseless signal phase, and adopt DTX (Discontinuous Transmission, discontinuous transmission) mode is transmitted, receiving end adopts the mode of interpolation to produce continuous ambient noise signal, to increase the comfort of listener.
As shown in Figure 1, for adopting the communication system principle schematic diagram of silence compression coding techniques, wherein, transmitting terminal comprises:
VAD (Voice Activity Detector, speech detector), original signal is inputted VAD after certain pre-service, VAD is conventionally according to the eigenwert of each frame signal, signal to noise ratio (S/N ratio) for example, in detection signal, whether comprise efficient voice, and according to testing result, signal is sent into different scramblers and encode;
Speech coder, for the audible signal that comprises efficient voice being encoded to speech frame output, speech frame is transferred to receiving end by communication channel;
Non-voice scrambler, adopting silence compression technology is background noise frames by the noiseless Signal coding that does not comprise efficient voice;
Background noise frames transmitter, the DTX mode of adopt setting by background noise frames by traffic channel to receiving end, for example every several frames, send a background noise frames, or determine whether to send a background noise frames according to the characteristic parameter situation of change of ground unrest;
Accordingly, receiving end comprises:
Voice decoder, for the speech frame of decoding and receiving from communication channel, and exports after being reconstructed into audible signal;
Non-voice demoder, for the background noise frames of decoding and receiving from communication channel;
Reconstructor, for adopting the mode of interpolation that decoded data is reconstructed into continuous ambient noise signal, to increase the comfort of listener.
The non-voice scrambler of transmitting terminal and background noise frames transmitter form DTX unit, and the non-voice demoder of receiving end and reconstructor form CNG unit, and silence compression technology is by being positioned at VAD and the DTX of transmitting terminal, and the CNG realization that is positioned at receiving end.
With CELP (Code-Excited Linear Prediction, the Code Excited Linear Prediction) speech coding technology commonly used, be example, CELP is a kind of high efficiency speech coding technology.Owing to having adopted perceptual weighting, analyze the technology such as synthetic, vector quantization and rear filtering, CELP can provide high-quality synthetic speech in middle low rate.In CELP speech coding technology, at transmitting terminal, carry out speech signal analysis to obtain the characteristic parameter of voice, the characteristic parameter of voice is encoded in code stream and is sent; Receiving end carries out phonetic synthesis, reconstructed speech signal according to the characteristic parameter in code stream.Voice signal s (n) is divided into the short time period, is called frame, and the speech analysis of transmitting terminal and the phonetic synthesis of receiving end are all carried out frame by frame.
At transmitting terminal, original signal s (n) can regard the output that a pumping signal e (n) encourages a wave filter v (n) to produce as, be s (n)=e (n) * v (n), receiving end is used identical wave filter to synthesize voice, is called composite filter.Therefore the characteristic parameter transmitting in the encoding code stream of voice signal comprises composite filter parameter and excitation parameters, wherein: composite filter parameter is mainly line spectral frequencies LSF (Line Spectrum Frequency) quantization parameter, and pumping signal parameter comprises: adaptive codebook parameter (comprising pitch delay parameter and fundamental tone gain parameter) and fixed codebook parameters (position and symbol and the fixed codebook gain parameter that comprise non-zero pulses in fixed codebook).
Different from speech coding parameters, what ground unrest coding parameter was described is ground unrest characteristic, because the pumping signal of ground unrest can be thought simple noise random series, and these sequences all can simply produce with random noise generation device at encoding and decoding end, then with energy parameter, control the amplitude of these sequences, just can produce final pumping signal, therefore pumping signal characteristic parameter can simply represent with energy parameter, and do not need to further describe with other some characteristic parameters, so different from speech frame in ground unrest encoding code stream, be, its excitation parameters is the energy parameter of current background noise frame, and identical with speech frame in ground unrest encoding code stream, the composite filter parameter in encoding code stream is also line spectral frequencies LSF quantization parameter, is concrete quantization method difference to some extent.By above analysis, can think that to the coded system of ground unrest be exactly a kind of simple voice coding modes in essence.
Through DTX compressed encoding and by the background noise frames of discontinuous transmission, be commonly referred to as SID (Silence Descriptor, silence description frames) frame.In current various speech coding systems, each intrasystem ambient noise signal is all used SID frame format and discontinuous transmission mechanism.
Wherein, G.729B silence compression scheme is a kind of representative silence compression technology, G.729B based on CELP algorithm model, so its ground unrest parameter of transmitting also obtains based on CELP model, comprises for describing composite filter parameter and the excitation parameters of ground unrest.At coding side, for example the court verdict of VAD is 0 o'clock, represent that current signal is noise, through certain pretreated original signal, be admitted to ground unrest code device, at ground unrest code device, ground unrest parameter is extracted and quantized, form SID frame, then ground unrest code device determines whether sending a frame SID frame according to the parameter situation of change in SID frame, for example, when the filter parameter in SID frame and energy parameter change greatly, sends a frame SID frame.G.729B, in, the interval of two adjacent S ID frames is at least 2 frames.In decoding end, if enter the ground unrest decoding and reconstituting stage, in ground unrest decoding device, according to the SID frame receiving, synthesize the comfort noise of description encoding end ground unrest characteristic.G.729B the bit number that in SID frame is each parametric distribution is as shown in table 1 below:
Table 1.
Bit distributes (from a high position to low level) Parametric description
S1 Quantitative prediction device index
S2-S6 One-level LSF quantisation vector index
S7-S10 Secondary LSF quantisation vector index
S11-S15 Gain (energy)
AMR (Adaptive Multi-Rate speech codec, self-adaptation multi code Rate of Chinese character speech coder) and AMR-WB (Adaptive Multi-Rate Wide Band speech codec, wideband adaptive multi code Rate of Chinese character speech coder) DTX in silence compression scheme adopts fixing transmit mechanism, no matter how the characteristic of ground unrest changes, and every 8 frames send a SID frame.The bit number that in the SID frame of AMR is each parametric distribution is as shown in table 2 below, is that the bit number of each parametric distribution is as shown in table 3 below in the SID frame of AMR-WB.
Table 2.
Bit distributes (from a high position to low level) Parametric description
S1-S3 Quantitative prediction device index
S4-S11 One-level LSF quantisation vector index
S12-S20 Secondary LSF quantisation vector index
S21-S29 Three grades of LSF quantisation vector index
S30-S35 Logarithm frame energy index
Table 3.
Bit distributes (from a high position to low level) Parametric description
S1-S6 One-level LSF quantisation vector index
S7-S12 Secondary LSF quantisation vector index
S13-S18 Three grades of LSF quantisation vector index
S19-S23 Level Four LSF quantisation vector index
S24-S28 Pyatyi LSF quantisation vector index
S29-S34 Logarithm frame energy index
S35 Shake sign
Existing ground unrest coded system can be carried out simple code to ground unrest, and synthesizes simple comfortable ground unrest in decoding end, can better meet to a certain extent the requirement of answering quality.Yet along with the development of the communication technology, voice coding quality is more and more higher, corresponding, the coding quality of the ground unrest in sound stage is also more and more higher, in this case, the ground unrest when making voice communication is more continuous, also needs to improve the coding quality of silent period ground unrest.Existing comparatively single SID frame coding mode and discontinuous transmission mechanism obviously cannot meet the specific requirement of multiple communication environment.
Summary of the invention
The embodiment of the present invention provides a kind of background noise encoding/decoding device, method and communication facilities, to realize according to the ground unrest situation of communication environment, adjusts flexibly SID frame coding mode and discontinuous transmission mechanism.
A ground unrest code device, comprising:
Background noise analysis unit, at least one signal characteristic parameter value of background extraction noise signal;
Ground unrest taxon, for determining ground unrest type according to described at least one signal characteristic parameter value;
Coding switch unit and at least two discontinuous transmission unit, described coding switch unit is for inputting corresponding discontinuous transmission unit according to described ground unrest type by ambient noise signal, each discontinuous transmission unit is for being encoded to by ambient noise signal silence description frames the discontinuous transmission that comprises discontinuous transmission unit indication information, and wherein the silence description frames coded system of each discontinuous transmission unit and/or silence description frames discontinuous transmission mechanism are not identical.
A ground unrest decoding device, comprising:
The first recognition unit, for identifying the discontinuous transmission unit indication information of silence description frames;
Decoding switch unit and at least two comfort noise generation units, described decoding switch unit is used for according to the recognition result of described recognition unit, described silence description frames is inputted to corresponding comfort noise generation unit, each comfort noise generation unit is according to the corresponding decoding of the coded system of the silence description frames receiving silence description frames, and reconstruct ambient noise signal.
A communication facilities, comprising:
The first speech detector, for detection of whether comprising efficient voice classification output in the signal gathering;
Speech coder, for being speech frame continuous wave output by the Signal coding that comprises efficient voice;
Also comprise ground unrest code device, this ground unrest code device specifically comprises:
Background noise analysis unit, for obtaining at least one signal characteristic parameter value of the ambient noise signal that does not comprise efficient voice;
Ground unrest taxon, for determining ground unrest type according to described at least one signal characteristic parameter value;
Coding switch unit and at least two discontinuous transmission unit, described coding switch unit is for inputting corresponding discontinuous transmission unit according to described ground unrest type by ambient noise signal, each discontinuous transmission unit is for being encoded to by ambient noise signal silence description frames the discontinuous transmission that comprises discontinuous transmission unit indication information, and wherein the silence description frames coded system of each discontinuous transmission unit and/or silence description frames discontinuous transmission mechanism are not identical.
A communication facilities, comprises ground unrest decoding device, and described ground unrest decoding device comprises:
The first recognition unit, for identifying the discontinuous transmission unit indication information of silence description frames;
Decoding switch unit and at least two comfort noise generation units, described decoding switch unit is used for according to the recognition result of described recognition unit, described silence description frames is inputted to corresponding comfort noise generation unit, each comfort noise generation unit is according to the corresponding decoding of the coded system of the silence description frames receiving silence description frames, and reconstruct ambient noise signal.
A ground unrest coding method, comprising:
At least one signal characteristic parameter value of background extraction noise signal;
According to described at least one signal characteristic parameter value, determine ground unrest type;
According to a kind of discontinuous transmission mechanism of described ground unrest type selecting, ambient noise signal is encoded to silence description frames the discontinuous transmission that comprises discontinuous transmission mechanism indication information, wherein the silence description frames coded system in each discontinuous transmission mechanism and/or the discontinuous send mode of silence description frames are not identical.
A coding/decoding method, comprising:
Discontinuous transmission mechanism indication information in identification silence description frames, wherein the silence description frames coded system in each discontinuous transmission mechanism is not identical;
According to described discontinuous transmission mechanism indication information, adopt corresponding decoding process decoding silence description frames, and be reconstructed into ambient noise signal.
Application technical solution of the present invention, can be according to the different SID frame coding method of the dissimilar flexible employing of ground unrest and discontinuous transmission mechanism, in the situation that rationally utilizing communication system real resource, be adaptive to the desired encoding precision of dissimilar ground unrest and time sense, reach under the prerequisite of rationally utilizing channel resource in the high-quality object that all kinds ground unrest is provided of decoding end.
Accompanying drawing explanation
Fig. 1 is the communication system principle schematic diagram of existing employing silence compression coding techniques;
Fig. 2, Fig. 3, Fig. 5 and Fig. 6 are respectively a kind of ground unrest code device primary structure schematic diagram that the embodiment of the present invention provides;
The switch mode principle schematic that Fig. 4 provides for the embodiment of the present invention;
Fig. 7, Fig. 8 and Fig. 9 are respectively a kind of communication facilities primary structure schematic diagram that the embodiment of the present invention provides.
Embodiment
Further analyze existing ground unrest coding, in each coding/decoding system, all adopt single ground unrest frame coding mode, cannot describe targetedly the concrete ambient noise signal of the communication environment of various characteristics, thereby meet the multifarious demand of communication environment; Single discontinuous transmission mechanism also cannot be transmitted background noise frames targetedly to the background noise frames of different qualities, so that cannot produce targetedly good comfort noise in decoding end; And the unicity of discontinuous transmission mechanism also cannot be adaptive to the actual state of communication system, for example communication bandwidth, can not rationally be applied down to the resource of communication system.
Therefore, a kind of encoding and decoding technique scheme that the embodiment of the present invention provides, can according to ground unrest characteristic, carry out respective handling targetedly, at coding side, ground unrest is analyzed, according to the characteristic of ground unrest, adopt different DTX scheme codings to send SID frame, and adopt corresponding CNG scheme reconstructed background noise signal at receiving end.
As shown in Figure 2, be a kind of ground unrest code device 20 that the embodiment of the present invention provides, this ground unrest code device 20 is for silence compression navamander, and primary structure comprises:
At least two DTX unit 21, comprise 211,DTX unit, DTX unit 2 12......DTX unit 21n, each DTX unit 21 is not continuous wave output for ambient noise signal being encoded to SID frame according to different DTX mechanism, wherein, difference between different DTX mechanism comprises: SID frame coding mode is not identical, SID frame discontinuous transmission mechanism is not identical, or the two is not identical;
Background noise analysis unit 22, for calculating at least one signal characteristic parameter value according to ambient noise signal;
Ground unrest taxon 23, for determining ground unrest type according to described at least one signal characteristic parameter value;
Coding switch unit 24, for inputting corresponding DTX unit 21 according to described ground unrest type by the ambient noise signal of reception.
The function of background noise analysis unit 22 is to analyze the ambient noise signal of input to obtain the characteristic parameter of background signal, analytical approach is to calculate one or more signal characteristic parameter values that can characterize ambient noise signal, and these signal characteristic parameters can be one of following parameter or combination in any: when long, SNR (signal to Noise Ratio), ground unrest are estimated level, ground unrest energy hunting estimation, ground unrest spectrum fluctuation estimation, ground unrest vibration frequency etc.Other can characterization signal background the parameter of size, kind (stable state/unstable state, changes size, changes speed), SNR (when long, in short-term, peak value) also can be calculated for analyzing.Except ambient noise signal being inputted background noise analysis unit 22, VAD can also be adjudicated to current demand signal is the indicator signal input background noise analysis unit 22 of background signal, and background noise analysis unit 22 triggers and calculates signal characteristic parameter value according to ambient noise signal indication information.
The function of ground unrest taxon 23 is the signal characteristic parameter values that calculate according to background noise analysis unit 22, divides the type of current background noise signal.Ground unrest type at least defines two kinds, can define according to whether stablizing of ground unrest, for example: ground unrest is divided into stationary background noise signal and unstable state ambient noise signal two classes; Whether stablizing and the size definition of SNR when long according to ground unrest and for example: ground unrest is divided into the large SNR ambient noise signal of stable state, the little SNR ambient noise signal of stable state, the large SNR ambient noise signal of unstable state, little SNR ambient noise signal four classes of unstable state.Can also be according to the size definition ground unrest type of ground unrest institute inclusion information meaning, such as ambient noise signal can be divided into melodious ambient noise signal (as music, flowing water, bird's twitters etc.), environmental background noise signal is (as cocktail party, curbside, airport etc.) or other ambient noise signal three classes.In a word, the classified types of ground unrest can be set as required, does not limit the enforcement of technical solution of the present invention.
Coding is the pre-stored applicable DTX unit 21 of each ground unrest type in switch unit 24, the ground unrest type definite according to ground unrest taxon 23, between at least two DTX unit 21, switch, select the DTX unit 21 corresponding with current background noise type to process.
Generally each ground unrest type is distinguished corresponding a kind of different DTX scheme, but consider demand or the communication channel resources situation of change of concrete communication scenes, can also set different switch modes, the switching controls scheme that each switch mode is corresponding different, can the corresponding concrete switching controls scheme of each switch mode is pre-stored in coding switch unit 24, and by the required switch mode of switch mode steering order indication coding switch unit 24, switch mode steering order can be limited instruction set of the current channel resource situation of reflection, the corresponding a kind of switch mode of each specific switch mode steering order, it is the distribution method of a kind of DTX unit.Also can, when switch mode steering order each time, concrete switching controls scheme corresponding to required switch mode be sent to coding switch unit 24 simultaneously.
Coding switch unit 24 is determined corresponding switch mode according to the indication information of concrete reason.To those skilled in the art, concrete switch mode switching control method is a lot, will not enumerate here.
Therefore as shown in Figure 3, in the code device of ground unrest shown in Fig. 2 20, can also increase command reception unit 25 is set, receive switch mode steering order and be transmitted to coding switch unit 24, switch mode steering order can be operating personnel from outside input, can be also transmitting terminal according to the communication channel resource conditions generation detecting etc.
Coding switch unit 24 can be selected appropriate switch mode according to switch mode steering order, has at least DTX unit 21 corresponding to ground unrest type not identical between any two switch modes.The ground unrest code device 20 of take comprises that four DTX unit 21 are example, the setting example of switch mode as shown in Figure 4:
When command reception unit 25 is not set, coding switch unit 24 is only that ground unrest is selected appropriate DTX unit according to ground unrest type, and for example switch mode 1;
When being provided with command reception unit 25, coding switch unit 24 is determined the pattern of selecting DTX unit according to switch mode steering order.For example:
While receiving switch mode steering order 1, according to switch mode 1, switch;
While receiving switch mode steering order 2, according to switch mode 2, switch;
While receiving switch mode steering order 3, according to switch mode 3, switch.
Switch mode steering order can be sent according to current communication channel resource conditions, for example, when current channel resource is nervous, wherein the ground unrest of part ground unrest type may be used samely compared with the DTX unit of low rate, and even all ground unrest types are all used minimum speed limit DTX unit.Again for example, when there is no switch mode steering order, only have wherein part DTX unit to be used, but when switch mode steering order represents that current channel resource is abundant or nervous, another part DTX unit is used.
Example is as shown in Figure 4 visible:
Instruction 1 indication enters switch mode 1, each ground unrest type corresponding a kind of different DTX unit all now, and instruction 1 can mean that current channel resource is normal;
Instruction 2 indications enter switch mode 2, the DTX unit that now part ground unrest type can be corresponding identical, as ground unrest Class1 and the equal corresponding DTX of ground unrest type 2 unit 212, ground unrest type 3 and the equal corresponding DTX of ground unrest type 4 unit 214, instruction 2 can represent current channel resource, and abundant (212 compare DTX unit, DTX unit 211 code rates are high, take resource many; 214 compare DTX unit, DTX unit 213 code rates are high, take resource many), or can represent that more nervous (212 compare DTX unit, DTX unit 211 code rates low, takies resource few to current channel resource; 214 compare DTX unit, DTX unit 213 code rates are low, take resource few);
Instruction 3 indications enter switch mode 3, the equal corresponding DTX unit 214 only of all ground unrest types now, instruction 3 can represent current channel resource, and very abundant (DTX unit 214 code rates are the highest, take resource maximum), or can represent current channel resource very nervous (DTX unit 214 code rates are minimum, take resource minimum).
As shown in Figure 5, when very sufficient or ground unrest is melodious especially at current channel resource, can also speech coder be set in ground unrest code device 20 again, utilize speech coder to carry out full-rate codes to specific background noise.Satisfied the imposing a condition of signal characteristic parameter value of specific background noise, whether ground unrest taxon 23 meets and imposes a condition to judge specific background noise type according at least one signal characteristic parameter value, and indicate coding switch unit 24 when specific background noise, to be switched to speech coder, speech coder is encoded to speech frame continuous wave output by this specific background noise signal.Certainly, also can utilize the existing speech coder of transmitting terminal to bear this function.
Consult shown in Fig. 1, each DTX unit 21 comprises two parts, and one is non-voice scrambler, and for ambient noise signal being encoded to SID frame, it two is background noise frames transmitter, for sending background noise frames according to the discontinuous mode of setting.Difference between DTX unit can be the difference of SID frame coding mode, can be also the difference of discontinuous transmission mechanism, or the two is neither same.Wherein: the difference of SID frame coding mode can be embodied in the following aspects:
1, the difference of coding parameter
Different SID frames can adopt the parameter of different types of parameter and varying number;
2, the difference of identical parameters quantified precision
For the different SID frames of same parameters, can adopt different quantified precisions;
Therefore, the structure of the SID frame that different SID frame coding modes produce may be different with length, and for example: the coding parameter of stationary background noise signal noise can be relatively simple, and the quantified precision of parameter can be relatively low, the length of SID frame can be relatively short.Corresponding, the coding parameter of unstable state ambient noise signal can be relatively many, a kind of method is to increase the excitation parameters in speech coding parameters in existing SID frame, as adaptive codebook parameter (pitch delay and gain) and fixed codebook parameters (position of non-zero pulses and symbol and gain) etc., the quantified precision of parameter can be relatively high, as increasing quantified precision of existing SID frame median filter parameter and excitation parameters etc., the length of SID frame also can correspondingly increase.And for example, the SID frame of melodious ambient noise signal should be used the most complicated coded system (can be full-rate codes), the SID frame of environmental background noise signal is used time complicated coded system, and the SID frame of other ambient noise signal is used better simply coded system.
Different ground unrest types can also be used different discontinuous transmission mechanism, and the difference of the SID frame transmit mechanism of different background noise type is mainly reflected on the transmission intercal of adjacent two SID frames.For example, for stationary background noise signal, because the change of background is less, the Quality Down susceptibility after DTX/CNG is not high to such ground unrest for human auditory system, only needs with relatively long discontinuous transmission interval.For unstable state ambient noise signal, because background change is in time larger, in order to guarantee that such ground unrest can have a higher subjective quality after DTX/CNG, need to use relatively short discontinuous transmission interval or even continuous transmission.And for example, for melodious background, due to the non-Noise Characteristic of itself, the requirement of subjective quality is quite high, so DTX system just need to be carried out continuous transmission.For environmental background noise, due to itself, carried certain environmental information, in order to ensure not losing these information, DTX system also need to be upgraded SID frame with higher speed.For other ground unrest, due to true appearance the non-staple object of reduction background, so the renewal of carrying out SID frame that DTX system just can be relatively slow.
The transmission intercal of SID frame is divided into two kinds, constant transmissions interval and Adaptive Transmission interval.Under the mechanism at constant transmissions interval, the interval of adjacent two SID interframe is fixed, and the size at interval just depends on the type of ground unrest.Under the mechanism at Adaptive Transmission interval, although the transmission of SID frame is to be adaptive to the speed that noise changes, also there is the restriction of minimum time interval.In the embodiment of the present invention, according to different background noise type, can determine to adopt different minimum intervals.
As shown in Figure 6, the primary structure schematic diagram for the first communication facilities provided by the invention, comprising: VAD10, ground unrest code device 20, speech coder 30 and communication interface 40, wherein:
VAD10, for detection of efficient voice, sends audible signal into speech coder 30 and is encoded to speech frame, and communication interface sends speech frame by communication channel; Ambient noise signal is sent into ground unrest code device 20, ground unrest code device 20 output SID frames, communication interface sends SID frame by communication channel.
In ground unrest code device 20, coding switch unit 24 switches applicable DTX unit 21 according to the type of ambient noise signal.
As shown in Figure 7, the second communication facilities primary structure schematic diagram providing for the embodiment of the present invention, wherein: the coding switch unit 24 in ground unrest code device 20 is determined switch mode according to the instruction of command reception unit 25, and switch applicable DTX unit 21 according to the switch mode of the type of ambient noise signal and current use.
Shown in Figure 5, in the ground unrest code device 20 in the communication facilities shown in Fig. 6 and Fig. 7, speech coder can also be set, speech coder completes the full-rate codes of ground unrest.
As shown in Figure 8, if utilize existing speech coder 30 to complete the full-rate codes of ground unrest, the switch unit 24 of encoding is connected on speech coder 30, ambient noise signal is sent into speech frame the continuous wave output that is encoded to full rate in speech coder 30.
The technical scheme that adopts the embodiment of the present invention to provide, the coding send mode of corresponding each the SID frame of needs arranges corresponding DTX unit indication information in SID frame, and DTX unit indication information is determined corresponding decoding and reconstituting scheme for decoding end.For speech frame, by Voice decoder, decoded and continuous wave output.
Due to 22 pairs of background noise analysis unit, calculating the calculating of background characteristics parameter value calculates identical with the part of each DTX unit 21 when encoding SID frame, therefore background noise analysis unit 22 can also connect each DTX unit 21, analysis result is exported to each DTX unit 21, thereby part reduces the calculated amount of DTX unit 21.
As shown in Figure 9, a kind of communication apparatus construction schematic diagram providing for the embodiment of the present invention, the unit that completes receiving end function mainly comprises: ground unrest decoding device 50, recognition unit 60, decoding switch unit 70, Voice decoder 80 and communication interface, wherein:
This ground unrest decoding device 50 comprises at least two CNG unit (51,52...5n), and each CNG unit is according to a kind of decoding and reconstituting scheme decoding SID frame of setting and be reconstructed into ground unrest;
Recognition unit 60, for identifying speech frame and the SID frame receiving by communication interface, and further identifies the DTX unit indication information in SID frame;
Decoding switch unit 70, for switching according to the recognition result of recognition unit 60, sends speech frame into voice decipher 80, SID frame is sent into corresponding CNG unit and carry out decoding and reconstituting.
Wherein, consult shown in Fig. 1, each CNG unit comprises non-voice demoder, for the background noise frames of decoding and receiving from communication channel; Reconstructor, for adopting the mode of interpolation that decoded data is reconstructed into continuous ambient noise signal, to increase the comfort of listener.
If first recognition unit 60 identifies speech frame or SID frame and SID frame that the coded frame receiving is full-rate codes, further identify the DTX unit indication information in this SID frame., should have 1 or several for carrying the bit of DTX unit indication information in the code stream of SID frame here, bit bit position both can also can be at end in the beginning of SID frame, or for example, at other assigned address of SID frame, shown in following table 3 or table 4:
Table 3.
Bit distributes (from a high position to low level) Parametric description
S1 DTX unit indication information
S2 Quantitative prediction device index
S3-S7 One-level LSF quantisation vector index
S8-S11 Secondary LSF quantisation vector index
S12-S16 Gain (energy)
Table 4.
Bit distributes (from a high position to low level) Parametric description
S1 Quantitative prediction device index
S2-S6 One-level LSF quantisation vector index
S7-S10 Secondary LSF quantisation vector index
S11-S15 Gain (energy)
S16-S17 DTX unit indication information
In recognition unit 60, can comprise two recognin unit, a recognin unit is for identifying speech frame or the SID frame that the coded frame receiving is full-rate codes, if identify speech frame, notify decoding switch unit 70, decoding switch unit is sent speech frame into Voice decoder 80 decoding continuous wave outputs; If identifying is SID frame, SID frame is sent into another recognin unit and further identified the DTX unit indication information in SID frame.
Because different ground unrest types has different SID frame format or update mechanism, according to DTX unit indication information, select suitable DTX decoding scheme decoding SID frame, noise interpolating method in different decoding schemes is different, the comfort noise being finally restored out.
Communication facilities can possess transmitting terminal function, also concrete receiving end function, structure shown in Fig. 9 conventionally and the merging of structure shown in Fig. 7 or Fig. 8 be arranged in a communication facilities.
With specific embodiment, be elaborated again below.
Embodiment mono-
At coding side, ambient noise signal is inputted background noise analysis unit 22 frame by frame, background noise analysis unit 22 calculates the subband level level[k of each current demand signal frame, i], wherein k and i represent to be respectively the level of i frame k subband and to be kept in internal memory.The calculating of subband can be realized also and can be realized by the method for conversion by bank of filters.
When the enough somes of signal frame accumulation, background noise analysis unit 22 starts according to being kept at the subband level in internal memory, calculates ground unrest spectrum fluctuation parameters spec_var[i], and after upgrade result of calculation while receiving each frame background noise frames.
Ground unrest spectrum fluctuation parameters spec_var[i] computing method be:
spec _ var [ i ] = Σ m = i - L + 1 i ( Σ n = i - L + 1 , n ≠ m i ( Σ k = 1 N ( ( level [ k , m ] - level [ k , n ] ) / ( 1 L · N Σ m = i - L + 1 i Σ k = 1 N level [ k , i ] ) ) 2 ) )
The quantity that wherein N is subband, L is chosen as to do ambient noise signal number of frames average when long.Ground unrest spectrum fluctuation parameters spec_var[i] calculating can be also based on LSF coefficient, that is:
spec _ var [ i ] = Σ m = i - L + 1 i ( Σ n = i - L + 1 , n ≠ m i ( Σ k = 1 N ( ( lsf [ k , m ] - lsf [ k , n ] ) / ( 1 L · N Σ m = i - L + 1 i Σ k = 1 N lsf [ k , i ] ) ) 2 ) )
Lsf[k wherein, i] represent k lsf coefficient of i frame, N is lsf number of coefficients, L is chosen as to do ambient noise signal number of frames average when long.
Background noise analysis unit 22 exports the ground unrest spectrum fluctuation parameters calculating to ground unrest taxon 23.
Coding switch unit 24 is that current background noise frame is selected corresponding DTX unit according to the ground unrest type information of ground unrest taxon 23 outputs.
In the present embodiment, two kinds of background type have been defined: stationary background noise signal and unstable state ambient noise signal.Ground unrest taxon 23 compares and incorporates current background noise into in stable state or unstable state a kind of by ground unrest being composed to fluctuation parameters and thresholding spec_var_thr.If spec_var[i] < spec_var_thr, current background noise signal is classified as stationary background noise signal, otherwise is classified as unstable state ambient noise signal.
In the present embodiment, DTX system has two kinds of different programmes of work:
Scheme one adopts every 8 frames to send the discontinuous transmission mechanism of a SID frame, only comprises wave filter and energy parameter in SID frame;
Scheme two adopts every 3 frames to send the discontinuous transmission mechanism of a SID frame, and in SID frame, except comprising wave filter, energy parameter has also increased fixed codebook parameters (belonging to excitation parameters) outward.
When current background noise belongs to stable state class, coding switch unit 24 is selected programme of work one.When current background noise belongs to unstable state class, coding switch unit 24 is selected programme of work two.
During every kind of SID frame assembling, a DTX unit indication information is set in frame to state which kind of DTX unit this SID frame belongs to.In the present embodiment, DTX unit indication information can represent with 1 bit, and 0 and 1 represents respectively Yi HeDTX unit, DTX unit two.
In decoding end, first recognition unit 60 judges the DTX unit under this SID frame according to the DTX unit indication information in SID frame, and decoding switch unit 70 is inputted corresponding CNG unit 50 by SID frame and carried out decoding and reconstituting.Wherein:
If DTX unit indication information is 0, corresponding CNG unit obtains filter parameter and energy parameter and 8 ambient noise signal frames is thereafter carried out to interpolation according to the SID frame format decoding in above-mentioned DTX programme of work one;
If DTX unit indication information is 1, corresponding CNG unit obtains filter parameter, energy parameter and fixed codebook parameters and 3 ambient noise signal frames is thereafter carried out to interpolation according to the SID frame format decoding in above-mentioned DTX programme of work two.
Embodiment bis-
At coding side, ambient noise signal is inputed to background noise analysis unit 22 frame by frame, and background noise analysis unit 22 calculates the subband level level[k of current background noise signal frame, i], frame energy pow[i] and signal to noise ratio (S/N ratio) snr[i in short-term], and be kept in internal memory.Wherein:
Subband level level[k, i] calculating can realize also and can realize by the method for conversion by bank of filters;
The account form of frame energy is:
Figure BDA00001759760300171
wherein N represents total sub band number;
The account form of signal to noise ratio (S/N ratio) is in short-term:
Figure BDA00001759760300172
bckr_noise_pow[i wherein] ground unrest energy is estimated when long.
When the enough somes of ambient noise signal frame accumulation, background noise analysis unit 22 starts according to being kept at short-time characteristic parameter (the subband level in internal memory, frame energy, signal to noise ratio (S/N ratio) in short-term) calculate ground unrest energy hunting pow_var[i], signal to noise ratio (S/N ratio) snr_long[i when long], ground unrest energy is estimated bckr_noise_pow[i when long], and when receiving each ambient noise signal frame, upgrade result of calculation.
When long signal to noise ratio (S/N ratio), when the renewal of all the other calculation of parameter results occurs over just current demand signal frame and is ambient noise signal frame, when when long, the renewal of signal to noise ratio (S/N ratio) occurs over just current demand signal frame and is non-ambient noise signal frame.
Signal to noise ratio (S/N ratio) snr_long[i when long] computing method be:
Figure BDA00001759760300173
wherein L is chosen as to do non-ambient noise signal number of frames average when long.
Ground unrest energy hunting pow_var[i] computing method be:
pow _ var [ i ] = 1 L * &Sigma; m = i - L + 1 i ( pow [ m ] - 1 L * &Sigma; m = i - L + 1 i pow [ m ] ) 2 , Wherein L is chosen as to do ambient noise signal number of frames average when long.
When long, ground unrest energy is estimated bckr_noise_pow[i] computing method be:
Bckr_noise_pow[i]=(1-α) * bckr_noise_pow[i-1]+α * pow[i], wherein α is the scale factor of 0 to 1, in the present embodiment, value is 5%.
Background noise analysis unit 22 is by the ground unrest energy hunting pow_var[i calculating], signal to noise ratio (S/N ratio) snr_long[i when long] export ground unrest taxon 23 to.
In the present embodiment, four kinds of background type have been defined: the large SNR ambient noise signal of stable state, the little SNR ambient noise signal of stable state, the large SNR ambient noise signal of unstable state and the little SNR ambient noise signal of unstable state.
Ground unrest taxon 23 is by by pow_var[i] and snr_long[i] compare with an energy hunting thresholding pow_var_thr and a signal-noise ratio threshold snr_thr respectively, it is a class of 4 kinds of ground unrest type kinds that current background noise is incorporated into, wherein:
As pow_var[i] when > pow_var_thr and snr_long > snr_thr, current background is the large SNR ambient noise signal of unstable state;
As pow_var[i] when > pow_var_thr and snr_long < snr_thr, current background is the little SNR ambient noise signal of unstable state;
As pow_var[i] when < pow_var_thr and snr_long > snr_thr, current background is the large SNR ambient noise signal of stable state;
As pow_var[i] when < pow_var_thr and snr_long < snr_thr, current background is the little SNR ambient noise signal of stable state.
In the present embodiment, one has four kinds of different DTX unit, wherein:
DTX unit one adopts every 8 frames to send the discontinuous transmission mechanism of a SID frame, only comprises filter parameter and energy parameter in SID frame;
DTX unit two adopts every 8 frames to send the discontinuous transmission mechanism of a SID frame, also comprises fixed codebook parameters in SID frame except comprising filter parameter and energy parameter;
DTX unit three adopts every 3 frames to send the discontinuous transmission mechanism of a SID frame, also comprises fixed codebook parameters in SID frame except comprising filter parameter and energy parameter;
DTX unit four adopts the continuous transmission mechanism of full-rate codes.
Coding switch unit 24, except receiving the ground unrest type information of ground unrest taxon 23, is also connected with a switch mode steering order interface.Switch mode steering order has 5 kinds of indications, represent respectively that bandwidth is normal, bandwidth is more, bandwidth is less, bandwidth is a lot of and bandwidth seldom.
The switch mode relation of switch mode steering order and DTX system is as follows:
When switch mode steering order represents that bandwidth is normal, the switch mode of DTX system is: the large SNR class of stable state background is adopted to DTX unit one; The little SNR class of stable state background is adopted to DTX unit two; The large SNR class of unstable state background is adopted to DTX unit three; The little SNR class of unstable state background is adopted to DTX unit four.
When switch mode steering order represents that bandwidth is more, the switch mode of DTX system is: stable state large SNR class background and the little SNR class of stable state background are all adopted to DTX unit two; Unstable state large SNR class background and the little SNR class of unstable state background are all adopted to DTX unit four;
When switch mode steering order represents that bandwidth is less, the switch mode of DTX system is: stable state large SNR class background and the little SNR class of stable state background are all adopted to DTX unit one; Unstable state large SNR class background and the little SNR class of unstable state background are all adopted to DTX unit three;
When switch mode steering order represents that bandwidth is a lot, the switch mode of DTX system is: four kinds of ground unrest types are all adopted to DTX unit four;
When switch mode steering order represents that bandwidth seldom, the switch mode of DTX system is: four kinds of ground unrest types are all adopted to DTX unit one.
During every kind of SID frame assembling, a DTX unit indication information is set in frame to state which kind of DTX unit this SID frame belongs to.DTX unit indication information in the present embodiment can represent with 2 bits, wherein: 00,01 and 10 represent respectively Er HeDTX unit, Yi,DTX unit, DTX unit three, so scheme four due to be full-rate codes without sign out.
In decoding end, first recognition unit 60 judges the DTX unit under this SID frame according to the DTX unit indication information in SID frame, and indicates corresponding CNG unit decodes reconstruct, wherein:
If DTX unit indication information is 00, according to the SID frame format decoding in above-mentioned DTX programme of work one, obtains filter parameter and energy parameter and 8 ambient noise signal frames are thereafter carried out to interpolation;
If DTX unit indication information is 01, according to the SID frame format decoding in above-mentioned DTX programme of work two, obtain excitation parameters, filter parameter and energy parameter also carry out interpolation to 8 ambient noise signal frames thereafter.
If DTX unit indication information is 10, according to the SID frame format decoding in above-mentioned DTX programme of work three, obtains filter parameter, energy parameter and fixed codebook parameters and 3 ambient noise signal frames are thereafter carried out to interpolation.
Those skilled in the art can be flexibly according to signal characteristic parameter dividing background noise type, and sets switch mode, will not enumerate here.
In sum, the embodiment of the present invention provides a kind of coding method, comprising:
Obtain at least one signal characteristic parameter value of ambient noise signal to be sent;
According to described at least one signal characteristic parameter value, determine ground unrest type;
According to a kind of DTX mechanism of described ground unrest type selecting, ambient noise signal is encoded to SID frame the discontinuous transmission that comprises DTX mechanism indication information, wherein the SID frame coding mode in each DTX mechanism and/or the discontinuous send mode of SID frame are not identical.
Further, described coding method also comprises: set different DTX mechanism preference pattern, i.e. different switch mode, has at least DTX mechanism corresponding to ground unrest type not identical between any two DTX mechanism preference pattern; And according to selected DTX mechanism preference pattern, for each ground unrest type is determined corresponding DTX mechanism.
A nearlyer step, described method also comprises: when setting value that described at least one signal characteristic parameter value is, ambient noise signal is encoded to the speech frame of full rate and sends continuously.
Wherein, signal characteristic parameter value comprises one of following or combination in any: when long, SNR, ground unrest estimate that level, ground unrest energy hunting estimation, the fluctuation of ground unrest spectrum estimate and ground unrest vibration frequency.
The embodiment of the present invention also provides a kind of method that corresponding SID frame is decoded, and comprising:
DTX mechanism indication information in identification SID frame, wherein the SID frame coding mode in each DTX mechanism is not identical;
According to described DTX mechanism indication information, adopt corresponding decoding process decoding SID frame, and be reconstructed into ambient noise signal.
The technical scheme that the application embodiment of the present invention proposes, can be according to the actual state of the dissimilar and communication system of ground unrest, adopt different SID frame coding methods and discontinuous transmission mechanism, in the situation that rationally utilizing communication system real resource, be adaptive to the desired encoding precision of dissimilar ground unrest and time sense, reach under the prerequisite of rationally utilizing channel resource in the high-quality object that all kinds ground unrest is provided of decoding end.
Obviously, those skilled in the art can carry out various changes and modification and not depart from the spirit and scope of the present invention the present invention.Like this, if within of the present invention these are revised and modification belongs to the scope of the claims in the present invention and equivalent technologies thereof, the present invention is also intended to comprise these changes and modification interior.

Claims (31)

1.一种背景噪声编码装置,其特征在于,包括:1. A background noise encoding device, characterized in that, comprising: 背景噪声分析单元,用于获取背景噪声信号的至少一个信号特征参数值;所述背景噪声信号包括背景噪声,所述背景噪声包括白噪声、背景嘈杂声和静音;A background noise analysis unit, configured to obtain at least one signal characteristic parameter value of a background noise signal; the background noise signal includes background noise, and the background noise includes white noise, background noise and silence; 背景噪声分类单元,用于根据所述至少一个信号特征参数值确定背景噪声类型;A background noise classification unit, configured to determine the type of background noise according to the at least one signal characteristic parameter value; 编码切换单元和至少两个非连续传输单元,所述编码切换单元用于根据所述背景噪声类型将背景噪声信号输入对应的非连续传输单元,每一个非连续传输单元用于将背景噪声信号编码为包含非连续传输单元指示信息的静音描述帧并非连续传输,其中各非连续传输单元的静音描述帧编码方式和/或静音描述帧非连续传输机制不相同,各非连续传输单元的静音描述帧编码方式不相同具体为:不同静音描述帧的编码参数不相同,或者同一编码参数不同静音描述帧的量化精度不相同,各非连续传输单元的静音描述帧非连续传输机制不相同具体为:发送的相邻两个静音描述帧的传输间隔不相同。A coding switching unit and at least two discontinuous transmission units, the coding switching unit is used to input the background noise signal into the corresponding discontinuous transmission unit according to the background noise type, and each discontinuous transmission unit is used to code the background noise signal The silence description frame containing the indication information of the discontinuous transmission unit is not transmitted continuously, and the encoding method and/or discontinuous transmission mechanism of the silence description frame of each discontinuous transmission unit is different, and the silence description frame of each discontinuous transmission unit The encoding methods are different, specifically: the encoding parameters of different silence description frames are different, or the quantization accuracy of different silence description frames with the same encoding parameter is different, and the discontinuous transmission mechanisms of the silence description frames of each discontinuous transmission unit are different. Specifically: send The transmission intervals of two adjacent silence description frames are different. 2.如权利要求1所述的背景噪声编码装置,其特征在于,还包括:指令接收单元,用于接收切换模式控制指令并转发给编码切换单元,所述切换模式控制指令用于指示编码切换单元使用一种切换模式,任意两个切换模式之间至少有一个背景噪声类型对应不同的非连续传输单元。2. The background noise encoding device according to claim 1, further comprising: an instruction receiving unit, configured to receive a switching mode control instruction and forward it to the encoding switching unit, and the switching mode control instruction is used to instruct encoding switching The unit uses a switching pattern, and at least one background noise type between any two switching patterns corresponds to a different discontinuous transmission unit. 3.如权利要求1所述的背景噪声编码装置,其特征在于,还包括:语音编码器,所述编码切换单元根据背景噪声分类单元识别出的其中一种指定背景噪声类型,将背景噪声信号输入语音编码器编码为语音帧并连续输出。3. The background noise encoding device according to claim 1, further comprising: a speech encoder, wherein the encoding switching unit converts the background noise signal according to one of the specified background noise types recognized by the background noise classification unit The input speech encoder encodes speech frames and outputs them continuously. 4.如权利要求1、2或3所述的背景噪声编码装置,其特征在于,所述非连续传输单元具体包括:4. The background noise encoding device according to claim 1, 2 or 3, wherein the discontinuous transmission unit specifically comprises: 非语音编码器,用于将背景噪声信号编码为静音描述帧;A non-speech encoder for encoding background noise signals into silence description frames; 静音描述帧发送器,用于非连续发送所述静音描述帧。The silence description frame sender is used for discontinuously sending the silence description frame. 5.一种对权利要求1所述背景噪声编码装置输出的静音描述帧进行解码的背景噪声解码装置,其特征在于,包括:5. A background noise decoding device for decoding the silence description frame output by the background noise coding device according to claim 1, characterized in that it comprises: 第一识别单元,用于识别静音描述帧中的非连续传输单元指示信息;The first identification unit is used to identify the DTU indication information in the silence description frame; 解码切换单元和至少两个舒适噪声生成单元,所述解码切换单元用于根据所述识别单元的识别结果,将所述静音描述帧输入对应的舒适噪声生成单元,每一个舒适噪声生成单元根据接收的静音描述帧的编码方式对应解码静音描述帧,并重构背景噪声信号。A decoding switching unit and at least two comfort noise generating units, the decoding switching unit is used to input the silence description frame into the corresponding comfort noise generating unit according to the identification result of the identifying unit, and each comfort noise generating unit receives The encoding method of the silence description frame corresponds to decoding the silence description frame and reconstructing the background noise signal. 6.如权利要求5所述的背景噪声解码装置,其特征在于,还包括:6. The background noise decoding device according to claim 5, further comprising: 第二识别单元,所述第二识别单元用于识别语音帧和静音描述帧,并将所述静音描述帧送入第一识别单元,或者在识别出语音帧时通知所述解码切换单元;A second identification unit, the second identification unit is used to identify the speech frame and the silence description frame, and send the silence description frame to the first identification unit, or notify the decoding switching unit when the speech frame is recognized; 语音解码器,所述解码切换单元将第二识别单元识别出的语音帧送入语音解码器解码并连续输出。Speech decoder, the decoding switching unit sends the speech frame recognized by the second recognition unit to the speech decoder for decoding and continuous output. 7.如权利要求6所述的背景噪声解码装置,其特征在于,所述第一识别单元和第二识别单元合并设置。7. The background noise decoding device according to claim 6, wherein the first identification unit and the second identification unit are combined. 8.一种通信设备,包括:8. A communication device comprising: 第一语音检测器,用于检测采集的信号中是否包含有效语音并分类输出;The first voice detector is used to detect whether valid voice is included in the collected signal and to classify and output; 第一语音编码器,用于将包含有效语音的信号编码为语音帧并连续输出;The first speech coder is used for encoding the signal comprising effective speech into speech frames and outputting them continuously; 其特征在于,还包括背景噪声编码装置,该背景噪声编码装置具体包括:It is characterized in that it also includes a background noise coding device, and the background noise coding device specifically includes: 背景噪声分析单元,用于获取不包含有效语音的背景噪声信号的至少一个信号特征参数值;所述背景噪声信号包括背景噪声,所述背景噪声包括白噪声、背景嘈杂声和静音;A background noise analysis unit, configured to obtain at least one signal characteristic parameter value of a background noise signal that does not contain effective speech; the background noise signal includes background noise, and the background noise includes white noise, background noise and silence; 背景噪声分类单元,用于根据所述至少一个信号特征参数值确定背景噪声类型;A background noise classification unit, configured to determine the type of background noise according to the at least one signal characteristic parameter value; 编码切换单元和至少两个非连续传输单元,所述编码切换单元用于根据所述背景噪声类型将背景噪声信号输入对应的非连续传输单元,每一个非连续传输单元用于将背景噪声信号编码为包含非连续传输单元指示信息的静音描述帧并非连续传输,其中各非连续传输单元的静音描述帧编码方式和/或静音描述帧非连续传输机制不相同,各非连续传输单元的静音描述帧编码方式不相同具体为:不同静音描述帧的编码参数不相同,或者同一编码参数不同静音描述帧的量化精度不相同,各非连续传输单元的静音描述帧非连续传输机制不相同具体为:发送的相邻两个静音描述帧的传输间隔不相同。A coding switching unit and at least two discontinuous transmission units, the coding switching unit is used to input the background noise signal into the corresponding discontinuous transmission unit according to the background noise type, and each discontinuous transmission unit is used to code the background noise signal The silence description frame containing the indication information of the discontinuous transmission unit is not transmitted continuously, and the encoding method and/or discontinuous transmission mechanism of the silence description frame of each discontinuous transmission unit is different, and the silence description frame of each discontinuous transmission unit The encoding methods are different, specifically: the encoding parameters of different silence description frames are different, or the quantization accuracy of different silence description frames with the same encoding parameter is different, and the discontinuous transmission mechanisms of the silence description frames of each discontinuous transmission unit are different. Specifically: send The transmission intervals of two adjacent silence description frames are different. 9.如权利要求8所述的通信设备,其特征在于,所述背景噪声编码装置还包括:指令接收单元,用于接收切换模式控制指令并转发给编码切换单元,所述切换模式控制指令用于指示编码切换单元使用一种切换模式,任意两个切换模式之间至少有一个背景噪声类型对应不同的非连续传输单元。9. The communication device according to claim 8, wherein the background noise encoding device further comprises: an instruction receiving unit, configured to receive the switching mode control instruction and forward it to the encoding switching unit, and the switching mode control instruction uses In order to instruct the coding switching unit to use a switching mode, at least one background noise type between any two switching modes corresponds to different discontinuous transmission units. 10.如权利要求8所述的通信设备,其特征在于,所述编码切换单元还连接所述第一语音编码器,所述编码切换单元根据背景噪声分类单元识别出的其中一种指定背景噪声类型,将背景噪声信号输入第一语音编码器编码为语音帧并连续输出。10. The communication device according to claim 8, wherein the coding switching unit is further connected to the first speech coder, and the coding switching unit recognizes one of the specified background noises according to the background noise classification unit Type, the background noise signal is input into the first speech coder to be encoded into speech frames and output continuously. 11.如权利要求8所述的通信设备,其特征在于,所述背景噪声编码装置还包括:第二语音编码器,所述编码切换单元根据背景噪声分类单元识别出的其中一种指定背景噪声类型,将背景噪声信号输入第二语音编码器编码为语音帧并连续输出。11. The communication device according to claim 8, wherein the background noise coding device further comprises: a second speech coder, and the coding switching unit recognizes one of the specified background noises according to the background noise classification unit type, the background noise signal is input into the second speech coder to be encoded into speech frames and output continuously. 12.如权利要求8、9、10或11所述的通信设备,其特征在于,所述非连续传输单元具体包括:12. The communication device according to claim 8, 9, 10 or 11, wherein the discontinuous transmission unit specifically comprises: 非语音编码器,用于根据设定的静音描述帧编码方式将背景噪声信号编码为静音描述帧;A non-speech coder, used to encode the background noise signal into a silence description frame according to the set silence description frame encoding method; 静音描述帧发送器,用于根据设定的非连续传输机制发送所述静音描述帧。The silence description frame sender is configured to send the silence description frame according to the set discontinuous transmission mechanism. 13.如权利要求8所述的通信设备,其特征在于,还包括:13. The communication device of claim 8, further comprising: 第一识别单元,用于识别静音描述帧中的非连续传输单元指示信息;The first identification unit is used to identify the DTU indication information in the silence description frame; 解码切换单元和至少两个舒适噪声生成单元,所述解码切换单元用于根据所述识别单元的识别结果,将所述静音描述帧输入对应的舒适噪声生成单元,每一个舒适噪声生成单元根据接收的静音描述帧的编码方式对应解码静音描述帧,并重构背景噪声信号。A decoding switching unit and at least two comfort noise generating units, the decoding switching unit is used to input the silence description frame into the corresponding comfort noise generating unit according to the identification result of the identifying unit, and each comfort noise generating unit receives The encoding method of the silence description frame corresponds to decoding the silence description frame and reconstructing the background noise signal. 14.如权利要求13所述的通信设备,其特征在于,还包括:14. The communication device of claim 13, further comprising: 第二识别单元,所述第二识别单元用于识别语音帧和静音描述帧,并将所述静音描述帧送入第一识别单元,或者在识别出语音帧时通知所述解码切换单元;A second identification unit, the second identification unit is used to identify the speech frame and the silence description frame, and send the silence description frame to the first identification unit, or notify the decoding switching unit when the speech frame is recognized; 语音解码器,所述解码切换单元将第二识别单元识别出的语音帧送入语音解码器解码并连续输出。Speech decoder, the decoding switching unit sends the speech frame recognized by the second recognition unit to the speech decoder for decoding and continuous output. 15.如权利要求14所述的通信设备,其特征在于,所述第一识别单元和第二识别单元合并设置。15. The communication device according to claim 14, wherein the first identification unit and the second identification unit are combined. 16.一种通信设备,包括背景噪声解码装置,其特征在于,所述背景噪声解码装置包括:16. A communication device, comprising a background noise decoding device, characterized in that the background noise decoding device comprises: 第一识别单元,用于识别静音描述帧中的非连续传输单元指示信息;The first identification unit is used to identify the DTU indication information in the silence description frame; 解码切换单元和至少两个舒适噪声生成单元,所述解码切换单元用于根据所述识别单元的识别结果,将所述静音描述帧输入对应的舒适噪声生成单元,每一个舒适噪声生成单元根据接收的静音描述帧的编码方式对应解码静音描述帧,并重构背景噪声信号;所述背景噪声信号包括背景噪声,所述背景噪声包括白噪声、背景嘈杂声和静音。A decoding switching unit and at least two comfort noise generating units, the decoding switching unit is used to input the silence description frame into the corresponding comfort noise generating unit according to the identification result of the identifying unit, and each comfort noise generating unit receives The encoding method of the silence description frame corresponds to decoding the silence description frame, and reconstructing the background noise signal; the background noise signal includes background noise, and the background noise includes white noise, background noise and silence. 17.如权利要求16所述的通信设备,其特征在于,还包括:17. The communication device of claim 16, further comprising: 第二识别单元,所述第二识别单元用于识别语音帧和静音描述帧,并将所述静音描述帧送入第一识别单元,或者在识别出静音描述帧时通知所述解码切换单元;A second identification unit, the second identification unit is used to identify the speech frame and the silence description frame, and send the silence description frame to the first identification unit, or notify the decoding switching unit when the silence description frame is recognized; 语音解码器,所述解码切换单元将第二识别单元识别出的语音帧送入语音解码器解码并输出。A speech decoder, wherein the decoding switching unit sends the speech frame recognized by the second recognition unit to the speech decoder for decoding and outputting. 18.如权利要求17所述的通信设备,其特征在于,所述第一识别单元和第二识别单元合并设置。18. The communication device according to claim 17, wherein the first identification unit and the second identification unit are combined. 19.一种背景噪声编码方法,其特征在于,包括:19. A background noise encoding method, comprising: 获取背景噪声信号的至少一个信号特征参数值;所述背景噪声信号包括背景噪声,所述背景噪声包括白噪声、背景嘈杂声和静音;Obtaining at least one signal characteristic parameter value of a background noise signal; the background noise signal includes background noise, and the background noise includes white noise, background noise and silence; 根据所述至少一个信号特征参数值确定背景噪声类型;determining a background noise type according to the at least one signal characteristic parameter value; 根据所述背景噪声类型选择一种非连续传输机制将背景噪声信号编码为包含非连续传输机制指示信息的静音描述帧并非连续发送,其中每一种非连续传输机制中的静音描述帧编码方式和/或静音描述帧非连续发送方式不相同,各非连续传输单元的静音描述帧编码方式不相同具体为:不同静音描述帧的编码参数不相同,或者同一编码参数不同静音描述帧的量化精度不相同,各非连续传输单元的静音描述帧非连续传输机制不相同具体为:发送的相邻两个静音描述帧的传输间隔不相同。Select a discontinuous transmission mechanism according to the background noise type to encode the background noise signal into a silence description frame containing the discontinuous transmission mechanism indication information and send it non-continuously, wherein the silence description frame encoding method in each discontinuous transmission mechanism and /or the non-continuous transmission modes of the silence description frames are different, and the coding methods of the silence description frames of each discontinuous transmission unit are different, specifically: the encoding parameters of different silence description frames are different, or the quantization accuracy of the silence description frames with the same encoding parameters is different Similarly, the discontinuous transmission mechanisms of the silence description frames of each discontinuous transmission unit are different, specifically: the transmission intervals of two adjacent silence description frames to be sent are different. 20.如权利要求19所述的编码方法,其特征在于,所述根据背景噪声类型选择一种非连续传输机制是:根据设定的非连续传输机制选择模式中背景噪声类型和非连续传输机制之间的对应关系进行选择的,任意两个切换模式之间至少有一个背景噪声类型对应的非连续传输机制不相同。20. The encoding method according to claim 19, wherein said selecting a discontinuous transmission mechanism according to the background noise type is: selecting the background noise type and the discontinuous transmission mechanism in the mode according to the set discontinuous transmission mechanism If the corresponding relationship is selected, at least one discontinuous transmission mechanism corresponding to the background noise type is different between any two switching modes. 21.如权利要求19所述的编码方法,其特征在于,还包括:当所述至少一个信号特征参数值满足设定条件时,将背景噪声信号编码为全速率的语音帧并连续发送。21. The encoding method according to claim 19, further comprising: when the at least one signal characteristic parameter value satisfies a set condition, encoding the background noise signal into a full-rate speech frame and sending it continuously. 22.如权利要求19、20或21所述的编码方法,其特征在于,所述信号特征参数包括如下之一或任意组合:长时SNR、背景噪声估计电平、背景噪声能量波动估计、背景噪声谱波动估计和背景噪声波动频率。22. The encoding method according to claim 19, 20 or 21, wherein the signal characteristic parameters include one or any combination of the following: long-term SNR, background noise estimation level, background noise energy fluctuation estimation, background noise Noise spectral fluctuation estimation and background noise fluctuation frequency. 23.如权利要求22所述的编码方法,其特征在于,根据各信号特征参数值的大小,所述背景噪声类型包括稳态背景噪声信号和非稳态背景噪声信号。23. The encoding method according to claim 22, characterized in that, according to the value of each signal characteristic parameter, the background noise type includes a steady-state background noise signal and a non-stationary background noise signal. 24.如权利要求23所述的编码方法,其特征在于,当所述信号特征参数包括长时SNR时:24. The encoding method according to claim 23, wherein when the signal characteristic parameter comprises a long-term SNR: 所述稳态背景噪声信号进一步包括:稳态大SNR背景噪声信号和稳态小SNR背景噪声信号;The steady-state background noise signal further includes: a steady-state large SNR background noise signal and a steady-state small SNR background noise signal; 所述非稳态背景噪声信号进一步包括:非稳态大SNR背景噪声信号和非稳态小SNR背景噪声信号。The unsteady background noise signal further includes: an unsteady large SNR background noise signal and an unsteady small SNR background noise signal. 25.如权利要求19所述的编码方法,其特征在于,不同静音描述帧编码方式之间的区别包括:至少一个静音描述帧编码参数不相同,或者同一静音描述帧编码参数的量化精度不相同。25. The encoding method according to claim 19, wherein the differences between different silence description frame encoding methods include: at least one silence description frame encoding parameter is different, or the quantization precision of the same silence description frame encoding parameter is different . 26.如权利要求25所述的编码方法,其特征在于,所述的静音描述帧编码参数包括合成滤波器参数和激励参数。26. The encoding method according to claim 25, characterized in that, said silence description frame encoding parameters include synthesis filter parameters and excitation parameters. 27.如权利要求26所述的编码方法,其特征在于,所述的激励参数包括自适应码本参数和/或固定码本参数。27. The encoding method according to claim 26, wherein the excitation parameters include adaptive codebook parameters and/or fixed codebook parameters. 28.如权利要求19所述的编码方法,其特征在于,不同静音描述帧非连续发送方式之间的区别包括:发送的相邻两个静音描述帧之间的传输间隔不相同。28. The encoding method according to claim 19, wherein the difference between discontinuous transmission modes of different silence description frames comprises: the transmission interval between two adjacent silence description frames sent is different. 29.如权利要求28所述的编码方法,其特征在于,所述的传输间隔根据背景信号的变化自适应确定,并且不同静音描述帧非连续发送方式之间的最小间隔不相同。29. The encoding method according to claim 28, wherein the transmission interval is adaptively determined according to the change of the background signal, and the minimum intervals between the discontinuous transmission modes of different silence description frames are different. 30.如权利要求28所述的编码方法,其特征在于,各静音描述帧非连续发送方式的传输间隔为不相同的固定值。30. The encoding method according to claim 28, wherein the transmission intervals of the discontinuous transmission modes of each silence description frame are different fixed values. 31.一种对权利要求19所述方法生成的静音描述帧进行解码的方法,其特征在于,包括:31. A method for decoding the silence description frame generated by the method according to claim 19, comprising: 识别静音描述帧中的非连续传输机制指示信息,其中每一种非连续传输机制中的静音描述帧编码方式不相同;Identify the discontinuous transmission mechanism indication information in the silence description frame, where the encoding methods of the silence description frame in each discontinuous transmission mechanism are different; 根据所述非连续传输机制指示信息,采用对应的解码方式解码静音描述帧,并重构为背景噪声信号。According to the discontinuous transmission mechanism indication information, the silence description frame is decoded in a corresponding decoding manner, and reconstructed into a background noise signal.
CN201210194567.8A 2007-06-05 2007-06-05 Background noise coding/decoding device and method as well as communication equipment Active CN102760441B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210194567.8A CN102760441B (en) 2007-06-05 2007-06-05 Background noise coding/decoding device and method as well as communication equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210194567.8A CN102760441B (en) 2007-06-05 2007-06-05 Background noise coding/decoding device and method as well as communication equipment

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
CN 200710110573 Division CN101320563B (en) 2007-06-05 2007-06-05 Background noise encoding/decoding device, method and communication equipment

Publications (2)

Publication Number Publication Date
CN102760441A CN102760441A (en) 2012-10-31
CN102760441B true CN102760441B (en) 2014-03-12

Family

ID=47054882

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210194567.8A Active CN102760441B (en) 2007-06-05 2007-06-05 Background noise coding/decoding device and method as well as communication equipment

Country Status (1)

Country Link
CN (1) CN102760441B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12354613B2 (en) 2013-02-22 2025-07-08 Telefonaktiebolaget Lm Ericsson (Publ) Methods and apparatuses for DTX hangover in audio coding

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104978970B (en) 2014-04-08 2019-02-12 华为技术有限公司 A noise signal processing and generating method, codec and codec system
CN113593602B (en) * 2021-07-19 2023-12-05 深圳市雷鸟网络传媒有限公司 Audio processing method and device, electronic equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1168751A (en) * 1994-12-05 1997-12-24 诺基亚电信公司 Method for substituting bar speech frames in digital communication system
CN1327574A (en) * 1998-12-07 2001-12-19 三菱电机株式会社 Sound decoding device and sound decoding method
JP2003501925A (en) * 1999-06-07 2003-01-14 エリクソン インコーポレイテッド Comfort noise generation method and apparatus using parametric noise model statistics
CN101320563B (en) * 2007-06-05 2012-06-27 华为技术有限公司 Background noise encoding/decoding device, method and communication equipment

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1168751A (en) * 1994-12-05 1997-12-24 诺基亚电信公司 Method for substituting bar speech frames in digital communication system
CN1327574A (en) * 1998-12-07 2001-12-19 三菱电机株式会社 Sound decoding device and sound decoding method
JP2003501925A (en) * 1999-06-07 2003-01-14 エリクソン インコーポレイテッド Comfort noise generation method and apparatus using parametric noise model statistics
CN101320563B (en) * 2007-06-05 2012-06-27 华为技术有限公司 Background noise encoding/decoding device, method and communication equipment

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12354613B2 (en) 2013-02-22 2025-07-08 Telefonaktiebolaget Lm Ericsson (Publ) Methods and apparatuses for DTX hangover in audio coding

Also Published As

Publication number Publication date
CN102760441A (en) 2012-10-31

Similar Documents

Publication Publication Date Title
CN101320563B (en) Background noise encoding/decoding device, method and communication equipment
CN101131817B (en) Method and apparatus for robust speech classification
CN1703737B (en) Method for interoperation between adaptive multi-rate wideband (AMR-WB) and multi-mode variable bit-rate wideband (VMR-WB) codecs
KR100964402B1 (en) Method and apparatus for determining encoding mode of audio signal and method and apparatus for encoding / decoding audio signal using same
CN1828722B (en) Complex signal activated detection for improved speech/noise classification of an audio signal
KR100883656B1 (en) Method and apparatus for classifying audio signals and method and apparatus for encoding / decoding audio signals using the same
CN103247293B (en) Coding method and decoding method for voice data
CN1954364B (en) Audio encoding with different encoding frame lengths
CN101335000B (en) Encoding method and device
CN100485337C (en) Selection of coding models for encoding an audio signal
CN1815558B (en) Low bit-rate coding of unvoiced segments of speech
AU2005215744A1 (en) Classification of audio signals
CN103050122B (en) MELP-based (Mixed Excitation Linear Prediction-based) multi-frame joint quantization low-rate speech coding and decoding method
CN104517612A (en) Variable-bit-rate encoder, variable-bit-rate decoder, variable-bit-rate encoding method and variable-bit-rate decoding method based on AMR (adaptive multi-rate)-NB (narrow band) voice signals
EP1617416B1 (en) Method and apparatus for subsampling phase spectrum information
CN101217037A (en) Method and system for source control of encoding rate of audio signal
CN102760441B (en) Background noise coding/decoding device and method as well as communication equipment
CN101572090B (en) Self-adapting multi-rate narrowband coding method and coder
Bhatt et al. Overall performance evaluation of adaptive multi rate 06.90 speech codec based on code excited linear prediction algorithm using MATLAB
EP1035538B1 (en) Multimode quantizing of the prediction residual in a speech coder
Gournay et al. A 1200 bits/s HSX speech coder for very-low-bit-rate communications
KR20080091305A (en) Audio encoding with different coding models
KR20070017379A (en) Selection of Coding Models for Coding Audio Signals
Liang et al. A new 1.2 kb/s speech coding algorithm and its real-time implementation on TMS320LC548
KR20070017378A (en) Audio encoding with different coding models

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant