CN101453517A - Noise generating apparatus and method - Google Patents

Noise generating apparatus and method Download PDF

Info

Publication number
CN101453517A
CN101453517A CNA2008101896425A CN200810189642A CN101453517A CN 101453517 A CN101453517 A CN 101453517A CN A2008101896425 A CNA2008101896425 A CN A2008101896425A CN 200810189642 A CN200810189642 A CN 200810189642A CN 101453517 A CN101453517 A CN 101453517A
Authority
CN
China
Prior art keywords
parameter
noise
frame
reconstruction
noise parameter
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CNA2008101896425A
Other languages
Chinese (zh)
Other versions
CN101453517B (en
Inventor
张德明
代金良
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to CN 200810189642 priority Critical patent/CN101453517B/en
Publication of CN101453517A publication Critical patent/CN101453517A/en
Application granted granted Critical
Publication of CN101453517B publication Critical patent/CN101453517B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Mobile Radio Communication Systems (AREA)

Abstract

The invention discloses a noise generating method, which comprises the following steps: acquiring an initial value of a reconstruction parameter according to a pre-acquired noise parameter; acquiring a random value-taking range according to the initial value of the reconstruction parameter; randomly taking a value as a reconstructed noise parameter within the random value-taking range; and generating noise according to the reconstructed noise parameter. The invention also discloses a noise generating device, which comprises an initial value unit, a range unit, a reconstruction unit and a synthesis unit, wherein the initial value unit is used for acquiring the initial value of the reconstruction parameter according to the pre-acquired noise parameter; the range unit is used for acquiring the random value-taking range according to initial value of the reconstruction parameter; the reconstruction unit is used for randomly taking the value as the reconstructed noise parameter; and the synthesis unit is used for synthesizing the noise according to the reconstructed noise parameter. The method and the device can adapt to a plurality of standard protocols, and ensure that a decoding end can recover the noise which makes users feel more comfortable.

Description

Noise generating apparatus, and method
Technical field
The present invention relates to communication technical field, relate in particular to a kind of noise generating apparatus, reach method.
Background technology
In the process of transferring voice, can use speech coding technology that voice messaging is compressed usually, to increase capability of communication system.
Because when carrying out voice communication, have only time of about 40% to comprise voice, all be quiet or background noise At All Other Times, and the people that carry out voice communication usually are concerned about all is the content of voice, to time of having only quiet or background noise and be indifferent to, therefore when voice messaging is compressed, can encode according to diverse ways and transmit at voice, quiet or background noise, with further raising capability of communication system.Discontinuous transmission system/comfort noise generates (DTX/CNG, DiscontinuousTransmission System/Comfortable Noise Generation), and a kind of technology that is used for further improving capacity of communication system comes to this.
The DTX/CNG technology is commonly referred to quiet insertion and describes (SID the encode frame that obtains of background noise, Silence Insertion Descriptor) frame, in common speech frame, can comprise spectrum parameter, signal energy gain parameter, fixed codebook, reach the relevant parameter of adaptive codebook, decoding end just can recover original speech data according to these information after receiving speech frame, and generally only comprising spectrum parameter and signal energy gain parameter in the SID frame, decoding end is only carried out the recovery of background noise according to spectrum parameter and signal energy gain parameter.This be because the user usually and be indifferent to have comprised what information in the background noise, therefore the SID frame can only transmit very a spot of reference information, also promptly compose parameter and signal energy gain parameter, decoding end is carried out the recovery of background noise according to these reference informations, make the user can roughly recognize the other side and be in what environment, and the acoustical quality that can obviously not influence the user gets final product.When carrying out voice transfer, some frames of being separated by just send the SID frame one time, and coding parameter does not send or the frame of at all encoding is commonly referred to tone-off (NO_DATA) frame.
In the speech coding standard that each big organisations and institutions formulates, all there is the concrete application of DTX/CNG technology in recent years.
At third generation partnership project (3GPP, Third Generation Partnership Projects) speech coding standard---adaptive multi-rate vocoder (AMR, Adaptive Multi-Rate) the DTX/CNG technology that adopts in, be according to per 8 frames of fixed intervals and send a SID frame, the parameter that the two continuous frames SID frame decoding that utilization receives goes out, also be signal energy gain parameter and spectrum parameter, carry out linear interpolation, to estimate the synthetic parameters needed of noise, be with equation expression:
P n + k = 8 - k 8 P sid ( n - 1 ) + k 8 P sid ( n ) ( k = 1 , · · · , 8 )
P wherein N+kThe estimated value of representing the CNG parameter of n SID frame k frame afterwards, P Sid (n-1)The parameter of n-1 the SID frame that the expression decoding end receives, P Sid (n)The parameter of n the SID frame that the expression decoding end receives.When n=0, P Sid (1)Mean value for hangover stages 8 frame speech frame spectrum parameter and signal energy gain parameter.
At (the ITU of International Telecommunications Union, International Telecommunication Union) speech coding standard---in the silence compression scheme of conjugated structure algebraic codebook Excited Linear Prediction vocoder definition, the DTX/CNG technology that adopts, be in the situation of change of coding side according to noise parameter, determine whether to send SID adaptively, the interval minimum of front and back two frame SID is 20 milliseconds, and maximum is not then limit.The CNG algorithm that adopts in decoding end can be expressed as with formula:
Reconstruction to the signal energy gain parameter:
Figure A200810189642D00072
Reconstruction to the spectrum parameter:
Figure A200810189642D00073
LSF t,sub_2=LSF sid_new
Wherein
Figure A200810189642D00074
The signal energy gain parameter that the up-to-date SID frame decoding that the expression decoding end receives goes out, LSF Sid_lastThe spectrum parameter that the SID that the expression decoding end last time receives decodes, LSF Sid_newThe spectrum parameter that the up-to-date SID that receives of expression decoding end decodes.
In research and practice process to prior art, the inventor finds that there is following problem in prior art:
The speech coding standard of 3GPP---the DTX/CNG technology that adopts among the AMR only sends the situation of SID frame at coding side according to fixed intervals, and what use at coding side is self adaptation when sending the SID frame at interval, can't operate as normal.
The speech coding standard of ITU---the DTX/CNG technology that adopts in the silence compression scheme of conjugated structure algebraic codebook Excited Linear Prediction vocoder definition, when present frame is SID, spectrum parameter that use decodes and Last SID frame on average go out the spectrum parameter of first subframe of present frame, and the spectrum parameter of second subframe is then directly used the spectrum parameter that decodes; Tone-off frame between next SID frame arrives, the then direct spectrum parameter reconstruction noise of using nearest SID frame decoding to go out, when the spectrum parameter of next SID frame arrival and spectrum parameter that decodes and former frame SID frame has difference, discontinuity will appear, and because the spectrum parameter is an amount that is in the continuous variation, therefore former and later two spectrum parameters are normally differentiated, so spectrum of the comfort noise of rebuilding, be easy to occur discontinuity, and then have influence on acoustical quality, especially obvious when former and later two spectrum parameter difference are big.
Summary of the invention
The technical problem that the embodiment of the invention will solve provides a kind of noise generating apparatus, reaches method, can adapt to the multiple standards agreement, decoding end is recovered make the user feel more comfortable noise.
For solving the problems of the technologies described above, the embodiment of the invention provides a kind of noise generation method on the one hand, and described method comprises:
According to the noise parameter that obtains in advance, obtain the reconstruction parameter initial value; Obtain the random value scope according to described reconstruction parameter initial value; Random value is as the noise parameter of rebuilding in described random value scope; Noise parameter generted noise according to described reconstruction.
On the other hand, provide a kind of noise generating apparatus, described device comprises:
The initial value unit is used for obtaining the reconstruction parameter initial value according to the noise parameter that obtains in advance;
Range cells is used for obtaining the random value scope according to described reconstruction parameter initial value;
Reconstruction unit is used in described random value scope random value as the noise parameter of rebuilding;
Synthesis unit is used for the noise parameter composite noise according to described reconstruction.
Above technical scheme as can be seen, the consensus standard that the embodiment of the invention is used coding side without limits, no matter coding side sends the SID frame according to fixed intervals, or self adaptation sends the SID frame at interval, can operate as normal.
And because after receiving first SID frame, when receiving new SID frame once more, the capital is taken at noise parameter that the former frame of the up-to-date SID frame of receiving rebuilds as described reconstruction parameter initial value, and with reference to the noise parameter of this reconstruction parameter initial value and the up-to-date SID frame of receiving, determine a random value scope, random value is as noise parameter in this scope, and the noise transition of generation is more natural, can bring the sense of hearing preferably to experience to the user.
Description of drawings
Noise generation method embodiment one flow chart that Fig. 1, the embodiment of the invention provide;
Noise generation method embodiment two flow charts that Fig. 2, the embodiment of the invention provide;
Noise generation method embodiment three flow charts that Fig. 3, the embodiment of the invention provide;
Noise generation method embodiment four flow charts that Fig. 4, the embodiment of the invention provide;
The noise generating apparatus example structure figure that Fig. 5, the embodiment of the invention provide.
Embodiment
The embodiment of the invention provides a kind of noise generating apparatus, has reached method, can adapt to the multiple standards agreement, decoding end is recovered make the user feel more comfortable noise.
The noise generation method embodiment that the embodiment of the invention provides by the noise parameter in a spot of SID frame, rebuilds the noise parameter of change at random, curve smoothing in decoding end, makes the user feel more comfortable noise to recover.
Noise generation method embodiment one flow process that the embodiment of the invention provides comprises as shown in Figure 1:
Step 101, obtain the noise parameter that carries in the SID frame.
After the beginning voice communication, decoding end is deciphered out frame information from the audio data stream that receives, and then the form of this frame is judged, if this frame is a speech frame, then enters the speech frame handling process; If non-speech frame as SID frame or tone-off frame, then enters the noise generation method embodiment flow process that present embodiment provides.
When handling non-speech frame, owing to do not comprise any speech data in the tone-off frame, directly enter step 102, when receiving the SID frame, will obtain the noise parameter that carries in this SID frame, i.e. signal energy gain parameter and spectrum parameter.
Step 102, rebuild continuous noise parameter according to prediction direction change at random, curve smoothing according to the noise parameter that obtains, comprise signal energy gain parameter and spectrum parameter, present frame is a non-speech frame, comprises SID frame and tone-off frame.
Can not depart from actual value too far away for the noise parameter that makes reconstruction, at first will determine a central value for the change curve of the noise parameter rebuild, and the noise parameter value of reconstruction is moved about near this central value, and this central value can be called as the center C of moving about k, also to determine simultaneously the scope of moving about, the noise parameter value that makes reconstruction is with C kBe the center, in this scope, move about that this scope of moving about can be called the radius Δ that moves about.
The move about method of radius Δ of acquisition has a variety ofly, and present embodiment provides wherein two kinds: a kind ofly be the time interval k acquisition according to noise parameter increment dP, predicting interval length l ength and present frame and the up-to-date SID frame of receiving; A kind of is to obtain according to noise parameter increment dP, predicting interval length l ength.
When obtaining to move about the radius Δ according to first method, the radius Δ that moves about of present frame noise parameter can be expressed as with formula:
Δ = dP 2 ( | k - length | + 1 )
Wherein length supposes promptly that for the up-to-date SID frame of receiving of prediction and the gap length between the next SID frame elapsed time length can receive next frame SID frame.
When present frame is a decoding end when receiving the first frame SID frame after voice segments, noise parameter increment dP can utilize the up-to-date SID frame noise parameter P that receives Sid, or the energy gain parameter of several frame speech frames of the past of storing in the buffer area and the acquisition of spectrum parameter.
When decoding end received the first frame non-speech frame after speech frame, present embodiment provided two kinds of methods that obtain the noise parameter increment:
Method one, the energy gain parameter of utilizing a few frame speech frames of the past of storing in the buffer area and spectrum parameter estimate over average energy gain parameter and compose parameter, as reconstruction parameter initial value P Ref, with the up-to-date noise parameter P that receives SidWith reconstruction parameter initial value P RefDifference as noise parameter increment dP, this moment, noise parameter increment dP can be with formulae express:
dP=P sid-P ref
Estimation reconstruction parameter initial value P RefMode can be to adopt the mean value of former frame energy gain parameters and spectrum parameter as reconstruction parameter initial value P Ref, also can be to adopt the weighted average of former frame energy gain parameters and spectrum parameter as reconstruction parameter initial value P Ref
Method two, the energy gain parameter and the spectrum parameter that directly adopt the up-to-date SID frame of receiving to carry, rebuild this SID frame to the noise between the next SID frame, when receiving the next SID frame of this SID frame, begin the reconstruction noise parameter again, energy gain parameter that the first frame SID frame carries after the employing speech frame and spectrum parameter are as reconstruction parameter initial value P Ref, with the up-to-date noise parameter P that receives SidWith reconstruction parameter initial value P RefDifference as noise parameter increment dP, this moment, noise parameter increment dP can be with formulae express:
dP=P sid-P ref
If SID frame or the tone-off frame received after the first frame SID frame, present embodiment provide two kinds of methods that obtain the noise parameter increment:
Method one, the noise parameter P that rebuilds with the up-to-date SID frame former frame that receives K-1For rebuilding initial parameter value P Ref, the up-to-date SID frame noise parameter P that receives SidWith reconstruction parameter initial value P RefDifference as noise parameter increment dP, this moment, noise parameter increment dP can be with formulae express:
dP=P sid-P ref
The difference of the noise parameter that method two, the noise parameter that carries with the up-to-date SID frame of receiving and former frame SID frame carry is that the n frame is an example as noise parameter increment dP with the up-to-date SID frame of receiving, noise parameter increment dP can be expressed as with formula:
dP=P sid(n)-P sid(n-1)
Before receiving next SID frame, when being two tone-off frame reconstruction noise parameters between the SID frame, the noise parameter increment dP that can use the SID frame of receiving recently is the tone-off frame radius Δ of determining to move about, also can be when at every turn being new tone-off frame reconstruction noise, upgrade noise parameter increment dP, present embodiment provides two kinds of methods of upgrading noise parameter increment dP:
Method one, the up-to-date SID frame noise parameter P that receives SidWith reconstruction parameter initial value P RefDifference as noise parameter increment dP, when being tone-off frame reconstruction noise parameter, the noise parameter P that rebuilds with former frame K-1Upgrade reconstruction parameter initial value P Ref, then use reconstruction parameter initial value P RefThe noise parameter increment dP that obtains also can correspondingly be updated.
Method two, be d with the noise parameter of the SID frame that receives recently and the difference of the noise parameter that former frame SID frame carries 0, the noise parameter of rebuilding with the former frame of the SID frame that receives recently is P 0, present frame is the k frame of the up-to-date SID frame that receives of distance, the noise parameter increment of present frame is d k, with d 0Deduct reconstruction parameter initial value P RefWith P 0Difference obtain the noise parameter increment d of present frame k, make d k=dP, d at this moment kCan be expressed as with formula:
d k=d 0-(P ref-P 0)
When being tone-off frame reconstruction noise parameter, with the noise parameter P of former frame reconstruction K-1Upgrade reconstruction parameter initial value P Ref, then use reconstruction parameter initial value P RefThe noise parameter increment d that obtains kAlso can correspondingly be updated.
Just the move about value direction of radius Δ of the prediction direction of change curve, and the value direction of the radius Δ that moves about is subjected to the influence of noise parameter increment dP, when noise parameter increment dP was "+", the Δ value was "+"; When noise parameter increment dP was "-", the Δ value was "-".
When present frame was the SID frame, k was " 0 ",
2(|k-length|+1)=2(length+1)
Δ = dP 2 ( length + 1 )
The duration of the no segment that constitutes along with the tone-off frame is elongated, and the k value slowly becomes greatly, when noise parameter increment dP is constant, 2 (| value k-length|+1) will slowly diminish, and it is big that the value of Δ then can slowly become.
Work as k=length, when promptly present frame is a length frame behind the up-to-date SID frame of receiving,
2(|k-length|+1)=2
Δ = dP 2
If also do not receive new SID frame behind this frame, the k value continues to increase, when noise parameter increment dP is constant, 2 (| value k-length|+1) will slowly become greatly, and the value of Δ then can slowly diminish.
So be two tone-off frame reconstruction noise parameters between the SID frame, when noise parameter increment dP was constant, the value of Δ was that an initial value equals
Figure A200810189642D00123
Maximum equals Then slow numerical value of decaying.If noise parameter increment dP is also changing thereupon, then the variation of the value of Δ will be subjected to corresponding influence.
When obtaining to move about the radius Δ according to second method, the radius Δ that moves about of present frame noise parameter can be expressed as with formula:
Δ = dP 2 * length
It is basic identical to obtain the move about method of radius Δ of the method for noise parameter increment dP and predicting interval length l ength and first kind of acquisition mentioned above.
At this moment, the value direction of the radius Δ that moves about still is subjected to the influence of noise parameter increment dP, and when noise parameter increment dP was "+", the Δ value was "+"; When noise parameter increment dP was "-", the Δ value was "-".
The center C of moving about of present frame noise parameter kCan pass through reconstruction parameter initial value P RefObtain the center C of moving about with the radius Δ that moves about of present frame noise parameter kCan be expressed as with formula:
C k=P ref+2Δ
Wherein, reconstruction parameter initial value P RefCan upgrade when reconstruction noise parameter each time, be P with current noise parameter k, then with P K-1Upgrade P Ref, the center C of moving about this moment kCan be expressed as with formula:
C k=P k-1+2Δ
With C kBe the center, at [C k-| Δ |, C k+ | Δ |] adopt the method for random value in interval, reconstruct the noise parameter P of present frame k, noise parameter P kCan be expressed as with formula:
P k=rand(C k-|Δ|,C k+|Δ|)
When present frame is the SID frame, when the Δ value is "+", C kAlso greater than the noise parameter P of former frame K-1, [C k-| Δ |, C k+ | Δ |] following being limited to:
C k-|Δ|=P k-1
[C k-| Δ |, C k+ | Δ |] lower limit compare P K-1High Δ, when adopting first method to obtain Δ, the value initial value of Δ equals Be noise parameter increment dP Relative noise parameter increase dP is very little value, therefore [a C k-| Δ |, C k+ | Δ |] lower limit be one and compare P K-1High slightly numerical value.When adopting second method to obtain Δ, Δ = P sid - P k - 1 2 * length , The value of Δ is the noise parameter increment
Figure A200810189642D00134
Relative noise parameter increase dP is very little value, therefore [a C k-| Δ |, C k+ | Δ |] lower limit also be one and compare P K-1High slightly numerical value.
[C k-| Δ |, C k+ | Δ |] on be limited to:
C k+|Δ|=P k-1+3Δ
[C k-| Δ |, C k+ | Δ |] the upper limit compare P K-1High 3 Δs when adopting first method to obtain Δ, are that " 2 " are example with the length value, and the value of 3 Δs is noise parameter increment dP's
Figure A200810189642D00135
Still be less than noise parameter increment dP, i.e. [C k-| Δ |, C k+ | Δ |] the upper limit less than P K-1With noise parameter increment dP and.
When adopting second method to obtain Δ, be that " 2 " are example with the length value, the value of 3 Δs is P SidWith P K-1Difference Still be less than noise parameter increment dP, i.e. [C k-| Δ |, C k+ | Δ |] the upper limit less than P K-1, with noise parameter increment dP's and, and second method is applied to adopting fixed intervals to send the occasion of SID frame usually, length generally can be more much bigger than " 2 " in the time of this, the value of 3 Δs is just littler.
In like manner, if present frame is the SID frame, when the Δ value is "-", [C k-| Δ |, C k+ | Δ |] lower limit can be than the up-to-date SID frame noise parameter P that receives SidHeight, the upper limit can be than the noise parameter P of former frame K-1Low slightly.
Therefore when present frame is the SID frame, at [C k-| Δ |, C k+ | Δ |] interval in the noise parameter P of random value k, can be a noise parameter P who compares former frame K-1Vicissitudinous slightly parameter, this variation is by the up-to-date SID frame noise parameter P that receives SidInfluenced, even gentle variation is the up-to-date SID frame noise parameter P that receives SidNoise parameter P with former frame K-1, difference is very big, P kAlso can be an excessive more level and smooth value, according to P kThe noise that generates also can change comparatively mitigation, can bring the user and experience preferably.
When present frame is the tone-off frame, reconstruction parameter initial value P RefNoise parameter P for the reconstruction of former frame K-1, the center C of moving about kBe subjected to reconstruction parameter initial value P RefInfluence, can mild variation take place to the value direction of the radius Δ that moves about, at [C k-| Δ |, C k+ | Δ |] interval in the noise parameter P of random value k, can be a noise parameter P who compares former frame K-1The continuous noise parameter P that reconstructs between the vicissitudinous slightly parameter, two SID frames kCan be an excessive more level and smooth value, according to P kThe noise that generates also can change comparatively mitigation, can bring the user and experience preferably.
Further, the radius Δ that moves about between two SID frames may be changed by the influence of k value or dP value, and the scope of random value also can correspondingly change, the continuous noise parameter P that reconstructs between two SID frames kCan be variation curve more at random, according to P kMore different variation also can take place in the noise that generates, and can bring the user and experience preferably.
In some cases, when present frame is the tone-off frame, may before arriving, next frame SID frame not upgrade reconstruction parameter initial value P yet Ref, will rely on this moment the variation of the radius Δ that moves about to change the scope of random value.
In the present embodiment, reconstruction parameter initial value P RefComprise: reconstruction signal energy gain initial parameter value, reconstruction spectrum initial parameter value.
The noise parameter generted noise that step 103, utilization are rebuild.
Decoding end utilizes random sequence generator to synthesize pumping signal, this pumping signal is when reconstruction noise, be equivalent to the SID frame and compare the content that normal speech frame lacks, as fixed codebook, and the relevant parameter of adaptive codebook etc., decoding end is according to the general character of noise, utilize random sequence generator to synthesize pumping signal, in order to reconstruction noise.
Utilize the method for the noise parameter generted noise of pumping signal and reconstruction to have two kinds:
First kind, decoding end are with the spectrum parameter in the noise parameter of rebuilding, be converted to the composite filter coefficient, pumping signal is carried out synthetic filtering, obtain noise signal, then synthetic noise signal is carried out the time domain shaping with the energy gain parameter in the noise parameter of rebuilding, carry out reprocessing, promptly may be output as final reconstruction noise.
The synthetic pumping signal of energy gain parameter in the noise parameter that second kind, decoding end utilization are rebuild and random sequence generator, then with the spectrum parameter in the noise parameter of rebuilding, be converted to the composite filter coefficient, pumping signal is carried out synthetic filtering, obtain noise signal.
In the present embodiment, the consensus standard that coding side is used without limits, no matter coding side sends the SID frame according to fixed intervals, or self adaptation sends the SID frame at interval, can operate as normal.
And owing to receive noise parameter that new SID frame all can rebuild with reference to former frame, and the noise parameter newly received at every turn, the reconstruction noise parameter, the noise transition that generates is more natural, the user has the sense of hearing preferably and experiences, simultaneously also with reference to the influence of actual noise parameter, make the user can tell roughly voice environment; Further when handling the tone-off frame, according to the change direction of the noise parameter of the distance between tone-off frame and the nearest SID frame, nearest SID frame, and the noise parameter of nearest SID frame and the difference of reconstruction parameter initial value, for the small noise parameter of variation is compared in this tone-off frame reconstruction with former frame, make that the noise parameter change curve that reconstructs is comparatively level and smooth, therefore nature is also compared in the transition between the every frame of noise that generates, and can bring the sense of hearing preferably to experience to the user.
The noise generation method embodiment two that the embodiment of the invention provides, coding side adopt self adaptation to send the SID frame at interval, and flow process comprises as shown in Figure 2:
Step 201, reception SID frame obtain the noise parameter that wherein carries.
After the beginning voice communication, decoding end is deciphered out frame information from the audio data stream that receives, and then the form of this frame is judged, if this frame is a speech frame, then enters the speech frame handling process; If non-speech frame as SID frame or tone-off frame, then enters the noise generation method embodiment flow process that present embodiment provides.
When handling non-speech frame, owing to do not comprise any speech data in the tone-off frame, directly enter step 202 usually, when receiving the SID frame, will obtain the noise parameter that wherein carries, i.e. signal energy gain parameter G SidWith spectrum parameter l sf Sid
Step 202, acquisition reconstruction parameter initial value.
Decoding end is detecting frame type when speech frame switches to non-speech frame, when promptly receiving the first frame SID frame, by the past N that stores in the buffering area pThe energy gain parameter of frame and spectrum parameter calculate average energy gain parameter G RefWith spectrum parameter l sf RefAs reconstruction parameter initial value, N herein pValue is the integer greater than 0, for example N p=5, the frame in past can be a speech frame, also can be the SID frame.Rebuild energy gain initial parameter value G RefCompose initial parameter value lsf with rebuilding RefAs follows with equation expression:
lsf ref = 1 N p Σ i = 1 N p lsf i
G ref = 1 N p Σ i = 1 N p G i
If the SID frame that receives is not the first frame SID frame, the energy gain parameter of rebuilding with this SID frame former frame and compose parameter then as the reconstruction parameter initial value.
When being tone-off frame reconstruction noise parameter, the energy gain parameter and the spectrum parameter update reconstruction parameter initial value that can all use former frame to rebuild can not upgrade the reconstruction parameter initial value at every turn yet before next frame SID frame arrives in the present embodiment.
Step 203, reconstruction noise parameter.
When changing the noise section over to, when also promptly receiving behind the speech frame the first frame SID frame, the length initial value is changed to N from voice segments p, when receiving the SID frame once more afterwards, get the gap length between up-to-date SID frame and its previous SID frame.In order to guarantee the efficient of DTX, in general can the transmission of SID frame be limited at interval, promptly length must be more than or equal to a natural number, and for example regulation length must be more than or equal to 2 in the agreement of version G.729B.
The energy gain parameter that decoding obtains from nearest SID frame is G Sid, the spectrum parameter is lsf Sid, for k frame behind this SID frame, the noise parameter increment d of its energy gain parameter K, GCan be expressed as with formula:
d kG=G sid-G ref
The radius Δ that moves about of its energy gain parameter GCan be expressed as with formula:
Δ G = d k , G 2 ( | k - length | + 1 )
The noise parameter increment d of its spectrum parameter K, lsfCan be expressed as with formula:
d k,lsf=lsf sid-lsf ref
The radius Δ that moves about of its spectrum parameter i LsfCan be expressed as with formula:
Δ i lsf = d k , lsf 2 ( | k - length | + 1 ) i = 1,2 , · · · , M
Wherein M is the exponent number of spectrum linear-in-the-parameter prediction.
Then rebuild the center C of moving about of energy gain parameter in the reconstruction noise parameter of present frame G, kCan be expressed as with formula:
C G,k=G ref+2Δ G
Rebuild the center of moving about of spectrum parameter in the reconstruction noise parameter of present frame
Figure A200810189642D00173
Can be expressed as with formula:
C lsf , k i = lsf ref + 2 Δ i lsf
Rebuild energy gain parameter G in the reconstruction noise parameter of present frame kCan be expressed as with formula:
G k=rand(C G,k-|Δ G|,C G,k+|Δ G|)
Rebuild the spectrum parameter in the reconstruction noise parameter of present frame
Figure A200810189642D00175
Can be expressed as with formula:
lsf k i = rand ( C lsf , k i - | Δ i lsf | , C lsf , k i + | Δ i lsf | )
Wherein (a b) is meant in interval [a, b] and gets equally distributed random number function rand.
If when receiving new SID frame, correlated variables is upgraded with following algorithm:
length=k-1;
G ref=G k-1
lsf ref = lsf k - 1 i ;
Make k=1 at last;
If what receive is the tone-off frame, when upgrading the reconstruction parameter initial value, make:
G ref=G k
lsf ref=lsf k
Upgrade rebuilding initial parameter value, make k=k+1 then.
Continue to reconstruct the noise parameter of this frame, up to receiving new SID frame.
The noise parameter generted noise that step 204, utilization are rebuild.
Adopt random sequence to generate white-noise excitation signal e (n);
With the spectrum parameter l sf that rebuilds kStructure composite filter a k(z);
Pumping signal composite filter synthetic filtering with generation:
y k(n)=e(n) *a k(n)
Then with synthetic noise y k(n) use the energy gain parameter G that rebuilds kCarry out the time domain shaping:
y ( n ) = y k ( n ) × G k Σ i = 0 N - 1 y k 2 ( n )
Wherein N is a frame length, can recover comfort noise in decoding end.
The method of the noise parameter generted noise that the utilization that present embodiment step 204 adopts is rebuild is the method one of utilizing the noise parameter generted noise of pumping signal and reconstruction mentioned above.
In the present embodiment, the consensus standard that coding side is used without limits, no matter coding side sends the SID frame according to fixed intervals, or self adaptation sends the SID frame at interval, can operate as normal.
And because when turning to the noise section from voice segments, adopt the average energy gain parameter of last voice segments and compose parameter as initial value, with reference to the noise parameter of newly receiving, the reconstruction noise parameter, when this had just guaranteed to switch from voice segments to the noise section, the noise of generation and the transition of voice segments were more natural, and the user has the sense of hearing preferably and experiences, simultaneously because with reference to the influence of actual noise parameter, make the user can tell roughly voice environment; Receive that at every turn new SID frame all can adopt the noise parameter of former frame reconstruction as initial value, with reference to the noise parameter of newly receiving, the reconstruction noise parameter, the noise transition that generates is more natural, the user has the sense of hearing preferably and experiences, simultaneously also with reference to the influence of actual noise parameter, make the user can tell roughly voice environment; Further when handling the tone-off frame, according to the change direction of the noise parameter of the distance between tone-off frame and the nearest SID frame, nearest SID frame, and the noise parameter of nearest SID frame and the difference of reconstruction parameter initial value, for the small noise parameter of variation is compared in this tone-off frame reconstruction with former frame, make that the noise parameter change curve that reconstructs is comparatively level and smooth, therefore nature is also compared in the transition between the every frame of noise that generates, and can bring the sense of hearing preferably to experience to the user.
The noise generation method embodiment three that the embodiment of the invention provides, coding side adopt fixed intervals to send the SID frame, and its flow process comprises as shown in Figure 3:
Step 301, reception SID frame obtain the noise parameter that wherein carries.
After the beginning voice communication, decoding end is deciphered out frame information from the audio data stream that receives, and then the form of this frame is judged, if this frame is a speech frame, then enters the speech frame handling process; If non-speech frame as SID frame or tone-off frame, then enters the noise generation method embodiment flow process that present embodiment provides.
When handling non-speech frame, owing to do not comprise any speech data in the tone-off frame, directly enter step 302 usually, when receiving the SID frame, will obtain the noise parameter that wherein carries, i.e. signal energy gain parameter G Sid, spectrum parameter l sf Sid
Step 302, acquisition reconstruction parameter initial value.
Coding side adopts fixing SID frame period to send the SID frame, supposes that here SID interframe is divided into LENGTH, and the LENGTH value is the natural number greater than 0.
Decoding end is detecting frame type when speech frame switches to non-speech frame, when promptly receiving the first frame SID frame, and with the reconstruction noise parameter of the noise parameter in the SID frame that receives as following LENGTH frame, and as reconstruction noise energy gain parameter G RefWith spectrum parameter l sf RefInitial value, rebuild energy gain initial parameter value G RefCompose initial parameter value lsf with rebuilding RefAs follows with equation expression:
lsf ref=lsf sid(1)
G ref=G sid(1)
Step 303, reconstruction noise parameter.
The reconstruction noise parameter is after receiving second SID frame, and the energy gain parameter that decoding obtains from nearest SID frame is G Sid, the spectrum parameter is lsf Sid, for k frame behind this SID frame, the noise parameter increment d of its energy gain parameter K, GCan be expressed as with formula:
d kG=G sid-G ref
The radius Δ that moves about of its energy gain parameter GCan be expressed as with formula:
Δ G = d k , G 2 * LENGTH
The noise parameter increment d of its spectrum parameter K, lsfCan be expressed as with formula:
d k,lsf=lsf sid-lsf ref
The radius Δ that moves about of its spectrum parameter i LsfCan be expressed as with formula:
Δ i lsf = d k , lsf 2 * LENGTH i = 1,2 , · · · , M
Wherein M is the exponent number of linear prediction.
Then rebuild the center C of moving about of energy gain parameter in the reconstruction noise parameter of present frame G, kCan be expressed as with formula:
C G,k=G ref+2Δ G
Rebuild the center of moving about of spectrum parameter in the reconstruction noise parameter of present frame Can be expressed as with formula:
C lsf , k i = lsf ref + 2 Δ i lsf
Rebuild energy gain parameter G in the reconstruction noise parameter of present frame kCan be expressed as with formula:
G k=rand(C G,k-|Δ G|,C G,k+|Δ G|)
Rebuild the spectrum parameter in the reconstruction noise parameter of present frame
Figure A200810189642D00205
Can be expressed as with formula:
lsf k i = rand ( C lsf , k i - | Δ i lsf | , C lsf , k i + | Δ i lsf | )
Wherein (a b) is meant in interval [a, b] and gets equally distributed random number function rand.
If when receiving new SID frame, correlated variables is upgraded with following algorithm:
length=k-1;
G ref=G k-1
lsf ref=lsf k-1
Make k=1 at last;
If what receive is the tone-off frame, when upgrading the reconstruction parameter initial value, make:
G ref=G k
lsf ref=lsf k
Upgrade rebuilding initial parameter value, make k=k+1 then.
Continue to reconstruct the noise parameter of this frame, up to receiving new SID frame.
The noise parameter generted noise that step 304, utilization are rebuild.
Use the energy gain parameter G of random sequence generator and reconstruction kSynthetic white-noise excitation signal e (n);
With the spectrum parameter l sf that rebuilds kStructure composite filter a k(z);
Pumping signal composite filter synthetic filtering with generation:
y k(n)=e(n) *a k(n)
Filtering Processing after the warp can recover comfort noise in decoding end again.
The method of the noise parameter generted noise that the utilization that present embodiment step 304 adopts is rebuild is the method two that utilizes the noise parameter generted noise of pumping signal and reconstruction mentioned above.
In the present embodiment, the consensus standard that coding side is used without limits, no matter coding side sends the SID frame according to fixed intervals, still self adaptation sends the SID frame at interval, can reconstruct and change smoother noise parameter, comprise energy gain parameter, spectrum parameter etc., and then generate the comfort noise of nature.
Because when changing the noise section over to from voice segments, adopt the noise parameter of the up-to-date SID frame of receiving to generate the first frame SID frame to the noise between the next SID frame, receive that at every turn new SID frame all can adopt the noise parameter of former frame reconstruction as initial value, with reference to the noise parameter of newly receiving, the reconstruction noise parameter, generted noise, because when voice segments changes the noise section over to, the SID frame that sends is very near from voice segments, so directly use the noise parameter of the up-to-date SID frame of receiving to generate the first frame SID frame to the noise between the next SID frame, it is more natural that voice segments changes the transition meeting of noise section over to, and the interval of two frame SID frames is very short, noise does not change in the of short duration time, is that ordinary people's the sense of hearing can't be found, the user has the sense of hearing preferably and experiences; Receive that at every turn new SID frame all can adopt the noise parameter of former frame reconstruction as initial value, with reference to the noise parameter of newly receiving, the reconstruction noise parameter, the noise transition that generates is more natural, the user has the sense of hearing preferably and experiences, simultaneously also with reference to the influence of actual noise parameter, make the user can tell roughly voice environment; Further when handling the tone-off frame, according to the change direction of the noise parameter of the distance between tone-off frame and the nearest SID frame, nearest SID frame, and the noise parameter of nearest SID frame and the difference of reconstruction parameter initial value, for the small noise parameter of variation is compared in this tone-off frame reconstruction with former frame, make that the noise parameter change curve that reconstructs is comparatively level and smooth, therefore nature is also compared in the transition between the every frame of noise that generates, and can bring the sense of hearing preferably to experience to the user.
The noise generation method embodiment four that the embodiment of the invention provides, coding side adopt self adaptation to send the SID frame at interval, and flow process comprises as shown in Figure 4:
Step 401, reception SID frame obtain the noise parameter that wherein carries.
After the beginning voice communication, decoding end is deciphered out frame information from the audio data stream that receives, and then the form of this frame is judged, if this frame is a speech frame, then enters the speech frame handling process; If non-speech frame as SID frame or tone-off frame, then enters the noise generation method embodiment flow process that present embodiment provides.
When handling non-speech frame, owing to do not comprise any speech data in the tone-off frame, directly enter step 402 usually, when receiving the SID frame, will obtain the noise parameter that wherein carries, i.e. signal energy gain parameter G SidWith spectrum parameter l sf Sid
Step 402, acquisition reconstruction parameter initial value.
Decoding end is detecting frame type when speech frame switches to non-speech frame, when promptly receiving the first frame SID frame, supposes that the signal energy gain parameter that obtains this moment is G from this frame Sid (1), the spectrum parameter is lsf Sid (1), then rebuild energy gain initial parameter value G RefCompose initial parameter value lsf with rebuilding RefAvailable equation expression is:
G ref=G sid(1)
lsf ref=lsf sid(1)
If the SID frame that receives is not the first frame SID frame, the energy gain parameter of rebuilding with this SID frame former frame and compose parameter then as the reconstruction parameter initial value.
When being tone-off frame reconstruction noise parameter, the energy gain parameter and the spectrum parameter update reconstruction parameter initial value that can all use former frame to rebuild can not upgrade the reconstruction parameter initial value at every turn yet before next frame SID frame arrives in the present embodiment.
Step 403, reconstruction noise parameter.
When changing the noise section over to, when also promptly receiving behind the speech frame the first frame SID frame, the length initial value is changed to N from voice segments p, when receiving the SID frame once more afterwards, get the gap length between up-to-date SID frame and its previous SID frame.In order to guarantee the efficient of DTX, in general can the transmission of SID frame be limited at interval, promptly length must be more than or equal to a natural number, and for example regulation length must be more than or equal to 2 in the agreement of version G.729B.
The decoder energy gain parameter that obtains of decoding from receive up-to-date SID frame is G Sid (n), the spectrum parameter is lsf Sid (n), (n=1,2 ...), make:
d 0,G=G sid(n)-G sid(n-1)
d 0,lsf=lsf sid(n)-lsf sid(n-1)
Then for k frame behind n the SID frame, the noise parameter increment d of its energy gain parameter K, GCan be expressed as with formula:
d k,G=d 0,G-(G ref-G 0)
Wherein, G RefBe the reconstruction parameter initial value of energy gain parameter, G 0The energy gain parameter of rebuilding for the former frame of the SID frame that receives recently.
When this SID frame that receives recently is the first frame SID frame, G 0Be the past N that stores in the buffering area pThe weighted average G of the energy gain parameter of frame Sid (0)G Sid (0)Available equation expression is as follows:
G sid ( 0 ) = Σ i = 1 N p w i × G i
W wherein iBe weights, satisfy relation Σ i = 1 N p w i = 1 .
The radius Δ that moves about of its energy gain parameter GCan be expressed as with formula:
Δ G = d k , G 2 ( | k - length | + 1 )
The noise parameter increment of its spectrum parameter
Figure A200810189642D00234
Can be expressed as with formula:
d k , lsf i = d 0 , lsf - ( lsf ref - lsf 0 )
Wherein, lsf RefBe the reconstruction parameter initial value of spectrum parameter, lsf 0The spectrum parameter of rebuilding for the former frame of the SID frame that receives recently.
When this SID frame that receives recently is the first frame SID frame, lsf 0Be the past N that stores in the buffering area pThe weighted average lsf of the energy gain parameter of frame Sid (0)Lsf Sid (0)Available equation expression is as follows:
lsf sid ( 0 ) = lsf 0 = Σ i = 1 N p w i × lsf i
W wherein iBe weights, satisfy relation Σ i = 1 N p w i = 1 .
The radius that moves about of its spectrum parameter
Figure A200810189642D00243
Can be expressed as with formula:
Δ i lsf = d k , lsf i 2 ( | k - length | + 1 ) i = 1,2 , · · · , M
Wherein M is the exponent number of spectrum linear-in-the-parameter prediction.
Then rebuild the center C of moving about of energy gain parameter in the reconstruction noise parameter of present frame G, kCan be expressed as with formula:
C G,k=G ref+2Δ G
Rebuild the center of moving about of spectrum parameter in the reconstruction noise parameter of present frame
Figure A200810189642D00245
Can be expressed as with formula:
C lsf , k i = lsf ref + 2 Δ i lsf
Rebuild energy gain parameter G in the reconstruction noise parameter of present frame kCan be expressed as with formula:
G k=rand(C G,k-|Δ G|,C G,k+|Δ G|)
Rebuild the spectrum parameter in the reconstruction noise parameter of present frame
Figure A200810189642D00247
Can be expressed as with formula:
lsf k i = rand ( C lsf , k i - | Δ i lsf | , C lsf , k i + | Δ i lsf | )
Wherein (a b) is meant in interval [a, b] and gets equally distributed random number function rand.
If when receiving new SID frame, correlated variables is upgraded with following algorithm:
length=k-1;
G ref=G k-1
lsf ref = lsf k - 1 i ;
Make k=1 at last;
If what receive is the tone-off frame, when upgrading the reconstruction parameter initial value, make:
G ref=G k
lsf ref=lsf k
Upgrade rebuilding initial parameter value, make k=k+1 then.
Continue to reconstruct the noise parameter of this frame, up to receiving new SID frame.
The noise parameter generted noise that step 404, utilization are rebuild.
Adopt random sequence to generate white-noise excitation signal e (n);
With the spectrum parameter l sf that rebuilds kStructure composite filter a k(z);
Pumping signal composite filter synthetic filtering with generation:
y k(n)=e(n) *a k(n)
Then with synthetic noise y k(n) use the energy gain parameter G that rebuilds kCarry out the time domain shaping:
y ( n ) = y k ( n ) × G k Σ i = 0 N - 1 y k 2 ( n )
Wherein N is a frame length, can recover comfort noise in decoding end.
The method of the noise parameter generted noise that the utilization that present embodiment step 404 adopts is rebuild is the method one of utilizing the noise parameter generted noise of pumping signal and reconstruction mentioned above.
In the present embodiment, the consensus standard that coding side is used without limits, no matter coding side sends the SID frame according to fixed intervals, still self adaptation sends the SID frame at interval, can reconstruct and change smoother noise parameter, comprise energy gain parameter, spectrum parameter etc., and then generate the comfort noise of nature.
Because when changing the noise section over to from voice segments, the noise parameter that adopts the up-to-date SID frame of receiving is as initial value, with reference to the noise parameter of newly receiving, the reconstruction noise parameter, because when voice segments changes the noise section over to, the SID frame that sends is very near from voice segments, thus the noise parameter that directly uses the up-to-date SID frame of receiving as initial value, it is more natural that voice segments changes the transition meeting of noise section over to; Receive that at every turn new SID frame all can adopt the noise parameter of former frame reconstruction as initial value, with reference to the noise parameter of newly receiving, the reconstruction noise parameter, the noise transition that generates is more natural, the user has the sense of hearing preferably and experiences, simultaneously also with reference to the influence of actual noise parameter, make the user can tell roughly voice environment; Further influence the noise parameter increment of reconstruction noise stochastic parameter span, it is difference according to nearest SID frame and former frame SID frame, and the difference acquisition of the noise parameter rebuild of reconstruction parameter initial value and nearest SID frame former frame, level and smooth variation can take place compared with former frame in the span by this noise parameter increment influence, the reconstruction noise parameter of random value also can be subjected to corresponding influence in this scope, make that the noise parameter change curve that reconstructs is comparatively level and smooth, therefore nature is also compared in the transition between the every frame of noise that generates, and can bring the sense of hearing preferably to experience to the user.
The noise generating apparatus embodiment that the embodiment of the invention provides is usually located at decoding end, can rebuild the noise parameter of change at random, curve smoothing by the noise parameter in a spot of SID frame, makes the user feel more comfortable noise to recover.
The noise generating apparatus example structure that the embodiment of the invention provides comprises as shown in Figure 5:
Initial value unit 5100 is used for obtaining the reconstruction parameter initial value according to the noise parameter that obtains in advance;
Range cells 5200 is used for obtaining the random value scope according to described reconstruction parameter initial value;
Reconstruction unit 5300 is used in described random value scope random value as the noise parameter of rebuilding;
Synthesis unit 5400 is used for the noise parameter composite noise according to described reconstruction.
Decoding end utilizes random sequence generator to synthesize pumping signal, this pumping signal is when reconstruction noise, be equivalent to the SID frame and compare the content that normal speech frame lacks, as fixed codebook, and the relevant parameter of adaptive codebook etc., decoding end is according to the general character of noise, utilize random sequence generator to synthesize pumping signal, in order to reconstruction noise.
Synthesis unit 5400 utilizes the method for the noise parameter generted noise of pumping signal and reconstruction to have two kinds:
First kind, synthesis unit 5400 are with the spectrum parameter in the noise parameter of rebuilding, be converted to the composite filter coefficient, pumping signal is carried out synthetic filtering, obtain noise signal, then synthetic noise signal is carried out the time domain shaping with the energy gain parameter in the noise parameter of rebuilding, carry out reprocessing, promptly may be output as final reconstruction noise.
Second kind, synthesis unit 5400 utilizes energy gain parameter and the synthetic pumping signal of random sequence generator in the noise parameter of rebuilding, then with the spectrum parameter in the noise parameter of rebuilding, be converted to the composite filter coefficient, pumping signal is carried out synthetic filtering, obtain noise signal.
Wherein, initial value unit 5100 comprises:
The first initial value unit 5101 is used for when receiving first quiet insertion descriptor frame, and the mean value of getting the noise parameter of a predetermined number frame before the described quiet insertion descriptor frame is as the reconstruction parameter initial value;
The second initial value unit 5102, be used for after receiving first quiet insertion descriptor frame, during when receiving quiet insertion descriptor frame once more or for tone-off frame reconstruction noise parameter, be taken at noise parameter that the former frame of the up-to-date quiet insertion descriptor frame of receiving rebuilds as described reconstruction parameter initial value.
Range cells 5200 comprises:
Increment unit 5210 is used for obtaining the noise parameter increment according to the noise parameter that obtains from quiet insertion descriptor frame;
Interval acquiring unit 5220 is used to obtain predicting interval length;
Radius acquiring unit 5230 obtains the radius that moves about according to described predicting interval length and described noise parameter increment;
The center acquiring unit is used for obtaining the center of moving about according to described reconstruction parameter initial value and the described radius that moves about;
Arithmetic element 5240, being used for the described center of moving about is the center of described random value scope, is the radius of described random value scope with the described radius that moves about, and determines described random value scope.
Wherein increment unit 5210 comprises:
First increment unit 5211 is used for difference with the noise parameter that obtains from the quiet insertion descriptor frame that obtains recently and described reconstruction parameter initial value as described noise parameter increment;
Or second increment unit 5212, be used for difference with noise parameter that obtains from the quiet insertion descriptor frame that obtains recently and the noise parameter that from the quiet insertion descriptor frame of former frame, obtains as described noise parameter increment;
Or the 3rd increment unit 5213, be used for the noise parameter that obtains from the quiet insertion descriptor frame that obtains recently difference with the noise parameter that from the quiet insertion descriptor frame of former frame, obtains, with the difference of described reconstruction parameter initial value and the difference of the reconstruction noise parameter of the quiet insertion descriptor frame former frame of obtaining recently as described noise parameter increment.
Radius acquiring unit 5230 comprises:
The first radius acquiring unit 5231, being used for being divided by with described noise parameter increment, with the described predicting interval length of twice obtains the described radius that moves about;
Or the second radius acquiring unit 5232, be used for obtaining the described radius that moves about according to the distance of described noise parameter increment, described predicting interval length, present frame and the up-to-date quiet insertion descriptor frame of receiving.
Interval acquiring unit 5220 comprises:
The first interval acquiring unit 5221 is used for when receiving first quiet insertion descriptor frame, with predetermined value as described gap length;
Or, the second interval acquiring unit 5222, be used for when receiving first quiet insertion descriptor frame, insert descriptor frame at interval as described gap length with the transmission sound of default.
The 3rd interval acquiring unit 5223, be used for after receiving first quiet insertion descriptor frame, during when receiving quiet insertion descriptor frame once more or for tone-off frame reconstruction noise parameter, be described predicting interval length with gap length between described up-to-date quiet insertion descriptor frame of receiving and the quiet insertion descriptor frame last time received.
The noise generation method embodiment that the method for operation of the noise generating apparatus embodiment that the embodiment of the invention provides and the embodiment of the invention mentioned above provide is similar substantially, no longer is repeated in this description at this.
In the present embodiment, the consensus standard that coding side is used without limits, no matter coding side sends the SID frame according to fixed intervals, or self adaptation sends the SID frame at interval, can operate as normal.
And owing to receive noise parameter that new SID frame all can rebuild with reference to former frame, and the noise parameter newly received at every turn, the reconstruction noise parameter, the noise transition that generates is more natural, the user has the sense of hearing preferably and experiences, simultaneously also with reference to the influence of actual noise parameter, make the user can tell roughly voice environment; Further when handling the tone-off frame, according to the change direction of the noise parameter of the distance between tone-off frame and the nearest SID frame, nearest SID frame, and the noise parameter of nearest SID frame and the difference of reconstruction parameter initial value, for the small noise parameter of variation is compared in this tone-off frame reconstruction with former frame, make that the noise parameter change curve that reconstructs is comparatively level and smooth, therefore nature is also compared in the transition between the every frame of noise that generates, and can bring the sense of hearing preferably to experience to the user.
One of ordinary skill in the art will appreciate that all or part of step that realizes in the foregoing description method is to instruct relevant hardware to finish by program, described program can be stored in a kind of computer-readable recording medium, this program is when carrying out, the above-mentioned storage medium of mentioning can be a read-only memory, disk or CD etc.
More than to a kind of noise generating apparatus provided by the present invention, and method be described in detail, used specific case herein principle of the present invention and execution mode are set forth, the explanation of above embodiment just is used for helping to understand method of the present invention and core concept thereof; Simultaneously, for one of ordinary skill in the art, according to thought of the present invention, the part that all can change in specific embodiments and applications, in sum, this description should not be construed as limitation of the present invention.

Claims (17)

1, a kind of noise generation method is characterized in that, described method comprises:
According to the noise parameter that obtains in advance, obtain the reconstruction parameter initial value; Obtain the random value scope according to described reconstruction parameter initial value; Random value is as the noise parameter of rebuilding in described random value scope; Noise parameter generted noise according to described reconstruction.
2, noise generation method as claimed in claim 1 is characterized in that, when receiving first quiet insertion descriptor frame, obtains described reconstruction parameter initial value and comprises:
Get the mean value of the noise parameter of a predetermined number frame before described first quiet insertion descriptor frame or weighted average as described reconstruction parameter initial value.
3, noise generation method as claimed in claim 1 or 2 is characterized in that, after receiving first quiet insertion descriptor frame, during when receiving quiet insertion descriptor frame once more or for tone-off frame reconstruction noise parameter, obtains described reconstruction parameter initial value and comprises:
Be taken at noise parameter that the former frame of the up-to-date quiet insertion descriptor frame of receiving rebuilds as described reconstruction parameter initial value.
4, noise generation method as claimed in claim 1 is characterized in that, obtains the random value scope according to described reconstruction parameter initial value and comprises:
Obtain the noise parameter increment according to the noise parameter that from quiet insertion descriptor frame, obtains; Obtain predicting interval length, obtain the radius that moves about according to predicting interval length and described noise parameter increment; Obtain the center of moving about according to described reconstruction parameter initial value and the described radius that moves about; With the described center of moving about is the center of described random value scope, is the radius of described random value scope with the described radius that moves about, and determines described random value scope.
5, noise generation method as claimed in claim 4 is characterized in that, obtains the center of moving about according to described reconstruction parameter initial value and the described radius that moves about and comprises:
With described reconstruction parameter initial value and the described radius that moves about of twice and for the described center of moving about.
6, noise generation method as claimed in claim 4 is characterized in that, the noise parameter that described basis is obtained from quiet insertion descriptor frame obtains the noise parameter increment and comprises:
With the difference of the noise parameter that from the quiet insertion descriptor frame that obtains recently, obtains and described reconstruction parameter initial value as described noise parameter increment;
Or with the difference of the noise parameter that from the quiet insertion descriptor frame that obtains recently, obtains and the noise parameter that from the quiet insertion descriptor frame of former frame, obtains as described noise parameter increment;
Or with the noise parameter that from the quiet insertion descriptor frame that obtains recently, obtains difference with the noise parameter that from the quiet insertion descriptor frame of former frame, obtains, with the difference of described reconstruction parameter initial value and the difference of the reconstruction noise parameter of the quiet insertion descriptor frame former frame of obtaining recently as described noise parameter increment.
7, noise generation method as claimed in claim 4 is characterized in that, describedly obtains the radius that moves about according to predicting interval length and described noise parameter increment and comprises:
With
Figure A200810189642C00031
Be the described radius that moves about;
Or with
Figure A200810189642C00032
Be the described radius that moves about;
Wherein, dP is that described noise parameter increment, length are that described predicting interval length, k are the distance of present frame and the up-to-date quiet insertion descriptor frame of receiving.
8, noise generation method as claimed in claim 4 is characterized in that, when receiving first quiet insertion descriptor frame, obtains described predicting interval length and comprises:
With predetermined value as described predicting interval length;
Transmission sound with default inserts descriptor frame at interval as described predicting interval length.
9, as claim 4 or 7 described noise generation methods, it is characterized in that, after receiving first quiet insertion descriptor frame, during when receiving quiet insertion descriptor frame once more or for tone-off frame reconstruction noise parameter, obtain described predicting interval length and comprise:
With gap length between described up-to-date quiet insertion descriptor frame of receiving and the quiet insertion descriptor frame last time received is described predicting interval length.
10, a kind of noise generating apparatus is characterized in that, described device comprises:
The initial value unit is used for obtaining the reconstruction parameter initial value according to the noise parameter that obtains in advance;
Range cells is used for obtaining the random value scope according to described reconstruction parameter initial value;
Reconstruction unit is used in described random value scope random value as the noise parameter of rebuilding;
Synthesis unit is used for the noise parameter composite noise according to described reconstruction.
11, noise generating apparatus as claimed in claim 10 is characterized in that, described initial value unit comprises:
The first initial value unit is used for when receiving first quiet insertion descriptor frame, and the mean value of getting the noise parameter of a predetermined number frame before the described quiet insertion descriptor frame is as the reconstruction parameter initial value.
As claim 10 or 11 described noise generating apparatus, it is characterized in that 12, described initial value unit comprises:
The second initial value unit, be used for after receiving first quiet insertion descriptor frame, during when receiving quiet insertion descriptor frame once more or for tone-off frame reconstruction noise parameter, be taken at noise parameter that the former frame of the up-to-date quiet insertion descriptor frame of receiving rebuilds as described reconstruction parameter initial value.
13, noise generating apparatus as claimed in claim 10 is characterized in that, described range cells comprises:
Increment unit is used for obtaining the noise parameter increment according to the noise parameter that obtains from quiet insertion descriptor frame;
The interval acquiring unit is used to obtain predicting interval length;
The radius acquiring unit obtains the radius that moves about according to described predicting interval length and described noise parameter increment;
The center acquiring unit is used for obtaining the center of moving about according to described reconstruction parameter initial value and the described radius that moves about;
Arithmetic element, being used for the described center of moving about is the center of described random value scope, is the radius of described random value scope with the described radius that moves about, and determines described random value scope.
14, noise generating apparatus as claimed in claim 13 is characterized in that, described increment unit comprises:
First increment unit is used for difference with the noise parameter that obtains from the quiet insertion descriptor frame that obtains recently and described reconstruction parameter initial value as described noise parameter increment;
Or second increment unit, be used for difference with noise parameter that obtains from the quiet insertion descriptor frame that obtains recently and the noise parameter that from the quiet insertion descriptor frame of former frame, obtains as described noise parameter increment;
Or the 3rd increment unit, be used for the noise parameter that obtains from the quiet insertion descriptor frame that obtains recently difference with the noise parameter that from the quiet insertion descriptor frame of former frame, obtains, with the difference of described reconstruction parameter initial value and the difference of the reconstruction noise parameter of the quiet insertion descriptor frame former frame of obtaining recently as described noise parameter increment.
15, noise generating apparatus as claimed in claim 13 is characterized in that, described radius acquiring unit comprises:
The first radius acquiring unit, being used for being divided by with described noise parameter increment, with the described predicting interval length of twice obtains the described radius that moves about;
Or the second radius acquiring unit, be used for obtaining the described radius that moves about according to the distance of described noise parameter increment, described predicting interval length, present frame and the up-to-date quiet insertion descriptor frame of receiving.
16, noise generating apparatus as claimed in claim 13 is characterized in that, described interval acquiring unit comprises:
The first interval acquiring unit is used for when receiving first quiet insertion descriptor frame, with predetermined value as described gap length;
Or, the second interval acquiring unit, be used for when receiving first quiet insertion descriptor frame, insert descriptor frame at interval as described gap length with the transmission sound of default.
As claim 13 or 16 described noise generating apparatus, it is characterized in that 17, described interval acquiring unit comprises:
The 3rd interval acquiring unit, be used for after receiving first quiet insertion descriptor frame, during when receiving quiet insertion descriptor frame once more or for tone-off frame reconstruction noise parameter, be described predicting interval length with gap length between described up-to-date quiet insertion descriptor frame of receiving and the quiet insertion descriptor frame last time received.
CN 200810189642 2007-09-28 2007-09-28 Noise generating apparatus and method Active CN101453517B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 200810189642 CN101453517B (en) 2007-09-28 2007-09-28 Noise generating apparatus and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 200810189642 CN101453517B (en) 2007-09-28 2007-09-28 Noise generating apparatus and method

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
CN2007101514089A Division CN101335003B (en) 2007-09-28 2007-09-28 Noise generating apparatus and method

Publications (2)

Publication Number Publication Date
CN101453517A true CN101453517A (en) 2009-06-10
CN101453517B CN101453517B (en) 2013-08-07

Family

ID=40735530

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 200810189642 Active CN101453517B (en) 2007-09-28 2007-09-28 Noise generating apparatus and method

Country Status (1)

Country Link
CN (1) CN101453517B (en)

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003501925A (en) * 1999-06-07 2003-01-14 エリクソン インコーポレイテッド Comfort noise generation method and apparatus using parametric noise model statistics
US7536298B2 (en) * 2004-03-15 2009-05-19 Intel Corporation Method of comfort noise generation for speech communication
CN101335003B (en) * 2007-09-28 2010-07-07 华为技术有限公司 Noise generating apparatus and method

Also Published As

Publication number Publication date
CN101453517B (en) 2013-08-07

Similar Documents

Publication Publication Date Title
CN101335003B (en) Noise generating apparatus and method
CN101483042B (en) Noise generating method and noise generating apparatus
CA2349944C (en) Speech coding with comfort noise variability feature for increased fidelity
KR102132798B1 (en) Noise signal processing and noise signal generation method, encoder, decoder and encoding and decoding system
JP6849619B2 (en) Add comfort noise to model background noise at low bitrates
JP4489959B2 (en) Speech synthesis method and speech synthesizer for synthesizing speech from pitch prototype waveform by time synchronous waveform interpolation
CN104584120B (en) Generate comfort noise
JP2007538283A (en) Audio coder mode switching support
JP5361909B2 (en) Method and means for encoding background noise information
WO2005041416A2 (en) Method and system for pitch contour quantization in audio coding
KR20160039297A (en) Apparatus and method for encoding and decoding an audio signal using an aligned look-ahead portion
CN104299614B (en) Coding/decoding method and decoding apparatus
EP1649453A1 (en) Low bit-rate audio encoding
CN112133315A (en) Determining budget for encoding LPD/FD transition frames
US5978761A (en) Method and arrangement for producing comfort noise in a linear predictive speech decoder
CN101393742A (en) Noise generating apparatus and method
CN101453517B (en) Noise generating apparatus and method
KR101166650B1 (en) Method and means for decoding background noise information
Gournay et al. Performance analysis of a decoder-based time scaling algorithm for variable jitter buffering of speech over packet networks
EP1387351A1 (en) Speech encoding device and method having TFO (Tandem Free Operation) function
JP2001094507A (en) Pseudo-backgroundnoise generating method
MX2008008477A (en) Method and device for efficient frame erasure concealment in speech codecs

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant