WO2010111876A1 - 一种信号去噪的方法和装置及音频解码系统 - Google Patents
一种信号去噪的方法和装置及音频解码系统 Download PDFInfo
- Publication number
- WO2010111876A1 WO2010111876A1 PCT/CN2009/076155 CN2009076155W WO2010111876A1 WO 2010111876 A1 WO2010111876 A1 WO 2010111876A1 CN 2009076155 W CN2009076155 W CN 2009076155W WO 2010111876 A1 WO2010111876 A1 WO 2010111876A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- adjusted
- frame
- coefficient
- spectral
- weighting
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 40
- 238000001228 spectrum Methods 0.000 claims abstract description 102
- 230000003595 spectral effect Effects 0.000 claims description 272
- 238000013139 quantization Methods 0.000 claims description 61
- 238000012937 correction Methods 0.000 claims description 10
- 230000001052 transient effect Effects 0.000 claims description 4
- 238000004260 weight control Methods 0.000 claims description 2
- 238000005516 engineering process Methods 0.000 abstract description 2
- 230000015572 biosynthetic process Effects 0.000 description 9
- 238000003786 synthesis reaction Methods 0.000 description 9
- 238000010586 diagram Methods 0.000 description 7
- 230000000694 effects Effects 0.000 description 7
- 230000005236 sound signal Effects 0.000 description 7
- 238000004364 calculation method Methods 0.000 description 5
- 230000006870 function Effects 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 238000011084 recovery Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/24—Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0316—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
- G10L21/0364—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
Definitions
- the present invention relates to the field of audio codec technology, and in particular to a method and apparatus for signal denoising and an audio decoding system. Background technique
- BWE Band Width Extension
- the BWE parameter coding is characterized by less bit usage and bandwidth. Guaranteed, the quality is acceptable;
- the code rate is high, the spectrum of the broadband or ultra-wideband part is quantized and encoded. The characteristics of the quantization code are more bits, higher precision and better quality.
- FIG. 1 is a structural diagram of a prior art audio coding system supporting broadband or ultra-wideband.
- the coding system adopts a hierarchical structure: a core encoder encodes low frequency information, and outputs a first layer code stream.
- the BWE encoder encodes the high-band spectrum with less bits and outputs the second layer stream; the quantization encoder quantizes the high-band spectrum using the remaining bits and outputs the third-layer stream.
- FIG. 2 is a structural diagram of a prior art audio decoding system supporting broadband or ultra-wideband.
- the decoding system also adopts a hierarchical structure: a core decoder is used to decode a low-frequency first layer code stream. Information; BWE decoder is used to decode the bandwidth extended second layer stream The dequantization decoder is used to decode and dequantize the information of the third layer code stream of the remaining bits of the high frequency band; finally, the decoding system synthesizes the frequency bands of the three layers of code streams, and outputs the synthesized audio signal of the frequency band, due to the general core decoder.
- the output signal is a time domain signal
- the signal output by the BWE decoder and the dequantization decoder is a frequency domain signal, so in the frequency band synthesis, the frequency domain signals of the second and third layer code streams are converted into time domain signals.
- the audio signal of the time domain after the frequency band is synthesized.
- the decoding system can decode only the second layer code stream when the code rate is low, and obtain the BWE coded information to ensure the basic high-band quality; In the high case, the third layer code stream can be further decoded to obtain better high band quality.
- the quantizer performs bit allocation, and allocates a large number of bits to some important frequency bands for high-accuracy quantization. And assigning fewer bits to some less important bands for less accurate quantization, even assigning bits to some less important bands, that is, not doing more important band quantizers for this part. Quantify.
- the embodiment of the invention provides a signal denoising method and device and an audio decoding system, which can reduce the noise of the frequency band synthesis after decoding and improve the hearing effect.
- the method for denoising a signal includes: selecting at least two spectral coefficients having high correlation with a spectral coefficient to be adjusted according to an inter-frame correlation of a frame in which a spectral coefficient to be adjusted is located; The at least two spectral coefficients are weighted with the to-be-adjusted spectral coefficients to obtain a predicted value of the spectral coefficient to be adjusted; the decoded signal is spectrally adjusted by using the obtained predicted value, and the adjusted decoded signal is output.
- the device for denoising a signal includes: a selecting unit, configured to select at least two spectral coefficients having high correlation with a spectral coefficient to be adjusted according to an inter-frame correlation of a frame in which a spectral coefficient to be adjusted is located; a weighting unit, configured to perform weighting by using at least two spectral coefficients selected by the selecting unit and the spectral coefficient to be adjusted, to obtain a predicted value of the spectral coefficient to be adjusted; and adjusting an output unit, configured to use the prediction obtained by using the weighting unit The value is spectrally adjusted to the decoded signal, and the adjusted decoded signal is output.
- the audio decoding system includes a core decoder, a bandwidth extension decoder, a dequantization decoder, and the foregoing signal denoising device, wherein the core decoder is configured to decode information of a low frequency first layer code stream.
- the bandwidth extension decoder is configured to decode information of the bandwidth-spread second layer code stream;
- the dequantization decoder is configured to decode information of a third layer code stream that is used to dequantize the remaining bits of the high frequency band;
- Means configured to receive the bandwidth extension decoder and the Decoding the decoded information output by the decoder, determining the spectral coefficients to be adjusted in the decoded information, and adjusting the spectral coefficients in the decoded information according to the obtained predicted values of the spectral coefficients to be adjusted.
- the predicted value of the spectral coefficient to be adjusted is obtained by weighting at least two related spectral coefficients of the spectral coefficient to be adjusted, and the decoded signal is adjusted according to the predicted value of the spectral coefficient to be adjusted.
- the spectrum is adapted to the predicted spectral coefficients (ie, the predicted values of the spectral coefficients to be adjusted) and other related spectral coefficients, so that the spectral coefficients obtained by different quantization precisions are mutually adapted, thereby increasing the smoothness of the decoded signal spectrum and reducing Understand the noise of the band synthesis after the code, so that the audio signal after the band synthesis can achieve better hearing effect.
- FIG. 1 is a structural diagram of a conventional audio coding system
- FIG. 2 is a structural diagram of an existing audio decoding system
- FIG. 3 is a schematic flowchart of a signal denoising method according to Embodiment 1 of the present invention
- FIG. 4 is a schematic flowchart of a signal denoising method according to Embodiment 2 of the present invention
- FIG. 6 is a structural diagram of an audio decoding system according to Embodiment 5 of the present invention. detailed description
- an embodiment of the present invention provides a method for signal denoising, including:
- Step 31 Select at least two spectral coefficients having high correlation with the spectral coefficients to be adjusted according to the inter-frame correlation of the frame in which the spectral coefficients to be adjusted are located;
- Step 32 Perform weighting by using at least two selected spectral coefficients and the spectral coefficients to be adjusted, and obtain a predicted value of the spectral coefficient to be adjusted;
- Step 33 Perform spectrum adjustment on the decoded signal by using the obtained predicted value, and output the adjusted decoded signal.
- the signal denoising method provided by the embodiment of the present invention obtains a predicted value of the spectral coefficient to be adjusted by weighting at least two related spectral coefficients of the adjusted spectral coefficient, and adjusts the decoded signal according to the predicted value of the spectral coefficient to be adjusted.
- the spectrum is adapted to the predicted spectral coefficients (ie, the predicted values of the spectral coefficients to be adjusted) and other related spectral coefficients, so that the spectral coefficients obtained by different quantization precisions are mutually adapted, thereby increasing the smoothness of the decoded signal spectrum and reducing Understand the noise of the band synthesis after the code, so that the audio signal after the band synthesis can achieve better hearing effect.
- a method for signal denoising includes: Step 41: Determine a to-be-adjusted spectral coefficient in a decoded signal according to a quantization precision of a spectral coefficient.
- a decoded signal is output, which is a low frequency signal output by the core decoder and output by the B WE decoder.
- the bandwidth extension high frequency signal and the other high frequency signals of the dequantization decoder output, wherein the bandwidth extension high frequency signal output by the BWE decoder and the other high frequency signals output by the dequantization decoder are frequency domain signals.
- the determined spectral coefficients to be adjusted may include: unquantized spectral coefficients, and/or spectral coefficients whose quantization accuracy is lower than a certain quantization precision threshold, where the quantization precision threshold may be set as needed.
- Lbit can only represent the symbol information of the frequency point.
- No bit ie, O bit
- the frequency of the bit rate is 1 bit/frequency. There is no amplitude information.
- the quantization accuracy of the frequency can be considered. If it is 0), the frequency point is not quantized, and it is determined that the frequency point of the bit rate is 1 bit/frequency point is the frequency point to be adjusted.
- the average quantization precision of the vector where the frequency point is located may be first determined. If the quantization precision is less than a certain lower limit, such as 0.5 bit/frequency, it is determined that all frequency points in the vector need to be adjusted; if the average quantization precision is greater than a certain The upper threshold, such as 2 bit/frequency, determines that all frequency points in the vector do not need to be adjusted; if the average quantization precision is between the two, such as between 0.5 bit/frequency and 2 bit/frequency In between, it is further determined whether there are frequency points in the vector that are not vector quantized, and if so, these frequency points that are not vector quantized are determined to be adjusted, otherwise no adjustment is needed. Step 42: Select a weighting mode from three weighting modes of high inter-frame correlation, low inter-frame correlation, and medium inter-frame correlation according to the inter-frame correlation of the frame in which the spectral coefficient to be adjusted is located.
- a certain lower limit such as 0.5 bit/frequency
- the upper threshold such as 2 bit/frequency
- the correlation between the frames can be judged according to the parameters related to the correlation, for example, a BWE algorithm, which uses the frame type to characterize the inter-frame correlation: the transient type frame represents low inter-frame correlation; Harmonic type frames represent high inter-frame correlation; normal type frames represent inter-frame correlation.
- the frame type is a correlation-related parameter, and the correlation between the frames can be determined according to the frame type, thereby determining the weighting mode.
- the correlation between the frames can also be determined by calculation. For example, first, a certain correlation calculation method calculates the correlation between the frame of the spectrum coefficient to be adjusted and the adjacent frame. If the correlation is greater than the upper threshold, the value is to be adjusted. The inter-frame correlation of the frame in which the spectral coefficient is located is high; if the correlation is less than the lower threshold, the inter-frame correlation of the frame in which the spectral coefficient is to be adjusted is low; in other cases, for example, the correlation is between the upper threshold and the lower threshold In between, the inter-frame correlation of the frame in which the spectral coefficients are to be adjusted is adjusted.
- step 42 different weighting modes are selected according to the inter-frame correlation: when the inter-frame correlation is high, the high inter-frame correlation weighting mode is selected; when the inter-frame correlation is low, the low inter-frame correlation is selected. Weighting mode; When inter-frame correlation is selected, the inter-frame correlation weighting mode is selected. Different weighting modes correspond to different weighting weights for weighting the inter-frame spectral coefficients and the intra-frame spectral coefficients. In general, the higher the inter-frame correlation, the higher the weighting weight of the inter-frame spectral coefficients, and the lower the weighting weight of the intra-frame frequency coefficients; the lower the inter-frame correlation, the lower the weighting weight of the inter-frame spectral coefficients. The weighting weight of the intra-frame spectral coefficients is higher.
- the weighted weight of the inter-frame spectral coefficients is proportional to the inter-frame correlation, and the weighted weight of the intra-frame spectral information is inversely proportional to the inter-frame correlation.
- the interframe frequency The spectral coefficient weight is large, the intra-frame spectral coefficient weight is small or zero; for low inter-frame correlation, the intra-frame spectral coefficient weight is large, and the inter-frame spectral coefficient weight is small or zero;
- the correlation frame, the size of the intra-frame and inter-frame frequency coefficient weights can be determined by the comparison of the inter-frame and intra-frame correlation.
- Step 43 Determine at least two spectral coefficients having high correlation with the spectral coefficients to be adjusted according to the selected weighting mode.
- determining at least two spectral coefficients having high correlation with the spectral coefficients to be adjusted according to the weighting mode may be: when a high inter-frame correlation weighting mode is selected, indicating that the inter-frame correlation is high, At least two spectral coefficients may be determined from a frame adjacent to a frame in which the spectral coefficient to be adjusted is located; when a low inter-frame correlation weighting mode is selected, it indicates that the inter-frame correlation is low, and the frame in which the spectral coefficient to be adjusted is located may be Determining at least two spectral coefficients in the middle; when the inter-frame correlation weighting mode is selected, indicating that the inter-frame correlation can be simultaneously from the frame in which the spectral coefficient to be adjusted is located and adjacent to the frame in which the spectral coefficient to be adjusted is located At least two spectral coefficients are determined in the frame.
- Step 44 Perform weighting by using the determined at least two spectral coefficients and the spectral coefficients to be adjusted, and obtain a predicted value of the spectral coefficient to be adjusted.
- the method for weighting the determined at least two spectral coefficients and the spectral coefficients to be adjusted may be predicted by using weighting values of at least one of the following information: 1. Dequantizing the quantized spectral coefficients output by the decoder; 2. Outputting by the BWE decoder Bandwidth spread spectrum coefficient; 3. The predicted value of the frequency coefficient obtained by the prediction. The product of the frequency coefficient and its corresponding weighting weight is the weighting value of the frequency coefficient; since the spectral coefficient to be adjusted may be the spectral coefficient corresponding to the unquantized frequency point, at least two spectral coefficients are used in step 44.
- the weighting value of the spectral coefficient to be adjusted may be 0, that is, only the determined weighting values of at least two spectral coefficients are used to obtain the predicted value of the spectral coefficient to be adjusted.
- the frequency coefficient is predicted according to the weighting value of at least one of the following information: (1) the predicted value of the previous frame; (2) the quantized frequency coefficient of the previous frame; (3) The bandwidth spread spectrum factor of the previous frame.
- the spectral coefficients are predicted according to the weighting values of at least one of the following: (1) the quantized spectral coefficients of the current frame; (2) the bandwidth spread spectrum coefficients of the current frame; (3) the current frame has Some predictions.
- the intra-frame correlation weighting mode predicts the spectral coefficients according to the weighting values of at least one of the following information: (1) the predicted values existing in the previous frame or the current frame; (2) the quantized spectral coefficients of the previous frame or the current frame; (3) The bandwidth spread spectrum factor of the previous frame or the current frame.
- the weighting weight of each of the above spectrum information may also be adjusted according to the quantization precision of the frequency to be adjusted.
- the quantized result can still be weighted and the weighted weight is proportional to the quantization precision of the spectral coefficient.
- Step 45 Control the energy of the obtained predicted value, and perform spectrum adjustment on the decoded signal.
- the upper limit of the energy of the spectral coefficient to be adjusted is first determined, and then the energy of the adjusted spectral coefficient is controlled to be less than or equal to the upper limit.
- the upper threshold can be determined according to the quantization error or the minimum non-zero quantization value in the range of the spectral coefficient to be adjusted, wherein the quantization error or the minimum non-zero quantization value can be obtained by the prior art, and details are not described herein again.
- the energy of the obtained predicted value is controlled, and the spectrum adjustment of the decoded signal may be: according to the upper limit, correcting the predicted value of the spectral coefficient to be adjusted, and obtaining the corrected value of the spectral coefficient to be adjusted,
- the energy of the correction value is within a range less than or equal to the upper limit threshold, and the modified signal is used for spectrum adjustment, wherein when the predicted value is less than or equal to the upper limit, the correction value is equal to the predicted value, and when the predicted value is greater than The correction value is equal to the upper threshold when the upper limit is exceeded.
- the threshold value a can be determined using an empirical value based on experimental statistics, or the quantization precision can be used to control the size of a.
- the value of the control threshold coefficient a is from 1 to a value less than one.
- the spectral coefficients to be adjusted are determined by the quantization precision of the spectral coefficients, and different weighting modes are selected according to the inter-frame correlation of the frame in which the spectral coefficients to be adjusted are located, and determined according to the selected weighting mode.
- the coefficient (ie, the predicted value of the spectral coefficient to be adjusted) is adapted to other related spectral coefficients, so that the spectral coefficients obtained by different quantization precisions are adapted to each other, and the spectrum of the decoded signal is increased.
- the smoothness reduces the noise of the synthesized frequency band after decoding, and enables the audio signal after the band synthesis to achieve a better hearing effect.
- the embodiment provides a method for performing weighted prediction on the adjusted spectral coefficients, and describes frequency information that can be used in different weighting modes, including:
- the intra-frame spectral information is f_ inner[n]
- the weighting weight in the frame is w_ inner [n]
- the inter-frame spectral information is f-inter[n]
- the weighted weight between frames is w-inter[n].
- N is the maximum number of frequency points that a frame has; if the spectral coefficient of the frequency point n is the spectral coefficient to be adjusted, the predicted value f[n] of the frequency coefficient of the frequency point n is expressed by a formula such as Formula 1 :
- f[n] w_inner[0] xf_inner[0] + w inner [ 1 ] x f inner [ 1 ] + ... +
- the weight weight w- inner [n] in the frame is proportional to the intra-frame correlation; the weighted weight w-inter[n] between frames is proportional to the inter-frame correlation; and the sum of all weighted weights is 1 .
- the following is a specific example of how to perform weighted prediction on the adjusted frequency coefficient.
- the quantized spectral coefficient fQ [n] of the current frame intermediate frequency point n is determined as the spectral coefficient to be adjusted, and the bandwidth spread spectrum coefficient of the current frame intermediate frequency point n is ffi[n]; the quantized spectrum of the intermediate frame of the current frame of the current frame n
- the coefficient is expressed as fS[l][n]
- the quantized spectral coefficient of the intermediate frequency point n of the previous frame is expressed as fS[0][n]
- the prediction of the quantized frequency coefficient of the intermediate frequency point n is f[n].
- the above frequency coefficient or predicted value may be 0 or a non-zero number.
- step 41 If the frequency point 17 needs to be adjusted according to the step 41 in the second embodiment, and the frame in which the frequency point is located, different weighting modes are selected according to step 42, for different weighting modes.
- the following processing is performed, wherein the frequency point 16 and the frequency point 18 are adjacent frequency points of the frequency point 17:
- fQ[17] (fB[17]+fQ[16]+fQ[18] )/3 , at this time, ffi[17], fQ[16], fQ[18] are The determined spectral coefficients with high correlation with the spectral coefficients to be adjusted, B[17], fQ[16], and fQ[18] have weighted weights of 1/3, 1/3, and 1/3, respectively, and other weighted prediction formulas below. The meaning in it is similar to here, no longer praised;
- f[17] (0.4 X fB[17]+fQ[17]+0.8 ⁇ fQ[16]+0.8 x fQ[18] )/3;
- f[17] ( 0.3 ⁇ fS[0][17]+0.7 ⁇ fS[l][17]+fQ[17] )12;
- f[17] ( fB[17]+fQ[16]+fQ[18]+ fS[l][16]+ fS[l][17]+ fS[l] [18] )/6;
- weighted weights and the frequency range of the values are derived from the experimental results, that is, the empirical values, and in practical applications of different scenarios, the choice of weighting weights and frequency points will vary from scene to scene, for example, different core encoders will have Different bandwidth extension ranges. Therefore, the above-mentioned inter-frame spectral information, the value range of the intra-frame spectral information, and the specific value of the weighting weight may be rooted. According to experiments in different scenarios.
- the method for performing weighted prediction on the adjusted spectral coefficients provided in the third embodiment is described by using specific weighting weights, frequency coefficients and calculation formulas. These specific weighting weights, frequency coefficients and calculation formulas are only based on experience. The preferred implementation manners do not constitute a limitation on the scope of the present invention. In practice, these specific weighting weights, spectral coefficients, and calculation formulas can be flexibly adjusted according to specific situations, which are extensions and modifications without departing from the invention. All belong to the scope of protection of the present invention.
- the method for performing weighted prediction on the adjusted spectral coefficients provided in the third embodiment can be applied to the embodiments of the present invention, and the weighted prediction is performed on the adjusted frequency coefficient, and the predicted value of the spectral coefficient to be adjusted is obtained.
- a signal denoising method is provided, where the BWE algorithm and the 8-dimensional trellis vector quantization are used as an example for description, but the present invention is not limited thereto.
- the method can also be applied to other vector quantization, such as 4-dimensional quantization.
- the two weights here can be the empirical values obtained from the experiment.
- the frame in which the spectrum coefficient to be adjusted is located is referred to as the current frame.
- the current frame and the previous frame are both harmonic frames, that is, there is a high interframe correlation. Then, the current frame vector has spectral coefficients decoded, and the vector of the corresponding frequency band of the current frame has no spectral coefficients decoded.
- the method for recovering the spectral coefficients to be adjusted may be: if the amplitude of the quantized spectral coefficients of the previous frame of the previous frame is larger than the amplitude of the quantized spectral coefficients corresponding to the previous frame by a given multiple (eg, twice),
- the amplitude of the frequency coefficient is the weighted sum of the amplitude of the BWE spectral coefficient of the current frame and the amplitude of the quantized spectral coefficient corresponding to the previous frame, and the symbol is the symbol of the BWE spectral coefficient of the current frame; otherwise, if the previous frame of the previous frame
- the amplitude of the corresponding quantized frequency coefficient is not greater than the amplitude of the quantized spectral coefficient corresponding to the previous frame
- the amplitude of the spectral coefficient to be adjusted is the amplitude of the quantized frequency coefficient corresponding to the previous frame of the previous frame, before The weight of the corresponding quantized frequency coefficient of one frame, and the weighted sum of the amplitude
- the method for recovering the spectral coefficient to be adjusted of the frequency point may be: obtaining a weighted average of the amplitude of the BWE spectral coefficient of the current frequency point and the amplitude of the quantized spectral coefficient of the adjacent frequency point.
- the value En as the magnitude of the spectral coefficient to be adjusted.
- the current frequency point is the frequency point at which the spectrum coefficient to be adjusted is located, and may be referred to as the frequency point to be adjusted.
- the adjacent frequency point may be a frequency point in the same frame that is higher than the frequency to be adjusted or lower in frequency, and may be one or several One. If En is greater than the threshold thr[i], then En is set to thr[i], and the amplitude of the spectral coefficient to be adjusted is set to thr[i].
- the sign of the spectral coefficient to be adjusted is the sign of the BWE frequency coefficient of the frequency.
- the amplitude of the spectral coefficient to be adjusted is multiplied by the sign of the spectral coefficient to be adjusted as the adjustment result of the frequency point.
- the recovery method of the spectral coefficient to be adjusted of the frequency point may be: the amplitude of the BWE spectral coefficient of the current frequency point, and the BWE of the current frequency point adjacent to the current frequency point.
- the current frequency point is the frequency point at which the spectrum coefficient to be adjusted is located, and may be referred to as the frequency point to be adjusted.
- the adjacent frequency point may be a frequency point in the same frame that is higher than the frequency to be adjusted or lower in frequency, and may be one or several One. If En is greater than the threshold thr[i], then En is set to thr[i], and the amplitude of the spectral coefficient to be adjusted is set to thr[i].
- the symbol of the spectral coefficient to be adjusted is the sign of the BWE frequency coefficient of the frequency point.
- the amplitude of the spectral coefficient to be adjusted is multiplied by the sign of the spectral coefficient to be adjusted as the result of the frequency adjustment.
- the weighting coefficients in the weighting operation may be different to control the degree of spectral coefficient adjustment so that it does not affect the auditory resolution of the quantized spectral coefficients, and does not introduce additional noise.
- the present invention further provides an apparatus for signal denoising.
- the method includes:
- the selecting unit 51 is configured to select at least two spectral coefficients having high correlation with the spectral coefficient to be adjusted according to the inter-frame correlation of the frame in which the spectral coefficient to be adjusted is located;
- a weighting unit 52 configured to perform weighting by using at least two spectral coefficients selected by the selecting unit 51 and the to-be-adjusted spectral coefficients, to obtain a predicted value of the spectral coefficient to be adjusted;
- the adjustment output unit 53 is configured to perform spectrum adjustment on the decoded signal by using the predicted value obtained by the weighting unit 52, and output the adjusted decoded signal.
- the device further comprises: a prediction point determining unit 50, configured to determine a spectral coefficient to be adjusted according to a quantization coding precision of the spectral coefficient, where the determined spectral coefficient to be adjusted includes: an unquantized spectral coefficient, and/or a spectrum whose quantization precision is lower than a quantization precision threshold coefficient.
- the selecting unit 51 includes:
- the weighting mode selection module 511 is configured to select one of three weighting modes: high inter-frame correlation, low inter-frame correlation, or medium inter-frame correlation according to the inter-frame correlation of the frame in which the spectral coefficient to be adjusted is located.
- Weighting mode high inter-frame correlation, low inter-frame correlation, or medium inter-frame correlation according to the inter-frame correlation of the frame in which the spectral coefficient to be adjusted is located.
- the correlation spectrum selection module 512 is configured to determine, according to the weighting mode selected by the weighting mode selection module 51 1 , at least two spectral coefficients having high correlation with the spectral coefficient to be adjusted.
- the weighting unit 52 includes any of the following modules:
- the high correlation weighting module 521 is configured to obtain a prediction value of the to-be-adjusted spectral coefficient according to a weighting value of at least one of the following information: (1) a predicted value of the previous frame; (2) The quantized spectral coefficient of the frame; (3) the bandwidth spread spectrum coefficient of the previous frame; or, the low correlation weighting module 522, the weighting mode for the correlation between the low frames, obtaining the to-be-tuned according to the weighting value of at least one of the following information
- the predicted value of the spectral coefficient (1) the quantized spectral coefficient of the current frame; (2) the bandwidth spread spectrum coefficient of the current frame; (3) the predicted value existing in the current frame; or, the correlation weighting module 523, used for centering a weighting mode of inter-frame correlation, obtaining a predicted value of the spectral coefficient to be adjusted according to a weighting value of at least one of the following information: (1) a predicted value of a previous frame or a current frame; (2) a quantization frequency of a previous
- the weighting weight of the spectrum information used in each of the above correlation weighting modules is controlled by the quantization precision of the spectrum coefficient to be adjusted, and the higher the quantization precision of the frequency information is
- the weighting weight should be larger, and its weighting weight is proportional to the quantization precision of the spectral coefficient.
- the product of the spectral coefficient and its corresponding weighted weight is the weighted value of the spectral coefficient.
- the weighting unit 52 described above further includes:
- the weight control module 520 is configured to control the weighting weight of the spectrum information according to the quantization precision of the spectrum coefficient to be adjusted. The higher the quantization precision of the spectrum information, the larger the weighting weight corresponding to the corresponding information.
- the above adjustment output unit 53 further includes:
- the correction module 530 is configured to generate a correction value of the to-be-adjusted spectral coefficient according to the upper threshold value of the spectral coefficient energy to be adjusted and the obtained predicted value, and perform spectrum adjustment on the decoded signal by using the modified value; wherein, the correction of the spectral coefficient to be adjusted The energy of the value is less than or equal to the upper threshold of the energy of the spectral coefficient to be adjusted.
- the apparatus for denoising a signal by using a weighting unit to weight the at least two related spectral coefficients selected by the selecting unit to adjust the spectral coefficient, to obtain a predicted value of the spectral coefficient to be adjusted, and the adjusted output unit according to the After the predicted value of the spectrum coefficient is adjusted, the spectrum of the decoded signal is adjusted, and the adjusted decoded signal is output; the predicted spectral coefficient (ie, the predicted value of the spectral coefficient to be adjusted) is adapted to other related spectral coefficients, thereby enabling different quantization.
- the spectral coefficients obtained by the precision are mutually adapted, the smoothness of the spectrum of the decoded signal is increased, the noise of the synthesized frequency band after decoding is reduced, and the audio signal after the frequency band synthesis can achieve a better hearing effect.
- an embodiment of the present invention further provides an audio decoding system.
- a core decoder 61 a bandwidth extension decoder 62, a dequantization decoder 63, and a signal denoising device 60 are included, wherein the core decoder 61 is configured to decode a low frequency first layer code stream.
- Information the bandwidth extension decoder 62, configured to decode information of the bandwidth-spread second layer code stream; the dequantization decoder 63, configured to decode the third layer code stream of the de-quantized high-band remaining bits information;
- the signal denoising device 60 may be the signal denoising device provided by the embodiment of the present invention, for receiving the decoded information output by the bandwidth extension decoder and the dequantization decoder, according to the decoded second
- the information of the layer code stream and the layer 3 code stream determines the spectrum coefficient to be adjusted, and adjusts the spectrum coefficient in the information of the decoded third layer code stream according to the obtained predicted value of the spectrum coefficient to be adjusted.
- the method in the embodiment of the present invention may be implemented in the form of a software function module, and when the software function module is sold or used as a stand-alone product, it may also be stored in a computer readable storage medium.
- the above-mentioned storage medium may be a read only memory, a magnetic disk or an optical disk or the like.
- the functional units in the embodiments of the present invention may be integrated into one processing module, or each unit may exist physically separately, or two or more units may be integrated into one module.
- the above integrated modules can be implemented in the form of hardware or in the form of software functional modules.
- the integrated modules, if implemented in the form of software functional modules and sold or used as separate products, may also be stored in a computer readable storage medium.
- the storage medium mentioned above may be a read only memory, a magnetic disk or an optical disk or the like.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Description
Claims
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2012502425A JP5459688B2 (ja) | 2009-03-31 | 2009-12-28 | 復号信号のスペクトルを調整する方法、装置、および音声復号システム |
KR1020117024686A KR101390433B1 (ko) | 2009-03-31 | 2009-12-28 | 신호 잡음 제거 방법, 신호 잡음 제거 장치, 및 오디오 디코딩 시스템 |
EP09842532A EP2407965B1 (en) | 2009-03-31 | 2009-12-28 | Method and device for audio signal denoising |
KR1020137015052A KR101320963B1 (ko) | 2009-03-31 | 2009-12-28 | 신호 잡음 제거 방법, 신호 잡음 제거 장치, 및 오디오 디코딩 시스템 |
US13/248,725 US8965758B2 (en) | 2009-03-31 | 2011-09-29 | Audio signal de-noising utilizing inter-frame correlation to restore missing spectral coefficients |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN200910133808 | 2009-03-31 | ||
CN200910133808.6 | 2009-03-31 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/248,725 Continuation US8965758B2 (en) | 2009-03-31 | 2011-09-29 | Audio signal de-noising utilizing inter-frame correlation to restore missing spectral coefficients |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2010111876A1 true WO2010111876A1 (zh) | 2010-10-07 |
Family
ID=42827479
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2009/076155 WO2010111876A1 (zh) | 2009-03-31 | 2009-12-28 | 一种信号去噪的方法和装置及音频解码系统 |
Country Status (5)
Country | Link |
---|---|
US (1) | US8965758B2 (zh) |
EP (2) | EP2555191A1 (zh) |
JP (1) | JP5459688B2 (zh) |
KR (2) | KR101390433B1 (zh) |
WO (1) | WO2010111876A1 (zh) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP5997592B2 (ja) * | 2012-04-27 | 2016-09-28 | 株式会社Nttドコモ | 音声復号装置 |
EP2720222A1 (en) | 2012-10-10 | 2014-04-16 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for efficient synthesis of sinusoids and sweeps by employing spectral patterns |
US9602841B2 (en) * | 2012-10-30 | 2017-03-21 | Texas Instruments Incorporated | System and method for decoding scalable video coding |
CN105976824B (zh) * | 2012-12-06 | 2021-06-08 | 华为技术有限公司 | 信号解码的方法和设备 |
CN106463143B (zh) * | 2014-03-03 | 2020-03-13 | 三星电子株式会社 | 用于带宽扩展的高频解码的方法及设备 |
SG11201609834TA (en) | 2014-03-24 | 2016-12-29 | Samsung Electronics Co Ltd | High-band encoding method and device, and high-band decoding method and device |
EP2980792A1 (en) * | 2014-07-28 | 2016-02-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for generating an enhanced signal using independent noise-filling |
KR20200107125A (ko) * | 2019-03-06 | 2020-09-16 | 삼성전자주식회사 | 간소화된 상관을 위한 전자 장치, 방법, 및 컴퓨터 판독가능 매체 |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1276896A (zh) * | 1997-09-18 | 2000-12-13 | 马特拉诺泰尔通讯公司 | 数字语音信号的去噪声方法 |
CN1463422A (zh) * | 2001-06-06 | 2003-12-24 | 三菱电机株式会社 | 噪声抑制装置 |
CN1892822A (zh) * | 2005-05-31 | 2007-01-10 | 日本电气株式会社 | 噪声抑制方法和设备 |
CN101010727A (zh) * | 2004-06-28 | 2007-08-01 | 索尼株式会社 | 信号编码器件和方法与信号解码器件和方法 |
WO2007099222A1 (fr) * | 2006-03-01 | 2007-09-07 | Parrot | Procede de debruitage d'un signal audio |
CN101046964A (zh) * | 2007-04-13 | 2007-10-03 | 清华大学 | 基于重叠变换压缩编码的错误隐藏帧重建方法 |
CN101067650A (zh) * | 2007-06-08 | 2007-11-07 | 骆建华 | 基于部分频谱数据信号重构的信号去噪方法 |
Family Cites Families (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS6014544A (ja) | 1983-07-05 | 1985-01-25 | Omron Tateisi Electronics Co | スペクトラム拡散通信の受信回路 |
US5237587A (en) | 1992-11-20 | 1993-08-17 | Magnavox Electronic Systems Company | Pseudo-noise modem and related digital correlation method |
JPH09261065A (ja) * | 1996-03-25 | 1997-10-03 | Mitsubishi Electric Corp | 量子化装置及び逆量子化装置及び量子化逆量子化システム |
JPH1020011A (ja) | 1996-07-02 | 1998-01-23 | Oki Electric Ind Co Ltd | 方位分析方法及び装置 |
JP3613303B2 (ja) * | 1996-08-08 | 2005-01-26 | 富士通株式会社 | 音声情報圧縮蓄積方法及び装置 |
DE69712537T2 (de) | 1996-11-07 | 2002-08-29 | Matsushita Electric Industrial Co., Ltd. | Verfahren zur Erzeugung eines Vektorquantisierungs-Codebuchs |
US5940435A (en) | 1996-11-21 | 1999-08-17 | Dsp Group, Inc. | Method for compensating filtering delays in a spread-spectrum receiver |
EP0878790A1 (en) * | 1997-05-15 | 1998-11-18 | Hewlett-Packard Company | Voice coding system and method |
US6548465B2 (en) | 2000-03-10 | 2003-04-15 | General Electric Company | Siloxane dry cleaning composition and process |
US6931373B1 (en) * | 2001-02-13 | 2005-08-16 | Hughes Electronics Corporation | Prototype waveform phase modeling for a frequency domain interpolative speech codec system |
US7447631B2 (en) * | 2002-06-17 | 2008-11-04 | Dolby Laboratories Licensing Corporation | Audio coding system using spectral hole filling |
KR100524065B1 (ko) | 2002-12-23 | 2005-10-26 | 삼성전자주식회사 | 시간-주파수 상관성을 이용한 개선된 오디오 부호화및/또는 복호화 방법과 그 장치 |
JP4311034B2 (ja) * | 2003-02-14 | 2009-08-12 | 沖電気工業株式会社 | 帯域復元装置及び電話機 |
KR100608062B1 (ko) * | 2004-08-04 | 2006-08-02 | 삼성전자주식회사 | 오디오 데이터의 고주파수 복원 방법 및 그 장치 |
WO2006062202A1 (ja) * | 2004-12-10 | 2006-06-15 | Matsushita Electric Industrial Co., Ltd. | 広帯域符号化装置、広帯域lsp予測装置、帯域スケーラブル符号化装置及び広帯域符号化方法 |
BRPI0607646B1 (pt) * | 2005-04-01 | 2021-05-25 | Qualcomm Incorporated | Método e equipamento para encodificação por divisão de banda de sinais de fala |
US8112286B2 (en) * | 2005-10-31 | 2012-02-07 | Panasonic Corporation | Stereo encoding device, and stereo signal predicting method |
JP2008033269A (ja) * | 2006-06-26 | 2008-02-14 | Sony Corp | デジタル信号処理装置、デジタル信号処理方法およびデジタル信号の再生装置 |
JP4769673B2 (ja) * | 2006-09-20 | 2011-09-07 | 富士通株式会社 | オーディオ信号補間方法及びオーディオ信号補間装置 |
JP5294086B2 (ja) * | 2007-02-28 | 2013-09-18 | 日本電気株式会社 | 重み係数学習システム及び音声認識システム |
JP2009047831A (ja) | 2007-08-17 | 2009-03-05 | Toshiba Corp | 特徴量抽出装置、プログラムおよび特徴量抽出方法 |
PT2186089T (pt) * | 2007-08-27 | 2019-01-10 | Ericsson Telefon Ab L M | Método e dispositivo para descodificação espetral percetual de um sinal áudio que inclui preenchimento de buracos espetrais |
-
2009
- 2009-12-28 JP JP2012502425A patent/JP5459688B2/ja active Active
- 2009-12-28 EP EP12190501A patent/EP2555191A1/en not_active Withdrawn
- 2009-12-28 EP EP09842532A patent/EP2407965B1/en active Active
- 2009-12-28 KR KR1020117024686A patent/KR101390433B1/ko active IP Right Grant
- 2009-12-28 KR KR1020137015052A patent/KR101320963B1/ko active IP Right Grant
- 2009-12-28 WO PCT/CN2009/076155 patent/WO2010111876A1/zh active Application Filing
-
2011
- 2011-09-29 US US13/248,725 patent/US8965758B2/en active Active
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1276896A (zh) * | 1997-09-18 | 2000-12-13 | 马特拉诺泰尔通讯公司 | 数字语音信号的去噪声方法 |
CN1463422A (zh) * | 2001-06-06 | 2003-12-24 | 三菱电机株式会社 | 噪声抑制装置 |
CN101010727A (zh) * | 2004-06-28 | 2007-08-01 | 索尼株式会社 | 信号编码器件和方法与信号解码器件和方法 |
CN1892822A (zh) * | 2005-05-31 | 2007-01-10 | 日本电气株式会社 | 噪声抑制方法和设备 |
WO2007099222A1 (fr) * | 2006-03-01 | 2007-09-07 | Parrot | Procede de debruitage d'un signal audio |
CN101046964A (zh) * | 2007-04-13 | 2007-10-03 | 清华大学 | 基于重叠变换压缩编码的错误隐藏帧重建方法 |
CN101067650A (zh) * | 2007-06-08 | 2007-11-07 | 骆建华 | 基于部分频谱数据信号重构的信号去噪方法 |
Non-Patent Citations (1)
Title |
---|
See also references of EP2407965A4 * |
Also Published As
Publication number | Publication date |
---|---|
EP2407965B1 (en) | 2012-12-12 |
KR20120000091A (ko) | 2012-01-03 |
KR20130086634A (ko) | 2013-08-02 |
KR101320963B1 (ko) | 2013-10-23 |
US8965758B2 (en) | 2015-02-24 |
EP2555191A1 (en) | 2013-02-06 |
EP2407965A1 (en) | 2012-01-18 |
JP5459688B2 (ja) | 2014-04-02 |
US20120022878A1 (en) | 2012-01-26 |
JP2012522272A (ja) | 2012-09-20 |
KR101390433B1 (ko) | 2014-04-29 |
EP2407965A4 (en) | 2012-01-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP5203929B2 (ja) | スペクトルエンベロープ表示のベクトル量子化方法及び装置 | |
WO2010111876A1 (zh) | 一种信号去噪的方法和装置及音频解码系统 | |
US9424851B2 (en) | Frame error concealment method and apparatus and decoding method and apparatus using the same | |
US8612219B2 (en) | SBR encoder with high frequency parameter bit estimating and limiting | |
AU2017201872B2 (en) | Audio encoder and decoder | |
JP6452759B2 (ja) | 先進量子化器 | |
JP6368029B2 (ja) | 雑音信号処理方法、雑音信号生成方法、符号化器、復号化器、並びに符号化および復号化システム | |
RU2707727C1 (ru) | Устройство обработки аудиосигнала, способ обработки аудиосигнала и программа обработки аудиосигнала | |
WO2015196837A1 (zh) | 一种音频编码方法和装置 | |
AU2012361423A1 (en) | Method, apparatus, and system for processing audio data | |
US9131290B2 (en) | Audio coding device, audio coding method, and computer-readable recording medium storing audio coding computer program | |
JP5340965B2 (ja) | 定常的な背景雑音の平滑化を行うための方法及び装置 | |
CN101853664B (zh) | 一种信号去噪的方法和装置及音频解码系统 | |
US10504531B2 (en) | Audio parameter quantization | |
KR102243217B1 (ko) | 오디오 신호 부호화 방법 및 장치 | |
AL-Rawi | ADPCM: US Patents from 2010 to 2016 | |
JP2005301002A (ja) | 音声符号化情報処理装置および音声符号化情報処理プログラム |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 09842532 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2012502425 Country of ref document: JP |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2009842532 Country of ref document: EP |
|
ENP | Entry into the national phase |
Ref document number: 20117024686 Country of ref document: KR Kind code of ref document: A |