WO2015007075A1 - 一种声道间电平差处理方法及装置 - Google Patents
一种声道间电平差处理方法及装置 Download PDFInfo
- Publication number
- WO2015007075A1 WO2015007075A1 PCT/CN2014/070131 CN2014070131W WO2015007075A1 WO 2015007075 A1 WO2015007075 A1 WO 2015007075A1 CN 2014070131 W CN2014070131 W CN 2014070131W WO 2015007075 A1 WO2015007075 A1 WO 2015007075A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- frame
- icld
- subframe
- subband
- sub
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 76
- 230000005236 sound signal Effects 0.000 claims abstract description 105
- 238000004364 calculation method Methods 0.000 claims description 19
- 230000005540 biological transmission Effects 0.000 abstract description 22
- 238000013139 quantization Methods 0.000 description 10
- 230000000694 effects Effects 0.000 description 6
- 238000010586 diagram Methods 0.000 description 3
- 230000011218 segmentation Effects 0.000 description 3
- 238000001228 spectrum Methods 0.000 description 3
- 230000001052 transient effect Effects 0.000 description 3
- 230000003993 interaction Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/005—Correction of errors induced by the transmission channel, if related to the coding algorithm
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/008—Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/01—Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/03—Application of parametric coding in stereophonic audio systems
Definitions
- the input stereo audio signal is first parsed to obtain an Inter-Channe l Leve l Difference (ICLD) value in each sub-band of the frame carrying the stereo audio signal. Then, the obtained ICLD value is compared with the ICLD value obtained in the previous frames.
- the stereo audio signal carried by the frame is a transient Transient, otherwise the frame
- the stereo audio signal carried is the normal Norma 1.
- the Trans ient is divided into 2 frames for transmission, that is, the ICLDs of the odd subbands and the even subbands are respectively transmitted.
- the transmission is divided into 4 frames, that is, the ICLD of 1/4 subband is transmitted per frame.
- ref inement processing is further performed on Norma l.
- Embodiments of the present invention provide a method and apparatus for processing an inter-channel level difference, which can ensure a decoded stereo audio quality in a case where a signal changes rapidly or is lost, and implements a stereo audio signal. Low bit rate transmission.
- an embodiment of the present invention provides a method for processing an inter-channel level difference, including: receiving a stereo audio signal, and parsing the stereo audio signal frame by frame to obtain each sub-frame of the stereo audio signal.
- the ICLD weight value of each subband in the any frame is calculated by the first weighting method.
- the second weighting manner is used to calculate the ICLD weight value of each subband in the any frame.
- the determining, by using the first weighting manner, the ICLD weight value of each subband in the any frame includes:
- the calculating according to the number of subframes included in any one of the frames, calculating a number of the bth subband in the i th subframe A weighting factor, including:
- the determining, by using the second weighting manner, the ICLD weight value of each subband in the any frame includes:
- calculating, according to the calculated sum of absolute values of ICLDs of each subframe in the any frame of the stereo audio signal, The second weighting factor of the bth subband in the ith subframe includes:
- Fac 2 (i,b) sum(i) / Vsum(j)
- the calculating, according to the ICLD of the b-th sub-band in the i-th subframe of the any frame, The second weighting factor of the bth subband in the subframe includes:
- an embodiment of the present invention provides an inter-channel level difference processing apparatus, including: a receiving unit, configured to receive a stereo audio signal;
- a parsing unit configured to parse the stereo audio signal frame by frame according to the stereo audio information received by the receiving unit, to obtain inter-channel power of each sub-band of each sub-frame in each frame of the stereo audio signal Adjusting the ICLD, the frame includes at least two subframes;
- a calculating unit configured to calculate, according to an ICLD of each sub-band of each subframe in each frame of the stereo audio signal obtained by the parsing unit, calculate an ICLD of each subframe in any frame of the stereo audio signal The sum of absolute values;
- a first weighting processing unit configured to calculate the any frame by using a first weighting manner when an absolute value of a difference between a sum of absolute values of ICLDs of any two of the frames is less than a preset threshold ICLD weighting value for each subband in the middle;
- a second weighting processing unit configured to calculate the any frame by using a second weighting manner when an absolute value of a difference between a sum of absolute values of ICLDs of any two of the frames is not less than a preset threshold The ICLD weighting value for each subband in the middle.
- the first weighting processing unit includes:
- a first calculating module configured to calculate, according to the number of subframes included in the any frame, a first weighting factor of a b-th sub-band in the i-th subframe, where i represents any one of the frames included in any one of the frames Subframe, 0 ⁇ i ⁇ L, the L is the number of subframes included in the any frame, and L > 2, the b represents any one of the i-th subframes, 0 ⁇ b K , K represents the number of subbands included in any one subframe, K > 12;
- a second calculating module configured to calculate the b-th sub-band in the any frame according to the first weighting factor and an ICLD of the b-th sub-band in the i-th subframe of any one of the frames The ICLD weighted value.
- the first computing module is configured to:
- the second weighting processing unit includes:
- a third calculating module configured to calculate a second weighting factor of the bth subband in the i-th subframe according to the sum of absolute values of ICLDs of each subframe in the any frame of the stereo audio signal
- i represents any one of the subframes included in any one of the frames, 0 ⁇ i L
- the L is the number of subframes included in the any frame, and L > 2,
- b represents the Any one of the i-th sub-frames, 0 ⁇ b ⁇ K
- the ⁇ indicates the number of sub-bands included in any one of the sub-frames, ⁇ > 12;
- the fourth calculation module is configured to use any one of the frames Calculating a second weighting factor of the bth subband in the ith subframe in the ICLD of the bth subband in the i th subframe;
- a fifth calculating module configured to calculate the b-th sub-band in the any frame according to the second weighting factor and an ICLD of the b-th sub-band in the i-th subframe of any one of the frames The ICLD weighted value.
- the third computing module is configured to:
- Fac 2 (i,b) sum(i) / Vsum(j)
- the fourth calculation module is used for:
- the fifth calculating module is configured to:
- Embodiments of the present invention provide a method for processing an inter-channel level difference by receiving a stereo audio signal and parsing the stereo audio signal frame by frame to obtain each sub-frame of each frame of the stereo audio signal.
- Inter-channel level difference ICLD of the band the frame includes at least two sub-frames; the stereo audio is calculated according to the ICLD of each sub-band of each sub-frame in each frame of the obtained stereo audio signal.
- the sum of the absolute values of the ICLD of each subframe in any frame of the signal when the absolute value of the difference between the sum of the absolute values of the ICLDs of any two of the frames is less than the preset threshold, Calculating an ICLD weighting value of each subband in any one of the frames in a weighting manner; when the absolute value of the difference between the sum of the absolute values of the ICLDs of any two of the frames is not less than a preset threshold,
- the second weighting method calculates an ICLD weighting value for each subband in the any frame.
- FIG. 1 is a flowchart of a method for processing an inter-channel level difference according to an embodiment of the present invention
- FIG. 1 is a flowchart of a method for processing an inter-channel level difference according to another embodiment of the present invention
- 3 is a flowchart of a method for processing an inter-channel level difference according to another embodiment of the present invention
- FIG. 4 is a flowchart of another method for processing an inter-channel level difference according to another embodiment of the present invention
- FIG. 5 is a block diagram of an inter-channel level difference processing apparatus according to an embodiment of the present invention
- FIG. 6 is a block diagram of another inter-channel level difference processing apparatus according to an embodiment of the present invention
- An embodiment of the present invention provides a method for processing a level difference between channels, which is applied to an encoding end of a stereo audio. As shown in FIG. 1, the method includes:
- Step 101 Receive a stereo audio signal, and parse the stereo audio signal frame by frame to obtain a An inter-channel level difference ICLD of each sub-band of each sub-frame in each frame of the stereo audio signal, wherein each frame includes at least two sub-frames.
- a stereo audio signal can be made up of many frames.
- each frame can be further divided into a plurality of subframes, and each subframe can be divided into a plurality of sub-bands.
- the number of subframes included in each frame is an even number, and each frame may include at least two subframes, for example, one subframe includes two subframes, or one frame. It includes 4 subframes, or 6 subframes in one frame, and so on. At least 12 sub-bands are included in each subframe.
- ICLD is used to distinguish the horizontal direction of the stereo source and describes the difference in intensity between the channels, which will affect the frequency content of the entire spectrum. Further optionally, ICLD is the power ratio of the two input channel signals, for example, the ICLD value of the bth subband is:
- Level(b) 101og 10 , where XL(P) and X R (P) are the frequency domain coefficients of the left and right channels, respectively.
- P is the frequency point value
- the calculation range of level (b) is A ⁇ p Ab , , A b respectively represents the frequency point value corresponding to different frequencies
- b represents any one of the sub-subframes, 0 ⁇ b K
- the K represents the number of sub-bands included in any one subframe, K > 12.
- the ⁇ indicates that in order to approximate the auditory characteristics of the human ear, the spectral coefficients having the same frequency resolution can be divided into ⁇ non-overlapping sub-bands according to the critical band theory.
- Step 102 Calculate a sum of absolute values of ICLDs of each subframe in any frame of the stereo audio signal according to the obtained ICLD of each sub-band of each subframe in each frame of the stereo audio signal.
- Step 103 When the absolute value of the difference between the sum of the absolute values of the ICLDs of any two of the frames is less than a preset threshold, calculate, by using the first weighting method, each subband of any one of the frames. ICLD weighted value.
- the preset threshold is a threshold obtained based on actual experience.
- the step includes: calculating, according to the number of subframes included in any frame, a first weighting factor of a b-th sub-band in the i-th subframe;
- the first weighting factor of the b-th sub-band in the i-th subframe is calculated according to the number of the subframes included in the any one of the frames, including:
- Step 104 When the absolute value of the difference between the sum of the absolute values of the ICLDs of any two of the frames is not less than a preset threshold, calculate a second weighting manner for each subband of the any frame. ICLD weighted value.
- the second weighting manner is used to calculate each of the subframes.
- the absolute value of the difference between the sum of the absolute values of the ICLDs of any two subframes included in any frame is not less than the preset threshold. It can be understood that: the absolute values of all the differences are greater than or equal to the preset threshold. , or department The absolute value of the difference is less than the preset threshold, and the absolute value of the remaining difference is greater than or equal to the preset threshold.
- the step includes: obtaining, according to the calculated absolute value of the ICLD of each subframe in the any frame of the stereo audio signal, or according to the bth in the i-th subframe in any one of the frames
- the ICLD of the subbands calculates a second weighting factor of the bth subband in the i-th subframe, where i represents any one of the subframes included in any one of the frames, 0 ⁇ i L, the L is The number of subframes included in any frame, and L > 2, where b represents any one of the i-th subframes, 0 ⁇ b K, and the K represents a sub-band included in any one of the subframes Quantity, K >12;
- Fac 2 (i,b) sum(i) / Vsum(j)
- one subframe may be divided into K subbands, for example, subband 1, subband 2, ... subband K, where 1, 2, ... K is the index number of the subband.
- L is rounded down
- II is rounded up.
- the obtained ICLD weighting value is quantized.
- the quantization process is performed by comparing the preset codebook with the ICLD weighting value to obtain a value in the preset codebook that is closest to the ICLD weighting value. For example, if the ICLD weighting value is 1. 4 and the default codebook includes 0 and 2 values, the value obtained by quantizing the ICLD weighting value is 2.
- the bit transmission is reduced, and when the quantization process is performed, the difference between the current ICLD weight value and the previous ICLD weight value may be calculated, and the calculated difference is obtained.
- the values are quantized. For example, one sub-frame is divided into 15 sub-bands, and the ICLD weighting value of the first sub-band is directly quantized.
- the ICLD weighting value of the first sub-band is quantized, the ICLD weighting value of the second sub-band is first calculated.
- the difference between the ICLD weights of the first subband, and then the calculated difference is quantized to obtain the ICLD weight of the second subband.
- the quantized result of the value can be used to calculate the quantized result of the ICLD weighting value of the other sub-bands in turn according to the method of calculating the quantized result of the ICLD weighting value of the second sub-band.
- An embodiment of the present invention provides a method for processing an inter-channel level difference, by calculating any frame of the stereo audio signal according to an ICLD of each sub-band of each subframe in each frame of the obtained stereo audio signal.
- the frame length carrying stereo audio is long, when processing stereo audio of 10 ms, if Norma l is processed in 4 frames, it is equivalent to performing ICLD update every 40 ms (4*10 ms), resulting in signal change.
- the quality of the decoded stereo audio cannot be guaranteed in the case of rapid or packet loss.
- the ICLD is transmitted frame by frame, the low bit rate transmission of the stereo audio signal cannot be achieved.
- the present invention The I CLD of the subframe of the frame is weighted so that the decoded stereo audio quality can be guaranteed in the case of rapid signal change or packet loss, and low bit rate transmission of the stereo audio signal can be realized.
- Embodiments of the present invention provide a method for processing a level difference between channels.
- a frame includes two subframes for detailed description.
- a frame includes a first subframe and a second subframe.
- the method includes:
- Step 201 Receive a stereo audio signal, and parse the stereo audio signal frame by frame to obtain an inter-channel level difference I CLD of each sub-band of each sub-frame in each frame of the stereo audio signal.
- two subframes are included in each frame. That is, the stereo audio signal is received, and the stereo audio signal is parsed frame by frame, and the inter-channel level difference ICLD of each sub-band of the first sub-frame and the second sub-frame in each frame of the stereo audio signal can be obtained.
- the specific processing manner of parsing the stereo audio signal on a frame-by-frame basis to obtain the ICLD of each sub-band in each sub-frame can be referred to the description in step 101 in FIG. 1, and is not described here.
- Step 202 According to the obtained sub-bands of each subframe in each frame of the stereo audio signal The ICLD calculates a sum of absolute values of inter-channel level differences ICLD of the first sub-frame and the second sub-frame in any one of the stereo audio signals.
- Step 203 Determine whether the absolute value of the difference between the sum of the absolute values of the ICLDs of the first subframe and the absolute value of the ICLD of the second subframe is at a preset threshold.
- the absolute value of the difference between the sum of the absolute values of the ICLDs of the first subframe and the absolute values of the ICLDs of the second subframe is determined according to I sum (1)-sum (2) I ⁇ TH Both are less than the preset threshold.
- TH is the preset threshold, and the preset threshold can be set according to actual experience.
- Step 204 When the absolute value of the difference between the sum of the absolute values of the ICLD of the first subframe and the absolute value of the ICLD of the second subframe is less than a preset threshold, the first weighting method is used to calculate The ICLD weighting value for each subband in any of the frames.
- the ICLD weighting of each subband in the any frame may be calculated by using the first weighting manner. value.
- the step includes: calculating, according to the number of subframes included in the any frame, a first weighting factor of the b-th sub-band in the i-th subframe, where the i indicates any one included in the any frame.
- the first weighting factor of the b-th sub-band in the i-th subframe is calculated according to the number of subframes included in any one of the frames, including:
- L is 2
- the first weighting factor of the first subframe or the second subframe in any frame is fac ⁇ iW ⁇ / I ⁇ OJ.
- Step 205 When the absolute value of the difference between the sum of the absolute values of the ICLDs of any two of the frames is not less than a preset threshold, calculate a second weighting manner for each subband of the any frame. ICLD weighted value.
- the second weighting manner may be used to calculate each of the subframes. I CLD weighted value with band.
- the second weighting manner is used to calculate the ICLD weighting value of each subband in the any frame, which can be calculated in the following three manners: mode one, integer frame weighting processing; mode two, based on the current subband Subband weighting processing; Mode 3, based on sub-subband weighting processing of adjacent M subbands.
- the step when performing weighting processing by using mode 1, the step includes:
- Fac 2 (i,b) sum(i) / Vsum(j)
- the step includes:
- the first subframe or the second subframe of the second subframe is calculated according to the ICLD of the first subframe included in any frame of the calculated stereo audio or the second subframe of the second subframe.
- fac 2 (b)
- the level (b) represents an ICLD weighting value of the b-th sub-band in the any frame, 0 ⁇ b K, the K Representing the number of subbands included in any one subframe, ⁇ >12
- the fac 2 (i, b) is the second weighting factor of the bth subband in the i th subframe, 0 ⁇ i L
- the l eve l(i,b) represents the ICLD of the bth subband of the i-th subframe.
- the step includes:
- the first subframe or the second subframe of the second subframe is calculated according to the ICLD of the first subframe included in any frame of the calculated stereo audio or the second subframe of the second subframe.
- the second weighting factor of the subbands including:
- one subframe may be divided into ⁇ subbands, for example, subband 1, subband 2, ... subband K, where 1, 2, ... K is the index number of the subband.
- Fac 2 (b)
- Step 206 Perform quantization processing on the obtained ICLD weighting value.
- the quantization process is performed by comparing the preset codebook with the ICLD weight value to obtain a value in the preset codebook that is closest to the ICLD weighting value. For example, if the ICLD weighting value is 1. 4 and the default codebook includes 0 and 2 values, the value obtained by quantizing the ICLD weighting value is 2.
- the bit transmission is reduced, and when the quantization process is performed, the current ICLD weight value and the previous ICLD weight value may be calculated.
- the difference is quantized by the calculated difference. For example, one sub-frame is divided into 15 sub-bands, and the ICLD weighting value of the first sub-band is directly quantized. When the ICLD weighting value of the first sub-band is quantized, the ICLD weighting value of the second sub-band is first calculated.
- the difference between the ICLD weighting values of the first sub-band, and then the calculated difference is quantized to obtain the quantized result of the ICLD weighting value of the second sub-band, which can be quantized according to the calculation of the ICLD weighting value of the second sub-band.
- the resulting method sequentially calculates the quantized results of the ICLD weighting values of the other sub-bands.
- Step 301 Receive a stereo audio signal, and parse the stereo audio signal frame by frame to obtain each frame of the stereo audio signal.
- the four subframes are included in each frame.
- the specific processing manner of parsing the stereo audio signal on a frame-by-frame basis and obtaining the ICLD of each sub-band in each sub-frame can be referred to the description in step 101 in FIG. 1, and is not described here.
- Step 302 Calculate a sum of absolute values of ICLDs of each subframe in any frame of the stereo audio signal according to the obtained ICLD of each sub-band of each subframe in each frame of the stereo audio signal.
- the ⁇ represents the number of subbands included in any one of the subframes, ⁇ > 12, and the i represents any one of the subframes included in any frame, in this embodiment, Q ⁇ i 4.
- Step 303 Determine whether an absolute value of a difference between a sum of absolute values of ICLDs of any two of the frames is less than a preset threshold.
- the absolute value of the difference between the sum of the absolute values of the ICLDs of the i-th subframe and the j-th subframe in any one frame is determined according to I s ⁇ (i) -sum (j) I ⁇ TH Preset threshold.
- TH is the preset threshold, and the preset threshold can be set according to actual experience.
- Step 304 When the absolute value of the difference between the sum of the absolute values of the ICLDs of any two subframes in any one of the frames is less than a preset threshold, the first weighting manner is used to calculate each subband of the any frame. ICLD weighted value.
- I sum (i) -sum (j) I ⁇ TH indicating that the i-th subframe is close to the ICLD of the j-th subframe
- the I of each sub-band in the any frame may be calculated by using the first weighting manner.
- CLD weighted value When the absolute value of the difference between the sum of the absolute values of the ICLDs of any two subframes in any one of the frames is less than a preset threshold, the first weighting manner is used to calculate each subband of the any frame.
- ICLD weighted value When I sum (i) -sum (j) I ⁇ TH, indicating that the i-th subframe is close to the ICLD of the j-th subframe
- this step includes:
- the first weighting factor of the b-th sub-band in the i-th subframe is calculated according to the number of subframes included in any one of the frames, including:
- the ICLD weight of the b-th subband of any frame is: w .
- the second weighting manner may be used to calculate each sub-band of the any frame.
- ICLD weighted value the absolute value of the difference between the sum of the absolute values of the ICLDs of any two subframes included in any frame is not less than the preset threshold. It can be understood that: the absolute values of all the differences are greater than or equal to the preset threshold. , or the absolute value of the partial difference is less than the preset threshold, and the absolute value of the remaining difference is greater than or equal to the preset threshold.
- I ⁇ TH the ICLD weighting value of any one of the sub-bands can be calculated by the second weighting method.
- the second weighting manner is used to calculate the ICLD weighting value of each subband in the any frame, which can be calculated in the following three manners: mode one, integer frame weighting processing; mode two, based on the current subband Subband weighting processing; Mode 3, based on sub-subband weighting processing of adjacent M subbands.
- the step when performing weighting processing by using mode 1, the step includes:
- calculating a second weighting factor of the b-th sub-band in the ith subframe according to the sum of absolute values of ICLDs of each subframe in the any frame of the stereo audio signal including:
- Fac 2 (i,b) sum(i) / Vsum(j)
- Fac 2 (i, b) sum(i) I ⁇ sum(j)
- the step includes:
- this step includes:
- the second weighting factor of the b-th sub-band in the i-th subframe is calculated according to the ICLD of the b-th sub-band in the i-th subframe of the any one of the frames, including:
- one subframe can be divided into K subbands, such as subband 1, subband 2, and subband K.
- 1, 2, ... K is the index number of the subband.
- the second weighting factor of the b-th sub-band in the i-th subframe is calculated based on the ICLD of the adjacent one sub-band:
- the second weighting factor of the b-th sub-band in the i-th subframe is calculated based on the ICLD of the adjacent three sub-bands:
- Step 306 Perform quantization processing on the obtained ICLD weighting value.
- the quantization process is performed by comparing the preset codebook with the ICLD weight value to obtain a value in a preset codebook that is closest to the ICLD weighting value. For example, if the ICLD weighting value is 1.4, and the preset codebook includes 0 and 2 values, the value obtained by quantizing the ICLD weighting value is 2.
- the bit transmission is reduced, and when the quantization process is performed, the difference between the current ICLD weight value and the previous ICLD weight value may be calculated, and the calculated difference is performed.
- Quantitative processing For example, one sub-frame is divided into 15 sub-bands, and the ICLD weighting value of the first sub-band is directly quantized. When the ICLD weighting value of the second sub-band is quantized, the ICLD weighting value of the second sub-band is first calculated.
- the difference between the ICLD weighting values of the first sub-band, and then the calculated difference is quantized to obtain the quantized result of the ICLD weighting value of the second sub-band, which can be quantized according to the calculation of the ICLD weighting value of the second sub-band.
- the resulting method sequentially calculates the quantized results of the ICLD weighting values of the other sub-bands.
- the segmentation signal-to-noise ratio obtained by the prior art is obtained when there is no packet loss during stereo audio signal transmission (
- the SSNR obtained is 3.73 dB.
- the SSNR is 3.73 dB.
- the SSNR obtained by the prior art is 3.59 dB
- the obtained SSNR is 3.72 dB
- the test result is no packet loss when there is a packet loss during the transmission of the stereo audio signal. The test results obtained are similar.
- the embodiment of the invention provides a method for processing the inter-channel level difference, which performs weighting processing on the ICLD of any one subframe based on the inter-subframe correlation, and quantizes the ICLD weighting value, so that the coding bit can be effectively reduced.
- the rate and in the case of rapid signal changes and packet loss, can guarantee the quality of the decoded stereo audio, and achieve the desired effect.
- An embodiment of the present invention provides a method for processing an inter-channel level difference.
- an input quantized stereo audio bit stream is received, and the bit stream is parsed to obtain K (K > 12).
- the ICLDs of the subbands sequentially use the ICLD values of the respective subbands as the ICLDs of the respective subbands of the current sub-frames, thereby obtaining the decoded stereo audio signal.
- the ICLD of subband 1 obtained after parsing the received bit stream is A
- the ICLD of subband 2 is B
- the ICLD of the subband 3 is C or the like.
- the ICLD of the subband 1 of each subframe is A
- the ICLD of the subband 2 is B
- the ICLD of the subband 3 is C or the like.
- An inter-channel level difference processing method provided by an embodiment of the present invention, after decoding the received quantized stereo audio bit stream, can obtain higher quality stereo audio, and achieve low stereo audio signal Bit rate transmission.
- the embodiment of the present invention provides an inter-channel level difference processing device, which may be an encoding end for processing stereo audio. As shown in FIG. 5, the device includes: a receiving unit 501, an analyzing unit 502, and a calculating unit 503, a weighting processing unit 504, a second weighting processing unit 505;
- a receiving unit 501 configured to receive a stereo audio signal
- the parsing unit 502 is configured to parse the stereo audio signal frame by frame according to the stereo audio information received by the receiving unit 501, and obtain a channel of each sub-band of each sub-frame in each frame of the stereo audio signal.
- Inter-level difference ICLD wherein each frame includes at least two subframes;
- a stereo audio signal can be made up of many frames.
- each frame can be further divided into a plurality of subframes, and each subframe can be divided into a plurality of sub-bands.
- the number of subframes included in each frame is an even number, and each frame may include at least two subframes, for example, one subframe includes two subframes, or one frame. It includes 4 subframes, or 6 subframes in one frame, and so on. At least 12 sub-bands are included in each subframe.
- ICLD is used to distinguish the horizontal direction of the stereo source and describes the difference in intensity between the channels, which will affect the frequency content of the entire spectrum.
- the calculating unit 503 is configured to calculate, according to the ICLD of each sub-band of each subframe in each frame of the stereo audio signal obtained by the parsing unit 502, calculate each subframe in any frame of the stereo audio signal.
- a first weighting processing unit 504 configured to calculate any one of the first weighting manners when an absolute value of a difference between a sum of absolute values of ICLDs of any two of the frames is less than a preset threshold ICLD weighting value for each subband in the frame;
- a second weighting processing unit 505 configured to calculate, according to a second weighting manner, when the absolute value of the difference between the sum of the absolute values of the ICLDs of any two of the frames is not less than a preset threshold Every frame The ICLD weighting value of the subbands.
- the absolute value of the difference between the sum of the absolute values of the ICLDs of any two subframes included in any frame is not less than the preset threshold. It can be understood that: the absolute values of all the differences are greater than or equal to the preset threshold. , or the absolute value of the partial difference is less than the preset threshold, and the absolute value of the remaining difference is greater than or equal to the preset threshold.
- the calculation unit 503 calculates, according to the ICLD of each sub-band of each subframe in each frame of the stereo audio signal obtained by the parsing unit, each sub-frame of the stereo audio signal.
- the first weighting processing unit 504 adopts the first Before the weighted manner calculates the ICLD weighting value of each subband in any one of the frames, or when the absolute value of the difference between the sum of the absolute values of the ICLDs of any two of the frames is not less than a preset threshold
- the second weighting processing unit 505 calculates the ICLD weighting value of each subband in the any frame by using the second weighting manner, the difference between the absolute values of the ICLDs of any two subframes in the any frame may be determined. Whether the absolute value is less than the preset threshold.
- the first weighting processing unit 504 includes: a first calculating module 5041 and a second calculating module 5042.
- a first calculating module 5041 configured to calculate, according to the number of subframes included in the any frame, a first weighting factor of a b-th sub-band in the i-th subframe, where the i indicates any one of the frames included in the any one of the frames One subframe, 0 ⁇ i L, the L is the number of subframes included in the any frame, and L > 2, the b represents any one of the i-th subframes, 0 ⁇ b K , K represents the number of subbands included in any one subframe, K > 12;
- the first calculating module 5041 is configured to:
- a second calculating module 5042 configured to calculate, according to the first weighting factor and an ICLD of the bth subband in the i th subframe of any one of the frames, the bth sub The ICLD weighted value of the band.
- the second weighting processing unit 505 includes: a third calculating module 5051, a fourth calculating module 5052, and a fifth calculating module 5053.
- a third calculating module 5051 configured to calculate, according to the calculated sum of absolute values of ICLDs of each subframe in the any frame of the stereo audio signal, a second weighting of the b-th sub-band in the i-th subframe a factor, where i represents any one of the subframes included in any one of the frames, 0 ⁇ i L, the L is the number of subframes included in the any frame, and L>2, where b represents Any one of the i-th sub-frames, 0 ⁇ b ⁇ K, the K represents the number of sub-bands included in any one of the sub-frames, K>12; or, the fourth calculation module 5052 is configured to An ICLD of a b-th sub-band in an i-th subframe of a frame, and calculating a second weighting factor of the b-th sub-band in the i-th subframe;
- a fifth calculating module 5053 configured to calculate, according to the second weighting factor and an ICLD of the bth subband in the i th subframe of any one of the frames, the bth subunit in any one of the frames The ICLD weighted value of the band.
- the third calculating module 5051 is configured to:
- Fac 2 (i,b) sum(i) / Vsum(j)
- the sub-subband weighting processing manner is performed according to the IC sub-bands of the adjacent M sub-bands, and when the second adding factor weight of the b-th sub-band in the i-th subframe is calculated, the fourth calculating module 5052 uses In:
- one subframe can be divided into K subbands, for example, subband 1, subband 2, ... subband K, where 1, 2, ... K is the index number of the subband.
- the segmentation signal-to-noise ratio (SSNR) obtained by the prior art is 3.63 dB
- the present invention is used when there is no packet loss during the transmission of the stereo audio signal.
- the SSNR obtained was 3.73 dB after the method provided in the example. It should be noted that the larger the SSNR value, the closer the codec processing audio is to the original audio, that is, the better the effect. ldB ⁇ Thus, the method provided by the embodiment of the present invention is higher than the prior art.
- the SSNR obtained by the prior art is 3.59 dB
- the obtained SSNR is 3.72 dB
- the test result is no packet loss when there is a packet loss during the transmission of the stereo audio signal. The test results obtained are similar.
- An embodiment of the present invention provides an inter-channel level difference processing apparatus, which performs weighting processing on an ICLD of any one subframe by using inter-subframe correlation, and performs quantization processing on an ICLD weighting value, so that coding bits can be effectively reduced.
- the rate and in the case of rapid signal changes and packet loss, can guarantee the quality of the decoded stereo audio, and achieve the desired effect.
- the embodiment of the present invention provides an inter-channel level difference processing device, which may be an encoding end for processing stereo audio.
- the device includes: a receiver 701, a memory 702, and a processor 703. 701, configured to receive a stereo audio signal;
- a memory 702 configured to store information including a program
- the processor 703 is connected to the receiver 701 and the memory 702, and is configured to control the execution of the program, and specifically includes: parsing the stereo audio signal frame by frame according to the received stereo audio information, to obtain the stereo audio signal.
- the ICLD of each subband of each subframe calculating the sum of the absolute values of the ICLD of each subframe in any frame of the stereo audio signal; and the absolute value of the ICLD of any two subframes in any one of the frames
- the ICLD weight value of each subband in the any frame is calculated by using the first weighting manner; when the absolute of the ICLD of any two subframes in any one of the frames When the absolute value of the difference of the sum of the values is not less than the preset threshold, the ICLD weight value of each subband in the any frame is
- the absolute value of the difference between the sum of the absolute values of the ICLDs of any two subframes in any frame is not less than the preset threshold. It can be understood that: the absolute values of all the differences are greater than or equal to the preset threshold, or The absolute value of the partial difference is less than the preset threshold, and the absolute value of the remaining difference is greater than or equal to the preset threshold.
- the stereo audio signal can be composed of many frames. In this step, each frame can be divided into multiple subframes, and each subframe can be divided into multiple sub-bands.
- each frame may include at least two subframes, for example, one subframe includes two subframes, or one frame. It includes 4 subframes, or 6 subframes in one frame, and so on. At least 12 sub-bands are included in each subframe.
- ICLD is used to distinguish the horizontal direction of the stereo source and describes the difference in intensity between the channels, which will affect the frequency content of the entire spectrum.
- the processor 703 is further configured to calculate a sum of absolute values of ICLDs of each subframe in any frame of the stereo audio signal, and in any two of the subframes When the absolute value of the difference between the absolute values of the ICLDs is less than the preset threshold, the first weighting method is used to calculate the ICLD weight value of each subband in the any frame, or in any of the frames.
- the second weighting manner is used to calculate the ICLD weighting value of each subband in the any frame, and is used to determine the Whether the absolute value of the difference between the sum of the absolute values of the ICLDs of any two subframes in any frame is less than a preset threshold.
- the processor 703 calculates the ICLD weight value of each sub-band in the any frame by using the first weighting manner
- the processor calculates the i-th sub-frame according to the number of sub-frames included in any one of the frames.
- a first weighting factor of the bth subband in the frame the i indicating any one of the subframes included in the any frame, 0 ⁇ i ⁇ L, the L is the number of subframes included in the any frame
- the b represents any one of the i-th subframes, 0 ⁇ b K
- the K represents the number of sub-bands included in any one of the subframes, K > 12
- calculating by the first weighting factor and the ICLD of the b-th sub-band in the i-th subframe of the any frame, an ICLD weighting value of the b-th sub-band in the any frame.
- the processor 703 calculates the ICLD weight value of each subband in the any frame by using the second weighting manner
- the processor 703 is configured to use each of the frames according to the calculated stereo audio signal.
- Calculating a second weighting factor of the b-th sub-band in the i-th subframe where the i represents any one of the sub-frames included in the any one of the frames, 0 ⁇ i L, L is the number of subframes included in any of the frames, and L > 2
- the b represents any one of the i-th subframes, 0 ⁇ b ⁇ K
- the K represents any one of the subframes
- the processor 703 calculates, according to the sum of the absolute values of the ICLDs of each subframe in the any frame of the stereo audio signal, the second sub-band of the i-th subframe. For the second weighting factor,
- Fac 2 (i,b) sum(i) / Vsum(j)
- the processor 703 calculates, according to the ICLD of the b-th sub-band in the i-th subframe of the any frame, the second weighting factor of the b-th sub-band in the i-th subframe, to:
- one subframe may be divided into K subbands, for example, subband 1, subband 2, ... subband K, where 1, 2, ... K is the index number of the subband.
- the processor 703 calculates, according to the second weighting factor and the ICLD of the b-th sub-band in the i-th subframe of any one of the frames, the number of the any one of the frames When the ICLD weights of b subbands are used,
- the segmentation signal-to-noise ratio obtained by the prior art is obtained when there is no packet loss during stereo audio signal transmission (
- the SSNR obtained is 3.73 dB.
- the SSNR is 3.73 dB.
- the larger the SSNR value The closer the audio is decoded to the original audio, the better the effect. ldB ⁇
- the method provided by the embodiment of the present invention is higher than the prior art.
- the SSNR obtained by the prior art is 3.59 dB
- the obtained SSNR is 3.72 dB
- the test result is no packet loss when there is a packet loss during the transmission of the stereo audio signal.
- the test results obtained are similar.
- the embodiment of the present invention provides an inter-channel level difference processing apparatus, which performs weighting processing on an ICLD of any one subframe according to correlation between subframes, and performs quantization processing on the ICLD weighting value, so as to effectively reduce
- the encoding bit rate, and in the case of rapid signal changes and packet loss, can guarantee the decoding of stereo audio quality, and achieve the desired effect.
- the device embodiments described above are merely illustrative, wherein the units described as separate components may or may not be physically separated, and the components displayed as the cells may or may not be physical. Units can be located in one place, or they can be distributed to multiple network units. Some or all of the modules may be selected according to actual needs to achieve the objectives of the embodiments of the present invention. Those of ordinary skill in the art can understand and implement without any creative effort.
- the present invention can be implemented by means of software plus necessary general hardware, and of course, dedicated hardware, dedicated CPU, dedicated memory, dedicated memory, Special components are used to achieve this, but in many cases the former is a better implementation.
- the technical solution of the present invention which is essential or contributes to the prior art, may be embodied in the form of a software product stored in a readable storage medium, such as a floppy disk of a computer.
- U disk mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Acces s Memory), disk or optical disk, etc., including a number of instructions to make a computer device (can It is a personal computer, a server, or a network device, etc.) that performs the methods described in various embodiments of the present invention.
- a computer device can It is a personal computer, a server, or a network device, etc.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Computational Linguistics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Health & Medical Sciences (AREA)
- Mathematical Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Stereophonic System (AREA)
- Data Exchanges In Wide-Area Networks (AREA)
Abstract
Description
Claims
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2016520245A JP6106336B2 (ja) | 2013-07-16 | 2014-01-04 | チャネル間レベル差処理方法及び装置 |
KR1020157032153A KR101730362B1 (ko) | 2013-07-16 | 2014-01-04 | 채널간 레벨 차이 계산 방법 및 장치 |
EP14826895.6A EP2977984B1 (en) | 2013-07-16 | 2014-01-04 | Method and device for processing inter-channel voltage level difference |
BR112015030201A BR112015030201A2 (pt) | 2013-07-16 | 2014-01-04 | método e aparelho de processamento de diferença de nível intercanal |
US14/931,975 US10002615B2 (en) | 2013-07-16 | 2015-11-04 | Inter-channel level difference processing method and apparatus |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310298100.2A CN104299615B (zh) | 2013-07-16 | 2013-07-16 | 一种声道间电平差处理方法及装置 |
CN201310298100.2 | 2013-07-16 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US14/931,975 Continuation US10002615B2 (en) | 2013-07-16 | 2015-11-04 | Inter-channel level difference processing method and apparatus |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2015007075A1 true WO2015007075A1 (zh) | 2015-01-22 |
Family
ID=52319314
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2014/070131 WO2015007075A1 (zh) | 2013-07-16 | 2014-01-04 | 一种声道间电平差处理方法及装置 |
Country Status (7)
Country | Link |
---|---|
US (1) | US10002615B2 (zh) |
EP (1) | EP2977984B1 (zh) |
JP (1) | JP6106336B2 (zh) |
KR (1) | KR101730362B1 (zh) |
CN (1) | CN104299615B (zh) |
BR (1) | BR112015030201A2 (zh) |
WO (1) | WO2015007075A1 (zh) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10109284B2 (en) * | 2016-02-12 | 2018-10-23 | Qualcomm Incorporated | Inter-channel encoding and decoding of multiple high-band audio signals |
GB201909715D0 (en) | 2019-07-05 | 2019-08-21 | Nokia Technologies Oy | Stereo audio |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102027535A (zh) * | 2008-04-11 | 2011-04-20 | 诺基亚公司 | 信号处理 |
CN102157151A (zh) * | 2010-02-11 | 2011-08-17 | 华为技术有限公司 | 一种多声道信号编码方法、解码方法、装置和系统 |
US20110206209A1 (en) * | 2008-10-03 | 2011-08-25 | Nokia Corporation | Apparatus |
CN102598122A (zh) * | 2009-11-12 | 2012-07-18 | 皇家飞利浦电子股份有限公司 | 参量编码和解码 |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
ATE538469T1 (de) * | 2008-07-01 | 2012-01-15 | Nokia Corp | Vorrichtung und verfahren zum justieren von räumlichen hinweisinformationen eines mehrkanaligen audiosignals |
KR101710113B1 (ko) * | 2009-10-23 | 2017-02-27 | 삼성전자주식회사 | 위상 정보와 잔여 신호를 이용한 부호화/복호화 장치 및 방법 |
EP2375410B1 (en) * | 2010-03-29 | 2017-11-22 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | A spatial audio processor and a method for providing spatial parameters based on an acoustic input signal |
-
2013
- 2013-07-16 CN CN201310298100.2A patent/CN104299615B/zh active Active
-
2014
- 2014-01-04 KR KR1020157032153A patent/KR101730362B1/ko active IP Right Grant
- 2014-01-04 EP EP14826895.6A patent/EP2977984B1/en not_active Not-in-force
- 2014-01-04 JP JP2016520245A patent/JP6106336B2/ja not_active Expired - Fee Related
- 2014-01-04 WO PCT/CN2014/070131 patent/WO2015007075A1/zh active Application Filing
- 2014-01-04 BR BR112015030201A patent/BR112015030201A2/pt not_active IP Right Cessation
-
2015
- 2015-11-04 US US14/931,975 patent/US10002615B2/en not_active Expired - Fee Related
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102027535A (zh) * | 2008-04-11 | 2011-04-20 | 诺基亚公司 | 信号处理 |
US20110206209A1 (en) * | 2008-10-03 | 2011-08-25 | Nokia Corporation | Apparatus |
CN102598122A (zh) * | 2009-11-12 | 2012-07-18 | 皇家飞利浦电子股份有限公司 | 参量编码和解码 |
CN102157151A (zh) * | 2010-02-11 | 2011-08-17 | 华为技术有限公司 | 一种多声道信号编码方法、解码方法、装置和系统 |
Also Published As
Publication number | Publication date |
---|---|
BR112015030201A2 (pt) | 2017-07-25 |
EP2977984A4 (en) | 2016-06-29 |
KR101730362B1 (ko) | 2017-04-26 |
CN104299615B (zh) | 2017-11-17 |
US20160055856A1 (en) | 2016-02-25 |
US10002615B2 (en) | 2018-06-19 |
KR20150140802A (ko) | 2015-12-16 |
EP2977984A1 (en) | 2016-01-27 |
EP2977984B1 (en) | 2018-12-12 |
JP6106336B2 (ja) | 2017-03-29 |
JP2016522451A (ja) | 2016-07-28 |
CN104299615A (zh) | 2015-01-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP7106711B2 (ja) | マルチチャネル信号符号化方法、マルチチャネル信号復号方法、エンコーダ、およびデコーダ | |
US9263050B2 (en) | Allocation, by sub-bands, of bits for quantifying spatial information parameters for parametric encoding | |
RU2439718C1 (ru) | Способ и устройство для обработки звукового сигнала | |
CN104485111B (zh) | 音频/语音编码装置、音频/语音解码装置及其方法 | |
US8831960B2 (en) | Audio encoding device, audio encoding method, and computer-readable recording medium storing audio encoding computer program for encoding audio using a weighted residual signal | |
ES2966665T3 (es) | Dispositivo y método de codificación de audio | |
CN112119457A (zh) | 可截断的预测编码 | |
EP3039675A1 (en) | Hybrid waveform-coded and parametric-coded speech enhancement | |
CN108140393B (zh) | 一种处理多声道音频信号的方法、装置和系统 | |
CN110462733B (zh) | 多声道信号的编解码方法和编解码器 | |
KR102288841B1 (ko) | 채널 간 위상차 파라미터 추출 방법 및 장치 | |
EP2705516B1 (en) | Encoding of stereophonic signals | |
EP3637417B1 (en) | Signal processing method and device | |
US9214158B2 (en) | Audio decoding device and audio decoding method | |
WO2015007075A1 (zh) | 一种声道间电平差处理方法及装置 | |
JP2020525847A (ja) | チャネル間位相差パラメータ符号化方法および装置 | |
CA2947360A1 (en) | Audio coding method and related apparatus | |
KR20200090856A (ko) | 오디오 인코딩 및 디코딩 방법 및 관련 제품 | |
CN114467138A (zh) | 音频编码和音频解码 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 14826895 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2014826895 Country of ref document: EP |
|
ENP | Entry into the national phase |
Ref document number: 20157032153 Country of ref document: KR Kind code of ref document: A |
|
REG | Reference to national code |
Ref country code: BR Ref legal event code: B01A Ref document number: 112015030201 Country of ref document: BR |
|
ENP | Entry into the national phase |
Ref document number: 2016520245 Country of ref document: JP Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
ENP | Entry into the national phase |
Ref document number: 112015030201 Country of ref document: BR Kind code of ref document: A2 Effective date: 20151202 |