EP3588497B1 - Multi-channel signal encoding and decoding method and codec - Google Patents
Multi-channel signal encoding and decoding method and codec Download PDFInfo
- Publication number
- EP3588497B1 EP3588497B1 EP18776186.1A EP18776186A EP3588497B1 EP 3588497 B1 EP3588497 B1 EP 3588497B1 EP 18776186 A EP18776186 A EP 18776186A EP 3588497 B1 EP3588497 B1 EP 3588497B1
- Authority
- EP
- European Patent Office
- Prior art keywords
- channel signal
- signal
- energy
- target
- downmixed
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims description 107
- 230000000694 effects Effects 0.000 description 15
- 238000010586 diagram Methods 0.000 description 12
- 230000005236 sound signal Effects 0.000 description 8
- 238000004364 calculation method Methods 0.000 description 7
- 238000004458 analytical method Methods 0.000 description 6
- 230000006870 function Effects 0.000 description 5
- 238000010606 normalization Methods 0.000 description 5
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 238000009499 grossing Methods 0.000 description 3
- 238000004891 communication Methods 0.000 description 2
- 230000001427 coherent effect Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 210000005069 ears Anatomy 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 230000036962 time dependent Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S1/00—Two-channel systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S5/00—Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation
Definitions
- This application relates to the audio encoding field, and more specifically, to a multi-channel signal encoding method, a multi-channel signal decoding method, an encoder, and a decoder.
- stereo audio provides a sense of orientation and a sense of distribution for each acoustic source, and provides improved clarity, intelligibility, and on-site feeling of sound. Therefore, stereo audio is very popular.
- Stereo processing technologies mainly include mid/side (Mid/Sid, MS) encoding, intensity stereo (Intensity Stereo, IS) encoding, parametric stereo (Parametric Stereo, PS) encoding, and the like.
- an encoder side when PS encoding is used to encode a channel signal, an encoder side performs spatial parameter analysis on a plurality of channel signals to obtain reverberation gain parameters and other spatial parameters of the plurality of channel signals, and encodes the reverberation gain parameters and the other spatial parameters of the plurality of channel signals, so that a decoder side can perform, based on the reverberation gain parameters of the channel signals during decoding, reverberation processing on the plurality of channel signals obtained through decoding, so as to improve auditory effects.
- Patent application EP2840811A1 relates to spatial audio coding and discloses a correlation-based time-dependent scaling method that determines an adequate amplitude of the late reverberation.
- the reverberation is based on a stereo downmix of the audio input signal adaptively scaled in amplitude.
- Patent application WO 2010/070016 A1 deals with applying reverb on multichannel downmixed signals by using different reverb impulse responses for each of the individual channels, after upmixing the audio.
- This application provides a multi-channel signal encoding method, a multi-channel signal decoding method, an encoder, and a decoder, so as to improve quality of a channel signal
- the scope of the protection is defined in accordance with a multichannel signal encoding method according to claim 1, a multichannel signal decoding method according to claim 8, a multichannel signal encoding device according to claim 14 and a multichannel signal decoding device according to claim 21. Further aspects are set forth in the dependent claims.
- a multi-channel signal encoding method includes: determining a downmixed signal of a first channel signal and a second channel signal in a multi-channel signal, an initial reverberation gain parameter of the first channel signal and the second channel signal; determining a target reverberation gain parameter of the first channel signal and the second channel signal based on a correlation between the first channel signal and the downmixed signal, a correlation between the second channel signal and the downmixed signal, and the initial reverberation gain parameter; and quantizing the first channel signal and the second channel signal based on the downmixed signal and the target reverberation gain parameter, and writing a quantized first channel signal and a quantized second channel signal into a bitstream.
- the correlation between the first channel signal or the second channel signal and the downmixed signal may be determined based on a difference between energy of the first channel signal or energy of the second channel signal and energy of the downmixed signal, or may be determined based on a difference between an amplitude of the first channel signal or an amplitude of the second channel signal and an amplitude of the downmixed signal.
- the first channel signal, the second channel signal, and the downmixed signal are channel signals obtained after normalization processing.
- the determining a target reverberation gain parameter of the first channel signal and the second channel signal based on a correlation between the first channel signal and the downmixed signal, a correlation between the second channel signal and the downmixed signal, and the initial reverberation gain parameter includes: determining a target attenuation factor based on the correlation between the first channel signal and the downmixed signal and the correlation between the second channel signal and the downmixed signal; and adjusting the initial reverberation gain parameter based on the target attenuation factor to obtain the target reverberation gain parameter.
- the initial reverberation gain parameter of the channel signal can be flexibly adjusted based on a value of the correlation between the channel signal and the downmixed signal by using the attenuation factor.
- the correlations between the first channel signal, the second channel signal, and the downmixed signal can be conveniently measured by using the energy of the channel signal, that is, the target attenuation factor can be conveniently determined by comparing the difference between the energy of the channel signal and the energy of the downmixed signal.
- the difference between the energy of the first channel signal or the energy of the second channcl signal and the energy of the downmixed signal is relatively large (greater than a given threshold)
- it may be considered that the correlation between the first channel signal and the downmixed signal and the correlation between the second channel signal and the downmixed signal are relatively weak. In this case, a relatively large target attenuation factor may be determined.
- the difference between the energy of the first channel signal or the energy of the second channel signal and the energy of the downmixed signal is relatively small (less than the given threshold)
- it may be considered that the correlation between the first channel signal and the downmixed signal and the correlation between the second channel signal and the downmixed signal are relatively strong.
- a relatively small target attenuation factor may be determined.
- the determining a target attenuation factor based on the correlation between the first channel signal and the downmixed signal and the correlation between the second channel signal and the downmixed signal may be calculating the target attenuation factor based on the correlations between the channel signals and the downmixed signal, or may be directly determining a preset attenuation factor as the target attenuation factor after the correlations between the channel signals and the downmixed signal are considered.
- each of the first channel signal and the second channel signal includes a plurality of frequency bins
- the determining a target attenuation factor based on the correlation between the first channel signal and the downmixed signal and the correlation between the second channel signal and the downmixed signal includes: determining difference values between energy of the first channel signal and energy of the downmixed signal at the plurality of frequency bins and between energy of the second channel signal and energy of the downmixed signal at the plurality of frequency bins; and determining the target attenuation factor based on the difference values.
- the difference between the energy of the first channel signal and the energy of the downmixed signal and the difference between the energy of the second channel signal and the energy of the downmixed signal can be conveniently determined by comparing the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins and the energy of the second channel signal and the energy of the downmixed signal at the plurality of frequency bins, and the attenuation factor is further determined. Therefore, it is unnecessary to compare differences between energy of the first channel signal and energy of the downmixed signal and differences between energy of the second channel signal and energy of the downmixed signal in all frequency bands.
- the determining difference values between energy of the first channel signal and energy of the downmixed signal at the plurality of frequency bins and between energy of the second channel signal and energy of the downmixed signal at the plurality of frequency bins includes: determining a first difference value between the energy of the first channel signal and the energy of the downmixed signal, where the first difference value indicates a sum of absolute values of the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins; and determining a second difference value between the energy of the second channel signal and the energy of the downmixed signal, where the second difference value indicates a sum of absolute values of the difference values between the energy of the second channel signal and the energy of the downmixed signal at the plurality of frequency bins; and the determining the target attenuation factor based on the difference values includes: determining the target attenuation factor based on a ratio between the first difference value and the second difference value
- the target attenuation factor may be directly determined based on the first difference value and the second difference value.
- the method before the determining the target attenuation factor based on the difference values, the method further includes: determining that the difference values are greater than a preset threshold.
- the target attenuation factor is determined, and the initial reverberation gain parameter is adjusted based on the target attenuation factor.
- the initial reverberation gain parameter may not be adjusted, thereby improving encoding efficiency.
- initial reverberation gain parameter of the plurality of channel signals may be directly determined as target reverberation gain parameter of the plurality of channel signals.
- the energy of the downmixed signal is determined based on the energy of the first channel signal and the energy of the second channel signal.
- the energy of the downmixed signal can be calculated by using the energy of the first channel signal and the energy of the second channel signal, and a calculation process can be simplified without using the downmixed signal itself.
- the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors corresponds to at least one subband of the multi-channel signal, and any subband corresponds to only one attenuation factor.
- a reverberation gain parameter can be more flexibly adjusted based on the target attenuation factor.
- each of frequency bands in which the first channel signal and the second channel signal are located includes a first frequency band and a second frequency band, an attenuation factor corresponding to a subband in the first frequency band is less than or equal to an attenuation factor corresponding to a subband in the second frequency band, and a frequency of the first frequency band is less than a frequency of the second frequency band.
- Reverberation gain parameters corresponding to a high frequency subband and a low frequency subband can be adjusted to different degrees by setting attenuation factors of different sizes for the reverberation gain parameters corresponding to the high frequency subband and the low frequency subband, and a better processing effect can be obtained during reverberation processing.
- a multi-channel signal encoding method includes: determining a downmixed signal of a first channel signal and a second channel signal in a multi-channel signal, an initial reverberation gain parameter of the first channel signal and the second channel signal; determining identification information of the first channel signal and the second channel signal based on a correlation between the first channel signal and the downmixed signal, and a correlation between the second channel signal and the downmixed signal, where the identification information indicates a channel signal that is in the first channel signal and the second channel signal and whose initial reverberation gain parameter needs to be adjusted; and quantizing the first channel signal and the second channel signal based on the downmixed signal, the initial reverberation gain parameter, and the identification information, and writing a quantized first channel signal and a quantized second channel signal into a bitstream.
- the correlation between the first channel signal or the second channel signal and the downmixed signal may be determined based on a difference between energy of the first channel signal or energy of the second channel signal and energy of the downmixed signal, or may be determined based on a difference between an amplitude of the first channel signal or an amplitude of the second channel signal and an amplitude of the downmixed signal.
- a channel signal whose initial reverberation gain parameter needs to be adjusted can be determined based on a correlation between the channel signal and the downmixed signal, so that a decoder side can first adjust initial reverberation gain parameter of some channel signals and then perform reverberation processing on these channel signals, thereby improving quality of a channel signal obtained after reverberation processing.
- the determining identification information of the first channel signal and the second channel signal based on a correlation between the first channel signal and the downmixed signal, and a correlation between the second channel signal and the downmixed signal includes: determining the identification information of the first channel signal and the second channel signal based on a correlation between energy of the first channel signal and energy of the downmixed signal and a correlation between energy of the second channel signal and the energy of the downmixed signal.
- the correlation between the first channel signal and the downmixed signal and the correlation between the second channel signal and the downmixed signal can be conveniently measured by using the energy of the channel signals and the energy of the downmixed signal, so that a channel signal whose initial reverberation gain parameter needs to be adjusted can be conveniently determined.
- the determining the identification information of the first channel signal and the second channel signal based on a correlation between energy of the first channel signal and energy of the downmixed signal and a correlation between energy of the second channel signal and the energy of the downmixed signal includes: determining a first difference value and a second difference value, where the first difference value is a sum of absolute values of difference values between energy of the first channel signal and energy of the downmixed signal at a plurality of frequency bins, and the second difference value is a sum of absolute values of difference values between energy of the second channel signal and energy of the downmixed signal at the plurality of frequency bins; and determining the identification information of the first channel signal and the second channel signal based on the first difference value and the second difference value.
- energy values of the first channel signal, the second channel signal, and the downmixed signal may be values obtained after normalization processing.
- the difference between the energy of the first channel signal and the energy of the downmixed signal and the difference between the energy of the second channel signal and the energy of the downmixed signal can be conveniently determined by comparing the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins and the energy of the second channel signal and the energy of the downmixed signal at the plurality of frequency bins, so as to determine a channel signal whose initial reverberation gain parameter needs to be adjusted. Therefore, it is unnecessary to compare differences between energy of the first channel signal and energy of the downmixed signal and differences between energy of the second channel signal and energy of the downmixed signal in all frequency bands.
- the determining the identification information of the first channel signal and the second channel signal based on the first difference value and the second difference value includes: determining the larger difference value in the first difference value and the second difference value as a target difference value; and determining the identification information based on the target difference value, where the identification information indicates a channel signal corresponding to the target difference value, and the channel signal corresponding to the target difference value is a channel signal whose initial reverberation gain parameter needs to be adjusted.
- the method further includes: determining a target attenuation factor based on the first difference value and the second difference value, where the target attenuation factor is used to adjust an initial reverberation gain parameter of a target channel signal; and quantizing the target attenuation factor, and writing a quantized target attenuation factor into the bitstream.
- the initial reverberation gain parameter of the channel signal can be flexibly adjusted based on a value of the correlation between the channel signal and the downmixed signal by using the attenuation factor.
- the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors corresponds to at least one subband of the target channel signal, and any subband corresponds to only one attenuation factor.
- a reverberation gain parameter can be more flexibly adjusted based on the target attenuation factor.
- the target channel signal includes a first frequency band and a second frequency band, an attenuation factor corresponding to a subband in the first frequency band is less than or equal to an attenuation factor corresponding to a subband in the second frequency band, and a frequency of the first frequency band is less than a frequency of the second frequency band.
- Reverberation gain parameters corresponding to a high frequency subband and a low frequency subband can be adjusted to different degrees by setting attenuation factors of different sizes for the reverberation gain parameters corresponding to the high frequency subband and the low frequency subband, and a better processing effect can be obtained during reverberation processing.
- the energy of the downmixed signal is determined based on the energy of the first channel signal and the energy of the second channel signal.
- the energy of the downmixed signal is estimated or deduced by using energy of a plurality of channel signals, which can reduce calculation.
- a multi-channel signal decoding method includes: obtaining a bitstream; determining a downmixed signal of a first channel signal and a second channel signal in a multi-channel signal, an initial reverberation gain parameter of the first channel signal and the second channel signal, and identification information of the first channel signal and the second channel signal based on the bitstream, where the identification information indicates a channel signal that is in the first channel signal and the second channcl signal and whose initial reverberation gain parameter needs to be adjusted; determining, as a target channel signal based on the identification information, the channel signal that is in the first channel signal and the second channel signal and whose initial reverberation gain parameter needs to be adjusted; and adjusting the initial reverberation gain parameter of the target channel signal.
- the channel signal whose initial reverberation gain parameter needs to be adjusted can be determined by using the identification information, and the initial reverberation gain parameter of the channel signal is adjusted before reverberation processing is performed on the channel signal, thereby improving quality of a channel signal obtained after reverberation processing.
- the adjusting an initial reverberation gain parameter of the target channel signal includes: determining a target attenuation factor; and adjusting the initial reverberation gain parameter of the target channel signal based on the target attenuation factor, to obtain a target reverberation gain parameter of the target channel signal.
- the initial reverberation gain parameter of the channel signal can be flexibly adjusted based on a value of the correlation between the channel signal and the downmixed signal by using the attenuation factor.
- the determining a target attenuation factor includes: determining a preset attenuation factor as the target attenuation factor.
- a process of determining the target attenuation factor can be simplified by presetting the attenuation factor, thereby improving decoding efficiency.
- the determining a target attenuation factor includes: obtaining the target attenuation factor based on the bitstream.
- the target attenuation factor may be directly obtained from the bitstream, and the process of determining the target attenuation factor can be also simplified, thereby improving decoding efficiency.
- the determining a target attenuation factor includes: obtaining an inter-channel level difference between the first channel signal and the second channel signal from the bitstream; and determining the target attenuation factor based on the inter-channel level difference, or determining the target attenuation factor based on the inter-channel level difference and the downmixed signal.
- the target attenuation factor can be more flexibly and accurately determined based on the inter-channel level difference, the downmixed signal, and the like, so that an initial reverberation gain parameter of a channel signal can be more accurately adjusted based on the attenuation factor.
- the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors corresponds to at least one subband of the target channel signal, and any subband corresponds to only one attenuation factor.
- a reverberation gain parameter can be more flexibly adjusted based on the target attenuation factor.
- the target channel signal includes a first frequency band and a second frequency band, an attenuation factor corresponding to a subband in the first frequency band is less than or equal to an attenuation factor corresponding to a subband in the second frequency band, and a frequency of the first frequency band is less than a frequency of the second frequency band.
- Reverberation gain parameters corresponding to a high frequency subband and a low frequency subband can be adjusted to different degrees by setting attenuation factors of different sizes for the reverberation gain parameters corresponding to the high frequency subband and the low frequency subband, and a better processing effect can be obtained during reverberation processing.
- an encoder includes a module or a unit configured to perform the method in the first aspect or various implementations of the first aspect.
- an encoder includes a module or a unit configured to perform the method in the second aspect or various implementations of the second aspect.
- a decoder includes a module or a unit configured to perform the method in the third aspect or various implementations of the third aspect.
- an encoder includes a memory and a processor, where the memory is configured to store a program, the processor is configured to execute the program, and when the program is executed, the processor performs the method in the first aspect or various implementations of the first aspect.
- an encoder includes a memory and a processor, where the memory is configured to store a program, the processor is configured to execute the program, and when the program is executed, the processor performs the method in the second aspect or various implementations of the second aspect.
- a decoder includes a memory and a processor, where the memory is configured to store a program, the processor is configured to execute the program, and when the program is executed, the processor performs the method in the third aspect or various implementations of the third aspect.
- a computer readable medium stores program code to be executed by a device, and the program code includes an instruction used to perform the method in the first aspect or various implementations of the first aspect.
- a computer readable medium stores program code to be executed by a device, and the program code includes an instruction used to perform the method in the second aspect or various implementations of the second aspect.
- a computer readable medium stores program code to be executed by a device, and the program code includes an instruction used to perform the method in the third aspect or various implementations of the third aspect.
- FIG. 1 shows a process of encoding a left-channel signal and a right-channel signal in the prior art.
- the encoding process shown in FIG. 1 specifically includes the following steps.
- step 110 specifically includes: performing spatial parameter analysis on the left-channel signal and the right-channel signal to obtain a spatial parameter of the left-channel signal and a spatial parameter of the right-channel signal; and performing downmixing processing on the left-channel signal and the right-channel signal to obtain a downmixed signal (where the downmixed signal obtained after downmixing processing is a mono audio signal, and the original two channels of audio signals are converted into one channel of audio signal through downmixing processing).
- the spatial parameter (may be also referred to as a spatial sensing parameter) includes an inter-channel correlation (Inter-channel Coherent, IC), an inter-channel level difference (Inter-channel Level Difference, ILD), an inter-channel time difference (Inter-channel Time Difference, ITD), an inter-channel phase difference (Inter-channel Phase Difference, IPD), and the like.
- IC Inter-channel Coherent
- ILD inter-channel level difference
- ITD inter-channel Time Difference
- IPD inter-channel Phase Difference
- the IC describes an inter-channel cross-correlation or coherence. This parameter determines sensing of a sound field range, and can improve spatial sense and sound stability of an audio signal.
- the ILD is used to distinguish a horizontal direction angle of a stereo source and describes an inter-channel intensity difference, and this parameter affects frequency components of an entire spectrum.
- the ITD and the IPD are spatial parameters representing horizontal directions of a sound source. They describe inter-channel time and phase differences. The parameters mainly affect frequency components below 2 kHz.
- the ITD may represent a time delay between a left-channel signal and a right-channel signal of a stereo
- the IPD may represent a waveform similarity of the left-channel signal and the right-channel signal of the stereo after time alignment.
- the ILD, the ITD, and the IPD can determine human ears' sensing of a location of a sound source, effectively determine a sound field location, and play an important role in stereo signal restoration.
- the bitstream obtained through encoding may be stored or transmitted to a decoder-side device.
- FIG. 2 shows a process of decoding a left-channel signal and a right-channel signal in the prior art.
- the decoding process shown in FIG. 2 specifically includes the following steps.
- the spatial parameters include an IC of the left-channel signal and the right-channel signal.
- the left-channel signal and the right-channel signal are obtained based on a decoded downmixed signal and the de-correlation signal of the current frame.
- left-channel signal and right-channel signal (respectively represented by L' and R' in FIG. 2 ) based on the spatial parameters, the left-channel signal, and the right-channel signal.
- left-channel signal and the right-channel signal (respectively represented by L' and R' in FIG. 2 ) in step 240 are obtained through decoding, and may be distorted to some extent compared with a left-channel signal and a right-channel signal that are encoded on an encoder side.
- the downmixed signal may be filtered, and then an inter-channel correlation parameter is used to correct a filtered downmixed signal to obtain a de-correlation signal.
- a purpose of generating the de-correlation signal is to improve a sense of reverberation of a finally generated stereo signal on a decoder side, and increase a sound field width of the stereo signal, so that an output audio signal is more mellow and full in terms of auditory sense.
- the sense of reverberation is essentially an effect of delaying such as reflecting and refracting an original audio signal differently and then superimposing the reflected and refracted audio signals on the original audio signal to enter a human ear.
- a correlation of different channel signals is not considered so as to adaptively adjust the IC.
- a relatively poor auditory effect may be caused.
- quality of a channel signal finally output by the decoder side is relatively poor.
- the embodiments of this application provide a multi-channel signal encoding or decoding method.
- a reverberation gain parameter can be correspondingly adjusted based on a correlation between different channel signals, and a de-correlation signal is corrected by using an adjusted reverberation gain parameter.
- reverberation processing is performed on different channel signals by using the de-correlation signal.
- the correlation between different channel signals is considered, so that quality of an output channel signal is better.
- FIG. 3 is a schematic flowchart of a multi-channel signal encoding method according to an embodiment of this application.
- the method in FIG. 3 may be performed by an encoder-side device or an encoder.
- the method in FIG. 3 includes the following steps.
- a sequence of determining the downmixed signal and determining the initial reverberation gain parameter is not limited, and the downmixed signal and the initial reverberation gain parameter may be determined simultaneously or successively.
- the initial reverberation gain parameter may be reverberation gain parameter obtained after spatial parameter analysis is performed on the first channel signal and the second channel signal.
- the downmixed signal may be obtained by performing downmixing processing on the plurality of channel signals.
- a spatial parameter of the first channel signal and a spatial parameter of the second channel signal are obtained by performing spatial parameter analysis on the first channel signal and the second channel signal, where the spatial parameters include the initial reverberation gain parameter of the first channel signal and the second channel signal.
- first channel signal and the second channel signal may correspond to a same spatial parameter, and correspondingly, the first channel signal and the second channel signal may also correspond to a same initial reverberation gain parameter. That is, the spatial parameter of the first channel signal and the spatial parameter of the second channel signal may be the same, and the initial reverberation gain parameter of the first channel signal and the second channel signal may also be the same.
- each of the first channel signal and the second channel signal includes 10 subbands, and each subband corresponds to one reverberation gain parameter, reverberation gain parameters corresponding to subbands, whose index values are the same, of the first channel signal and the second channel signal may be the same.
- first channel signal, the second channel signal, and the downmixed signal may be channel signals obtained after normalization processing.
- 320 Determine a target reverberation gain parameter of the first channel signal and the second channel signal based on a correlation between the first channel signal and the downmixed signal, a correlation between the second channel signal and the downmixed signal, and the initial reverberation gain parameter.
- the correlation between the first channel signal or the second channel signal and the downmixed signal may be determined based on a difference between energy of the first channel signal or energy of the second channel signal and energy of the downmixed signal, or may be determined based on a difference between an amplitude of the first channcl signal or an amplitude of the second channcl signal and an amplitude of the downmixed signal.
- the correlation between the first channel signal and the downmixed signal is relative large.
- the difference between the energy or the amplitude of the first channel signal and the energy or the amplitude of the downmixed signal is relatively small.
- the difference between the energy of the first channel signal or the energy of the second channel signal and the energy of the downmixed signal may be specifically a difference value between the energy of the first channel signal or the energy of the second channel signal and the energy of the downmixed signal.
- the difference between the amplitude of the first channel signal or the amplitude of the second channel signal and the amplitude of the downmixed signal may be specifically a difference value between the amplitude of the first channel signal or the amplitude of the second channel signal and the amplitude of the downmixed signal.
- the correlation between the first channel signal or the second channel signal and the downmixed signal may alternatively refer to a difference between a phase, a period, or the like of the first channel signal or the second channel signal and a phase, a period, or the like of the downmixed signal.
- the multi-channel signal has more than two channel signals
- the multi-channel signal includes the first channel signal, the second channel signal, a third channel signal, and a fourth channel signal
- the first channel signal and the second channel signal may be processed by using the method in FIG. 3
- the third channel signal and the fourth channel signal are also processed by using the method in FIG. 3 .
- the determining a target reverberation gain parameter of the first channel signal and the second channel signal based on a correlation between the first channel signal and the downmixed signal, a correlation between the second channel signal and the downmixed signal, and the initial reverberation gain parameter includes: determining a target attenuation factor based on the correlation between the first channel signal and the downmixed signal and the correlation between the second channel signal and the downmixed signal; and adjusting the initial reverberation gain parameter based on the target attenuation factor, to obtain the target reverberation gain parameter.
- the determining a target attenuation factor based on the correlation between the first channel signal and the downmixed signal and the correlation between the second channel signal and the downmixed signal may be calculating the target attenuation factor based on the correlations between the channel signals and the downmixed signal, or may be directly determining a preset attenuation factor as the target attenuation factor after the correlations between the channel signals and the downmixed signal are considered.
- the initial reverberation gain parameter of the channel signal can be flexibly adjusted based on a value of the correlation between the channel signal and the downmixed signal by using the attenuation factor.
- a target attenuation factor with a relatively small value may be determined.
- a target attenuation factor with a relatively large value may be determined.
- correlations between the plurality of channel signals and the downmixed signal may refer to differences between energy of the plurality of channel signals and the energy of the downmixed signal, or differences between amplitudes of the plurality of channel signals and the amplitude of the downmixed signal.
- the differences between the energy of the plurality of channel signals and the energy of the downmixed signal may be specifically difference values between the energy of the plurality of channel signals and the energy of the downmixed signal.
- the differences between the amplitudes of the plurality of channel signals and the amplitude of the downmixed signal may be specifically difference values between the amplitudes of the plurality of channel signals and the amplitude of the downmixed signal.
- the correlations between the plurality of channel signals and the downmixed signal may alternatively refer to differences between phases, periods, or the like of the plurality of channel signals and the phase, the period, or the like of the downmixed signal.
- the correlation between the first channel signal or the second channel signal and the downmixed signal may be determined based on the difference between the energy of the first channel signal or the energy of the second channel signal and the energy of the downmixed signal, and further the target attenuation factor is determined.
- the correlation between the first channel signal and the downmixed signal and the correlation between the second channel signal and the downmixed signal can be conveniently measured by using the energy of the channel signals and the energy of the downmixed signal, that is, the target attenuation factor can be conveniently determined by comparing the difference between the energy of the first channel signal or the energy of the second channel signal and the energy of the downmixed signal.
- both the first channel signal and the second channel signal include a plurality of frequency bins
- the determining a target attenuation factor based on the correlation between the first channel signal and the downmixed signal and the correlation between the second channel signal and the downmixed signal includes: determining difference values between energy of the first channel signal and energy of the downmixed signal at the plurality of frequency bins and between energy of the second channel signal and energy of the downmixed signal at the plurality of frequency bins; and determining the target attenuation factor based on the difference values.
- the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins may be difference values between energy of the first channel signal and energy of the downmixed signal at a plurality of same frequency bins.
- the first channel signal includes three frequency bins (a first frequency channel number, a second frequency channel number, and a third frequency channel number).
- difference values between energy of the first channel signal and energy of the downmixed signal at the three frequency bins are specifically a difference value between the first channel signal and the downmixed signal at the first frequency channel number, a difference value between the first channel signal and the downmixed signal at the second frequency channel number, and a difference value between the first channel signal and the downmixed signal at the third frequency channel number.
- the difference values between the energy of the second channel signal and the energy of the downmixed signal at the plurality of frequency bins may be difference values between energy of the second channel signal and energy of the downmixed signal at a plurality of same frequency bins.
- the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins may be a sum of absolute values of the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins.
- the difference values between the energy of the second channel signal and the energy of the downmixed signal at the plurality of frequency bins may be a sum of absolute values of the difference values between the energy of the second channel signal and the energy of the downmixed signal at the plurality of frequency bins.
- energy values of the first channel signal, the second channel signal, and the downmixed signal may be values obtained after normalization processing.
- the difference between the energy of the first channel signal and the energy of the downmixed signal and the difference between the energy of the second channel signal and the energy of the downmixed signal can be conveniently determined by comparing the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins and the energy of the second channel signal and the energy of the downmixed signal at the plurality of frequency bins, and the attenuation factor is further determined. Therefore, it is unnecessary to compare differences between energy of the first channel signal and energy of the downmixed signal and differences between energy of the second channel signal and energy of the downmixed signal in all frequency bands.
- the determining difference values between energy of the first channel signal and energy of the downmixed signal at the plurality of frequency bins and between energy of the second channel signal and energy of the downmixed signal at the plurality of frequency bins includes: determining a first difference value between the energy of the first channel signal and the energy of the downmixed signal, where the first difference value indicates a sum of absolute values of the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins; determining a second difference value between the energy of the second channel signal and the energy of the downmixed signal, where the second difference value indicates a sum of absolute values of the difference values between the energy of the second channel signal and the energy of the downmixed signal at the plurality of frequency bins; and determining the target attenuation factor based on the first difference value and the second difference value.
- the determining the target attenuation factor based on the first difference value and the second difference value may include: determining the target attenuation factor based on a ratio between the first difference value and the second difference value.
- the ratio between the first difference value and the second difference value may be directly determined as the target attenuation factor.
- the first difference value is a
- the second difference value is b.
- some smoothing processing may be performed on the target attenuation factor and an attenuation factor of a previous frame, and a target attenuation factor obtained after smoothing processing is used to further adjust the initial reverberation gain parameter of the plurality of channel signals.
- the method in FIG. 3 further includes: determining that the difference values are greater than a preset threshold.
- the difference values are greater than the preset threshold herein may mean that the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins and the energy of the second channel signal and the energy of the downmixed signal are greater than a same preset threshold, or may mean that the difference between the energy of the first channel signal and the energy of the downmixed signal is greater than a preset first threshold, and the difference between the energy of the second channel signal and the energy of the downmixed signal is greater than a preset second threshold.
- the target attenuation factor is determined, and the initial reverberation gain parameter is adjusted based on the target attenuation factor.
- the initial reverberation gain parameter may not be adjusted, thereby improving encoding efficiency.
- the difference value between the energy of the first channel signal and the energy of the downmixed signal is greater than M (where M is between 0.5 and 1) times the energy of the first channel signal
- M is between 0.5 and 1 times the energy of the first channel signal
- the preset threshold is M times the energy of the first channel signal.
- a ratio of the difference value between the energy of the first channel signal and the energy of the downmixed signal to the energy of the first channel signal is greater than M, it may also be considered that the difference value between the energy of the first channel signal and the energy of the downmixed signal is greater than the preset threshold.
- initial reverberation gain parameter of the plurality of channel signals may be directly determined as target reverberation gain parameter of the plurality of channel signals.
- the energy of the downmixed signal is determined based on the energy of the first channel signal and the energy of the second channel signal.
- the energy of the downmixed signal can be calculated by using the energy of the first channel signal and the energy of the second channel signal, and a calculation process can be simplified without using the downmixed signal itself.
- the energy of the downmixed signal may alternatively be directly calculated based on the downmixed signal itself.
- the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors corresponds to at least one subband of the multi-channel signal, and any subband corresponds to only one attenuation factor.
- indexes of subbands included in each of the first channel signal and the second channel signal are 0 to 9.
- Both the first channel signal and the second channel signal include 10 reverberation gain parameters, each subband corresponds to one reverberation gain parameter, the target attenuation factor includes five attenuation factors, and each attenuation factor corresponds to two subbands; or the target attenuation factor includes 10 attenuation factors, and each attenuation factor corresponds to one subband.
- a reverberation gain parameter can be more flexibly adjusted based on the target attenuation factor. For example, reverberation gain parameters corresponding to subbands, whose indexes are 0 to 4, of a plurality of channel signals need to be adjusted slightly, but reverberation gain parameters corresponding to subbands, whose indexes are 5 to 9, of a channel signal need to be adjusted greatly.
- relatively small attenuation factors may be set for the reverberation gain parameters corresponding to the subbands whose indexes are 0 to 4, and relatively large attenuation factors are set for the reverberation gain parameters corresponding to the subbands whose indexes are 5 to 9.
- each of the first channel signal and the second channel signal (where a frequency band occupied by the first channel signal and a frequency band occupied by the second channel signal are the same) includes a first frequency band and a second frequency band, an attenuation factor corresponding to a subband in the first frequency band is less than or equal to an attenuation factor corresponding to a subband in the second frequency band, and a frequency of the first frequency band is less than a frequency of the second frequency band.
- each of frequency bands in which the first channel signal and the second channel signal are located includes a low frequency part and a high frequency part
- the target attenuation factor includes a plurality of attenuation factors.
- the low frequency part corresponds to at least one attenuation factor
- the high frequency part corresponds to at least one attenuation factor
- the attenuation factor corresponding to the low frequency part is less than the attenuation factor corresponding to the high frequency part.
- Reverberation gain parameters corresponding to a high frequency subband and a low frequency subband can be adjusted to different degrees by setting attenuation factors of different sizes for the reverberation gain parameters corresponding to the high frequency subband and the low frequency subband, and a better processing effect can be obtained during reverberation processing.
- FIG. 4 is a schematic flowchart of a multi-channel signal encoding method according to an embodiment of this application.
- channel signals include a left-channel signal and a right-channel signal, and a process of encoding the left-channel signal and the right-channel signal specifically includes the following steps.
- the spatial parameters include initial reverberation gain parameter of the left-channel signal and the right-channel signal, and another spatial parameter.
- each of the left-channel signal and the right-channel signal may be divided into a high frequency part and a low frequency part, and difference values between energy of the left-channel signal and energy of the downmixed signal and between energy of the right-channel signal and energy of the downmixed signal at the high frequency part are determined as the difference values between the energy of the left-channel signal and the energy of the downmixed signal and between the energy of the right-channel signal and the energy of the downmixed signal.
- Adjust reverberation gain parameters of the left-channel signal and the right-channel signal based on the difference values between the energy of the left-channel signal and the energy of the downmixed signal and between the energy of the right-channel signal and the energy of the downmixed signal.
- an encoder side may determine a target attenuation factor based on the difference values between the energy of the left-channel signal and the energy of the downmixed signal and between the energy of the right-channel signal and the energy of the downmixed signal, and adjust the reverberation gain parameters of the left-channel signal and the right-channel signal based on the target attenuation factor.
- FIG. 5 is a schematic flowchart of a multi-channel signal decoding method according to an embodiment of this application.
- channel signals include a left-channel signal and a right-channel signal.
- the bitstream generated through encoding in the encoding method in FIG. 4 may be decoded.
- a decoding process in FIG. 5 specifically includes the following steps:
- the spatial parameter includes a reverberation gain parameter adjusted by an encoder side, that is, the encoder side encodes the adjusted reverberation gain parameter.
- a decoder side obtains the reverberation gain parameter adjusted by the encoder side.
- Step 520 and step 530 are not performed in a sequence, and may be performed simultaneously.
- the reverberation gain parameter based on which reverberation processing is performed on the left-channel signal and the right-channel signal has been adjusted based on correlations between the left-channel signal and the downmixed signal and between the right-channel signal and the downmixed signal.
- corresponding reverberation processing can be performed based on a difference between the left-channel signal and the right-channel signal, thereby improving quality of a channel signal obtained after reverberation processing.
- the encoder side determines whether an initial reverberation gain parameter of a channel signal needs to be adjusted. If the initial reverberation gain parameter of the channel signal needs to be adjusted, the encoder side adjusts the initial reverberation gain parameter of the channel signal, and encodes an adjusted reverberation gain parameter, so that the decoder side directly performs reverberation processing based on a reverberation gain parameter obtained through decoding.
- the encoder side may alternatively determine only whether the initial reverberation gain parameter of the channel signal needs to be adjusted. If the initial reverberation gain parameter of the channel signal needs to be adjusted, the encoder side sends corresponding indication information to the decoder side. After receiving the indication information, the decoder side adjusts the initial reverberation gain parameter of the channel signal.
- FIG. 6 is a schematic flowchart of a multi-channel signal encoding method according to an embodiment of this application. The method in FIG. 6 includes the following steps.
- the downmixed signal may be obtained by performing downmixing processing on the first channel signal and the second channel signal, and spatial parameters are obtained by performing spatial parameter analysis on the first channcl signal and the second channcl signal, where the spatial parameters include the initial reverberation gain parameter of the first channel signal and the second channel signal.
- the downmixed signal and the initial reverberation gain parameter may be determined simultaneously or successively.
- first channel signal and the second channel signal may correspond to a same spatial parameter, and specifically, the first channel signal and the second channel signal also correspond to a same initial reverberation gain parameter. That is, a spatial parameter of the first channel signal and a spatial parameter of the second channel signal are the same, and the initial reverberation gain parameter of the first channel signal and the second channel signal are the same.
- each of the first channel signal and the second channel signal includes 10 subbands, and each subband corresponds to one reverberation gain parameter, reverberation gain parameters corresponding to subbands, whose index values are the same, of the first channel signal and the second channel signal may be the same.
- the correlation between the first channel signal or the second channel signal and the downmixed signal may be determined based on a difference between energy of the first channel signal or energy of the second channel signal and energy of the downmixed signal, or may be determined based on a difference between an amplitude of the first channel signal or an amplitude of the second channel signal and an amplitude of the downmixed signal.
- the correlation between the first channel signal and the downmixed signal is relative large.
- the difference between the energy or the amplitude of the first channel signal and the energy or the amplitude of the downmixed signal is relatively small.
- the difference between the energy of the first channel signal or the energy of the second channel signal and the energy of the downmixed signal may be specifically a difference value between the energy of the first channel signal or the energy of the second channel signal and the energy of the downmixed signal.
- the difference between the amplitude of the first channel signal or the amplitude of the second channel signal and the amplitude of the downmixed signal may be specifically a difference value between the amplitude of the first channel signal or the amplitude of the second channel signal and the amplitude of the downmixed signal.
- the correlation between the first channel signal or the second channel signal and the downmixed signal may alternatively refer to a difference between a phase, a period, or the like of the first channel signal or the second channel signal and a phase, a period, or the like of the downmixed signal.
- the first channel signal, the second channel signal, and the downmixed signal may be channel signals obtained after normalization processing.
- the identification information may indicate that the first channel signal or the second channel signal is a channel signal whose initial reverberation gain parameter needs to be adjusted, or may indicate that the first channel signal and the second channel signal are channel signals whose initial reverberation gain parameters need to be adjusted, or may indicate that a reverberation gain parameter does not need to be adjusted for both the first channel signal and the second channel signal.
- the identification information may indicate, by using a value of an identifier field, a channel signal that is in a plurality of channel signals and whose initial reverberation gain parameter needs to be adjusted.
- the identifier field of the identification information occupies two bits. When the value of the identifier field is 00, it indicates that neither the initial reverberation gain parameter of the first channel signal nor the initial reverberation gain parameter of the second channel signal needs to be adjusted. When the value of the identifier field is 01, it indicates that only the initial reverberation gain parameter of the first channel signal needs to be adjusted. When the value of the identifier field is 10, it indicates that only the initial reverberation gain parameter of the second channel signal needs to be adjusted. When the value of the identifier field is 11, it indicates that both the initial reverberation gain parameter of the first channel signal and the second channel signal need to be adjusted.
- the determining identification information of the first channel signal and the second channel signal based on a correlation between the first channel signal and the downmixed signal, and a correlation between the second channel signal and the downmixed signal includes: determining the identification information of the first channel signal and the second channel signal based on correlations between the energy of the first channel signal and the energy of the downmixed signal and between the energy of the second channel signal and the energy of the downmixed signal.
- the correlation between the first channel signal and the downmixed signal and the correlation between the second channel signal and the downmixed signal can be conveniently measured by using the energy of the channel signals and the energy of the downmixed signal, so that a channel signal whose initial reverberation gain parameter needs to be adjusted can be conveniently determined.
- the energy or amplitude of the downmixed signal may be calculated based on the energy of the first channel signal and the energy of the second channel signal, thereby simplifying a calculation process.
- the energy of the downmixed signal may be directly calculated based on the downmixed signal itself.
- the channel signal can be determined as a channel signal whose initial reverberation gain parameter needs to be adjusted, when the energy of the channel signal is greatly different from the energy of the downmixed signal. Therefore, a decoder side can first adjust an initial reverberation gain parameter of the channel signal and then perform reverberation processing on the channel signal, thereby improving quality of a channel signal obtained after reverberation processing.
- the determining the identification information of the first channel signal and the second channel signal based on correlations between the energy of the first channel signal and the energy of the downmixed signal and between the energy of the second channel signal and the energy of the downmixed signal includes: determining a first difference value and a second difference value, where the first difference value is a sum of absolute values of difference values between energy of the first channel signal and energy of the downmixed signal at a plurality of frequency bins, and the second difference value is a sum of absolute values of difference values between energy of the second channel signal and energy of the downmixed signal at the plurality of frequency bins; and determining the identification information of the first channel signal and the second channel signal based on the first difference value and the second difference value.
- the difference between the energy of the first channel signal and the energy of the downmixed signal and the difference between the energy of the second channel signal and the energy of the downmixed signal can be conveniently determined by comparing the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins and the energy of the second channel signal and the energy of the downmixed signal at the plurality of frequency bins, so as to determine a channel signal whose initial reverberation gain parameter needs to be adjusted. Therefore, it is unnecessary to compare differences between energy of the first channel signal and energy of the downmixed signal and differences between energy of the second channel signal and energy of the downmixed signal in all frequency bands.
- the determining the identification information of the first channel signal and the second channel signal based on the first difference value and the second difference value includes: determining the larger difference value in the first difference value and the second difference value as a target difference value; and determining the identification information based on the target difference value, where the identification information indicates a channel signal corresponding to the target difference value, and the channel signal corresponding to the target difference value is a channel signal whose initial reverberation gain parameter needs to be adjusted.
- the first channel signal may be determined as a channel signal whose initial reverberation gain parameter needs to be adjusted.
- the determining the identification information of the first channel signal and the second channel signal based on the sum of the absolute values of the difference values between the energy of the first channel signal or the energy of the second channel signal and the energy of the downmixed signal at the plurality of frequency bins includes: generating first identification information when the sum of the absolute values of the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins is greater than the preset threshold, where the first identification information indicates that the initial reverberation gain parameter of the first channel signal needs to be adjusted; and generating second identification information when the sum of the absolute values of the difference values between the energy of the second channel signal and the energy of the downmixed signal at the plurality of frequency bins is greater than the preset threshold, where the second identification information indicates that the initial reverberation gain parameter of the second channel signal needs to be adjusted.
- the channel signal can be determined as a channel signal whose initial reverberation gain parameter needs to be adjusted, when the energy of the channel signal is greatly different from the energy of the downmixed signal. Therefore, a decoder side can first adjust an initial reverberation gain parameter of the channel signal and then perform reverberation processing on the channel signal, thereby improving quality of a channel signal obtained after reverberation processing.
- the identification information of the first channel signal and the second channel signal may be one piece of identification information or two pieces of identification information.
- the identification information of the first channel signal and the second channel signal may be one piece of identification information, and the identification information indicates that both the initial reverberation gain parameter of the first channel signal and the second channel signal need to be adjusted.
- the identification information of the first channel signal and the second channel signal is two pieces of identification information: first identification information and second identification information respectively, the first identification information indicates that the initial reverberation gain parameter of the first channel signal needs to be adjusted, and the second identification information indicates that the initial reverberation gain parameter of the second channel signal needs to be adjusted.
- first identification information indicates that the initial reverberation gain parameter of the first channel signal needs to be adjusted
- second identification information indicates that the initial reverberation gain parameter of the second channel signal needs to be adjusted.
- the method in FIG. 6 further includes: determining a target attenuation factor based on the first difference value and the second difference value, where the target attenuation factor is used to adjust an initial reverberation gain parameter of a target channel signal; and quantizing the target attenuation factor, and writing a quantized target attenuation factor into the bitstream.
- the initial reverberation gain parameter of the channel signal can be flexibly adjusted based on a value of the correlation between the channel signal and the downmixed signal by using the attenuation factor.
- first difference value and the second difference value may be calculated by referring to Formula (1) and Formula (2) in the foregoing.
- the target attenuation factor may be determined based on a ratio between the first difference value and the second difference value.
- the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors corresponds to at least one subband of the target channel signal, and any subband corresponds to only one attenuation factor.
- the multi-channel signal includes a plurality of subbands, and adjacent subbands may correspond to one attenuation factor.
- a reverberation gain parameter can be more flexibly adjusted based on the target attenuation factor.
- the target channel signal includes a first frequency band and a second frequency band, an attenuation factor corresponding to a subband in the first frequency band is less than or equal to an attenuation factor corresponding to a subband in the second frequency band, and a frequency of the first frequency band is less than a frequency of the second frequency band.
- Reverberation gain parameters corresponding to a high frequency subband and a low frequency subband can be adjusted to different degrees by setting attenuation factors of different sizes for the reverberation gain parameters corresponding to the high frequency subband and the low frequency subband, and a better processing effect can be obtained during reverberation processing.
- a frequency band in which the target channel signal is located includes a low frequency part and a high frequency part, and the target attenuation factor includes a plurality of attenuation factors.
- the low frequency part corresponds to at least one attenuation factor
- the high frequency part corresponds to at least one attenuation factor
- the attenuation factor corresponding to the low frequency part is less than the attenuation factor corresponding to the high frequency part.
- the energy of the downmixed signal is determined based on the energy of the first channel signal and the energy of the second channel signal.
- the energy of the downmixed signal can be calculated by using the energy of the first channel signal and the energy of the second channel signal, and a calculation process can be simplified without using the downmixed signal itself.
- FIG. 7 is a schematic flowchart of a multi-channel signal decoding method according to an embodiment of this application.
- the method in FIG. 7 may be performed by a decoder-side device or a decoder.
- the method in FIG. 7 specifically includes the following steps:
- the channel signal whose initial reverberation gain parameter needs to be adjusted can be determined by using the identification information, and the initial reverberation gain parameter of the channel signal is adjusted before reverberation processing is performed on the channel signal, thereby improving quality of a 2 channel signal obtained after reverberation processing.
- the adjusting an initial reverberation gain parameter of the target channel signal includes: determining a target attenuation factor, and adjusting the initial reverberation gain parameter of the target channel signal based on the target attenuation factor, to obtain a target reverberation gain parameter of the target channel signal.
- the initial reverberation gain parameter of the channel signal can be flexibly adjusted based on a size of a correlation between the channel signal and the downmixed signal by using the attenuation factor.
- the decoder side may determine a preset attenuation factor as the target attenuation factor.
- the decoder side directly adjusts the initial reverberation gain parameter of the target channel signal based on a preset attenuation factor.
- a process of determining the target attenuation factor can be simplified by presetting the attenuation factor, thereby improving decoding efficiency.
- the decoder side may obtain the target attenuation factor from bitstreams of a plurality of channel signals, that is, obtain the target attenuation factor by decoding the bitstreams of the plurality of channel signals.
- an encoder side has determined the target attenuation factor, and encodes the target attenuation factor to obtain and transmit the bitstream to the decoder side. In this way, the decoder side does not need to calculate the target attenuation factor any more, but directly decodes the bitstream to obtain the target attenuation factor.
- the target attenuation factor may be directly obtained from the bitstream, and the process of determining the target attenuation factor can be also simplified, thereby improving decoding efficiency.
- the determining a target attenuation factor specifically includes: obtaining an inter-channel level difference between the first channel signal and the second channel signal from the bitstream; and determining the target attenuation factor based on the inter-channel level difference, or determining the target attenuation factor based on the inter-channel level difference and the downmixed signal.
- the target attenuation factor can be more flexibly and accurately determined based on the inter-channel level difference, the downmixed signal, and the like, so that an initial reverberation gain parameter of a channel signal can be more accurately adjusted based on the attenuation factor.
- the inter-channel level difference when the inter-channel level difference is relatively large, it may be considered that a difference between the first channel signal and the second channel signal is relatively large, and a correlation between the first channel signal and the second channel signal is relatively small.
- an attenuation factor with a relatively large value may be determined as the target attenuation factor.
- the target attenuation factor when the target attenuation factor is being determined based on the downmixed signal, the target attenuation factor may be determined by using periodicity and harmonicity of the downmixed signal. For example, when the periodicity or the harmonicity of the downmixed signal is good, it may be considered that the difference between the first channel signal and the second channel signal is relatively small, and the correlation between the first channel signal and the second channel signal is relatively large. In this case, an attenuation factor with a relatively small value may be determined as the target attenuation factor.
- the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors corresponds to at least one subband of the target channel signal, and any subband corresponds to only one attenuation factor.
- each of the first channel signal and the second channel signal includes a plurality of subbands, and a plurality of adjacent subbands may correspond to one attenuation factor.
- a reverberation gain parameter can be more flexibly adjusted based on the target attenuation factor.
- the target channel signal includes a first frequency band and a second frequency band, an attenuation factor corresponding to a subband in the first frequency band is less than or equal to an attenuation factor corresponding to a subband in the second frequency band, and a frequency of the first frequency band is less than a frequency of the second frequency band.
- Reverberation gain parameters corresponding to a high frequency subband and a low frequency subband can be adjusted to different degrees by setting attenuation factors of different sizes for the reverberation gain parameters corresponding to the high frequency subband and the low frequency subband, and a better processing effect can be obtained during reverberation processing.
- a frequency band in which the target channel signal is located includes a low frequency part and a high frequency part, and the target attenuation factor includes a plurality of attenuation factors.
- the low frequency part corresponds to at least one attenuation factor
- the high frequency part corresponds to at least one attenuation factor
- the attenuation factor corresponding to the low frequency part is less than the attenuation factor corresponding to the high frequency part.
- FIG. 8 is a schematic block diagram of an encoder according to an embodiment of this application.
- An encoder 800 in FIG. 8 includes:
- the encoder 800 may correspond to the multi-channel signal encoding method in FIG. 3 , and the encoder 800 may perform the multi-channel signal encoding method in FIG. 3 .
- the processing unit 810 is specifically configured to determine a target attenuation factor based on the correlation between the first channel signal and the downmixed signal and the correlation between the second channel signal and the downmixed signal; and adjust the initial reverberation gain parameter based on the target attenuation factor to obtain the target reverberation gain parameter.
- each of the first channel signal and the second channel signal includes a plurality of frequency bins
- the processing unit 810 is specifically configured to: determine difference values between energy of the first channel signal and energy of the downmixed signal at the plurality of frequency bins and between energy of the second channel signal and energy of the downmixed signal at the plurality of frequency bins, and determine the target attenuation factor based on the difference values.
- the processing unit 810 is specifically configured to: determine a first difference value between the energy of the first channel signal and the energy of the downmixed signal, where the first difference value indicates a sum of absolute values of the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins; determine a second difference value between the energy of the second channel signal and the energy of the downmixed signal, where the second difference value indicates a sum of absolute values of the difference values between the energy of the second channel signal and the energy of the downmixed signal at the plurality of frequency bins; and determine the target attenuation factor based on a ratio between the first difference value and the second difference value.
- the processing unit 810 before determining the target attenuation factor based on the difference values, is further specifically configured to: determine that the difference values are greater than a preset threshold.
- the energy of the downmixed signal is determined based on the energy of the first channel signal and the energy of the second channel signal.
- the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors corresponds to at least one subband of the multi-channel signal, and any subband corresponds to only one attenuation factor.
- FIG. 9 is a schematic block diagram of an encoder according to an embodiment of this application.
- An encoder 900 in FIG. 9 includes:
- a channel signal whose initial reverberation gain parameter needs to be adjusted can be determined based on a correlation between the channel signal and the downmixed signal, so that a decoder side can first adjust initial reverberation gain parameter of some channel signals and then perform reverberation processing on these channel signals, thereby improving quality of a channel signal obtained after reverberation processing.
- the encoder 900 may correspond to the multi-channel signal encoding method in FIG. 6 , and the encoder 900 may perform the multi-channel signal encoding method in FIG. 6 .
- the processing unit 910 is specifically configured to determine the identification information of the first channel signal and the second channel signal based on a correlation between energy of the first channel signal and energy of the downmixed signal and a correlation between energy of the second channel signal and the energy of the downmixed signal.
- the processing unit 910 is specifically configured to: determine a first difference value and a second difference value, where the first difference value is a sum of absolute values of difference values between energy of the first channel signal and energy of the downmixed signal at a plurality of frequency bins, and the second difference value is a sum of absolute values of difference values between energy of the second channel signal and energy of the downmixed signal at the plurality of frequency bins; and determine the identification information of the first channel signal and the second channel signal based on the first difference value and the second difference value.
- the processing unit 910 is specifically configured to determine the larger difference value in the first difference value and the second difference value as a target difference value, and determine the identification information based on the target difference value, where the identification information indicates a channel signal corresponding to the target difference value, and the channel signal corresponding to the target difference value is a channel signal whose initial reverberation gain parameter needs to be adjusted.
- the processing unit 910 is further specifically configured to: determine a target attenuation factor based on the first difference value and the second difference value, where the target attenuation factor is used to adjust an initial reverberation gain parameter of a target channel signal; and quantize the target attenuation factor, and write a quantized target attenuation factor into the bitstream.
- the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors corresponds to at least one subband of the target channel signal, and any subband corresponds to only one attenuation factor.
- the energy of the downmixed signal is determined based on the energy of the first channel signal and the energy of the second channel signal.
- FIG. 10 is a schematic block diagram of a decoder according to an embodiment of this application.
- a decoder 1000 in FIG. 10 includes:
- the channel signal whose initial reverberation gain parameter needs to be adjusted can be determined by using the identification information, and the initial reverberation gain parameter of the channel signal is adjusted before reverberation processing is performed on the channel signal, thereby improving quality of a channel signal obtained after reverberation processing.
- the decoder 1000 may correspond to the multi-channel signal decoding method in FIG. 7 , and the decoder 1000 may perform the multi-channel signal decoding method in FIG. 7 .
- the processing unit 1020 is specifically configured to determine a target attenuation factor, and adjust the initial reverberation gain parameter of the target channel signal based on the target attenuation factor, to obtain a target reverberation gain parameter of the target channel signal.
- the processing unit 1020 is specifically configured to determine a preset attenuation factor as the target attenuation factor.
- the processing unit 1020 is specifically configured to obtain the target attenuation factor based on the bitstream.
- the processing unit 1020 is specifically configured to obtain an inter-channel level difference between the first channel signal and the second channel signal from the bitstream, and determine the target attenuation factor based on the inter-channel level difference, or determine the target attenuation factor based on the inter-channel level difference and the downmixed signal.
- the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors corresponds to at least one subband of the target channel signal, and any subband corresponds to only one attenuation factor.
- FIG. 11 is a schematic block diagram of an encoder according to an embodiment of this application.
- An encoder 1100 in FIG. 11 includes:
- the encoder 1100 may correspond to the multi-channel signal encoding method in FIG. 3 , and the encoder 1100 may perform the multi-channel signal encoding method in FIG. 3 .
- the processor 1120 is specifically configured to determine a target attenuation factor based on the correlation between the first channel signal and the downmixed signal and the correlation between the second channel signal and the downmixed signal; and adjust the initial reverberation gain parameter based on the target attenuation factor to obtain the target reverberation gain parameter.
- each of the first channel signal and the second channel signal includes a plurality of frequency bins
- the processor 1120 is specifically configured to: determine difference values between energy of the first channel signal and energy of the downmixed signal at the plurality of frequency bins and between energy of the second channel signal and energy of the downmixed signal at the plurality of frequency bins, and determine the target attenuation factor based on the difference values.
- the processor 1120 is specifically configured to: determine a first difference value between the energy of the first channel signal and the energy of the downmixed signal, where the first difference value indicates a sum of absolute values of the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins; determine a second difference value between the energy of the second channel signal and the energy of the downmixed signal, where the second difference value indicates a sum of absolute values of the difference values between the energy of the second channel signal and the energy of the downmixed signal at the plurality of frequency bins; and determine the target attenuation factor based on a ratio between the first difference value and the second difference value.
- the processor 1120 before determining the target attenuation factor based on the difference values, is further specifically configured to: determine that the difference values are greater than a preset threshold.
- the energy of the downmixed signal is determined based on the energy of the first channel signal and the energy of the second channel signal.
- the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors corresponds to at least one subband of the multi-channel signal, and any subband corresponds to only one attenuation factor.
- FIG. 12 is a schematic block diagram of an encoder according to an embodiment of this application.
- An encoder 1200 in FIG. 12 includes:
- a channel signal whose initial reverberation gain parameter needs to be adjusted can be determined based on a correlation between the channel signal and the downmixed signal, so that a decoder side can first adjust initial reverberation gain parameter of some channel signals and then perform reverberation processing on these channel signals, thereby improving quality of a channel signal obtained after reverberation processing.
- the encoder 1200 may correspond to the multi-channel signal encoding method in FIG. 6 , and the encoder 1200 may perform the multi-channel signal encoding method in FIG. 6 .
- the processor 1220 is specifically configured to determine the identification information of the first channel signal and the second channel signal based on a correlation between energy of the first channel signal and energy of the downmixed signal and a correlation between energy of the second channel signal and the energy of the downmixed signal.
- the processor 1220 is specifically configured to: determine a first difference value and a second difference value, where the first difference value is a sum of absolute values of difference values between energy of the first channel signal and energy of the downmixed signal at a plurality of frequency bins, and the second difference value is a sum of absolute values of difference values between energy of the second channel signal and energy of the downmixed signal at the plurality of frequency bins; and determine the identification information of the first channel signal and the second channel signal based on the first difference value and the second difference value.
- the processor 1220 is specifically configured to determine the larger difference value in the first difference value and the second difference value as a target difference value, and determine the identification information based on the target difference value, where the identification information indicates a channel signal corresponding to the target difference value, and the channel signal corresponding to the target difference value is a channel signal whose initial reverberation gain parameter needs to be adjusted.
- the processor 1220 is further specifically configured to: determine a target attenuation factor based on the first difference value and the second difference value, where the target attenuation factor is used to adjust an initial reverberation gain parameter of a target channel signal; and quantize the target attenuation factor, and write a quantized target attenuation factor into the bitstream.
- the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors corresponds to at least one subband of the target channel signal, and any subband corresponds to only one attenuation factor.
- the energy of the downmixed signal is determined based on the energy of the first channel signal and the energy of the second channel signal.
- FIG. 13 is a schematic block diagram of a decoder according to an embodiment of this application.
- a decoder 1300 in FIG. 13 includes:
- the channel signal whose initial reverberation gain parameter needs to be adjusted can be determined by using the identification information, and the initial reverberation gain parameter of the channel signal is adjusted before reverberation processing is performed on the channel signal, thereby improving quality of a channel signal obtained after reverberation processing.
- the decoder 1300 may correspond to the multi-channel signal decoding method in FIG. 7 , and the decoder 1300 may perform the multi-channel signal decoding method in FIG. 7 .
- the processor 1320 is specifically configured to determine a target attenuation factor, and adjust the initial reverberation gain parameter of the target channel signal based on the target attenuation factor, to obtain a target reverberation gain parameter of the target channel signal.
- the processor 1320 is specifically configured to determine a preset attenuation factor as the target attenuation factor.
- the processor 1320 is specifically configured to obtain the target attenuation factor based on the bitstream.
- the processor 1320 is specifically configured to obtain an inter-channel level difference between the first channel signal and the second channel signal from the bitstream, and determine the target attenuation factor based on the inter-channel level difference, or determine the target attenuation factor based on the inter-channel level difference and the downmixed signal.
- the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors corresponds to at least one subband of the target channel signal, and any subband corresponds to only one attenuation factor.
- the disclosed system, apparatus, and method may be implemented in other manners.
- the described apparatus embodiment is merely an example.
- the unit division is merely logical function division and may be other division in actual implementation.
- a plurality of units or components may be combined or integrated into another system, or some features may be ignored or not performed.
- the displayed or discussed mutual couplings or direct couplings or communication connections may be implemented by using some interfaces.
- the indirect couplings or communication connections between the apparatuses or units may be implemented in electronic, mechanical, or other forms.
- the units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one position, or may be distributed on a plurality of network units. Some or all of the units may be selected based on actual requirements to achieve the objectives of the solutions of the embodiments.
- the functions When the functions are implemented in the form of a software functional unit and sold or used as an independent product, the functions may be stored in a computer-readable storage medium. Based on such an understanding, the technical solutions of this application essentially, or the part contributing to the prior art, or some of the technical solutions may be implemented in a form of a software product.
- the computer software product is stored in a storage medium, and includes several instructions for instructing a computer device (which may be a personal computer, a server, a network device, or the like) to perform all or some of the steps of the methods described in the embodiments of this application.
- the foregoing storage medium includes: any medium that can store program code, such as a USB flash drive, a removable hard disk, a read-only memory (Read-Only Memory, ROM), a random access memory (Random Access Memory, RAM), a magnetic disk, or an optical disc.
- program code such as a USB flash drive, a removable hard disk, a read-only memory (Read-Only Memory, ROM), a random access memory (Random Access Memory, RAM), a magnetic disk, or an optical disc.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Mathematical Physics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Stereophonic System (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Description
- This application relates to the audio encoding field, and more specifically, to a multi-channel signal encoding method, a multi-channel signal decoding method, an encoder, and a decoder.
- As living quality is improved, people have increasing demands on high-quality audio. Compared with mono audio, stereo audio provides a sense of orientation and a sense of distribution for each acoustic source, and provides improved clarity, intelligibility, and on-site feeling of sound. Therefore, stereo audio is very popular.
- Stereo processing technologies mainly include mid/side (Mid/Sid, MS) encoding, intensity stereo (Intensity Stereo, IS) encoding, parametric stereo (Parametric Stereo, PS) encoding, and the like.
- In the prior art, when PS encoding is used to encode a channel signal, an encoder side performs spatial parameter analysis on a plurality of channel signals to obtain reverberation gain parameters and other spatial parameters of the plurality of channel signals, and encodes the reverberation gain parameters and the other spatial parameters of the plurality of channel signals, so that a decoder side can perform, based on the reverberation gain parameters of the channel signals during decoding, reverberation processing on the plurality of channel signals obtained through decoding, so as to improve auditory effects. However, in some cases, for example, when a correlation between a plurality of channel signals is relatively low, worse auditory effects are caused when reverberation processing is performed, based on reverberation gain parameters corresponding to the plurality of channcl signals, on the plurality of channcl signals obtained through decoding Patent application
EP2840811A1 relates to spatial audio coding and discloses a correlation-based time-dependent scaling method that determines an adequate amplitude of the late reverberation. The reverberation is based on a stereo downmix of the audio input signal adaptively scaled in amplitude. Patent applicationWO 2010/070016 A1 deals with applying reverb on multichannel downmixed signals by using different reverb impulse responses for each of the individual channels, after upmixing the audio. - This application provides a multi-channel signal encoding method, a multi-channel signal decoding method, an encoder, and a decoder, so as to improve quality of a channel signal The scope of the protection is defined in accordance with a multichannel signal encoding method according to claim 1, a multichannel signal decoding method according to claim 8, a multichannel signal encoding device according to claim 14 and a multichannel signal decoding device according to claim 21. Further aspects are set forth in the dependent claims.
- According to a first aspect, a multi-channel signal encoding method is provided, where the method includes: determining a downmixed signal of a first channel signal and a second channel signal in a multi-channel signal, an initial reverberation gain parameter of the first channel signal and the second channel signal; determining a target reverberation gain parameter of the first channel signal and the second channel signal based on a correlation between the first channel signal and the downmixed signal, a correlation between the second channel signal and the downmixed signal, and the initial reverberation gain parameter; and quantizing the first channel signal and the second channel signal based on the downmixed signal and the target reverberation gain parameter, and writing a quantized first channel signal and a quantized second channel signal into a bitstream.
- In this application, when a target reverberation gain parameter of a channel signal is being determined, a correlation between the channel signal and the downmixed signal is considered. In this way, a better processing effect can be obtained when reverberation processing is performed on the channel signal based on the target reverberation gain parameter, thereby improving quality of a channel signal obtained after reverberation processing.
- The correlation between the first channel signal or the second channel signal and the downmixed signal may be determined based on a difference between energy of the first channel signal or energy of the second channel signal and energy of the downmixed signal, or may be determined based on a difference between an amplitude of the first channel signal or an amplitude of the second channel signal and an amplitude of the downmixed signal.
- With reference to the first aspect, in some implementations of the first aspect, the first channel signal, the second channel signal, and the downmixed signal are channel signals obtained after normalization processing.
- With reference to the first aspect, in some implementations of the first aspect, the determining a target reverberation gain parameter of the first channel signal and the second channel signal based on a correlation between the first channel signal and the downmixed signal, a correlation between the second channel signal and the downmixed signal, and the initial reverberation gain parameter includes: determining a target attenuation factor based on the correlation between the first channel signal and the downmixed signal and the correlation between the second channel signal and the downmixed signal; and adjusting the initial reverberation gain parameter based on the target attenuation factor to obtain the target reverberation gain parameter.
- The initial reverberation gain parameter of the channel signal can be flexibly adjusted based on a value of the correlation between the channel signal and the downmixed signal by using the attenuation factor.
- The correlations between the first channel signal, the second channel signal, and the downmixed signal can be conveniently measured by using the energy of the channel signal, that is, the target attenuation factor can be conveniently determined by comparing the difference between the energy of the channel signal and the energy of the downmixed signal. Specifically, when the difference between the energy of the first channel signal or the energy of the second channcl signal and the energy of the downmixed signal is relatively large (greater than a given threshold), it may be considered that the correlation between the first channel signal and the downmixed signal and the correlation between the second channel signal and the downmixed signal are relatively weak. In this case, a relatively large target attenuation factor may be determined. However, when the difference between the energy of the first channel signal or the energy of the second channel signal and the energy of the downmixed signal is relatively small (less than the given threshold), it may be considered that the correlation between the first channel signal and the downmixed signal and the correlation between the second channel signal and the downmixed signal are relatively strong. In this case, a relatively small target attenuation factor may be determined.
- The determining a target attenuation factor based on the correlation between the first channel signal and the downmixed signal and the correlation between the second channel signal and the downmixed signal may be calculating the target attenuation factor based on the correlations between the channel signals and the downmixed signal, or may be directly determining a preset attenuation factor as the target attenuation factor after the correlations between the channel signals and the downmixed signal are considered.
- With reference to the first aspect, in some implementations of the first aspect, each of the first channel signal and the second channel signal includes a plurality of frequency bins, and the determining a target attenuation factor based on the correlation between the first channel signal and the downmixed signal and the correlation between the second channel signal and the downmixed signal includes: determining difference values between energy of the first channel signal and energy of the downmixed signal at the plurality of frequency bins and between energy of the second channel signal and energy of the downmixed signal at the plurality of frequency bins; and determining the target attenuation factor based on the difference values.
- The difference between the energy of the first channel signal and the energy of the downmixed signal and the difference between the energy of the second channel signal and the energy of the downmixed signal can be conveniently determined by comparing the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins and the energy of the second channel signal and the energy of the downmixed signal at the plurality of frequency bins, and the attenuation factor is further determined. Therefore, it is unnecessary to compare differences between energy of the first channel signal and energy of the downmixed signal and differences between energy of the second channel signal and energy of the downmixed signal in all frequency bands.
- With reference to the first aspect, in some implementations of the first aspect, the determining difference values between energy of the first channel signal and energy of the downmixed signal at the plurality of frequency bins and between energy of the second channel signal and energy of the downmixed signal at the plurality of frequency bins includes: determining a first difference value between the energy of the first channel signal and the energy of the downmixed signal, where the first difference value indicates a sum of absolute values of the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins; and determining a second difference value between the energy of the second channel signal and the energy of the downmixed signal, where the second difference value indicates a sum of absolute values of the difference values between the energy of the second channel signal and the energy of the downmixed signal at the plurality of frequency bins; and the determining the target attenuation factor based on the difference values includes: determining the target attenuation factor based on a ratio between the first difference value and the second difference value.
- Alternatively, the target attenuation factor may be directly determined based on the first difference value and the second difference value.
- With reference to the first aspect, in some implementations of the first aspect, before the determining the target attenuation factor based on the difference values, the method further includes: determining that the difference values are greater than a preset threshold.
- Only when the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins and the energy of the second channel signal and the energy of the downmixed signal are relatively large, the target attenuation factor is determined, and the initial reverberation gain parameter is adjusted based on the target attenuation factor. When the difference values are relatively small, the initial reverberation gain parameter may not be adjusted, thereby improving encoding efficiency.
- When difference values between energy of a plurality of channel signals and the energy of the downmixed signal are less than the preset threshold, initial reverberation gain parameter of the plurality of channel signals may be directly determined as target reverberation gain parameter of the plurality of channel signals.
- With reference to the first aspect, in some implementations of the first aspect, the energy of the downmixed signal is determined based on the energy of the first channel signal and the energy of the second channel signal.
- The energy of the downmixed signal can be calculated by using the energy of the first channel signal and the energy of the second channel signal, and a calculation process can be simplified without using the downmixed signal itself.
- With reference to the first aspect, in some implementations of the first aspect, the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors corresponds to at least one subband of the multi-channel signal, and any subband corresponds to only one attenuation factor.
- When the target attenuation factor includes a plurality of attenuation factors, a reverberation gain parameter can be more flexibly adjusted based on the target attenuation factor.
- With reference to the first aspect, in some implementations of the first aspect, each of frequency bands in which the first channel signal and the second channel signal are located includes a first frequency band and a second frequency band, an attenuation factor corresponding to a subband in the first frequency band is less than or equal to an attenuation factor corresponding to a subband in the second frequency band, and a frequency of the first frequency band is less than a frequency of the second frequency band.
- Reverberation gain parameters corresponding to a high frequency subband and a low frequency subband can be adjusted to different degrees by setting attenuation factors of different sizes for the reverberation gain parameters corresponding to the high frequency subband and the low frequency subband, and a better processing effect can be obtained during reverberation processing.
- According to a second aspect, a multi-channel signal encoding method is provided, where the method includes: determining a downmixed signal of a first channel signal and a second channel signal in a multi-channel signal, an initial reverberation gain parameter of the first channel signal and the second channel signal; determining identification information of the first channel signal and the second channel signal based on a correlation between the first channel signal and the downmixed signal, and a correlation between the second channel signal and the downmixed signal, where the identification information indicates a channel signal that is in the first channel signal and the second channel signal and whose initial reverberation gain parameter needs to be adjusted; and quantizing the first channel signal and the second channel signal based on the downmixed signal, the initial reverberation gain parameter, and the identification information, and writing a quantized first channel signal and a quantized second channel signal into a bitstream.
- The correlation between the first channel signal or the second channel signal and the downmixed signal may be determined based on a difference between energy of the first channel signal or energy of the second channel signal and energy of the downmixed signal, or may be determined based on a difference between an amplitude of the first channel signal or an amplitude of the second channel signal and an amplitude of the downmixed signal.
- In this application, a channel signal whose initial reverberation gain parameter needs to be adjusted can be determined based on a correlation between the channel signal and the downmixed signal, so that a decoder side can first adjust initial reverberation gain parameter of some channel signals and then perform reverberation processing on these channel signals, thereby improving quality of a channel signal obtained after reverberation processing.
- With reference to the second aspect, in some implementations of the second aspect, the determining identification information of the first channel signal and the second channel signal based on a correlation between the first channel signal and the downmixed signal, and a correlation between the second channel signal and the downmixed signal includes: determining the identification information of the first channel signal and the second channel signal based on a correlation between energy of the first channel signal and energy of the downmixed signal and a correlation between energy of the second channel signal and the energy of the downmixed signal.
- The correlation between the first channel signal and the downmixed signal and the correlation between the second channel signal and the downmixed signal can be conveniently measured by using the energy of the channel signals and the energy of the downmixed signal, so that a channel signal whose initial reverberation gain parameter needs to be adjusted can be conveniently determined.
- With reference to the second aspect, in some implementations of the second aspect, the determining the identification information of the first channel signal and the second channel signal based on a correlation between energy of the first channel signal and energy of the downmixed signal and a correlation between energy of the second channel signal and the energy of the downmixed signal includes: determining a first difference value and a second difference value, where the first difference value is a sum of absolute values of difference values between energy of the first channel signal and energy of the downmixed signal at a plurality of frequency bins, and the second difference value is a sum of absolute values of difference values between energy of the second channel signal and energy of the downmixed signal at the plurality of frequency bins; and determining the identification information of the first channel signal and the second channel signal based on the first difference value and the second difference value.
- It should be understood that energy values of the first channel signal, the second channel signal, and the downmixed signal may be values obtained after normalization processing.
- The difference between the energy of the first channel signal and the energy of the downmixed signal and the difference between the energy of the second channel signal and the energy of the downmixed signal can be conveniently determined by comparing the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins and the energy of the second channel signal and the energy of the downmixed signal at the plurality of frequency bins, so as to determine a channel signal whose initial reverberation gain parameter needs to be adjusted. Therefore, it is unnecessary to compare differences between energy of the first channel signal and energy of the downmixed signal and differences between energy of the second channel signal and energy of the downmixed signal in all frequency bands.
- With reference to the second aspect, in some implementations of the second aspect, the determining the identification information of the first channel signal and the second channel signal based on the first difference value and the second difference value includes: determining the larger difference value in the first difference value and the second difference value as a target difference value; and determining the identification information based on the target difference value, where the identification information indicates a channel signal corresponding to the target difference value, and the channel signal corresponding to the target difference value is a channel signal whose initial reverberation gain parameter needs to be adjusted.
- With reference to the second aspect, in some implementations of the second aspect, the method further includes: determining a target attenuation factor based on the first difference value and the second difference value, where the target attenuation factor is used to adjust an initial reverberation gain parameter of a target channel signal; and quantizing the target attenuation factor, and writing a quantized target attenuation factor into the bitstream.
- The initial reverberation gain parameter of the channel signal can be flexibly adjusted based on a value of the correlation between the channel signal and the downmixed signal by using the attenuation factor.
- With reference to the second aspect, in some implementations of the second aspect, the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors corresponds to at least one subband of the target channel signal, and any subband corresponds to only one attenuation factor.
- When the target attenuation factor includes a plurality of attenuation factors, a reverberation gain parameter can be more flexibly adjusted based on the target attenuation factor.
- With reference to the second aspect, in some implementations of the second aspect, the target channel signal includes a first frequency band and a second frequency band, an attenuation factor corresponding to a subband in the first frequency band is less than or equal to an attenuation factor corresponding to a subband in the second frequency band, and a frequency of the first frequency band is less than a frequency of the second frequency band.
- Reverberation gain parameters corresponding to a high frequency subband and a low frequency subband can be adjusted to different degrees by setting attenuation factors of different sizes for the reverberation gain parameters corresponding to the high frequency subband and the low frequency subband, and a better processing effect can be obtained during reverberation processing.
- With reference to the second aspect, in some implementations of the second aspect, the energy of the downmixed signal is determined based on the energy of the first channel signal and the energy of the second channel signal.
- The energy of the downmixed signal is estimated or deduced by using energy of a plurality of channel signals, which can reduce calculation.
- According to a third aspect, a multi-channel signal decoding method is provided, where the method includes: obtaining a bitstream; determining a downmixed signal of a first channel signal and a second channel signal in a multi-channel signal, an initial reverberation gain parameter of the first channel signal and the second channel signal, and identification information of the first channel signal and the second channel signal based on the bitstream, where the identification information indicates a channel signal that is in the first channel signal and the second channcl signal and whose initial reverberation gain parameter needs to be adjusted; determining, as a target channel signal based on the identification information, the channel signal that is in the first channel signal and the second channel signal and whose initial reverberation gain parameter needs to be adjusted; and adjusting the initial reverberation gain parameter of the target channel signal.
- In this application, the channel signal whose initial reverberation gain parameter needs to be adjusted can be determined by using the identification information, and the initial reverberation gain parameter of the channel signal is adjusted before reverberation processing is performed on the channel signal, thereby improving quality of a channel signal obtained after reverberation processing.
- With reference to the third aspect, in some implementations of the third aspect, the adjusting an initial reverberation gain parameter of the target channel signal includes: determining a target attenuation factor; and adjusting the initial reverberation gain parameter of the target channel signal based on the target attenuation factor, to obtain a target reverberation gain parameter of the target channel signal.
- The initial reverberation gain parameter of the channel signal can be flexibly adjusted based on a value of the correlation between the channel signal and the downmixed signal by using the attenuation factor.
- With reference to the third aspect, in some implementations of the third aspect, the determining a target attenuation factor includes: determining a preset attenuation factor as the target attenuation factor.
- A process of determining the target attenuation factor can be simplified by presetting the attenuation factor, thereby improving decoding efficiency.
- With reference to the third aspect, in some implementations of the third aspect, the determining a target attenuation factor includes: obtaining the target attenuation factor based on the bitstream.
- When the bitstream includes the target attenuation factor, the target attenuation factor may be directly obtained from the bitstream, and the process of determining the target attenuation factor can be also simplified, thereby improving decoding efficiency.
- With reference to the third aspect, in some implementations of the third aspect, the determining a target attenuation factor includes: obtaining an inter-channel level difference between the first channel signal and the second channel signal from the bitstream; and determining the target attenuation factor based on the inter-channel level difference, or determining the target attenuation factor based on the inter-channel level difference and the downmixed signal.
- The target attenuation factor can be more flexibly and accurately determined based on the inter-channel level difference, the downmixed signal, and the like, so that an initial reverberation gain parameter of a channel signal can be more accurately adjusted based on the attenuation factor.
- With reference to the third aspect, in some implementations of the third aspect, the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors corresponds to at least one subband of the target channel signal, and any subband corresponds to only one attenuation factor.
- When the target attenuation factor includes a plurality of attenuation factors, a reverberation gain parameter can be more flexibly adjusted based on the target attenuation factor.
- With reference to the third aspect, in some implementations of the third aspect, the target channel signal includes a first frequency band and a second frequency band, an attenuation factor corresponding to a subband in the first frequency band is less than or equal to an attenuation factor corresponding to a subband in the second frequency band, and a frequency of the first frequency band is less than a frequency of the second frequency band.
- Reverberation gain parameters corresponding to a high frequency subband and a low frequency subband can be adjusted to different degrees by setting attenuation factors of different sizes for the reverberation gain parameters corresponding to the high frequency subband and the low frequency subband, and a better processing effect can be obtained during reverberation processing.
- According to a fourth aspect, an encoder is provided, and the encoder includes a module or a unit configured to perform the method in the first aspect or various implementations of the first aspect.
- According to a fifth aspect, an encoder is provided, and the encoder includes a module or a unit configured to perform the method in the second aspect or various implementations of the second aspect.
- According to a sixth aspect, a decoder is provided, and the decoder includes a module or a unit configured to perform the method in the third aspect or various implementations of the third aspect.
- According to a seventh aspect, an encoder is provided. The encoder includes a memory and a processor, where the memory is configured to store a program, the processor is configured to execute the program, and when the program is executed, the processor performs the method in the first aspect or various implementations of the first aspect.
- According to an eighth aspect, an encoder is provided. The encoder includes a memory and a processor, where the memory is configured to store a program, the processor is configured to execute the program, and when the program is executed, the processor performs the method in the second aspect or various implementations of the second aspect.
- According to a ninth aspect, a decoder is provided. The decoder includes a memory and a processor, where the memory is configured to store a program, the processor is configured to execute the program, and when the program is executed, the processor performs the method in the third aspect or various implementations of the third aspect.
- According to a tenth aspect, a computer readable medium is provided, the computer readable medium stores program code to be executed by a device, and the program code includes an instruction used to perform the method in the first aspect or various implementations of the first aspect.
- According to an eleventh aspect, a computer readable medium is provided, the computer readable medium stores program code to be executed by a device, and the program code includes an instruction used to perform the method in the second aspect or various implementations of the second aspect.
- According to a twelfth aspect, a computer readable medium is provided, the computer readable medium stores program code to be executed by a device, and the program code includes an instruction used to perform the method in the third aspect or various implementations of the third aspect.
-
-
FIG. 1 is a schematic flowchart of encoding a left-channel signal and a right-channel signal in the prior art; -
FIG. 2 is a schematic flowchart of decoding a left-channel signal and a right-channel signal in the prior art; -
FIG. 3 is a schematic flowchart of a multi-channel signal encoding method according to an embodiment of this application; -
FIG. 4 is a schematic flowchart of a multi-channel signal encoding method according to an embodiment of this application; -
FIG. 5 is a schematic flowchart of a multi-channel signal decoding method according to an embodiment of this application; -
FIG. 6 is a schematic flowchart of a multi-channel signal encoding method according to an embodiment of this application; -
FIG. 7 is a schematic flowchart of a multi-channel signal decoding method according to an embodiment of this application; -
FIG. 8 is a schematic block diagram of an encoder according to an embodiment of this application; -
FIG. 9 is a schematic block diagram of an encoder according to an embodiment of this application; -
FIG. 10 is a schematic block diagram of a decoder according to an embodiment of this application; -
FIG. 11 is a schematic block diagram of an encoder according to an embodiment of this application; -
FIG. 12 is a schematic block diagram of an encoder according to an embodiment of this application; and -
FIG. 13 is a schematic block diagram of a decoder according to an embodiment of this application. - The following describes technical solutions of this application with reference to accompanying drawings. To better understand a multi-channel signal encoding method and a multi-channel signal decoding method in embodiments of this application, the following first briefly describes a multi-channel signal encoding method and a multi-channel signal decoding method in the prior art with reference to
FIG. 1 andFIG. 2 . -
FIG. 1 shows a process of encoding a left-channel signal and a right-channel signal in the prior art. The encoding process shown inFIG. 1 specifically includes the following steps. - 110. Perform spatial parameter analysis and downmixing processing on a left-channel signal (represented by L in the figure) and a right-channel signal (represented by R in the figure).
- Specifically, step 110 specifically includes: performing spatial parameter analysis on the left-channel signal and the right-channel signal to obtain a spatial parameter of the left-channel signal and a spatial parameter of the right-channel signal; and performing downmixing processing on the left-channel signal and the right-channel signal to obtain a downmixed signal (where the downmixed signal obtained after downmixing processing is a mono audio signal, and the original two channels of audio signals are converted into one channel of audio signal through downmixing processing).
- The spatial parameter (may be also referred to as a spatial sensing parameter) includes an inter-channel correlation (Inter-channel Coherent, IC), an inter-channel level difference (Inter-channel Level Difference, ILD), an inter-channel time difference (Inter-channel Time Difference, ITD), an inter-channel phase difference (Inter-channel Phase Difference, IPD), and the like.
- The IC describes an inter-channel cross-correlation or coherence. This parameter determines sensing of a sound field range, and can improve spatial sense and sound stability of an audio signal. The ILD is used to distinguish a horizontal direction angle of a stereo source and describes an inter-channel intensity difference, and this parameter affects frequency components of an entire spectrum. The ITD and the IPD are spatial parameters representing horizontal directions of a sound source. They describe inter-channel time and phase differences. The parameters mainly affect frequency components below 2 kHz. For two channel signals, the ITD may represent a time delay between a left-channel signal and a right-channel signal of a stereo, and the IPD may represent a waveform similarity of the left-channel signal and the right-channel signal of the stereo after time alignment. The ILD, the ITD, and the IPD can determine human ears' sensing of a location of a sound source, effectively determine a sound field location, and play an important role in stereo signal restoration.
- 120. Encode the downmixed signal to obtain a bitstream.
- 130. Encode the spatial parameters to obtain a bitstream.
- 140. Multiplex the bitstream obtained by encoding the downmixed signal and the bitstream obtained by encoding the spatial parameters to obtain a bitstream.
- The bitstream obtained through encoding may be stored or transmitted to a decoder-side device.
-
FIG. 2 shows a process of decoding a left-channel signal and a right-channel signal in the prior art. The decoding process shown inFIG. 2 specifically includes the following steps. - 210. Demultiplex a bitstream to separately obtain a bitstream obtained by encoding a downmixed signal and a bitstream obtained by encoding a spatial parameter.
- 220. Decode the bitstreams to obtain a downmixed signal of a left-channel signal and a right-channel signal, a spatial parameter of the left-channel signal, and a spatial parameter of the right-channel signal.
- The spatial parameters include an IC of the left-channel signal and the right-channel signal.
- 230. Obtain a de-correlation signal based on a downmixed signal and a spatial parameter of a current frame.
- The left-channel signal and the right-channel signal are obtained based on a decoded downmixed signal and the de-correlation signal of the current frame.
- 240. Obtain finally output left-channel signal and right-channel signal (respectively represented by L' and R' in
FIG. 2 ) based on the spatial parameters, the left-channel signal, and the right-channel signal. - It should be understood that the left-channel signal and the right-channel signal (respectively represented by L' and R' in
FIG. 2 ) in step 240 are obtained through decoding, and may be distorted to some extent compared with a left-channel signal and a right-channel signal that are encoded on an encoder side. - Specifically, the downmixed signal may be filtered, and then an inter-channel correlation parameter is used to correct a filtered downmixed signal to obtain a de-correlation signal.
- A purpose of generating the de-correlation signal is to improve a sense of reverberation of a finally generated stereo signal on a decoder side, and increase a sound field width of the stereo signal, so that an output audio signal is more mellow and full in terms of auditory sense. The sense of reverberation is essentially an effect of delaying such as reflecting and refracting an original audio signal differently and then superimposing the reflected and refracted audio signals on the original audio signal to enter a human ear.
- In the prior art, after the IC is obtained, a correlation of different channel signals is not considered so as to adaptively adjust the IC. In this case, when reverberation processing is performed on the channel signal based on the previously obtained IC, a relatively poor auditory effect may be caused. For example, when a correlation between different channel signals is relatively low, if the previously obtained IC is still used to correct a de-correlation signal, and then the de-correlation signal is used to perform same reverberation processing on the different channel signals, quality of a channel signal finally output by the decoder side is relatively poor. That is, because a difference between different channel signals is relatively large, if reverberation processing is performed on different channel signals by still using the de-correlation signal corrected by the previous relatively large IC, a reverberation effect of the channel signal is not increased, but the output channel signal may be distorted.
- Therefore, the embodiments of this application provide a multi-channel signal encoding or decoding method. In this method, a reverberation gain parameter can be correspondingly adjusted based on a correlation between different channel signals, and a de-correlation signal is corrected by using an adjusted reverberation gain parameter. Then, reverberation processing is performed on different channel signals by using the de-correlation signal. In this way, when reverberation processing is performed on different channel signals, the correlation between different channel signals is considered, so that quality of an output channel signal is better.
-
FIG. 3 is a schematic flowchart of a multi-channel signal encoding method according to an embodiment of this application. The method inFIG. 3 may be performed by an encoder-side device or an encoder. The method inFIG. 3 includes the following steps. - 310. Determine a downmixed signal of a first channel signal and a second channel signal in a multi-channel signal, an initial reverberation gain parameter of the first channel signal and the second channel signal.
- It should be understood that, in this embodiment of this application, a sequence of determining the downmixed signal and determining the initial reverberation gain parameter is not limited, and the downmixed signal and the initial reverberation gain parameter may be determined simultaneously or successively.
- The initial reverberation gain parameter may be reverberation gain parameter obtained after spatial parameter analysis is performed on the first channel signal and the second channel signal.
- Specifically, the downmixed signal may be obtained by performing downmixing processing on the plurality of channel signals. A spatial parameter of the first channel signal and a spatial parameter of the second channel signal are obtained by performing spatial parameter analysis on the first channel signal and the second channel signal, where the spatial parameters include the initial reverberation gain parameter of the first channel signal and the second channel signal.
- It should be understood that the first channel signal and the second channel signal may correspond to a same spatial parameter, and correspondingly, the first channel signal and the second channel signal may also correspond to a same initial reverberation gain parameter. That is, the spatial parameter of the first channel signal and the spatial parameter of the second channel signal may be the same, and the initial reverberation gain parameter of the first channel signal and the second channel signal may also be the same.
- Further, assuming that each of the first channel signal and the second channel signal includes 10 subbands, and each subband corresponds to one reverberation gain parameter, reverberation gain parameters corresponding to subbands, whose index values are the same, of the first channel signal and the second channel signal may be the same.
- In addition, the first channel signal, the second channel signal, and the downmixed signal may be channel signals obtained after normalization processing.
- 320. Determine a target reverberation gain parameter of the first channel signal and the second channel signal based on a correlation between the first channel signal and the downmixed signal, a correlation between the second channel signal and the downmixed signal, and the initial reverberation gain parameter.
- Optionally, the correlation between the first channel signal or the second channel signal and the downmixed signal may be determined based on a difference between energy of the first channel signal or energy of the second channel signal and energy of the downmixed signal, or may be determined based on a difference between an amplitude of the first channcl signal or an amplitude of the second channcl signal and an amplitude of the downmixed signal.
- Specifically, when the difference between the energy or the amplitude of the first channel signal and the energy or the amplitude of the downmixed signal is relatively small, it may be considered that the correlation between the first channel signal and the downmixed signal is relative large. When the difference between the energy or the amplitude of the first channel signal and the energy or the amplitude of the downmixed signal is relatively large, it may be considered that the correlation between the first channel signal and the downmixed signal is relatively small.
- The difference between the energy of the first channel signal or the energy of the second channel signal and the energy of the downmixed signal may be specifically a difference value between the energy of the first channel signal or the energy of the second channel signal and the energy of the downmixed signal. Similarly, the difference between the amplitude of the first channel signal or the amplitude of the second channel signal and the amplitude of the downmixed signal may be specifically a difference value between the amplitude of the first channel signal or the amplitude of the second channel signal and the amplitude of the downmixed signal.
- In addition, the correlation between the first channel signal or the second channel signal and the downmixed signal may alternatively refer to a difference between a phase, a period, or the like of the first channel signal or the second channel signal and a phase, a period, or the like of the downmixed signal.
- 330. Quantize the first channel signal and the second channel signal based on the downmixed signal and the target reverberation gain parameter, and write a quantized first channel signal and a quantized second channel signal into a bitstream.
- It should be understood that when the multi-channel signal has more than two channel signals, for example, when the multi-channel signal includes the first channel signal, the second channel signal, a third channel signal, and a fourth channel signal, the first channel signal and the second channel signal may be processed by using the method in
FIG. 3 , and the third channel signal and the fourth channel signal are also processed by using the method inFIG. 3 . - In this application, when a target reverberation gain parameter of a channel signal is being determined, a correlation between the channel signal and the downmixed signal is considered. In this way, a better processing effect can be obtained when reverberation processing is performed on the channel signal based on the target reverberation gain parameter, thereby improving quality of a channel signal obtained after reverberation processing.
- Optionally, in an embodiment, the determining a target reverberation gain parameter of the first channel signal and the second channel signal based on a correlation between the first channel signal and the downmixed signal, a correlation between the second channel signal and the downmixed signal, and the initial reverberation gain parameter includes: determining a target attenuation factor based on the correlation between the first channel signal and the downmixed signal and the correlation between the second channel signal and the downmixed signal; and adjusting the initial reverberation gain parameter based on the target attenuation factor, to obtain the target reverberation gain parameter.
- Specifically, the determining a target attenuation factor based on the correlation between the first channel signal and the downmixed signal and the correlation between the second channel signal and the downmixed signal may be calculating the target attenuation factor based on the correlations between the channel signals and the downmixed signal, or may be directly determining a preset attenuation factor as the target attenuation factor after the correlations between the channel signals and the downmixed signal are considered.
- The initial reverberation gain parameter of the channel signal can be flexibly adjusted based on a value of the correlation between the channel signal and the downmixed signal by using the attenuation factor.
- For example, when the correlation between the first channel signal and the downmixed signal and the correlation between the second channel signal and the downmixed signal are relatively large (in this case, it may also be considered that the first channel signal is relatively similar to the second channel signal), a target attenuation factor with a relatively small value may be determined. However, when the correlation between the first channel signal and the downmixed signal and the correlation between the second channel signal and the downmixed signal are relatively small (in this case, it may also be considered that the first channel signal is relatively different from the second channel signal), a target attenuation factor with a relatively large value may be determined.
- In some embodiments, correlations between the plurality of channel signals and the downmixed signal may refer to differences between energy of the plurality of channel signals and the energy of the downmixed signal, or differences between amplitudes of the plurality of channel signals and the amplitude of the downmixed signal. The differences between the energy of the plurality of channel signals and the energy of the downmixed signal may be specifically difference values between the energy of the plurality of channel signals and the energy of the downmixed signal. Similarly, the differences between the amplitudes of the plurality of channel signals and the amplitude of the downmixed signal may be specifically difference values between the amplitudes of the plurality of channel signals and the amplitude of the downmixed signal. In addition, the correlations between the plurality of channel signals and the downmixed signal may alternatively refer to differences between phases, periods, or the like of the plurality of channel signals and the phase, the period, or the like of the downmixed signal.
- In some embodiments, the correlation between the first channel signal or the second channel signal and the downmixed signal may be determined based on the difference between the energy of the first channel signal or the energy of the second channel signal and the energy of the downmixed signal, and further the target attenuation factor is determined.
- The correlation between the first channel signal and the downmixed signal and the correlation between the second channel signal and the downmixed signal can be conveniently measured by using the energy of the channel signals and the energy of the downmixed signal, that is, the target attenuation factor can be conveniently determined by comparing the difference between the energy of the first channel signal or the energy of the second channel signal and the energy of the downmixed signal.
- Optionally, in an embodiment, both the first channel signal and the second channel signal include a plurality of frequency bins, and the determining a target attenuation factor based on the correlation between the first channel signal and the downmixed signal and the correlation between the second channel signal and the downmixed signal includes: determining difference values between energy of the first channel signal and energy of the downmixed signal at the plurality of frequency bins and between energy of the second channel signal and energy of the downmixed signal at the plurality of frequency bins; and determining the target attenuation factor based on the difference values.
- The difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins may be difference values between energy of the first channel signal and energy of the downmixed signal at a plurality of same frequency bins. For example, the first channel signal includes three frequency bins (a first frequency channel number, a second frequency channel number, and a third frequency channel number). In this case, difference values between energy of the first channel signal and energy of the downmixed signal at the three frequency bins are specifically a difference value between the first channel signal and the downmixed signal at the first frequency channel number, a difference value between the first channel signal and the downmixed signal at the second frequency channel number, and a difference value between the first channel signal and the downmixed signal at the third frequency channel number.
- Similarly, the difference values between the energy of the second channel signal and the energy of the downmixed signal at the plurality of frequency bins may be difference values between energy of the second channel signal and energy of the downmixed signal at a plurality of same frequency bins.
- Optionally, the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins may be a sum of absolute values of the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins. Similarly, the difference values between the energy of the second channel signal and the energy of the downmixed signal at the plurality of frequency bins may be a sum of absolute values of the difference values between the energy of the second channel signal and the energy of the downmixed signal at the plurality of frequency bins.
- It should be understood that energy values of the first channel signal, the second channel signal, and the downmixed signal may be values obtained after normalization processing.
- The difference between the energy of the first channel signal and the energy of the downmixed signal and the difference between the energy of the second channel signal and the energy of the downmixed signal can be conveniently determined by comparing the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins and the energy of the second channel signal and the energy of the downmixed signal at the plurality of frequency bins, and the attenuation factor is further determined. Therefore, it is unnecessary to compare differences between energy of the first channel signal and energy of the downmixed signal and differences between energy of the second channel signal and energy of the downmixed signal in all frequency bands.
- Optionally, in an embodiment, the determining difference values between energy of the first channel signal and energy of the downmixed signal at the plurality of frequency bins and between energy of the second channel signal and energy of the downmixed signal at the plurality of frequency bins includes: determining a first difference value between the energy of the first channel signal and the energy of the downmixed signal, where the first difference value indicates a sum of absolute values of the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins; determining a second difference value between the energy of the second channel signal and the energy of the downmixed signal, where the second difference value indicates a sum of absolute values of the difference values between the energy of the second channel signal and the energy of the downmixed signal at the plurality of frequency bins; and determining the target attenuation factor based on the first difference value and the second difference value.
- The determining the target attenuation factor based on the first difference value and the second difference value may include: determining the target attenuation factor based on a ratio between the first difference value and the second difference value.
- Specifically, when the first channel signal is a left-channel signal and the second channel signal is a right-channel signal, the first difference value and the second difference value may be calculated according to the following formula:
- When the target attenuation factor is being determined based on the first difference value and the second difference value, the ratio between the first difference value and the second difference value may be directly determined as the target attenuation factor. For example, the first difference value is a, and the second difference value is b. When a < b, a/b is determined as the target attenuation factor, or when a > b, b/a is determined as the target attenuation factor. In addition, after the target attenuation factor is determined based on the first difference value and the second difference value, some smoothing processing may be performed on the target attenuation factor and an attenuation factor of a previous frame, and a target attenuation factor obtained after smoothing processing is used to further adjust the initial reverberation gain parameter of the plurality of channel signals.
- Optionally, in an embodiment, before the target attenuation factor is determined based on the foregoing difference values, the method in
FIG. 3 further includes: determining that the difference values are greater than a preset threshold. - It should be understood that, that the difference values are greater than the preset threshold herein may mean that the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins and the energy of the second channel signal and the energy of the downmixed signal are greater than a same preset threshold, or may mean that the difference between the energy of the first channel signal and the energy of the downmixed signal is greater than a preset first threshold, and the difference between the energy of the second channel signal and the energy of the downmixed signal is greater than a preset second threshold.
- Only when the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins and the energy of the second channel signal and the energy of the downmixed signal are relatively large, the target attenuation factor is determined, and the initial reverberation gain parameter is adjusted based on the target attenuation factor. When the difference values are relatively small, the initial reverberation gain parameter may not be adjusted, thereby improving encoding efficiency.
- For example, when the difference value between the energy of the first channel signal and the energy of the downmixed signal is greater than M (where M is between 0.5 and 1) times the energy of the first channel signal, it may be considered that the difference value between the energy of the first channel signal and the energy of the downmixed signal is greater than the preset threshold. In this case, the preset threshold is M times the energy of the first channel signal. Alternatively, when a ratio of the difference value between the energy of the first channel signal and the energy of the downmixed signal to the energy of the first channel signal is greater than M, it may also be considered that the difference value between the energy of the first channel signal and the energy of the downmixed signal is greater than the preset threshold.
- When difference values between energy of a plurality of channel signals and the energy of the downmixed signal are less than the preset threshold, initial reverberation gain parameter of the plurality of channel signals may be directly determined as target reverberation gain parameter of the plurality of channel signals.
- Optionally, in an embodiment, the energy of the downmixed signal is determined based on the energy of the first channel signal and the energy of the second channel signal.
- The energy of the downmixed signal can be calculated by using the energy of the first channel signal and the energy of the second channel signal, and a calculation process can be simplified without using the downmixed signal itself.
- Certainly, in this embodiment of this application, the energy of the downmixed signal may alternatively be directly calculated based on the downmixed signal itself.
- Optionally, in an embodiment, the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors corresponds to at least one subband of the multi-channel signal, and any subband corresponds to only one attenuation factor.
- For example, indexes of subbands included in each of the first channel signal and the second channel signal are 0 to 9. Both the first channel signal and the second channel signal include 10 reverberation gain parameters, each subband corresponds to one reverberation gain parameter, the target attenuation factor includes five attenuation factors, and each attenuation factor corresponds to two subbands; or the target attenuation factor includes 10 attenuation factors, and each attenuation factor corresponds to one subband.
- In addition, when the target attenuation factor includes a plurality of attenuation factors, a reverberation gain parameter can be more flexibly adjusted based on the target attenuation factor. For example, reverberation gain parameters corresponding to subbands, whose indexes are 0 to 4, of a plurality of channel signals need to be adjusted slightly, but reverberation gain parameters corresponding to subbands, whose indexes are 5 to 9, of a channel signal need to be adjusted greatly. In this case, relatively small attenuation factors may be set for the reverberation gain parameters corresponding to the subbands whose indexes are 0 to 4, and relatively large attenuation factors are set for the reverberation gain parameters corresponding to the subbands whose indexes are 5 to 9.
- Optionally, in an embodiment, each of the first channel signal and the second channel signal (where a frequency band occupied by the first channel signal and a frequency band occupied by the second channel signal are the same) includes a first frequency band and a second frequency band, an attenuation factor corresponding to a subband in the first frequency band is less than or equal to an attenuation factor corresponding to a subband in the second frequency band, and a frequency of the first frequency band is less than a frequency of the second frequency band.
- For example, each of frequency bands in which the first channel signal and the second channel signal are located includes a low frequency part and a high frequency part, and the target attenuation factor includes a plurality of attenuation factors. The low frequency part corresponds to at least one attenuation factor, the high frequency part corresponds to at least one attenuation factor, and the attenuation factor corresponding to the low frequency part is less than the attenuation factor corresponding to the high frequency part.
- Reverberation gain parameters corresponding to a high frequency subband and a low frequency subband can be adjusted to different degrees by setting attenuation factors of different sizes for the reverberation gain parameters corresponding to the high frequency subband and the low frequency subband, and a better processing effect can be obtained during reverberation processing.
-
FIG. 4 is a schematic flowchart of a multi-channel signal encoding method according to an embodiment of this application. InFIG. 4 , channel signals include a left-channel signal and a right-channel signal, and a process of encoding the left-channel signal and the right-channel signal specifically includes the following steps. - 410. Calculate a spatial parameter of the left-channel signal and a spatial parameter of the right-channel signal.
- The spatial parameters include initial reverberation gain parameter of the left-channel signal and the right-channel signal, and another spatial parameter.
- 420. Perform downmixing processing on the left-channel signal (represented by L in the figure) and the right-channel signal (represented by R in the figure) to obtain a downmixed signal.
- 430. Determine difference values between energy of the left-channel signal and energy of the downmixed signal and between energy of the right-channel signal and energy of the downmixed signal.
- Specifically, each of the left-channel signal and the right-channel signal may be divided into a high frequency part and a low frequency part, and difference values between energy of the left-channel signal and energy of the downmixed signal and between energy of the right-channel signal and energy of the downmixed signal at the high frequency part are determined as the difference values between the energy of the left-channel signal and the energy of the downmixed signal and between the energy of the right-channel signal and the energy of the downmixed signal.
- 440. Adjust reverberation gain parameters of the left-channel signal and the right-channel signal based on the difference values between the energy of the left-channel signal and the energy of the downmixed signal and between the energy of the right-channel signal and the energy of the downmixed signal.
- Specifically, an encoder side may determine a target attenuation factor based on the difference values between the energy of the left-channel signal and the energy of the downmixed signal and between the energy of the right-channel signal and the energy of the downmixed signal, and adjust the reverberation gain parameters of the left-channel signal and the right-channel signal based on the target attenuation factor.
- 450. Quantize the downmixed signal, adjusted reverberation gain parameters, and another spatial parameter to obtain a bitstream.
-
FIG. 5 is a schematic flowchart of a multi-channel signal decoding method according to an embodiment of this application. InFIG. 5 , channel signals include a left-channel signal and a right-channel signal. InFIG. 5 , the bitstream generated through encoding in the encoding method inFIG. 4 may be decoded. A decoding process inFIG. 5 specifically includes the following steps: - 510. Obtain a bitstream of the left-channel signal and the right-channel signal.
- 520. Decode the bitstream to obtain a downmixed signal.
- 530. Decode the bitstream to obtain spatial parameters of the left-channel signal and the right-channel signal.
- The spatial parameter includes a reverberation gain parameter adjusted by an encoder side, that is, the encoder side encodes the adjusted reverberation gain parameter. In this way, after decoding the bitstream, a decoder side obtains the reverberation gain parameter adjusted by the encoder side.
- Step 520 and step 530 are not performed in a sequence, and may be performed simultaneously.
- 540. Perform subsequent processing (for example, smoothing filtering) on the spatial parameters obtained through decoding.
- 550. Obtain a de-correlation signal based on the downmixed signal and the reverberation gain parameter that are obtained through decoding (where the reverberation gain parameter is the reverberation gain parameter adjusted by the encoder side).
- 560. Perform upmixing processing based on the spatial parameters and the downmixed signal processed in step 540 to obtain the left-channel signal and the right-channel signal.
- 570. Separately perform reverberation processing on the left-channel signal and the right-channel signal based on the de-correlation signal.
- In the method shown in
FIG. 5 , the reverberation gain parameter based on which reverberation processing is performed on the left-channel signal and the right-channel signal has been adjusted based on correlations between the left-channel signal and the downmixed signal and between the right-channel signal and the downmixed signal. In this way, corresponding reverberation processing can be performed based on a difference between the left-channel signal and the right-channel signal, thereby improving quality of a channel signal obtained after reverberation processing. - In the encoding method in
FIG. 3 , the encoder side determines whether an initial reverberation gain parameter of a channel signal needs to be adjusted. If the initial reverberation gain parameter of the channel signal needs to be adjusted, the encoder side adjusts the initial reverberation gain parameter of the channel signal, and encodes an adjusted reverberation gain parameter, so that the decoder side directly performs reverberation processing based on a reverberation gain parameter obtained through decoding. - Actually, the encoder side may alternatively determine only whether the initial reverberation gain parameter of the channel signal needs to be adjusted. If the initial reverberation gain parameter of the channel signal needs to be adjusted, the encoder side sends corresponding indication information to the decoder side. After receiving the indication information, the decoder side adjusts the initial reverberation gain parameter of the channel signal.
-
FIG. 6 is a schematic flowchart of a multi-channel signal encoding method according to an embodiment of this application. The method inFIG. 6 includes the following steps. - 610. Determine a downmixed signal of a first channel signal and a second channel signal in a multi-channel signal, an initial reverberation gain parameter of the first channel signal and the second channel signal.
- Specifically, the downmixed signal may be obtained by performing downmixing processing on the first channel signal and the second channel signal, and spatial parameters are obtained by performing spatial parameter analysis on the first channcl signal and the second channcl signal, where the spatial parameters include the initial reverberation gain parameter of the first channel signal and the second channel signal.
- It should be understood that the downmixed signal and the initial reverberation gain parameter may be determined simultaneously or successively.
- It should be understood that the first channel signal and the second channel signal may correspond to a same spatial parameter, and specifically, the first channel signal and the second channel signal also correspond to a same initial reverberation gain parameter. That is, a spatial parameter of the first channel signal and a spatial parameter of the second channel signal are the same, and the initial reverberation gain parameter of the first channel signal and the second channel signal are the same.
- Further, assuming that each of the first channel signal and the second channel signal includes 10 subbands, and each subband corresponds to one reverberation gain parameter, reverberation gain parameters corresponding to subbands, whose index values are the same, of the first channel signal and the second channel signal may be the same.
- 620. Determine identification information of the first channel signal and the second channel signal based on a correlation between the first channel signal and the downmixed signal, and a correlation between the second channel signal and the downmixed signal, where the identification information indicates a channel signal that is in the first channel signal and the second channel signal and whose initial reverberation gain parameter needs to be adjusted.
- Optionally, the correlation between the first channel signal or the second channel signal and the downmixed signal may be determined based on a difference between energy of the first channel signal or energy of the second channel signal and energy of the downmixed signal, or may be determined based on a difference between an amplitude of the first channel signal or an amplitude of the second channel signal and an amplitude of the downmixed signal.
- Specifically, when the difference between the energy or the amplitude of the first channel signal and the energy or the amplitude of the downmixed signal is relatively small, it may be considered that the correlation between the first channel signal and the downmixed signal is relative large. When the difference between the energy or the amplitude of the first channel signal and the energy or the amplitude of the downmixed signal is relatively large, it may be considered that the correlation between the first channel signal and the downmixed signal is relatively small.
- The difference between the energy of the first channel signal or the energy of the second channel signal and the energy of the downmixed signal may be specifically a difference value between the energy of the first channel signal or the energy of the second channel signal and the energy of the downmixed signal. Similarly, the difference between the amplitude of the first channel signal or the amplitude of the second channel signal and the amplitude of the downmixed signal may be specifically a difference value between the amplitude of the first channel signal or the amplitude of the second channel signal and the amplitude of the downmixed signal.
- In addition, the correlation between the first channel signal or the second channel signal and the downmixed signal may alternatively refer to a difference between a phase, a period, or the like of the first channel signal or the second channel signal and a phase, a period, or the like of the downmixed signal.
- The first channel signal, the second channel signal, and the downmixed signal may be channel signals obtained after normalization processing.
- Specifically, the identification information may indicate that the first channel signal or the second channel signal is a channel signal whose initial reverberation gain parameter needs to be adjusted, or may indicate that the first channel signal and the second channel signal are channel signals whose initial reverberation gain parameters need to be adjusted, or may indicate that a reverberation gain parameter does not need to be adjusted for both the first channel signal and the second channel signal.
- In some embodiments, the identification information may indicate, by using a value of an identifier field, a channel signal that is in a plurality of channel signals and whose initial reverberation gain parameter needs to be adjusted. For example, the identifier field of the identification information occupies two bits. When the value of the identifier field is 00, it indicates that neither the initial reverberation gain parameter of the first channel signal nor the initial reverberation gain parameter of the second channel signal needs to be adjusted. When the value of the identifier field is 01, it indicates that only the initial reverberation gain parameter of the first channel signal needs to be adjusted. When the value of the identifier field is 10, it indicates that only the initial reverberation gain parameter of the second channel signal needs to be adjusted. When the value of the identifier field is 11, it indicates that both the initial reverberation gain parameter of the first channel signal and the second channel signal need to be adjusted.
- In some embodiments, the determining identification information of the first channel signal and the second channel signal based on a correlation between the first channel signal and the downmixed signal, and a correlation between the second channel signal and the downmixed signal includes: determining the identification information of the first channel signal and the second channel signal based on correlations between the energy of the first channel signal and the energy of the downmixed signal and between the energy of the second channel signal and the energy of the downmixed signal.
- The correlation between the first channel signal and the downmixed signal and the correlation between the second channel signal and the downmixed signal can be conveniently measured by using the energy of the channel signals and the energy of the downmixed signal, so that a channel signal whose initial reverberation gain parameter needs to be adjusted can be conveniently determined.
- In some embodiments, the energy or amplitude of the downmixed signal may be calculated based on the energy of the first channel signal and the energy of the second channel signal, thereby simplifying a calculation process. Alternatively, the energy of the downmixed signal may be directly calculated based on the downmixed signal itself.
- 630. Quantize the first channel signal and the second channel signal based on the downmixed signal, the initial reverberation gain parameter, and the identification information, and write a quantized first channel signal and a quantized second channel signal into a bitstream.
- In this application, by determining a relationship between a preset threshold and a size of a difference value between energy of a channel signal and the energy of the downmixed signal, the channel signal can be determined as a channel signal whose initial reverberation gain parameter needs to be adjusted, when the energy of the channel signal is greatly different from the energy of the downmixed signal. Therefore, a decoder side can first adjust an initial reverberation gain parameter of the channel signal and then perform reverberation processing on the channel signal, thereby improving quality of a channel signal obtained after reverberation processing.
- Optionally, in an embodiment, the determining the identification information of the first channel signal and the second channel signal based on correlations between the energy of the first channel signal and the energy of the downmixed signal and between the energy of the second channel signal and the energy of the downmixed signal includes: determining a first difference value and a second difference value, where the first difference value is a sum of absolute values of difference values between energy of the first channel signal and energy of the downmixed signal at a plurality of frequency bins, and the second difference value is a sum of absolute values of difference values between energy of the second channel signal and energy of the downmixed signal at the plurality of frequency bins; and determining the identification information of the first channel signal and the second channel signal based on the first difference value and the second difference value.
- The difference between the energy of the first channel signal and the energy of the downmixed signal and the difference between the energy of the second channel signal and the energy of the downmixed signal can be conveniently determined by comparing the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins and the energy of the second channel signal and the energy of the downmixed signal at the plurality of frequency bins, so as to determine a channel signal whose initial reverberation gain parameter needs to be adjusted. Therefore, it is unnecessary to compare differences between energy of the first channel signal and energy of the downmixed signal and differences between energy of the second channel signal and energy of the downmixed signal in all frequency bands.
- Optionally, the determining the identification information of the first channel signal and the second channel signal based on the first difference value and the second difference value includes: determining the larger difference value in the first difference value and the second difference value as a target difference value; and determining the identification information based on the target difference value, where the identification information indicates a channel signal corresponding to the target difference value, and the channel signal corresponding to the target difference value is a channel signal whose initial reverberation gain parameter needs to be adjusted.
- Specifically, when the sum of the absolute values of the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins is greater than the sum of the absolute values of the difference values between the energy of the second channel signal and the energy of the downmixed signal at the plurality of frequency bins, the first channel signal may be determined as a channel signal whose initial reverberation gain parameter needs to be adjusted.
- In addition, when both the sum of the absolute values of the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins, and the sum of the absolute values of the difference values between the energy of the second channel signal and the energy of the downmixed signal at the plurality of frequency bins are relatively large (for example, both are greater than the preset threshold), another piece of identification information may be determined, and the identification information indicates that both the initial reverberation gain parameter of the first channel signal and the second channel signal need to be adjusted.
- Specifically, in some embodiments, the determining the identification information of the first channel signal and the second channel signal based on the sum of the absolute values of the difference values between the energy of the first channel signal or the energy of the second channel signal and the energy of the downmixed signal at the plurality of frequency bins includes: generating first identification information when the sum of the absolute values of the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins is greater than the preset threshold, where the first identification information indicates that the initial reverberation gain parameter of the first channel signal needs to be adjusted; and generating second identification information when the sum of the absolute values of the difference values between the energy of the second channel signal and the energy of the downmixed signal at the plurality of frequency bins is greater than the preset threshold, where the second identification information indicates that the initial reverberation gain parameter of the second channel signal needs to be adjusted.
- By determining a relationship between the preset threshold and a size of a difference value between energy of a channel signal and the energy of the downmixed signal, the channel signal can be determined as a channel signal whose initial reverberation gain parameter needs to be adjusted, when the energy of the channel signal is greatly different from the energy of the downmixed signal. Therefore, a decoder side can first adjust an initial reverberation gain parameter of the channel signal and then perform reverberation processing on the channel signal, thereby improving quality of a channel signal obtained after reverberation processing.
- It should be understood that the identification information of the first channel signal and the second channel signal may be one piece of identification information or two pieces of identification information. For example, when both the initial reverberation gain parameter of the first channel signal and the second channel signal need to be adjusted, the identification information of the first channel signal and the second channel signal may be one piece of identification information, and the identification information indicates that both the initial reverberation gain parameter of the first channel signal and the second channel signal need to be adjusted. Alternatively, the identification information of the first channel signal and the second channel signal is two pieces of identification information: first identification information and second identification information respectively, the first identification information indicates that the initial reverberation gain parameter of the first channel signal needs to be adjusted, and the second identification information indicates that the initial reverberation gain parameter of the second channel signal needs to be adjusted. When a channel signal has no corresponding identification information, it indicates that an initial reverberation gain parameter of the channel signal does not need to be adjusted. That is, when the identification information includes only the first identification information, the initial reverberation gain parameter of only the first channel signal in the first channel signal and the second channel signal needs to be adjusted.
- Optionally, in some embodiments, when the initial reverberation gain parameter of the first channel signal needs to be adjusted, the method in
FIG. 6 further includes: determining a target attenuation factor based on the first difference value and the second difference value, where the target attenuation factor is used to adjust an initial reverberation gain parameter of a target channel signal; and quantizing the target attenuation factor, and writing a quantized target attenuation factor into the bitstream. - The initial reverberation gain parameter of the channel signal can be flexibly adjusted based on a value of the correlation between the channel signal and the downmixed signal by using the attenuation factor.
- It should be understood that the first difference value and the second difference value may be calculated by referring to Formula (1) and Formula (2) in the foregoing.
- When the target attenuation factor is being determined based on the first difference value and the second difference value, the target attenuation factor may be determined based on a ratio between the first difference value and the second difference value.
- In some embodiments, the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors corresponds to at least one subband of the target channel signal, and any subband corresponds to only one attenuation factor. For example, the multi-channel signal includes a plurality of subbands, and adjacent subbands may correspond to one attenuation factor.
- When the target attenuation factor includes a plurality of attenuation factors, a reverberation gain parameter can be more flexibly adjusted based on the target attenuation factor.
- In some other embodiments, the target channel signal includes a first frequency band and a second frequency band, an attenuation factor corresponding to a subband in the first frequency band is less than or equal to an attenuation factor corresponding to a subband in the second frequency band, and a frequency of the first frequency band is less than a frequency of the second frequency band.
- Reverberation gain parameters corresponding to a high frequency subband and a low frequency subband can be adjusted to different degrees by setting attenuation factors of different sizes for the reverberation gain parameters corresponding to the high frequency subband and the low frequency subband, and a better processing effect can be obtained during reverberation processing.
- For example, a frequency band in which the target channel signal is located includes a low frequency part and a high frequency part, and the target attenuation factor includes a plurality of attenuation factors. The low frequency part corresponds to at least one attenuation factor, the high frequency part corresponds to at least one attenuation factor, and the attenuation factor corresponding to the low frequency part is less than the attenuation factor corresponding to the high frequency part.
- In some embodiments, the energy of the downmixed signal is determined based on the energy of the first channel signal and the energy of the second channel signal.
- The energy of the downmixed signal can be calculated by using the energy of the first channel signal and the energy of the second channel signal, and a calculation process can be simplified without using the downmixed signal itself.
- The foregoing describes the encoding method in the embodiment of this application in detail with reference to
FIG. 6 . The following describes a decoding method in the embodiment of this application with reference toFIG. 7 . It should be understood that the decoding method inFIG. 7 corresponds to the encoding method inFIG. 6 . For brevity, repeated descriptions are properly omitted below. -
FIG. 7 is a schematic flowchart of a multi-channel signal decoding method according to an embodiment of this application. The method inFIG. 7 may be performed by a decoder-side device or a decoder. The method inFIG. 7 specifically includes the following steps: - 710. Obtain a bitstream.
- 720. Determine a downmixed signal of a first channel signal and a second channel signal in a multi-channel signal, an initial reverberation gain parameter of the first channel signal and the second channel signal, and identification information of the first channel signal and the second channel signal based on the bitstream, where the identification information indicates a channel signal that is in the first channel signal and the second channel signal and whose initial reverberation gain parameter needs to be adjusted.
- 730. Determine, as a target channel signal based on the identification information, the channel signal that is in the first channel signal and the second channel signal and whose initial reverberation gain parameter needs to be adjusted.
- 740. Adjust the initial reverberation gain parameter of the target channel signal.
- In this application, the channel signal whose initial reverberation gain parameter needs to be adjusted can be determined by using the identification information, and the initial reverberation gain parameter of the channel signal is adjusted before reverberation processing is performed on the channel signal, thereby improving quality of a 2channel signal obtained after reverberation processing.
- Optionally, in an embodiment, the adjusting an initial reverberation gain parameter of the target channel signal includes: determining a target attenuation factor, and adjusting the initial reverberation gain parameter of the target channel signal based on the target attenuation factor, to obtain a target reverberation gain parameter of the target channel signal.
- The initial reverberation gain parameter of the channel signal can be flexibly adjusted based on a size of a correlation between the channel signal and the downmixed signal by using the attenuation factor.
- When determining the attenuation factor, the decoder side may determine a preset attenuation factor as the target attenuation factor. Alternatively, the decoder side directly adjusts the initial reverberation gain parameter of the target channel signal based on a preset attenuation factor.
- A process of determining the target attenuation factor can be simplified by presetting the attenuation factor, thereby improving decoding efficiency.
- In some embodiments, the decoder side may obtain the target attenuation factor from bitstreams of a plurality of channel signals, that is, obtain the target attenuation factor by decoding the bitstreams of the plurality of channel signals. In this case, an encoder side has determined the target attenuation factor, and encodes the target attenuation factor to obtain and transmit the bitstream to the decoder side. In this way, the decoder side does not need to calculate the target attenuation factor any more, but directly decodes the bitstream to obtain the target attenuation factor.
- When the bitstream includes the target attenuation factor, the target attenuation factor may be directly obtained from the bitstream, and the process of determining the target attenuation factor can be also simplified, thereby improving decoding efficiency.
- Optionally, in an embodiment, the determining a target attenuation factor specifically includes: obtaining an inter-channel level difference between the first channel signal and the second channel signal from the bitstream; and determining the target attenuation factor based on the inter-channel level difference, or determining the target attenuation factor based on the inter-channel level difference and the downmixed signal.
- The target attenuation factor can be more flexibly and accurately determined based on the inter-channel level difference, the downmixed signal, and the like, so that an initial reverberation gain parameter of a channel signal can be more accurately adjusted based on the attenuation factor.
- Specifically, when the inter-channel level difference is relatively large, it may be considered that a difference between the first channel signal and the second channel signal is relatively large, and a correlation between the first channel signal and the second channel signal is relatively small. In this case, an attenuation factor with a relatively large value may be determined as the target attenuation factor.
- In addition, when the target attenuation factor is being determined based on the downmixed signal, the target attenuation factor may be determined by using periodicity and harmonicity of the downmixed signal. For example, when the periodicity or the harmonicity of the downmixed signal is good, it may be considered that the difference between the first channel signal and the second channel signal is relatively small, and the correlation between the first channel signal and the second channel signal is relatively large. In this case, an attenuation factor with a relatively small value may be determined as the target attenuation factor.
- Optionally, in an embodiment, the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors corresponds to at least one subband of the target channel signal, and any subband corresponds to only one attenuation factor. For example, each of the first channel signal and the second channel signal includes a plurality of subbands, and a plurality of adjacent subbands may correspond to one attenuation factor.
- When the target attenuation factor includes a plurality of attenuation factors, a reverberation gain parameter can be more flexibly adjusted based on the target attenuation factor.
- In some other embodiments, the target channel signal includes a first frequency band and a second frequency band, an attenuation factor corresponding to a subband in the first frequency band is less than or equal to an attenuation factor corresponding to a subband in the second frequency band, and a frequency of the first frequency band is less than a frequency of the second frequency band.
- Reverberation gain parameters corresponding to a high frequency subband and a low frequency subband can be adjusted to different degrees by setting attenuation factors of different sizes for the reverberation gain parameters corresponding to the high frequency subband and the low frequency subband, and a better processing effect can be obtained during reverberation processing.
- For example, a frequency band in which the target channel signal is located includes a low frequency part and a high frequency part, and the target attenuation factor includes a plurality of attenuation factors. The low frequency part corresponds to at least one attenuation factor, the high frequency part corresponds to at least one attenuation factor, and the attenuation factor corresponding to the low frequency part is less than the attenuation factor corresponding to the high frequency part.
- The foregoing describes the encoding method and the decoding method in the embodiments of this application in detail with reference to
FIG. 3 to FIG. 7 . The following describes an encoder and a decoder in the embodiments of this application with reference toFIG. 8 to FIG. 13 . It should be understood that the encoder and the decoder inFIG. 8 to FIG. 13 can implement steps performed by the encoder and the decoder in the encoding method and the decoding method in the embodiments of this application. For brevity, repeated descriptions are properly omitted below. -
FIG. 8 is a schematic block diagram of an encoder according to an embodiment of this application. Anencoder 800 inFIG. 8 includes: - a
processing unit 810, configured to determine a downmixed signal of a first channel signal and a second channel signal in a multi-channel signal, an initial reverberation gain parameter of the first channel signal and the second channel signal; where - the
processing unit 810 is further configured to determine a target reverberation gain parameter of the first channel signal and the second channel signal based on a correlation between the first channel signal and the downmixed signal, a correlation between the second channel signal and the downmixed signal, and the initial reverberation gain parameter; and - an
encoding unit 820, configured to quantize the first channel signal and the second channel signal based on the downmixed signal and the target reverberation gain parameter, and write a quantized first channel signal and a quantized second channel signal into a bitstream. - The
encoder 800 may correspond to the multi-channel signal encoding method inFIG. 3 , and theencoder 800 may perform the multi-channel signal encoding method inFIG. 3 . - In this application, when a target reverberation gain parameter of a channel signal is being determined, a correlation between the channel signal and the downmixed signal is considered. In this way, a better processing effect can be obtained when reverberation processing is performed on the channel signal based on the target reverberation gain parameter, thereby improving quality of a channel signal obtained after reverberation processing.
- Optionally, in an embodiment, the
processing unit 810 is specifically configured to determine a target attenuation factor based on the correlation between the first channel signal and the downmixed signal and the correlation between the second channel signal and the downmixed signal; and adjust the initial reverberation gain parameter based on the target attenuation factor to obtain the target reverberation gain parameter. - Optionally, in an embodiment, each of the first channel signal and the second channel signal includes a plurality of frequency bins, and the
processing unit 810 is specifically configured to: determine difference values between energy of the first channel signal and energy of the downmixed signal at the plurality of frequency bins and between energy of the second channel signal and energy of the downmixed signal at the plurality of frequency bins, and determine the target attenuation factor based on the difference values. - Optionally, in an embodiment, the
processing unit 810 is specifically configured to: determine a first difference value between the energy of the first channel signal and the energy of the downmixed signal, where the first difference value indicates a sum of absolute values of the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins; determine a second difference value between the energy of the second channel signal and the energy of the downmixed signal, where the second difference value indicates a sum of absolute values of the difference values between the energy of the second channel signal and the energy of the downmixed signal at the plurality of frequency bins; and determine the target attenuation factor based on a ratio between the first difference value and the second difference value. - Optionally, in an embodiment, before determining the target attenuation factor based on the difference values, the
processing unit 810 is further specifically configured to: determine that the difference values are greater than a preset threshold. - Optionally, in an embodiment, the energy of the downmixed signal is determined based on the energy of the first channel signal and the energy of the second channel signal.
- Optionally, in an embodiment, the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors corresponds to at least one subband of the multi-channel signal, and any subband corresponds to only one attenuation factor.
-
FIG. 9 is a schematic block diagram of an encoder according to an embodiment of this application. Anencoder 900 inFIG. 9 includes: - a
processing unit 910, configured to determine a downmixed signal of a first channel signal and a second channel signal in a multi-channel signal, an initial reverberation gain parameter of the first channel signal and the second channel signal; where - the
processing unit 910 is further configured to determine identification information of the first channel signal and the second channel signal based on a correlation between the first channel signal and the downmixed signal, and a correlation between the second channel signal and the downmixed signal, where the identification information indicates a channel signal that is in the first channel signal and the second channel signal and whose initial reverberation gain parameter needs to be adjusted; and - an
encoding unit 920, configured to quantize the first channel signal and the second channel signal based on the downmixed signal, the initial reverberation gain parameter, and the identification information, and write a quantized first channel signal and a quantized second channel signal into a bitstream. - In this application, a channel signal whose initial reverberation gain parameter needs to be adjusted can be determined based on a correlation between the channel signal and the downmixed signal, so that a decoder side can first adjust initial reverberation gain parameter of some channel signals and then perform reverberation processing on these channel signals, thereby improving quality of a channel signal obtained after reverberation processing.
- It should be understood that the
encoder 900 may correspond to the multi-channel signal encoding method inFIG. 6 , and theencoder 900 may perform the multi-channel signal encoding method inFIG. 6 . - Optionally, in an embodiment, the
processing unit 910 is specifically configured to determine the identification information of the first channel signal and the second channel signal based on a correlation between energy of the first channel signal and energy of the downmixed signal and a correlation between energy of the second channel signal and the energy of the downmixed signal. - Optionally, in an embodiment, the
processing unit 910 is specifically configured to: determine a first difference value and a second difference value, where the first difference value is a sum of absolute values of difference values between energy of the first channel signal and energy of the downmixed signal at a plurality of frequency bins, and the second difference value is a sum of absolute values of difference values between energy of the second channel signal and energy of the downmixed signal at the plurality of frequency bins; and determine the identification information of the first channel signal and the second channel signal based on the first difference value and the second difference value. - Optionally, in an embodiment, the
processing unit 910 is specifically configured to determine the larger difference value in the first difference value and the second difference value as a target difference value, and determine the identification information based on the target difference value, where the identification information indicates a channel signal corresponding to the target difference value, and the channel signal corresponding to the target difference value is a channel signal whose initial reverberation gain parameter needs to be adjusted. - Optionally, in an embodiment, the
processing unit 910 is further specifically configured to: determine a target attenuation factor based on the first difference value and the second difference value, where the target attenuation factor is used to adjust an initial reverberation gain parameter of a target channel signal; and quantize the target attenuation factor, and write a quantized target attenuation factor into the bitstream. - Optionally, in an embodiment, the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors corresponds to at least one subband of the target channel signal, and any subband corresponds to only one attenuation factor.
- Optionally, in an embodiment, the energy of the downmixed signal is determined based on the energy of the first channel signal and the energy of the second channel signal.
-
FIG. 10 is a schematic block diagram of a decoder according to an embodiment of this application. Adecoder 1000 inFIG. 10 includes: - an obtaining
unit 1010, configured to obtain a bitstream; and - a
processing unit 1020, configured to determine a downmixed signal of a first channel signal and a second channel signal in a multi-channel signal, an initial reverberation gain parameter of the first channel signal and the second channel signal, and identification information of the first channel signal and the second channel signal based on the bitstream, where the identification information indicates a channel signal that is in the first channel signal and the second channel signal and whose initial reverberation gain parameter needs to be adjusted; where - the
processing unit 1020 is further configured to determine, as a target channel signal based on the identification information, the channel signal that is in the first channel signal and the second channel signal and whose initial reverberation gain parameter needs to be adjusted; and - the
processing unit 1020 is further configured to adjust the initial reverberation gain parameter of the target channel signal. - In this application, the channel signal whose initial reverberation gain parameter needs to be adjusted can be determined by using the identification information, and the initial reverberation gain parameter of the channel signal is adjusted before reverberation processing is performed on the channel signal, thereby improving quality of a channel signal obtained after reverberation processing.
- It should be understood that the
decoder 1000 may correspond to the multi-channel signal decoding method inFIG. 7 , and thedecoder 1000 may perform the multi-channel signal decoding method inFIG. 7 . - Optionally, in an embodiment, the
processing unit 1020 is specifically configured to determine a target attenuation factor, and adjust the initial reverberation gain parameter of the target channel signal based on the target attenuation factor, to obtain a target reverberation gain parameter of the target channel signal. - Optionally, in an embodiment, the
processing unit 1020 is specifically configured to determine a preset attenuation factor as the target attenuation factor. - Optionally, in an embodiment, the
processing unit 1020 is specifically configured to obtain the target attenuation factor based on the bitstream. - Optionally, in an embodiment, the
processing unit 1020 is specifically configured to obtain an inter-channel level difference between the first channel signal and the second channel signal from the bitstream, and determine the target attenuation factor based on the inter-channel level difference, or determine the target attenuation factor based on the inter-channel level difference and the downmixed signal. - Optionally, in an embodiment, the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors corresponds to at least one subband of the target channel signal, and any subband corresponds to only one attenuation factor.
-
FIG. 11 is a schematic block diagram of an encoder according to an embodiment of this application. Anencoder 1100 inFIG. 11 includes: - a
memory 1110, configured to store a program; and - a
processor 1120, configured to execute the program, and when the program is executed, theprocessor 1120 is configured to determine a downmixed signal of a first channel signal and a second channel signal in a multi-channel signal, an initial reverberation gain parameter of the first channel signal and the second channel signal; determine a target reverberation gain parameter of the first channel signal and the second channel signal based on a correlation between the first channel signal and the downmixed signal, a correlation between the second channel signal and the downmixed signal, and the initial reverberation gain parameter; and quantize the first channel signal and the second channel signal based on the downmixed signal and the target reverberation gain parameter, and write a quantized first channel signal and a quantized second channel signal into a bitstream. - The
encoder 1100 may correspond to the multi-channel signal encoding method inFIG. 3 , and theencoder 1100 may perform the multi-channel signal encoding method inFIG. 3 . - In this application, when a target reverberation gain parameter of a channel signal is being determined, a correlation between the channel signal and the downmixed signal is considered. In this way, a better processing effect can be obtained when reverberation processing is performed on the channel signal based on the target reverberation gain parameter, thereby improving quality of a channel signal obtained after reverberation processing.
- Optionally, in an embodiment, the
processor 1120 is specifically configured to determine a target attenuation factor based on the correlation between the first channel signal and the downmixed signal and the correlation between the second channel signal and the downmixed signal; and adjust the initial reverberation gain parameter based on the target attenuation factor to obtain the target reverberation gain parameter. - Optionally, in an embodiment, each of the first channel signal and the second channel signal includes a plurality of frequency bins, and the
processor 1120 is specifically configured to: determine difference values between energy of the first channel signal and energy of the downmixed signal at the plurality of frequency bins and between energy of the second channel signal and energy of the downmixed signal at the plurality of frequency bins, and determine the target attenuation factor based on the difference values. - Optionally, in an embodiment, the
processor 1120 is specifically configured to: determine a first difference value between the energy of the first channel signal and the energy of the downmixed signal, where the first difference value indicates a sum of absolute values of the difference values between the energy of the first channel signal and the energy of the downmixed signal at the plurality of frequency bins; determine a second difference value between the energy of the second channel signal and the energy of the downmixed signal, where the second difference value indicates a sum of absolute values of the difference values between the energy of the second channel signal and the energy of the downmixed signal at the plurality of frequency bins; and determine the target attenuation factor based on a ratio between the first difference value and the second difference value. - Optionally, in an embodiment, before determining the target attenuation factor based on the difference values, the
processor 1120 is further specifically configured to: determine that the difference values are greater than a preset threshold. - Optionally, in an embodiment, the energy of the downmixed signal is determined based on the energy of the first channel signal and the energy of the second channel signal.
- Optionally, in an embodiment, the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors corresponds to at least one subband of the multi-channel signal, and any subband corresponds to only one attenuation factor.
-
FIG. 12 is a schematic block diagram of an encoder according to an embodiment of this application. Anencoder 1200 inFIG. 12 includes: - a
memory 1210, configured to store a program; and - a
processor 1220, configured to execute the program, and when the program is executed, theprocessor 1220 is configured to determine a downmixed signal of a first channel signal and a second channel signal in a multi-channel signal, an initial reverberation gain parameter of the first channel signal and the second channel signal; determine identification information of the first channel signal and the second channel signal based on a correlation between the first channel signal and the downmixed signal, and a correlation between the second channel signal and the downmixed signal, where the identification information indicates a channel signal that is in the first channel signal and the second channel signal and whose initial reverberation gain parameter needs to be adjusted; and quantize the first channel signal and the second channel signal based on the downmixed signal, the initial reverberation gain parameter, and the identification information, and write a quantized first channel signal and a quantized second channel signal into a bitstream. - In this application, a channel signal whose initial reverberation gain parameter needs to be adjusted can be determined based on a correlation between the channel signal and the downmixed signal, so that a decoder side can first adjust initial reverberation gain parameter of some channel signals and then perform reverberation processing on these channel signals, thereby improving quality of a channel signal obtained after reverberation processing.
- It should be understood that the
encoder 1200 may correspond to the multi-channel signal encoding method inFIG. 6 , and theencoder 1200 may perform the multi-channel signal encoding method inFIG. 6 . - Optionally, in an embodiment, the
processor 1220 is specifically configured to determine the identification information of the first channel signal and the second channel signal based on a correlation between energy of the first channel signal and energy of the downmixed signal and a correlation between energy of the second channel signal and the energy of the downmixed signal. - Optionally, in an embodiment, the
processor 1220 is specifically configured to: determine a first difference value and a second difference value, where the first difference value is a sum of absolute values of difference values between energy of the first channel signal and energy of the downmixed signal at a plurality of frequency bins, and the second difference value is a sum of absolute values of difference values between energy of the second channel signal and energy of the downmixed signal at the plurality of frequency bins; and determine the identification information of the first channel signal and the second channel signal based on the first difference value and the second difference value. - Optionally, in an embodiment, the
processor 1220 is specifically configured to determine the larger difference value in the first difference value and the second difference value as a target difference value, and determine the identification information based on the target difference value, where the identification information indicates a channel signal corresponding to the target difference value, and the channel signal corresponding to the target difference value is a channel signal whose initial reverberation gain parameter needs to be adjusted. - Optionally, in an embodiment, the
processor 1220 is further specifically configured to: determine a target attenuation factor based on the first difference value and the second difference value, where the target attenuation factor is used to adjust an initial reverberation gain parameter of a target channel signal; and quantize the target attenuation factor, and write a quantized target attenuation factor into the bitstream. - Optionally, in an embodiment, the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors corresponds to at least one subband of the target channel signal, and any subband corresponds to only one attenuation factor.
- Optionally, in an embodiment, the energy of the downmixed signal is determined based on the energy of the first channel signal and the energy of the second channel signal.
-
FIG. 13 is a schematic block diagram of a decoder according to an embodiment of this application. Adecoder 1300 inFIG. 13 includes: - a
memory 1310, configured to store a program; and - a
processor 1320, configured to execute the program, and when the program is executed, theprocessor 1320 is configured to obtain a bitstream; determine a downmixed signal of a first channel signal and a second channel signal in a multi-channel signal, an initial reverberation gain parameter of the first channel signal and the second channel signal, and identification information of the first channel signal and the second channel signal based on the bitstream, where the identification information indicates a channel signal that is in the first channel signal and the second channel signal and whose initial reverberation gain parameter needs to be adjusted; determine, as a target channel signal based on the identification information, the channel signal that is in the first channel signal and the second channel signal and whose initial reverberation gain parameter needs to be adjusted; and adjust the initial reverberation gain parameter of the target of channel signal. - In this application, the channel signal whose initial reverberation gain parameter needs to be adjusted can be determined by using the identification information, and the initial reverberation gain parameter of the channel signal is adjusted before reverberation processing is performed on the channel signal, thereby improving quality of a channel signal obtained after reverberation processing.
- It should be understood that the
decoder 1300 may correspond to the multi-channel signal decoding method inFIG. 7 , and thedecoder 1300 may perform the multi-channel signal decoding method inFIG. 7 . - Optionally, in an embodiment, the
processor 1320 is specifically configured to determine a target attenuation factor, and adjust the initial reverberation gain parameter of the target channel signal based on the target attenuation factor, to obtain a target reverberation gain parameter of the target channel signal. - Optionally, in an embodiment, the
processor 1320 is specifically configured to determine a preset attenuation factor as the target attenuation factor. - Optionally, in an embodiment, the
processor 1320 is specifically configured to obtain the target attenuation factor based on the bitstream. - Optionally, in an embodiment, the
processor 1320 is specifically configured to obtain an inter-channel level difference between the first channel signal and the second channel signal from the bitstream, and determine the target attenuation factor based on the inter-channel level difference, or determine the target attenuation factor based on the inter-channel level difference and the downmixed signal. - Optionally, in an embodiment, the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors corresponds to at least one subband of the target channel signal, and any subband corresponds to only one attenuation factor.
- A person of ordinary skill in the art may be aware that, in combination with the examples of units and algorithm steps described in the embodiments disclosed in this specification, the embodiments may be implemented by electronic hardware or a combination of computer software and electronic hardware. Whether the functions are performed by hardware or software depends on particular applications and design constraint conditions of the technical solutions. A person skilled in the art may use different methods to implement the described functions for each particular application, but it should not be considered that the implementation goes beyond the scope of this application.
- It may be clearly understood by a person skilled in the art that, for the purpose of convenient and brief description, for a detailed working process of the foregoing system, apparatus, and unit, refer to a corresponding process in the foregoing method embodiments, and details are not described herein again.
- In the several embodiments provided in this application, it should be understood that the disclosed system, apparatus, and method may be implemented in other manners. For example, the described apparatus embodiment is merely an example. For example, the unit division is merely logical function division and may be other division in actual implementation. For example, a plurality of units or components may be combined or integrated into another system, or some features may be ignored or not performed. In addition, the displayed or discussed mutual couplings or direct couplings or communication connections may be implemented by using some interfaces. The indirect couplings or communication connections between the apparatuses or units may be implemented in electronic, mechanical, or other forms.
- The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one position, or may be distributed on a plurality of network units. Some or all of the units may be selected based on actual requirements to achieve the objectives of the solutions of the embodiments.
- In addition, functional units in the embodiments of this application may be integrated into one processing unit, or each of the units may exist alone physically, or two or more units are integrated into one unit.
- When the functions are implemented in the form of a software functional unit and sold or used as an independent product, the functions may be stored in a computer-readable storage medium. Based on such an understanding, the technical solutions of this application essentially, or the part contributing to the prior art, or some of the technical solutions may be implemented in a form of a software product. The computer software product is stored in a storage medium, and includes several instructions for instructing a computer device (which may be a personal computer, a server, a network device, or the like) to perform all or some of the steps of the methods described in the embodiments of this application. The foregoing storage medium includes: any medium that can store program code, such as a USB flash drive, a removable hard disk, a read-only memory (Read-Only Memory, ROM), a random access memory (Random Access Memory, RAM), a magnetic disk, or an optical disc.
- The foregoing descriptions are merely specific implementations of this application, but are not intended to limit the protection scope of this application. Any variation or replacement readily figured out by a person skilled in the art within the technical scope disclosed in this application shall fall within the protection scope of this application. Therefore, the protection scope of this application shall be subject to the protection scope of the claims.
Claims (26)
- A multi-channel signal encoding method, comprising:determining (610) a downmixed signal of a first channel signal and a second channel signal in a multi-channel signal, an initial reverberation gain parameter of the first channel signal and the second channel signal;determining (620) identification information of the first channel signal and the second channel signal based on a correlation between the first channel signal and the downmixed signal, and a correlation between the second channel signal and the downmixed signal, wherein the identification information indicates a channel signal that is in the first channel signal and the second channel signal and whose initial reverberation gain parameter needs to be adjusted; andquantizing (630) the first channel signal and the second channel signal based on the downmixed signal, the initial reverberation gain parameter, and the identification information, and writing a quantized first channel signal and a quantized second channel signal into a bitstream.
- The method according to claim 1, wherein the determining identification information of the first channel signal and the second channel signal based on a correlation between the first channel signal and the downmixed signal, and a correlation between the second channel signal and the downmixed signal comprises:
determining the identification information of the first channel signal and the second channel signal based on a correlation between energy of the first channel signal and energy of the downmixed signal and a correlation between energy of the second channel signal and the energy of the downmixed signal. - The method according to claim 2, wherein the determining the identification information of the first channel signal and the second channel signal based on a correlation between energy of the first channel signal and energy of the downmixed signal and a correlation between energy of the second channel signal and the energy of the downmixed signal comprises:determining a first difference value and a second difference value, wherein the first difference value is a sum of absolute values of difference values between energy of the first channel signal and energy of the downmixed signal at a plurality of frequency bins, and the second difference value is a sum of absolute values of difference values between energy of the second channel signal and energy of the downmixed signal at the plurality of frequency bins; anddetermining the identification information of the first channel signal and the second channel signal based on the first difference value and the second difference value.
- The method according to claim 3, wherein the determining the identification information of the first channel signal and the second channel signal based on the first difference value and the second difference value comprises:determining the larger difference value in the first difference value and the second difference value as a target difference value; anddetermining the identification information based on the target difference value, wherein the identification information indicates a channel signal corresponding to the target difference value, and the channel signal corresponding to the target difference value is a channel signal whose initial reverberation gain parameter needs to be adjusted.
- The method according to claim 4, wherein the method further comprises:determining a target attenuation factor based on the first difference value and the second difference value, wherein the target attenuation factor is used to adjust an initial reverberation gain parameter of a target channel signal; andquantizing the target attenuation factor, and writing a quantized target attenuation factor into the bitstream.
- The method according to claim 5, wherein the target attenuation factor comprises a plurality of attenuation factors, each of the plurality of attenuation factors corresponds to at least one subband of the target channel signal, and any subband corresponds to only one attenuation factor.
- The method according to any one of claims 2 to 6, wherein the energy of the downmixed signal is determined based on the energy of the first channel signal and the energy of the second channel signal.
- A multi-channel signal decoding method, comprising:obtaining (710) a bitstream;determining (720) a downmixed signal of a first channel signal and a second channel signal in a multi-channel signal, an initial reverberation gain parameter of the first channel signal and the second channel signal, and identification information of the first channel signal and the second channel signal based on the bitstream, wherein the identification information indicates a channel signal that is in the first channel signal and the second channel signal and whose initial reverberation gain parameter needs to be adjusted;determining (730), as a target channel signal based on the identification information, the channel signal that is in the first channel signal and the second channel signal and whose initial reverberation gain parameter needs to be adjusted; andadjusting (740) the initial reverberation gain parameter of the target channel signal.
- The method according to claim 8, wherein the adjusting an initial reverberation gain parameter of the target channel signal comprises:determining a target attenuation factor; andadjusting the initial reverberation gain parameter of the target channel signal based on the target attenuation factor, to obtain a target reverberation gain parameter of the target channel signal.
- The method according to claim 9, wherein the determining a target attenuation factor comprises:
determining a preset attenuation factor as the target attenuation factor. - The method according to claim 9, wherein the determining a target attenuation factor comprises:
obtaining the target attenuation factor based on the bitstream. - The method according to claim 9, wherein the determining a target attenuation factor comprises:obtaining an inter-channel level difference between the first channel signal and the second channel signal from the bitstream; anddetermining the target attenuation factor based on the inter-channel level difference; ordetermining the target attenuation factor based on the inter-channel level difference and the downmixed signal.
- The method according to any one of claims 9 to 12, wherein the target attenuation factor comprises a plurality of attenuation factors, each of the plurality of attenuation factors corresponds to at least one subband of the target channel signal, and any subband corresponds to only one attenuation factor.
- An encoder (900), comprising:a processing unit (910), configured to determine a downmixed signal of a first channel signal and a second channel signal in a multi-channel signal, an initial reverberation gain parameter of the first channel signal and the second channel signal; whereinthe processing unit (910) is further configured to determine identification information of the first channel signal and the second channel signal based on a correlation between the first channel signal and the downmixed signal, and a correlation between the second channel signal and the downmixed signal, wherein the identification information indicates a channel signal that is in the first channel signal and the second channel signal and whose initial reverberation gain parameter needs to be adjusted; andan encoding unit (920), configured to quantize the first channel signal and the second channel signal based on the downmixed signal, the initial reverberation gain parameter, and the identification information, and write a quantized first channel signal and a quantized second channel signal into a bitstream.
- The encoder according to claim 14, wherein the processing unit is specifically configured to:
determine the identification information of the first channel signal and the second channel signal based on a correlation between energy of the first channel signal and energy of the downmixed signal and a correlation between energy of the second channel signal and the energy of the downmixed signal. - The encoder according to claim 15, wherein the processing unit is specifically configured to:determine a first difference value and a second difference value, wherein the first difference value is a sum of absolute values of difference values between energy of the first channel signal and energy of the downmixed signal at a plurality of frequency bins, and the second difference value is a sum of absolute values of difference values between energy of the second channel signal and energy of the downmixed signal at the plurality of frequency bins; anddetermine the identification information of the first channel signal and the second channel signal based on the first difference value and the second difference value.
- The encoder according to claim 16, wherein the processing unit is specifically configured to:determine the larger difference value in the first difference value and the second difference value as a target difference value; anddetermine the identification information based on the target difference value, wherein the identification information indicates a channel signal corresponding to the target difference value, and the channel signal corresponding to the target difference value is a channel signal whose initial reverberation gain parameter needs to be adjusted.
- The encoder according to claim 17, wherein the processing unit is further configured to:determine a target attenuation factor based on the first difference value and the second difference value, wherein the target attenuation factor is used to adjust an initial reverberation gain parameter of a target channel signal; andquantize the target attenuation factor, and write a quantized target attenuation factor into the bitstream.
- The encoder according to claim 18, wherein the target attenuation factor comprises a plurality of attenuation factors, each of the plurality of attenuation factors corresponds to at least one subband of the target channel signal, and any subband corresponds to only one attenuation factor.
- The encoder according to any one of claims 15 to 19, wherein the energy of the downmixed signal is determined based on the energy of the first channel signal and the energy of the second channel signal.
- A decoder (1000), comprising:an obtaining unit (1010), configured to obtain a bitstream; anda processing unit (1020), configured to determine a downmixed signal of a first channel signal and a second channel signal in a multi-channel signal, an initial reverberation gain parameter of the first channel signal and the second channel signal, and identification information of the first channel signal and the second channel signal based on the bitstream, wherein the identification information indicates a channel signal that is in the first channel signal and the second channel signal and whose initial reverberation gain parameter needs to be adjusted; whereinthe processing unit (1020) is further configured to determine, as a target channel signal based on the identification information, the channel signal that is in the first channel signal and the second channel signal and whose initial reverberation gain parameter needs to be adjusted; andthe processing unit (1020) is further configured to adjust the initial reverberation gain parameter of the target channel signal.
- The decoder according to claim 21, wherein the processing unit is specifically configured to:determine a target attenuation factor; andadjust the initial reverberation gain parameter of the target channel signal based on the target attenuation factor, to obtain a target reverberation gain parameter of the target channel signal.
- The decoder according to claim 22, wherein the processing unit is specifically configured to:
determine a preset attenuation factor as the target attenuation factor. - The decoder according to claim 22, wherein the processing unit is specifically configured to:
obtain the target attenuation factor based on the bitstream. - The decoder according to claim 22, wherein the processing unit is specifically configured to:obtain an inter-channel level difference between the first channel signal and the second channel signal from the bitstream; anddetermine the target attenuation factor based on the inter-channel level difference; ordetermine the target attenuation factor based on the inter-channel level difference and the downmixed signal.
- The decoder according to any one of claims 22 to 25, wherein the target attenuation factor comprises a plurality of attenuation factors, each of the plurality of attenuation factors corresponds to at least one subband of the target channel signal, and any subband corresponds to only one attenuation factor.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP24152513.8A EP4375994A2 (en) | 2017-03-31 | 2018-03-01 | Multi-channel signal encoding method, multi-channel signal decoding method, encoder, and decoder |
EP21170071.1A EP3917171B1 (en) | 2017-03-31 | 2018-03-01 | Multi-channel signal encoding method, multi-channel signal decoding method, encoder, and decoder |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710205821.2A CN108665902B (en) | 2017-03-31 | 2017-03-31 | Coding and decoding method and coder and decoder of multi-channel signal |
PCT/CN2018/077782 WO2018177066A1 (en) | 2017-03-31 | 2018-03-01 | Multi-channel signal encoding and decoding method and codec |
Related Child Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP21170071.1A Division EP3917171B1 (en) | 2017-03-31 | 2018-03-01 | Multi-channel signal encoding method, multi-channel signal decoding method, encoder, and decoder |
EP24152513.8A Division EP4375994A2 (en) | 2017-03-31 | 2018-03-01 | Multi-channel signal encoding method, multi-channel signal decoding method, encoder, and decoder |
Publications (3)
Publication Number | Publication Date |
---|---|
EP3588497A1 EP3588497A1 (en) | 2020-01-01 |
EP3588497A4 EP3588497A4 (en) | 2020-01-15 |
EP3588497B1 true EP3588497B1 (en) | 2021-05-12 |
Family
ID=63674221
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP18776186.1A Active EP3588497B1 (en) | 2017-03-31 | 2018-03-01 | Multi-channel signal encoding and decoding method and codec |
EP24152513.8A Pending EP4375994A2 (en) | 2017-03-31 | 2018-03-01 | Multi-channel signal encoding method, multi-channel signal decoding method, encoder, and decoder |
EP21170071.1A Active EP3917171B1 (en) | 2017-03-31 | 2018-03-01 | Multi-channel signal encoding method, multi-channel signal decoding method, encoder, and decoder |
Family Applications After (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP24152513.8A Pending EP4375994A2 (en) | 2017-03-31 | 2018-03-01 | Multi-channel signal encoding method, multi-channel signal decoding method, encoder, and decoder |
EP21170071.1A Active EP3917171B1 (en) | 2017-03-31 | 2018-03-01 | Multi-channel signal encoding method, multi-channel signal decoding method, encoder, and decoder |
Country Status (8)
Country | Link |
---|---|
US (3) | US11386907B2 (en) |
EP (3) | EP3588497B1 (en) |
JP (4) | JP6804666B2 (en) |
KR (1) | KR102281097B1 (en) |
CN (2) | CN108665902B (en) |
BR (1) | BR112019020468A2 (en) |
ES (1) | ES2882626T3 (en) |
WO (1) | WO2018177066A1 (en) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108665902B (en) * | 2017-03-31 | 2020-12-01 | 华为技术有限公司 | Coding and decoding method and coder and decoder of multi-channel signal |
CN108694955B (en) | 2017-04-12 | 2020-11-17 | 华为技术有限公司 | Coding and decoding method and coder and decoder of multi-channel signal |
CN111654745B (en) * | 2020-06-08 | 2022-10-14 | 海信视像科技股份有限公司 | Multi-channel signal processing method and display device |
CN113985780B (en) * | 2021-10-28 | 2024-01-12 | 中国人民解放军战略支援部队信息工程大学 | Multi-channel remote control device and method, storage medium and electronic equipment |
Family Cites Families (41)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
ITMI20031258A1 (en) | 2003-06-20 | 2004-12-21 | Nextec Srl | PROCESS AND MACHINE FOR WATERPROOFING SEMI-FINISHED PRODUCTS OF FOOTWEAR, CLOTHING AND ACCESSORIES, AND SEMI-FINISHED PRODUCTS OBTAINED BY SUCH PROCEDURE OR MACHINE. |
SE0400998D0 (en) * | 2004-04-16 | 2004-04-16 | Cooding Technologies Sweden Ab | Method for representing multi-channel audio signals |
CA2572805C (en) * | 2004-07-02 | 2013-08-13 | Matsushita Electric Industrial Co., Ltd. | Audio signal decoding device and audio signal encoding device |
EP1946294A2 (en) * | 2005-06-30 | 2008-07-23 | LG Electronics Inc. | Apparatus for encoding and decoding audio signal and method thereof |
JP2007025290A (en) * | 2005-07-15 | 2007-02-01 | Matsushita Electric Ind Co Ltd | Device controlling reverberation of multichannel audio codec |
TWI396188B (en) | 2005-08-02 | 2013-05-11 | Dolby Lab Licensing Corp | Controlling spatial audio coding parameters as a function of auditory events |
US8184817B2 (en) * | 2005-09-01 | 2012-05-22 | Panasonic Corporation | Multi-channel acoustic signal processing device |
EP1946310A4 (en) * | 2005-10-26 | 2011-03-09 | Lg Electronics Inc | Method for encoding and decoding multi-channel audio signal and apparatus thereof |
ATE476732T1 (en) * | 2006-01-09 | 2010-08-15 | Nokia Corp | CONTROLLING BINAURAL AUDIO SIGNALS DECODING |
US8560303B2 (en) * | 2006-02-03 | 2013-10-15 | Electronics And Telecommunications Research Institute | Apparatus and method for visualization of multichannel audio signals |
US7965848B2 (en) * | 2006-03-29 | 2011-06-21 | Dolby International Ab | Reduced number of channels decoding |
US8027479B2 (en) | 2006-06-02 | 2011-09-27 | Coding Technologies Ab | Binaural multi-channel decoder in the context of non-energy conserving upmix rules |
CN101166377A (en) * | 2006-10-17 | 2008-04-23 | 施伟强 | A low code rate coding and decoding scheme for multi-language circle stereo |
KR20080052813A (en) * | 2006-12-08 | 2008-06-12 | 한국전자통신연구원 | Apparatus and method for audio coding based on input signal distribution per channels |
KR20080066537A (en) * | 2007-01-12 | 2008-07-16 | 엘지전자 주식회사 | Encoding/decoding an audio signal with a side information |
CN101149925B (en) * | 2007-11-06 | 2011-02-16 | 武汉大学 | Space parameter selection method for parameter stereo coding |
CN101572088A (en) * | 2008-04-30 | 2009-11-04 | 北京工业大学 | Stereo encoding and decoding method, a coder-decoder and encoding and decoding system |
EP2144229A1 (en) * | 2008-07-11 | 2010-01-13 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Efficient use of phase information in audio encoding and decoding |
KR101614160B1 (en) | 2008-07-16 | 2016-04-20 | 한국전자통신연구원 | Apparatus for encoding and decoding multi-object audio supporting post downmix signal |
CN103561378B (en) | 2008-07-31 | 2015-12-23 | 弗劳恩霍夫应用研究促进协会 | The signal of binaural signal generates |
CN101673548B (en) * | 2008-09-08 | 2012-08-08 | 华为技术有限公司 | Parametric stereo encoding method, parametric stereo encoding device, parametric stereo decoding method and parametric stereo decoding device |
CN102257562B (en) * | 2008-12-19 | 2013-09-11 | 杜比国际公司 | Method and apparatus for applying reverb to a multi-channel audio signal using spatial cue parameters |
US8219408B2 (en) | 2008-12-29 | 2012-07-10 | Motorola Mobility, Inc. | Audio signal decoder and method for producing a scaled reconstructed audio signal |
KR101137361B1 (en) | 2009-01-28 | 2012-04-26 | 엘지전자 주식회사 | A method and an apparatus for processing an audio signal |
CN102307323B (en) * | 2009-04-20 | 2013-12-18 | 华为技术有限公司 | Method for modifying sound channel delay parameter of multi-channel signal |
JP5793675B2 (en) * | 2009-07-31 | 2015-10-14 | パナソニックIpマネジメント株式会社 | Encoding device and decoding device |
CN102714038B (en) * | 2009-11-20 | 2014-11-05 | 弗兰霍菲尔运输应用研究公司 | Apparatus for providing an upmix signal representation on the basis of the downmix signal representation, apparatus for providing a bitstream representing a multi-channel audio signal, methods, computer programs and bitstream representing a multi-cha |
US8540177B2 (en) * | 2009-11-20 | 2013-09-24 | Penta TMR Inc. | Vertical feed mixer having cutout edge |
JP5333257B2 (en) | 2010-01-20 | 2013-11-06 | 富士通株式会社 | Encoding apparatus, encoding system, and encoding method |
CN102157151B (en) * | 2010-02-11 | 2012-10-03 | 华为技术有限公司 | Encoding method, decoding method, device and system of multichannel signals |
JP5299327B2 (en) | 2010-03-17 | 2013-09-25 | ソニー株式会社 | Audio processing apparatus, audio processing method, and program |
US9424852B2 (en) * | 2011-02-02 | 2016-08-23 | Telefonaktiebolaget Lm Ericsson (Publ) | Determining the inter-channel time difference of a multi-channel audio signal |
KR101842258B1 (en) | 2011-09-14 | 2018-03-27 | 삼성전자주식회사 | Method for signal processing, encoding apparatus thereof, and decoding apparatus thereof |
RU2015121941A (en) | 2012-11-09 | 2017-01-10 | Стормингсвисс Сарл | NONLINEAR REVERSE CODING OF MULTI-CHANNEL SIGNALS |
JP6160072B2 (en) | 2012-12-06 | 2017-07-12 | 富士通株式会社 | Audio signal encoding apparatus and method, audio signal transmission system and method, and audio signal decoding apparatus |
US9715880B2 (en) * | 2013-02-21 | 2017-07-25 | Dolby International Ab | Methods for parametric multi-channel encoding |
EP2840811A1 (en) * | 2013-07-22 | 2015-02-25 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Method for processing an audio signal; signal processing unit, binaural renderer, audio encoder and audio decoder |
CN103700372B (en) * | 2013-12-30 | 2016-10-05 | 北京大学 | A kind of parameter stereo coding based on orthogonal decorrelation technique, coding/decoding method |
CN104995915B (en) * | 2015-02-05 | 2018-11-30 | 华为技术有限公司 | Decoding method and codec |
CN105405445B (en) * | 2015-12-10 | 2019-03-22 | 北京大学 | A kind of parameter stereo coding, coding/decoding method based on transmission function between sound channel |
CN108665902B (en) | 2017-03-31 | 2020-12-01 | 华为技术有限公司 | Coding and decoding method and coder and decoder of multi-channel signal |
-
2017
- 2017-03-31 CN CN201710205821.2A patent/CN108665902B/en active Active
-
2018
- 2018-03-01 EP EP18776186.1A patent/EP3588497B1/en active Active
- 2018-03-01 EP EP24152513.8A patent/EP4375994A2/en active Pending
- 2018-03-01 KR KR1020197029632A patent/KR102281097B1/en active IP Right Grant
- 2018-03-01 EP EP21170071.1A patent/EP3917171B1/en active Active
- 2018-03-01 BR BR112019020468A patent/BR112019020468A2/en unknown
- 2018-03-01 CN CN201880022744.XA patent/CN110462733B/en active Active
- 2018-03-01 WO PCT/CN2018/077782 patent/WO2018177066A1/en unknown
- 2018-03-01 JP JP2019553260A patent/JP6804666B2/en active Active
- 2018-03-01 ES ES18776186T patent/ES2882626T3/en active Active
-
2019
- 2019-09-27 US US16/586,128 patent/US11386907B2/en active Active
-
2020
- 2020-12-01 JP JP2020199446A patent/JP7035154B2/en active Active
-
2022
- 2022-03-02 JP JP2022031743A patent/JP7436541B2/en active Active
- 2022-06-10 US US17/837,558 patent/US11894001B2/en active Active
-
2023
- 2023-12-22 US US18/393,866 patent/US20240135938A1/en active Pending
-
2024
- 2024-02-08 JP JP2024018177A patent/JP2024059683A/en active Pending
Non-Patent Citations (1)
Title |
---|
None * |
Also Published As
Publication number | Publication date |
---|---|
EP3917171B1 (en) | 2024-04-24 |
KR102281097B1 (en) | 2021-07-22 |
JP2022084671A (en) | 2022-06-07 |
CN108665902B (en) | 2020-12-01 |
JP2020512590A (en) | 2020-04-23 |
EP3588497A1 (en) | 2020-01-01 |
US20240135938A1 (en) | 2024-04-25 |
JP2021047432A (en) | 2021-03-25 |
KR20190122839A (en) | 2019-10-30 |
JP7035154B2 (en) | 2022-03-14 |
WO2018177066A1 (en) | 2018-10-04 |
JP6804666B2 (en) | 2020-12-23 |
EP4375994A2 (en) | 2024-05-29 |
ES2882626T3 (en) | 2021-12-02 |
US11386907B2 (en) | 2022-07-12 |
US20220310104A1 (en) | 2022-09-29 |
CN110462733A (en) | 2019-11-15 |
CN108665902A (en) | 2018-10-16 |
EP3917171A1 (en) | 2021-12-01 |
EP3588497A4 (en) | 2020-01-15 |
US20200027466A1 (en) | 2020-01-23 |
JP7436541B2 (en) | 2024-02-21 |
US11894001B2 (en) | 2024-02-06 |
JP2024059683A (en) | 2024-05-01 |
BR112019020468A2 (en) | 2020-04-28 |
CN110462733B (en) | 2022-05-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11178505B2 (en) | Multi-channel signal encoding method, multi-channel signal decoding method, encoder, and decoder | |
RU2704733C1 (en) | Device and method of encoding or decoding a multichannel signal using a broadband alignment parameter and a plurality of narrowband alignment parameters | |
US11894001B2 (en) | Multi-channel signal encoding method, multi-channel signal decoding method, encoder, and decoder |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE |
|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE |
|
17P | Request for examination filed |
Effective date: 20190923 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
AX | Request for extension of the european patent |
Extension state: BA ME |
|
A4 | Supplementary search report drawn up and despatched |
Effective date: 20191212 |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: H04S 5/00 20060101ALI20191206BHEP Ipc: G10L 19/04 20130101AFI20191206BHEP Ipc: G10L 19/008 20130101ALI20191206BHEP Ipc: H04S 1/00 20060101ALI20191206BHEP |
|
DAV | Request for validation of the european patent (deleted) | ||
DAX | Request for extension of the european patent (deleted) | ||
GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: GRANT OF PATENT IS INTENDED |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: G10L 19/008 20130101ALI20201123BHEP Ipc: H04S 5/00 20060101ALI20201123BHEP Ipc: G10L 19/04 20130101AFI20201123BHEP Ipc: H04S 1/00 20060101ALI20201123BHEP |
|
INTG | Intention to grant announced |
Effective date: 20201211 |
|
GRAS | Grant fee paid |
Free format text: ORIGINAL CODE: EPIDOSNIGR3 |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE PATENT HAS BEEN GRANTED |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D |
|
REG | Reference to a national code |
Ref country code: CH Ref legal event code: EP |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R096 Ref document number: 602018017094 Country of ref document: DE |
|
REG | Reference to a national code |
Ref country code: IE Ref legal event code: FG4D |
|
REG | Reference to a national code |
Ref country code: AT Ref legal event code: REF Ref document number: 1392708 Country of ref document: AT Kind code of ref document: T Effective date: 20210615 |
|
REG | Reference to a national code |
Ref country code: SE Ref legal event code: TRGR |
|
REG | Reference to a national code |
Ref country code: LT Ref legal event code: MG9D |
|
REG | Reference to a national code |
Ref country code: AT Ref legal event code: MK05 Ref document number: 1392708 Country of ref document: AT Kind code of ref document: T Effective date: 20210512 |
|
REG | Reference to a national code |
Ref country code: NL Ref legal event code: MP Effective date: 20210512 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: HR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20210512 Ref country code: LT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20210512 Ref country code: FI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20210512 Ref country code: BG Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20210812 Ref country code: AT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20210512 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: PT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20210913 Ref country code: RS Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20210512 Ref country code: NO Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20210812 Ref country code: PL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20210512 Ref country code: LV Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20210512 Ref country code: GR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20210813 Ref country code: IS Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20210912 |
|
REG | Reference to a national code |
Ref country code: ES Ref legal event code: FG2A Ref document number: 2882626 Country of ref document: ES Kind code of ref document: T3 Effective date: 20211202 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: NL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20210512 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: SK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20210512 Ref country code: EE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20210512 Ref country code: SM Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20210512 Ref country code: RO Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20210512 Ref country code: CZ Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20210512 Ref country code: DK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20210512 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R097 Ref document number: 602018017094 Country of ref document: DE |
|
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
26N | No opposition filed |
Effective date: 20220215 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: IS Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20210912 Ref country code: AL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20210512 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: MC Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20210512 |
|
REG | Reference to a national code |
Ref country code: CH Ref legal event code: PL |
|
REG | Reference to a national code |
Ref country code: BE Ref legal event code: MM Effective date: 20220331 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: LU Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20220301 Ref country code: LI Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20220331 Ref country code: IE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20220301 Ref country code: CH Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20220331 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: BE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20220331 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: FR Payment date: 20230208 Year of fee payment: 6 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: SE Payment date: 20230210 Year of fee payment: 6 Ref country code: IT Payment date: 20230213 Year of fee payment: 6 |
|
P01 | Opt-out of the competence of the unified patent court (upc) registered |
Effective date: 20230524 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: ES Payment date: 20230406 Year of fee payment: 6 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: MK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20210512 Ref country code: CY Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20210512 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: DE Payment date: 20240130 Year of fee payment: 7 Ref country code: GB Payment date: 20240201 Year of fee payment: 7 |