WO2018177066A1 - 多声道信号的编解码方法和编解码器 - Google Patents

多声道信号的编解码方法和编解码器 Download PDF

Info

Publication number
WO2018177066A1
WO2018177066A1 PCT/CN2018/077782 CN2018077782W WO2018177066A1 WO 2018177066 A1 WO2018177066 A1 WO 2018177066A1 CN 2018077782 W CN2018077782 W CN 2018077782W WO 2018177066 A1 WO2018177066 A1 WO 2018177066A1
Authority
WO
WIPO (PCT)
Prior art keywords
channel signal
signal
target
channel
difference value
Prior art date
Application number
PCT/CN2018/077782
Other languages
English (en)
French (fr)
Inventor
刘泽新
苗磊
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority to KR1020197029632A priority Critical patent/KR102281097B1/ko
Priority to EP21170071.1A priority patent/EP3917171B1/en
Priority to JP2019553260A priority patent/JP6804666B2/ja
Priority to EP24152513.8A priority patent/EP4375994A2/en
Priority to EP18776186.1A priority patent/EP3588497B1/en
Priority to BR112019020468A priority patent/BR112019020468A2/pt
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Priority to CN201880022744.XA priority patent/CN110462733B/zh
Priority to ES18776186T priority patent/ES2882626T3/es
Publication of WO2018177066A1 publication Critical patent/WO2018177066A1/zh
Priority to US16/586,128 priority patent/US11386907B2/en
Priority to US17/837,558 priority patent/US11894001B2/en
Priority to US18/393,866 priority patent/US20240135938A1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S1/00Two-channel systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S5/00Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation 

Definitions

  • the present application relates to the field of audio coding, and more particularly to a codec method and codec for multi-channel signals.
  • stereo audio has the sense of orientation and distribution of each sound source, which can improve the clarity, intelligibility and presence of sound, and is therefore favored by people.
  • Stereo processing technologies mainly include Mid/Sid (MS) encoding, Intensity Stereo (IS) encoding, and Parametric Stereo (PS) encoding.
  • MS Mid/Sid
  • IS Intensity Stereo
  • PS Parametric Stereo
  • the encoding end when the PS signal is used to encode the channel signal, the encoding end performs spatial parameter analysis on the plurality of channel signals to obtain reverberation gain parameters and other spatial parameters of the plurality of channel signals, and multiple sounds.
  • the reverberation gain parameter of the channel signal and other spatial parameters are encoded, so that the decoding end can perform reverberation processing on the decoded plurality of channel signals according to the reverberation gain parameter of the channel signal during decoding to increase the auditory effect.
  • the reverberation processing is performed on the decoded plurality of channel signals according to the reverberation gain parameters corresponding to the plurality of channel signals. On the contrary, it will lead to a worse hearing effect.
  • the present application provides a codec method and codec for multi-channel signals to improve the quality of channel signals.
  • a method for encoding a multi-channel signal comprising: determining a downmix signal of a first channel signal and a second channel signal of the multichannel signal and the first channel signal And an initial reverberation gain parameter of the second channel signal; a correlation between the first channel signal and the second channel signal and the downmix signal, respectively, and the initial reverberation gain parameter Determining a target reverberation gain parameter of the first channel signal and the second channel signal; and the first channel signal and the according to the downmix signal and the target reverberation gain parameter
  • the second channel signal is quantized and the quantized first channel signal and second channel signal are written to the code stream.
  • the correlation between the channel signal and the downmix signal is considered, so that it is possible to obtain a reverberation process for the channel signal according to the target reverberation gain parameter.
  • the correlation between the first channel signal or the second channel signal and the downmix signal may be determined according to the difference between the energy of the first channel signal or the second channel signal and the energy of the downmix signal, or may be according to the first The difference between the amplitude of the channel signal or the second channel signal and the amplitude of the downmix signal is determined.
  • the first channel signal, the second channel signal, and the downmix signal are channel signals obtained after normalization.
  • the correlation between the first channel signal and the second channel signal and the downmix signal, and the initial mixing Determining a target reverberation gain parameter of the first channel signal and the second channel signal, including: according to the first channel signal, the second channel signal and the lower The correlation of the mixed signals is determined, and the target attenuation factor is determined; and the initial reverberation gain parameter is adjusted according to the target attenuation factor to obtain the target reverberation gain parameter.
  • the initial reverberation gain parameter of the channel signal can be flexibly adjusted according to the magnitude of the correlation between the channel signal and the downmix signal by the attenuation factor.
  • the energy of the channel signal can more conveniently measure the correlation between the first channel signal, the second channel signal and the downmix signal, that is, by comparing the difference between the energy of the channel signal and the downmix signal, it can be conveniently Determine the target attenuation factor. Specifically, when the difference between the energy of the first channel signal or the second channel signal and the downmix signal is larger (greater than a given threshold), the first channel signal and the second channel signal and the downmix signal may be considered.
  • the correlation is weak, and a larger target attenuation factor can be determined at this time, and when the difference between the energy of the first channel signal or the second channel signal and the downmix signal is small (less than a given threshold), the first The correlation between the first channel signal and the second channel signal and the downmix signal is weak, and a smaller target attenuation factor can be determined.
  • determining the target attenuation factor may calculate the target attenuation factor according to the correlation between the channel signal and the lower signal, or may consider the sound. After the correlation between the channel signal and the downmix signal, the preset attenuation factor is directly determined as the target attenuation factor.
  • the first channel signal and the second channel signal each include a plurality of frequency points, according to the first channel signal, Determining a correlation between the second channel signal and the downmix signal, respectively, determining a target attenuation factor, comprising: determining that the first channel signal and the second channel signal are respectively associated with the downmix signal a difference value of energy of the plurality of frequency points; determining the target attenuation factor according to the difference value.
  • the difference in energy of the mixed signal determines the attenuation factor without having to compare the difference in energy between the first channel signal and the second channel signal and the downmix signal over the entire frequency band.
  • the determining the energy of the first channel signal and the second channel signal and the downmix signal at the plurality of frequency points, respectively a difference value comprising: determining a first difference value of an energy of the first channel signal and an energy of the downmix signal, the first difference value being used to indicate the first channel signal and the lower a sum of absolute values of differences in energy of the mixed signal at the plurality of frequency points; a second difference value of the energy of the second channel signal and the energy of the downmix signal, the second difference value being used And a sum of absolute values indicating a difference between energy of the second channel signal and the downmix signal respectively at a plurality of frequency points; determining the target attenuation factor according to the difference value, including: according to Determining the ratio of the first difference value and the second difference value to determine the target attenuation factor.
  • the target attenuation factor may also be determined directly from the first difference value and the second difference value.
  • the method prior to determining the target attenuation factor based on the difference value, the method further comprises determining that the difference value is greater than a predetermined threshold.
  • the target attenuation factor is determined only when the difference between the energy of the first channel signal and the second channel signal and the downmix signal is relatively large, and the initial reverberation gain parameter is adjusted according to the target attenuation factor, and the difference is In the case of a small value, the initial reverberation gain parameter may not be adjusted, thereby improving coding efficiency.
  • the initial reverberation gain parameter of the plurality of channel signals may be directly determined as the target reverberation gain parameter of the plurality of channel signals.
  • the energy of the downmix signal is determined based on energy of the first channel signal and the second channel signal.
  • the energy of the downmix signal can be calculated by the energy of the first channel signal and the second channel signal, without being calculated by the downmix signal itself, which simplifies a certain calculation process.
  • the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors respectively corresponding to at least the plurality of channel signals One subband, and any subband only corresponds to one attenuation factor.
  • the reverberation gain parameter can be more flexibly adjusted according to the target attenuation factor.
  • the frequency band in which the first channel signal and the second channel signal are located includes a first frequency band and a second frequency band, and the subband in the first frequency band
  • the corresponding attenuation factor is less than or equal to the attenuation factor corresponding to the subband in the second frequency band, wherein the frequency of the first frequency band is smaller than the frequency of the second frequency band.
  • the reverberation gain parameters corresponding to the high frequency and low frequency sub-bands can be adjusted to different degrees, and reverberation processing can be performed. Get better processing results.
  • a method for decoding a multi-channel signal comprising: determining a downmix signal of a first channel signal and a second channel signal of the multichannel signal and the first channel signal And an initial reverberation gain parameter of the second channel signal; determining the first channel signal according to a correlation between the first channel signal and the second channel signal and the downmix signal respectively And the identification information of the second channel signal, the identifier information is used to indicate a channel signal of the first channel signal and the second channel signal that needs to adjust an initial reverberation gain parameter; a downmix signal, the initial reverberation gain parameter, and the identification information, quantizing the first channel signal and the second channel signal, and quantizing the first channel signal and the second sound
  • the track signal is written to the code stream.
  • the correlation between the first channel signal or the second channel signal and the downmix signal may be determined according to the difference between the energy of the first channel signal or the second channel signal and the energy of the downmix signal, or may be according to the first The difference between the amplitude of the channel signal or the second channel signal and the amplitude of the downmix signal is determined.
  • the channel signal that needs to adjust the initial reverberation gain parameter can be determined according to the correlation between the channel signal and the downmix signal, so that the decoding end can first adjust the initial reverberation gain parameter of some channel signals.
  • the reverberation processing of these channel signals is performed to improve the quality of the channel signal after the reverberation processing.
  • the identification information of the channel signal and the second channel signal includes: determining, according to the correlation between the energy of the first channel signal and the second channel signal and the energy of the downmix signal, respectively Identification information of the first channel signal and the second channel signal.
  • the correlation between the first channel signal and the second channel signal and the downmix signal can be conveniently measured, thereby making it easier to determine the sound that needs to adjust the initial reverberation gain parameter. Signal.
  • the correlation between the first channel signal and the second channel signal and the energy of the downmix signal respectively determines the first sound
  • the identification information of the track signal and the second channel signal includes: determining a first difference value and a second difference value, wherein the first difference value is that the first channel signal and the downmix signal are respectively a sum of absolute values of differences in energy of the plurality of frequency points, the second difference value being a sum of absolute values of differences in energy of the second channel signal and the downmix signal respectively at a plurality of frequency points And determining identification information of the first channel signal and the second channel signal according to the first difference value and the second difference value.
  • the energy values of the first channel signal, the second channel signal, and the downmix signal may be normalized values.
  • the first channel signal, the second channel signal, and the downmix signal can be conveniently determined.
  • the difference in energy determines the channel signal that needs to adjust the initial reverberation gain parameter without having to compare the difference in energy between the first channel signal and the second channel signal and the downmix signal over the entire frequency band.
  • determining the identification information of the first channel signal and the second channel signal according to the first difference value and the second difference value includes: determining a maximum difference value of the first difference value and the second difference value as a target difference value; determining the identifier information according to the target difference value, where the identifier information is specifically used to indicate the target difference value
  • the channel signals corresponding to the target difference values are channel signals that need to adjust an initial reverberation gain parameter.
  • the method further comprising: determining a target attenuation factor based on the first difference value and the second difference value, the target attenuation factor being used for Adjusting an initial reverberation gain parameter of the target channel signal; quantizing the target attenuation factor, and writing the quantized target attenuation factor to the code stream.
  • the initial reverberation gain parameter of the channel signal can be flexibly adjusted according to the magnitude of the correlation between the channel signal and the downmix signal by the attenuation factor.
  • the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors respectively corresponding to at least one of the target channel signals Subbands, and any subband only corresponds to one attenuation factor.
  • the reverberation gain parameter can be more flexibly adjusted according to the target attenuation factor.
  • the target channel signal includes a first frequency band and a second frequency band, and a sub-band in the first frequency band corresponds to an attenuation factor that is less than or equal to a second frequency band.
  • the reverberation gain parameters corresponding to the high frequency and low frequency sub-bands can be adjusted to different degrees, and reverberation processing can be performed. Get better processing results.
  • the energy of the downmix signal is determined based on the energy of the first channel signal and the second channel signal.
  • a method for decoding a multi-channel signal comprising: acquiring a code stream; and determining a downmix of the first channel signal and the second channel signal in the multichannel signal according to the code stream An initial reverberation gain parameter of the signal, the first channel signal and the second channel signal, and identification information of the first channel signal and the second channel signal, wherein the identification information is used Determining, in the first channel signal and the second channel signal, a channel signal that needs to adjust an initial reverberation gain parameter; determining the first channel signal and the second channel according to the identification information
  • the channel signal in the signal that needs to adjust the initial reverberation gain parameter is the target channel signal; the initial reverberation gain parameter of the target channel signal is adjusted.
  • the channel signal that needs to adjust the initial reverberation gain parameter can be determined by the identification information, and the initial reverberation gain parameter of the channel signal is adjusted before the channel signal is subjected to the reverberation processing, thereby improving the reverberation processing.
  • the quality of the channel signal afterwards.
  • the adjusting an initial reverberation gain parameter of the target channel signal comprises: determining a target attenuation factor; The initial reverberation gain parameter of the target channel signal is adjusted to obtain a target reverberation gain parameter of the target channel signal.
  • the initial reverberation gain parameter of the channel signal can be flexibly adjusted according to the magnitude of the correlation between the channel signal and the downmix signal by the attenuation factor.
  • the determining a target attenuation factor includes determining a predetermined attenuation factor as the target attenuation factor.
  • the process of determining the target attenuation factor can be simplified, thereby improving the efficiency of decoding.
  • the determining the target attenuation factor comprises: obtaining the target attenuation factor based on the code stream.
  • the process of determining the target attenuation factor can be simplified by directly obtaining the target attenuation factor in the code stream, and the decoding efficiency can be improved.
  • the determining a target attenuation factor includes: acquiring, between the channels of the first channel signal and the second channel signal, from the code stream Level difference; determining the target attenuation factor according to the inter-channel level difference, or determining the target attenuation factor according to the inter-channel level difference and the downmix signal.
  • the target attenuation factor can be more flexibly and accurately determined according to the level difference between the channels, the downmix signal, etc., and the initial reverberation parameter of the channel signal can be more accurately adjusted according to the attenuation factor.
  • the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors respectively corresponding to at least one of the target channel signals Subbands, and any subband only corresponds to one attenuation factor.
  • the reverberation gain parameter can be more flexibly adjusted according to the target attenuation factor.
  • the target channel signal includes a first frequency band and a second frequency band, and a sub-band in the first frequency band corresponds to an attenuation factor that is less than or equal to the first The attenuation factor corresponding to the subband in the second frequency band, wherein the frequency of the first frequency band is smaller than the frequency of the second frequency band.
  • the reverberation gain parameters corresponding to the high frequency and low frequency sub-bands can be adjusted to different degrees, and reverberation processing can be performed. Get better processing results.
  • an encoder comprising a module or unit for performing the method of the first aspect or various implementations thereof.
  • an encoder comprising a module or unit for performing the method of the second aspect or various implementations thereof.
  • a decoder comprising a module or unit for performing the method of the third aspect or various implementations thereof.
  • an encoder comprising: a memory for storing a program, the processor for executing a program, and when the program is executed, the processor performs the first aspect or The method in its various implementations.
  • an encoder comprising: a memory for storing a program, the processor for executing a program, and when the program is executed, the processor performs a second aspect or The method in its various implementations.
  • a decoder comprising: a memory for storing a program, the processor for executing a program, and when the program is executed, the processor performs a third aspect or The method in its various implementations.
  • a computer readable medium storing program code for device execution, the program code comprising instructions for performing the method of the first aspect or various implementations thereof.
  • a computer readable medium storing program code for device execution, the program code comprising instructions for performing the method of the second aspect or various implementations thereof .
  • a twelfth aspect a computer readable medium storing program code for device execution, the program code comprising instructions for performing the method of the third aspect or various implementations thereof .
  • FIG. 1 is a schematic flow chart of encoding left and right channel signals in the prior art.
  • FIG. 2 is a schematic flow chart of decoding left and right channel signals in the prior art.
  • FIG. 3 is a schematic flowchart of a method for encoding a multi-channel signal according to an embodiment of the present application.
  • FIG. 4 is a schematic flowchart of a method for decoding a multi-channel signal according to an embodiment of the present application.
  • FIG. 5 is a schematic flowchart of a method for encoding a multi-channel signal according to an embodiment of the present application.
  • FIG. 6 is a schematic flowchart of a method for encoding a multi-channel signal according to an embodiment of the present application.
  • FIG. 7 is a schematic flowchart of a method for decoding a multi-channel signal according to an embodiment of the present application.
  • FIG. 8 is a schematic block diagram of an encoder according to an embodiment of the present application.
  • FIG. 9 is a schematic block diagram of an encoder according to an embodiment of the present application.
  • FIG. 10 is a schematic block diagram of a decoder of an embodiment of the present application.
  • FIG. 11 is a schematic block diagram of an encoder of an embodiment of the present application.
  • FIG. 12 is a schematic block diagram of an encoder of an embodiment of the present application.
  • FIG. 13 is a schematic block diagram of a decoder of an embodiment of the present application.
  • FIG. 1 shows a process of encoding left and right channel signals in the prior art.
  • the encoding process shown in Figure 1 specifically includes:
  • the step 110 specifically includes: performing spatial parameter analysis on the left channel signal and the right channel signal, obtaining spatial parameters of the left channel signal and the right channel signal; and downmixing the left channel signal and the right channel signal. Processing, obtaining a downmix signal (the downmix signal obtained after the downmix processing is a mono audio signal, and the original two channel audio signal is converted into a channel audio signal by downmix processing).
  • Spatial parameters also known as spatially perceptible parameters
  • IC Inter-channel Coherent
  • ITD Inter-channel Level Difference
  • IPD Inter-channel Phase Difference
  • the IC describes the cross-correlation or coherence between the channels. This parameter determines the perception of the sound field range and can improve the spatial sense of the audio signal and the stability of the sound.
  • ILD is used to distinguish the horizontal direction of the stereo source and describes the difference in intensity between the channels, which will affect the frequency content of the entire spectrum.
  • ITD and IPD are spatial parameters that represent the horizontal orientation of the sound source. They describe the time and phase differences between the channels. This parameter mainly affects the frequency components below 2 kHz.
  • the ITD can represent the time delay between the left and right channel signals of the stereo
  • the IPD can represent the waveform similarity of the stereo left and right channel signals after time alignment.
  • ILD, ITD and IPD can determine the human ear's perception of the sound source position, can effectively determine the sound field position, and play an important role in the recovery of stereo signals.
  • the bit stream obtained by encoding the downmix signal and the spatial parameter is multiplexed to obtain a code stream.
  • the encoded code stream can be stored or transmitted to the decoding end device.
  • Fig. 2 shows a process of decoding left and right channel signals in the prior art.
  • the decoding process shown in FIG. 2 specifically includes:
  • the above spatial parameters include an IC of a left channel signal and a right channel signal.
  • the left and right channel signals of the final output are obtained (indicated by L' and R' in Fig. 2, respectively).
  • left channel signal and the right channel signal (represented by L' and R' in FIG. 2, respectively) in step 240 are decoded, and may be certain compared to the left and right channel signals encoded by the encoding end. Distortion.
  • the de-correlation signal can be obtained by filtering the downmix signal and then correcting the filtered downmix signal by using the inter-channel correlation parameter.
  • the purpose of generating the decorrelated signal is to increase the reverberation of the finally generated stereo signal at the decoding end, increase the sound field width of the stereo signal, and make the output audio signal more rounded and full.
  • the so-called reverberation is essentially an effect of the original audio signal being superimposed with the original audio signal into the human ear after a delay of different reflection and refraction.
  • the IC is adaptively adjusted without considering the correlation of signals between different channels, so that the reverberation processing of the channel signal according to the originally acquired IC may cause poor
  • the auditory effect For example, when the correlation between different channel signals is low, if the previously acquired IC is still used to correct the decorrelated signal, and then the decorrelated signal is used to perform the same reverberation processing on different channel signals, The quality of the channel signal that results in the final output of the decoder is poor. That is to say, because the difference between the signals of different channels is large, if the re-correlation signal of the previous larger IC correction is still used for reverberation processing of different channel signals, not only will the mixing of the channel signals be increased. The effect of the sound may also cause distortion of the output channel signal.
  • the embodiment of the present application proposes a coding and decoding method for a multi-channel signal, which can adjust the reverberation gain parameter according to the correlation between different channel signals, and utilize the adjusted reverberation.
  • the gain parameter corrects the decorrelated signal, and then uses the decorrelated signal to perform reverberation processing on different channel signals, so that reverberation processing of different channel signals is taken into consideration between different channel signals. Correlation makes the quality of the output channel signal better.
  • FIG. 3 is a schematic flowchart of a method for encoding a multi-channel signal according to an embodiment of the present application.
  • the method of FIG. 3 may be performed by an encoding device or an encoder, and the method of FIG. 3 includes:
  • the order of determining the downmix signal and determining the initial reverberation gain parameter is not limited, and may be performed simultaneously or sequentially.
  • the above initial reverberation gain parameter may refer to a reverberation gain parameter obtained by performing spatial parameter analysis on the first channel signal and the second channel signal.
  • the downmix signal may be obtained by performing downmix processing on the plurality of channel signals; and acquiring the first channel signal and the second by performing spatial parameter analysis on the first channel signal and the second channel signal.
  • first channel signal and the second channel signal may correspond to the same spatial parameter, and accordingly, the first channel signal and the second channel signal may also correspond to the same initial reverberation gain parameter. That is, the spatial parameter of the first channel signal and the spatial parameter of the second channel signal may be the same, and the initial reverberation gain parameter of the first channel signal and the initial reverberation gain parameter of the second channel signal may also be the same. .
  • first channel signal and the second channel signal each comprise 10 sub-bands, each of which corresponds to one reverberation gain parameter
  • index values of the first channel signal and the second channel signal are the same.
  • the reverberation gain parameters corresponding to the subbands can be the same.
  • first channel signal, the second channel signal, and the downmix signal may be channel signals obtained after normalization processing.
  • the correlation between the first channel signal or the second channel signal and the downmix signal may be determined according to the difference between the energy of the first channel signal or the second channel signal and the energy of the downmix signal, or It is determined according to the difference between the amplitude of the first channel signal or the second channel signal and the amplitude of the downmix signal.
  • the correlation between the first channel signal and the downmix signal may be considered to be large, and when the first channel signal is When the difference in energy or amplitude from the energy or amplitude of the downmix signal is large, it can be considered that the correlation between the first channel signal and the downmix signal is small.
  • the energy difference between the energy of the first channel signal or the second channel signal and the downmix signal may specifically refer to a difference between the energy of the first channel signal or the second channel signal and the energy of the downmix signal.
  • the difference between the amplitudes of the first channel signal and the second channel signal and the amplitude of the downmix signal may specifically refer to a difference between the amplitude of the first channel signal or the second channel signal and the amplitude of the downmix signal.
  • the correlation between the first channel signal or the second channel signal and the downmix signal may also refer to a phase of the first channel signal or the second channel signal, a phase and a phase difference of the downmix signal, and a period difference. Wait.
  • the quantized first channel signal and second channel signal are written to the code stream.
  • the number of multi-channel signals is more than two, for example, when the multi-channel signal includes the first channel signal, the second channel signal, the third channel signal, and the fourth channel signal,
  • the first channel signal and the second channel signal are processed by the method of FIG. 3, and the third channel signal and the fourth channel signal are also processed by the method of FIG.
  • the correlation between the channel signal and the downmix signal is considered, so that the reverberation processing of the channel signal according to the target reverberation gain parameter can be obtained.
  • the target reverberation gain parameter comprises: determining a target attenuation factor according to a correlation between the first channel signal and the second channel signal and the downmix signal respectively; and adjusting the initial reverberation gain parameter according to the target attenuation factor to obtain Target reverberation gain parameter.
  • determining the target attenuation factor may calculate the target attenuation factor according to the correlation between the channel signal and the lower signal, or The predetermined attenuation factor is directly determined as the target attenuation factor after considering the correlation between the channel signal and the downmix signal.
  • the initial reverberation gain parameter of the channel signal can be flexibly adjusted according to the magnitude of the correlation between the channel signal and the downmix signal by the attenuation factor.
  • the correlation between the first channel signal and the second channel signal and the downmix signal is large (it may also be considered that the first channel signal is similar to the second channel signal), a value may be determined. a smaller target attenuation factor, and when the correlation between the first channel signal and the second channel signal and the downmix signal is small (this may also be considered as a large difference between the first channel signal and the second channel signal) ), you can determine a larger target attenuation factor.
  • the correlation between the plurality of channel signals and the downmix signal may refer to a difference between the energy of the plurality of channel signals and the energy of the downmix signal, or the amplitude of the plurality of channel signals.
  • the energy difference between the energy of the plurality of channel signals and the downmix signal may specifically be a difference between the energy of the plurality of channel signals and the energy of the downmix signal, and similarly, the amplitudes and downmixes of the plurality of channel signals are The difference in amplitude of the signal may specifically be the difference between the amplitude of the plurality of channel signals and the amplitude of the downmix signal.
  • the correlation between the plurality of channel signals and the downmix signal may be a phase or a period of the plurality of channel signals and a phase or period difference of the downmix signal.
  • the correlation between the first channel signal or the second channel signal and the downmix signal may be determined according to the difference between the energy of the first channel signal or the second channel signal and the energy of the downmix signal, and further Then determine the target attenuation factor.
  • the correlation between the first channel signal and the second channel signal and the downmix signal can be conveniently measured by the energy of the channel signal and the downmix signal, that is, by comparing the first channel signal or the second channel.
  • the difference in energy between the signal and the downmix signal makes it easier to determine the target attenuation factor.
  • the first channel signal and the second channel signal each include multiple frequency points
  • the target is determined according to the correlation between the first channel signal and the second channel signal and the downmix signal respectively.
  • the attenuation factor includes: determining a difference value between the first channel signal and the second channel signal and the energy of the downmix signal at the plurality of frequency points respectively; determining the target attenuation factor according to the difference value.
  • the difference value of the energy of the first channel signal and the downmix signal at the plurality of frequency points may refer to a difference value of energy of the plurality of same frequency points of the first channel signal and the downmix signal, respectively.
  • the first channel signal includes three frequency points (the first frequency point, the second frequency point, and the third frequency point), then the difference between the energy of the first channel signal and the downmix signal at the three frequency points.
  • the value specifically refers to the difference value of the first channel signal and the downmix signal at the first frequency point, the difference value between the first channel signal and the downmix signal at the second frequency point, and the first channel signal and the downmix signal are at The difference value of the third frequency point.
  • the difference value of the energy of the second channel signal and the downmix signal at the plurality of frequency points may refer to a difference value of the energy of the plurality of same frequency points of the second channel signal and the downmix signal, respectively.
  • the difference value of the energy of the first channel signal and the downmix signal at the plurality of frequency points may refer to an absolute value of a difference between the energy of the first channel signal and the downmix signal at the plurality of frequency points respectively.
  • the difference value of the energy of the second channel signal and the downmix signal at the plurality of frequency points may refer to the absolute difference of the energy of the second channel signal and the downmix signal respectively at the plurality of frequency points. The sum of the values.
  • the energy values of the first channel signal, the second channel signal, and the downmix signal may be normalized values.
  • the difference in energy of the mixed signal determines the attenuation factor without having to compare the difference in energy between the first channel signal and the second channel signal and the downmix signal over the entire frequency band.
  • determining a difference value of the energy of the first channel signal and the second channel signal and the energy of the downmix signal at the plurality of frequency points respectively includes: determining the first channel signal a first difference value between the energy and the energy of the downmix signal, the first difference value being used to indicate a sum of absolute values of the difference values of the energy of the first channel signal and the downmix signal respectively at the plurality of frequency points; determining the second sound a second difference between the energy of the track signal and the energy of the downmix signal, the second difference value being used to indicate the sum of the absolute values of the differences of the energy of the first channel signal and the downmix signal respectively at the plurality of frequency points; The first difference value and the second difference value determine a target attenuation factor.
  • Determining the target attenuation factor according to the first difference value and the second difference value may include determining the target attenuation factor according to a ratio of the first difference value and the second difference value.
  • the first difference value and the second difference value may be calculated according to the following formula.
  • diff_l_h is the first difference value
  • diff_r_h is the second difference value
  • the frequency band of the left channel signal and the right channel signal includes a high frequency part and a low frequency part
  • M1 is a starting frequency point of the high frequency part
  • M2 is a high frequency part Part of the end frequency
  • mag_l[k] is the energy or amplitude value of the left channel signal at a certain frequency between M1 and M2
  • mag_r[k] is the index of the right channel signal between M1 and M2.
  • mag_dmx[k] is the energy or amplitude value of the frequency of the index of the downmix signal between M1 and M2, where mag_dmx[k] can be calculated by the downmix signal itself
  • the result can also be calculated from the energy or amplitude values of the left and right channel signals.
  • the ratio of the first difference value to the second difference value may be directly determined as the target attenuation factor.
  • the first difference value is a
  • the second difference value is b
  • the target attenuation factor and the attenuation factor of the previous frame may be further smoothed, and the smoothed target attenuation factor is used to further The initial reverberation gain parameter of the channel signal is adjusted.
  • the method of FIG. 3 before determining the target attenuation factor according to the difference value, the method of FIG. 3 further includes: determining that the difference value is greater than a preset threshold.
  • the difference value herein is greater than the preset threshold, which may mean that the difference between the energy of the first channel signal and the second channel signal and the energy of the downmix signal is greater than the same preset threshold, or may refer to the first sound.
  • the difference between the energy of the channel signal and the downmix signal is greater than a preset first threshold, and the difference between the energy of the second channel signal and the downmix signal is greater than a preset second threshold.
  • the target attenuation factor is determined only when the difference between the energy of the first channel signal and the second channel signal and the downmix signal is relatively large, and the initial reverberation gain parameter is adjusted according to the target attenuation factor, and the difference is In the case of a small value, the initial reverberation gain parameter may not be adjusted, thereby improving coding efficiency.
  • the difference value of the energy of the first channel signal and the downmix signal is greater than the M of the first channel signal energy (M is between 0.5-1) times
  • M is between 0.5-1) times
  • the energy of the first channel and the downmix signal can be considered.
  • the difference value is greater than a preset threshold.
  • the preset threshold is M times the energy of the first channel signal.
  • the difference between the energy of the first channel and the downmix signal may be considered to be greater than a preset threshold.
  • the initial reverberation gain parameter of the plurality of channel signals may be directly determined as the target reverberation gain parameter of the plurality of channel signals.
  • the energy of the downmix signal is determined based on the energy of the first channel signal and the second channel signal.
  • the energy of the downmix signal can be calculated by the energy of the first channel signal and the second channel signal, without being calculated by the downmix signal itself, which simplifies a certain calculation process.
  • the energy of the downmix signal can also be calculated directly according to the downmix signal itself.
  • the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors respectively corresponding to at least one subband of the plurality of channel signals, and any one of the plurality of channel signals The band corresponds to only one attenuation factor.
  • the first channel signal and the second channel signal include sub-bands with an index of 0-9, and the first channel signal and the second channel signal each include 10 reverberation gain parameters, one for each sub-band.
  • Reverberation gain parameter the target attenuation factor contains 5 attenuation factors, each attenuation factor corresponds to two sub-bands, or the target attenuation factor contains 10 attenuation factors, and each attenuation factor corresponds to one sub-band.
  • the reverberation gain parameter can be more flexibly adjusted according to the target attenuation factor. For example, a reverberation gain parameter corresponding to a sub-band with an index of 0-4 of a plurality of channel signals needs to be adjusted slightly, and a reverberation gain parameter corresponding to an index of the channel signal of 5-9 needs to be larger. Adjustment, then, a smaller attenuation factor can be set for the reverberation gain parameter corresponding to the subband with index 0-4, and a larger attenuation factor is set for the reverberation gain parameter corresponding to the subband with index 5-9.
  • the first channel signal and the second channel signal include a first frequency band and a second frequency band, and the first frequency band
  • the attenuation factor corresponding to the subband in the second frequency band is less than or equal to the attenuation factor corresponding to the subband in the second frequency band, wherein the frequency of the first frequency band is smaller than the frequency of the second frequency band.
  • the frequency band in which the first channel signal and the second channel signal are located includes a low frequency portion and a high frequency portion
  • the target attenuation factor includes a plurality of attenuation factors, wherein the low frequency portion corresponds to at least one attenuation factor, and the high frequency portion corresponds to at least one The attenuation factor, the attenuation factor corresponding to the low frequency portion is smaller than the attenuation factor corresponding to the high frequency portion.
  • the reverberation gain parameters corresponding to the high frequency and low frequency sub-bands can be adjusted to different degrees, and reverberation processing can be performed. Get better processing results.
  • FIG. 4 is a schematic flowchart of a method for encoding a multi-channel signal according to an embodiment of the present application.
  • the channel signal includes a left channel signal and a right channel signal
  • the process of encoding the left channel signal and the right channel signal specifically includes:
  • the spatial parameters described above include initial reverberation gain parameters for the left and right channel signals, as well as other spatial parameters.
  • the left and right channel signals can be divided into a high frequency portion and a low frequency portion, and the difference between the left and right channel signals and the energy of the downmix signal in the high frequency portion can be determined as the energy of the left and right channel signals and the energy of the downmix signal. The difference.
  • the encoding end may determine the target attenuation factor according to the difference between the energy of the left and right channel signals and the energy of the downmix signal, and adjust the reverberation gain parameters of the left and right channel signals according to the target attenuation factor.
  • FIG. 5 is a schematic flowchart of a method for decoding a multi-channel signal according to an embodiment of the present application.
  • the channel signal includes a left channel signal and a right channel signal
  • FIG. 5 can decode the code stream generated by the encoding method in FIG. 4.
  • the decoding process of FIG. 5 specifically includes:
  • the decoded code stream acquires spatial parameters of the left channel signal and the right channel signal.
  • the spatial parameter includes a reverberation gain parameter adjusted by the encoding end, that is, the encoding end encodes the adjusted reverberation gain parameter, so that the decoding end obtains the encoding end after decoding the code stream. Adjusted reverberation gain parameters.
  • Step 520 and step 530 have no sequential relationship and can be performed simultaneously.
  • the reverberation gain parameter is a reverberation gain parameter adjusted by the encoding end.
  • step 560 Perform the upmix processing according to the spatial parameters and the downmix signal processed in step 540 to obtain a left channel signal and a right channel signal.
  • the reverberation gain parameter according to the reverberation processing of the left channel signal and the right channel signal has been adjusted according to the correlation between the left and right channel signals and the downmix signal, so that Corresponding reverberation processing is performed according to the difference between the left and right channel signals, and the quality of the channel signal obtained after the reverberation processing is improved.
  • the encoding method of FIG. 3 is that the encoding end determines whether it is necessary to adjust the initial reverberation gain parameter of the channel signal, and if necessary, adjusts the initial reverberation gain parameter of the channel signal at the encoding end, and adjusts the adjusted
  • the reverberation gain parameter is encoded so that the decoding end directly performs the reverberation processing according to the reverberation gain parameter obtained by the decoding.
  • the encoding end can also determine whether the initial reverberation gain parameter of the channel signal needs to be adjusted. If adjustment is needed, the corresponding indication information is sent to the encoding end, and after receiving the indication information, the decoding end is completed by the decoding end. The initial reverberation gain parameter of the channel signal is adjusted.
  • FIG. 6 is a schematic flowchart of a method for encoding a multi-channel signal according to an embodiment of the present application.
  • the method of Figure 6 includes:
  • the downmix signal can be obtained by performing downmix processing on the first channel signal and the second channel signal, and spatial parameters are obtained by performing spatial parameter analysis on the first channel signal and the second channel signal, wherein the spatial parameter The initial reverberation gain parameter of the first channel signal and the second channel signal is included.
  • determining the downmix signal and determining the initial reverberation gain parameter may be performed simultaneously or sequentially.
  • first channel signal and the second channel signal may correspond to the same spatial parameter.
  • first channel signal and the second channel signal also correspond to the same initial reverberation gain parameter. That is, the spatial parameter of the first channel signal is the same as the spatial parameter of the second channel signal, and the initial reverberation gain parameter of the first channel signal is the same as the initial reverberation gain parameter of the second channel signal.
  • first channel signal and the second channel signal each comprise 10 sub-bands, each of which corresponds to one reverberation gain parameter
  • index values of the first channel signal and the second channel signal are the same.
  • the reverberation gain parameters corresponding to the subbands can be the same.
  • the channel signal of the initial reverberation gain parameter needs to be adjusted in the track signal and the second channel signal.
  • the correlation between the first channel signal or the second channel signal and the downmix signal may be determined according to the difference between the energy of the first channel signal or the second channel signal and the energy of the downmix signal, or It is determined according to the difference between the amplitude of the first channel signal or the second channel signal and the amplitude of the downmix signal.
  • the correlation between the first channel signal and the downmix signal may be considered to be large, and when the first channel signal is When the difference in energy or amplitude from the energy or amplitude of the downmix signal is large, it can be considered that the correlation between the first channel signal and the downmix signal is small.
  • the difference between the energy of the first channel signal or the second channel signal and the downmix signal may specifically be the difference between the energy of the first channel signal or the second channel signal and the energy of the downmix signal.
  • the difference between the amplitudes of the first channel signal and the second channel signal and the amplitude of the downmix signal may specifically refer to a difference between the amplitude of the first channel signal or the second channel signal and the amplitude of the downmix signal.
  • the correlation between the first channel signal or the second channel signal and the downmix signal may also refer to a phase of the first channel signal or the second channel signal, a phase and a phase difference of the downmix signal, and a period difference. Wait.
  • the first channel signal, the second channel signal, and the downmix signal may be channel signals obtained after normalization.
  • the identifier information may indicate that the first channel signal or the second channel signal is a channel signal that needs to adjust an initial reverberation gain parameter, and may also indicate that the first channel signal and the second channel signal are adjusted for need.
  • the channel signal of the gain parameter may be, or it may be indicated that neither the first channel signal nor the second channel signal need to adjust the reverberation gain parameter.
  • the identification information may indicate a channel signal of the plurality of channel signals that requires adjustment of the initial reverberation gain parameter by the value of the identification bit.
  • the identifier of the identification information occupies two bits. When the value of the identifier is 00, it indicates that the initial reverberation gain parameters of the first channel signal and the second channel signal do not need to be adjusted; when the identifier bit is taken A value of 01 indicates that only the initial reverberation gain parameter of the first channel signal needs to be adjusted; when the value of the flag bit is 10, it indicates that only the initial reverberation gain parameter of the second channel signal needs to be adjusted; when the value of the identifier bit is used; It is indicated at 11 o'clock that the initial reverberation gain parameters of the first channel signal and the second channel signal need to be adjusted.
  • determining the identification information of the first channel signal and the second channel signal according to the correlation between the first channel signal and the second channel signal and the downmix signal, respectively including: according to the first channel The correlation between the energy of the signal and the second channel signal and the energy of the downmix signal respectively determines the identification information of the first channel signal and the second channel signal.
  • the correlation between the first channel signal and the second channel signal and the downmix signal can be conveniently measured, thereby making it easier to determine the sound that needs to adjust the initial reverberation gain parameter. Signal.
  • the energy or amplitude of the downmix signal can be calculated from the energy of the first channel signal and the second channel signal, thereby simplifying certain computational processes.
  • the energy of the downmix signal can be calculated directly from the downmix signal itself.
  • the channel signal by determining the relationship between the magnitude of the difference between the energy of the channel signal and the downmix signal and the preset threshold, the channel signal can be obtained when the energy difference between the channel signal and the downmix signal is large. It is determined that the channel signal of the reverberation gain parameter needs to be adjusted, so that the decoding end can first adjust the initial reverberation gain parameter of the channel signal, and then perform reverberation processing on the channel signal, which can improve the reverberation process. The quality of the channel signal.
  • determining, according to the correlation between the first channel signal and the second channel signal and the energy of the downmix signal, the identification information of the first channel signal and the second channel signal including: Determining a first difference value and a second difference value, the first difference value is a sum of absolute values of differences between energy of the first channel signal and the downmix signal respectively at the plurality of frequency points, and the second difference value is the second sound value The sum of the absolute values of the differences of the energy of the track signal and the downmix signal respectively at the plurality of frequency points; and the identification information of the first channel signal and the second channel signal are determined according to the first difference value and the second difference value.
  • the first channel signal, the second channel signal, and the downmix signal can be conveniently determined.
  • the difference in energy determines the channel signal that needs to adjust the initial reverberation gain parameter without having to compare the difference in energy between the first channel signal and the second channel signal and the downmix signal over the entire frequency band.
  • determining the identification information of the first channel signal and the second channel signal according to the first difference value and the second difference value including: determining the maximum difference value of the first difference value and the second difference value as the target a difference value; determining the identifier information according to the target difference value, where the identifier information is specifically used to indicate a target channel signal corresponding to the target difference value, and the channel signal corresponding to the target difference value needs to adjust an initial reverberation gain The channel signal of the parameter.
  • the sum of the absolute values of the differences between the energy of the first channel signal and the downmix signal at the plurality of frequency points is greater than the absolute value of the difference between the energy of the second channel signal and the downmix signal at the plurality of frequency points.
  • the first channel signal and the first channel signal are determined according to a sum of absolute values of the difference values of the energy of the first channel signal or the second channel signal and the downmix signal respectively at the plurality of frequency points.
  • the identification information of the two-channel signal includes: generating the first identification information if the sum of the absolute values of the differences between the energy of the first channel signal and the downmix signal respectively at the plurality of frequency points is greater than a preset threshold, The first identification information is used to indicate that the initial reverberation gain parameter of the first channel signal needs to be adjusted; and the sum of the absolute values of the differences between the energy of the second channel signal and the downmix signal respectively at the plurality of frequency points is greater than a preset In the case of the threshold, the second identification information is generated, and the second identification information is used to indicate that the initial reverberation gain parameter of the second channel signal needs to be adjusted.
  • the channel signal By determining the relationship between the magnitude of the difference between the energy of the channel signal and the downmix signal and the preset threshold, the channel signal can be determined to be adjusted if the energy difference between the channel signal and the downmix signal is large.
  • the channel signal of the reverberation gain parameter enables the decoding end to first adjust the initial reverberation gain parameter of the channel signal, and then perform reverberation processing on the channel signal, thereby improving the channel signal after the reverberation processing. quality.
  • the identification information of the first channel signal and the second channel signal may be either one identification information or two identification information.
  • the identification information of the first channel signal and the second channel signal may be an identification information indicating The initial reverberation gain parameters of the first channel signal and the second channel signal need to be adjusted; or, the identification information of the first channel signal and the second channel signal are two identification information, which are the first identification information and the first
  • the second identification information is used to indicate that the initial reverberation gain parameter of the first channel signal needs to be adjusted
  • the second identification information is used to indicate that the initial reverberation gain parameter of the second channel signal needs to be adjusted.
  • the method of FIG. 6 further includes: determining a target attenuation factor according to the first difference value and the second difference value, where The target attenuation factor is used to adjust the initial reverberation gain parameter of the target channel signal; the target attenuation factor is quantized, and the quantized target attenuation factor is written into the code stream.
  • the initial reverberation gain parameter of the channel signal can be flexibly adjusted according to the magnitude of the correlation between the channel signal and the downmix signal by the attenuation factor.
  • first difference value and second difference value can be calculated with reference to the formula (1) and the formula (2) in the above.
  • the target attenuation factor may be determined according to a ratio of the first difference value and the second difference value when the target attenuation factor is determined according to the first difference value and the second difference value.
  • the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors respectively corresponding to at least one subband of the target channel signal, and any one of the subbands corresponds to only one attenuation factor.
  • a multi-channel signal contains multiple sub-bands, and adjacent sub-bands may correspond to one attenuation factor.
  • the reverberation gain parameter can be more flexibly adjusted according to the target attenuation factor.
  • the target channel signal includes a first frequency band and a second frequency band
  • the attenuation factor corresponding to the sub-band in the first frequency band is less than or equal to an attenuation factor corresponding to the sub-band in the second frequency band, where the first The frequency of the frequency band is less than the frequency of the second frequency band.
  • the reverberation gain parameters corresponding to the high frequency and low frequency sub-bands can be adjusted to different degrees, and reverberation processing can be performed. Get better processing results.
  • the frequency band in which the target channel signal is located includes a low frequency portion and a high frequency portion
  • the target attenuation factor includes a plurality of attenuation factors, wherein the low frequency portion corresponds to at least one attenuation factor, the high frequency portion corresponds to at least one attenuation factor, and the low frequency portion corresponds to The attenuation factor is less than the attenuation factor corresponding to the high frequency portion.
  • the energy of the downmix signal is determined based on the energy of the first channel signal and the second channel signal.
  • the energy of the downmix signal can be calculated by the energy of the first channel signal and the second channel signal, without being calculated by the downmix signal itself, which simplifies a certain calculation process.
  • the encoding method of the embodiment of the present application is described in detail with reference to FIG. 6.
  • the decoding method of the embodiment of the present application is described below with reference to FIG. 7. It should be understood that the decoding method in FIG. 7 is the same as the encoding method in FIG. Correspondingly, for the sake of brevity, the repeated description is omitted as appropriate below.
  • FIG. 7 is a schematic flowchart of a method for decoding a multi-channel signal according to an embodiment of the present application.
  • the method of FIG. 7 may be performed by a decoding device or a decoder, and the method of FIG. 7 specifically includes:
  • 720. Determine, according to the code stream, a downmix signal of the first channel signal and the second channel signal of the multichannel signal, an initial reverberation gain parameter of the first channel signal and the second channel signal, and a first channel.
  • the identification information of the signal and the second channel signal wherein the identification information is used to indicate a channel signal of the first channel signal and the second channel signal that needs to adjust an initial reverberation gain parameter.
  • the channel signal that needs to adjust the initial reverberation gain parameter can be determined by the identification information, and the initial reverberation gain parameter of the channel signal is adjusted before the channel signal is subjected to the reverberation processing, thereby improving the reverberation processing.
  • the quality of the channel signal afterwards.
  • the initial reverberation gain parameter of the target channel signal is adjusted, including: determining a target attenuation factor; and adjusting an initial reverberation gain parameter of the target channel signal according to the target attenuation factor to obtain a target.
  • the target reverberation gain parameter of the channel signal is adjusted, including: determining a target attenuation factor; and adjusting an initial reverberation gain parameter of the target channel signal according to the target attenuation factor to obtain a target.
  • the target reverberation gain parameter of the channel signal is adjusted, including: determining a target attenuation factor; and adjusting an initial reverberation gain parameter of the target channel signal according to the target attenuation factor to obtain a target.
  • the initial reverberation gain parameter of the channel signal can be flexibly adjusted according to the magnitude of the correlation between the channel signal and the downmix signal by the attenuation factor.
  • the decoding end determines the attenuation factor
  • the preset attenuation factor can be determined as the target attenuation factor.
  • the decoding end directly adjusts the initial reverberation gain parameter of the target channel signal according to a preset attenuation factor.
  • the process of determining the target attenuation factor can be simplified, thereby improving the efficiency of decoding.
  • the decoding end may obtain a target attenuation factor from a code stream of a plurality of channel signals, that is, obtain a target attenuation factor by decoding a code stream of the plurality of channel signals, in which case decoding
  • the target has determined the target attenuation factor, and the target attenuation factor is encoded to obtain the code stream to be transmitted to the decoding end, so that the decoding end does not need to calculate the target attenuation factor, but directly decodes the coded stream to obtain the target attenuation factor.
  • the process of determining the target attenuation factor can be simplified by directly obtaining the target attenuation factor in the code stream, and the decoding efficiency can be improved.
  • determining the target attenuation factor specifically includes: acquiring an inter-channel level difference between the first channel signal and the second channel signal from the code stream; and determining a target attenuation factor according to the inter-channel level difference.
  • the target attenuation factor is determined based on the inter-channel level difference and the downmix signal.
  • the target attenuation factor can be more flexibly and accurately determined according to the level difference between the channels, the downmix signal, etc., and the initial reverberation parameter of the channel signal can be more accurately adjusted according to the attenuation factor.
  • the difference between the first channel signal and the second channel signal can be considered to be large, and the correlation is small. At this time, a larger attenuation factor can be determined. As the target attenuation factor.
  • the periodicity and harmonicity of the downmix signal can be utilized to determine the target attenuation factor.
  • the periodicity or harmonicity of the downmix signal is good, it can be considered that the difference between the first channel signal and the second channel signal is small, and the correlation is large, and a small value can be determined at this time.
  • the attenuation factor is used as the target attenuation factor.
  • the target attenuation factor includes a plurality of attenuation factors, wherein each of the plurality of attenuation factors respectively corresponds to at least one subband of the target channel signal, and any one of the subbands
  • the band corresponds to only one attenuation factor.
  • the first channel signal and the second channel signal comprise a plurality of sub-bands, and the adjacent plurality of sub-bands may correspond to an attenuation factor.
  • the reverberation gain parameter can be more flexibly adjusted according to the target attenuation factor.
  • the target channel signal includes a first frequency band and a second frequency band
  • the attenuation factor corresponding to the sub-band in the first frequency band is less than or equal to an attenuation factor corresponding to the sub-band in the second frequency band, where the first The frequency of the frequency band is less than the frequency of the second frequency band.
  • the reverberation gain parameters corresponding to the high frequency and low frequency sub-bands can be adjusted to different degrees, and reverberation processing can be performed. Get better processing results.
  • the frequency band in which the target channel signal is located includes a low frequency portion and a high frequency portion
  • the target attenuation factor includes a plurality of attenuation factors, wherein the low frequency portion corresponds to at least one attenuation factor, the high frequency portion corresponds to at least one attenuation factor, and the low frequency portion corresponds to attenuation.
  • the factor is less than the attenuation factor corresponding to the high frequency portion.
  • the encoding and decoding method of the embodiment of the present application is described in detail with reference to FIG. 3 to FIG. 7 .
  • the encoder and the decoder of the embodiment of the present application are described below with reference to FIG. 8 to FIG. 13 . It should be understood that FIG. 8 to FIG. 13
  • the encoder and decoder in the implementation can perform the steps performed by the encoder and the decoder in the codec method of the embodiment of the present application. For the sake of brevity, the repeated description is appropriately omitted below.
  • FIG. 8 is a schematic block diagram of an encoder according to an embodiment of the present application.
  • the encoder 800 of Figure 8 includes:
  • the processing unit 810 is configured to determine a downmix signal of the first channel signal and the second channel signal in the multichannel signal, and initial reverberation gain parameters of the first channel signal and the second channel signal ;
  • the processing unit 810 is further configured to determine the first sound according to a correlation between the first channel signal, the second channel signal, and the downmix signal, and the initial reverberation gain parameter. a target reverberation gain parameter of the track signal and the second channel signal;
  • the encoding unit 820 is configured to quantize the first channel signal and the second channel signal according to the downmix signal and the target reverberation gain parameter, and quantize the first channel signal And the second channel signal is written to the code stream.
  • the encoder 800 described above may correspond to the encoding method of the multi-channel signal of FIG. 3, and the encoder 800 may perform the encoding method of the multi-channel signal of FIG.
  • the correlation between the channel signal and the downmix signal is considered, so that the reverberation processing of the channel signal according to the target reverberation gain parameter can be obtained.
  • the processing unit 810 is specifically configured to: determine a target attenuation factor according to a correlation between the first channel signal and the second channel signal and the downmix signal, respectively;
  • the initial reverberation gain parameter is adjusted according to the target attenuation factor to obtain the target reverberation gain parameter.
  • the first channel signal and the second channel signal each include multiple frequency points
  • the processing unit 810 is specifically configured to: determine the first channel signal and the Determining a difference value between the energy of the second channel signal and the energy of the downmix signal at the plurality of frequency points; determining the target attenuation factor according to the difference value.
  • the processing unit 810 is specifically configured to: determine a first difference value between an energy of the first channel signal and an energy of the downmix signal, where the first difference value is used to Determining a sum of absolute values of differences in energy of the first channel signal and the downmix signal respectively at a plurality of frequency points; determining energy of the second channel signal and energy of the downmix signal a second difference value, the second difference value being used to indicate a sum of absolute values of differences between energy of the first channel signal and the downmix signal respectively at a plurality of frequency points; according to the first difference The ratio of the value to the second difference value determines the target attenuation factor.
  • the processing unit 810 is further configured to: determine that the difference value is greater than a preset threshold.
  • the energy of the downmix signal is determined according to the energy of the first channel signal and the second channel signal.
  • the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors respectively corresponding to at least one subband of the plurality of channel signals, and any one The subband only corresponds to one attenuation factor.
  • FIG. 9 is a schematic block diagram of an encoder according to an embodiment of the present application.
  • the encoder 900 of Figure 9 includes:
  • the processing unit 910 is configured to determine a downmix signal of the first channel signal and the second channel signal in the multichannel signal, and initial reverberation gain parameters of the first channel signal and the second channel signal ;
  • the processing unit 910 is further configured to determine the first channel signal and the second channel according to a correlation between the first channel signal and the second channel signal and the downmix signal, respectively. Identification information of the signal, the identification information being used to indicate a channel signal of the first channel signal and the second channel signal that needs to be adjusted with an initial reverberation gain parameter;
  • the encoding unit 920 is configured to quantize the first channel signal and the second channel signal according to the downmix signal, the initial reverberation gain parameter, and the identification information, and quantize the The first channel signal and the second channel signal are written to the code stream.
  • the channel signal that needs to adjust the initial reverberation gain parameter can be determined according to the correlation between the channel signal and the downmix signal, so that the decoding end can first adjust the initial reverberation gain parameter of some channel signals.
  • the reverberation processing of these channel signals is performed to improve the quality of the channel signal after the reverberation processing.
  • the above encoder 900 may correspond to the encoding method of the multi-channel signal of FIG. 6, and the encoder 900 may perform the encoding method of the multi-channel signal in FIG.
  • the processing unit 910 is specifically configured to determine, according to the correlation between the energy of the first channel signal and the second channel signal and the energy of the downmix signal, respectively. Identification information of the first channel signal and the second channel signal.
  • the processing unit 910 is specifically configured to: determine a first difference value and a second difference value, where the first difference value is that the first channel signal and the downmix signal are respectively a sum of absolute values of differences in energy of the plurality of frequency points, the second difference value being a sum of absolute values of differences in energy of the second channel signal and the downmix signal respectively at a plurality of frequency points And determining identification information of the first channel signal and the second channel signal according to the first difference value and the second difference value.
  • the processing unit 910 is specifically configured to: determine a maximum difference value of the first difference value and the second difference value as a target difference value; and determine the identifier information according to the target difference value.
  • the identifier information is specifically used to indicate a channel signal corresponding to the target difference value, and the channel signal corresponding to the target difference value is a channel signal that needs to adjust an initial reverberation gain parameter.
  • the processing unit 910 is further configured to: determine, according to the first difference value and the second difference value, a target attenuation factor, where the target attenuation factor is used to target the target sound The initial reverberation gain parameter of the channel signal is adjusted; the target attenuation factor is quantized, and the quantized target attenuation factor is written to the code stream.
  • the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors respectively corresponding to at least one subband of the target channel signal, and any one of the subbands The band corresponds to only one attenuation factor.
  • the energy of the downmix signal is determined according to the energy of the first channel signal and the second channel signal.
  • FIG. 10 is a schematic block diagram of a decoder of an embodiment of the present application.
  • the decoder 1000 of FIG. 10 includes:
  • An obtaining unit 1010 configured to acquire a code stream
  • the processing unit 1020 is configured to determine, according to the code stream, a downmix signal of the first channel signal and the second channel signal of the multichannel signal, the first channel signal, and the second channel signal An initial reverberation gain parameter and identification information of the first channel signal and the second channel signal, wherein the identification information is used to indicate the first channel signal and the second channel signal Need to adjust the channel signal of the initial reverberation gain parameter;
  • the processing unit 1020 is further configured to determine, according to the identifier information, a channel signal that needs to adjust an initial reverberation gain parameter in the first channel signal and the second channel signal as a target channel signal;
  • the processing unit 1020 is further configured to adjust an initial reverberation gain parameter of the target channel signal.
  • the channel signal that needs to adjust the initial reverberation gain parameter can be determined by the identification information, and the initial reverberation gain parameter of the channel signal is adjusted before the channel signal is subjected to the reverberation processing, thereby improving the reverberation processing.
  • the quality of the channel signal afterwards.
  • the above decoder 1000 may correspond to the decoding method of the multi-channel signal of FIG. 7, and the decoder 1000 may perform the decoding method of the multi-channel signal in FIG.
  • the processing unit 1020 is specifically configured to: determine a target attenuation factor; adjust an initial reverberation gain parameter of the target channel signal according to the target attenuation factor, to obtain the target sound The target reverberation gain parameter of the channel signal.
  • the processing unit 1020 is specifically configured to: determine a preset attenuation factor as the target attenuation factor.
  • the processing unit 1020 is specifically configured to: acquire the target attenuation factor according to the code stream.
  • the processing unit 1020 is specifically configured to: acquire an inter-channel level difference between the first channel signal and the second channel signal from the code stream;
  • the inter-channel level difference determines the target attenuation factor, or the target attenuation factor is determined based on the inter-channel level difference and the downmix signal.
  • the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors respectively corresponding to at least one subband of the target channel signal, and any one of the subbands The band corresponds to only one attenuation factor.
  • FIG. 11 is a schematic block diagram of an encoder of an embodiment of the present application.
  • the encoder 1100 of Figure 11 includes:
  • a memory 1110 configured to store a program
  • a processor 1120 configured to execute a program, when the program is executed, the processor 1120 is configured to determine a downmix signal of the first channel signal and the second channel signal in the multichannel signal, and the An initial reverberation gain parameter of the first channel signal and the second channel signal; a correlation between the first channel signal and the second channel signal and the downmix signal, respectively, and the initial a reverberation gain parameter determining a target reverberation gain parameter of the first channel signal and the second channel signal; and the first channel according to the downmix signal and the target reverberation gain parameter
  • the signal and the second channel signal are quantized, and the quantized first channel signal and second channel signal are written to the code stream.
  • the above encoder 1100 may correspond to the encoding method of the multi-channel signal of FIG. 3, and the encoder 1100 may perform the encoding method of the multi-channel signal in FIG.
  • the correlation between the channel signal and the downmix signal is considered, so that it is possible to obtain a reverberation process for the channel signal according to the target reverberation gain parameter.
  • the processor 1120 is specifically configured to: determine a target attenuation factor according to a correlation between the first channel signal and the second channel signal and the downmix signal respectively; The initial reverberation gain parameter is adjusted according to the target attenuation factor to obtain the target reverberation gain parameter.
  • the first channel signal and the second channel signal each include multiple frequency points
  • the processor 1120 is specifically configured to: determine the first channel signal and the Determining a difference value between the energy of the second channel signal and the energy of the downmix signal at the plurality of frequency points; determining the target attenuation factor according to the difference value.
  • the processor 1120 is specifically configured to: determine a first difference value between an energy of the first channel signal and an energy of the downmix signal, where the first difference value is used to Determining a sum of absolute values of differences in energy of the first channel signal and the downmix signal respectively at a plurality of frequency points; determining energy of the second channel signal and energy of the downmix signal a second difference value, the second difference value being used to indicate a sum of absolute values of differences between energy of the first channel signal and the downmix signal respectively at a plurality of frequency points; according to the first difference The ratio of the value to the second difference value determines the target attenuation factor.
  • the processor 1120 before determining the target attenuation factor according to the difference value, is further configured to: determine that the difference value is greater than a preset threshold.
  • the energy of the downmix signal is determined according to the energy of the first channel signal and the second channel signal.
  • the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors respectively corresponding to at least one subband of the plurality of channel signals, and any one The subband only corresponds to one attenuation factor.
  • FIG. 12 is a schematic block diagram of an encoder of an embodiment of the present application.
  • the encoder 1200 of Figure 12 includes:
  • a memory 1210 configured to store a program
  • a processor 1220 configured to execute a program, when the program is executed, the processor 1220 is configured to determine a downmix signal of the first channel signal and the second channel signal in the multichannel signal, and the An initial reverberation gain parameter of the first channel signal and the second channel signal; determining the first according to a correlation between the first channel signal and the second channel signal and the downmix signal Identification information of a first channel signal and the second channel signal, the identification information being used to indicate a channel signal of the first channel signal and the second channel signal that needs to adjust an initial reverberation gain parameter And quantizing the first channel signal and the second channel signal according to the downmix signal, the initial reverberation gain parameter, and the identification information, and quantizing the first channel signal And the second channel signal is written to the code stream.
  • the channel signal that needs to adjust the initial reverberation gain parameter can be determined according to the correlation between the channel signal and the downmix signal, so that the decoding end can first adjust the initial reverberation gain parameter of some channel signals.
  • the reverberation processing of these channel signals is performed to improve the quality of the channel signal after the reverberation processing.
  • the above encoder 1200 may correspond to the encoding method of the multi-channel signal of FIG. 6, and the encoder 1200 may perform the encoding method of the multi-channel signal in FIG.
  • the processor 1220 is specifically configured to: determine, according to the correlation between the energy of the first channel signal and the second channel signal and the energy of the downmix signal, respectively Identification information of the first channel signal and the second channel signal.
  • the processor 1220 is specifically configured to: determine a first difference value and a second difference value, where the first difference value is that the first channel signal and the downmix signal are respectively a sum of absolute values of differences in energy of the plurality of frequency points, the second difference value being a sum of absolute values of differences in energy of the second channel signal and the downmix signal respectively at a plurality of frequency points And determining identification information of the first channel signal and the second channel signal according to the first difference value and the second difference value.
  • the processor 1220 is specifically configured to: determine a maximum difference value of the first difference value and the second difference value as a target difference value; and determine the identifier information according to the target difference value.
  • the identifier information is specifically used to indicate a channel signal corresponding to the target difference value, and the channel signal corresponding to the target difference value is a channel signal that needs to adjust an initial reverberation gain parameter.
  • the processor 1220 is further configured to: determine, according to the first difference value and the second difference value, a target attenuation factor, where the target attenuation factor is used to target the target sound The initial reverberation gain parameter of the channel signal is adjusted; the target attenuation factor is quantized, and the quantized target attenuation factor is written to the code stream.
  • the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors respectively corresponding to at least one subband of the target channel signal, and any one of the subbands The band corresponds to only one attenuation factor.
  • the energy of the downmix signal is determined according to the energy of the first channel signal and the second channel signal.
  • FIG. 13 is a schematic block diagram of a decoder of an embodiment of the present application.
  • the decoder 1300 of Figure 13 includes:
  • a memory 1310 configured to store a program
  • a processor 1320 configured to execute a program, when the program is executed, the processor 1320 is configured to acquire a code stream, and determine, according to the code stream, a first channel signal and a second channel in the multi-channel signal a downmix signal of the signal, an initial reverberation gain parameter of the first channel signal and the second channel signal, and identification information of the first channel signal and the second channel signal, wherein The identification information is used to indicate a channel signal of the first channel signal and the second channel signal that needs to adjust an initial reverberation gain parameter; determining the first channel signal and the The channel signal of the second channel signal that needs to adjust the initial reverberation gain parameter is the target channel signal; and the initial reverberation gain parameter of the plurality of channel signals is adjusted.
  • the channel signal that needs to adjust the initial reverberation gain parameter can be determined by the identification information, and the initial reverberation gain parameter of the channel signal is adjusted before the channel signal is subjected to the reverberation processing, thereby improving the reverberation processing.
  • the quality of the channel signal afterwards.
  • the above decoder 1300 may correspond to the decoding method of the multi-channel signal of FIG. 7, and the decoder 1300 may perform the decoding method of the multi-channel signal in FIG.
  • the processor 1320 is specifically configured to: determine a target attenuation factor; adjust an initial reverberation gain parameter of the target channel signal according to the target attenuation factor, to obtain the target sound The target reverberation gain parameter of the channel signal.
  • the processor 1320 is specifically configured to: determine a preset attenuation factor as the target attenuation factor.
  • the processor 1320 is specifically configured to: acquire the target attenuation factor according to the code stream.
  • the processor 1320 is specifically configured to: obtain an inter-channel level difference between the first channel signal and the second channel signal from the code stream;
  • the inter-channel level difference determines the target attenuation factor, or the target attenuation factor is determined based on the inter-channel level difference and the downmix signal.
  • the target attenuation factor includes a plurality of attenuation factors, each of the plurality of attenuation factors respectively corresponding to at least one subband of the target channel signal, and any one of the subbands The band corresponds to only one attenuation factor.
  • the disclosed systems, devices, and methods may be implemented in other manners.
  • the device embodiments described above are merely illustrative.
  • the division of the unit is only a logical function division.
  • there may be another division manner for example, multiple units or components may be combined or Can be integrated into another system, or some features can be ignored or not executed.
  • the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, and may be in an electrical, mechanical or other form.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
  • each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
  • the functions may be stored in a computer readable storage medium if implemented in the form of a software functional unit and sold or used as a standalone product.
  • the technical solution of the present application which is essential or contributes to the prior art, or a part of the technical solution, may be embodied in the form of a software product, which is stored in a storage medium, including
  • the instructions are used to cause a computer device (which may be a personal computer, server, or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present application.
  • the foregoing storage medium includes: a U disk, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like, which can store program codes. .

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Mathematical Physics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Stereophonic System (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

一种多声道信号的编解码方法和编解码器,该编码方法包括:确定多声道信号中的第一声道信号和第二声道信号的下混信号以及第一声道信号和第二声道信号的初始混响增益参数;根据第一声道信号、第二声道信号分别与下混信号的相关性,以及初始混响增益参数,确定第一声道信号和第二声道信号的目标混响增益参数;根据下混信号和目标混响增益参数,对第一声道信号和第二声道信号进行量化,并将量化后的第一声道信号和第二声道信号写入码流。该编解码方法和编解码器能够提高混响处理后得到的声道信号的质量。

Description

多声道信号的编解码方法和编解码器
本申请要求于2017年03月31日提交中国专利局、申请号为201710205821.2、申请名称为“多声道信号的编解码方法和编解码器”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本申请涉及音频编码领域,并且更具体地,涉及一种多声道信号的编解码方法和编解码器。
背景技术
随着生活质量的提高,人们对高质量音频的需求不断增加。相对于单声道音频,立体声音频具有各声源的方位感和分布感,能够提高声音的清晰度、可懂度及临场感,因而备受人们青睐。
立体声处理技术主要有和差(Mid/Sid,MS)编码、强度立体声(Intensity Stereo,IS)编码以及参数立体声(Parametric Stereo,PS)编码等。
现有技术在采用PS编码对声道信号进行编码时,编码端会对多个声道信号进行空间参数分析,得到多个声道信号的混响增益参数以及其它空间参数,并对多个声道信号的混响增益参数以及其它空间参数进行编码,以便解码端在解码时能够根据声道信号的混响增益参数对解码得到的多个声道信号进行混响处理,以增加听觉效果。但是,在某些情况下,例如,当多声道信号之间的相关性较低时,根据多个声道信号对应的混响增益参数对解码得到的多个声道信号进行混响处理时,反而会导致更差的听觉效果。
发明内容
本申请提供一种多声道信号的编解码方法和编解码器,以提高声道信号的质量。
第一方面,提供了一种多声道信号的编码方法,该方法包括:确定多声道信号中的第一声道信号和第二声道信号的下混信号以及所述第一声道信号和所述第二声道信号的初始混响增益参数;根据所述第一声道信号、所述第二声道信号分别与所述下混信号的相关性,以及所述初始混响增益参数,确定所述第一声道信号和所述第二声道信号的目标混响增益参数;根据所述下混信号和所述目标混响增益参数,对所述第一声道信号和所述第二声道信号进行量化,并将量化后的第一声道信号和第二声道信号写入码流。
本申请中,在确定声道信号的目标混响增益参数时,考虑到了声道信号与下混信号的相关性,这样能够在根据目标混响增益参数对声道信号进行混响处理时取得更好的处理效果,从而提升混响处理后的声道信号的质量。
上述第一声道信号或者第二声道信号与下混信号的相关性可以根据第一声道信号或者第二声道信号的能量与下混信号的能量的差异来确定,也可以根据第一声道信号或者第 二声道信号的幅度与下混信号的幅度的差异来确定。
结合第一方面,在第一方面的某些实现方式中,所述第一声道信号、所述第二声道信号以及所述下混信号是经过归一化处理之后得到的声道信号。
结合第一方面,在第一方面的某些实现方式中,所述根据所述第一声道信号、所述第二声道信号分别与所述下混信号的相关性,以及所述初始混响增益参数,确定所述第一声道信号和所述第二声道信号的目标混响增益参数,包括:根据所述第一声道信号、所述第二声道信号分别与所述下混信号的相关性,确定目标衰减因子;根据所述目标衰减因子对所述初始混响增益参数进行调整,得到所述目标混响增益参数。
通过衰减因子能够实现根据声道信号与下混信号的相关性的大小来灵活调整声道信号的初始混响增益参数。
通过声道信号的能量能够较为方便地衡量第一声道信号、第二声道信号与下混信号的相关性,也就是说通过比较声道信号与下混信号的能量的差异,能够方便地确定目标衰减因子。具体地,当第一声道信号或者第二声道信号的能量与下混信号的相差较大(大于给定阈值)时可以认为第一声道信号和第二声道信号与下混信号的相关性较弱,这时可以确定一个较大的目标衰减因子,而当第一声道信号或者第二声道信号的能量与下混信号的相差较小(小于给定阈值)时可以认为第一声道信号和第二声道信号与下混信号的相关性较弱,这时可以确定一个较小的目标衰减因子。
上述根据第一声道信号、第二声道信号分别与下混信号的相关性,确定目标衰减因子既可以根据声道信号与下信号的相关性来计算目标衰减因子,也可以是考虑到声道信号与下混信号的相关性之后直接将预设的衰减因子确定为目标衰减因子。
结合第一方面,在第一方面的某些实现方式中,所述第一声道信号和所述第二声道信号均包含多个频点,所述根据所述第一声道信号、所述第二声道信号分别与所述下混信号的相关性,确定目标衰减因子,包括:确定所述第一声道信号和所述第二声道信号分别与所述下混信号在所述多个频点的能量的差异值;根据所述差异值确定所述目标衰减因子。
通过比较第一声道信号、第二声道信号分别与下混信号在多个频点的能量的差异值,能够较为方便地确定第一声道信号、第二声道信号的能量分别与下混信号的能量的差异,进而确定衰减因子,而不必比较第一声道信号、第二声道信号分别与下混信号在全部频带上的能量的差异。
结合第一方面,在第一方面的某些实现方式中,所述确定所述第一声道信号和所述第二声道信号分别与所述下混信号在所述多个频点的能量的差异值,包括:确定所述第一声道信号的能量与所述下混信号的能量的第一差异值,所述第一差异值用于指示所述第一声道信号与所述下混信号分别在多个频点的能量的差值的绝对值的和;确定所述第二声道信号的能量与所述下混信号的能量的第二差异值,所述第二差异值用于指示所述第二声道信号与所述下混信号分别在多个频点的能量的差值的绝对值的和;所述根据所述差异值确定所述目标衰减因子,包括:根据所述第一差异值和所述第二差异值的比值,确定所述目标衰减因子。
可替换地,还可以直接根据所述第一差异值和所述第二差异值,确定所述目标衰减因子。
结合第一方面,在第一方面的某些实现方式中,在根据所述差异值确定所述目标衰减 因子之前,所述方法还包括:确定所述差异值大于预设阈值。
只有在第一声道信号、第二声道信号分别与下混信号的能量的差异值比较大时才会确定目标衰减因子,并根据目标衰减因子对初始混响增益参数进行调整,而在差值较小的情况下,可以不对初始混响增益参数进行调整,从而提高了编码效率。
而当多个声道信号与下混信号的能量的差异值小于预设阈值时,可以直接将多个声道信号的初始混响增益参数确定为该多个声道信号的目标混响增益参数。
结合第一方面,在第一方面的某些实现方式中,所述下混信号的能量是根据根据所述第一声道信号和第二声道信号的能量确定的。
通过第一声道信号和第二声道信号的能量能够计算下混信号的能量,而不用再通过下混信号本身计算,能够简化一定的计算过程。
结合第一方面,在第一方面的某些实现方式中,所述目标衰减因子包括多个衰减因子,所述多个衰减因子中的每个衰减因子分别对应所述多个声道信号的至少一个子带,并且任意一个子带仅对应一个衰减因子。
当目标衰减因子包括多个衰减因子时,能够根据目标衰减因子对混响增益参数进行更灵活的调整。
结合第一方面,在第一方面的某些实现方式中,所述第一声道信号和第二声道信号所在的频段包含第一频段和第二频段,所述第一频段中的子带对应的衰减因子小于或者等于第二频段中的子带对应的衰减因子,其中,所述第一频段的频率小于所述第二频段的频率。
通过为高频和低频子带对应的混响增益参数设置不同的大小的衰减因子,能够对低频子带和高频子带对应的混响增益参数进行不同程度的调整,能够在进行混响处理时取得更好的处理效果。
第二方面,提供了一种多声道信号的解码方法,该方法包括:确定多声道信号中的第一声道信号和第二声道信号的下混信号以及所述第一声道信号和所述第二声道信号的初始混响增益参数;根据所述第一声道信号、所述第二声道信号分别与所述下混信号的相关性,确定所述第一声道信号和所述第二声道信号的标识信息,所述标识信息用于指示所述第一声道信号和所述第二声道信号中需要调整初始混响增益参数的声道信号;根据所述下混信号、所述初始混响增益参数以及所述标识信息,对所述第一声道信号和所述第二声道信号进行量化,并将量化后的第一声道信号和第二声道信号写入码流。
上述第一声道信号或者第二声道信号与下混信号的相关性可以根据第一声道信号或者第二声道信号的能量与下混信号的能量的差异来确定,也可以根据第一声道信号或者第二声道信号的幅度与下混信号的幅度的差异来确定。
在本申请中,能够根据声道信号与下混信号的相关性来确定需要调整初始混响增益参数的声道信号,使得解码端能够先对某些声道信号的初始混响增益参数进行调整后再对这些声道信号进行混响处理,能够提升混响处理后的声道信号的质量。
结合第二方面,在第二方面的某些实现方式中,所述根据所述第一声道信号、所述第二声道信号分别与所述下混信号的相关性,确定所述第一声道信号和所述第二声道信号的标识信息,包括:根据所述第一声道信号、所述第二声道信号的能量分别与所述下混信号的能量的相关性,确定所述第一声道信号和所述第二声道信号的标识信息。
通过声道信号以及下混信号的能量能够较为方便地衡量第一声道信号、第二声道信号 分别与下混信号的相关性,从而能够较为方便地确定需要调整初始混响增益参数的声道信号。
结合第二方面,在第二方面的某些实现方式中,所述第一声道信号、所述第二声道信号分别与所述下混信号的能量的相关性,确定所述第一声道信号和所述第二声道信号的标识信息,包括:确定第一差异值和第二差异值,所述第一差异值为所述第一声道信号与所述下混信号分别在多个频点的能量的差值的绝对值的和,所述第二差异值为所述第二声道信号与所述下混信号分别在多个频点的能量的差值的绝对值的和;根据所述第一差异值和所述第二差异值确定所述第一声道信号和所述第二声道信号的标识信息。
应理解,上述第一声道信号、第二声道信号以及下混信号的能量值可以是经过归一化处理后的数值。
通过比较第一声道信号、第二声道信号分别与下混信号在多个频点的能量的差值,能够较为方便地确定第一声道信号、第二声道信号分别与下混信号的能量的差异,进而确定需要调整初始混响增益参数的声道信号,而不必比较第一声道信号、第二声道信号分别与下混信号在全部频带上的能量的差异。
结合第二方面,在第二方面的某些实现方式中,根据所述第一差异值和所述第二差异值确定所述第一声道信号和所述第二声道信号的标识信息,包括:将所述第一差异值和所述第二差异值中的最大差异值确定为目标差异值;根据目标差异值确定所述标识信息,所述标识信息具体用于指示所述目标差异值对应的声道信号,所述目标差异值对应的声道信号为需要调整初始混响增益参数的声道信号。
结合第二方面,在第二方面的某些实现方式中,所述方法还包括:根据所述第一差异值和所述第二差异值确定目标衰减因子,所述目标衰减因子用于对所述目标声道信号的初始混响增益参数进行调整;对所述目标衰减因子进行量化,并将量化后的目标衰减因子写入所述码流。
通过衰减因子能够实现根据声道信号与下混信号的相关性的大小来灵活调整声道信号的初始混响增益参数。
结合第二方面,在第二方面的某些实现方式中,所述目标衰减因子包括多个衰减因子,所述多个衰减因子中的每个衰减因子分别对应所述目标声道信号的至少一个子带,并且任意一个子带仅对应一个衰减因子。
当目标衰减因子包括多个衰减因子时,能够根据目标衰减因子对混响增益参数进行更灵活的调整。
结合第二方面,在第二方面的某些实现方式中,所述目标声道信号包含第一频段和第二频段,所述第一频段中的子带对应的衰减因子小于或者等于第二频段中的子带对应的衰减因子,其中,所述第一频段的频率小于所述第二频段的频率。
通过为高频和低频子带对应的混响增益参数设置不同的大小的衰减因子,能够对低频子带和高频子带对应的混响增益参数进行不同程度的调整,能够在进行混响处理时取得更好的处理效果。
结合第二方面,在第二方面的某些实现方式中,所述下混信号的能量是根据所述第一声道信号和所述第二声道信号的能量确定的。
通过多个声道信号的能量来估计或者推导下混信号的能量,能够节省一定的计算量。
第三方面,提供了一种多声道信号的解码方法,该方法包括:获取码流;根据所述码流确定多声道信号中的第一声道信号和第二声道信号的下混信号、所述第一声道信号和所述第二声道信号的初始混响增益参数以及所述第一声道信号和所述第二声道信号的标识信息,其中,所述标识信息用于指示所述第一声道信号和所述第二声道信号中需要调整初始混响增益参数的声道信号;根据所述标识信息确定所述第一声道信号和所述第二声道信号中需要调整初始混响增益参数的声道信号为目标声道信号;对所述目标声道信号的初始混响增益参数进行调整。
本申请中,能够通过标识信息确定需要调整初始混响增益参数的声道信号,并在对该声道信号进行混响处理之前调整该声道信号的初始混响增益参数,能够提升混响处理后的声道信号的质量。
结合第三方面,在第三方面的某些实现方式中,所述对所述目标声道信号的初始混响增益参数进行调整,包括:确定目标衰减因子;根据所述目标衰减因子对所述目标声道信号的初始混响增益参数进行调整,得到所述目标声道信号的目标混响增益参数。
通过衰减因子能够实现根据声道信号与下混信号的相关性的大小来灵活调整声道信号的初始混响增益参数。
结合第三方面,在第三方面的某些实现方式中,所述确定目标衰减因子,包括:将预设的衰减因子确定为所述目标衰减因子。
通过预设设置衰减因子,能够简化确定目标衰减因子的过程,进而提高解码的效率。
结合第三方面,在第三方面的某些实现方式中,所述确定目标衰减因子,包括:根据所述码流获取所述目标衰减因子。
当码流中包含目标衰减因子时,可以直接通过在码流中获取目标衰减因子也能够简化确定目标衰减因子的过程,能够提高解码的效率。
结合第三方面,在第三方面的某些实现方式中,所述确定目标衰减因子,包括:从所述码流获取所述第一声道信号和所述第二声道信号的声道间电平差;根据所述声道间电平差确定所述目标衰减因子,或者,根据所述声道间电平差以及所述下混信号,确定所述目标衰减因子。
通过根据声道间电平差、下混信号等能够更为灵活更准确地确定目标衰减因子,进而能够根据该衰减因子对声道信号的初始混响参数进行更准确的调整。
结合第三方面,在第三方面的某些实现方式中,所述目标衰减因子包括多个衰减因子,所述多个衰减因子中的每个衰减因子分别对应所述目标声道信号的至少一个子带,并且任意一个子带仅对应一个衰减因子。
当目标衰减因子包括多个衰减因子时,能够根据目标衰减因子对混响增益参数进行更灵活的调整。
结合第三方面,在第三方面的某些实现方式中,所述目标声道信号包含第一频段和第二频段,所述第一频段中的子带对应的衰减因子小于或者等于所述第二频段中的子带对应的衰减因子,其中,所述第一频段的频率小于所述第二频段的频率。
通过为高频和低频子带对应的混响增益参数设置不同的大小的衰减因子,能够对低频子带和高频子带对应的混响增益参数进行不同程度的调整,能够在进行混响处理时取得更好的处理效果。
第四方面,提供了一种编码器,所述编码器包括用于执行所述第一方面或其各种实现方式中的方法的模块或单元。
第五方面,提供了一种编码器,所述编码器包括用于执行所述第二方面或其各种实现方式中的方法的模块或单元。
第六方面,提供了一种解码器,所述编码器包括用于执行所述第三方面或其各种实现方式中的方法的模块或单元。
第七方面,提供了一种编码器,包括存储器和处理器,所述存储器用于存储程序,所述处理器用于执行程序,当所述程序被执行时,所述处理器执行第一方面或其各种实现方式中的方法。
第八方面,提供了一种编码器,包括存储器和处理器,所述存储器用于存储程序,所述处理器用于执行程序,当所述程序被执行时,所述处理器执行第二方面或其各种实现方式中的方法。
第九方面,提供了一种解码器,包括存储器和处理器,所述存储器用于存储程序,所述处理器用于执行程序,当所述程序被执行时,所述处理器执行第三方面或其各种实现方式中的方法。
第十方面,提供一种计算机可读介质,所述计算机可读介质存储用于设备执行的程序代码,所述程序代码包括用于执行第一方面或其各种实现方式中的方法的指令。
第十一方面,提供一种计算机可读介质,所述计算机可读介质存储用于设备执行的程序代码,所述程序代码包括用于执行第二方面或其各种实现方式中的方法的指令。
第十二方面,提供一种计算机可读介质,所述计算机可读介质存储用于设备执行的程序代码,所述程序代码包括用于执行第三方面或其各种实现方式中的方法的指令。
附图说明
图1是现有技术对左右声道信号进行编码的示意性流程图。
图2是现有技术对左右声道信号进行解码的示意性流程图。
图3是本申请实施例的多声道信号的编码方法的示意性流程图。
图4是本申请实施例的多声道信号的解码方法的示意性流程图。
图5是本申请实施例的多声道信号的编码方法的示意性流程图。
图6是本申请实施例的多声道信号的编码方法的示意性流程图。
图7是本申请实施例的多声道信号的解码方法的示意性流程图。
图8是本申请实施例的编码器的示意性框图。
图9是本申请实施例的编码器的示意性框图。
图10是本申请实施例的解码器的示意性框图。
图11是本申请实施例的编码器的示意性框图。
图12是本申请实施例的编码器的示意性框图。
图13是本申请实施例的解码器的示意性框图。
具体实施方式
下面将结合附图,对本申请中的技术方案进行描述。为了更好地理解本申请实施例的 多声道信号的编解码方法,下面先结合图1和图2对现有技术中对多声道信号进行编码和解码的方法进行简单的介绍。
图1示出了现有技术中对左右声道信号进行编码的过程。图1所示的编码过程具体包括:
110、对左声道信号(图中用L表示)和右声道信号(图中用R表示)进行空间参数分析以及下混处理。
具体地,步骤110具体包括:对左声道信号和右声道信号进行空间参数分析,获得左声道信号和右声道信号的空间参数;对左声道信号和右声道信号进行下混处理,得到下混信号(经过下混处理后得到的下混信号为单声道音频信号,通过下混处理将原来的两路声道音频信号变成了一路声道音频信号)。
空间参数(也可以称为空间感知参数)包含声道间相关性(Inter-channel Coherent,IC)、声道间电平差(Inter-channel Level Difference,ILD)、声道间时间差(Inter-channel Time Difference,ITD)以及声道间相位差(Inter-channel Phase Difference,IPD)等。
其中,IC描述了声道间的互相关或相干性,该参数决定了声场范围的感知,可以提高音频信号空间感和声响稳定性。ILD用于分辨立体声源的水平方向角度,描述了声道间的强度差别,该参数将影响整个频谱的频率成分。ITD和IPD为表示声源水平方位的空间参数,描述了声道间的时间和相位差别,该参数主要影响2kHz以下的频率成分。对于两路声道信号而言,ITD可以表示立体声的左右声道信号之间的时间延时,IPD可以表示立体声的左右声道信号在时间对齐后的波形相似性。ILD、ITD和IPD能够决定人耳对声源位置的感知,可以有效确定声场位置,对立体声信号的恢复具有重要作用。
120、对下混信号进行编码,得到比特流。
130、对空间参数进行编码,得到比特流。
140、将对下混信号及空间参数编码得到的比特流进行复用得到码流。
编码得到的码流可以存储或者传输给解码端设备。
图2示出了现有技术中对左右声道信号进行解码的过程。图2所示的解码过程具体包括:
210、对比特流解复用,分别得到下混信号编码得到的码流以及空间参数编码得到的码流。
220、解码码流,得到左声道信号和右声道信号的下混信号,以及左声道信号和右声道信号的空间参数。
上述空间参数包括左声道信号和右声道信号的IC。
230、根据前面帧的下混信号和空间参数,得到去相关信号。
根据解码的当前帧的下混信号和去相关信号,得到左右声道信号;
240、根据空间参数及左右声道信号,得到最终输出的左右声道信号(图2中分别用L’和R’表示)。
应理解,步骤240中的左声道信号和右声道信号(在图2中分别用L’和R’表示)是解码得到的,与编码端编码的左右声道信号相比可能会有一定的失真。
具体地,可以通过对下混信号进行滤波处理,然后利用声道间相关性参数对滤波后得到的下混信号进行修正得到去相关信号。
生成去相关信号的目的是为了在解码端增加最终生成的立体声信号的混响感,增加立体声信号的声场宽度,使得输出的音频信号的听觉更加圆润饱满。所谓混响感,实质上是原始音频信号通过不同的反射折射等延时后和原始音频信号叠加在一起进入人耳的一种效果。
现有技术在获取IC后,没有考虑到不同声道间信号的相关性对该IC进行自适应的调整,这样在根据原来获取的IC对声道信号进行混响处理时反而可能会导致较差的听觉效果。例如,当不同声道信号之间的相关性较低时,如果仍采用之前获取的IC对去相关信号进行修正,然后再利用去相关信号对不同声道信号进行相同的混响处理的话就会导致解码端最终输出的声道信号的质量较差。也就是说,由于不同声道信号之间的差异较大,如果仍采用之前较大的IC修正后的去相关信号对不同声道信号进行混响处理的话,不仅不会增加声道信号的混响效果,还可能会导致输出的声道信号失真。
因此,本申请实施例提出了一种多声道信号的编解码方法,该方法能够根据不同声道信号之间的相关性,对混响增益参数进行相应的调整,并利用调整后的混响增益参数对去相关信号进行修正,然后再用该去相关信号对不同的声道信号进行混响处理,这样,在对不同声道信号进行混响处理时就考虑到了不同声道信号之间的相关性,使得输出的声道信号的质量更好。
图3是本申请实施例的多声道信号的编码方法的示意性流程图。图3的方法可以由编码端设备或者编码器来执行,图3的方法包括:
310、确定多声道信号中的第一声道信号和第二声道信号的下混信号以及第一声道信号和第二声道信号的初始混响增益参数。
应理解,在本申请实施例中,对确定下混信号与确定初始混响增益参数的先后顺序不做限定,既可以是同时进行,也可以是依次进行。
上述初始混响增益参数可以是指对第一声道信号和第二声道信号进行空间参数分析后获得的混响增益参数。
具体地,可以通过对上述多个声道信号进行下混处理得到下混信号;通过对上述第一声道信号和第二声道信号进行空间参数分析来获取该第一声道信号和第二声道信号的空间参数,该空间参数中包含第一声道信号和第二声道信号的初始混响增益参数。
应理解,上述第一声道信号和第二声道信号可以对应相同的空间参数,相应地,第一声道信号和第二声道信号也可以对应相同的初始混响增益参数。也就是说,第一声道信号的空间参数与第二声道信号的空间参数可以相同,第一声道信号的初始混响增益参数与第二声道信号的初始混响增益参数也可以相同。
进一步地,假设第一声道信号和第二声道信号均包含10个子带,每个子带分别对应1个混响增益参数,那么,第一声道信号和第二声道信号的索引值相同的子带对应的混响增益参数可以是相同的。
另外,上述第一声道信号、第二声道信号以及下混信号可以是经过归一化处理之后得到的声道信号。
320、根据第一声道信号、第二声道信号分别与下混信号的相关性,以及初始混响增益参数,确定第一声道信号和第二声道信号的目标混响增益参数。
可选地,第一声道信号或者第二声道信号与下混信号的相关性可以根据第一声道信号 或者第二声道信号的能量与下混信号的能量的差异来确定,也可以根据第一声道信号或者第二声道信号的幅度与下混信号的幅度的差异来确定。
具体地,当第一声道信号的能量或者幅度与下混信号的能量或者幅度的差异较小时可以认为第一声道信号与下混信号的相关性较大,而当第一声道信号的能量或者幅度与下混信号的能量或者幅度的差异较大时可以认为第一声道信号与下混信号的相关性较小。
上述第一声道信号或者第二声道信号的能量与下混信号的能量差异具体可以是指第一声道信号或者第二声道信号的能量与下混信号的能量之间的差值,同样,上述第一声道信号和第二声道信号的幅度与下混信号的幅度的差异具体可以是指第一声道信号或者第二声道信号的幅度与下混信号的幅度的差值。
另外,上述第一声道信号或者第二声道信号与下混信号的相关性还可以是指第一声道信号或者第二声道信号的相位、周期与下混信号的相位、周期的差异等。
330、根据下混信号和目标混响增益参数,对第一声道信号和第二声道信号进行量化,
并将量化后的第一声道信号和第二声道信号写入码流。
应理解,当多声道信号的数目多于两路时,例如,当多声道信号包括第一声道信号、第二声道信号、第三声道信号以及第四声道信号时,可以对第一声道信号和第二声道信号采用图3的方法进行处理,对第三声道信号和第四声道信号也采用图3的方法进行处理。
本申请中,在确定声道信号的目标混响增益参数时,考虑到了声道信号与下混信号的相关性,这样能够在根据目标混响增益参数对声道信号进行混响处理时取得较好的处理效果,从而提升混响处理后的声道信号的质量。
可选地,作为一个实施例,根据第一声道信号、第二声道信号分别与下混信号的相关性,以及初始混响增益参数,确定第一声道信号和第二声道信号的目标混响增益参数,包括:根据第一声道信号、第二声道信号分别与下混信号的相关性,确定目标衰减因子;根据该目标衰减因子对上述初始混响增益参数进行调整,得到目标混响增益参数。
具体而言,上述根据第一声道信号、第二声道信号分别与下混信号的相关性,确定目标衰减因子既可以根据声道信号与下信号的相关性来计算目标衰减因子,也可以是考虑到声道信号与下混信号的相关性之后直接将预设的衰减因子确定为目标衰减因子。
通过衰减因子能够实现根据声道信号与下混信号的相关性的大小来灵活调整声道信号的初始混响增益参数。
例如,当上述第一声道信号和第二声道信号与下混信号的相关性较大时(此时也可以认为第一声道信号与第二声道信号比较相近),可以确定一个数值较小的目标衰减因子,而当上述第一声道信号和第二声道信号与下混信号的相关性较小时(此时也可以认为第一声道信号与第二声道信号相差较大),可以确定一个数值较大的目标衰减因子。
在一些实施例中,上述多个声道信号与下混信号的相关性可以是指上述多个声道信号的能量与下混信号的能量的差异,或者是指上述多个声道信号的幅度与下混信号的幅度的差异。上述多个声道信号的能量与下混信号的能量差异具体可以是多个声道信号的能量与下混信号的能量之间的差值,同样,上述多个声道信号的幅度与下混信号的幅度的差异具体可以是多个声道信号的幅度与下混信号的幅度的差值。另外,上述多个声道信号与下混信号的相关性还可以是多个声道信号的相位或者周期与下混信号的相位或者周期的差异等。
在一些实施例中,可以根据第一声道信号或者第二声道信号的能量与下混信号的能量的差异确定第一声道信号或者第二声道信号与下混信号的相关性,进而再确定目标衰减因子。
通过声道信号以及下混信号的能量能够较为方便地衡量第一声道信号、第二声道信号分别与下混信号的相关性,也就是说通过比较第一声道信号或者第二声道信号与下混信号的能量的差异,能够较为方便地确定目标衰减因子。
可选地,作为一个实施例,第一声道信号和第二声道信号均包含多个频点,根据第一声道信号、第二声道信号分别与下混信号的相关性,确定目标衰减因子,包括:确定第一声道信号和第二声道信号分别与下混信号在多个频点的能量的差异值;根据差异值确定目标衰减因子。
上述第一声道信号与下混信号在多个频点的能量的差异值可以是指第一声道信号和下混信号分别多个相同频点的能量的差异值。例如,第一声道信号包括三个频点(第一频点、第二频点和第三频点),那么,第一声道信号与下混信号在这三个频点的能量的差异值具体是指第一声道信号与下混信号在第一频点的差异值,第一声道信号与下混信号在第二频点的差异值,第一声道信号与下混信号在第三频点的差异值。
类似地,上述第二声道信号与下混信号在多个频点的能量的差异值可以是指第二声道信号和下混信号分别多个相同频点的能量的差异值。
可选地,上述第一声道信号与下混信号在多个频点的能量的差异值可以是指第一声道信号与下混信号分别在多个频点的能量的差值的绝对值的和,类似地,第二声道信号与下混信号在多个频点的能量的差异值可以是指第二声道信号与下混信号分别在多个频点的能量的差值的绝对值的和。
应理解,上述第一声道信号、第二声道信号以及下混信号的能量值可以是经过归一化处理后的数值。
通过比较第一声道信号、第二声道信号分别与下混信号在多个频点的能量的差异值,能够较为方便地确定第一声道信号、第二声道信号的能量分别与下混信号的能量的差异,进而确定衰减因子,而不必比较第一声道信号、第二声道信号分别与下混信号在全部频带上的能量的差异。
可选地,作为一个实施例,确定第一声道信号和所述第二声道信号的分别与下混信号的在多个频点的能量的差异值,包括:确定第一声道信号的能量与下混信号的能量的第一差异值,第一差异值用于指示第一声道信号与下混信号分别在多个频点的能量的差值的绝对值的和;确定第二声道信号的能量与下混信号的能量的第二差异值,第二差异值用于指示第一声道信号与下混信号分别在多个频点的能量的差值的绝对值的和;根据第一差异值和第二异差值确定目标衰减因子。
根据第一差异值和第二异差值确定目标衰减因子可以包括:根据第一差异值和第二差异值的比值确定目标衰减因子。
具体地,当上述第一声道信号为左声道信号,第二声道信号为右声道信号时,可以根据下列公式计算第一差异值和第二差异值。
Figure PCTCN2018077782-appb-000001
Figure PCTCN2018077782-appb-000002
其中,diff_l_h为第一差异值,diff_r_h为第二差异值,左声道信号和右声道信号的频段包括高频部分和低频部分,M1为高频部分的起始频点,M2为高频部分的结束频点,mag_l[k]为左声道信号在M1和M2之间的某个频点的能量或者幅度值,mag_r[k]为右声道信号在M1和M2之间的索引为k的频点的能量或者幅度值,mag_dmx[k]为下混信号在M1和M2之间的索引为k的频点的能量或幅度值,其中,mag_dmx[k]可以通过下混信号本身计算得到,也可以根据左右声道信号的能量或者幅度值计算得到的。
在根据第一差异值和第二差异值确定目标衰减因子时,可以直接将第一差异值与第二差异值的比值确定为目标衰减因子。例如,第一差异值为a,第二差异值为b,当a<b时,将a/b确定为目标衰减因子,当a>b时,将b/a确定为目标衰减因子。另外,在根据第一差异值和第二差异值确定了目标衰减因子后,可以再对目标衰减因子与之前帧的衰减因子进行一些平滑处理,利用平滑处理后的目标衰减因子再对上述多个声道信号的初始混响增益参数进行调整。
可选地,作为一个实施例,在根据上述差异值确定目标衰减因子之前,图3方法还包括:确定该差异值大于预设阈值。
应理解,这里的差异值大于预设阈值可以是指第一声道信号和第二声道信号分别与下混信号的能量的差异值均大于同一个预设阈值,也可以是指第一声道信号与下混信号的能量的差异大于预设的第一阈值,而第二声道信号与下混信号的能量的差异大于预设的第二阈值。
只有在第一声道信号、第二声道信号分别与下混信号的能量的差异值比较大时才会确定目标衰减因子,并根据目标衰减因子对初始混响增益参数进行调整,而在差值较小的情况下,可以不对初始混响增益参数进行调整,从而提高了编码效率。
例如,当第一声道信号与下混信号的能量的差异值大于第一声道信号能量的M(M位于0.5-1之间)倍时就可以认为第一声道与下混信号的能量的差异值大于预设阈值,此时,该预设阈值为第一声道信号能量的M倍。或者,当第一声道信号与下混信号的能量的差异值与第一声道信号的能量的比值大于M时也可以认为第一声道与下混信号的能量的差异值大于预设阈值。
而当多个声道信号与下混信号的能量的差异值小于预设阈值时,可以直接将多个声道信号的初始混响增益参数确定为该多个声道信号的目标混响增益参数。
可选地,作为一个实施例,下混信号的能量是根据所述第一声道信号和第二声道信号的能量确定的。
通过第一声道信号和第二声道信号的能量能够计算下混信号的能量,而不用再通过下混信号本身计算,能够简化一定的计算过程。
当然,在本申请实施例中,也可以直接根据下混信号本身来计算下混信号的能量。
可选地,作为一个实施例,上述目标衰减因子包括多个衰减因子,所述多个衰减因子中的每个衰减因子分别对应所述多个声道信号的至少一个子带,并且任意一个子带仅对应一个衰减因子。
例如,上述第一声道信号和第二声道信号包含的子带的索引为0-9,第一声道信号和 第二声道信号均包含10个混响增益参数,每个子带对应一个混响增益参数,目标衰减因子包含5个衰减因子,每个衰减因子对应两个子带,或者目标衰减因子包含10个衰减因子,每个衰减因子对应一个子带。
另外,当目标衰减因子包括多个衰减因子时,能够根据目标衰减因子对混响增益参数进行更灵活的调整。例如,多个声道信号的索引为0-4的子带对应的混响增益参数需要进行较小的调整,而声道信号的索引为5-9对应的混响增益参数需要进行较大的调整,那么,可以为索引为0-4的子带对应的混响增益参数设置较小的衰减因子,而为索引为5-9的子带对应的混响增益参数设置较大的衰减因子。
可选地,作为一个实施例,上述第一声道信号和第二声道信号(第一声道信号和第二声道信号占用的频段相同)包含第一频段和第二频段,第一频段中的子带对应的衰减因子小于或者等于第二频段中的子带对应的衰减因子,其中,第一频段的频率小于第二频段的频率。
例如,上述第一声道信号和第二声道信号所在的频段包括低频部分和高频部分,目标衰减因子包括多个衰减因子,其中,低频部分对应至少一个衰减因子,高频部分对应至少一个衰减因子,低频部分对应的衰减因子小于高频部分对应的衰减因子。
通过为高频和低频子带对应的混响增益参数设置不同的大小的衰减因子,能够对低频子带和高频子带对应的混响增益参数进行不同程度的调整,能够在进行混响处理时取得更好的处理效果。
图4示出了本申请实施例的多声道信号的编码方法的示意性流程图。在图4中,声道信号包括左声道信号和右声道信号,对左声道信号和右声道信号进行编码的过程具体包括:
410、计算左声道信号和右声道信号的空间参数。
上述空间参数包含了左声道信号和右声道信号的初始混响增益参数以及其它空间参数。
420、对左声道信号(图中用L表示)和右声道信号(图中用R表示)进行下混处理,得到下混信号。
430、确定左声道信号和右声道信号的能量分别与下混信号的能量的差值;
具体地,可以将左右声道信号分为高频部分和低频部分,将左右声道信号与下混信号在高频部分的能量的差值确定为左右声道信号的能量与下混信号的能量的差值。
440、根据左右声道信号的能量分别与下混信号的能量的差值,对左声道信号和右声道信号的混响增益参数进行调整。
具体而言,编码端可以根据左右声道信号的能量分别与下混信号的能量的差值,确定目标衰减因子,根据目标衰减因子对左右声道信号的混响增益参数进行调整。
450、对下混信号、调整后的混响增益参数以及其它空间参数进行量化,得到码流。
图5示出了本申请实施例的多声道信号的解码方法的示意性流程图。在图5中,声道信号包括左声道信号和右声道信号,图5可以对图4中的编码方法编码生成的码流进行解码,图5的解码过程具体包括:
510、获取左声道信号和右声道信号的码流。
520、解码码流获取下混信号。
530、解码码流获取左声道信号和右声道信号的空间参数。
该空间参数中包括经过编码端调整后的混响增益参数,也就是说,编码端是对调整后的混响增益参数进行编码,这样,解码端在对码流进行解码后就得到了编码端调整后的混响增益参数。
步骤520与步骤530没有先后关系,可以同时进行。
540、对解码得到的空间参数进行后续处理(例如,平滑滤波)。
550、根据解码得到的下混信号和混响增益参数(该混响增益参数是经过编码端调整后的混响增益参数),得到去相关信号。
560、根据步骤540处理过的空间参数及下混信号,进行上混处理,得到左声道信号和右声道信号。
570、根据去相关信号分别对左声道信号和右声道信号进行混响处理。
在图5所示的方法中,对左声道信号和右声道信号进行混响处理时依据的混响增益参数已经根据左右声道信号与下混信号的相关性进行了调整,这样就能根据左右声道信号之间的差异来进行相应的混响处理,提高了混响处理后得到的声道信号的质量。
图3的编码方法是编码端确定是否需要对声道信号的初始混响增益参数进行调整,如果需要调整的话就在编码端对声道信号的初始混响增益参数进行调整,并对调整后的混响增益参数进行编码,使得解码端直接根据解码得到的混响增益参数执行混响处理即可。
事实上,编码端也可以只确定声道信号的初始混响增益参数是否需要调整,如果需要调整的话就向编码端发送相应的指示信息,解码端在接收到该指示信息后,由解码端完成对声道信号的初始混响增益参数进行调整。
图6是本申请实施例的多声道信号的编码方法的示意性流程图。图6的方法包括:
610、确定多声道信号中的第一声道信号和第二声道信号的下混信号以及第一声道信号和第二声道信号的初始混响增益参数。
具体地,可以通过第一声道信号和第二声道信号进行下混处理得到下混信号,通过对第一声道信号和第二声道信号进行空间参数分析来获取空间参数,其中空间参数中包含第一声道信号和第二声道信号的初始混响增益参数。
应理解,确定下混信号与确定初始混响增益参数既可以是同时进行,也可以是依次进行。
应理解,上述第一声道信号和第二声道信号可以对应相同的空间参数,具体地,第一声道信号和第二声道信号也对应相同的初始混响增益参数。也就是说,第一声道信号的空间参数与第二声道信号的空间参数相同,第一声道信号的初始混响增益参数与第二声道信号的初始混响增益参数相同。
进一步地,假设第一声道信号和第二声道信号均包含10个子带,每个子带分别对应1个混响增益参数,那么,第一声道信号和第二声道信号的索引值相同的子带对应的混响增益参数可以是相同的。
620、根据第一声道信号、第二声道信号分别与下混信号的相关性,确定第一声道信号和所述第二声道信号的标识信息,该标识信息用于指示第一声道信号和所述第二声道信号中需要调整初始混响增益参数的声道信号。
可选地,第一声道信号或者第二声道信号与下混信号的相关性可以根据第一声道信号 或者第二声道信号的能量与下混信号的能量的差异来确定,也可以根据第一声道信号或者第二声道信号的幅度与下混信号的幅度的差异来确定。
具体地,当第一声道信号的能量或者幅度与下混信号的能量或者幅度的差异较小时可以认为第一声道信号与下混信号的相关性较大,而当第一声道信号的能量或者幅度与下混信号的能量或者幅度的差异较大时可以认为第一声道信号与下混信号的相关性较小。
上述第一声道信号或者第二声道信号的能量与下混信号的能量差异具体可以是值第一声道信号或者第二声道信号的能量与下混信号的能量之间的差值,同样,上述第一声道信号和第二声道信号的幅度与下混信号的幅度的差异具体可以是指第一声道信号或者第二声道信号的幅度与下混信号的幅度的差值。
另外,上述第一声道信号或者第二声道信号与下混信号的相关性还可以是指第一声道信号或者第二声道信号的相位、周期与下混信号的相位、周期的差异等。
上述第一声道信号、第二声道信号以及下混信号可以是经过归一化处理之后得到的声道信号。
具体地,上述标识信息可以指示第一声道信号或者第二声道信号为需要调整初始混响增益参数的声道信号,也可以指示第一声道信号和第二声道信号为需要调整混响增益参数的声道信号,或者,也可以指示第一声道信号和第二声道信号均不需要调整混响增益参数。
在一些实施例中,标识信息可以通过标识位的取值来指示多个声道信号中需要调整初始混响增益参数的声道信号。例如,该标识信息的标识位占用两个比特,当标识位的取值为00时表示第一声道信号和第二声道信号的初始混响增益参数均不需要调整;当标识位的取值为01表示仅第一声道信号的初始混响增益参数需要调整;当标识位的取值为10时表示仅第二声道信号的初始混响增益参数需要调整;当标识位的取值为11时表示第一声道信号和第二声道信号的初始混响增益参数均需要调整。
在一些实施例中,根据第一声道信号、第二声道信号分别与下混信号的相关性,确定第一声道信号和第二声道信号的标识信息,包括:根据第一声道信号、第二声道信号的能量分别与下混信号的能量的相关性,确定第一声道信号和第二声道信号的标识信息。
通过声道信号以及下混信号的能量能够较为方便地衡量第一声道信号、第二声道信号分别与下混信号的相关性,从而能够较为方便地确定需要调整初始混响增益参数的声道信号。
在一些实施例中,下混信号的能量或者幅值可以根据第一声道信号和第二声道信号的能量来计算,从而简化一定的计算过程。或者,也可以直接根据下混信号本身来计算下混信号的能量。
630、根据下混信号、初始混响增益参数以及标识信息,对第一声道信号和第二声道信号进行量化,并将量化后的第一声道信号和第二声道信号写入码流。
本申请中,通过判断声道信号与下混信号的能量的差值的大小与预设阈值的关系,能够在声道信号与下混信号的能量差异较大的情况下,将该声道信号确定为需要调整混响增益参数的声道信号,使得解码端能够先对该声道信号的初始混响增益参数进行调整后再对该声道信号进行混响处理,能够提升混响处理后的声道信号的质量。
可选地,作为一个实施例,根据第一声道信号、第二声道信号分别与下混信号的能量的相关性,确定第一声道信号和第二声道信号的标识信息,包括:确定第一差异值和第二 差异值,第一差异值为第一声道信号与下混信号分别在多个频点的能量的差值的绝对值的和,第二差异值为第二声道信号与下混信号分别在多个频点的能量的差值的绝对值的和;根据第一差异值和第二差异值确定第一声道信号和第二声道信号的标识信息。
通过比较第一声道信号、第二声道信号分别与下混信号在多个频点的能量的差值,能够较为方便地确定第一声道信号、第二声道信号分别与下混信号的能量的差异,进而确定需要调整初始混响增益参数的声道信号,而不必比较第一声道信号、第二声道信号分别与下混信号在全部频带上的能量的差异。
可选地,根据第一差异值和第二差异值确定第一声道信号和第二声道信号的标识信息,包括:将第一差异值和第二差异值中的最大差异值确定为目标差异值;根据目标差异值确定所述标识信息,所述标识信息具体用于指示所述目标差异值对应的目标声道信号,所述目标差异值对应的声道信号为需要调整初始混响增益参数的声道信号。
具体地,当第一声道信号与下混信号在多个频点的能量的差值的绝对值的和大于第二声道信号与下混信号在多个频点的能量的差值的绝对值的和时,可以将第一声道信号确定为需要调整初始混响增益参数的声道信号。
而当多第一声道信号和第二声道信号分别与下混信号在多个频点的能量的差值的绝对值的和均比较大(例如,均大于预设阈值)时,可以确定另一个标识信息,该标识信息指示第一声道信号和第二声道信号的初始混响增益参数均需要调整。
具体地,在一些实施例中,根据第一声道信号或者第二声道信号与下混信号分别在多个频点的能量的差值的绝对值的和,确定第一声道信号和第二声道信号的标识信息,包括:在第一声道信号与下混信号分别在多个频点的能量的差值的绝对值的和大于预设阈值的情况下,生成第一标识信息,第一标识信息用于指示第一声道信号的初始混响增益参数需要调整;在第二声道信号与下混信号分别在多个频点的能量的差值的绝对值的和大于预设阈值的情况下,生成第二标识信息,第二标识信息用于指示第二声道信号的初始混响增益参数需要调整。
通过判断声道信号与下混信号的能量的差异值的大小与预设阈值的关系,能够在声道信号与下混信号的能量差异较大的情况下,将该声道信号确定为需要调整混响增益参数的声道信号,使得解码端能够先对该声道信号的初始混响增益参数进行调整后再对该声道信号进行混响处理,能够提升混响处理后的声道信号的质量。
应理解,上述第一声道信号和第二声道信号的标识信息既可以是一个标识信息,也可以是两个标识信息。例如,当第一声道信号和第二声道信号的初始混响增益参数均需要调整时,第一声道信号和第二声道信号的标识信息可以是一个标识信息,该标识信息指示第一声道信号和第二声道信号的初始混响增益参数均需要调整;或者,第一声道信号和第二声道信号的标识信息为两个标识信息,分别为第一标识信息和第二标识信息,第一标识信息用于指示第一声道信号的初始混响增益参数需要调整,第二标识信息用于指示第二声道信号的初始混响增益参数需要调整。当某个声道信号没有对应的标识信息时说明该声道信号的初始混响增益参数不需要调整,也就是说,当上述标识信息只包含第一标识信息时,那么第一声道信号和第二声道信号中只有第一声道信号的初始混响增益参数需要调整。
可选地,在一些实施例中,当第一声道信号的初始混响增益参数需要调整时,图6的方法还包括:根据上述第一差异值和第二差异值确定目标衰减因子,该目标衰减因子用于 对目标声道信号的初始混响增益参数进行调整;对目标衰减因子进行量化,并将量化后的目标衰减因子写入码流。
通过衰减因子能够实现根据声道信号与下混信号的相关性的大小来灵活调整声道信号的初始混响增益参数。
应理解,上述第一差异值和第二差异值可以参照上文中的公式(1)和公式(2)进行计算。
在根据上述第一差异值和第二差异值确定目标衰减因子时可以根据第一差异值和第二差异值的比值来确定目标衰减因子。
在一些实施例中,上述目标衰减因子包括多个衰减因子,多个衰减因子中的每个衰减因子分别对应所述目标声道信号的至少一个子带,并且任意一个子带仅对应一个衰减因子。例如,多声道信号包含多个子带,相邻的子带可以对应一个衰减因子。
当目标衰减因子包括多个衰减因子时,能够根据目标衰减因子对混响增益参数进行更灵活的调整。
在另一些实施例中,目标声道信号包含第一频段和第二频段,第一频段中的子带对应的衰减因子小于或者等于第二频段中的子带对应的衰减因子,其中,第一频段的频率小于第二频段的频率。
通过为高频和低频子带对应的混响增益参数设置不同的大小的衰减因子,能够对低频子带和高频子带对应的混响增益参数进行不同程度的调整,能够在进行混响处理时取得更好的处理效果。
例如,上述目标声道信号所在的频段包括低频部分和高频部分,目标衰减因子包括多个衰减因子,其中,低频部分对应至少一个衰减因子,高频部分对应至少一个衰减因子,低频部分对应的衰减因子小于高频部分对应的衰减因子。
在一些实施例中,下混信号的能量是根据所述第一声道信号和第二声道信号的能量确定的。
通过第一声道信号和第二声道信号的能量能够计算下混信号的能量,而不用再通过下混信号本身计算,能够简化一定的计算过程。
上文结合图6对本申请实施例的编码方法进行了详细的描述,下面结合图7对本申请实施例的解码方法进行描述,应理解,图7中的解码方法与图6中的编码方法是相对应的,为了简洁,下面适当省略重复的描述。
图7示出了本申请实施例的多声道信号的解码方法的示意性流程图。图7的方法可以由解码端设备或者解码器执行,图7的方法具体包括:
710、获取码流。
720、根据码流确定多声道信号中的第一声道信号和第二声道信号的下混信号、第一声道信号和第二声道信号的初始混响增益参数以及第一声道信号和第二声道信号的标识信息,其中,所述标识信息用于指示所述第一声道信号和第二声道信号中需要调整初始混响增益参数的声道信号。
730、根据标识信息确定第一声道信号和所述第二声道信号中需要调整初始混响增益参数的声道信号为目标声道信号。
740、对目标声道信号的初始混响增益参数进行调整。
本申请中,能够通过标识信息确定需要调整初始混响增益参数的声道信号,并在对该声道信号进行混响处理之前调整该声道信号的初始混响增益参数,能够提升混响处理后的声道信号的质量。
可选地,作为一个实施例,对目标声道信号的初始混响增益参数进行调整,包括:确定目标衰减因子;根据目标衰减因子对目标声道信号的初始混响增益参数进行调整,得到目标声道信号的目标混响增益参数。
通过衰减因子能够实现根据声道信号与下混信号的相关性的大小来灵活调整声道信号的初始混响增益参数。
解码端在确定衰减因子时,可以将预设的衰减因子确定为目标衰减因子。或者是解码端直接根据预设的衰减因子对目标声道信号的初始混响增益参数进行调整。
通过预先设置衰减因子,能够简化确定目标衰减因子的过程,进而提高解码的效率。
在一些实施例中,解码端可以从多个声道信号的码流中获取目标衰减因子,也就是说通过解码多个声道信号的码流来获取目标衰减因子,在这种情况下,解码端已经确定了目标衰减因子,并且将目标衰减因子编码得到码流传输到解码端,这样解码端不必再计算目标衰减因子,而是直接从码流中解码获得目标衰减因子。
当码流中包含目标衰减因子时,可以直接通过在码流中获取目标衰减因子也能够简化确定目标衰减因子的过程,能够提高解码的效率。
可选地,作为一个实施例,确定目标衰减因子具体包括:从码流获取第一声道信号和第二声道信号的声道间电平差;根据声道间电平差确定目标衰减因子,或者,根据声道间电平差和下混信号确定目标衰减因子。
通过根据声道间电平差、下混信号等能够更为灵活更准确地确定目标衰减因子,进而能够根据该衰减因子对声道信号的初始混响参数进行更准确的调整。
具体地,当声道间电平差较大时,可以认为第一声道信号与第二声道信号之间的差异较大,相关性较小,此时可以确定一个数值较大的衰减因子作为目标衰减因子。
另外,在根据下混信号确定目标衰减因子时,可以利用下混信号的周期性和谐波性来确定目标衰减因子。例如,当下混信号的周期性或者谐波性较好时,可以认为第一声道信号与第二声道信号之间的差异较小,相关性较大,此时可以确定一个数值较小的衰减因子作为目标衰减因子。
可选地,作为一个实施例,上述目标衰减因子包括多个衰减因子,其中,该多个衰减因子中的每个衰减因子分别对应所述目标声道信号的至少一个子带,并且任意一个子带仅对应一个衰减因子。例如,第一声道信号和第二声道信号包含多个子带,相邻的多个子带可以对应一个衰减因子。
当目标衰减因子包括多个衰减因子时,能够根据目标衰减因子对混响增益参数进行更灵活的调整。
在另一些实施例中,目标声道信号包含第一频段和第二频段,第一频段中的子带对应的衰减因子小于或者等于第二频段中的子带对应的衰减因子,其中,第一频段的频率小于第二频段的频率。
通过为高频和低频子带对应的混响增益参数设置不同的大小的衰减因子,能够对低频子带和高频子带对应的混响增益参数进行不同程度的调整,能够在进行混响处理时取得更 好的处理效果。
例如,目标声道信号所在的频段包括低频部分和高频部分,目标衰减因子包括多个衰减因子,其中,低频部分对应至少一个衰减因子,高频部分对应至少一个衰减因子,低频部分对应的衰减因子小于高频部分对应的衰减因子。
上文结合图3至图7对本申请实施例的编解码方法进行了详细的描述,下面结合图8至图13对本申请实施例的编码器和解码器进行描述,应理解,图8至图13中的编码器和解码器能够实现本申请实施例的编解码方法中由编码器和解码器执行的步骤。为了简洁,下面适当省略重复的描述。
图8是本申请实施例的编码器的示意性框图。图8的编码器800包括:
处理单元810,用于确定多声道信号中的第一声道信号和第二声道信号的下混信号以及所述第一声道信号和所述第二声道信号的初始混响增益参数;
所述处理单元810还用于根据所述第一声道信号、所述第二声道信号分别与所述下混信号的相关性,以及所述初始混响增益参数,确定所述第一声道信号和所述第二声道信号的目标混响增益参数;
编码单元820,用于根据所述下混信号和所述目标混响增益参数,对所述第一声道信号和所述第二声道信号进行量化,并将量化后的第一声道信号和第二声道信号写入码流。
上述编码器800可以对应于图3的多声道信号的编码方法,编码器800可以执行图3中的多声道信号的编码方法。
本申请中,在确定声道信号的目标混响增益参数时,考虑到了声道信号与下混信号的相关性,这样能够在根据目标混响增益参数对声道信号进行混响处理时取得较好的处理效果,从而提升混响处理后的声道信号的质量。
可选地,作为一个实施例,所述处理单元810具体用于:根据所述第一声道信号、所述第二声道信号分别与所述下混信号的相关性,确定目标衰减因子;根据所述目标衰减因子对所述初始混响增益参数进行调整,得到所述目标混响增益参数。
可选地,作为一个实施例,所述第一声道信号和所述第二声道信号均包含多个频点,所述处理单元810具体用于:确定所述第一声道信号和所述第二声道信号的能量分别与所述下混信号在所述多个频点的能量的差异值;根据所述差异值确定所述目标衰减因子。
可选地,作为一个实施例,所述处理单元810具体用于:确定所述第一声道信号的能量与所述下混信号的能量的第一差异值,所述第一差异值用于指示所述第一声道信号与所述下混信号分别在多个频点的能量的差值的绝对值的和;确定所述第二声道信号的能量与所述下混信号的能量的第二差异值,所述第二差异值用于指示所述第一声道信号与所述下混信号分别在多个频点的能量的差值的绝对值的和;根据所述第一差异值和所述第二差异值的比值,确定所述目标衰减因子。
可选地,作为一个实施例,在根据所述差异值确定所述目标衰减因子之前,所述处理单元810具体还用于:确定所述差异值大于预设阈值。
可选地,作为一个实施例,所述下混信号的能量是根据所述第一声道信号和第二声道信号的能量确定的。
可选地,作为一个实施例,所述目标衰减因子包括多个衰减因子,所述多个衰减因子中的每个衰减因子分别对应所述多个声道信号的至少一个子带,并且任意一个子带仅对应 一个衰减因子。
图9是本申请实施例的编码器的示意性框图。图9的编码器900包括:
处理单元910,用于确定多声道信号中的第一声道信号和第二声道信号的下混信号以及所述第一声道信号和所述第二声道信号的初始混响增益参数;
所述处理单元910还用于根据所述第一声道信号、所述第二声道信号分别与所述下混信号的相关性,确定所述第一声道信号和所述第二声道信号的标识信息,所述标识信息用于指示所述第一声道信号和所述第二声道信号中需要调整初始混响增益参数的声道信号;
编码单元920,用于根据所述下混信号、所述初始混响增益参数以及所述标识信息,对所述第一声道信号和所述第二声道信号进行量化,并将量化后的第一声道信号和第二声道信号写入码流。
在本申请中,能够根据声道信号与下混信号的相关性来确定需要调整初始混响增益参数的声道信号,使得解码端能够先对某些声道信号的初始混响增益参数进行调整后再对这些声道信号进行混响处理,能够提升混响处理后的声道信号的质量。
应理解,上述编码器900可以对应于图6的多声道信号的编码方法,编码器900可以执行图6中的多声道信号的编码方法。
可选地,作为一个实施例,所述处理单元910具体用于:根据所述第一声道信号、所述第二声道信号的能量分别与所述下混信号的能量的相关性,确定所述第一声道信号和所述第二声道信号的标识信息。
可选地,作为一个实施例,处理单元910具体用于:确定第一差异值和第二差异值,所述第一差异值为所述第一声道信号与所述下混信号分别在多个频点的能量的差值的绝对值的和,所述第二差异值为所述第二声道信号与所述下混信号分别在多个频点的能量的差值的绝对值的和;根据所述第一差异值和所述第二差异值确定所述第一声道信号和所述第二声道信号的标识信息。
可选地,作为一个实施例,处理单元910具体用于:将所述第一差异值和所述第二差异值中的最大差异值确定为目标差异值;根据目标差异值确定所述标识信息,所述标识信息具体用于指示所述目标差异值对应的声道信号,所述目标差异值对应的声道信号为需要调整初始混响增益参数的声道信号。
可选地,作为一个实施例,所述处理单元910具体还用于:根据所述第一差异值和所述第二差异值确定目标衰减因子,所述目标衰减因子用于对所述目标声道信号的初始混响增益参数进行调整;对所述目标衰减因子进行量化,并将量化后的目标衰减因子写入所述码流。
可选地,作为一个实施例,所述目标衰减因子包括多个衰减因子,所述多个衰减因子中的每个衰减因子分别对应所述目标声道信号的至少一个子带,并且任意一个子带仅对应一个衰减因子。
可选地,作为一个实施例,所述下混信号的能量是根据所述第一声道信号和所述第二声道信号的能量确定的。
图10是本申请实施例的解码器的示意性框图。图10的解码器1000包括:
获取单元1010,用于获取码流;
处理单元1020,用于根据所述码流确定多声道信号中的第一声道信号和第二声道信 号的下混信号、所述第一声道信号和所述第二声道信号的初始混响增益参数以及所述第一声道信号和所述第二声道信号的标识信息,其中,所述标识信息用于指示所述第一声道信号和所述第二声道信号中需要调整初始混响增益参数的声道信号;
所述处理单元1020还用于根据所述标识信息确定所述第一声道信号和所述第二声道信号中需要调整初始混响增益参数的声道信号为目标声道信号;
所述处理单元1020还用于对所述目标声道信号的初始混响增益参数进行调整。
本申请中,能够通过标识信息确定需要调整初始混响增益参数的声道信号,并在对该声道信号进行混响处理之前调整该声道信号的初始混响增益参数,能够提升混响处理后的声道信号的质量。
应理解,上述解码器1000可以对应于图7的多声道信号的解码方法,解码器1000可以执行图7中的多声道信号的解码方法。
可选地,作为一个实施例,所述处理单元1020具体用于:确定目标衰减因子;根据所述目标衰减因子对所述目标声道信号的初始混响增益参数进行调整,得到所述目标声道信号的目标混响增益参数。
可选地,作为一个实施例,所述处理单元1020具体用于:将预设的衰减因子确定为所述目标衰减因子。
可选地,作为一个实施例,所述处理单元1020具体用于:根据所述码流获取所述目标衰减因子。
可选地,作为一个实施例,所述处理单元1020具体用于:从所述码流获取所述第一声道信号和所述第二声道信号的声道间电平差;根据所述声道间电平差确定所述目标衰减因子,或者,根据所述声道间电平差以及所述下混信号,确定所述目标衰减因子。
可选地,作为一个实施例,所述目标衰减因子包括多个衰减因子,所述多个衰减因子中的每个衰减因子分别对应所述目标声道信号的至少一个子带,并且任意一个子带仅对应一个衰减因子。
图11是本申请实施例的编码器的示意性框图。图11的编码器1100包括:
存储器1110,用于存储程序;
处理器1120,用于执行程序,当所述程序被执行时,所述处理器1120用于确定多声道信号中的第一声道信号和第二声道信号的下混信号以及所述第一声道信号和所述第二声道信号的初始混响增益参数;根据所述第一声道信号、所述第二声道信号分别与所述下混信号的相关性,以及所述初始混响增益参数,确定所述第一声道信号和所述第二声道信号的目标混响增益参数;根据所述下混信号和所述目标混响增益参数,对所述第一声道信号和所述第二声道信号进行量化,并将量化后的第一声道信号和第二声道信号写入码流。
上述编码器1100可以对应于图3的多声道信号的编码方法,编码器1100可以执行图3中的多声道信号的编码方法。
本申请中,在确定声道信号的目标混响增益参数时,考虑到了声道信号与下混信号的相关性,这样能够在根据目标混响增益参数对声道信号进行混响处理时取得更好的处理效果,从而提升混响处理后的声道信号的质量。
可选地,作为一个实施例,所述处理器1120具体用于:根据所述第一声道信号、所述第二声道信号分别与所述下混信号的相关性,确定目标衰减因子;根据所述目标衰减因 子对所述初始混响增益参数进行调整,得到所述目标混响增益参数。
可选地,作为一个实施例,所述第一声道信号和所述第二声道信号均包含多个频点,所述处理器1120具体用于:确定所述第一声道信号和所述第二声道信号的能量分别与所述下混信号在所述多个频点的能量的差异值;根据所述差异值确定所述目标衰减因子。
可选地,作为一个实施例,所述处理器1120具体用于:确定所述第一声道信号的能量与所述下混信号的能量的第一差异值,所述第一差异值用于指示所述第一声道信号与所述下混信号分别在多个频点的能量的差值的绝对值的和;确定所述第二声道信号的能量与所述下混信号的能量的第二差异值,所述第二差异值用于指示所述第一声道信号与所述下混信号分别在多个频点的能量的差值的绝对值的和;根据所述第一差异值和所述第二差异值的比值,确定所述目标衰减因子。
可选地,作为一个实施例,在根据所述差异值确定所述目标衰减因子之前,所述处理器1120具体还用于:确定所述差异值大于预设阈值。
可选地,作为一个实施例,所述下混信号的能量是根据所述第一声道信号和第二声道信号的能量确定的。
可选地,作为一个实施例,所述目标衰减因子包括多个衰减因子,所述多个衰减因子中的每个衰减因子分别对应所述多个声道信号的至少一个子带,并且任意一个子带仅对应一个衰减因子。
图12是本申请实施例的编码器的示意性框图。图12的编码器1200包括:
存储器1210,用于存储程序;
处理器1220,用于执行程序,当所述程序被执行时,所述处理器1220用于确定多声道信号中的第一声道信号和第二声道信号的下混信号以及所述第一声道信号和所述第二声道信号的初始混响增益参数;根据所述第一声道信号、所述第二声道信号分别与所述下混信号的相关性,确定所述第一声道信号和所述第二声道信号的标识信息,所述标识信息用于指示所述第一声道信号和所述第二声道信号中需要调整初始混响增益参数的声道信号;根据所述下混信号、所述初始混响增益参数以及所述标识信息,对所述第一声道信号和所述第二声道信号进行量化,并将量化后的第一声道信号和第二声道信号写入码流。
在本申请中,能够根据声道信号与下混信号的相关性来确定需要调整初始混响增益参数的声道信号,使得解码端能够先对某些声道信号的初始混响增益参数进行调整后再对这些声道信号进行混响处理,能够提升混响处理后的声道信号的质量。
应理解,上述编码器1200可以对应于图6的多声道信号的编码方法,编码器1200可以执行图6中的多声道信号的编码方法。
可选地,作为一个实施例,所述处理器1220具体用于:根据所述第一声道信号、所述第二声道信号的能量分别与所述下混信号的能量的相关性,确定所述第一声道信号和所述第二声道信号的标识信息。
可选地,作为一个实施例,处理器1220具体用于:确定第一差异值和第二差异值,所述第一差异值为所述第一声道信号与所述下混信号分别在多个频点的能量的差值的绝对值的和,所述第二差异值为所述第二声道信号与所述下混信号分别在多个频点的能量的差值的绝对值的和;根据所述第一差异值和所述第二差异值确定所述第一声道信号和所述第二声道信号的标识信息。
可选地,作为一个实施例,处理器1220具体用于:将所述第一差异值和所述第二差异值中的最大差异值确定为目标差异值;根据目标差异值确定所述标识信息,所述标识信息具体用于指示所述目标差异值对应的声道信号,所述目标差异值对应的声道信号为需要调整初始混响增益参数的声道信号。
可选地,作为一个实施例,所述处理器1220具体还用于:根据所述第一差异值和所述第二差异值确定目标衰减因子,所述目标衰减因子用于对所述目标声道信号的初始混响增益参数进行调整;对所述目标衰减因子进行量化,并将量化后的目标衰减因子写入所述码流。
可选地,作为一个实施例,所述目标衰减因子包括多个衰减因子,所述多个衰减因子中的每个衰减因子分别对应所述目标声道信号的至少一个子带,并且任意一个子带仅对应一个衰减因子。
可选地,作为一个实施例,所述下混信号的能量是根据所述第一声道信号和所述第二声道信号的能量确定的。
图13是本申请实施例的解码器的示意性框图。图13的解码器1300包括:
存储器1310,用于存储程序;
处理器1320,用于执行程序,当所述程序被执行时,所述处理器1320用于获取码流;根据所述码流确定多声道信号中的第一声道信号和第二声道信号的下混信号、所述第一声道信号和所述第二声道信号的初始混响增益参数以及所述第一声道信号和所述第二声道信号的标识信息,其中,所述标识信息用于指示所述第一声道信号和所述第二声道信号中需要调整初始混响增益参数的声道信号;根据所述标识信息确定所述第一声道信号和所述第二声道信号中需要调整初始混响增益参数的声道信号为目标声道信号;对所述多个声道信号的初始混响增益参数进行调整。
本申请中,能够通过标识信息确定需要调整初始混响增益参数的声道信号,并在对该声道信号进行混响处理之前调整该声道信号的初始混响增益参数,能够提升混响处理后的声道信号的质量。
应理解,上述解码器1300可以对应于图7的多声道信号的解码方法,解码器1300可以执行图7中的多声道信号的解码方法。
可选地,作为一个实施例,所述处理器1320具体用于:确定目标衰减因子;根据所述目标衰减因子对所述目标声道信号的初始混响增益参数进行调整,得到所述目标声道信号的目标混响增益参数。
可选地,作为一个实施例,所述处理器1320具体用于:将预设的衰减因子确定为所述目标衰减因子。
可选地,作为一个实施例,所述处理器1320具体用于:根据所述码流获取所述目标衰减因子。
可选地,作为一个实施例,所述处理器1320具体用于:从所述码流获取所述第一声道信号和所述第二声道信号的声道间电平差;根据所述声道间电平差确定所述目标衰减因子,或者,根据所述声道间电平差以及所述下混信号,确定所述目标衰减因子。
可选地,作为一个实施例,所述目标衰减因子包括多个衰减因子,所述多个衰减因子中的每个衰减因子分别对应所述目标声道信号的至少一个子带,并且任意一个子带仅对应 一个衰减因子。
本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的系统、装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。
在本申请所提供的几个实施例中,应该理解到,所揭露的系统、装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。
所述功能如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(Read-Only Memory,ROM)、随机存取存储器(Random Access Memory,RAM)、磁碟或者光盘等各种可以存储程序代码的介质。
以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以所述权利要求的保护范围为准。

Claims (40)

  1. 一种多声道信号的编码方法,其特征在于,包括:
    确定多声道信号中的第一声道信号和第二声道信号的下混信号以及所述第一声道信号和所述第二声道信号的初始混响增益参数;
    根据所述第一声道信号、所述第二声道信号分别与所述下混信号的相关性,以及所述初始混响增益参数,确定所述第一声道信号和所述第二声道信号的目标混响增益参数;
    根据所述下混信号和所述目标混响增益参数,对所述第一声道信号和所述第二声道信号进行量化,并将量化后的第一声道信号和第二声道信号写入码流。
  2. 如权利要求1所述的方法,其特征在于,所述根据所述第一声道信号、所述第二声道信号分别与所述下混信号的相关性,以及所述初始混响增益参数,确定所述第一声道信号和所述第二声道信号的目标混响增益参数,包括:
    根据所述第一声道信号、所述第二声道信号分别与所述下混信号的相关性,确定目标衰减因子;
    根据所述目标衰减因子对所述初始混响增益参数进行调整,得到所述目标混响增益参数。
  3. 如权利要求2所述的方法,其特征在于,所述第一声道信号和所述第二声道信号均包含多个频点,所述根据所述第一声道信号、所述第二声道信号分别与所述下混信号的相关性,确定目标衰减因子,包括:
    确定所述第一声道信号和所述第二声道信号分别与所述下混信号在所述多个频点的能量的差异值;
    根据所述差异值确定所述目标衰减因子。
  4. 如权利要求3所述的方法,其特征在于,所述确定所述第一声道信号和所述第二声道信号分别与所述下混信号在所述多个频点的能量的差异值,包括:
    确定所述第一声道信号的能量与所述下混信号的能量的第一差异值,所述第一差异值用于指示所述第一声道信号与所述下混信号分别在多个频点的能量的差值的绝对值的和;
    确定所述第二声道信号的能量与所述下混信号的能量的第二差异值,所述第二差异值用于指示所述第二声道信号与所述下混信号分别在多个频点的能量的差值的绝对值的和;
    所述根据所述差异值确定所述目标衰减因子,包括:
    根据所述第一差异值和所述第二差异值的比值,确定所述目标衰减因子。
  5. 如权利要求3或4所述的方法,其特征在于,在根据所述差异值确定所述目标衰减因子之前,所述方法还包括:
    确定所述差异值大于预设阈值。
  6. 如权利要求3-5中任一项所述的方法,其特征在于,所述下混信号的能量是根据所述第一声道信号和第二声道信号的能量确定的。
  7. 如权利要求2-6中任一项所述的方法,其特征在于,所述目标衰减因子包括多个衰减因子,所述多个衰减因子中的每个衰减因子分别对应所述多个声道信号的至少一个子带,并且任意一个子带仅对应一个衰减因子。
  8. 一种多声道信号的编码方法,其特征在于,包括:
    确定多声道信号中的第一声道信号和第二声道信号的下混信号以及所述第一声道信号和所述第二声道信号的初始混响增益参数;
    根据所述第一声道信号、所述第二声道信号分别与所述下混信号的相关性,确定所述第一声道信号和所述第二声道信号的标识信息,所述标识信息用于指示所述第一声道信号和所述第二声道信号中需要调整初始混响增益参数的声道信号;
    根据所述下混信号、所述初始混响增益参数以及所述标识信息,对所述第一声道信号和所述第二声道信号进行量化,并将量化后的第一声道信号和第二声道信号写入码流。
  9. 如权利要求8所述的方法,其特征在于,所述根据所述第一声道信号、所述第二声道信号分别与所述下混信号的相关性,确定所述第一声道信号和所述第二声道信号的标识信息,包括:
    根据所述第一声道信号、所述第二声道信号的能量分别与所述下混信号的能量的相关性,确定所述第一声道信号和所述第二声道信号的标识信息。
  10. 如权利要求9所述的方法,其特征在于,所述根据所述第一声道信号、所述第二声道信号分别与所述下混信号的能量的相关性,确定所述第一声道信号和所述第二声道信号的标识信息,包括:
    确定第一差异值和第二差异值,所述第一差异值为所述第一声道信号与所述下混信号分别在多个频点的能量的差值的绝对值的和,所述第二差异值为所述第二声道信号与所述下混信号分别在多个频点的能量的差值的绝对值的和;
    根据所述第一差异值和所述第二差异值确定所述第一声道信号和所述第二声道信号的标识信息。
  11. 如权利要求10所述的方法,其特征在于,根据所述第一差异值和所述第二差异值确定所述第一声道信号和所述第二声道信号的标识信息,包括:
    将所述第一差异值和所述第二差异值中的最大差异值确定为目标差异值;
    根据目标差异值确定所述标识信息,所述标识信息具体用于指示所述目标差异值对应的声道信号,所述目标差异值对应的声道信号为需要调整初始混响增益参数的声道信号。
  12. 如权利要求11所述的方法,其特征在于,所述方法还包括:
    根据所述第一差异值和所述第二差异值确定目标衰减因子,所述目标衰减因子用于对所述目标声道信号的初始混响增益参数进行调整;
    对所述目标衰减因子进行量化,并将量化后的目标衰减因子写入所述码流。
  13. 如权利要求12所述的方法,其特征在于,所述目标衰减因子包括多个衰减因子,所述多个衰减因子中的每个衰减因子分别对应所述目标声道信号的至少一个子带,并且任意一个子带仅对应一个衰减因子。
  14. 如权利要求9-13中任一项所述的方法,其特征在于,所述下混信号的能量是根据所述第一声道信号和所述第二声道信号的能量确定的。
  15. 一种多声道信号的解码方法,其特征在于,包括:
    获取码流;
    根据所述码流确定多声道信号中的第一声道信号和第二声道信号的下混信号、所述第一声道信号和所述第二声道信号的初始混响增益参数以及所述第一声道信号和所述第二 声道信号的标识信息,其中,所述标识信息用于指示所述第一声道信号和所述第二声道信号中需要调整初始混响增益参数的声道信号;
    根据所述标识信息确定所述第一声道信号和所述第二声道信号中需要调整初始混响增益参数的声道信号为目标声道信号;
    对所述目标声道信号的初始混响增益参数进行调整。
  16. 如权利要求15所述的方法,其特征在于,所述对所述目标声道信号的初始混响增益参数进行调整,包括:
    确定目标衰减因子;
    根据所述目标衰减因子对所述目标声道信号的初始混响增益参数进行调整,得到所述目标声道信号的目标混响增益参数。
  17. 如权利要求16所述的方法,其特征在于,所述确定目标衰减因子,包括:
    将预设的衰减因子确定为所述目标衰减因子。
  18. 如权利要求16所述的方法,其特征在于,所述确定目标衰减因子,包括:
    根据所述码流获取所述目标衰减因子。
  19. 如权利要求16所述的方法,其特征在于,所述确定目标衰减因子,包括:
    从所述码流获取所述第一声道信号和所述第二声道信号的声道间电平差;
    根据所述声道间电平差确定所述目标衰减因子,或者,
    根据所述声道间电平差以及所述下混信号,确定所述目标衰减因子。
  20. 如权利要求16-19中任一项所述的方法,其特征在于,所述目标衰减因子包括多个衰减因子,所述多个衰减因子中的每个衰减因子分别对应所述目标声道信号的至少一个子带,并且任意一个子带仅对应一个衰减因子。
  21. 一种编码器,其特征在于,包括:
    处理单元,用于确定多声道信号中的第一声道信号和第二声道信号的下混信号以及所述第一声道信号和所述第二声道信号的初始混响增益参数;
    所述处理单元还用于根据所述第一声道信号、所述第二声道信号分别与所述下混信号的相关性,以及所述初始混响增益参数,确定所述第一声道信号和所述第二声道信号的目标混响增益参数;
    编码单元,用于根据所述下混信号和所述目标混响增益参数,对所述第一声道信号和所述第二声道信号进行量化,并将量化后的第一声道信号和第二声道信号写入码流。
  22. 如权利要求21所述的编码器,其特征在于,所述处理单元具体用于:
    根据所述第一声道信号、所述第二声道信号分别与所述下混信号的相关性,确定目标衰减因子;
    根据所述目标衰减因子对所述初始混响增益参数进行调整,得到所述目标混响增益参数。
  23. 如权利要求22所述的编码器,其特征在于,所述第一声道信号和所述第二声道信号均包含多个频点,所述处理单元具体用于:
    确定所述第一声道信号和所述第二声道信号的能量分别与所述下混信号在所述多个频点的能量的差异值;
    根据所述差异值确定所述目标衰减因子。
  24. 如权利要求23所述的编码器,其特征在于,所述处理单元具体用于:
    确定所述第一声道信号的能量与所述下混信号的能量的第一差异值,所述第一差异值用于指示所述第一声道信号与所述下混信号分别在多个频点的能量的差值的绝对值的和;
    确定所述第二声道信号的能量与所述下混信号的能量的第二差异值,所述第二差异值用于指示所述第一声道信号与所述下混信号分别在多个频点的能量的差值的绝对值的和;
    根据所述第一差异值和所述第二差异值的比值,确定所述目标衰减因子。
  25. 如权利要求23或24所述的编码器,其特征在于,在根据所述差异值确定所述目标衰减因子之前,所述处理单元具体还用于:
    确定所述差异值大于预设阈值。
  26. 如权利要求23-25中任一项所述的编码器,其特征在于,所述下混信号的能量是根据所述第一声道信号和第二声道信号的能量确定的。
  27. 如权利要求22-26中任一项所述的编码器,其特征在于,所述目标衰减因子包括多个衰减因子,所述多个衰减因子中的每个衰减因子分别对应所述多个声道信号的至少一个子带,并且任意一个子带仅对应一个衰减因子。
  28. 一种编码器,其特征在于,包括:
    处理单元,用于确定多声道信号中的第一声道信号和第二声道信号的下混信号以及所述第一声道信号和所述第二声道信号的初始混响增益参数;
    所述处理单元还用于根据所述第一声道信号、所述第二声道信号分别与所述下混信号的相关性,确定所述第一声道信号和所述第二声道信号的标识信息,所述标识信息用于指示所述第一声道信号和所述第二声道信号中需要调整初始混响增益参数的声道信号;
    编码单元,用于根据所述下混信号、所述初始混响增益参数以及所述标识信息,对所述第一声道信号和所述第二声道信号进行量化,并将量化后的第一声道信号和第二声道信号写入码流。
  29. 如权利要求28所述的编码器,其特征在于,所述处理单元具体用于:
    根据所述第一声道信号、所述第二声道信号的能量分别与所述下混信号的能量的相关性,确定所述第一声道信号和所述第二声道信号的标识信息。
  30. 如权利要求29所述的编码器,其特征在于,处理单元具体用于:
    确定第一差异值和第二差异值,所述第一差异值为所述第一声道信号与所述下混信号分别在多个频点的能量的差值的绝对值的和,所述第二差异值为所述第二声道信号与所述下混信号分别在多个频点的能量的差值的绝对值的和;
    根据所述第一差异值和所述第二差异值确定所述第一声道信号和所述第二声道信号的标识信息。
  31. 如权利要求30所述的编码器,其特征在于,处理单元具体用于:
    将所述第一差异值和所述第二差异值中的最大差异值确定为目标差异值;
    根据目标差异值确定所述标识信息,所述标识信息具体用于指示所述目标差异值对应的声道信号,所述目标差异值对应的声道信号为需要调整初始混响增益参数的声道信号。
  32. 如权利要求31所述的编码器,其特征在于,所述处理单元还用于:
    根据所述第一差异值和所述第二差异值确定目标衰减因子,所述目标衰减因子用于对所述目标声道信号的初始混响增益参数进行调整;
    对所述目标衰减因子进行量化,并将量化后的目标衰减因子写入所述码流。
  33. 如权利要求32所述的编码器,其特征在于,所述目标衰减因子包括多个衰减因子,所述多个衰减因子中的每个衰减因子分别对应所述目标声道信号的至少一个子带,并且任意一个子带仅对应一个衰减因子。
  34. 如权利要求29-33中任一项所述的编码器,其特征在于,所述下混信号的能量是根据所述第一声道信号和所述第二声道信号的能量确定的。
  35. 一种解码器,其特征在于,包括:
    获取单元,用于获取码流;
    处理单元,用于根据所述码流确定多声道信号中的第一声道信号和第二声道信号的下混信号、所述第一声道信号和所述第二声道信号的初始混响增益参数以及所述第一声道信号和所述第二声道信号的标识信息,其中,所述标识信息用于指示所述第一声道信号和所述第二声道信号中需要调整初始混响增益参数的声道信号;
    所述处理单元还用于根据所述标识信息确定所述第一声道信号和所述第二声道信号中需要调整初始混响增益参数的声道信号为目标声道信号;
    所述处理单元还用于对所述目标声道信号的初始混响增益参数进行调整。
  36. 如权利要求35所述的解码器,其特征在于,所述处理单元具体用于:
    确定目标衰减因子;
    根据所述目标衰减因子对所述目标声道信号的初始混响增益参数进行调整,得到所述目标声道信号的目标混响增益参数。
  37. 如权利要求36所述的解码器,其特征在于,所述处理单元具体用于:
    将预设的衰减因子确定为所述目标衰减因子。
  38. 如权利要求36所述的解码器,其特征在于,所述处理单元具体用于:
    根据所述码流获取所述目标衰减因子。
  39. 如权利要求36所述的解码器,其特征在于,所述处理单元具体用于:
    从所述码流获取所述第一声道信号和所述第二声道信号的声道间电平差;
    根据所述声道间电平差确定所述目标衰减因子,或者,
    根据所述声道间电平差以及所述下混信号,确定所述目标衰减因子。
  40. 如权利要求36-39中任一项所述的解码器,其特征在于,所述目标衰减因子包括多个衰减因子,所述多个衰减因子中的每个衰减因子分别对应所述目标声道信号的至少一个子带,并且任意一个子带仅对应一个衰减因子。
PCT/CN2018/077782 2017-03-31 2018-03-01 多声道信号的编解码方法和编解码器 WO2018177066A1 (zh)

Priority Applications (11)

Application Number Priority Date Filing Date Title
EP21170071.1A EP3917171B1 (en) 2017-03-31 2018-03-01 Multi-channel signal encoding method, multi-channel signal decoding method, encoder, and decoder
JP2019553260A JP6804666B2 (ja) 2017-03-31 2018-03-01 マルチチャネル信号符号化方法、マルチチャネル信号復号化方法、符号器、及び復号器
EP24152513.8A EP4375994A2 (en) 2017-03-31 2018-03-01 Multi-channel signal encoding method, multi-channel signal decoding method, encoder, and decoder
EP18776186.1A EP3588497B1 (en) 2017-03-31 2018-03-01 Multi-channel signal encoding and decoding method and codec
BR112019020468A BR112019020468A2 (pt) 2017-03-31 2018-03-01 método de codificação de sinal multicanal, método de decodificação de sinal multicanal, codificador e decodificador
KR1020197029632A KR102281097B1 (ko) 2017-03-31 2018-03-01 멀티-채널 신호 인코딩 및 디코딩 방법 및 코덱
CN201880022744.XA CN110462733B (zh) 2017-03-31 2018-03-01 多声道信号的编解码方法和编解码器
ES18776186T ES2882626T3 (es) 2017-03-31 2018-03-01 Método de codificación y decodificación de señales multicanales y codec
US16/586,128 US11386907B2 (en) 2017-03-31 2019-09-27 Multi-channel signal encoding method, multi-channel signal decoding method, encoder, and decoder
US17/837,558 US11894001B2 (en) 2017-03-31 2022-06-10 Multi-channel signal encoding method, multi-channel signal decoding method, encoder, and decoder
US18/393,866 US20240135938A1 (en) 2017-03-31 2023-12-22 Multi-Channel Signal Encoding Method, Multi-Channel Signal Decoding Method, Encoder, and Decoder

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201710205821.2A CN108665902B (zh) 2017-03-31 2017-03-31 多声道信号的编解码方法和编解码器
CN201710205821.2 2017-03-31

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US16/586,128 Continuation US11386907B2 (en) 2017-03-31 2019-09-27 Multi-channel signal encoding method, multi-channel signal decoding method, encoder, and decoder

Publications (1)

Publication Number Publication Date
WO2018177066A1 true WO2018177066A1 (zh) 2018-10-04

Family

ID=63674221

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/077782 WO2018177066A1 (zh) 2017-03-31 2018-03-01 多声道信号的编解码方法和编解码器

Country Status (8)

Country Link
US (3) US11386907B2 (zh)
EP (3) EP4375994A2 (zh)
JP (4) JP6804666B2 (zh)
KR (1) KR102281097B1 (zh)
CN (2) CN108665902B (zh)
BR (1) BR112019020468A2 (zh)
ES (1) ES2882626T3 (zh)
WO (1) WO2018177066A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113985780A (zh) * 2021-10-28 2022-01-28 中国人民解放军战略支援部队信息工程大学 多通道远程控制装置及方法、存储介质及电子设备

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108665902B (zh) 2017-03-31 2020-12-01 华为技术有限公司 多声道信号的编解码方法和编解码器
CN108694955B (zh) 2017-04-12 2020-11-17 华为技术有限公司 多声道信号的编解码方法和编解码器
CN111654745B (zh) * 2020-06-08 2022-10-14 海信视像科技股份有限公司 多声道的信号处理方法及显示设备

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101149925A (zh) * 2007-11-06 2008-03-26 武汉大学 一种用于参数立体声编码的空间参数选取方法
CN101410889A (zh) * 2005-08-02 2009-04-15 杜比实验室特许公司 对作为听觉事件的函数的空间音频编码参数进行控制
CN101673548A (zh) * 2008-09-08 2010-03-17 华为技术有限公司 参数立体声编码方法、装置和参数立体声解码方法、装置
US20100153119A1 (en) * 2006-12-08 2010-06-17 Electronics And Telecommunications Research Institute Apparatus and method for coding audio data based on input signal distribution characteristics of each channel
CN102307323A (zh) * 2009-04-20 2012-01-04 华为技术有限公司 对多声道信号的声道延迟参数进行修正的方法
CN103700372A (zh) * 2013-12-30 2014-04-02 北京大学 一种基于正交解相关技术的参数立体声编码、解码方法
CN105405445A (zh) * 2015-12-10 2016-03-16 北京大学 一种基于声道间传递函数的参数立体声编码、解码方法

Family Cites Families (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
ITMI20031258A1 (it) 2003-06-20 2004-12-21 Nextec Srl Procedimento e macchina per impermeabilizzare semilavorati di calzature, capi ed accessori di abbigliamento, e semilavorati ottenuti con tale procedimento o macchina.
SE0400998D0 (sv) * 2004-04-16 2004-04-16 Cooding Technologies Sweden Ab Method for representing multi-channel audio signals
US7756713B2 (en) * 2004-07-02 2010-07-13 Panasonic Corporation Audio signal decoding device which decodes a downmix channel signal and audio signal encoding device which encodes audio channel signals together with spatial audio information
AU2006266655B2 (en) * 2005-06-30 2009-08-20 Lg Electronics Inc. Apparatus for encoding and decoding audio signal and method thereof
JP2007025290A (ja) * 2005-07-15 2007-02-01 Matsushita Electric Ind Co Ltd マルチチャンネル音響コーデックにおける残響を制御する装置
KR101277041B1 (ko) * 2005-09-01 2013-06-24 파나소닉 주식회사 멀티 채널 음향 신호 처리 장치 및 방법
KR20080094710A (ko) 2005-10-26 2008-10-23 엘지전자 주식회사 멀티채널 오디오 신호의 부호화 및 복호화 방법과 그 장치
ATE476732T1 (de) 2006-01-09 2010-08-15 Nokia Corp Steuerung der dekodierung binauraler audiosignale
WO2007089129A1 (en) * 2006-02-03 2007-08-09 Electronics And Telecommunications Research Institute Apparatus and method for visualization of multichannel audio signals
US7965848B2 (en) 2006-03-29 2011-06-21 Dolby International Ab Reduced number of channels decoding
US8027479B2 (en) 2006-06-02 2011-09-27 Coding Technologies Ab Binaural multi-channel decoder in the context of non-energy conserving upmix rules
CN101166377A (zh) * 2006-10-17 2008-04-23 施伟强 一种多语种环绕立体声的低码率编解码方案
KR20080066537A (ko) * 2007-01-12 2008-07-16 엘지전자 주식회사 부가정보를 가지는 오디오신호의 부호화/복호화 방법 및장치
CN101572088A (zh) * 2008-04-30 2009-11-04 北京工业大学 立体声编解码方法、编解码器及编解码系统
EP2144229A1 (en) * 2008-07-11 2010-01-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Efficient use of phase information in audio encoding and decoding
KR101614160B1 (ko) 2008-07-16 2016-04-20 한국전자통신연구원 포스트 다운믹스 신호를 지원하는 다객체 오디오 부호화 장치 및 복호화 장치
CA2820199C (en) 2008-07-31 2017-02-28 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. Signal generation for binaural signals
KR101342425B1 (ko) * 2008-12-19 2013-12-17 돌비 인터네셔널 에이비 다중-채널의 다운믹싱된 오디오 입력 신호에 리버브를 적용하기 위한 방법 및 다중-채널의 다운믹싱된 오디오 입력 신호에 리버브를 적용하도록 구성된 리버브레이터
US8219408B2 (en) 2008-12-29 2012-07-10 Motorola Mobility, Inc. Audio signal decoder and method for producing a scaled reconstructed audio signal
KR101137360B1 (ko) 2009-01-28 2012-04-19 엘지전자 주식회사 오디오 신호 처리 방법 및 장치
CN102171754B (zh) * 2009-07-31 2013-06-26 松下电器产业株式会社 编码装置以及解码装置
CA2721845C (en) * 2009-11-20 2017-07-11 Penta TMR Inc. Vertical feed mixer having cutout edge
MX2012005781A (es) * 2009-11-20 2012-11-06 Fraunhofer Ges Forschung Aparato para proporcionar una representacion de señal de mezcla ascendente con base en la representacion de señal de mezcla descendente, aparato para proporcionar un flujo de bits que representa una señal de audio multicanal, metodos, programas informaticos y flujo de bits que representan una señal de audio multicanal usando un parametro de combinacion lineal.
JP5333257B2 (ja) 2010-01-20 2013-11-06 富士通株式会社 符号化装置、符号化システムおよび符号化方法
CN102157151B (zh) * 2010-02-11 2012-10-03 华为技术有限公司 一种多声道信号编码方法、解码方法、装置和系统
JP5299327B2 (ja) 2010-03-17 2013-09-25 ソニー株式会社 音声処理装置、音声処理方法、およびプログラム
US9424852B2 (en) * 2011-02-02 2016-08-23 Telefonaktiebolaget Lm Ericsson (Publ) Determining the inter-channel time difference of a multi-channel audio signal
KR101842258B1 (ko) * 2011-09-14 2018-03-27 삼성전자주식회사 신호 처리 방법, 그에 따른 엔코딩 장치, 및 그에 따른 디코딩 장치
KR20150101999A (ko) 2012-11-09 2015-09-04 스토밍스위스 에스에이알엘 다채널 신호의 비선형 역부호화
JP6160072B2 (ja) * 2012-12-06 2017-07-12 富士通株式会社 オーディオ信号符号化装置および方法、オーディオ信号伝送システムおよび方法、オーディオ信号復号装置
US9715880B2 (en) * 2013-02-21 2017-07-25 Dolby International Ab Methods for parametric multi-channel encoding
EP2840811A1 (en) * 2013-07-22 2015-02-25 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Method for processing an audio signal; signal processing unit, binaural renderer, audio encoder and audio decoder
WO2016123774A1 (zh) * 2015-02-05 2016-08-11 华为技术有限公司 编解码方法和编解码器
CN108665902B (zh) 2017-03-31 2020-12-01 华为技术有限公司 多声道信号的编解码方法和编解码器

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101410889A (zh) * 2005-08-02 2009-04-15 杜比实验室特许公司 对作为听觉事件的函数的空间音频编码参数进行控制
US20100153119A1 (en) * 2006-12-08 2010-06-17 Electronics And Telecommunications Research Institute Apparatus and method for coding audio data based on input signal distribution characteristics of each channel
CN101149925A (zh) * 2007-11-06 2008-03-26 武汉大学 一种用于参数立体声编码的空间参数选取方法
CN101673548A (zh) * 2008-09-08 2010-03-17 华为技术有限公司 参数立体声编码方法、装置和参数立体声解码方法、装置
CN102307323A (zh) * 2009-04-20 2012-01-04 华为技术有限公司 对多声道信号的声道延迟参数进行修正的方法
CN103700372A (zh) * 2013-12-30 2014-04-02 北京大学 一种基于正交解相关技术的参数立体声编码、解码方法
CN105405445A (zh) * 2015-12-10 2016-03-16 北京大学 一种基于声道间传递函数的参数立体声编码、解码方法

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113985780A (zh) * 2021-10-28 2022-01-28 中国人民解放军战略支援部队信息工程大学 多通道远程控制装置及方法、存储介质及电子设备
CN113985780B (zh) * 2021-10-28 2024-01-12 中国人民解放军战略支援部队信息工程大学 多通道远程控制装置及方法、存储介质及电子设备

Also Published As

Publication number Publication date
JP2024059683A (ja) 2024-05-01
US11894001B2 (en) 2024-02-06
JP2022084671A (ja) 2022-06-07
EP3917171B1 (en) 2024-04-24
US20240135938A1 (en) 2024-04-25
EP3588497B1 (en) 2021-05-12
JP6804666B2 (ja) 2020-12-23
JP2021047432A (ja) 2021-03-25
JP7035154B2 (ja) 2022-03-14
CN108665902A (zh) 2018-10-16
EP3917171A1 (en) 2021-12-01
KR20190122839A (ko) 2019-10-30
CN110462733A (zh) 2019-11-15
EP3588497A4 (en) 2020-01-15
CN108665902B (zh) 2020-12-01
US20200027466A1 (en) 2020-01-23
EP4375994A2 (en) 2024-05-29
US11386907B2 (en) 2022-07-12
JP7436541B2 (ja) 2024-02-21
JP2020512590A (ja) 2020-04-23
EP3588497A1 (en) 2020-01-01
BR112019020468A2 (pt) 2020-04-28
CN110462733B (zh) 2022-05-10
ES2882626T3 (es) 2021-12-02
US20220310104A1 (en) 2022-09-29
KR102281097B1 (ko) 2021-07-22

Similar Documents

Publication Publication Date Title
JP7106711B2 (ja) マルチチャネル信号符号化方法、マルチチャネル信号復号方法、エンコーダ、およびデコーダ
JP6641018B2 (ja) チャネル間時間差を推定する装置及び方法
JP7035154B2 (ja) マルチチャネル信号符号化方法、マルチチャネル信号復号化方法、符号器、及び復号器
US20230352034A1 (en) Encoding and decoding methods, and encoding and decoding apparatuses for stereo signal

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18776186

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2019553260

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 20197029632

Country of ref document: KR

Kind code of ref document: A

REG Reference to national code

Ref country code: BR

Ref legal event code: B01A

Ref document number: 112019020468

Country of ref document: BR

ENP Entry into the national phase

Ref document number: 2018776186

Country of ref document: EP

Effective date: 20190923

ENP Entry into the national phase

Ref document number: 112019020468

Country of ref document: BR

Kind code of ref document: A2

Effective date: 20190930