WO2016141731A1 - 确定声道间时间差参数的方法和装置 - Google Patents
确定声道间时间差参数的方法和装置 Download PDFInfo
- Publication number
- WO2016141731A1 WO2016141731A1 PCT/CN2015/095090 CN2015095090W WO2016141731A1 WO 2016141731 A1 WO2016141731 A1 WO 2016141731A1 CN 2015095090 W CN2015095090 W CN 2015095090W WO 2016141731 A1 WO2016141731 A1 WO 2016141731A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- channel
- search
- complexity
- time domain
- domain signal
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 123
- 238000012545 processing Methods 0.000 claims abstract description 130
- 230000008569 process Effects 0.000 claims description 52
- 238000005070 sampling Methods 0.000 claims description 28
- 238000005314 correlation function Methods 0.000 claims description 24
- 238000009499 grossing Methods 0.000 claims description 14
- 238000001514 detection method Methods 0.000 claims description 6
- 230000006870 function Effects 0.000 description 17
- 230000005236 sound signal Effects 0.000 description 15
- 238000013507 mapping Methods 0.000 description 14
- 238000010586 diagram Methods 0.000 description 7
- 230000004069 differentiation Effects 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 5
- 230000005540 biological transmission Effects 0.000 description 3
- 238000004891 communication Methods 0.000 description 3
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 238000013139 quantization Methods 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
Definitions
- the present invention relates to the field of audio processing and, more particularly, to a method and apparatus for determining inter-channel time difference parameters.
- stereo audio has the sense of orientation and distribution of each source, which can improve the clarity and intelligibility of information, and is therefore favored by people.
- a transmission technology for a stereo audio signal is known, and the encoding end converts a stereo signal into a mono audio signal and an Inter-Channel Time Difference (ITD) parameter, which are respectively encoded and transmitted.
- ITD Inter-Channel Time Difference
- the stereo signal is further restored according to parameters such as ITD, thereby enabling low-bit high-quality transmission of the stereo signal.
- the encoding end can determine the limit value T max of the ITD parameter at the sampling rate based on the sampling rate of the input audio signal, and thus, based on the input audio signal, the search range of [-T max , T max ] Within the specified step size, the calculation is performed to obtain the ITD parameters. Therefore, regardless of the channel quality, the above search range and search step are the same.
- the accuracy requirements of the ITD parameters are different. For example, if the channel quality is poor, the accuracy of the ITD parameters is low. At this time, if the above larger search range and smaller search are still used, The step size will cause waste of computing resources and seriously affect processing efficiency.
- Embodiments of the present invention provide a method and apparatus for determining a time difference parameter between channels, which can adapt the accuracy of the determined ITD parameter to the channel quality.
- a method for determining an inter-channel time difference parameter comprising: determining a target search complexity from at least two search complexity, wherein the at least two search complexity and at least two channels The quality values are corresponding one by one; according to the target search complexity, the first channel signal and the second channel signal are searched to determine the first channel and the second channel corresponding to the first channel Time difference ITD parameter between channels.
- determining the target search complexity from the at least two search complexity includes: acquiring an encoding parameter for a stereo signal, where the stereo signal is based on the first A channel signal and a signal of the second channel are generated, and the coding parameter is determined according to a current channel quality value, and the coding parameter includes any one of the following parameters: an encoding bit rate, a coding bit number, or used to indicate the The complexity control parameter of the search complexity; according to the coding parameter, the target search complexity is determined from at least two search complexity.
- the at least two search complexity are in one-to-one correspondence with the at least two search steps, and the at least two search complexity includes the first a search complexity and a second search complexity, the at least two search steps including a first search step and a second search step, wherein the first search step corresponding to the first search complexity is smaller than the second Searching for a second search step corresponding to the complexity, the first search complexity being higher than the second search complexity, and the signal of the first channel and the signal of the second channel according to the target search complexity
- Performing a search process includes: determining a target search step size corresponding to the target search complexity; and performing a search process on the signal of the first channel and the signal of the second channel according to the target search step.
- the at least two search complexity are in one-to-one correspondence with at least two search ranges, where the at least two search complexity includes a third search
- the complexity and the fourth search complexity the at least two search ranges include a first search range and a second search range, wherein the first search range corresponding to the third search complexity is greater than the fourth search complexity a second search range, the third search complexity is higher than the fourth search complexity, and the search processing is performed on the signal of the first channel and the signal of the second channel according to the target search complexity, including: Determining a target search range corresponding to the target search complexity; and searching for the first channel signal and the second channel signal on the target search range.
- the determining a target search range corresponding to the target search complexity includes: determining a time domain signal according to the first channel The time domain signal of the second channel determines a reference parameter corresponding to an acquisition order between the time domain signal of the first channel and the time domain signal of the second channel, wherein the first sound The time domain signal of the track and the time domain signal of the second channel correspond to the same time period; the target search range is determined according to the target search complexity, the reference parameter and the limit value T max , wherein the limit value T max is The target search range belongs to [-T max , 0], or the target search range belongs to [0, T max ], determined according to the sampling rate of the time domain signal of the first channel.
- determining the reference parameter according to the time domain signal of the first channel and the time domain signal of the second channel including: The time domain signal of the first channel and the time domain signal of the second channel are subjected to cross-correlation processing to determine a first cross-correlation processing value and a second cross-correlation processing value, wherein the first cross-correlation processing value is a maximum function value of a first-channel time domain signal relative to a cross-correlation function of the second-channel time-domain signal within a preset range, the second cross-correlation processing value being a time domain signal of the second channel a maximum function value of the cross-correlation function of the time domain signal of the first channel in the preset range; determining the size relationship between the first cross-correlation processing value and the second cross-correlation processing value Benchmark parameters.
- the reference parameter is an index value corresponding to a larger one of the first cross-correlation processing value and the second cross-correlation processing value. Or the opposite of the index value.
- determining the reference parameter according to the time domain signal of the first channel and the time domain signal of the second channel including: The time domain signal of the first channel and the time domain signal of the second channel perform peak detection processing to determine a first index value and a second index value, wherein the first index value is related to the first channel An index value corresponding to a maximum amplitude value of the time domain signal within a preset range, the second index value being an index value corresponding to a maximum amplitude value of the time domain signal of the second channel within the preset range; The reference parameter is determined according to a size relationship between the first index value and the second index value.
- the method further includes: performing smoothing processing on the first ITD parameter based on the second ITD parameter, where the first ITD parameter Is the ITD parameter of the first time period, the second ITD parameter is a smoothed value of the ITD parameter of the second time period, and the second time period is before the first time period.
- an apparatus for determining a time difference parameter between channels comprising: a determining unit, configured to determine a target search complexity from at least two search complexity, wherein the at least two search complexity correspond to at least two channel quality values one by one; and a processing unit configured to search according to the target The complexity is to perform a search process on the signal of the first channel and the signal of the second channel to determine a first inter-channel time difference ITD parameter corresponding to the first channel and the second channel.
- the determining unit is specifically configured to acquire an encoding parameter for the stereo signal, where the stereo signal is based on the signal of the first channel and the second channel Generating the signal, the encoding parameter is determined according to a current channel quality value, and the encoding parameter includes any one of the following parameters: an encoding bit rate, a number of encoding bits, or a complexity control parameter used to indicate the search complexity; The encoding parameter determines a target search complexity from at least two search complexity.
- the at least two search complexity are in one-to-one correspondence with the at least two search steps, and the at least two search complexity includes the first a search complexity and a second search complexity, the at least two search steps including a first search step and a second search step, wherein the first search step corresponding to the first search complexity is smaller than the second Searching for a second search step corresponding to the complexity, the first search complexity is higher than the second search complexity, and the processing unit is specifically configured to determine a target search step size corresponding to the target search complexity; Searching for the signal of the first channel and the signal of the second channel according to the target search step.
- the at least two search complexity are in one-to-one correspondence with at least two search ranges, where the at least two search complexity includes a third search
- the complexity and the fourth search complexity the at least two search ranges include a first search range and a second search range, wherein the first search range corresponding to the third search complexity is greater than the fourth search complexity a second search range, the third search complexity is higher than the fourth search complexity
- the processing unit is specifically configured to determine a target search range corresponding to the target search complexity; for using the target search range And performing a search process on the signal of the first channel and the signal of the second channel.
- the processing unit is configured to determine, according to the time domain signal of the first channel and the time domain signal of the second channel, a reference parameter, the reference parameter corresponding to an acquisition sequence between the time domain signal of the first channel and the time domain signal of the second channel, wherein the time domain signal of the first channel and the second channel
- the time domain signal corresponds to the same time period
- the target search range is determined according to the target search complexity, the reference parameter and the limit value T max , wherein the limit value T max is based on the time domain signal of the first channel
- the target search range is determined by the sampling rate, which belongs to [-T max , 0], or the target search range belongs to [0, T max ].
- the processing unit is specifically configured to perform mutual interaction between the time domain signal of the first channel and the time domain signal of the second channel.
- Correlation processing to determine a first cross-correlation processing value and a second cross-correlation processing value, wherein the first cross-correlation processing value is a time domain signal of the first channel relative to a time domain signal of the second channel a maximum function value of the cross-correlation function within a preset range, the second cross-correlation processing value being a cross-correlation function of the time domain signal of the second channel relative to the time domain signal of the first channel at the preset range a maximum function value; configured to determine the reference parameter according to a size relationship between the first cross-correlation processing value and the second cross-correlation processing value.
- the reference parameter is an index value corresponding to a larger one of the first cross-correlation processing value and the second cross-correlation processing value. Or the opposite of the index value.
- the processing unit is configured to perform peaking on the time domain signal of the first channel and the time domain signal of the second channel. Detecting to determine a first index value and a second index value, wherein the first index value is an index value corresponding to a maximum amplitude value of the first channel time domain signal within a preset range, the first The index value corresponding to the maximum amplitude value of the time domain signal of the second channel in the preset range; for determining a size relationship between the first index value and the second index value, Determine the baseline parameter.
- the processing unit is further configured to perform a smoothing process on the first ITD parameter based on the second ITD parameter, where the first ITD The parameter is an ITD parameter of a first time period, the second ITD parameter being a smoothed value of the ITD parameter of the second time period, the second time period being before the first time period.
- a method and apparatus for determining an inter-channel time difference parameter according to an embodiment of the present invention, determining a target search complexity corresponding to a current channel quality from at least two search complexity levels, and pairing the first according to the target search complexity
- the signal of the channel and the signal of the second channel are searched, so that the accuracy of the determined ITD parameter can be adapted to the channel quality, and thus, in the case of poor current channel quality, the target search complexity can be reduced.
- the complexity or amount of computation of the search process can support savings in computing resources and increase processing efficiency.
- FIG. 1 is a schematic flow chart of a method of determining an inter-channel time difference parameter according to an embodiment of the present invention.
- FIG. 2 is a schematic diagram of a search range determination process in accordance with an embodiment of the present invention.
- FIG. 3 is a schematic diagram of a process of determining a target search range according to another embodiment of the present invention.
- FIG. 4 is a schematic diagram of a process of determining a target search range according to still another embodiment of the present invention.
- FIG. 5 is a schematic block diagram of an apparatus for determining an inter-channel time difference parameter according to an embodiment of the present invention.
- FIG. 6 is a schematic structural diagram of an apparatus for determining an inter-channel time difference parameter according to an embodiment of the present invention.
- the execution body of the method 100 may be an encoding end device for transmitting an audio signal (also referred to as a transmitting device). As shown in FIG. 1, the method 100 includes:
- S110 Determine target search complexity from at least two search complexity, where the at least two search complexity corresponds to at least two channel quality values.
- the method 100 of determining an inter-channel time difference parameter of an embodiment of the present invention may be applied to an audio system having at least two channels in which, by at least two channels (ie, including the first channel and the a two-channel mono signal that synthesizes a stereo signal, for example, by a left sound
- the mono signal of the track i.e., an example of the first channel
- the mono signal from the right channel i.e., an example of the second channel
- a parametric stereo (PS) technique can be cited as a method for transmitting the stereo signal.
- the encoding end converts the stereo signal into a mono signal and a spatial sensing parameter, and respectively performs encoding, and the decoding end is obtained. After the mono audio, the stereo signal is further restored according to the spatial parameters.
- the inter-channel time difference (ITD) parameter is a spatial parameter indicating the horizontal orientation of the sound source and is an important component of the spatial parameter.
- the embodiment of the present invention mainly relates to the process of determining the ITD parameter.
- the process of encoding and decoding the stereo signal and the mono signal according to the ITD parameter is similar to the prior art, and a detailed description thereof is omitted herein to avoid redundancy.
- the audio system may also have three or more channels, and can pass The mono signal of any two channels is combined into a stereo signal.
- the processing procedure of applying the method 100 to an audio system having two channels ie, left channel and right channel
- left The channel is used as the first channel
- the right channel is used as the second channel.
- the method for acquiring the ITD parameters between the left and right channels is also different for different search complexity, so that the encoding end device may first determine the current search complexity before determining the ITD parameters.
- different search complexity corresponds to different ITD parameter acquisition manners (hereinafter, the specific relationship between the search complexity and the ITD parameter acquisition manner is described in detail), and the higher the search complexity, the obtained The higher the accuracy of the ITD parameters. Conversely, the lower the search complexity, the lower the accuracy of the obtained ITD parameters.
- the encoding end device can make the accuracy of the obtained ITD parameter correspond to the current channel quality by selecting the search complexity corresponding to the current channel quality (ie, the target search complexity).
- a plurality of (ie, at least two) channel qualities and a plurality of (ie, at least two) search complexity are directly recorded in a mapping relationship with each other in a mapping table (for ease of understanding) And the distinction is recorded as: mapping entry #1) and stored in the encoding device, so that the encoding device can directly search for the current channel quality in the mapping entry #1 after acquiring the current channel quality.
- Search complexity as a target search complexity.
- the search complexity can be divided into M levels (or, say, M kinds of search complexity, which are described as: M, M-1, ..., 1), and the M-level search complexity and M channels can be used.
- Quality for example, note: Q M , Q M-1 , Q M-2 , ..., Q 1 , where Q M >Q M-1 >Q M-2 >...>Q 1 ) one-to-one correspondence, ie :
- the search complexity corresponding to the channel quality Q M is M, that is, if the current channel quality is higher than or equal to the channel quality Q M , the determined target search complexity may be set to M.
- the search complexity corresponding to the channel quality Q M-1 is M-1, that is, if the current channel quality is higher than or equal to the channel quality Q M-1 and lower than the channel quality Q M , the determined target
- the search complexity can be set to M-1.
- the search complexity corresponding to the channel quality Q M-2 is M-2, that is, if the current channel quality is higher than or equal to the channel quality Q M-2 and lower than the channel quality Q M-1 , then determined
- the target search complexity can be set to M-2.
- the search complexity corresponding to the channel quality Q 2 is 2, that is, if the current channel quality is higher than or equal to the channel quality Q 2 and lower than the channel quality Q 3 , the determined target search complexity can be set. Is 2.
- the search complexity corresponding to the channel quality Q 1 is 1, that is, if the current channel quality is lower than the channel quality Q 2 , the determined target search complexity can be set to 1.
- the channel quality refers to the quality of a channel between an encoding end and a decoding end for transmitting an audio signal and an ITD parameter to be described later.
- determining the target search complexity from the at least two search complexity including:
- the encoding parameter is determined according to a current channel quality value, and the encoding parameter includes any one of the following parameters: an encoding bit rate, a coding bit number, or a complexity control parameter used to indicate the search complexity;
- the target search complexity is determined from at least two search complexity.
- the better the channel quality the higher the encoding bit rate, the larger the number of encoding bits.
- the worse the channel quality the lower the coding bit rate and the smaller the number of coded bits.
- a plurality of (ie, at least two) coding bit rates and a plurality of (ie, at least two) search complexityes may be recorded in a one-to-one correspondence relationship with each other in a mapping entry ( In order to facilitate understanding and distinguishing, it is recorded as: mapping entry #2) and stored in the encoding device, so that the encoding device can directly find and current in the mapping entry #2 after obtaining the current encoding bit rate.
- the encoding bit rate corresponds to the search complexity as the target search complexity.
- the method and process for the encoding end device to obtain the current encoding bit rate may be similar to the prior art, and a detailed description thereof is omitted in order to avoid redundancy.
- the search complexity can be divided into M levels (or, say, M kinds of search complexity, which are described as: M, M-1, ..., 1), and the M-level search complexity and M codes can be used.
- Bit rate (recorded as: B M , B M-1 , B M-2 , ..., B 1 , where B M >B M-1 >B M-2 >...>B 1 )
- B M >B M-1 >B M-2 >...>B 1 One-to-one correspondence, namely:
- the search complexity corresponding to the encoding bit rate B M is M, that is, if the current encoding bit rate is higher than or equal to the encoding bit rate B M , the determined target search complexity may be set to M.
- the search complexity corresponding to the encoding bit rate B M-1 is M-1, that is, if the current encoding bit rate is higher than or equal to the encoding bit rate B M-1 and lower than the encoding bit rate B M , then The determined target search complexity can be set to M-1.
- the encoding complexity corresponding to the encoding bit rate B M-2 is M-2, that is, if the current encoding bit rate is higher than or equal to the encoding bit rate B M-2 and lower than the encoding bit rate B M-1 , the determined target search complexity can be set to M-2.
- the encoding complexity corresponding to the encoding bit rate B 2 is 2, that is, if the current encoding bit rate is higher than or equal to the encoding bit rate B 2 and lower than the encoding bit rate B 3 , the determined target search is complicated.
- the degree can be set to 2.
- the encoding complexity corresponding to the encoding bit rate B 1 is 1, i.e., if the current encoding bit rate is lower than the encoding bit rate B 2 , the determined target search complexity can be set to 1.
- a plurality of (ie, at least two) coded bit numbers and a plurality of (ie, at least two) search complexity degrees may be recorded in a one-to-one correspondence relationship with each other in a mapping entry ( In order to facilitate understanding and distinguishing, it is recorded as: mapping entry #3) and stored in the encoding device, so that the encoding device can directly find and current in the mapping entry #3 after obtaining the current number of encoded bits.
- the number of encoded bits corresponds to the search complexity as the target search complexity.
- the method and the process for the encoding end device to obtain the current number of encoded bits may be similar to the prior art, and a detailed description thereof will be omitted in order to avoid redundancy.
- the search complexity can be divided into M levels (or, say, M kinds of search complexity, which are described as: M, M-1, ..., 1), and the M-level search complexity and M codes can be used.
- the number of bits (remembered as: C M , C M-1 , C M-2 , ..., C 1 , where C M >C M-1 >C M-2 >...>C 1 ) one-to-one correspondence, namely:
- the search complexity corresponding to the number of coded bits C M is M, that is, if the current number of coded bits is higher than or equal to the number of coded bits C M , the determined target search complexity may be set to M.
- the search complexity corresponding to the number of coded bits C M-1 is M-1, that is, if the current number of coded bits is higher than or equal to the number of coded bits C M-1 and lower than the number of coded bits C M , then
- the determined target search complexity can be set to M-1.
- the search complexity corresponding to the number of coded bits C M-2 is M-2, that is, if the current number of coded bits is higher than or equal to the number of coded bits C M-2 and lower than the number of coded bits C M-1 , the determined target search complexity can be set to M-2.
- the search complexity corresponding to the number of coded bits C 2 is 2, that is, if the current number of coded bits is higher than or equal to the number of coded bits C 2 and lower than the number of coded bits C 3 , the determined target search is complicated.
- the degree can be set to 2.
- the search complexity corresponding to the number of coded bits C 1 is 1, that is, if the current number of coded bits is lower than the number of coded bits C 2 , the determined target search complexity may be set to 1.
- different complexity control parameters may be configured for different channel qualities, so that different complexity control parameter values can be matched to different search complexity, and thus, multiple (ie, , at least two) complexity control parameter values and a plurality of (ie, at least two) search complexity one-to-one correspondence with each other are recorded in the mapping table item (for ease of understanding and differentiation, it is recorded as: mapping table item #4 ) and stored in the encoding device, so that the encoding device can After obtaining the current complexity control parameter value, the search complexity corresponding to the current complexity control parameter value is directly searched in the mapping table item #4 as the target search complexity.
- the complexity control parameter value can be written into the command line in advance, so that the encoding end device can read the current complexity control parameter value in the command line.
- the search complexity can be divided into M levels (or, say, M kinds of search complexity, which are described as: M, M-1, ..., 1), and the M-level search complexity and M complexes can be used.
- Degree control parameters (recorded as: N M , N M-1 , N M-2 , ..., N 1 , where N M >N M-1 >N M-2 >...>N 1 ) one-to-one correspondence, ie :
- the complexity of the complexity control parameter N M is M, that is, if the current complexity control parameter is higher than or equal to the complexity control parameter N M , the determined target search complexity may be set to M. .
- the complexity of the complexity control parameter N M-1 is M-1, that is, if the current complexity control parameter is higher than or equal to the complexity control parameter N M-1 and lower than the complexity control parameter. N M , then the determined target search complexity can be set to M-1.
- the complexity control parameter N M-2 corresponds to a search complexity of M-2, that is, if the current complexity control parameter is higher than or equal to the complexity control parameter N M-2 and lower than the complexity control parameter N M-1 , the determined target search complexity can be set to M-2.
- the complexity control parameter N 2 corresponds to a search complexity of 2, that is, if the current complexity control parameter is higher than or equal to the complexity control parameter N 2 and lower than the complexity control parameter N 3 , then determined.
- the target search complexity can be set to 2.
- the complexity control parameter N 1 corresponds to a search complexity of 1, that is, if the current complexity control parameter is lower than the complexity control parameter N 2 , the determined target search complexity may be set to 1.
- the coding bit rate, the number of coding bits or the complexity control parameter listed above as the coding parameters are merely exemplary descriptions, and the present invention is not limited thereto. Others can be determined by channel quality, or can reflect channel quality. The information or parameters are all within the scope of the present invention.
- the encoding end device may perform a search process according to the target search complex to acquire the ITD parameter.
- different search complexity may correspond to different search step sizes (ie, case 1), or different search complexity may correspond to different search ranges (ie, case 2), below, respectively
- the encoder determines the ITD parameters based on the target search complexity. The process is described in detail.
- the at least two search complexity are in one-to-one correspondence with at least two search steps, the at least two search complexity including a first search complexity and a second search complexity, the at least two search steps including the first search step a second search step size, wherein the first search step size corresponding to the first search complexity is less than the second search step size corresponding to the second search complexity, the first search complexity being higher than the first Second search complexity, and
- the first channel signal and the second channel signal are searched, including:
- the M kinds of search complexity (ie, M, M-1, . . . , 1) may be compared with M search steps (recorded as: L M , L M-1 , L M-2 ,...,L 1 , where L M ⁇ L M-1 ⁇ L M-2 ⁇ ... ⁇ L 1 ) one-to-one correspondence, namely:
- L M search step searches corresponding complexity is M, i.e., determined as described above, if the target of the search complexity is M, M can be the complexity of the search for the corresponding search step is set to L M Target search step size.
- the search complexity corresponding to the search step L M-1 is M-1, that is, if the target search complexity determined as described above is M-1, the search complexity M-1 may be corresponding.
- the search step size L M-1 is set to the target search step size.
- the search complexity corresponding to the search step L M-2 is M-2, that is, if the target search complexity determined as described above is M-2, the search complexity M-2 may be corresponding.
- the search step size L M-2 is set to the target search step size.
- search step L 2 corresponding to the search complexity is 2, i.e., determined as described above if the search complexity of the target is 2, may be the complexity of the search corresponding to 2 L 2 set search step Search for the step size for the target.
- a search step length L corresponding to the search complexity is 1, i.e., if the search is determined as described above is a certain complexity, the complexity of the search can be the one corresponding to the length L 1 is set search step Search for the step size for the target.
- M search step sizes i.e., L M , L M-1 , L M-2 , ..., L 1
- L M search step sizes can be determined according to the following formula. Specific value.
- K is a preset value, indicating the number of searches when the complexity is the lowest, Indicates the rounding operation.
- the left channel signal and the right channel signal may be searched according to the target search step length, Determine the ITD parameters.
- the above-mentioned searchable processing may be performed in the time domain (ie, mode 1) or in the frequency domain (ie, mode 2), and the present invention is not particularly limited. .
- the encoding end device can acquire an audio signal corresponding to the left channel by, for example, an audio input device such as a microphone corresponding to the left channel, and according to a preset sampling rate ⁇ (ie, the first channel
- ⁇ ie, the first channel
- An example of the sampling rate of the time domain signal is to sample the audio signal to generate a time domain signal of the left channel (that is, an example of the time domain signal of the first channel, hereinafter, for ease of understanding and distinction, Do the time domain signal #L).
- the process of acquiring the time domain signal #L may be similar to the prior art.
- detailed description thereof is omitted.
- the sampling rate of the time domain signal of the first channel is the same as the sampling rate of the time domain signal of the second channel. Therefore, similarly, the encoding end device may be, for example, opposite to the right channel.
- An audio input device such as a microphone acquires an audio signal corresponding to the right channel, and samples the audio signal according to the sampling rate ⁇ to generate a time domain signal of the right channel (ie, the second channel
- a time domain signal #R An example of the time domain signal is hereinafter described as a time domain signal #R) for ease of understanding and differentiation.
- the time domain signal #L and the time domain signal #R are time domain signals corresponding to the same time period (or time domain signals acquired in the same time period), for example, when The domain signal #L and the time domain signal #R may be time domain signals corresponding to the same frame (ie, 20 ms). In this case, the time domain signal #L and the time domain signal #R can be obtained corresponding to the one frame signal.
- An ITD parameter when the domain signal #L and the time domain signal #R may be time domain signals corresponding to the same frame (ie, 20 ms).
- the time domain signal #L and the time domain signal #R may also be time domain signals corresponding to the same subframe (ie, 10 ms or 5 ms, etc.) in the same frame.
- the time domain signal #R can obtain a plurality of ITD parameters corresponding to the one frame signal, for example, if the subframe corresponding to the time domain signal #L and the time domain signal #R is 10 ms, then the frame is passed (ie, , 20ms) signal can get two ITD parameters.
- the subframe corresponding to the time domain signal #L and the time domain signal #R is 5 ms
- four ITD parameters can be obtained by the one frame (ie, 20 ms) signal.
- the lengths of the time periods corresponding to the time domain signal #L and the time domain signal #R enumerated above are merely illustrative, and the present invention is not limited thereto, and the length of the time period may be arbitrarily changed as needed.
- the encoding end device may perform a search process on the time domain signal #L and the time domain signal #R according to the target search step size (ie, L t ) determined as described above, that is,
- the encoding end device may determine the cross-correlation function c n (i) of the time domain signal #L with respect to the time domain signal #R according to the following Equation 1, and determine the time domain signal #R relative to the time domain signal according to the following Equation 2 #L's cross-correlation function c p (i), ie:
- x R (j) represents the signal value of the time domain signal #R at the jth sampling point
- x L (j+i) represents the signal value of the time domain signal #L at the j+ith sampling point
- x L (j) represents the signal value of the time domain signal #L at the jth sampling point
- x R (j+i) represents the signal value of the time domain signal #R at the j+ith sampling point
- Length represents The total number of sampling points included in the time domain signal #R time domain signal #L, or the length of the time domain signal #R time domain signal #L, for example, may be the length of one frame (ie, 20 ms) or one sub The length of the frame (for example, 10ms or 5ms, etc.);
- T max represents a limit value of the ITD parameter (or the maximum value of the acquisition time difference between the left time domain signal #L and the time domain signal #R) may be determined according to the above sampling rate ⁇ , and the determination method thereof may be The prior art is similar, and detailed description is omitted here to avoid redundancy;
- Step 4 The encoding end device can calculate the time domain signal #L determined relative to the time domain signal when the search processing is performed on the time domain signal #R and the time domain signal #L with the target search step size (ie, L t ).
- the encoding side device can calculate the time domain signal #R determined with respect to the time domain signal #L when the search processing is performed on the time domain signal #R and the time domain signal #L with the target search step size (ie, L t ).
- the encoding end device can be versus The comparison is made and the ITD parameters are determined based on the comparison results.
- the encoding device can The corresponding index value is taken as the ITD parameter.
- the encoding device can The opposite of the corresponding index value is taken as the ITD parameter.
- T max represents a limit value of the ITD parameter (or the maximum value of the acquisition time difference between the time domain signal #L and the time domain signal #R) may be determined according to the above sampling rate ⁇ , and the determination method thereof may be There is a technical similarity, and a detailed description thereof will be omitted herein to avoid redundancy.
- the encoding end device may perform time-frequency transform processing on the time domain signal #L to obtain a frequency domain signal of the left channel (ie, an example of a frequency domain signal of the first channel, and below, for easy understanding and differentiation, recording frequency Domain signal #L).
- the time domain signal #R may be subjected to time-frequency transform processing to obtain a frequency domain signal of the right channel (ie, an example of the frequency domain signal of the second channel, hereinafter, for ease of understanding and distinction, the frequency domain signal #R is recorded. )
- a fast Fourier transform (FFT, Fast Fourier) may be employed.
- FFT Fast Fourier transform
- the Transformation technology performs time-frequency transform processing based on Equation 3 below.
- X(k) represents the frequency domain signal and FFT_LENGTH represents the time-frequency transform length.
- x(n) represents a time domain signal (ie, time domain signal #L or time domain signal #R), and Length represents the total number of sampling points included in the time domain signal.
- the encoding end device may perform a search process on the frequency domain signal #L and the frequency domain signal #R according to the target search step size (ie, L t ) determined as described above, that is,
- the encoding end device may divide the FFT_LENGTH frequency points of the frequency domain signal into N subband (for example, 1) subband according to the preset bandwidth A, where the information is included for the kth subband A k
- the frequency point is A k-1 ⁇ b ⁇ A k -1;
- Step c calculating a correlation function mag(j) of the frequency domain signal #L and the frequency domain signal #R according to the following Equation 4
- X L (b) represents the signal value of the frequency domain signal #L at the bth frequency point
- X R (b) represents the signal value of the frequency domain signal #R at the bth frequency point
- FFT_LENGTH represents the time frequency conversion length.
- T max represents a limit value of the ITD parameter (or the maximum value of the acquisition time difference between the left time domain signal #L and the time domain signal #R) may be determined according to the above sampling rate ⁇ , and the determination method thereof may be The prior art is similar, and detailed description is omitted here to avoid redundancy.
- the encoding end device can determine the value of the ITD parameter of the kth subband. That is, the index value corresponding to the maximum value of mag(j).
- the number of subbands corresponds to the value of the ITD parameter.
- the encoding end device may further perform quantization processing and the like on the ITD parameter value, and process the processed ITD parameter value and the mono signal (for example, the time domain signal #L, the time domain signal #R, and the frequency domain signal). #L or frequency domain signal #R) is sent to the decoding device (or the receiving device).
- the decoder device can recover the stereo audio signal based on the mono audio signal and the ITD parameter value.
- the at least two search complexity are in one-to-one correspondence with at least two search scopes, the at least two search complexity including a third search complexity and a fourth search complexity, the at least two search ranges including the first search range and the first search range a second search range, wherein the first search range corresponding to the third search complexity is greater than the second search range corresponding to the fourth search complexity, the third search complexity being higher than the fourth search complexity, and
- the first channel signal and the second channel signal are searched, including:
- the M kinds of search complexity ie, M, M-1, . . . , 1
- M search ranges (recorded as: F M , F M-1 , F M-2 ,...,F 1 , where F M >F M-1 >F M-2 >...>F 1 ) one-to-one correspondence, namely:
- the search range corresponding F. M search complexity is M, i.e., determined as described above, if the target of the search complexity is M, M can be the complexity of the search corresponding to the search range set as the target search M F. range.
- the search complexity corresponding to the search range F M-1 is M-1, that is, if the target search complexity determined as described above is M-1, the search complexity M-1 may be corresponding.
- the search range F M-1 is set as the target search range.
- the search complexity corresponding to the search range F M-2 is M-2, that is, if the target search complexity determined as described above is M-2, the search complexity M-2 may be corresponding.
- the search range F M-2 is set as the target search range.
- F 2 corresponding to the search range of the search complexity is 2, i.e., determined as described above if the search complexity of the target is 2, the search complexity may be 2 F 2 is set corresponding to the search range as a target Search range.
- Search complexity is 1, i.e., if the search is determined as described above is a certain complexity, the complexity of the search can be the one corresponding to a search range set as the target F. Search range.
- the search ranges F M , F M-1 , F M-2 , . . . , F 1 may all be search ranges in the time domain, or the above search range F M , F M-1 , F M-2 , ..., F 1 may also be search ranges in the frequency domain, and the present invention is not particularly limited.
- the search range F M on the frequency domain with the highest search complexity can be determined as [-T max , T max ].
- the target search range corresponding to the target search complexity is determined, including:
- the target search range Determining the target search range according to the target search complexity, the reference parameter and the limit value T max , wherein the limit value T max is determined according to a sampling rate of the time domain signal, the target search range belongs to [-T max , 0], or the target search range belongs to [0, T max ].
- the encoding end device can determine the reference parameter based on the time domain signal #L and the time domain signal #R.
- the reference parameter may correspond to the time domain signal #L and the time domain signal #R acquisition order (for example, the sequence of input to the audio input device), and then, corresponding to the determination process of the reference parameter, the corresponding parameter
- the relationship is described in detail.
- the reference parameter (ie, mode X) may be determined by performing cross-correlation processing on the time domain signal #L and the time domain signal #R, and may also search for the time domain signal #L and the time domain signal.
- the maximum value of #R is used to determine the reference parameter (ie, mode Y).
- the mode X and the mode Y will be described in detail.
- determining the reference parameter according to the time domain signal of the first channel and the time domain signal of the second channel including:
- Performing cross-correlation processing on the time domain signal of the first channel and the time domain signal of the second channel Determining a first cross-correlation processing value and a second cross-correlation processing value, wherein the first cross-correlation processing value is a cross-correlation function of a time domain signal of the first channel relative to a time domain signal of the second channel a maximum function value within a preset range, the second cross-correlation processing value being a maximum function of a cross-correlation function of the time domain signal of the second channel relative to the time domain signal of the first channel within the preset range value;
- the reference parameter is determined according to a size relationship between the first cross-correlation processing value and the second cross-correlation processing value.
- the encoding end device may determine the cross-correlation function c n (i) of the time domain signal #L with respect to the time domain signal #R according to the following Equation 5, that is,
- T max represents a limit value of the ITD parameter (or the maximum value of the acquisition time difference between the time domain signal #L and the time domain signal #R) may be determined according to the above sampling rate ⁇ , and the determination method thereof may be There is a technical similarity, and a detailed description thereof will be omitted herein to avoid redundancy.
- x R (j) represents the signal value of the time domain signal #R at the jth sampling point
- x L (j+i) represents the signal value of the time domain signal #L at the j+ith sampling point
- Length represents The total number of sampling points included in the time domain signal #R, or the length of the time domain signal #R, for example, may be the length of one frame (ie, 20 ms) or the length of one subframe (for example, 10 ms or 5 ms, etc.) ).
- the encoding end device can determine the maximum value of the cross correlation function c n (i)
- the encoding end device can determine the cross-correlation function c p (i) of the time domain signal #R with respect to the time domain signal #L according to Equation 6 below, namely:
- the encoding end device can determine the maximum value of the cross correlation function c p (i)
- the encoding end device may be configured according to versus The relationship between the parameters is determined by the following method X1 or mode X2.
- the encoding end device can determine that the time domain signal #L is acquired before the time domain signal #R, that is, the ITD parameter between the left and right channels is a positive number.
- the reference parameter T can be set to 1.
- the encoding end device can determine that the reference parameter is greater than 0, thereby determining that the search range is [0, T max ], that is, when the time domain signal #L is acquired before the time domain signal #R
- the ITD parameter is a positive number
- the search range is [0, T max ] (that is, an example in which the search range belongs to [0, T max ]).
- the encoding end device can determine that the time domain signal #L is acquired after the time domain signal #R, that is, the ITD parameter between the left and right channels is a negative number.
- the reference parameter T can be set to zero.
- the encoding end device can determine that the reference parameter is not greater than 0, thereby determining that the search range is [-T max , 0], that is, the time domain signal #L is after the time domain signal #R When acquired, the ITD parameter is a negative number and the search range is [-T max , 0] (ie, the search range belongs to an example of [-T max , 0]).
- the search scope is F 2 .
- the reference parameter is an inverse of an index value or an index value corresponding to a larger one of the first cross-correlation processing value and the second cross-correlation processing value.
- the encoding end device can determine that the time domain signal #L is acquired before the time domain signal #R, that is, the ITD parameter between the left and right channels is a positive number.
- the reference parameter T can be set to The corresponding index value.
- the encoding end device may further determine whether the reference parameter T is greater than or equal to T max /2, and determine a search range according to the determination result, for example, when T When ⁇ T max /2, the search range is [T max /2, T max ] (that is, an example in which the search range belongs to [0, T max ]). When T ⁇ T max /2, the search range is [0, T max /2] (that is, another example in which the search range belongs to [0, T max ]).
- the encoding end device can determine that the time domain signal #L is acquired after the time domain signal #R, that is, the ITD parameter between the left and right channels is a negative number.
- the reference parameter T can be set to The opposite of the corresponding index value.
- the encoding end device may further determine whether the reference parameter T is less than or equal to -T max /2, and determine a search range according to the determination result. For example, when T ⁇ -T max /2, the search range is [-T max , -T max /2] (that is, the search range belongs to an example of [-T max , 0]). When T>-T max /2, the search range is [-T max /2, 0] (that is, another example in which the search range belongs to [-T max , 0]).
- the complexity of the search comprises three or more, can be from the [-T max, -T max / 2 ], [- T max / 2,0], [0, T max / 2] and [
- determining the reference parameter according to the time domain signal of the first channel and the time domain signal of the second channel including:
- the first index value is the first sound value
- the second index value being an index corresponding to a maximum amplitude value of the time domain signal of the second channel within the preset range value
- the reference parameter is determined according to a size relationship between the first index value and the second index value.
- the encoding end device can detect the amplitude value of the time domain signal #L (represented as: L(j)) maximum value max(L(j)), j ⁇ [0, Length- 1], and record the index value p left corresponding to the max(L(j)), where Length represents the total number of sampling points included in the time domain signal #L.
- the encoding end device can detect the amplitude value (represented as: R(j)) maximum value max(R(j)), j ⁇ [0, Length-1] of the time domain signal #R, and record the max (R) (j)) The corresponding index value p right , where Length represents the total number of sample points included in the time domain signal #R.
- the encoding end device can determine the size relationship between p left and p right .
- the encoding end device can determine that the time domain signal #L is acquired before the time domain signal #R, that is, the ITD parameter between the left and right channels is a positive number.
- the reference parameter T can be set to 1.
- the encoding end device can determine that the reference parameter is greater than 0, thereby determining that the search range is [0, T max ], that is, when the time domain signal #L is acquired before the time domain signal #R
- the ITD parameter is a positive number
- the search range is [0, T max ] (that is, an example in which the search range belongs to [0, T max ]).
- the encoding end device may determine that the time domain signal #L is acquired after the time domain signal #R, that is, the ITD parameter between the left and right channels is a negative number. In this case, The reference parameter T is set to zero.
- the encoding end device can determine that the reference parameter is not greater than 0, thereby determining that the search range is [-T max , 0], that is, the time domain signal #L is after the time domain signal #R When acquired, the ITD parameter is a negative number and the search range is [-T max , 0] (ie, the search range belongs to an example of [-T max , 0]).
- the search scope is F 2 .
- the encoding end device may perform time-frequency transform processing on the time domain signal #L to obtain a frequency domain signal of the left channel (ie, an example of a frequency domain signal of the first channel, hereinafter, for ease of understanding and differentiation, the frequency domain is recorded.
- Signal #L The time domain signal #R may be subjected to time-frequency transform processing to obtain a frequency domain signal of the right channel (ie, an example of the frequency domain signal of the second channel, hereinafter, for ease of understanding and distinction, the frequency domain signal #R is recorded. )
- a fast Fourier transform (FFT) technique may be employed to perform time-frequency transform processing based on Equation 7 below.
- X(k) represents the frequency domain signal and FFT_LENGTH represents the time-frequency transform length.
- x(n) means The time domain signal (ie, time domain signal #L or time domain signal #R), Length represents the total number of sample points included in the time domain signal.
- the encoding end device can perform search processing on the frequency domain signal #L and the frequency domain signal #R determined as described above within the search range determined as described above to determine the ITD between the left channel and the right channel.
- Parameters for example, can be enumerated as follows:
- the encoding end device may divide the FFT_LENGTH frequency points of the frequency domain signal into N subband (for example, 1) subband according to the preset bandwidth A, where the frequency included in the kth subband A k is included.
- the point is A k-1 ⁇ b ⁇ A k -1,
- the correlation function mag(j) of the frequency domain signal #L is calculated according to the following Equation 8.
- X L (b) represents the signal value of the frequency domain signal #L at the bth frequency point
- X R (b) represents the signal value of the frequency domain signal #R at the bth frequency point
- FFT_LENGTH represents the time frequency conversion length.
- the range of values of j is the search range determined as described above. For ease of understanding and explanation, the search range is denoted as [a, b].
- the ITD parameter value of the kth subband is That is, the index value corresponding to the maximum value of mag(j).
- one or more (corresponding to the number of sub-bands determined as described above) between the left channel and the right channel can be obtained as the ITD parameter value.
- the encoding end device may further perform quantization processing or the like on the ITD parameter value, and send the processed ITD parameter value and the mono signal obtained by, for example, downmixing the signals of the left and right channels to the decoding end device. (or, the receiving device).
- the decoder device can recover the stereo audio signal based on the mono audio signal and the ITD parameter value.
- the method further includes:
- the first ITD parameter is an ITD parameter of a first time period
- the second ITD parameter is a smoothed value of an ITD parameter of a second time period
- the second The time period is before the first time period
- the encoding end device may further smooth the ITD parameter value as described above, as an example and not a limitation, the encoding end device. This smoothing can be performed according to Equation 5 below:
- T sm (k) w 1 *T sm [-1] (k)+w 2 *T(k) Equation 5
- T sm (k) represents the smoothed ITD parameter value corresponding to the kth frame or the kth subframe
- T sm [-1] represents the k-1th frame or the k-1th subframe corresponding to
- T(k) represents the unsmoothed ITD parameter value corresponding to the kth frame or the kth subframe
- w 1 and w 2 are smoothing factors
- T sm [-1] can be a preset value.
- the foregoing smoothing process may be performed by the encoding end device, or may be performed by the decoding end device, and the present invention is not particularly limited, that is, the encoding end.
- the device may also directly send the ITD parameter value obtained as described above to the decoding end device without performing the smoothing process described above, and perform smoothing processing on the ITD parameter value by the decoding end device, and perform smoothing processing by the decoding end device.
- the method and process may be similar to the method and process of smoothing performed by the above-mentioned decoding device. Here, in order to avoid redundancy, detailed description thereof will be omitted.
- a method for determining an inter-channel time difference parameter by determining a target search complexity corresponding to a current channel quality from at least two search complexity levels, and pairing the first channel according to the target search complexity
- the signal and the signal of the second channel are searched, so that the accuracy of the determined ITD parameter can be adapted to the channel quality, so that in the case of poor current channel quality, the search complexity can be reduced by the target search complexity.
- the complexity or amount of computation can support savings in computing resources and increase processing efficiency.
- FIG. 5 shows a schematic block diagram of an apparatus 200 for determining an inter-channel time difference parameter in accordance with an embodiment of the present invention. As shown in FIG. 5, the apparatus 200 includes:
- a determining unit 210 configured to determine a target search complexity from at least two search complexity, wherein the at least two search complexity correspond to at least two channel quality values one by one;
- the processing unit 220 is configured to perform a search process on the signal of the first channel and the signal of the second channel according to the target search complexity to determine a first corresponding to the first channel and the second channel Inter-channel time difference ITD parameter.
- the determining unit 210 is specifically configured to acquire an encoding parameter for the stereo signal, where the stereo signal is generated based on the signal of the first channel and the signal of the second channel, where the encoding parameter is based on a current channel.
- the encoding parameter includes any one of the following parameters: an encoding bit rate, a number of encoding bits, or a complexity control parameter used to indicate the search complexity; and is used to learn from at least two search complexity according to the encoding parameter. To determine the target search complexity.
- the at least two search complexity are in one-to-one correspondence with at least two search steps, the at least two search complexity including a first search complexity and a second search complexity, the at least two search steps including a first search step size and a second search step size, wherein the first search step size corresponding to the first search complexity is less than the second search step size corresponding to the second search complexity, the first search complexity Higher than the second search complexity, and the processing unit 220 is specifically configured to determine a target search step size corresponding to the target search complexity; and a signal for the first channel according to the target search step size and The signal of the second channel is subjected to a search process.
- the at least two search complexity are in one-to-one correspondence with the at least two search ranges, wherein the first search range corresponding to the third search complexity is greater than the second corresponding to the fourth search complexity a search range, the third search complexity is higher than the fourth search complexity, and the processing unit 220 is specifically configured to determine a target search range corresponding to the target search complexity; for using the target search range, The signal of the first channel and the signal of the second channel perform a search process.
- the processing unit 220 is configured to determine a reference parameter according to the time domain signal of the first channel and the time domain signal of the second channel, where the reference parameter corresponds to a time domain signal of the first channel. And an acquisition sequence between the time domain signals of the second channel, wherein the time domain signal of the first channel and the time domain signal of the second channel correspond to the same time period; and the search complexity is used according to the target
- the reference parameter and the limit value T max are determined, wherein the limit value T max is determined according to a sampling rate of the time domain signal of the first channel, and the target search range belongs to [-T max , 0], or the target search range belongs to [0, T max ].
- the processing unit 220 is configured to perform cross-correlation processing on the time domain signal of the first channel and the time domain signal of the second channel to determine a first cross-correlation processing value and a second cross-correlation processing.
- a value wherein the first cross-correlation processing value is a maximum function value of a cross-correlation function of the time domain signal of the first channel relative to a time domain signal of the second channel within a preset range, the second mutual The correlation processing value is a maximum function value of the cross-correlation function of the time domain signal of the second channel relative to the time domain signal of the first channel within the preset range; and is used for processing the value according to the first cross correlation and The size relationship between the second cross-correlation processing values determines the reference parameter.
- the reference parameter is an index value corresponding to a larger one of the first cross-correlation processing value and the second cross-correlation processing value or an inverse of the index value.
- the processing unit 220 is configured to perform peak detection processing on the time domain signal of the first channel and the time domain signal of the second channel to determine a first index value and a second index value, where
- the first index value is an index value corresponding to a maximum amplitude value of the first channel time domain signal within a preset range
- the second index value is a time domain signal with the second channel at the pre
- the index value corresponding to the maximum amplitude value in the range is used; and the reference parameter is determined according to the size relationship between the first index value and the second index value.
- the processing unit 220 is further configured to perform smoothing processing on the first ITD parameter based on the second ITD parameter, where the first ITD parameter is an ITD parameter of a first time period, and the second ITD parameter is a second A smoothed value of the ITD parameter of the time period, the second time period being before the first time period.
- the apparatus 200 for determining the inter-channel time difference parameter according to the embodiment of the present invention may correspond to the encoding end device in the method of the embodiment of the present invention, and
- the units and modules in the apparatus 200 for determining the inter-channel time difference parameter and the other operations and/or functions described above are respectively implemented in order to implement the corresponding processes of the method 100 in FIG. 1 , and are not described herein again for brevity.
- An apparatus for determining an inter-channel time difference parameter determining a target search complexity corresponding to a current channel quality from at least two search complexity levels, and pairing the first channel according to the target search complexity
- the signal and the signal of the second channel are searched, so that the accuracy of the determined ITD parameter can be adapted to the channel quality, so that in the case of poor current channel quality, the search complexity can be reduced by the target search complexity.
- the complexity or amount of computation can support savings in computing resources and increase processing efficiency.
- FIGS. 1 through 4 a method of determining an inter-channel time difference parameter according to an embodiment of the present invention is described in detail with reference to FIGS. 1 through 4.
- a method for determining an inter-channel time difference parameter according to an embodiment of the present invention will be described in detail with reference to FIG. device.
- FIG. 6 shows an illustration of an apparatus 300 for determining an inter-channel time difference parameter in accordance with an embodiment of the present invention.
- the device 300 can include:
- processor 320 connected to the bus
- the processor 320 calls, by using the bus 310, a program stored in the memory 330, for determining a target search complexity from at least two search complexity, wherein the at least two search complexity and at least two One channel quality value corresponds to one;
- the processor 320 is specifically configured to acquire an encoding parameter for a stereo signal, where the stereo signal is generated based on the signal of the first channel and the signal of the second channel, where the encoding parameter is based on a current channel.
- the encoding parameter includes any one of the following parameters: an encoding bit rate, a number of encoding bits, or a complexity control parameter used to indicate the search complexity;
- the at least two search complexity are in one-to-one correspondence with at least two search steps, the at least two search complexity including a first search complexity and a second search complexity, the at least two search steps including a first search step size and a second search step size, wherein the first search step size corresponding to the first search complexity is less than the second search step size corresponding to the second search complexity, the first search complexity Higher than the second search complexity, and
- the processor 320 is specifically configured to determine a target search step size corresponding to the target search complexity
- the at least two search complexity are in one-to-one correspondence with at least two search scopes, where the at least two search complexity includes a third search complexity and a fourth search complexity, where the at least two search ranges include the first a search range and a second search range, wherein the first search range corresponding to the third search complexity is greater than the second search range corresponding to the fourth search complexity, the third search complexity being higher than the fourth search Complexity, and
- the processor 320 is specifically configured to determine a target search range corresponding to the target search complexity
- the processor 320 is configured to determine a reference parameter according to the time domain signal of the first channel and the time domain signal of the second channel, where the reference parameter corresponds to a time domain signal of the first channel. And an acquisition sequence between the time domain signals of the second channel, wherein the time domain signal of the first channel and the time domain signal of the second channel correspond to the same time period;
- the target search range determining, according to the target search complexity, the reference parameter and the limit value T max , the target search range, wherein the limit value T max is determined according to a sampling rate of the time domain signal of the first channel, the target The search range belongs to [-T max , 0], or the target search range belongs to [0, T max ].
- the processor 320 is configured to perform cross-correlation processing on the time domain signal of the first channel and the time domain signal of the second channel to determine a first cross correlation processing value and a second cross correlation processing.
- a value wherein the first cross-correlation processing value is a maximum function value of a cross-correlation function of the time domain signal of the first channel relative to a time domain signal of the second channel within a preset range, the second mutual The correlation processing value is a maximum function value of the cross-correlation function of the time domain signal of the second channel relative to the time domain signal of the first channel within the preset range;
- the reference parameter is an index value corresponding to a larger one of the first cross-correlation processing value and the second cross-correlation processing value or an inverse of the index value.
- the processor 320 is configured to perform peak detection processing on the time domain signal of the first channel and the time domain signal of the second channel to determine a first index value and a second index value, where
- the first index value is an index value corresponding to a maximum amplitude value of the first channel time domain signal within a preset range
- the second index value is a time domain signal with the second channel at the pre Set the index value corresponding to the maximum amplitude value in the range
- the processor 320 is further configured to perform smoothing processing on the first ITD parameter based on the second ITD parameter, where the first ITD parameter is an ITD parameter of a first time period, and the second ITD parameter is a second A smoothed value of the ITD parameter of the time period, the second time period being before the first time period.
- bus 310 the various components of the device 300 are coupled together by a bus 310, wherein the bus 310 includes a power bus, a control bus, and a status signal in addition to the data bus. bus.
- bus 310 includes a power bus, a control bus, and a status signal in addition to the data bus.
- bus 310 various buses are labeled as bus 310 in the figure.
- the processor 320 can implement or perform the steps and logic blocks disclosed in the method embodiments of the present invention.
- Processor 320 can be a microprocessor or the processor can be any conventional processor, decoder or the like.
- the steps of the method disclosed in the embodiments of the present invention may be directly implemented by the hardware processor, or may be performed by a combination of hardware and software modules in the decoding processor.
- the software module can be located in a conventional storage medium such as random access memory, flash memory, read only memory, programmable read only memory or electrically erasable programmable memory, registers, and the like.
- the storage medium is located in the memory 330, and the processor reads the information in the memory 330 and performs the steps of the above method in combination with its hardware.
- the processor 320 may be a central processing unit (“CPU"), and the processor 320 may also be other general-purpose processors, digital signal processors (DSPs). , an application specific integrated circuit (ASIC), an off-the-shelf programmable gate array (FPGA) or other programmable logic device, discrete gate or transistor logic device, discrete hardware component, and the like.
- the general purpose processor may be a microprocessor or the processor or any conventional processor or the like.
- the memory 330 can include read only memory and random access memory and provides instructions and data to the processor 320. A portion of the memory 330 may also include a non-volatile random access memory. For example, the memory 330 can also store information of the device type.
- each step of the foregoing method may be completed by an integrated logic circuit of hardware in the processor 320 or an instruction in a form of software.
- the steps of the method disclosed in the embodiments of the present invention may be directly implemented as a hardware processor, or may be performed by a combination of hardware and software modules in the processor.
- the software module can be located in a conventional storage medium such as random access memory, flash memory, read only memory, programmable read only memory or electrically erasable programmable memory, registers, and the like.
- the apparatus 300 for determining the inter-channel time difference parameter according to the embodiment of the present invention may correspond to the encoding end device in the method of the embodiment of the present invention, and
- the units and modules in the apparatus 300 for determining the inter-channel time difference parameter and the other operations and/or functions described above are respectively implemented in order to implement the corresponding processes of the method 100 in FIG. 1 , and are not described herein again for brevity.
- An apparatus for determining an inter-channel time difference parameter determining a target search complexity corresponding to a current channel quality from at least two search complexity levels, and pairing the first channel according to the target search complexity
- the signal and the signal of the second channel are searched, so that the accuracy of the determined ITD parameter can be adapted to the channel quality, and thus, the current channel quality is poor.
- the complexity of the search processing can be reduced by the target search complexity, and the computational resource can be saved and the processing efficiency can be improved.
- the size of the sequence numbers of the above processes does not mean the order of execution, and the order of execution of each process should be determined by its function and internal logic, and should not be taken to the embodiments of the present invention.
- the implementation process constitutes any limitation.
- the disclosed systems, devices, and methods may be implemented in other manners.
- the device embodiments described above are merely illustrative.
- the division of the unit is only a logical function division.
- there may be another division manner for example, multiple units or components may be combined or Can be integrated into another system, or some features can be ignored or not executed.
- the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, and may be in an electrical, mechanical or other form.
- the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
- each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
- the functions may be stored in a computer readable storage medium if implemented in the form of a software functional unit and sold or used as a standalone product. Based on such understanding, the technical solution of the present invention, which is essential or contributes to the prior art, or a part of the technical solution, may be embodied in the form of a software product stored in a storage medium. A number of instructions are included to cause a computer device (which may be a personal computer, server, or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present invention.
- the foregoing storage medium includes: a U disk, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like. .
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Mathematical Physics (AREA)
- Stereophonic System (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
- Investigating Or Analyzing Materials By The Use Of Ultrasonic Waves (AREA)
- Mobile Radio Communication Systems (AREA)
Abstract
Description
Claims (18)
- 一种确定声道间时间差参数的方法,其特征在于,所述方法包括:从至少两个搜索复杂度中,确定目标搜索复杂度,其中,所述至少两个搜索复杂度与至少两个信道质量值一一相对应;根据所述目标搜索复杂度,对第一声道的信号及第二声道的信号进行搜索处理,以确定与所述第一声道及所述第二声道相对应的第一声道间时间差ITD参数。
- 根据权利要求1所述的方法,其特征在于,所述从至少两个搜索复杂度中,确定目标搜索复杂度,包括:获取针对立体声信号的编码参数,所述立体声信号是基于所述第一声道的信号及所述第二声道的信号生成的,所述编码参数是根据当前的信道质量值确定的,所述编码参数包括以下任一参数:编码比特率、编码比特数或用于指示所述搜索复杂度的复杂度控制参数;根据所述编码参数,从至少两个搜索复杂度中,确定目标搜索复杂度。
- 根据权利要求1或2所述的方法,其特征在于,所述至少两个搜索复杂度与至少两个搜索步长一一对应,所述至少两个搜索复杂度包括第一搜索复杂度和第二搜索复杂度,所述至少两个搜索步长包括第一搜索步长和第二搜索步长,其中,与第一搜索复杂度相对应的第一搜索步长小于与第二搜索复杂度相对应的第二搜索步长,所述第一搜索复杂度高于所述第二搜索复杂度,以及所述根据所述目标搜索复杂度,对第一声道的信号及第二声道的信号进行搜索处理,包括:确定与所述目标搜索复杂度相对应的目标搜索步长;根据所述目标搜索步长,对所述第一声道的信号及所述第二声道的信号进行搜索处理。
- 根据权利要求1或2所述的方法,其特征在于,所述至少两个搜索复杂度与至少两个搜索范围一一对应,所述至少两个搜索复杂度包括第三搜索复杂度和第四搜索复杂度,所述至少两个搜索范围包括第一搜索范围和第二搜索范围,其中,与第三搜索复杂度相对应的第一搜索范围大于与第四搜索复杂度相对应的第二搜索范围,所述第三搜索复杂度高于所述第四搜索复杂度,以及所述根据所述目标搜索复杂度,对第一声道的信号及第二声道的信号进行搜索处理,包括:确定与所述目标搜索复杂度相对应的目标搜索范围;在所述目标搜索范围上,对所述第一声道的信号及所述第二声道的信号进行搜索处理。
- 根据权利要求4所述的方法,其特征在于,所述确定与所述目标搜索复杂度相对应的目标搜索范围,包括:根据所述第一声道的时域信号及所述第二声道的时域信号,确定基准参数,所述基准参数对应于所述第一声道的时域信号与所述第二声道的时域信号之间的获取顺序,其中,所述第一声道的时域信号及所述第二声道的时域信号对应于同一时段;根据所述目标搜索复杂度、所述基准参数和极限值Tmax,确定所述目标搜索范围,其中,所述极限值Tmax是根据所述第一声道的时域信号的采样率确定的,所述目标搜索范围属于[-Tmax,0],或所述目标搜索范围属于[0,Tmax]。
- 根据权利要求5所述的方法,其特征在于,所述根据第一声道的时域信号及第二声道的时域信号,确定基准参数,包括:对所述第一声道的时域信号及所述第二声道的时域信号进行互相关处理,以确定第一互相关处理值及第二互相关处理值,其中,所述第一互相关处理值是所述第一声道的时域信号相对于所述第二声道的时域信号的互相关函数在预设范围内的最大函数值,所述第二互相关处理值是所述第二声道的时域信号相对于所述第一声道的时域信号的互相关函数在所述预设范围内的最大函数值;根据所述第一互相关处理值及所述第二互相关处理值之间的大小关系,确定所述基准参数。
- 根据权利要求6所述的方法,其特征在于,所述基准参数是所述第一互相关处理值及所述第二互相关处理值中较大一方所对应的索引值或者所述索引值的相反数。
- 根据权利要求5所述的方法,其特征在于,所述根据第一声道的时域信号及第二声道的时域信号,确定基准参数,包括:对所述第一声道的时域信号及所述第二声道的时域信号进行峰值检测处理,以确定第一索引值及第二索引值,其中,所述第一索引值是与所述第 一声道的时域信号在预设范围内的最大幅度值相对应的索引值,所述第二索引值是与所述第二声道的时域信号在所述预设范围内的最大幅度值相对应的索引值;根据所述第一索引值与所述第二索引值之间的大小关系,确定所述基准参数。
- 根据权利要求1至8中任一项所述的方法,其特征在于,所述方法还包括:基于第二ITD参数,对所述第一ITD参数进行平滑处理,其中,所述第一ITD参数是第一时段的ITD参数,所述第二ITD参数是第二时段的ITD参数的平滑值,所述第二时段处于所述第一时段之前。
- 一种确定声道间时间差参数的装置,其特征在于,所述装置包括:确定单元,用于从至少两个搜索复杂度中,确定目标搜索复杂度,其中,所述至少两个搜索复杂度与至少两个信道质量值一一相对应;处理单元,用于根据所述目标搜索复杂度,对第一声道的信号及第二声道的信号进行搜索处理,以确定与所述第一声道及所述第二声道相对应的第一声道间时间差ITD参数。
- 根据权利要求10所述的装置,其特征在于,所述确定单元具体用于获取针对立体声信号的编码参数,所述立体声信号是基于所述第一声道的信号及所述第二声道的信号生成的,所述编码参数是根据当前的信道质量值确定的,所述编码参数包括以下任一参数:编码比特率、编码比特数或用于指示所述搜索复杂度的复杂度控制参数;用于根据所述编码参数,从至少两个搜索复杂度中,确定目标搜索复杂度。
- 根据权利要求10或11所述的装置,其特征在于,所述至少两个搜索复杂度与至少两个搜索步长一一对应,所述至少两个搜索复杂度包括第一搜索复杂度和第二搜索复杂度,所述至少两个搜索步长包括第一搜索步长和第二搜索步长,其中,与第一搜索复杂度相对应的第一搜索步长小于与第二搜索复杂度相对应的第二搜索步长,所述第一搜索复杂度高于所述第二搜索复杂度,以及所述处理单元具体用于确定与所述目标搜索复杂度相对应的目标搜索步长;用于根据所述目标搜索步长,对所述第一声道的信号及所述第二声道的信号进行搜索处理。
- 根据权利要求10或11所述的装置,其特征在于,所述至少两个搜索复杂度与至少两个搜索范围之间一一对应,其中,与第三搜索复杂度相对应的第一搜索范围大于与第四搜索复杂度相对应的第二搜索范围,所述第三搜索复杂度高于所述第四搜索复杂度,以及所述处理单元具体用于确定与所述目标搜索复杂度相对应的目标搜索范围;用于在所述目标搜索范围上,对所述第一声道的信号及所述第二声道的信号进行搜索处理。
- 根据权利要求13所述的装置,其特征在于,所述处理单元具体用于根据所述第一声道的时域信号及所述第二声道的时域信号,确定基准参数,所述基准参数对应于所述第一声道的时域信号与所述第二声道的时域信号之间的获取顺序,其中,所述第一声道的时域信号及所述第二声道的时域信号对应于同一时段;根据所述目标搜索复杂度、所述基准参数和极限值Tmax,确定所述目标搜索范围,其中,所述极限值Tmax是根据所述第一声道的时域信号的采样率确定的,所述目标搜索范围属于[-Tmax,0],或所述目标搜索范围属于[0,Tmax]。
- 根据权利要求14所述的装置,其特征在于,所述处理单元具体用于对所述第一声道的时域信号及所述第二声道的时域信号进行互相关处理,以确定第一互相关处理值及第二互相关处理值,其中,所述第一互相关处理值是所述第一声道的时域信号相对于所述第二声道的时域信号的互相关函数在预设范围内的最大函数值,所述第二互相关处理值是所述第二声道的时域信号相对于所述第一声道的时域信号的互相关函数在所述预设范围内的最大函数值;用于根据所述第一互相关处理值及所述第二互相关处理值之间的大小关系,确定所述基准参数。
- 根据权利要求15所述的装置,其特征在于,所述基准参数是所述第一互相关处理值及所述第二互相关处理值中较大一方所对应的索引值或者所述索引值的相反数。
- 根据权利要求14所述的装置,其特征在于,所述处理单元具体用 于对所述第一声道的时域信号及所述第二声道的时域信号进行峰值检测处理,以确定第一索引值及第二索引值,其中,所述第一索引值是与所述第一声道的时域信号在预设范围内的最大幅度值相对应的索引值,所述第二索引值是与所述第二声道的时域信号在所述预设范围内的最大幅度值相对应的索引值;用于根据所述第一索引值与所述第二索引值之间的大小关系,确定所述基准参数。
- 根据权利要求10至17中任一项所述的装置,其特征在于,所述处理单元还用于基于第二ITD参数,对所述第一ITD参数进行平滑处理,其中,所述第一ITD参数是第一时段的ITD参数,所述第二ITD参数是第二时段的ITD参数的平滑值,所述第二时段处于所述第一时段之前。
Priority Applications (10)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
MX2017011466A MX2017011466A (es) | 2015-03-09 | 2015-11-20 | Metodo y aparato para determinar parametro de diferencia de tiempo inter-canal. |
KR1020177025506A KR20170116132A (ko) | 2015-03-09 | 2015-11-20 | 채널 간 시차 파라미터를 결정하는 방법 및 장치 |
CA2977843A CA2977843A1 (en) | 2015-03-09 | 2015-11-20 | Method and apparatus for determining inter-channel time difference parameter |
JP2017547578A JP2018508047A (ja) | 2015-03-09 | 2015-11-20 | チャネル間時間差パラメータを決定するための方法および装置 |
SG11201706997PA SG11201706997PA (en) | 2015-03-09 | 2015-11-20 | Method and apparatus for determining inter-channel time difference parameter |
EP15884409.2A EP3255632B1 (en) | 2015-03-09 | 2015-11-20 | Method and apparatus for determining time difference parameter among sound channels |
BR112017018819-8A BR112017018819A2 (zh) | 2015-03-09 | 2015-11-20 | Method and apparatus for determining the time difference between the channel parameters |
RU2017134756A RU2682026C1 (ru) | 2015-03-09 | 2015-11-20 | Способ и устройство для определения параметра межканальной разности времени |
AU2015385489A AU2015385489B2 (en) | 2015-03-09 | 2015-11-20 | Method and apparatus for determining inter-channel time difference parameter |
US15/696,716 US10388288B2 (en) | 2015-03-09 | 2017-09-06 | Method and apparatus for determining inter-channel time difference parameter |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510103379.3 | 2015-03-09 | ||
CN201510103379.3A CN106033672B (zh) | 2015-03-09 | 2015-03-09 | 确定声道间时间差参数的方法和装置 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/696,716 Continuation US10388288B2 (en) | 2015-03-09 | 2017-09-06 | Method and apparatus for determining inter-channel time difference parameter |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2016141731A1 true WO2016141731A1 (zh) | 2016-09-15 |
Family
ID=56879889
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2015/095090 WO2016141731A1 (zh) | 2015-03-09 | 2015-11-20 | 确定声道间时间差参数的方法和装置 |
Country Status (12)
Country | Link |
---|---|
US (1) | US10388288B2 (zh) |
EP (1) | EP3255632B1 (zh) |
JP (1) | JP2018508047A (zh) |
KR (1) | KR20170116132A (zh) |
CN (1) | CN106033672B (zh) |
AU (1) | AU2015385489B2 (zh) |
BR (1) | BR112017018819A2 (zh) |
CA (1) | CA2977843A1 (zh) |
MX (1) | MX2017011466A (zh) |
RU (1) | RU2682026C1 (zh) |
SG (1) | SG11201706997PA (zh) |
WO (1) | WO2016141731A1 (zh) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP3252756A4 (en) * | 2015-03-09 | 2017-12-13 | Huawei Technologies Co., Ltd. | Method and device for determining inter-channel time difference parameter |
TWI666630B (zh) * | 2017-06-29 | 2019-07-21 | 大陸商華為技術有限公司 | 時延估計方法及裝置 |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20210072736A (ko) * | 2018-10-08 | 2021-06-17 | 돌비 레버러토리즈 라이쎈싱 코오포레이션 | 인코딩 및 디코딩 동작을 단순화하기 위해 상이한 포맷으로 캡처된 오디오 신호들을 축소된 수의 포맷으로 변환하는 것 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1273663A (zh) * | 1998-05-26 | 2000-11-15 | 皇家菲利浦电子有限公司 | 具有改进的语音编码器的传输系统 |
CN1288557A (zh) * | 1998-01-21 | 2001-03-21 | 诺基亚移动电话有限公司 | 解码方法和包括自适应后置滤波器的系统 |
CN1820306A (zh) * | 2003-05-01 | 2006-08-16 | 诺基亚有限公司 | 可变比特率宽带语音编码中增益量化的方法和装置 |
CN101073109A (zh) * | 2004-09-30 | 2007-11-14 | 艾利森电话股份有限公司 | 用于编解码器选择中自适应门限的方法及装置 |
US8077893B2 (en) * | 2007-05-31 | 2011-12-13 | Ecole Polytechnique Federale De Lausanne | Distributed audio coding for wireless hearing aids |
US8948891B2 (en) * | 2009-08-12 | 2015-02-03 | Samsung Electronics Co., Ltd. | Method and apparatus for encoding/decoding multi-channel audio signal by using semantic information |
Family Cites Families (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0669811A (ja) * | 1992-08-21 | 1994-03-11 | Oki Electric Ind Co Ltd | 符号化回路及び復号化回路 |
WO2003107591A1 (en) * | 2002-06-14 | 2003-12-24 | Nokia Corporation | Enhanced error concealment for spatial audio |
GB2453117B (en) | 2007-09-25 | 2012-05-23 | Motorola Mobility Inc | Apparatus and method for encoding a multi channel audio signal |
US20100290629A1 (en) * | 2007-12-21 | 2010-11-18 | Panasonic Corporation | Stereo signal converter, stereo signal inverter, and method therefor |
KR20100009981A (ko) * | 2008-07-21 | 2010-01-29 | 성균관대학교산학협력단 | 첫번째 다중 경로 성분에서의 동기화를 통한 초광대역 무선 통신 수신기에서의 동기화 방법 및 이를 이용한 초광대역 무선 통신 수신기 |
WO2010037427A1 (en) * | 2008-10-03 | 2010-04-08 | Nokia Corporation | Apparatus for binaural audio coding |
CN101408615B (zh) * | 2008-11-26 | 2011-11-30 | 武汉大学 | 双耳时间差itd临界感知特性的测量方法及其装置 |
CN101533641B (zh) * | 2009-04-20 | 2011-07-20 | 华为技术有限公司 | 对多声道信号的声道延迟参数进行修正的方法和装置 |
CN102307323B (zh) * | 2009-04-20 | 2013-12-18 | 华为技术有限公司 | 对多声道信号的声道延迟参数进行修正的方法 |
CN102422347B (zh) * | 2009-05-20 | 2013-07-03 | 松下电器产业株式会社 | 编码装置、解码装置及编码和解码方法 |
US8463414B2 (en) * | 2010-08-09 | 2013-06-11 | Motorola Mobility Llc | Method and apparatus for estimating a parameter for low bit rate stereo transmission |
US9424852B2 (en) * | 2011-02-02 | 2016-08-23 | Telefonaktiebolaget Lm Ericsson (Publ) | Determining the inter-channel time difference of a multi-channel audio signal |
EP3182409B1 (en) * | 2011-02-03 | 2018-03-14 | Telefonaktiebolaget LM Ericsson (publ) | Determining the inter-channel time difference of a multi-channel audio signal |
JP5947971B2 (ja) * | 2012-04-05 | 2016-07-06 | 華為技術有限公司Huawei Technologies Co.,Ltd. | マルチチャネルオーディオ信号の符号化パラメータを決定する方法及びマルチチャネルオーディオエンコーダ |
WO2013149671A1 (en) * | 2012-04-05 | 2013-10-10 | Huawei Technologies Co., Ltd. | Multi-channel audio encoder and method for encoding a multi-channel audio signal |
CN103534753B (zh) * | 2012-04-05 | 2015-05-27 | 华为技术有限公司 | 用于信道间差估计的方法和空间音频编码装置 |
US9659569B2 (en) * | 2013-04-26 | 2017-05-23 | Nokia Technologies Oy | Audio signal encoder |
CN106033671B (zh) * | 2015-03-09 | 2020-11-06 | 华为技术有限公司 | 确定声道间时间差参数的方法和装置 |
-
2015
- 2015-03-09 CN CN201510103379.3A patent/CN106033672B/zh active Active
- 2015-11-20 CA CA2977843A patent/CA2977843A1/en not_active Abandoned
- 2015-11-20 MX MX2017011466A patent/MX2017011466A/es unknown
- 2015-11-20 AU AU2015385489A patent/AU2015385489B2/en active Active
- 2015-11-20 BR BR112017018819-8A patent/BR112017018819A2/zh not_active Application Discontinuation
- 2015-11-20 EP EP15884409.2A patent/EP3255632B1/en active Active
- 2015-11-20 JP JP2017547578A patent/JP2018508047A/ja active Pending
- 2015-11-20 KR KR1020177025506A patent/KR20170116132A/ko active IP Right Grant
- 2015-11-20 WO PCT/CN2015/095090 patent/WO2016141731A1/zh active Application Filing
- 2015-11-20 RU RU2017134756A patent/RU2682026C1/ru active
- 2015-11-20 SG SG11201706997PA patent/SG11201706997PA/en unknown
-
2017
- 2017-09-06 US US15/696,716 patent/US10388288B2/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1288557A (zh) * | 1998-01-21 | 2001-03-21 | 诺基亚移动电话有限公司 | 解码方法和包括自适应后置滤波器的系统 |
CN1273663A (zh) * | 1998-05-26 | 2000-11-15 | 皇家菲利浦电子有限公司 | 具有改进的语音编码器的传输系统 |
CN1820306A (zh) * | 2003-05-01 | 2006-08-16 | 诺基亚有限公司 | 可变比特率宽带语音编码中增益量化的方法和装置 |
CN101073109A (zh) * | 2004-09-30 | 2007-11-14 | 艾利森电话股份有限公司 | 用于编解码器选择中自适应门限的方法及装置 |
US8077893B2 (en) * | 2007-05-31 | 2011-12-13 | Ecole Polytechnique Federale De Lausanne | Distributed audio coding for wireless hearing aids |
US8948891B2 (en) * | 2009-08-12 | 2015-02-03 | Samsung Electronics Co., Ltd. | Method and apparatus for encoding/decoding multi-channel audio signal by using semantic information |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP3252756A4 (en) * | 2015-03-09 | 2017-12-13 | Huawei Technologies Co., Ltd. | Method and device for determining inter-channel time difference parameter |
US10210873B2 (en) | 2015-03-09 | 2019-02-19 | Huawei Technologies Co., Ltd. | Method and apparatus for determining inter-channel time difference parameter |
TWI666630B (zh) * | 2017-06-29 | 2019-07-21 | 大陸商華為技術有限公司 | 時延估計方法及裝置 |
US11304019B2 (en) | 2017-06-29 | 2022-04-12 | Huawei Technologies Co., Ltd. | Delay estimation method and apparatus |
US11950079B2 (en) | 2017-06-29 | 2024-04-02 | Huawei Technologies Co., Ltd. | Delay estimation method and apparatus |
Also Published As
Publication number | Publication date |
---|---|
BR112017018819A2 (zh) | 2018-04-24 |
EP3255632A4 (en) | 2017-12-13 |
AU2015385489A1 (en) | 2017-09-28 |
MX2017011466A (es) | 2018-01-11 |
EP3255632B1 (en) | 2020-01-08 |
RU2682026C1 (ru) | 2019-03-14 |
KR20170116132A (ko) | 2017-10-18 |
US10388288B2 (en) | 2019-08-20 |
CA2977843A1 (en) | 2016-09-15 |
AU2015385489B2 (en) | 2019-04-04 |
CN106033672B (zh) | 2021-04-09 |
US20170365265A1 (en) | 2017-12-21 |
SG11201706997PA (en) | 2017-09-28 |
CN106033672A (zh) | 2016-10-19 |
JP2018508047A (ja) | 2018-03-22 |
EP3255632A1 (en) | 2017-12-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP7443423B2 (ja) | マルチチャネル信号の符号化方法およびエンコーダ | |
US9479886B2 (en) | Scalable downmix design with feedback for object-based surround codec | |
CN107146627B (zh) | 对更高阶高保真度立体声响复制表示进行压缩和解压缩的方法和装置 | |
CN112735447B (zh) | 压缩和解压缩高阶高保真度立体声响复制信号表示的方法及装置 | |
WO2016141732A1 (zh) | 确定声道间时间差参数的方法和装置 | |
JP2024036349A (ja) | 遅延推定方法および遅延推定装置 | |
JP7301154B2 (ja) | 音声データの処理方法並びにその、装置、電子機器及びコンピュータプログラム | |
CN111316353A (zh) | 确定空间音频参数编码和相关联的解码 | |
KR101756838B1 (ko) | 다채널 오디오 신호를 다운 믹스하는 방법 및 장치 | |
WO2016141731A1 (zh) | 确定声道间时间差参数的方法和装置 | |
JP2023510556A (ja) | オーディオ符号化および復号方法ならびにオーディオ符号化および復号デバイス | |
CN113948098A (zh) | 一种立体声音频信号时延估计方法及装置 | |
WO2019106221A1 (en) | Processing of spatial audio parameters | |
WO2017202680A1 (en) | Method and apparatus for voice or sound activity detection for spatial audio | |
CN108877815B (zh) | 一种立体声信号处理方法及装置 | |
RU2648632C2 (ru) | Классификатор многоканального звукового сигнала | |
WO2017193550A1 (zh) | 多声道信号的编码方法和编码器 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 15884409 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2977843 Country of ref document: CA |
|
WWE | Wipo information: entry into national phase |
Ref document number: 11201706997P Country of ref document: SG |
|
REEP | Request for entry into the european phase |
Ref document number: 2015884409 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: MX/A/2017/011466 Country of ref document: MX |
|
ENP | Entry into the national phase |
Ref document number: 2017547578 Country of ref document: JP Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 20177025506 Country of ref document: KR Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
REG | Reference to national code |
Ref country code: BR Ref legal event code: B01A Ref document number: 112017018819 Country of ref document: BR |
|
ENP | Entry into the national phase |
Ref document number: 2015385489 Country of ref document: AU Date of ref document: 20151120 Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2017134756 Country of ref document: RU |
|
ENP | Entry into the national phase |
Ref document number: 112017018819 Country of ref document: BR Kind code of ref document: A2 Effective date: 20170901 |