WO2016133366A1 - Procédé de traitement de signal multiplex et appareil de traitement de signal multiplex permettant de mettre en œuvre ce procédé - Google Patents
Procédé de traitement de signal multiplex et appareil de traitement de signal multiplex permettant de mettre en œuvre ce procédé Download PDFInfo
- Publication number
- WO2016133366A1 WO2016133366A1 PCT/KR2016/001613 KR2016001613W WO2016133366A1 WO 2016133366 A1 WO2016133366 A1 WO 2016133366A1 KR 2016001613 W KR2016001613 W KR 2016001613W WO 2016133366 A1 WO2016133366 A1 WO 2016133366A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- channel
- signal
- downmix
- output signal
- decorrelator
- Prior art date
Links
- 238000003672 processing method Methods 0.000 title claims abstract description 17
- 238000012545 processing Methods 0.000 title claims description 49
- 238000000034 method Methods 0.000 claims abstract description 73
- 239000011159 matrix material Substances 0.000 claims description 83
- 230000008569 process Effects 0.000 claims description 41
- 239000013598 vector Substances 0.000 description 34
- 238000005070 sampling Methods 0.000 description 22
- 230000002123 temporal effect Effects 0.000 description 20
- 238000007493 shaping process Methods 0.000 description 18
- 238000010586 diagram Methods 0.000 description 15
- 230000015572 biosynthetic process Effects 0.000 description 12
- 238000003786 synthesis reaction Methods 0.000 description 12
- 238000006243 chemical reaction Methods 0.000 description 4
- 230000005236 sound signal Effects 0.000 description 4
- 230000003595 spectral effect Effects 0.000 description 4
- 101100018996 Caenorhabditis elegans lfe-2 gene Proteins 0.000 description 2
- 230000006835 compression Effects 0.000 description 2
- 238000007906 compression Methods 0.000 description 2
- 230000021615 conjugation Effects 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 230000007480 spreading Effects 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000006866 deterioration Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000001788 irregular Effects 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/20—Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
Definitions
- the present invention relates to a multi-channel signal processing method and a multi-channel signal processing apparatus for performing the method, and more particularly to a method and apparatus that can be compressed without deterioration of sound quality even if the number of channels of the multi-channel signal increases.
- MPS MPEG Surround
- MPS is a codec for coding multichannel signals such as 5.1 channel and 7.1 channel.
- MPS multi-channel signals can be compressed and transmitted at a high compression rate.
- the encoding / decoding process has a limitation of backward compatibility.
- the bitstream of the multi-channel signal generated through the MPS is required to be backward compatible to be reproduced in mono or stereo format through the existing codec.
- the decoder may then recover the multi-channel signal from the bitstream using the additional information received from the encoder. In this case, the decoder may restore the multi-channel signal with additional information for upmixing.
- the present invention provides a method and apparatus for processing a multichannel signal through an N-N / 2-N structure.
- Multi-channel signal processing method comprises the steps of identifying the downmix signal of the N / 2 channel derived from the input signal of the N channel; And generating an N-channel output signal from the identified N / 2 channel downmix signal using a plurality of OTT boxes, wherein the number of the plurality of OTT boxes includes no LFE channel in the output signal.
- the number of channels of the downmix signal may be equal to N / 2.
- Each of the plurality of OTT boxes may generate an output signal of two channels using an uncorrelated signal generated from a decorrelator corresponding to each of the plurality of OTT boxes and a downmix signal of one channel. .
- the decorrelator When the number N of channels of the output signal exceeds a preset channel number M, the decorrelator includes a first decorrelator corresponding to a channel of M or less and a second decorrelator corresponding to more than M channels; The second decorrelator may reuse a filter set of the first decorrelator.
- An OTT box whose output is an LFE channel among the plurality of OTT boxes may generate two channels of downmix signals without using an uncorrelated signal.
- Each of the plurality of OTT boxes may generate two channel output signals using the residual signal and one channel downmix signal instead of the uncorrelated signal when the transmitted residual signal exists.
- the generating of the N-channel output signal may include generating an N-channel output signal using a pre decorrelator matrix M1 and a mix matrix M2.
- Each of the plurality of OTT boxes may generate an output signal of N channels using a channel level difference (CLD).
- CLD channel level difference
- the number N of channels of the output signal may be an even number from 10 to 32.
- a method of processing a multichannel signal including: decoding a downmix signal of an N / 2 channel encoded according to a first coding scheme; And generating an output signal of the N channel from the downmix signal of the N / 2 channel according to a second coding scheme, wherein the second coding scheme, when the output signal does not include an LFE channel,
- One number of one-to-two (OTT) boxes equal to N / 2, which is the number of channels of the downmix signal, may be used.
- the multi-channel signal processing apparatus includes a process for executing a multi-channel signal processing method, wherein the process identifies a downmix signal of the N / 2 channel derived from the input signal of the N channel and And generating an N-channel output signal from the identified N / 2 channel downmix signal using a plurality of OTT boxes, wherein the number of the plurality of OTT boxes is equal to the downmix when the LFE channel is not present in the output signal. It may be equal to N / 2 which is the number of channels of the signal.
- Each of the plurality of OTT boxes may generate an output signal of two channels using an uncorrelated signal generated from a decorrelator corresponding to each of the plurality of OTT boxes and a downmix signal of one channel. .
- the decorrelator When the number N of channels of the output signal exceeds a preset channel number M, the decorrelator includes a first decorrelator corresponding to a channel of M or less and a second decorrelator corresponding to more than M channels; The second decorrelator may reuse a filter set of the first decorrelator.
- An OTT box whose output is an LFE channel among the plurality of OTT boxes may generate two channels of downmix signals without using an uncorrelated signal.
- Each of the plurality of OTT boxes may generate two channel output signals using the residual signal and one channel downmix signal instead of the uncorrelated signal when the transmitted residual signal exists.
- the process may generate an output signal of the N channel using a pre decorrelator matrix M1 and a mix matrix M2.
- Each of the plurality of OTT boxes may generate an output signal of N channels using a channel level difference (CLD).
- CLD channel level difference
- the number N of channels of the output signal may be an even number from 10 to 32.
- the multi-channel signal processing apparatus includes a process for executing a multi-channel signal processing method, wherein the process decodes the downmix signal of the N / 2 channel encoded according to the first coding scheme and And generating an output signal of the N channel from the downmix signal of the N / 2 channel according to a second coding scheme, wherein the second coding scheme, when the output signal does not include an LFE channel,
- One number of one-to-two (OTT) boxes equal to the number of channels N / 2 may be used.
- FIG. 1 is a diagram illustrating an encoding apparatus and a decoding apparatus, according to an embodiment.
- FIG. 2 is a diagram illustrating detailed components of an encoding apparatus according to an embodiment.
- FIG. 3 is a diagram illustrating detailed components of an encoding apparatus according to another embodiment.
- FIG. 4 is a diagram for describing an operation of a first encoding unit, according to an exemplary embodiment.
- FIG. 5 is a diagram illustrating detailed components of a decoding apparatus according to an embodiment.
- FIG. 6 is a diagram illustrating detailed components of a decoding apparatus according to another exemplary embodiment.
- FIG. 7 is a diagram for describing an operation of a second decoding unit, according to an exemplary embodiment.
- FIG. 8 is a diagram illustrating a spatial audio processing procedure for an N-N / 2-N structure according to an embodiment.
- FIG. 9 illustrates a tree structure for performing spatial audio processing for the N-N / 2-N structure according to an embodiment.
- FIG. 10 illustrates a process of generating an output signal of 24 channels from a 12-channel downmix according to an embodiment.
- FIG. 11 illustrates an OTT box of the process of FIG. 10, according to an exemplary embodiment.
- FIG. 12 illustrates a process of FIG. 11 according to an MPS standard according to an embodiment.
- an N / 2 channel downmix signal is generated from an N channel input signal through an MPS encoder, and an N / 2 output signal is generated using an N / 2 channel downmix signal through an MPS decoder.
- the N / 2 channel represents more channels than the number of channels defined in the existing MPS standard.
- the MPS decoder according to an embodiment of the present invention may satisfy the extended MPS standard for the MPEG-H 3D AUDIO standard.
- the encoding apparatus and the decoding apparatus correspond to the multichannel signal processing apparatus.
- FIG. 1 is a diagram illustrating an encoding apparatus and a decoding apparatus, according to an embodiment.
- the encoding apparatus 100 may generate an N / 2 channel downmix signal by downmixing an N channel input signal. Then, the decoding apparatus 101 may generate an output signal of the N channel by using the downmix signal of the N / 2 channel.
- N may be 10 or more.
- FIG. 2 is a diagram illustrating detailed components of an encoding apparatus according to an embodiment.
- the encoding apparatus may include a first encoding unit 201, a sampling rate converter 202, and a second encoding unit 203.
- the first encoding unit 201 is defined as an MPS encoder.
- the second encoding unit 203 is defined as a USAC (Unified Speech and Audio Codec) encoder. That is, an N / 2 channel downmix signal may be generated by downmixing an input signal of N channels.
- the sampling rate converter 202 may convert the sampling rate for the downmix signal of the N / 2 channel.
- the sampling rate converter 202 may downsample the bit rate based on the bitrate allocated to the USAC encoder, which is the second encoder 203. If a sufficiently high bitrate is allocated to the USAC encoder, which is the second encoding unit 203, the sampling rate converter 202 may be bypassed.
- the second encoding unit 203 may encode the core band of the downmix signal of the N / 2 channel whose sampling rate is converted. Then, the downmix signal of the N / 2 channel encoded through the second encoder 203 may be generated.
- the encoded downmix signal of the N / 2 channel may be a signal of the M channel (M is equal to or smaller than N / 2).
- M is equal to or smaller than N / 2.
- the core band means a low frequency band in which the frequency band is not extended.
- the number of channels of the downmix signal output through the MPS encoder corresponding to the first encoding unit 201 is limited to one channel, two channels, and 5.1 channels.
- the first encoding unit 201 may exceed the number of channels of the downmix signal defined in the MPS standard. That is, the first encoding unit 201 may generate an N / 2 channel downmix signal by downmixing an input signal of N channels.
- the N / 2 channel may be 1, 2, 5.1, or 5.1 or more.
- FIG. 3 is a diagram illustrating detailed components of an encoding apparatus according to another embodiment.
- FIG. 3 is the same as the component described in FIG. 2, but shows an embodiment in which the order is changed.
- FIG. 2 illustrates an embodiment in which a sampling rate converter 202 exists between the first encoder 201 and the second encoder 203.
- FIG. 3 illustrates an embodiment in which the first encoding unit 302 and the second encoding unit 303 are disposed after the sampling rate converter 301.
- FIG. 4 is a diagram for describing an operation of a first encoding unit, according to an exemplary embodiment.
- the first encoding unit 401 may include a plurality of TTO boxes 402.
- each of the plurality of TTO boxes 402 may downmix two input signals and output one downmix signal. That is, the first encoder 401 may include N / 2 TTO boxes 402 to downmix the input signals of the N channels input as shown in FIG. 4 to generate the downmix signals of the N / 2 channels. Can be.
- the downmix signal generated by the first encoder 401 may be one channel, two channels, or 5.1 channels.
- the first encoding unit 401 may generate an N / 2 channel downmix signal from the N channel input signal according to the MPS.
- the N / 2 channel may be a channel of 5.1 channels or more as well as 1 channel, 2 channels or 5.1 channels.
- the first encoding unit 401 needs to consider an additional syntax to control the MPS.
- the first encoding unit 401 may define an additional syntax for controlling the MPS by using a coding mode using an arbitrary tree.
- FIG. 5 is a diagram illustrating detailed components of a decoding apparatus according to an embodiment.
- the decoding apparatus may include a first decoding unit 501, a sampling rate converter 502, and a second decoding unit 503.
- the first decoding unit 501 may reconstruct the downmix signal of the N / 2 channel by decoding the encoded downmix signal of the N / 2 channel.
- the first decoding unit 501 may be defined as a USAC decoder.
- the sampling rate converter 502 may convert the sampling rate of the downmix signal of the N / 2 channel. In this case, the sampling rate converter 502 may convert the sampling rate of the audio signal converted by the encoding apparatus to the original sampling rate. In other words, when the sampling rate conversion is performed in FIG. 2 or FIG. 3, the sampling rate conversion unit 502 operates. If the sampling rate conversion is not performed in FIG. 2 or FIG. 3, the sampling rate conversion unit 502 may be bypassed without operation.
- the second decoding unit 503 may generate an N-channel output signal by upmixing the N / 2 channel downmix signal output from the sampling rate converter 502.
- the downmix signal input to the conventional MPS decoder is limited to one channel, two channels, and 5.1 channels.
- the downmix signal input to the second decoding unit 503 may be extended to N / 2 channels as well as 1 channel, 2 channels, and 5.1 channels.
- the second decoding unit 503 may generate the N-channel output signal by upmixing the N / 2 channel downmix signal.
- N since the N / 2 channel downmix signal input to the second decoding unit 503 means at least 5.1 channel or more, N may be 10.2 or more channels.
- FIG. 6 is a diagram illustrating detailed components of a decoding apparatus according to another exemplary embodiment.
- FIG. 6 may process an audio signal in the order of the first decoding unit 601, the second decoding unit 602, and the sampling rate converter 603.
- the first decoding unit 601 may restore the downmix signal of the N / 2 channel.
- the second decoding unit 602 may generate the output signal of the N channel by upmixing the downmix signal of the N / 2 channel.
- the sampling rate converter 603 may convert the sampling rate of the output signal of the N channel generated through the second decoder 602.
- FIG. 7 is a diagram for describing an operation of a second decoding unit, according to an exemplary embodiment.
- the second decoding unit 701 described with reference to FIGS. 5 and 6 may generate an output signal of the N channel by upmixing the downmix signal of the N / 2 channel.
- the second decoding unit 701 may include a plurality of OTT boxes 702.
- the OTT box 702 may generate two channels of output signals in stereo form by upmixing one channel of downmix signals.
- the second decoding unit 701 generates N / 2 OTT boxes 702 in order for the second decoding unit 701 to upmix the N / 2 channel downmix signal to generate the N channel output signal. It may include.
- the number of channels of the downmix signal input to the second decoding unit 701 and processed may be one channel, two channels, or 5.1 channels.
- the second decoding unit 701 may generate an output signal of the N channel according to the MPS from the downmix signal of the N / 2 channel.
- N may be 10.2 or more.
- the second decoding unit 701 needs to consider additional syntax to control the MPS.
- the second decoding unit 701 may define an additional syntax for controlling the MPS by using a coding mode using an arbitrary tree.
- the MPS decoder illustrated in FIGS. 8 to 12 is related to the second decoding unit 503 of FIG. 5 and the second decoding unit 602 of FIG. 6.
- FIG. 8 illustrates a process of processing a multichannel signal according to an N-N / 2-N configuration.
- FIG. 8 shows an N-N / 2-N structure in which the structure defined in MPEG SURROUND is changed.
- MPEG SURROUND spatial synthesis may be performed in a decoder as shown in Table 1. Spatial synthesis can transform the input signals from the time domain into a non-uniform subband domain through a hybrid Quadrature Mirror Filter (QMF) analysis bank.
- QMF Quadrature Mirror Filter
- the decoder then operates in the hybrid subband.
- the decoder may generate an output signal from the input signals by performing spatial synthesis based on the spatial parameters passed by the encoder.
- the decoder can then use the hybrid QMF synthesis bank to inverse the output signals from the hybrid subband to the time domain.
- MPEG SURROUND defines a 5-1-5 structure, a 5-2-5 structure, a 7-2-7 structure, and a 7-5-7 structure, but the present invention proposes an N-N / 2-N structure.
- the decoder may generate the N-channel output signal by upmixing the N / 2 channel downmix signal.
- the number of N channels in the N-N / 2-N structure of the present invention is not limited. That is, the N-N / 2-N structure may support not only a channel structure supported by the MPS but also a channel structure of a multichannel signal not supported by the MPS.
- N / 2 means the number of channels of the downmix signal derived through the MPS.
- NumInCh means the number of channels of the downmix signal
- NumOutCh means the number of channels of the output signal.
- NumInCh which is the number of channels of the downmix signal, is N / 2.
- NumInCh is N / 2 and NumOutCh is N.
- the downmix signals X 0 to X NumInch ⁇ 1 and the residual signals res of the N / 2 channels form an input vector X.
- NumInCh is N / 2
- X 0 to X NumInCh ⁇ 1 represent downmix signals of N / 2 channels.
- N the number of one-to-two (OTT) boxes is N / 2
- N the number of channels of the output signal, must be even to process the downmix signal of the N / 2 channel.
- N may be from 10 to 32.
- the decorrelators, uncorrelated signals, and residual signals labeled from 1 to M correspond to different OTT boxes.
- the reconstruction process for the multi-channel signal to which the N-N / 2-N structure is applied can be visualized in a tree structure.
- the input vector X to be multiplied by means a vector including the downmix signal of the N / 2 channel.
- the number of decorrelators generating the uncorrelated signal may be N / 2 at the maximum. However, if N, the channel number of the output signal, exceeds 20, the filters of the decorrelator can be reused.
- N which is the number of channels of the output signal in the N-N / 2-N structure, needs to be less than twice the limited specific number (ex. N ⁇ 20). If the LFE channel is included in the output signal, the N channel needs to be configured with a smaller number of channels (eg, N ⁇ 24) than more than twice the specific number in consideration of the number of LFE channels.
- the output result of the decorrelators may be replaced with the residual signal for a specific frequency region depending on the bitstream. If the LFE channel is one of the outputs of the OTT box, no decorrelator is used for the OTT box based on the upmix.
- the decorrelators labeled M (ex. NumInCh-NumLfe) from 1, the output result of the decorrelator (uncorrelated signal), and residual signals correspond to different OTT boxes.
- d 1 ⁇ d M means uncorrelated signal which is the output result of the decorrelator (D 1 ⁇ D M )
- res 1 ⁇ res M means the residual signal which is the output result of the decorrelator (D 1 ⁇ D M ) do.
- the decorrelators D1 to DM correspond to different OTT boxes, respectively.
- vectors and matrices used in the NN / 2-N structure are defined.
- Input signals to decorators in N-2 / NN structures are vectors Is defined as
- Equation 1 Of elements in To May be input directly to the matrix M2 without being input to the N / 2 decorrelators corresponding to the N / 2 OTT boxes. so, To May be defined as a direct signal. And vector Of elements in To Signals other than To ) May be input to the N / 2 decorrelators corresponding to the N / 2 OTT boxes.
- vector Is composed of a direct signal, d 1 to d M which are decorrelated signals output from decorrelators, and res 1 to res M which are residual signals output from decorrelators. vector May be determined by Equation 2 below.
- Is Means a set of all k satisfying And, Signal Fall decorator When input to, it means the uncorrelated signal output from the decorator. Especially, Is the OTT box is OTTx and the residual signal is In the case of means the signal output from the decorator.
- the subbands of the output signal can be defined dependently for all time slots n and all hybrid subbands k.
- Output signal Can be determined by Equation 3 through the vector w and the matrix M2 .
- Equation 4 Denotes a matrix M2 composed of NumOutCh rows and NumInCh-NumLfe columns. Is Can be defined by Equation 4 below.
- the hybrid synthesis filter bank is a combination of the QMF synthesis bank through the Nyquist synthesis banks, Can be transformed from the hybrid subband domain to the time domain through a hybrid synthesis filterbank.
- vectors Is the same as described above, but the vector May be divided into two vectors as shown in Equations 6 and 7 below.
- Is Means a set of all k satisfying Also, decorator Input signal to Is entered, Decorator Means the uncorrelated signal output from.
- a spreading signal is generated through the decorrelator for spatial synthesis.
- the generated spread signal may be mixed with the direct signal.
- the temporal envelope of the spread signal does not match the envelope of the direct signal.
- subband domain time processing is used to shape the envelope of each spreading signal portion of the output signal to match the temporal shape of the downmix signal transmitted from the encoder.
- processing may be implemented with envelope estimation, such as envelope ratio calculation for direct and spread signals or shaping of the upper spectral portion of the spread signal.
- the temporal energy envelope of the portion corresponding to the direct signal and the portion corresponding to the spread signal in the output signal generated through upmixing can be estimated.
- the shaping factor may be calculated as the ratio between the temporal energy envelope for the portion corresponding to the direct signal and the portion corresponding to the spread signal.
- STP May be signaled as. if, If, the spread signal portion of the output signal generated through upmixing can be processed via STP.
- the downmix of the spatial upmix is approximated with the transmitted original downmix signal ( approximation).
- the direct downmix signal for (NumInCh-NumLfe) may be defined by Equation 8 below.
- the envelopes of the downmix broadband envelopes and the spread signal portion of each upmix channel can be estimated using Equation 9 below using normalized direct energy.
- Means a bandpass factor Denotes a spectral flattering factor.
- the scale factor for the NN / 2-N structure Can be defined.
- the scale factor is then applied to the spread signal portion of the output signal, thereby mapping the temporal envelope of the output signal to substantially the temporal envelope of the downmix signal.
- the spread signal portion processed by the scale factor in each channel of the output signals of the N channels may be mixed with the direct signal portion.
- it may be signaled whether the extension signal portion has been processed in the scale factor for each channel of the output signal. ( ) Indicates that the extension signal portion was processed with the scale factor.)
- GES can recover the broadband envelope of the synthesized output signal.
- GES includes a modified upmixing process after flattening and reshaping the envelope for the direct signal portion for each channel of the output signal.
- additional information of a parametric broadband envelope included in the bitstream may be used.
- the additional information includes the envelope ratio of the envelope of the original input signal and the envelope of the downmix signal.
- the envelope ratio at the decoder may be applied to the direct signal portion of each time slot included in the frame for each channel of the output signal.
- the GES does not alter the spread signal portion for each channel of the output signal.
- the extension signal and the direct signal of the output signal may be respectively synthesized using the post mixing matrix M2 modified in the hybrid subband domain according to Equation 11 below.
- Equation 11 the direct signal portion for the output signal y provides the direct signal and the residual signal, and the extension signal portion for the output signal y provides the extension signal. In total, only the direct signal can be processed by the GES.
- the result of processing the GES may be determined according to Equation 12 below.
- the GES can extract an envelope for a particular channel of the upmixed output signal from the downmix signal by the downmix signal and decoder that performs spatial synthesis except the LFE channel depending on the tree structure.
- Output signal in NN / 2-N structure May be defined as shown in Table 3 below.
- the input signal in the NN / 2-N structure May be defined as shown in Table 4 below.
- downmix signals in NN / 2-N structures May be defined as shown in Table 5 below.
- the matrix M1 (defined for all time slots n and all hybrid subbands k) ) And the matrix M2 ( ) Will be described. These matrices are defined for a given parameter time slot and given processing band m based on the parameter time slot and the CLD, ICC and CPC parameters valid for the processing band. And Interpolated version of.
- Matrix M1 may be expressed as a free matrix.
- the size of the matrix M1 depends on the number of channels of the downmix signal input to the matrix M1 and the number of decorrelators used in the decoder.
- the elements of the matrix M1 may be derived from the CLD and / or CPC parameters.
- M1 may be defined by Equation 13 below.
- Matrix for Matrix M1 May be defined as follows.
- OTT box matrix May be defined differently according to the channel structure.
- all channels of an input signal may be input in pairs by 2 channels to the OTT box. So, for the NN / 2-N structure, the number of OTT boxes is N / 2.
- the matrix I is a vector containing the input signal It depends on the number of OTT boxes equal to its column size.
- Lfe upmixes based on OTT boxes are not considered in the NN / 2-N architecture since no decorrelator is needed.
- matrix All elements of may be either 1 or 0.
- Equation 15 In the NN / 2-N structure May be defined by Equation 15 below.
- OTT boxes in the NN / 2-N architecture represent a parallel processing satge, not a cascade. Therefore, all OTT boxes in the NN / 2-N structure are not connected to any other OTT boxes. So, matrix is unit matrix And unit matrix It can be configured as. In this case, the unit matrix May be a unit matrix of size N * N.
- Calibration factor matrix It can be applied to the downmix signal or an externally supplied downmix signal.
- Matrix in NN / 2-N structure May be defined by Equation 16 below.
- Means a unit matrix indicating NumInch * NumInCh size Denotes a zero matrix representing NumInch * NumInCh size.
- the number of channels of the downmix signal may be more than five.
- the inverse matrix H is a vector of input signals for all parameter sets and processing bands. It may be a unit matrix having the same size as the number of columns of.
- matrix M2 Defines how to combine the direct and uncorrelated signals to regenerate the multi-channel output signal. May be defined by Equation 19 below.
- the element of can be calculated from the equivalent model of the OTT box.
- the OTT box includes a decorrelator and a mixing section.
- the mono input signal input to the OTT box is transmitted to the decorrelator and the mixing unit, respectively.
- the mixing unit may generate a stereo output signal using a mono input signal, an uncorrelated signal output through the decorrelator, and the CLD and ICC parameters.
- the CLD controls localization in the stereo field
- the ICC controls the stereo wideness of the output signal.
- Equation 21 the result output from any OTT box can be defined by Equation 21 below.
- OTT box Labeling as ( ), Time slot for OTT box And parameter bands Denotes an element of an arbitrary matrix.
- the post gain matrix may be defined as in Equation 22 below.
- CLD and ICC may be defined by Equation 24 below.
- decorrelators may be performed by a reverberation filter in the QMF subband domain.
- Reverberation filters exhibit different filter characteristics based on which hybrid subband currently corresponds to all hybrid subbands.
- the reverberation filter is an IIR grating filter.
- the IIR grating filters have different filter coefficients for different decorrelators to produce mutually uncorrelated orthogonal signals.
- the uncorrelated process carried out by the decorator is carried out in several processes.
- the output of matrix M1 Is entered into the set of all-pass uncorrelated filters.
- the filtered signals can then be energy shaped.
- energy shaping is shaping the spectral or temporal envelope to match uncorrelated signals more closely to the input signal.
- the uncorrelated filter consists of a plurality of all-pass (IIR) regions preceded by a fixed frequency-dependent delay.
- the frequency axis may be divided into different regions so as to correspond to the QMF division frequency.
- the length of the delay and the length of the filter coefficient vectors are the same.
- the filter coefficients of the decorrelator with fractional delay due to additional phase rotation depend on the hybrid subband index.
- the filters of the decorrelators have different filter coefficients to ensure orthogonality between the uncorrelated signals output from the decorrelators.
- N / 2 decorrelators are required.
- the number of decorrelators may be limited to ten.
- the decorators are more than 10 OTT boxes according to 10 basis modulo operations. It can be reused corresponding to the number of.
- the N / 2 decorrelators are indexed by 10 units. That is, the 0th decorator and the 10th decorator Have the same index.
- the decorrelator may include a first decorrelator corresponding to a channel less than or equal to M and a second decorrelator corresponding to more than M channels. Can be.
- the second decorrelator may reuse the filter set of the first decorrelator.
- N-N / 2-N structure For the N-N / 2-N structure, it may be implemented by the syntax of Table 7.
- bsTreeConfig may be implemented by Table 8.
- bsTreeConfig may be implemented by Table 8. According to Table 8, when bsTreeConfig is 7, the configuration of the decoding apparatus of the N-N / 2-N structure according to an embodiment of the present invention.
- the number of OTT boxes numOttBoxes is equal to the number of channels NumInCh of the downmix signal. And the number of TTT boxes is zero.
- bsNumInCh which is the number of channels of the downmix signal in the N-N / 2-N structure, may be implemented as shown in Table 10 below.
- NumInCh refers to the number of channels of the downmix signal input to the decoding apparatus of the NN / 2-N structure
- NumOutCh refers to the number of channels of the output signal to which the downmix signal is upmixed.
- N LFE which is the number of LFE channels among the output signals may be implemented as shown in Table 11 below.
- NumLfe means the number of LFE channels (N LFE ) in the NN / 2-N structure.
- the channel order of the output signal may be implemented as shown in Table 12 according to the number of channels of the output signal and the number of LFE channels.
- And audioChannelLayout represents the layout of the loudspeaker Loudspeaker at the time of actual reproduction.
- the channel order of the LFE channel is determined by (i) a condition processed with a channel other than the LFE channel using an OTT box, and (ii) a condition located last in the channel list. Can be determined to satisfy.
- LFE channels are L, Lv, R, Rv, Ls, Lss It is located last in Rs, Rss, C, LFE, Cvr, and LFE2.
- FIG. 9 illustrates a tree structure for performing spatial audio processing for the N-N / 2-N structure according to an embodiment.
- the N-N / 2-N structure shown in FIG. 8 may be represented in a tree form as shown in FIG. 9.
- all OTT boxes can regenerate two channels of output signals based on CLD, ICC, residual signal and input signal.
- OTT boxes and their corresponding CLD, ICC, residual and input signals may be numbered in the order in which they appear in the bitstream.
- the decoder which is a multichannel signal processing apparatus, may generate N-channel output signals from N / 2-channel downmix signals using N / 2 OTT boxes.
- N / 2 OTT boxes are not implemented through a plurality of layers. That is, the OTT boxes may perform upmixing in parallel for each channel of the downmix signal of the N / 2 channel. In other words, one OTT box is not connected to another OTT box.
- the left tree structure of FIG. 9 shows an N-N / 2-N tree structure when no LFE channel is applied, and the right tree structure shows an N-N / 2-N tree structure when an LFE channel is applied.
- All OTT boxes shown in FIG. 9 may remix two channels of output signals by upmixing one channel of downmix signals (M).
- the N / 2 OTT boxes may generate the output signal of the N channel using the residual signal res and the downmix signal M.
- the OTT box in which the LFE channel is output among the N / 2 OTT boxes may use only the downmix signal except the residual signal.
- the OTT box in which the LFE channel is not output among the N / 2 OTT boxes upmixes the downmix signal using CLD and ICC, but the LFE channel is The output OTT box can upmix the downmix signal using only the CLD.
- the OTT box in which the LFE channel is not output among the N / 2 OTT boxes generates an uncorrelated signal through the decorrelator, but the OTT in which the LFE channel is output.
- the box does not perform uncorrelated processes and therefore does not generate uncorrelated signals.
- FIG. 10 illustrates a process of generating an output signal of 24 channels from a 12-channel downmix according to an embodiment.
- an N / 2 channel downmix signal may be generated from an N channel input signal through MPS encoding.
- the N-channel output signal may be generated from the downmix signal of the N / 2 channel through MPS decoding.
- the channels of the downmix signal output through the encoder are 1 channel, 2 channels, and 5.1 channels.
- the present invention is not limited thereto.
- additional syntax definition is required to support the number of channels of downmix signals that are not defined in the existing MPS standard.
- BsTreeConfig defines the decoding process of input and output signals.
- BsTreeConfig 0 a process of generating a downmix signal of one channel from an input signal of six channels (5.1 channels) and an output signal of six channels (5.1 channels) from a downmix signal of one channel is defined.
- the decoder needs five OTT boxes, and channel level difference (CLD) may be applied to each OTT box.
- CLD channel level difference
- the CLD input to the OTT box may be defined up to defaultCLD [0 ⁇ 5] according to the position of the OTT box, and the CLD corresponding to the OTT box is enabled. That is, if CLD is enabled, CLD may be input to the OTT box.
- ottModeLfe also means that the LFE channel is output from the OTT box.
- the present invention can process an input signal having a channel different from the channel defined in the existing MPS standard by using the reserved bit in the MPS standard. For example, when N, the channel number of the input signal, is 24, and the channel number of the downmix signal is 12, it may be defined as shown in Table 13.
- FIG. 10 shows a decoder implemented according to Table 13. Referring to FIG. 10, a process of generating an output signal of 24 channels including two LFE channels from a 12-channel downmix signal x 0- x 11 is illustrated.
- 12 channels of downmix signals (x0-x11) and 12 signals of residual signals (res 1 -res 11 ) are input, but will be described below except for the residual signal. do.
- the decoder of FIG. 10 may input a downmix signal of 12 channels to the decorrelator 1007 to generate an uncorrelated signal.
- the vector v 1003 of FIG. 10 may be derived by applying the matrix M1 1002 to the vector x 1001.
- the vector v 1003 may be determined according to Equation 25 below.
- Equation 25 corresponds to (1).
- x Mo to x M11 may be mapped to v M0 to v M11 .
- the uncorrelated signal may be derived equal to the number of downmix signals.
- the vector w 1004 may be determined according to Equation 26 below.
- Equation 26 corresponds to Equation 2.
- the decorrelator 1007 operates when there is no residual signal. That is, if there is no residual signal, an uncorrelated signal may be generated.
- D () is used when the decorrelator generates an uncorrelated signal.
- the vector y 1006 may be derived by applying the matrix M2 1005 to the vector w 1004 according to Equation 27.
- Equation 28 R1 for deriving the matrix M1 1002
- Equation 29 R2 for deriving the mates M2 1005
- H LL , H LR , H RL , and H RR in Equation 29 may be derived from CLD and ICC corresponding to each OTT box.
- the present invention proposes an OTT-based MPS (MPEG Surround) decoder having a parallel structure that generates N-channel output signals from N / 2 channel downmix signals according to newly defined bsTreeConfig information.
- MPS MPEG Surround
- FIG. 11 illustrates an OTT box of the process of FIG. 10, according to an exemplary embodiment.
- each OTT box generates two channels of signals using a downmix signal of one channel and an uncorrelated signal generated through the decorrelator (D).
- D decorrelator
- defaultCld [0] to defaultCld [9] corresponding to the CLD, and OttModelfe [0] and OttModelfe [1] corresponding to the LFE channel may be input.
- the LFE channel may be included in the output signal.
- OttModelfe [0] and OttModelfe [1] are then enabled.
- FIG. 12 illustrates a process of FIG. 11 according to an MPS standard according to an embodiment.
- FIG. 12 a case in which 12 channels of downmix signals M 0 to M 11 are input to each OTT box is illustrated. Then, the output signal y of 24 channels is generated.
- CLD and ICC are also input to each OTT box.
- the residual signal is illustrated in FIG. 12 as being input to the OTT box, if there is no residual signal, an uncorrelated signal generated through the decorrelator from the downmix signal may be input to the OTT box instead of the residual signal.
- Multi-channel audio signal processing method comprises the steps of identifying the downmix signal and the residual signal of the N / 2 channel generated from the input signal of the N channel; Applying a downmix signal and a residual signal of the N / 2 channel to a first matrix; Outputs a first signal input to the N / 2 decorrelators corresponding to N / 2 OTT boxes and a second signal transmitted to the second matrix without being input to the N / 2 decorrelators through the first matrix Making; Outputting uncorrelated signals from a first signal through the N / 2 decorrelators; Applying the uncorrelated signal and the second signal to a second matrix; And generating an output signal of the N channel through the second matrix.
- N / 2 decorrelators may correspond to the N / 2 OTT boxes.
- the index of the decorrelator may be repeatedly reused according to the reference value.
- the decorrelator may use N / 2, except for the number of LFE channels, and the LFE channel may not use the decorrelator of the OTT box. .
- the second matrix may be input with a vector including the second signal, the uncorrelated signal derived from the decorrelator, and the residual signal derived from the decorrelator. have.
- the second matrix is a spread comprising a vector corresponding to a direct signal consisting of the second signal and a residual signal derived from the decorrelator and an uncorrelated signal derived from the decorrelator.
- a vector corresponding to the signal may be input.
- the generating of the N-channel output signal includes, when subband domain time processing (STP) is used, applying a scale factor based on a spread signal and a direct signal to a spread signal portion of the output signal to temporal envelope of the output signal.
- STP subband domain time processing
- the generating of the N-channel output signal may flatten and reshape the envelope of the direct signal portion for each channel of the N-channel output signal when guided envelope shaping (GES) is used.
- GES guided envelope shaping
- the size of the first matrix may be determined according to the number of channels of the downmix signal applying the first matrix and the number of decorrelators, and the elements of the first matrix may be determined by the CLD parameter or the CPC parameter.
- a method of processing a multichannel audio signal including: identifying a downmix signal of an N / 2 channel and a residual signal of the N / 2 channel; Inputting an N / 2 channel downmix signal and an N / 2 channel residual signal to the N / 2 OTT boxes to generate an N channel output signal, wherein the N / 2 OTT boxes are not connected to each other;
- the OTT box which is arranged in parallel without any other and outputs the LFE channel among the N / 2 OTT boxes receives (1) only the downmix signal except the residual signal, and (2) the CLD parameter among the CLD parameter and the ICC parameter. (3) Do not output uncorrelated signal through decorator.
- An apparatus for processing a multichannel signal includes a processor for performing a multichannel signal processing method, and the multichannel signal processing method includes downmixing an N / 2 channel generated from an input signal of N channels. Identifying a signal and a residual signal; Applying a downmix signal and a residual signal of the N / 2 channel to a first matrix; Outputs a first signal input to the N / 2 decorrelators corresponding to N / 2 OTT boxes and a second signal transmitted to the second matrix without being input to the N / 2 decorrelators through the first matrix Making; Outputting uncorrelated signals from a first signal through the N / 2 decorrelators; Applying the uncorrelated signal and the second signal to a second matrix; And generating an output signal of the N channel through the second matrix.
- N / 2 decorrelators may correspond to the N / 2 OTT boxes.
- the index of the decorrelator may be repeatedly reused according to the reference value.
- the decorrelator may use N / 2, except for the number of LFE channels, and the LFE channel may not use the decorrelator of the OTT box. .
- the second matrix may be input with a vector including the second signal, the uncorrelated signal derived from the decorrelator, and the residual signal derived from the decorrelator. have.
- the second matrix is a spread comprising a vector corresponding to a direct signal consisting of the second signal and a residual signal derived from the decorrelator and an uncorrelated signal derived from the decorrelator.
- a vector corresponding to the signal may be input.
- the generating of the N-channel output signal includes, when subband domain time processing (STP) is used, applying a scale factor based on a spread signal and a direct signal to a spread signal portion of the output signal to temporal envelope of the output signal.
- STP subband domain time processing
- the generating of the N-channel output signal may flatten and reshape the envelope of the direct signal portion for each channel of the N-channel output signal when guided envelope shaping (GES) is used.
- GES guided envelope shaping
- the size of the first matrix may be determined according to the number of channels of the downmix signal applying the first matrix and the number of decorrelators, and the elements of the first matrix may be determined by the CLD parameter or the CPC parameter.
- an apparatus for processing a multichannel signal includes a processor for performing a method for processing a multichannel signal, and the method for processing a multichannel signal includes: an N / 2 channel downmix signal and an N / 2 channel; Identifying a residual signal; Inputting an N / 2 channel downmix signal and an N / 2 channel residual signal to the N / 2 OTT boxes to generate an N channel output signal,
- the N / 2 OTT boxes are arranged in parallel without being connected to each other, and an OTT box that outputs an LFE channel among the N / 2 OTT boxes receives (1) only a downmix signal except a residual signal, (2) It uses CLD parameter among CLD parameter and ICC parameter. (3) Does not output uncorrelated signal through decorator.
- the apparatus described above may be implemented as a hardware component, a software component, and / or a combination of hardware components and software components.
- the devices and components described in the embodiments may be, for example, processors, controllers, arithmetic logic units (ALUs), digital signal processors, microcomputers, field programmable arrays (FPAs), It may be implemented using one or more general purpose or special purpose computers, such as a programmable logic unit (PLU), microprocessor, or any other device capable of executing and responding to instructions.
- the processing device may execute an operating system (OS) and one or more software applications running on the operating system.
- the processing device may also access, store, manipulate, process, and generate data in response to the execution of the software.
- OS operating system
- the processing device may also access, store, manipulate, process, and generate data in response to the execution of the software.
- processing device includes a plurality of processing elements and / or a plurality of types of processing elements. It can be seen that it may include.
- the processing device may include a plurality of processors or one processor and one controller.
- other processing configurations are possible, such as parallel processors.
- the software may include a computer program, code, instructions, or a combination of one or more of the above, and configure the processing device to operate as desired, or process it independently or collectively. You can command the device.
- Software and / or data may be any type of machine, component, physical device, virtual equipment, computer storage medium or device in order to be interpreted by or to provide instructions or data to the processing device. Or may be permanently or temporarily embodied in a signal wave to be transmitted.
- the software may be distributed over networked computer systems so that they may be stored or executed in a distributed manner.
- Software and data may be stored on one or more computer readable recording media.
- the method according to the embodiment may be embodied in the form of program instructions that can be executed by various computer means and recorded in a computer readable medium.
- the computer readable medium may include program instructions, data files, data structures, etc. alone or in combination.
- the program instructions recorded on the media may be those specially designed and constructed for the purposes of the embodiments, or they may be of the kind well-known and available to those having skill in the computer software arts.
- Examples of computer-readable recording media include magnetic media such as hard disks, floppy disks, and magnetic tape, optical media such as CD-ROMs, DVDs, and magnetic disks, such as floppy disks.
- Examples of program instructions include not only machine code generated by a compiler, but also high-level language code that can be executed by a computer using an interpreter or the like.
- the hardware device described above may be configured to operate as one or more software boxes to perform the operations of the embodiments, and vice versa.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Mathematical Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
L'invention concerne un procédé de codage de signal multiplex, un appareil de codage permettant de mettre en œuvre le procédé de codage, un procédé de traitement de signal multiplex et un appareil de décodage permettant de mettre en œuvre un procédé de décodage. Le procédé de décodage comprend les étapes consistant : à identifier un signal somme d'un canal N/2 qui a été obtenu à partir d'un signal d'entrée d'un canal N; et à générer un signal de sortie du canal N à partir du signal somme du canal N/2 identifié, au moyen d'une pluralité de dispositifs OTT. Le nombre de dispositifs OTT peut être égal à N/2, qui est le nombre de canaux du signal somme, s'il n'y a pas de canal LFE dans le signal de sortie.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US15/551,734 US10225675B2 (en) | 2015-02-17 | 2016-02-17 | Multichannel signal processing method, and multichannel signal processing apparatus for performing the method |
US16/290,469 US10638243B2 (en) | 2015-02-17 | 2019-03-01 | Multichannel signal processing method, and multichannel signal processing apparatus for performing the method |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR20150024464 | 2015-02-17 | ||
KR10-2015-0024464 | 2015-02-17 | ||
KR10-2016-0018462 | 2016-02-17 | ||
KR1020160018462A KR20160101692A (ko) | 2015-02-17 | 2016-02-17 | 다채널 신호 처리 방법 및 상기 방법을 수행하는 다채널 신호 처리 장치 |
Related Child Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/551,734 A-371-Of-International US10225675B2 (en) | 2015-02-17 | 2016-02-17 | Multichannel signal processing method, and multichannel signal processing apparatus for performing the method |
US16/290,469 Continuation US10638243B2 (en) | 2015-02-17 | 2019-03-01 | Multichannel signal processing method, and multichannel signal processing apparatus for performing the method |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2016133366A1 true WO2016133366A1 (fr) | 2016-08-25 |
Family
ID=56689064
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/KR2016/001613 WO2016133366A1 (fr) | 2015-02-17 | 2016-02-17 | Procédé de traitement de signal multiplex et appareil de traitement de signal multiplex permettant de mettre en œuvre ce procédé |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2016133366A1 (fr) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20070094422A (ko) * | 2006-01-11 | 2007-09-20 | 삼성전자주식회사 | 멀티 채널 복호화 및 부호화 방법 및 장치 |
US20070233293A1 (en) * | 2006-03-29 | 2007-10-04 | Lars Villemoes | Reduced Number of Channels Decoding |
US20120321090A1 (en) * | 2005-11-21 | 2012-12-20 | Samsung Electronics Co., Ltd. | System, medium, and method of encoding/decoding multi-channel audio signals |
KR20150009474A (ko) * | 2013-07-15 | 2015-01-26 | 한국전자통신연구원 | 다채널 신호를 위한 인코더 및 인코딩 방법, 다채널 신호를 위한 디코더 및 디코딩 방법 |
-
2016
- 2016-02-17 WO PCT/KR2016/001613 patent/WO2016133366A1/fr active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20120321090A1 (en) * | 2005-11-21 | 2012-12-20 | Samsung Electronics Co., Ltd. | System, medium, and method of encoding/decoding multi-channel audio signals |
KR20070094422A (ko) * | 2006-01-11 | 2007-09-20 | 삼성전자주식회사 | 멀티 채널 복호화 및 부호화 방법 및 장치 |
US20070233293A1 (en) * | 2006-03-29 | 2007-10-04 | Lars Villemoes | Reduced Number of Channels Decoding |
KR20150009474A (ko) * | 2013-07-15 | 2015-01-26 | 한국전자통신연구원 | 다채널 신호를 위한 인코더 및 인코딩 방법, 다채널 신호를 위한 디코더 및 디코딩 방법 |
Non-Patent Citations (1)
Title |
---|
DOLBY METADATA GUIDE, 2005, Retrieved from the Internet <URL:http://www.dolby.com/us/en/technologies/dolby-metadata.html> * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2010107269A2 (fr) | Appareil et méthode de codage/décodage d'un signal multicanaux | |
WO2016024847A1 (fr) | Procédé et dispositif de génération et de lecture de signal audio | |
WO2015142073A1 (fr) | Méthode et appareil de traitement de signal audio | |
WO2015099429A1 (fr) | Procédé de traitement de signaux audio, dispositif de paramétrage pour celui-ci et dispositif de traitement de signaux audio | |
WO2015152665A1 (fr) | Procédé et dispositif de traitement de signal audio | |
WO2010087614A2 (fr) | Procédé de codage et de décodage d'un signal audio et son appareil | |
WO2010050740A2 (fr) | Appareil et procédé de codage/décodage d’un signal multicanal | |
WO2015147530A1 (fr) | Procédé et appareil de rendu de signal acoustique, et support d'enregistrement lisible par ordinateur | |
WO2016204581A1 (fr) | Procédé et dispositif de traitement de canaux internes pour une conversion de format de faible complexité | |
WO2017222140A1 (fr) | Procédés et dispositifs de codage et de décodage comprenant un filtre en boucle à base de cnn | |
WO2014175669A1 (fr) | Procédé de traitement de signaux audio pour permettre une localisation d'image sonore | |
WO2010062123A2 (fr) | Codec vocal/audio unifié (usac) pour le traitement d’une séquence de fenêtres sur la base d’une commutation de mode | |
WO2015060652A1 (fr) | Procédé et appareil conçus pour le traitement d'un signal audio | |
WO2010008229A1 (fr) | Appareil de codage et de décodage audio multi-objet prenant en charge un signal post-sous-mixage | |
WO2019107868A1 (fr) | Appareil et procédé de sortie de signal audio, et appareil d'affichage l'utilisant | |
WO2020242260A1 (fr) | Procédé et dispositif de compression d'image basée sur l'apprentissage machine utilisant un contexte global | |
WO2020013514A1 (fr) | Procédé et appareil de traitement de signal vidéo | |
WO2020032632A1 (fr) | Procédé de codage/décodage d'images et dispositif associé | |
WO2016204579A1 (fr) | Procédé et dispositif de traitement de canaux internes pour une conversion de format de faible complexité | |
WO2022158912A1 (fr) | Dispositif d'annulation de signaux d'écho et de bruit intégré basé sur des canaux multiples utilisant un réseau neuronal profond | |
WO2021118107A1 (fr) | Appareil de sortie audio et procédé de commande de celui-ci | |
WO2019035622A1 (fr) | Procédé et appareil de traitement de signal audio à l'aide d'un signal ambiophonique | |
WO2022158913A1 (fr) | Dispositif d'annulation intégré de signal de bruit et d'écho utilisant un réseau neuronal profond ayant une structure parallèle | |
WO2019031652A1 (fr) | Procédé de lecture audio tridimensionnelle et appareil de lecture | |
WO2016204583A1 (fr) | Procédé et dispositif de traitement de canaux internes réduisant la complexité de la conversion de format |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 16752696 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 16752696 Country of ref document: EP Kind code of ref document: A1 |