EP3023984A1 - Encoder and encoding method for multichannel signal, and decoder and decoding method for multichannel signal - Google Patents
Encoder and encoding method for multichannel signal, and decoder and decoding method for multichannel signal Download PDFInfo
- Publication number
- EP3023984A1 EP3023984A1 EP14826617.4A EP14826617A EP3023984A1 EP 3023984 A1 EP3023984 A1 EP 3023984A1 EP 14826617 A EP14826617 A EP 14826617A EP 3023984 A1 EP3023984 A1 EP 3023984A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- channel signal
- lfe
- signal
- downmixed
- encoding
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
- 238000000034 method Methods 0.000 title claims abstract description 84
- 230000000694 effects Effects 0.000 claims abstract description 22
- 238000001914 filtration Methods 0.000 description 4
- 230000005236 sound signal Effects 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 3
- 230000001360 synchronised effect Effects 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 230000005540 biological transmission Effects 0.000 description 2
- 230000006835 compression Effects 0.000 description 2
- 238000007906 compression Methods 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 238000010168 coupling process Methods 0.000 description 2
- 238000005859 coupling reaction Methods 0.000 description 2
- 230000001934 delay Effects 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
Definitions
- Exemplary embodiments relate to an encoder and encoding method for a multi-channel signal, and a decoder and decoding method for a multi-channel signal, and more particularly to a codec for efficiently processing a multi-channel signal including a plurality of channel signals.
- An aspect of the present invention is to provide an apparatus and method of encoding or decoding a multi-channel signal including a low-frequency effects (LFE) channel signal.
- LFE low-frequency effects
- Another aspect of the present invention is to provide an apparatus and method of performing two-stage encoding/decoding or one-stage encoding/decoding employing a time delay.
- a method of encoding a multi-channel signal including outputting a first downmixed signal and a first spatial cue by encoding a first normal channel signal and a first low-frequency effects (LFE) channel signal which are included in a multi-channel signal; outputting a second downmixed signal and a second spatial cue by encoding a second normal channel signal and a second LFE channel signal which are included in the multi-channel signal; encoding the first downmixed signal the second downmixed signal together; and generating a bitstream including the encoded first downmixed signal, the encoded second downmixed signal, the first spatial cue and the second spatial cue.
- LFE low-frequency effects
- the outputting of the first cue may output the first downmixed signal and the first spatial cue by applying parametric coding to the first normal channel signal and the first LFE channel signal in an LFE mode
- the outputting of the second cue may output the second downmixed signal and the second spatial cue by applying parametric coding to the second normal channel signal and the second LFE channel signal in the LFE mode
- the first spatial cue and the second spatial cue may include a channel level difference (CLD) output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- CLD channel level difference
- a method of encoding a multi-channel signal including outputting a first downmixed signal and a first spatial cue by encoding a first normal channel signal and a first LFE channel signal which are included in a multi-channel signal; outputting a second downmixed signal and a second spatial cue by encoding a second normal channel signal and a second LFE channel signal which are included in the multi-channel signal; encoding the first downmixed signal; encoding the second downmixed signal separately from the first downmixed signal; and generating a bitstream including the encoded first downmixed signal, the encoded second downmixed signal, the first spatial cue and the second spatial cue.
- the outputting of the first cue may output the first downmixed signal and the first spatial cue by applying parametric coding to the first normal channel signal and the first LFE channel signal in an LFE mode
- the outputting of the second cue may output the second downmixed signal and the second spatial cue by applying parametric coding to the second normal channel signal and the second LFE channel signal in the LFE mode
- the first spatial cue and the second spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- a method of encoding a multi-channel signal including outputting a downmixed signal and a spatial cue by encoding a first LFE channel signal and a second LFE channel signal which are included in a multi-channel signal; encoding the downmixed signal; and generating a bitstream including the encoded downmixed signal and the spatial cue.
- the outputting may output the downmixed signal and the spatial cue by applying parametric coding to the first LFE channel signal and the second LFE channel signal in an LFE mode, and the spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- a method of encoding a multi-channel signal including applying a time delay to a first LFE channel signal included in a multi-channel signal; applying the time delay to a second LFE channel signal included in the multi-channel signal; encoding the first LEF channel signal to which the time delay is applied; encoding the second LEF channel signal to which the time delay is applied; and generating a bitstream including the encoded first LEF channel signal and the encoded second LEF channel signal.
- the time delay may include a time delay which occurs in encoding a normal channel signal included in the multi-channel signal.
- a method of encoding a multi-channel signal including applying a time delay a normal channel signal included in a multi-channel signal; encoding the normal channel signal to which the time delay is applied; outputting a downmixed signal and a spatial cue by encoding an LFE channel signal included in the multi-channel signal; and encoding the encoded LFE channel signal, wherein the time delay includes a time delay which occurs in encoding the LFE channel signal.
- the outputting may output the downmixed signal and the spatial cue by conducting parametric coding on the LFE channel signal in an LFE mode.
- a method of decoding a multi-channel signal including generating a first downmixed signal and a second downmixed signal by decoding an encoded result extracted from a bitstream; outputting a first normal channel signal and a first LFE channel signal by decoding the first downmixed signal; and outputting a second normal channel signal and a second LFE channel signal by decoding the second downmixed signal.
- the outputting of the first normal channel signal and the first LFE channel signal may output the first normal channel signal and the first LEF channel signal from the first downmixed signal by applying a first spatial cue to parametric coding
- the outputting of the second normal channel signal and the second LFE channel signal may output the second normal channel signal and the second LEF channel signal from the second downmixed signal by applying a second spatial cue to parametric coding
- the first spatial cue and the second spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- a method of decoding a multi-channel signal including generating a first downmixed signal by decoding an encoded result extracted from a bitstream; generating a second downmixed signal by decoding another encoded result extracted from the bitstream; outputting a first normal channel signal and a first LFE channel signal by decoding the first downmixed signal; and outputting a second normal channel signal and a second LFE channel signal by decoding the second downmixed signal.
- the outputting of the first normal channel signal and the first LFE channel signal may output the first normal channel signal and the first LEF channel signal using parametric coding based on a first spatial cue for the first downmixed signal
- the outputting of the second normal channel signal and the second LFE channel signal may output the second normal channel signal and the second LEF channel signal using parametric coding based on a second spatial cue for the second downmixed signal
- the first spatial cue and the second spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- a method of decoding a multi-channel signal including generating a downmixed signal by decoding an encoded result extracted from a bitstream; and outputting a first LFE channel signal and a second LFE channel signal by decoding the downmixed signal.
- the outputting may output the first LEF channel signal and the second LFE channel signal by applying parametric coding based on a spatial cue to the downmixed signal, and the spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- a method of decoding a multi-channel signal including outputting a first LFE channel signal by decoding an encoded result extracted from a bitstream; outputting a second LFE channel signal by decoding another encoded result extracted from the bitstream; applying a time delay to the first LEF channel signal; and applying the time delay to the second LFE channel signal.
- the time delay may include a time delay which occurs in decoding a normal channel signal.
- a method of decoding a multi-channel signal including decoding a normal channel signal from a bitstream; applying a time delay to the decoded normal channel signal; decoding an LFE channel signal from the bitstream; and decoding the decoded LFE channel signal.
- the time delay may include a time delay which occurs in decoding the LFE channel signal.
- the decoding of the LFE channel signal may output a downmixed signal and a spatial cue by conducting parametric coding on the LFE channel signal in an LFE mode.
- an encoder for a multi-channel signal including a first encoding unit to output a first downmixed signal and a first spatial cue by encoding a first normal channel signal and a first low-frequency effects (LFE) channel signal which are included in a multi-channel signal and to output a second downmixed signal and a second spatial cue by encoding a second normal channel signal and a second LFE channel signal which are included in the multi-channel signal; a second encoding unit to encode the first downmixed signal and the second downmixed signal together; and a bitstream formatter to generate a bitstream including the encoded first downmixed signal, the encoded second downmixed signal, the first spatial cue and the second spatial cue.
- LFE low-frequency effects
- the first encoding unit may output the first downmixed signal and the first spatial cue by applying parametric coding to the first normal channel signal and the first LFE channel signal in an LFE mode and may output the second downmixed signal and the second spatial cue by applying parametric coding to the second normal channel signal and the second LFE channel signal in the LFE mode, and the first spatial cue and the second spatial cue may include a channel level difference (CLD) output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- CLD channel level difference
- an encoder for a multi-channel signal including a first encoding unit to output a first downmixed signal and a first spatial cue by encoding a first normal channel signal and a first LFE channel signal which are included in a multi-channel signal and to output a second downmixed signal and a second spatial cue by encoding a second normal channel signal and a second LFE channel signal which are included in the multi-channel signal; a second encoding unit to encode the first downmixed signal; encoding the second downmixed signal separately from the first downmixed signal; and a bitstream formatter to generate a bitstream including the encoded first downmixed signal, the encoded second downmixed signal, the first spatial cue and the second spatial cue.
- the first encoding unit may output the first downmixed signal and the first spatial cue by applying parametric coding to the first normal channel signal and the first LFE channel signal in an LFE mode and may output the second downmixed signal and the second spatial cue by applying parametric coding to the second normal channel signal and the second LFE channel signal in the LFE mode, and the first spatial cue and the second spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- an encoder for a multi-channel signal including a first encoding unit to output a downmixed signal and a spatial cue by encoding a first LFE channel signal and a second LFE channel signal which are included in a multi-channel signal; a second encoding unit to encode the downmixed signal; and a bitstream formatter to generate a bitstream including the encoded downmixed signal and the spatial cue.
- the first encoding unit may output the downmixed signal and the spatial cue by applying parametric coding to the first LFE channel signal and the second LFE channel signal in an LFE mode, and the spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- an encoder for a multi-channel signal including a delay unit to apply a time delay to a first LFE channel signal included in a multi-channel signal and to applying the time delay to a second LFE channel signal included in the multi-channel signal; a second encoding unit to encode the first LEF channel signal to which the time delay is applied and to encode the second LEF channel signal to which the time delay is applied; and a bitstream formatter to generate a bitstream including the encoded first LEF channel signal and the encoded second LEF channel signal.
- the time delay may include a time delay which occurs in encoding a normal channel signal included in the multi-channel signal.
- an encoder for a multi-channel signal including a delay unit to apply a time delay a normal channel signal included in a multi-channel signal; a first encoding unit to encode the normal channel signal to which the time delay is applied; a second encoding unit to output a downmixed signal and a spatial cue by encoding an LFE channel signal included in the multi-channel signal; and a third encoding unit to encode the encoded LFE channel signal, wherein the time delay includes a time delay which occurs in encoding the LFE channel signal.
- the second encoding unit may output the downmixed signal and the spatial cue by conducting parametric coding on the LFE channel signal in an LFE mode.
- a decoder for a multi-channel signal including a first decoding unit to generate a first downmixed signal and a second downmixed signal by decoding an encoded result extracted from a bitstream; and a second decoding unit to output a first normal channel signal and a first LFE channel signal by decoding the first downmixed signal and to output a second normal channel signal and a second LFE channel signal by decoding the second downmixed signal.
- the second decoding unit may output the first normal channel signal and the first LEF channel signal from the first downmixed signal by applying a first spatial cue to parametric coding and may output the second normal channel signal and the second LEF channel signal from the second downmixed signal by applying a second spatial cue to parametric coding, and the first spatial cue and the second spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- a decoder for a multi-channel signal including a first decoding unit to generate a first downmixed signal by decoding an encoded result extracted from a bitstream and to generate a second downmixed signal by decoding another encoded result extracted from the bitstream; and a second decoding unit to output a first normal channel signal and a first LFE channel signal by decoding the first downmixed signal; and to output a second normal channel signal and a second LFE channel signal by decoding the second downmixed signal.
- the second decoding unit may output the first normal channel signal and the first LEF channel signal using parametric coding based on a first spatial cue for the first downmixed signal and may output the second normal channel signal and the second LEF channel signal using parametric coding based on a second spatial cue for the second downmixed signal, and the first spatial cue and the second spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- a decoder for a multi-channel signal including a first decoding unit to generate a downmixed signal by decoding an encoded result extracted from a bitstream; and a second decoding unit to output a first LFE channel signal and a second LFE channel signal by decoding the downmixed signal.
- the second decoding unit may output the first LEF channel signal and the second LFE channel signal by applying parametric coding based on a spatial cue to the downmixed signal, and the spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- a decoder for a multi-channel signal including a first decoding unit to output a first LFE channel signal by decoding an encoded result extracted from a bitstream and to output a second LFE channel signal by decoding another encoded result extracted from the bitstream; and a delay unit to apply a time delay to the first LEF channel signal and to apply the time delay to the second LFE channel signal.
- the time delay may include a time delay which occurs in decoding a normal channel signal.
- a decoder for a multi-channel signal including a first decoding unit to decode a normal channel signal from a bitstream; a delay unit to apply a time delay to the decoded normal channel signal; a second decoding unit to decode an LFE channel signal from the bitstream; and a third decoding unit to decode the decoded LFE channel signal.
- the time delay may include a time delay which occurs in decoding the LFE channel signal.
- the second decoding unit may output a downmixed signal and a spatial cue by conducting parametric coding on the LFE channel signal in an LFE mode.
- a multi-channel signal including a low-frequency effects (LFE) channel signal in addition to a normal channel signal may be effectively encoded or decoded.
- LFE low-frequency effects
- synchronized multi-channel signals may be output by employing two-stage encoding/decoding or one-stage encoding/decoding employing a time delay.
- FIG. 1 illustrates an encoder and a decoder according to an embodiment.
- the encoder 101 may encode a multi-channel signal including a plurality of channel signals to generate a bitstream.
- the decoder 101 may decode the multi-channel signal from the bitstream received from the encoder 101 or stored in a medium of the encoder 101.
- the multi-channel signal may include a low-frequency effects (LFE) channel signal.
- LFE channel signal refers to a channel signal for low-frequency effects (LFE) of a selective and limited sound range.
- a low sound range may refer to a low-frequency range from 20 to 120 Hz.
- An LFE channel signal may be used to supplement low-frequency information on a main channel signal by transmitting additional low-frequency information.
- FIG. 2 illustrates an encoder which encodes a multi-channel signal including an LFE channel signal according to a first embodiment.
- FIGS. 2 to 5 illustrate processes of encoding a multi-channel signal including two LFE channel signals
- FIGS. 6 to 9 illustrate processes of decoding encoded results of FIGS. 2 to 5 .
- the encoder may include a first encoding unit 201, a first encoding unit 202, a second encoding unit 203 and a bitstream formatter 204.
- the first encoding units 201 and 202 may perform the same operations.
- the first encoding unit 201 may generate a downmixed signal dmx 1 using an LFE channel signal Lfe 1 and a normal channel signal x i .
- a normal channel signal may refer to a channel signal which does not exhibit low-frequency effects.
- the first encoding unit 202 may generate a downmixed signal dmx 2 using an LFE channel signal Lfe 2 and a normal channel signal x i+1 . i represents an index of a normal channel signal. That is, the encoder of FIG. 2 may encode a multi-channel signal including a normal channel signal coupled to an LFE channel signal.
- the first encoding units 201 and 202 may perform parametric coding to output spatial cues and the downmixed signals.
- the first encoding units 201 and 202 perform parametric coding using the LFE channel signals.
- a channel level difference (CLD) as a spatial cue may be extracted from an LFE band. Accordingly, a spatial cue output through parametric coding using an LFE channel signal may output a relatively smaller amount of data than a spatial cue output through generally used parametric coding.
- the spatial cues output from the first encoding units 201 and 202 are bit1 and bit2, respectively.
- the second encoding unit 203 may encode the downmixed signal dmx 1 output from the first encoding unit 201 and the downmixed signal dmx 2 output from the first encoding unit 202.
- the downmixed signals dmx 1 and dmx 2 may be input as a stereo signal to the second encoding unit 203.
- the second encoding unit 203 may be an Advanced Audio Codec (AAC), MP3, or the like.
- AAC Advanced Audio Codec
- the second encoding unit 203 outputs bit3 as an encoded result, which is input to the bitstream formatter 204.
- the bitstream formatter 204 may convert bit3 into a bitstream.
- FIG. 3 illustrates an encoder which encodes a multi-channel signal including an LFE channel signal according to a second embodiment.
- the encoder of FIG. 3 may include a first encoding unit 301, a second encoding unit 302, a first encoding unit 303, a second encoding unit 304 and a bitstream formatter 305.
- the first encoding units 301 and 303 of FIG. 3 may operate in the same manner as the first encoding units 201 and 202 of FIG. 2 . That is, the first encoding units 301 and 303 may perform parametric coding using an LFE channel signal to extract a CLD as a spatial cue from an LFE band.
- the first encoding unit 301 may generate a downmixed signal dmx 1 using an LFE channel signal Lfe 1 and a normal channel signal x i .
- the first encoding unit 303 may generate a downmixed signal dmx 2 using an LFE channel signal Lfe 2 and a normal channel signal x i+1 .
- the downmixed signal dmxi resulting from encoding by the first encoding unit 301 is input as a mono signal to the second encoding unit 302.
- the second encoding unit 302 may output bit3 using the downmixed signal dmx 1 .
- the downmixed signal dmx 2 resulting from encoding by the first encoding unit 303 is input as a mono signal to the second encoding unit 304.
- the second encoding unit 304 may output bit4 using the downmixed signal dmx 2 .
- FIG. 4 illustrates an encoder which encodes a multi-channel signal including an LFE channel signal according to a third embodiment.
- the encoder may include a first encoding unit 401, a second encoding unit 402 and a bitstream formatter 403.
- LFE channel signals Lfe 1 and Lfe 2 may be coupled to each other and input to the first encoding unit 401.
- the first encoding unit 401 may output a downmixed signal dmx 3 as a mono signal using the LFE channel signals Lfe 1 and Lfe 2 .
- bit1 means a spatial cue derived by the first encoding unit 401 through parametric coding.
- the downmixed signal dmx 3 may be input to the second encoding unit 402.
- the second encoding unit 402 may code an LFE band in the downmixed signal dmx 3 .
- a Unified Speech and Audio Codec (USAC) and an Advanced Audio Codec (AAC) may have a separate coding mode for coding an LFE band.
- the second encoding unit 402 may use a coding mode provided by the USAC or AAC.
- Bit2 output from the second encoding unit 402 and bit1 output from the first encoding unit 401 may be output as a bitstream through the bitstream formatter 403.
- FIG. 5 illustrates an encoder which encodes a multi-channel signal including an LFE channel signal according to a fourth embodiment.
- the encoder may include a delay unit 501, a second encoding unit 502, a delay unit 503, a second encoding unit 504 and a bitstream formatter 505.
- FIG. 5 illustrates a process of encoding an LFE channel signal using the second encoding units 502 and 504, not via the aforementioned first encoding units.
- the second encoding units 502 and 504 may perform parametric coding on an LFE band. Input signals for the second encoding units 502 and 504 may need to be delayed corresponding to the presence of a first encoding unit.
- Input signals for the second encoding units 502 and 504 may need to be delayed corresponding to the presence of a first encoding unit.
- one more encoding process of the normal channel signals may cause a time delay. Accordingly, a bitstream of the LFE channel signals synchronized with the normal channel signals may be generated only when the time delay is considered.
- the delay units 501 and 503 may apply a time delay ⁇ enc , which may occur in real encoding, to the LFE channel signals Lfe 1 and Lfe 2 . Subsequently, time-delayed LFE channel signals Lfe 1 (n- ⁇ enc ) and Lfe 2 (n- ⁇ enc ) may be input to the second encoding units 502 and 504, respectively. Bit1 and bit2, encoded results by the second encoding units 502 and 504 may be output as a bitstream via the bitstream formatter 505.
- FIG. 6 illustrates a decoder which decodes an encoded result of FIG. 2 .
- the decoder may include a bitstream deformatter 601, a first decoding unit 602, a second decoding unit 603 and a second decoding unit 604.
- FIG. 6 may operate in an inverse manner to FIG. 2 .
- a bitstream input to the bitstream deformatter 601 may be the bitstream generated in FIG. 2 .
- the bitstream deformatter 601 may output bit1, bit2 and bit3 from the bitstream. Bit1, bit2 and bit3 are the same as those mentioned in FIG. 2 .
- Bit3 may be input to the first decoding unit 602.
- the first decoding unit 602 may generate downmixed signals dmx 1 and dmx 2 using bit3.
- the second decoding unit 603 may perform parametric coding on bit1 as a spatial cue and the downmixed signal dmx 1 to output a normal channel signal x i and an LFE channel signal Lfe 1 .
- the second decoding unit 604 may perform parametric coding on bit2 as a spatial cue and the downmixed signal dmx 2 to output a normal channel signal x i+1 and an LFE channel signal Lfe 2 .
- FIG. 7 illustrates a decoder which decodes an encoded result of FIG. 3 .
- the decoder may include a bitstream deformatter 701, a first decoding unit 702, a second decoding unit 703, a first decoding unit 704 and a second decoding unit 705.
- FIG. 7 may operate in an inverse manner to FIG. 3 .
- a bitstream input to the bitstream deformatter 701 may be the bitstream generated in FIG. 3 .
- the bitstream deformatter 701 may output bit1, bit2, bit3 and bit4 from the bitstream. Bit1, bit2, bit3 and bit4 are the same as those mentioned in FIG. 3 .
- Bit3 may be input to the first decoding unit 702, and bit4 may be input to the first decoding unit 704.
- the first decoding unit 702 may generate a downmixed signal dmx 1 using bit3.
- the first decoding unit 704 may generate a downmixed signal dmx 2 using bit4.
- the second decoding unit 703 may perform parametric coding on bit1 as a spatial cue and the downmixed signal dmx 1 to output a normal channel signal x i and an LFE channel signal Lfe 1 .
- the second decoding unit 703 may perform parametric coding on bit2 as a spatial cue and the downmixed signal dmx 2 to output a normal channel signal x i+1 and an LFE channel signal Lfe 2 .
- FIG. 8 illustrates a decoder which decodes an encoded result of FIG. 4 .
- the decoder may include a bitstream deformatter 801, a first decoding unit 802 and a second decoding unit 803.
- FIG. 8 may operate in an inverse manner to FIG. 4 .
- a bitstream input to the bitstream deformatter 801 may be the bitstream generated in FIG. 4 .
- the bitstream deformatter 801 may output bit1 and bit2 from the bitstream. Bit1 and bit2 are the same as those mentioned in FIG. 4 .
- Bit1 may be input to the first decoding unit 802, and bit2 may be input to the second decoding unit 803.
- the first decoding unit 802 may generate a downmixed signal dmx 3 using bit3.
- the second decoding unit 803 may perform parametric coding on bit2 as a spatial cue and the downmixed signal dmx 3 to output LFE channel signals Lfe 1 and Lfe 2 .
- the first decoding unit 802 and the second decoding unit 803 may perform parametric coding on an LFE band of the input downmixed signal dmx 3 .
- FIG. 9 illustrates a decoder which decodes an encoded result of FIG. 5 .
- the decoder may include a bitstream deformatter 901, a first decoding unit 902, a delay unit 903, a first decoding unit 904 and a delay unit 905.
- FIG. 9 may operate in an inverse manner to FIG. 5 .
- a bitstream input to the bitstream deformatter 901 may be the bitstream generated in FIG. 5 .
- the bitstream deformatter 901 may output bit1 and bit2 from the bitstream. Bit1 and bit2 are the same as those mentioned in FIG. 5 .
- Bit1 may be input to the first decoding unit 902, and bit2 may be input to the first decoding unit 904.
- the first decoding unit 902 may generate an LFE channel signal Lfe 1 (n- ⁇ enc ) using bit1
- the second decoding unit 904 may generate an LFE channel signal Lfe 2 (n- ⁇ enc ) using bit2.
- the delay unit 903 may apply a time delay to the LFE channel signal Lfe 1 (n- ⁇ enc ) to output Lfe 1 (n- ⁇ enc - ⁇ dec ).
- the delay unit 905 may apply a time delay to the LFE channel signal Lfe 2 (n- ⁇ enc ) to output Lfe 2 (n- ⁇ enc - ⁇ dec ).
- the delay units 903 and 905 may apply a time delay ⁇ dec occurring in one-time decoding so that signals subjected to one-time decoding synchronize with those subjected to two-time decoding.
- FIG. 10 illustrates a process of encoding a multi-channel signal using the encoder of FIG. 2 .
- FIG. 10 illustrates an encoder for a multi-channel signal which adopts the encoder Type1 illustrated in FIG. 2 .
- Two To Ones (TTOs) 1001, 1002, 1004 and 1005 may encode an input signal according to a parametric coding mode for an MPEG Surround stereo signal. That is, the TTOs may correspond to the first encoding units of FIG. 2 , and USAC encoders may correspond to the second encoding unit of FIG. 2 .
- TTOs 1001 and 1002 may perform parametric coding according to a normal mode
- TTOs 1004 and 1005 may perform parametric coding according to an LFE mode.
- a CLD, Inter-Channel Coherence (ICC) and Interchannel Phase Difference (IPD) as spatial cues may be extracted by analyzing a normal channel signal x i .
- IPD Interchannel Phase Difference
- a CLD may be extracted from an LFE band of an input LFE channel signal.
- FIG. 10 illustrates an encoding process when N multi-channel signals are input.
- the N multi-channel signals may be subjected to parametric coding via the TTOs into M downmixed signals dmx 1 to dmx M .
- the M downmixed signals may be input in a stereo form and encoded through USAC core coding.
- LFE channel signals Lfe 1 and Lfe 2 may be coupled to normal channel signals to be input to the TTO 1004 and TTO1005.
- normal channel signals of the multi-channel signals may be coupled and downmixed by two channels, and a downmixed result may be subjected to stereo coding by the USAC encoders.
- two normal channel signals x 2M-1 and x 2M may be respectively coupled to the LFE channel signals Lfe 1 and Lfe 2 and input to the TTOs(Lfe).
- FIG. 10 shows that the encoder Type 1 of FIG. 2 is adopted, the encoder Type 2 illustrated in FIG. 3 may be applied, instead of the encoder Type 1.
- FIG. 11 illustrates a decoder which decodes an encoded result of FIG. 10 .
- FIG. 11 illustrates a decoder for a multi-channel signal which adopts the decoder Type 1 illustrated in FIG. 6 .
- One To Twos (OTTs) 1103, 1104, 1106 and 1107 may decode an input signal according to a parametric coding mode for an MPEG Surround stereo signal. That is, the OTTs may correspond to the second decoding units of FIG. 6 , and USAC decoders may correspond to the first decoding unit.
- OTTs One To Twos
- USAC decoders may correspond to the first decoding unit.
- OTTs 1103 and 1104 may perform parametric coding according to a normal mode, and the OTTs 1106 and 1107 may perform parametric coding according to an LFE mode.
- the encoded result may be decoded to output N multi-channel audio signals.
- M downmixed signals may be output from a bitstream via the USAC decoders.
- the M downmixed signals may be input to the respective OTTs to output stereo signals.
- the OTTs 1103 and 1104 may output two normal channel signals, and the OTTs 1106 and 1107 may output normal channel signals coupled to LEF channel signals.
- FIG. 12 illustrates a process of encoding a multi-channel signal when encoding bits are sufficient in FIG. 10 .
- normal channel signals x 1 to x 2M-2 may be encoded by USAC encoders 1203 and 1206.
- a delay time ⁇ enc occurring in encoding via delay units 1201, 1201, 1204 and 1205, may be applied to the normal channel signals x 1 to x 2M-2 . Accordingly, time-delayed results may be encoded by the USAC encoders 1203 and 1206.
- the time delay ⁇ enc occurs in OTTs 1207 and 1208 and may include time delays due to quadrature mirror filter (QMF) analysis, hybrid analysis and QMF synthesis.
- QMF quadrature mirror filter
- a time delay occurring by QMF synthesis may be excluded when calculating ⁇ enc .
- FIG. 13 illustrates a decoder which decodes an encoded result of FIG. 12 .
- FIG. 13 may perform an inverse process to FIG. 12 .
- a normal channel signal is decoded by USAC decoders 1302 and 1305 and output via delay units 1303, 1304, 1306 and 1307.
- a result derived by a bitstream deformatter 1301 maybe decoded by a USAC decoder 1308 to generate downmixed signals, and the downmixed signals may be respectively input to OTTs 1309 and 1310 to output normal channel signals x 2M-1 and x 2M respectively coupled to LFE channel signals Lfe 1 and Lfe 2 .
- the delay units 1303, 1304, 1306 and 1307 may apply a time delay ⁇ dec occurring in one-time decoding to one-time to output results from the USAC decoders 1302 and 1305.
- ⁇ dec includes QMF analysis, hybrid analysis, QMF synthesis and filtering delays and is different from ⁇ ene .
- output signals from the USAC decoders 1302 and 1305 are QMF signals, a time delay occurring by QMF analysis may be excluded when determining ⁇ dec .
- a filtering delay refers to a time delay which occurs due to a filtering operation by the OTTs 1309 and 1310, irrespective of QMF conversion.
- a filtering delay may be a time delay occurring in a decorrelator operation by the OTTs 1309 and 1310.
- FIG. 14 illustrates an example of encoding a multi-channel signal using the encoder of FIG. 4 .
- the encoder Type 3 illustrated in FIG. 4 may be used.
- Normal channel signals x 1 to x 2M of a multi-channel signal may be coupled by two and input to TTOs 1401 and 1402.
- the TTOs 1401 and 1402 may perform parametric coding on the normal channel signals coupled by two to output downmixed signals dmx 1 and dmx 2 along with spatial cues.
- the output downmixed signals dmx 1 and dmx 2 may be input in a stereo form to a USAC encoder 1403.
- LFE channel signals Lfe 1 and Lfe 2 included in the multi-channel signal may be coupled by two and input to a TTO 140.
- the TTO 1404 may perform parametric coding using the two LFE channel signals Lfe 1 and Lfe 2 to output a downmixed signal dmx 3 in a mono form.
- a USAC encoder 1405 may encode the downmixed signal dmx 3 in the LFE mode.
- FIG. 15 illustrates another example of encoding a multi-channel signal using the encoder of FIG. 4 .
- a normal channel signal may be encoded by USAC encoders 1503 and 1506, instead of being subjected to parametric coding by TTOs 1401 and 1402 in FIG. 14 .
- delay units 1501, 1502, 1504 and 1505 may apply a time delay occurring by the TTO 1507 to the normal channel signal.
- LFE channel signals Lfe 1 and Lfe 2 may be encoded by the TTO 1507 in the LFE mode to output a downmixed signal dmx 3 , and the downmixed signal dmx 3 may be encoded by the USAC encoder.
- FIGS. 14 and 15 illustrate the encoders
- corresponding decoders may operate according to inverse processes.
- a normal channel signal may be output from a bitstream obtained in FIG. 14 via a USAC decoder and an OTT.
- LFE channel signals Lfe 1 and Lfe 2 may be output from the bitstream obtained in FIG. 14 via a USAC decoder and an OTT.
- a normal channel signal may be output from a bitstream obtained in FIG. 15 via a USAC decoder and a delay unit.
- LFE channel signals Lfe 1 and Lfe 2 may be output from the bitstream obtained in FIG. 15 via a USAC decoder and an OTT.
- FIG. 16 illustrates an example of encoding a multi-channel signal using the encoder of FIG. 5 .
- FIG. 16 illustrates an encoder for a multi-channel signal which adopts the encoder Type 4 illustrated in FIG. 5 .
- Normal channel signals may be converted into downmixed signals through TTOs 1601 and 1602, and the converted downmixed signals may be output as a bitstream through a USAC encoder 1603.
- delay units 1604 and 1606 may apply a time delay ⁇ enc occurring in the TTOs 1601 and 1602 to LFE channel signals, and time-delayed results may be encoded respectively by USAC encoders 1605 and 1607 according to the LFE mode. That is, since the LFE channel signals are subjected to encoding once, unlike the normal channel signals subjected to encoding two times, the time delay ⁇ enc occurring in encoding by the TTOs may need to be applied to the LFE channel signals.
- FIG. 17 illustrates an example of decoding a multi-channel signal using the decoder of FIG. 9 .
- the decoder Type 4 illustrated in FIG. 9 is used.
- normal channel signals may be output through a USAC decoder 1702 and a TTO 1703 and 1704.
- LFE channel signals may be output through USAC decoders 1705 and 1707 and delay units 1706 and 1708.
- the delay units 1706 and 1708 may need to apply a time delay ⁇ dec occurring in decoding by the TTOs 1703 and 1704 to the LFE channel signals. Accordingly, the normal channel signals and the LFE channel signals output from the decoder may be synchronized with each other.
- FIG. 18 illustrates an encoder which encodes a multi-channel signal when the multi-channel signal includes an odd number of LFE channel signals according to an embodiment.
- FIGS. 2 to 17 illustrate an even number of LFE channel signals
- FIG. 18 illustrates an odd number of LFE channel signals.
- one LEF channel signal Lfe 2n+1 may be input to a delay unit 1801, and a time delay ⁇ may be applied to the LEF channel signal.
- the time-delayed LFE channel signal may be encoded by a second encoding unit 1802 in an LFE mode to output bit1. That is, an odd number of LFE channel signals may be processed by the encoder Type 4 of FIG. 5 or the decoder Type 4 of FIG. 9 .
- a normal channel signal is encoded by a first encoding unit and the second encoding unit, and thus a delay unit 1801 may need to apply a time delay occurring due to the first encoding unit to the LFE channel signal for synchronization with the normal channel signal.
- FIG. 19 illustrates an encoder which encodes a normal audio signal, not an LFE channel signal, according to an embodiment.
- normal channel signals x 1 and x 2 may be subjected to parametric coding by a first encoding unit 1901 to be converted into a downmixed signal dmx 1 along with a spatial cue bit1.
- normal channel signals x 3 and x 4 may be subjected to parametric coding by a first encoding unit 1902 to be converted into a downmixed signal dmx 2 along with a spatial cue bit2.
- parametric coding applied to the normal channel signals may extract not only a CLD but also an ICC and IPD as spatial cues.
- the downmixed signal dmx 1 and dmx 2 may be input in a stereo form to a second encoding unit 1903 and encoded to output bit3.
- Bit3 may be converted into a bistream by a bitstream formatter 1904.
- FIG. 20 illustrates a decoder which decodes an encoded result of FIG. 19 .
- spatial cues bit1 and bit2 and encoded bit3 may be output by a bitstream deformatter 2001 from the bitstream generated in FIG. 19 .
- a first decoding unit 2002 may decode bit3 to output downmixed signals dmx 1 and dmx 2 .
- a second decoding unit 2003 may decode a downmixed signal dmx 1 to output normal channel signals x 1 and x 2 .
- a second decoding unit 2004 may decode a downmixed signal dmx 2 to output normal channel signals x 1 and x 2 .
- FIG. 21 illustrates an encoding process and a decoding process according to an embodiment.
- the foregoing first encoding units may correspond to TTOs 2101 and 2102 of FIG. 21
- the foregoing second encoding units may correspond to a USAC encoder 2103.
- the foregoing first decoding units may correspond to a USAC decoder 2104
- the foregoing second encoding units may correspond to OTTs 2105 and 2106.
- the USAC decoder 2104 may output two downmixed signals from a bitstream.
- the OTTs 2105 and 2106 may output (i) four normal channel signals or (ii) results of coupling one normal channel signal and one LFE channel signal from the downmixed signals.
- FIG. 22 illustrates a USAC encoder and a USAC decoder according to a first embodiment.
- a USAC encoder may include TTOs 2203 and 2204 to configure an extended USAC encoder 2201.
- a USAC decoder may include OTTs 2211 and 2212 to configure an extended USAC decoder 2202.
- the non-frequency-extended core band is decoded by a core decoder 2208, and a decoded result may be input to and subjected to frequency extension by an SBR 2209, thereby reconstructing an original signal.
- a result of frequency extension by the SBR 2209 may be subjected to parametric coding by an OTT 2210 to generate two downmixed signals, and the downmixed signals may be subjected to parametric coding by the OTTs 2211 and 2212 to output (i) four normal channel signals or (ii) one normal channel signal and one LFE channel signal.
- FIG. 23 illustrates a USAC encoder and a USAC decoder according to a second embodiment.
- positions of an SBR 2305 and a TTO 2306 in an extended USAC encoder 2301 and positions of an OTT 2309 and an SBR 2310 in an extended USAC decoder 2302 are changed from those in FIG. 22 .
- Other components may be equivalent to those in FIG. 22 .
- the apparatuses described herein may be implemented using hardware components, software components, and/or combinations of hardware components and software components.
- the units and components illustrated in the embodiments may be implemented using one or more general-purpose or special purpose computers, such as, for example, a processor, a controller, an arithmetic logic unit (ALU), a digital signal processor, a microcomputer, a field programmable array (FPA), a programmable logic unit (PLU), a microprocessor or any other device capable of responding to and executing instructions.
- a processing device may run an operating system (OS) and one or more software applications that run on the OS. The processing device also may access, store, manipulate, process, and create data in response to execution of the software.
- OS operating system
- the processing device also may access, store, manipulate, process, and create data in response to execution of the software.
- a processing device may include multiple processing elements and multiple types of processing elements.
- a processing device may include multiple processors or a processor and a controller.
- different processing configurations are possible, such as parallel processors.
- the software may include a computer program, a piece of code, an instruction, or one or more combinations thereof, to independently or collectively instruct or configure the processing device to operate as desired.
- Software and/or data may be embodied permanently or temporarily in any type of machine, component, physical or virtual equipment, computer storage medium or device, or in a propagated signal wave in order to provide instructions or data to the processing device or to be interpreted by the processing device.
- the software may also be distributed over network coupled computer systems so that the software is stored and executed in a distributed fashion.
- the software and data may be stored by one or more non-transitory computer readable recording mediums.
- the methods according to the embodiments may be realized as program instructions implemented by various computers and be recorded in non-transitory computer-readable media.
- the media may also include, alone or in combination with the program instructions, data files, data structures, and the like.
- the program instructions recorded in the media may be designed and configured specially for the embodiments or be known and available to those skilled in computer software.
- Examples of the non-transitory computer readable recording medium may include magnetic media such as hard disks, floppy disks, and magnetic tape; optical media such as CD ROM disks and DVDs; magneto-optical media such as floptical disks; and hardware devices that are specially configured to store and perform program instructions, such as read-only memory (ROM), random access memory (RAM), flash memory, and the like.
- Examples of program instructions include both machine codes, such as produced by a compiler, and higher level language codes that may be executed by the computer using an interpreter.
- the described hardware devices may be configured to act as one or more software modules in order to perform the operations of the above-described exemplary embodiments, or vice versa.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Description
- Exemplary embodiments relate to an encoder and encoding method for a multi-channel signal, and a decoder and decoding method for a multi-channel signal, and more particularly to a codec for efficiently processing a multi-channel signal including a plurality of channel signals.
- With demands for ultrahigh quality of audiovisual (AV) media require, novel technology for compression/transmission of AV media is needed. For superhigh audio content, audio quality and accurate representation of a sound field of multi-channels are important rather than backward comparability. For instance, a 22.2 channel audio signal, which is for reproducing a sound field of an ultrahigh-quality audio, requires a high-quality multi-channel audio coding technique which enables representation of unique sound quality and effects of a sound field of content as it is, rather than compression/transmission techniques for backward compatibility.
- Thus, new codec structures are needed for encoding/decoding known 5.1 or 7.1 channel or greater multi-channel signals.
- An aspect of the present invention is to provide an apparatus and method of encoding or decoding a multi-channel signal including a low-frequency effects (LFE) channel signal.
- Another aspect of the present invention is to provide an apparatus and method of performing two-stage encoding/decoding or one-stage encoding/decoding employing a time delay.
- According to a first embodiment of the present invention, there is provided a method of encoding a multi-channel signal, the method including outputting a first downmixed signal and a first spatial cue by encoding a first normal channel signal and a first low-frequency effects (LFE) channel signal which are included in a multi-channel signal; outputting a second downmixed signal and a second spatial cue by encoding a second normal channel signal and a second LFE channel signal which are included in the multi-channel signal; encoding the first downmixed signal the second downmixed signal together; and generating a bitstream including the encoded first downmixed signal, the encoded second downmixed signal, the first spatial cue and the second spatial cue.
- The outputting of the first cue may output the first downmixed signal and the first spatial cue by applying parametric coding to the first normal channel signal and the first LFE channel signal in an LFE mode, the outputting of the second cue may output the second downmixed signal and the second spatial cue by applying parametric coding to the second normal channel signal and the second LFE channel signal in the LFE mode, and the first spatial cue and the second spatial cue may include a channel level difference (CLD) output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- According to a second embodiment of the present invention, there is provided a method of encoding a multi-channel signal, the method including outputting a first downmixed signal and a first spatial cue by encoding a first normal channel signal and a first LFE channel signal which are included in a multi-channel signal; outputting a second downmixed signal and a second spatial cue by encoding a second normal channel signal and a second LFE channel signal which are included in the multi-channel signal; encoding the first downmixed signal; encoding the second downmixed signal separately from the first downmixed signal; and generating a bitstream including the encoded first downmixed signal, the encoded second downmixed signal, the first spatial cue and the second spatial cue.
- The outputting of the first cue may output the first downmixed signal and the first spatial cue by applying parametric coding to the first normal channel signal and the first LFE channel signal in an LFE mode, the outputting of the second cue may output the second downmixed signal and the second spatial cue by applying parametric coding to the second normal channel signal and the second LFE channel signal in the LFE mode, and the first spatial cue and the second spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- According to a third embodiment of the present invention, there is provided a method of encoding a multi-channel signal, the method including outputting a downmixed signal and a spatial cue by encoding a first LFE channel signal and a second LFE channel signal which are included in a multi-channel signal; encoding the downmixed signal; and generating a bitstream including the encoded downmixed signal and the spatial cue.
- The outputting may output the downmixed signal and the spatial cue by applying parametric coding to the first LFE channel signal and the second LFE channel signal in an LFE mode, and the spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- According to a fourth embodiment of the present invention, there is provided a method of encoding a multi-channel signal, the method including applying a time delay to a first LFE channel signal included in a multi-channel signal; applying the time delay to a second LFE channel signal included in the multi-channel signal; encoding the first LEF channel signal to which the time delay is applied; encoding the second LEF channel signal to which the time delay is applied; and generating a bitstream including the encoded first LEF channel signal and the encoded second LEF channel signal.
- The time delay may include a time delay which occurs in encoding a normal channel signal included in the multi-channel signal.
- According to a fifth embodiment of the present invention, there is provided a method of encoding a multi-channel signal, the method including applying a time delay a normal channel signal included in a multi-channel signal; encoding the normal channel signal to which the time delay is applied; outputting a downmixed signal and a spatial cue by encoding an LFE channel signal included in the multi-channel signal; and encoding the encoded LFE channel signal, wherein the time delay includes a time delay which occurs in encoding the LFE channel signal.
- The outputting may output the downmixed signal and the spatial cue by conducting parametric coding on the LFE channel signal in an LFE mode.
- According to a first embodiment of the present invention, there is provided a method of decoding a multi-channel signal, the method including generating a first downmixed signal and a second downmixed signal by decoding an encoded result extracted from a bitstream; outputting a first normal channel signal and a first LFE channel signal by decoding the first downmixed signal; and outputting a second normal channel signal and a second LFE channel signal by decoding the second downmixed signal.
- The outputting of the first normal channel signal and the first LFE channel signal may output the first normal channel signal and the first LEF channel signal from the first downmixed signal by applying a first spatial cue to parametric coding, the outputting of the second normal channel signal and the second LFE channel signal may output the second normal channel signal and the second LEF channel signal from the second downmixed signal by applying a second spatial cue to parametric coding, and the first spatial cue and the second spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- According to a second embodiment of the present invention, there is provided a method of decoding a multi-channel signal, the method including generating a first downmixed signal by decoding an encoded result extracted from a bitstream; generating a second downmixed signal by decoding another encoded result extracted from the bitstream; outputting a first normal channel signal and a first LFE channel signal by decoding the first downmixed signal; and outputting a second normal channel signal and a second LFE channel signal by decoding the second downmixed signal.
- The outputting of the first normal channel signal and the first LFE channel signal may output the first normal channel signal and the first LEF channel signal using parametric coding based on a first spatial cue for the first downmixed signal, the outputting of the second normal channel signal and the second LFE channel signal may output the second normal channel signal and the second LEF channel signal using parametric coding based on a second spatial cue for the second downmixed signal, and the first spatial cue and the second spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- According to a third embodiment of the present invention, there is provided a method of decoding a multi-channel signal, the method including generating a downmixed signal by decoding an encoded result extracted from a bitstream; and outputting a first LFE channel signal and a second LFE channel signal by decoding the downmixed signal.
- The outputting may output the first LEF channel signal and the second LFE channel signal by applying parametric coding based on a spatial cue to the downmixed signal, and the spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- According to a fourth embodiment of the present invention, there is provided a method of decoding a multi-channel signal, the method including outputting a first LFE channel signal by decoding an encoded result extracted from a bitstream; outputting a second LFE channel signal by decoding another encoded result extracted from the bitstream; applying a time delay to the first LEF channel signal; and applying the time delay to the second LFE channel signal.
- The time delay may include a time delay which occurs in decoding a normal channel signal.
- According to a fifth embodiment of the present invention, there is provided a method of decoding a multi-channel signal, the method including decoding a normal channel signal from a bitstream; applying a time delay to the decoded normal channel signal; decoding an LFE channel signal from the bitstream; and decoding the decoded LFE channel signal.
- The time delay may include a time delay which occurs in decoding the LFE channel signal.
- The decoding of the LFE channel signal may output a downmixed signal and a spatial cue by conducting parametric coding on the LFE channel signal in an LFE mode.
- According to a first embodiment of the present invention, there is provided an encoder for a multi-channel signal, the encoder including a first encoding unit to output a first downmixed signal and a first spatial cue by encoding a first normal channel signal and a first low-frequency effects (LFE) channel signal which are included in a multi-channel signal and to output a second downmixed signal and a second spatial cue by encoding a second normal channel signal and a second LFE channel signal which are included in the multi-channel signal; a second encoding unit to encode the first downmixed signal and the second downmixed signal together; and a bitstream formatter to generate a bitstream including the encoded first downmixed signal, the encoded second downmixed signal, the first spatial cue and the second spatial cue.
- The first encoding unit may output the first downmixed signal and the first spatial cue by applying parametric coding to the first normal channel signal and the first LFE channel signal in an LFE mode and may output the second downmixed signal and the second spatial cue by applying parametric coding to the second normal channel signal and the second LFE channel signal in the LFE mode, and the first spatial cue and the second spatial cue may include a channel level difference (CLD) output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- According to a second embodiment of the present invention, there is provided an encoder for a multi-channel signal, the encoder including a first encoding unit to output a first downmixed signal and a first spatial cue by encoding a first normal channel signal and a first LFE channel signal which are included in a multi-channel signal and to output a second downmixed signal and a second spatial cue by encoding a second normal channel signal and a second LFE channel signal which are included in the multi-channel signal; a second encoding unit to encode the first downmixed signal; encoding the second downmixed signal separately from the first downmixed signal; and a bitstream formatter to generate a bitstream including the encoded first downmixed signal, the encoded second downmixed signal, the first spatial cue and the second spatial cue.
- The first encoding unit may output the first downmixed signal and the first spatial cue by applying parametric coding to the first normal channel signal and the first LFE channel signal in an LFE mode and may output the second downmixed signal and the second spatial cue by applying parametric coding to the second normal channel signal and the second LFE channel signal in the LFE mode, and the first spatial cue and the second spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- According to a third embodiment of the present invention, there is provided an encoder for a multi-channel signal, the encoder including a first encoding unit to output a downmixed signal and a spatial cue by encoding a first LFE channel signal and a second LFE channel signal which are included in a multi-channel signal; a second encoding unit to encode the downmixed signal; and a bitstream formatter to generate a bitstream including the encoded downmixed signal and the spatial cue.
- The first encoding unit may output the downmixed signal and the spatial cue by applying parametric coding to the first LFE channel signal and the second LFE channel signal in an LFE mode, and the spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- According to a fourth embodiment of the present invention, there is provided an encoder for a multi-channel signal, the encoder including a delay unit to apply a time delay to a first LFE channel signal included in a multi-channel signal and to applying the time delay to a second LFE channel signal included in the multi-channel signal; a second encoding unit to encode the first LEF channel signal to which the time delay is applied and to encode the second LEF channel signal to which the time delay is applied; and a bitstream formatter to generate a bitstream including the encoded first LEF channel signal and the encoded second LEF channel signal.
- The time delay may include a time delay which occurs in encoding a normal channel signal included in the multi-channel signal.
- According to a fifth embodiment of the present invention, there is provided an encoder for a multi-channel signal, the encoder including a delay unit to apply a time delay a normal channel signal included in a multi-channel signal; a first encoding unit to encode the normal channel signal to which the time delay is applied; a second encoding unit to output a downmixed signal and a spatial cue by encoding an LFE channel signal included in the multi-channel signal; and a third encoding unit to encode the encoded LFE channel signal, wherein the time delay includes a time delay which occurs in encoding the LFE channel signal.
- The second encoding unit may output the downmixed signal and the spatial cue by conducting parametric coding on the LFE channel signal in an LFE mode.
- According to a first embodiment of the present invention, there is provided a decoder for a multi-channel signal, the decoder including a first decoding unit to generate a first downmixed signal and a second downmixed signal by decoding an encoded result extracted from a bitstream; and a second decoding unit to output a first normal channel signal and a first LFE channel signal by decoding the first downmixed signal and to output a second normal channel signal and a second LFE channel signal by decoding the second downmixed signal.
- The second decoding unit may output the first normal channel signal and the first LEF channel signal from the first downmixed signal by applying a first spatial cue to parametric coding and may output the second normal channel signal and the second LEF channel signal from the second downmixed signal by applying a second spatial cue to parametric coding, and the first spatial cue and the second spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- According to a second embodiment of the present invention, there is provided a decoder for a multi-channel signal, the decoder including a first decoding unit to generate a first downmixed signal by decoding an encoded result extracted from a bitstream and to generate a second downmixed signal by decoding another encoded result extracted from the bitstream; and a second decoding unit to output a first normal channel signal and a first LFE channel signal by decoding the first downmixed signal; and to output a second normal channel signal and a second LFE channel signal by decoding the second downmixed signal.
- The second decoding unit may output the first normal channel signal and the first LEF channel signal using parametric coding based on a first spatial cue for the first downmixed signal and may output the second normal channel signal and the second LEF channel signal using parametric coding based on a second spatial cue for the second downmixed signal, and the first spatial cue and the second spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- According to a third embodiment of the present invention, there is provided a decoder for a multi-channel signal, the decoder including a first decoding unit to generate a downmixed signal by decoding an encoded result extracted from a bitstream; and a second decoding unit to output a first LFE channel signal and a second LFE channel signal by decoding the downmixed signal.
- The second decoding unit may output the first LEF channel signal and the second LFE channel signal by applying parametric coding based on a spatial cue to the downmixed signal, and the spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- According to a fourth embodiment of the present invention, there is provided a decoder for a multi-channel signal, the decoder including a first decoding unit to output a first LFE channel signal by decoding an encoded result extracted from a bitstream and to output a second LFE channel signal by decoding another encoded result extracted from the bitstream; and a delay unit to apply a time delay to the first LEF channel signal and to apply the time delay to the second LFE channel signal.
- The time delay may include a time delay which occurs in decoding a normal channel signal.
- According to a fifth embodiment of the present invention, there is provided a decoder for a multi-channel signal, the decoder including a first decoding unit to decode a normal channel signal from a bitstream; a delay unit to apply a time delay to the decoded normal channel signal; a second decoding unit to decode an LFE channel signal from the bitstream; and a third decoding unit to decode the decoded LFE channel signal.
- The time delay may include a time delay which occurs in decoding the LFE channel signal.
- The second decoding unit may output a downmixed signal and a spatial cue by conducting parametric coding on the LFE channel signal in an LFE mode.
- According to an aspect of the present invention, a multi-channel signal including a low-frequency effects (LFE) channel signal in addition to a normal channel signal may be effectively encoded or decoded.
- According to another aspect of the present invention, synchronized multi-channel signals may be output by employing two-stage encoding/decoding or one-stage encoding/decoding employing a time delay.
-
-
FIG. 1 illustrates an encoder and a decoder according to an embodiment. -
FIG. 2 illustrates an encoder which encodes a multi-channel signal including a low-frequency effects (LFE) channel signal according to a first embodiment. -
FIG. 3 illustrates an encoder which encodes a multi-channel signal including an LFE channel signal according to a second embodiment. -
FIG. 4 illustrates an encoder which encodes a multi-channel signal including an LFE channel signal according to a third embodiment. -
FIG. 5 illustrates an encoder which encodes a multi-channel signal including an LFE channel signal according to a fourth embodiment. -
FIG. 6 illustrates a decoder which decodes an encoded result ofFIG. 2 . -
FIG. 7 illustrates a decoder which decodes an encoded result ofFIG. 3 . -
FIG. 8 illustrates a decoder which decodes an encoded result ofFIG. 4 . -
FIG. 9 illustrates a decoder which decodes an encoded result ofFIG. 5 . -
FIG. 10 illustrates a process of encoding a multi-channel signal using the encoder ofFIG. 2 . -
FIG. 11 illustrates a decoder which decodes an encoded result ofFIG. 10 . -
FIG. 12 illustrates a process of encoding a multi-channel signal when encoding bits are sufficient inFIG. 10 . -
FIG. 13 illustrates a decoder which decodes an encoded result ofFIG. 12 . -
FIG. 14 illustrates an example of encoding a multi-channel signal using the encoder ofFIG. 4 . -
FIG. 15 illustrates another example of encoding a multi-channel signal using the encoder ofFIG. 4 . -
FIG. 16 illustrates an example of encoding a multi-channel signal using the encoder ofFIG. 5 . -
FIG. 17 illustrates an example of decoding a multi-channel signal using the decoder ofFIG. 9 . -
FIG. 18 illustrates an encoder which encodes a multi-channel signal when the multi-channel signal includes an odd number of LFE channel signals according to an embodiment. -
FIG. 19 illustrates an encoder which encodes a normal audio signal, not an LFE channel signal, according to an embodiment. -
FIG. 20 illustrates a decoder which decodes an encoded result ofFIG. 19 . -
FIG. 21 illustrates an encoding process and a decoding process according to an embodiment. -
FIG. 22 illustrates a USAC encoder and a USAC decoder according to a first embodiment. -
FIG. 23 illustrates a USAC encoder and a USAC decoder according to a first embodiment. - Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings.
-
FIG. 1 illustrates an encoder and a decoder according to an embodiment. - Referring to
FIG. 1 , theencoder 101 and thedecoder 102 are shown. Theencoder 101 may encode a multi-channel signal including a plurality of channel signals to generate a bitstream. Thedecoder 101 may decode the multi-channel signal from the bitstream received from theencoder 101 or stored in a medium of theencoder 101. - Here, according to one embodiment, the multi-channel signal may include a low-frequency effects (LFE) channel signal. Here, an LFE channel signal refers to a channel signal for low-frequency effects (LFE) of a selective and limited sound range. Here, a low sound range may refer to a low-frequency range from 20 to 120 Hz. An LFE channel signal may be used to supplement low-frequency information on a main channel signal by transmitting additional low-frequency information.
- Hereinafter, processes of encoding or decoding a multi-channel signal including an LFE channel signal will be described in detail.
-
FIG. 2 illustrates an encoder which encodes a multi-channel signal including an LFE channel signal according to a first embodiment. -
FIGS. 2 to 5 illustrate processes of encoding a multi-channel signal including two LFE channel signals, andFIGS. 6 to 9 illustrate processes of decoding encoded results ofFIGS. 2 to 5 . - Referring to
FIG. 2 , the encoder may include afirst encoding unit 201, afirst encoding unit 202, asecond encoding unit 203 and abitstream formatter 204. Here, thefirst encoding units - In detail, the
first encoding unit 201 may generate a downmixed signal dmx1 using an LFE channel signal Lfe1 and a normal channel signal xi. Here, a normal channel signal may refer to a channel signal which does not exhibit low-frequency effects. Thefirst encoding unit 202 may generate a downmixed signal dmx2 using an LFE channel signal Lfe2 and a normal channel signal xi+1. i represents an index of a normal channel signal. That is, the encoder ofFIG. 2 may encode a multi-channel signal including a normal channel signal coupled to an LFE channel signal. - Here, the
first encoding units first encoding units - When parametric coding is performed using an LFE channel signal, a channel level difference (CLD) as a spatial cue may be extracted from an LFE band. Accordingly, a spatial cue output through parametric coding using an LFE channel signal may output a relatively smaller amount of data than a spatial cue output through generally used parametric coding. Here, the spatial cues output from the
first encoding units - The
second encoding unit 203 may encode the downmixed signal dmx1 output from thefirst encoding unit 201 and the downmixed signal dmx2 output from thefirst encoding unit 202. The downmixed signals dmx1 and dmx2 may be input as a stereo signal to thesecond encoding unit 203. For instance, thesecond encoding unit 203 may be an Advanced Audio Codec (AAC), MP3, or the like. Thesecond encoding unit 203 outputs bit3 as an encoded result, which is input to thebitstream formatter 204. Thebitstream formatter 204 may convert bit3 into a bitstream. -
FIG. 3 illustrates an encoder which encodes a multi-channel signal including an LFE channel signal according to a second embodiment. - The encoder of
FIG. 3 may include afirst encoding unit 301, asecond encoding unit 302, afirst encoding unit 303, asecond encoding unit 304 and abitstream formatter 305. - The
first encoding units FIG. 3 may operate in the same manner as thefirst encoding units FIG. 2 . That is, thefirst encoding units first encoding unit 301 may generate a downmixed signal dmx1 using an LFE channel signal Lfe1 and a normal channel signal xi. Thefirst encoding unit 303 may generate a downmixed signal dmx2 using an LFE channel signal Lfe2 and a normal channel signal xi+1. - The downmixed signal dmxi resulting from encoding by the
first encoding unit 301 is input as a mono signal to thesecond encoding unit 302. Thesecond encoding unit 302 may output bit3 using the downmixed signal dmx1. The downmixed signal dmx2 resulting from encoding by thefirst encoding unit 303 is input as a mono signal to thesecond encoding unit 304. Thesecond encoding unit 304 may output bit4 using the downmixed signal dmx2. -
FIG. 4 illustrates an encoder which encodes a multi-channel signal including an LFE channel signal according to a third embodiment. - Referring to
FIG. 4 , the encoder may include afirst encoding unit 401, asecond encoding unit 402 and abitstream formatter 403. LFE channel signals Lfe1 and Lfe2 may be coupled to each other and input to thefirst encoding unit 401. Thefirst encoding unit 401 may output a downmixed signal dmx3 as a mono signal using the LFE channel signals Lfe1 and Lfe2. Here, bit1 means a spatial cue derived by thefirst encoding unit 401 through parametric coding. - The downmixed signal dmx3 may be input to the
second encoding unit 402. Here, thesecond encoding unit 402 may code an LFE band in the downmixed signal dmx3. A Unified Speech and Audio Codec (USAC) and an Advanced Audio Codec (AAC) may have a separate coding mode for coding an LFE band. Thesecond encoding unit 402 may use a coding mode provided by the USAC or AAC. Bit2 output from thesecond encoding unit 402 and bit1 output from thefirst encoding unit 401 may be output as a bitstream through thebitstream formatter 403. -
FIG. 5 illustrates an encoder which encodes a multi-channel signal including an LFE channel signal according to a fourth embodiment. - Referring to
FIG. 5 , the encoder may include adelay unit 501, asecond encoding unit 502, adelay unit 503, asecond encoding unit 504 and abitstream formatter 505.FIG. 5 illustrates a process of encoding an LFE channel signal using thesecond encoding units - In
FIG. 5 , thesecond encoding units second encoding units - Thus, the
delay units second encoding units second encoding units bitstream formatter 505. -
FIG. 6 illustrates a decoder which decodes an encoded result ofFIG. 2 . - Referring to
FIG. 6 , the decoder may include abitstream deformatter 601, afirst decoding unit 602, asecond decoding unit 603 and asecond decoding unit 604.FIG. 6 may operate in an inverse manner toFIG. 2 . - A bitstream input to the
bitstream deformatter 601 may be the bitstream generated inFIG. 2 . The bitstream deformatter 601 may output bit1, bit2 and bit3 from the bitstream. Bit1, bit2 and bit3 are the same as those mentioned inFIG. 2 . - Bit3 may be input to the
first decoding unit 602. Thefirst decoding unit 602 may generate downmixed signals dmx1 and dmx2 using bit3. Thesecond decoding unit 603 may perform parametric coding on bit1 as a spatial cue and the downmixed signal dmx1 to output a normal channel signal xi and an LFE channel signal Lfe1. Likewise, thesecond decoding unit 604 may perform parametric coding on bit2 as a spatial cue and the downmixed signal dmx2 to output a normal channel signal xi+1 and an LFE channel signal Lfe2. -
FIG. 7 illustrates a decoder which decodes an encoded result ofFIG. 3 . - Referring to
FIG. 7 , the decoder may include abitstream deformatter 701, afirst decoding unit 702, asecond decoding unit 703, afirst decoding unit 704 and asecond decoding unit 705.FIG. 7 may operate in an inverse manner toFIG. 3 . - A bitstream input to the
bitstream deformatter 701 may be the bitstream generated inFIG. 3 . The bitstream deformatter 701 may output bit1, bit2, bit3 and bit4 from the bitstream. Bit1, bit2, bit3 and bit4 are the same as those mentioned inFIG. 3 . - Bit3 may be input to the
first decoding unit 702, and bit4 may be input to thefirst decoding unit 704. Thefirst decoding unit 702 may generate a downmixed signal dmx1 using bit3. Thefirst decoding unit 704 may generate a downmixed signal dmx2 using bit4. - Subsequently, the
second decoding unit 703 may perform parametric coding on bit1 as a spatial cue and the downmixed signal dmx1 to output a normal channel signal xi and an LFE channel signal Lfe1. Likewise, thesecond decoding unit 703 may perform parametric coding on bit2 as a spatial cue and the downmixed signal dmx2 to output a normal channel signal xi+1 and an LFE channel signal Lfe2. -
FIG. 8 illustrates a decoder which decodes an encoded result ofFIG. 4 . - Referring to
FIG. 8 , the decoder may include abitstream deformatter 801, afirst decoding unit 802 and asecond decoding unit 803.FIG. 8 may operate in an inverse manner toFIG. 4 . - A bitstream input to the
bitstream deformatter 801 may be the bitstream generated inFIG. 4 . The bitstream deformatter 801 may output bit1 and bit2 from the bitstream. Bit1 and bit2 are the same as those mentioned inFIG. 4 . - Bit1 may be input to the
first decoding unit 802, and bit2 may be input to thesecond decoding unit 803. Thefirst decoding unit 802 may generate a downmixed signal dmx3 using bit3. Thesecond decoding unit 803 may perform parametric coding on bit2 as a spatial cue and the downmixed signal dmx3 to output LFE channel signals Lfe1 and Lfe2. InFIG. 8 , thefirst decoding unit 802 and thesecond decoding unit 803 may perform parametric coding on an LFE band of the input downmixed signal dmx3. -
FIG. 9 illustrates a decoder which decodes an encoded result ofFIG. 5 . - Referring to
FIG. 9 , the decoder may include abitstream deformatter 901, afirst decoding unit 902, adelay unit 903, afirst decoding unit 904 and a delay unit 905.FIG. 9 may operate in an inverse manner toFIG. 5 . - A bitstream input to the
bitstream deformatter 901 may be the bitstream generated inFIG. 5 . The bitstream deformatter 901 may output bit1 and bit2 from the bitstream. Bit1 and bit2 are the same as those mentioned inFIG. 5 . - Bit1 may be input to the
first decoding unit 902, and bit2 may be input to thefirst decoding unit 904. Thefirst decoding unit 902 may generate an LFE channel signal Lfe1(n-τenc) using bit1, and thesecond decoding unit 904 may generate an LFE channel signal Lfe2(n-τenc) using bit2. - The
delay unit 903 may apply a time delay to the LFE channel signal Lfe1(n-τenc) to output Lfe1(n-τenc-τdec). Likewise, the delay unit 905 may apply a time delay to the LFE channel signal Lfe2(n-τenc) to output Lfe2(n-τenc-τdec). - That is, unlike in
FIGS. 6 to 8 , since a decoding process is carried out once inFIG. 9 , thedelay units 903 and 905 may apply a time delay τdec occurring in one-time decoding so that signals subjected to one-time decoding synchronize with those subjected to two-time decoding. -
FIG. 10 illustrates a process of encoding a multi-channel signal using the encoder ofFIG. 2 . -
FIG. 10 illustrates an encoder for a multi-channel signal which adopts the encoder Type1 illustrated inFIG. 2 . InFIG. 10 , Two To Ones (TTOs) 1001, 1002, 1004 and 1005 may encode an input signal according to a parametric coding mode for an MPEG Surround stereo signal. That is, the TTOs may correspond to the first encoding units ofFIG. 2 , and USAC encoders may correspond to the second encoding unit ofFIG. 2 . - In
FIG. 10 ,TTOs -
FIG. 10 illustrates an encoding process when N multi-channel signals are input. In detail, in a first operation, the N multi-channel signals may be subjected to parametric coding via the TTOs into M downmixed signals dmx1 to dmxM. In a second operation, the M downmixed signals may be input in a stereo form and encoded through USAC core coding. InFIG. 10 , LFE channel signals Lfe1 and Lfe2 may be coupled to normal channel signals to be input to theTTO 1004 and TTO1005. - That is, referring to
FIG. 10 , normal channel signals of the multi-channel signals may be coupled and downmixed by two channels, and a downmixed result may be subjected to stereo coding by the USAC encoders. Among the normal channel signals of the multi-channel signals, two normal channel signals x2M-1 and x2M may be respectively coupled to the LFE channel signals Lfe1 and Lfe2 and input to the TTOs(Lfe). - Although
FIG. 10 shows that theencoder Type 1 ofFIG. 2 is adopted, theencoder Type 2 illustrated inFIG. 3 may be applied, instead of theencoder Type 1. -
FIG. 11 illustrates a decoder which decodes an encoded result ofFIG. 10 . -
FIG. 11 illustrates a decoder for a multi-channel signal which adopts thedecoder Type 1 illustrated inFIG. 6 . InFIG. 11 , One To Twos (OTTs) 1103, 1104, 1106 and 1107 may decode an input signal according to a parametric coding mode for an MPEG Surround stereo signal. That is, the OTTs may correspond to the second decoding units ofFIG. 6 , and USAC decoders may correspond to the first decoding unit. - In
FIG. 11 ,OTTs OTTs FIG. 11 , the encoded result may be decoded to output N multi-channel audio signals. - In detail, in a first operation, M downmixed signals may be output from a bitstream via the USAC decoders. In a second operation, the M downmixed signals may be input to the respective OTTs to output stereo signals. The
OTTs OTTs -
FIG. 12 illustrates a process of encoding a multi-channel signal when encoding bits are sufficient inFIG. 10 . - When encoding bits for normal channel signals included in a multi-channel signal are sufficient, the encoding process of
FIG. 12 may be performed. That is, normal channel signals x1 to x2M-2 may be encoded byUSAC encoders delay units USAC encoders - Here, the time delay τenc occurs in
OTTs USAC Encoder 1209 is a QMF signal, a time delay occurring by QMF synthesis may be excluded when calculating τenc. -
FIG. 13 illustrates a decoder which decodes an encoded result ofFIG. 12 . -
FIG. 13 may perform an inverse process toFIG. 12 . Referring toFIG. 13 , a normal channel signal is decoded byUSAC decoders delay units bitstream deformatter 1301 maybe decoded by aUSAC decoder 1308 to generate downmixed signals, and the downmixed signals may be respectively input toOTTs - Here, since the LFE channel signals are subjected to decoding two times, the
delay units USAC decoders USAC decoders OTTs OTTs -
FIG. 14 illustrates an example of encoding a multi-channel signal using the encoder ofFIG. 4 . - In
FIG. 14 , theencoder Type 3 illustrated inFIG. 4 may be used. Normal channel signals x1 to x2M of a multi-channel signal may be coupled by two and input to TTOs 1401 and 1402. TheTTOs USAC encoder 1403. - Meanwhile, LFE channel signals Lfe1 and Lfe2 included in the multi-channel signal may be coupled by two and input to a TTO 140. The
TTO 1404 may perform parametric coding using the two LFE channel signals Lfe1 and Lfe2 to output a downmixed signal dmx3 in a mono form. Subsequently, aUSAC encoder 1405 may encode the downmixed signal dmx3 in the LFE mode. -
FIG. 15 illustrates another example of encoding a multi-channel signal using the encoder ofFIG. 4 . - In
FIG. 15 , theencoder Type 3 illustrated inFIG. 4 may be used. Here, inFig. 15 , a normal channel signal may be encoded byUSAC encoders TTOs FIG. 14 . As illustrated below, since an LFE channel signal is subjected to encoding two times through aTTO 1507 and aUSAC encoder 1508,delay units TTO 1507 to the normal channel signal. - Meanwhile, LFE channel signals Lfe1 and Lfe2 may be encoded by the
TTO 1507 in the LFE mode to output a downmixed signal dmx3, and the downmixed signal dmx3 may be encoded by the USAC encoder. - While
FIGS. 14 and15 illustrate the encoders, corresponding decoders may operate according to inverse processes. In detail, a normal channel signal may be output from a bitstream obtained inFIG. 14 via a USAC decoder and an OTT. Also, LFE channel signals Lfe1 and Lfe2 may be output from the bitstream obtained inFIG. 14 via a USAC decoder and an OTT. - In addition, a normal channel signal may be output from a bitstream obtained in
FIG. 15 via a USAC decoder and a delay unit. Also, LFE channel signals Lfe1 and Lfe2 may be output from the bitstream obtained inFIG. 15 via a USAC decoder and an OTT. -
FIG. 16 illustrates an example of encoding a multi-channel signal using the encoder ofFIG. 5 . -
FIG. 16 illustrates an encoder for a multi-channel signal which adopts theencoder Type 4 illustrated inFIG. 5 . - Normal channel signals may be converted into downmixed signals through
TTOs USAC encoder 1603. - Meanwhile,
delay units TTOs USAC encoders -
FIG. 17 illustrates an example of decoding a multi-channel signal using the decoder ofFIG. 9 . - In
FIG. 17 , thedecoder Type 4 illustrated inFIG. 9 is used. Referring toFIG. 17 , normal channel signals may be output through aUSAC decoder 1702 and aTTO USAC decoders delay units - Since the LFE channel signals are subjected to decoding once, the
delay units -
FIG. 18 illustrates an encoder which encodes a multi-channel signal when the multi-channel signal includes an odd number of LFE channel signals according to an embodiment. - While
FIGS. 2 to 17 illustrate an even number of LFE channel signals,FIG. 18 illustrates an odd number of LFE channel signals. - Referring to
FIG. 18 , one LEFchannel signal Lfe 2n+1 may be input to adelay unit 1801, and a time delay τ may be applied to the LEF channel signal. The time-delayed LFE channel signal may be encoded by asecond encoding unit 1802 in an LFE mode to output bit1. That is, an odd number of LFE channel signals may be processed by theencoder Type 4 ofFIG. 5 or thedecoder Type 4 ofFIG. 9 . - Although not shown in
FIG. 18 , unlike the LFE channel signal, a normal channel signal is encoded by a first encoding unit and the second encoding unit, and thus adelay unit 1801 may need to apply a time delay occurring due to the first encoding unit to the LFE channel signal for synchronization with the normal channel signal. -
FIG. 19 illustrates an encoder which encodes a normal audio signal, not an LFE channel signal, according to an embodiment. - Referring to
FIG. 19 , normal channel signals x1 and x2 may be subjected to parametric coding by afirst encoding unit 1901 to be converted into a downmixed signal dmx1 along with a spatial cue bit1. Likewise, normal channel signals x3 and x4 may be subjected to parametric coding by afirst encoding unit 1902 to be converted into a downmixed signal dmx2 along with a spatial cue bit2. - As described above, parametric coding applied to the normal channel signals may extract not only a CLD but also an ICC and IPD as spatial cues. The downmixed signal dmx1 and dmx2 may be input in a stereo form to a
second encoding unit 1903 and encoded to output bit3. Bit3 may be converted into a bistream by abitstream formatter 1904. -
FIG. 20 illustrates a decoder which decodes an encoded result ofFIG. 19 . - Referring to
FIG. 20 , spatial cues bit1 and bit2 and encoded bit3 may be output by abitstream deformatter 2001 from the bitstream generated inFIG. 19 . - A
first decoding unit 2002 may decode bit3 to output downmixed signals dmx1 and dmx2. Asecond decoding unit 2003 may decode a downmixed signal dmx1 to output normal channel signals x1 and x2. Likewise, asecond decoding unit 2004 may decode a downmixed signal dmx2 to output normal channel signals x1 and x2. -
FIG. 21 illustrates an encoding process and a decoding process according to an embodiment. - The foregoing first encoding units may correspond to TTOs 2101 and 2102 of
FIG. 21 , and the foregoing second encoding units may correspond to aUSAC encoder 2103. Also, the foregoing first decoding units may correspond to aUSAC decoder 2104, and the foregoing second encoding units may correspond toOTTs - (i) Four normal channel signals or (ii) results of coupling one normal channel signal and one LFE channel signal may be input to the
TTOs TTOs USAC encoder 2103 may encode the downmixed signal. - On the contrary, the
USAC decoder 2104 may output two downmixed signals from a bitstream. TheOTTs -
FIG. 22 illustrates a USAC encoder and a USAC decoder according to a first embodiment. - The foregoing embodiments illustrate configurations in which a USAC encoder is separate from a TTO or a USAC decoder is separate from an OTT. Alternatively, as in
FIG. 22 , a USAC encoder may includeTTOs extended USAC encoder 2201. Likewise, a USAC decoder may includeOTTs extended USAC decoder 2202. - (i) Four normal channel signals or (ii) one normal channel signal and one LFE channel signal may be subjected to parametric coding by the TTOs 2203 and 2204 and output as downmixed signals. The downmixed signals output from the
TTOs TTO 2205 and be subjected to parametric coding one more time. A result of parametric coding is subjected to in frequency extension by a spectral band replication (SBR) 2206, and a non-frequency-extended core band may be encoded by acore encoder 2207. - In a bitstream generated by the extended
USAC encoder 2201, the non-frequency-extended core band is decoded by acore decoder 2208, and a decoded result may be input to and subjected to frequency extension by anSBR 2209, thereby reconstructing an original signal. Subsequently, a result of frequency extension by theSBR 2209 may be subjected to parametric coding by anOTT 2210 to generate two downmixed signals, and the downmixed signals may be subjected to parametric coding by theOTTs -
FIG. 23 illustrates a USAC encoder and a USAC decoder according to a second embodiment. - In
FIG. 23 , positions of anSBR 2305 and aTTO 2306 in anextended USAC encoder 2301 and positions of anOTT 2309 and anSBR 2310 in anextended USAC decoder 2302 are changed from those inFIG. 22 . Other components may be equivalent to those inFIG. 22 . - The apparatuses described herein may be implemented using hardware components, software components, and/or combinations of hardware components and software components. For instance, the units and components illustrated in the embodiments may be implemented using one or more general-purpose or special purpose computers, such as, for example, a processor, a controller, an arithmetic logic unit (ALU), a digital signal processor, a microcomputer, a field programmable array (FPA), a programmable logic unit (PLU), a microprocessor or any other device capable of responding to and executing instructions. A processing device may run an operating system (OS) and one or more software applications that run on the OS. The processing device also may access, store, manipulate, process, and create data in response to execution of the software. For purpose of simplicity, the description of a processing device is used as singular; however, one skilled in the art will appreciated that a processing device may include multiple processing elements and multiple types of processing elements. For example, a processing device may include multiple processors or a processor and a controller. In addition, different processing configurations are possible, such as parallel processors.
- The software may include a computer program, a piece of code, an instruction, or one or more combinations thereof, to independently or collectively instruct or configure the processing device to operate as desired. Software and/or data may be embodied permanently or temporarily in any type of machine, component, physical or virtual equipment, computer storage medium or device, or in a propagated signal wave in order to provide instructions or data to the processing device or to be interpreted by the processing device. The software may also be distributed over network coupled computer systems so that the software is stored and executed in a distributed fashion. The software and data may be stored by one or more non-transitory computer readable recording mediums.
- The methods according to the embodiments may be realized as program instructions implemented by various computers and be recorded in non-transitory computer-readable media. The media may also include, alone or in combination with the program instructions, data files, data structures, and the like. The program instructions recorded in the media may be designed and configured specially for the embodiments or be known and available to those skilled in computer software. Examples of the non-transitory computer readable recording medium may include magnetic media such as hard disks, floppy disks, and magnetic tape; optical media such as CD ROM disks and DVDs; magneto-optical media such as floptical disks; and hardware devices that are specially configured to store and perform program instructions, such as read-only memory (ROM), random access memory (RAM), flash memory, and the like. Examples of program instructions include both machine codes, such as produced by a compiler, and higher level language codes that may be executed by the computer using an interpreter. The described hardware devices may be configured to act as one or more software modules in order to perform the operations of the above-described exemplary embodiments, or vice versa.
- While a few exemplary embodiments have been shown and described with reference to the accompanying drawings, it will be apparent to those skilled in the art that various modifications and variations can be made from the foregoing descriptions. For example, adequate effects may be achieved even if the foregoing processes and methods are carried out in different order than described above, and/or the aforementioned elements, such as systems, structures, devices, or circuits, are combined or coupled in different forms and modes than as described above or be substituted or switched with other components or equivalents. Thus, other implementations, alternative embodiments and equivalents to the claimed subject matter are construed as being within the appended claims.
Claims (20)
- A method of encoding a multi-channel signal, the method comprising:outputting a first downmixed signal and a first spatial cue by encoding a first normal channel signal and a first low-frequency effects (LFE) channel signal which are comprised in a multi-channel signal;outputting a second downmixed signal and a second spatial cue by encoding a second normal channel signal and a second LFE channel signal which are comprised in the multi-channel signal;encoding the first downmixed signal the second downmixed signal together; andgenerating a bitstream comprising the encoded first downmixed signal, the encoded second downmixed signal, the first spatial cue and the second spatial cue.
- The method of claim 1, wherein the outputting of the first cue outputs the first downmixed signal and the first spatial cue by applying parametric coding to the first normal channel signal and the first LFE channel signal in an LFE mode, the outputting of the second cue outputs the second downmixed signal and the second spatial cue by applying parametric coding to the second normal channel signal and the second LFE channel signal in the LFE mode, and the first spatial cue and the second spatial cue comprise a channel level difference (CLD) output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- A method of encoding a multi-channel signal, the method comprising:outputting a first downmixed signal and a first spatial cue by encoding a first normal channel signal and a first low-frequency effects (LFE) channel signal which are comprised in a multi-channel signal;outputting a second downmixed signal and a second spatial cue by encoding a second normal channel signal and a second LFE channel signal which are comprised in the multi-channel signal;encoding the first downmixed signal;encoding the second downmixed signal separately from the first downmixed signal; andgenerating a bitstream comprising the encoded first downmixed signal, the encoded second downmixed signal, the first spatial cue and the second spatial cue.
- The method of claim 3, wherein the outputting of the first cue outputs the first downmixed signal and the first spatial cue by applying parametric coding to the first normal channel signal and the first LFE channel signal in an LFE mode, the outputting of the second cue outputs the second downmixed signal and the second spatial cue by applying parametric coding to the second normal channel signal and the second LFE channel signal in the LFE mode, and the first spatial cue and the second spatial cue comprise a channel level difference (CLD) output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- A method of encoding a multi-channel signal, the method comprising:outputting a downmixed signal and a spatial cue by encoding a first low-frequency effects (LFE) channel signal and a second LFE channel signal which are comprised in a multi-channel signal;encoding the downmixed signal; andgenerating a bitstream comprising the encoded downmixed signal and the spatial cue.
- The method of claim 5, wherein the outputting outputs the downmixed signal and the spatial cue by applying parametric coding to the first LFE channel signal and the second LFE channel signal in an LFE mode, and the spatial cue comprises a channel level difference (CLD) output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- A method of encoding a multi-channel signal, the method comprising:applying a time delay to a first low-frequency effects (LFE) channel signal comprised in a multi-channel signal;applying the time delay to a second LFE channel signal comprised in the multi-channel signal;encoding the first LEF channel signal to which the time delay is applied;encoding the second LEF channel signal to which the time delay is applied; andgenerating a bitstream comprising the encoded first LEF channel signal and the encoded second LEF channel signal.
- The method of claim 7, wherein the time delay comprises a time delay which occurs in encoding a normal channel signal comprised in the multi-channel signal.
- A method of encoding a multi-channel signal, the method comprising:applying a time delay a normal channel signal comprised in a multi-channel signal;encoding the normal channel signal to which the time delay is applied;outputting a downmixed signal and a spatial cue by encoding a low-frequency effects (LFE) channel signal comprised in the multi-channel signal; andencoding the encoded LFE channel signal,wherein the time delay comprises a time delay which occurs in encoding the LFE channel signal.
- The method of claim 9, wherein the outputting outputs the downmixed signal and the spatial cue by conducting parametric coding on the LFE channel signal in an LFE mode.
- A method of decoding a multi-channel signal, the method comprising:generating a first downmixed signal and a second downmixed signal by decoding an encoded result extracted from a bitstream;outputting a first normal channel signal and a first low-frequency effects (LFE) channel signal by decoding the first downmixed signal; andoutputting a second normal channel signal and a second LFE channel signal by decoding the second downmixed signal.
- The method of claim 11, wherein the outputting of the first normal channel signal and the first LFE channel signal outputs the first normal channel signal and the first LEF channel signal from the first downmixed signal by applying a first spatial cue to parametric coding, the outputting of the second normal channel signal and the second LFE channel signal outputs the second normal channel signal and the second LEF channel signal from the second downmixed signal by applying a second spatial cue to parametric coding, and the first spatial cue and the second spatial cue comprise a channel level difference (CLD) output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- A method of decoding a multi-channel signal, the method comprising:generating a first downmixed signal by decoding an encoded result extracted from a bitstream;generating a second downmixed signal by decoding another encoded result extracted from the bitstream;outputting a first normal channel signal and a first low-frequency effects (LFE) channel signal by decoding the first downmixed signal; andoutputting a second normal channel signal and a second LFE channel signal by decoding the second downmixed signal.
- The method of claim 13, wherein the outputting of the first normal channel signal and the first LFE channel signal outputs the first normal channel signal and the first LEF channel signal using parametric coding based on a first spatial cue for the first downmixed signal, the outputting of the second normal channel signal and the second LFE channel signal outputs the second normal channel signal and the second LEF channel signal using parametric coding based on a second spatial cue for the second downmixed signal, and the first spatial cue and the second spatial cue comprise a channel level difference (CLD) output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- A method of decoding a multi-channel signal, the method comprising:generating a downmixed signal by decoding an encoded result extracted from a bitstream; andoutputting a first low-frequency effects (LFE) channel signal and a second LFE channel signal by decoding the downmixed signal.
- The method of claim 15, wherein the outputting outputs the first LEF channel signal and the second LFE channel signal by applying parametric coding based on a spatial cue to the downmixed signal, and the spatial cue comprises a channel level difference (CLD) output from an LFE band of the first LFE channel signal or the second LFE channel signal.
- A method of decoding a multi-channel signal, the method comprising:outputting a first low-frequency effects (LFE) channel signal by decoding an encoded result extracted from a bitstream;outputting a second LFE channel signal by decoding another encoded result extracted from the bitstream;applying a time delay to the first LEF channel signal; andapplying the time delay to the second LFE channel signal.
- The method of claim 17, wherein the time delay comprises a time delay which occurs in decoding a normal channel signal.
- A method of decoding a multi-channel signal, the method comprising:decoding a normal channel signal from a bitstream;applying a time delay to the decoded normal channel signal;decoding a low-frequency effects (LFE) channel signal from the bitstream; anddecoding the decoded LFE channel signal,wherein the time delay comprises a time delay which occurs in decoding the LFE channel signal.
- The method of claim 19, wherein the decoding of the LFE channel signal outputs a downmixed signal and a spatial cue by conducting parametric coding on the LFE channel signal in an LFE mode.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR20130083073 | 2013-07-15 | ||
KR20130123416 | 2013-10-16 | ||
PCT/KR2014/006406 WO2015009040A1 (en) | 2013-07-15 | 2014-07-15 | Encoder and encoding method for multichannel signal, and decoder and decoding method for multichannel signal |
Publications (2)
Publication Number | Publication Date |
---|---|
EP3023984A1 true EP3023984A1 (en) | 2016-05-25 |
EP3023984A4 EP3023984A4 (en) | 2017-03-08 |
Family
ID=52572683
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP14826617.4A Ceased EP3023984A4 (en) | 2013-07-15 | 2014-07-15 | Encoder and encoding method for multichannel signal, and decoder and decoding method for multichannel signal |
Country Status (3)
Country | Link |
---|---|
EP (1) | EP3023984A4 (en) |
KR (1) | KR20150009474A (en) |
WO (1) | WO2015009040A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20180035230A1 (en) * | 2015-02-17 | 2018-02-01 | Electronics And Telecommunications Research Institute | Multichannel signal processing method, and multichannel signal processing apparatus for performing same |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2016133366A1 (en) * | 2015-02-17 | 2016-08-25 | 한국전자통신연구원 | Multichannel signal processing method, and multichannel signal processing apparatus for performing same |
WO2018128457A2 (en) * | 2017-01-05 | 2018-07-12 | 엘지전자 주식회사 | Method for performing channel-coding of information on basis of polar code |
CN115410584A (en) * | 2021-05-28 | 2022-11-29 | 华为技术有限公司 | Method and apparatus for encoding multi-channel audio signal |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7644003B2 (en) * | 2001-05-04 | 2010-01-05 | Agere Systems Inc. | Cue-based audio coding/decoding |
US7805313B2 (en) * | 2004-03-04 | 2010-09-28 | Agere Systems Inc. | Frequency-based coding of channels in parametric multi-channel coding systems |
KR101271069B1 (en) * | 2005-03-30 | 2013-06-04 | 돌비 인터네셔널 에이비 | Multi-channel audio encoder and decoder, and method of encoding and decoding |
US7751572B2 (en) * | 2005-04-15 | 2010-07-06 | Dolby International Ab | Adaptive residual audio coding |
KR100891685B1 (en) * | 2005-08-30 | 2009-04-03 | 엘지전자 주식회사 | Apparatus for encoding and decoding audio signal and method thereof |
EP2372701B1 (en) * | 2006-10-16 | 2013-12-11 | Dolby International AB | Enhanced coding and parameter representation of multichannel downmixed object coding |
US20120093323A1 (en) * | 2010-10-14 | 2012-04-19 | Samsung Electronics Co., Ltd. | Audio system and method of down mixing audio signals using the same |
-
2014
- 2014-07-15 WO PCT/KR2014/006406 patent/WO2015009040A1/en active Application Filing
- 2014-07-15 EP EP14826617.4A patent/EP3023984A4/en not_active Ceased
- 2014-07-15 KR KR20140089269A patent/KR20150009474A/en not_active IP Right Cessation
Non-Patent Citations (1)
Title |
---|
See references of WO2015009040A1 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20180035230A1 (en) * | 2015-02-17 | 2018-02-01 | Electronics And Telecommunications Research Institute | Multichannel signal processing method, and multichannel signal processing apparatus for performing same |
US10225675B2 (en) * | 2015-02-17 | 2019-03-05 | Electronics And Telecommunications Research Institute | Multichannel signal processing method, and multichannel signal processing apparatus for performing the method |
US10638243B2 (en) | 2015-02-17 | 2020-04-28 | Electronics And Telecommunications Research Institute | Multichannel signal processing method, and multichannel signal processing apparatus for performing the method |
Also Published As
Publication number | Publication date |
---|---|
WO2015009040A1 (en) | 2015-01-22 |
EP3023984A4 (en) | 2017-03-08 |
KR20150009474A (en) | 2015-01-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR100888474B1 (en) | Apparatus and method for encoding/decoding multichannel audio signal | |
EP1851997B1 (en) | Near-transparent or transparent multi-channel encoder/decoder scheme | |
US11056122B2 (en) | Encoder and encoding method for multi-channel signal, and decoder and decoding method for multi-channel signal | |
JP7413418B2 (en) | Audio decoder for interleaving signals | |
RU2643644C2 (en) | Coding and decoding of audio signals | |
CN109887516B (en) | Method for decoding audio scene, audio decoder and medium | |
RU2696952C2 (en) | Audio coder and decoder | |
KR101756838B1 (en) | Method and apparatus for down-mixing multi channel audio signals | |
EP3023984A1 (en) | Encoder and encoding method for multichannel signal, and decoder and decoding method for multichannel signal | |
KR20080066538A (en) | Apparatus and method for encoding/decoding multi-channel signal | |
JP6303435B2 (en) | Audio encoding apparatus, audio encoding method, audio encoding program, and audio decoding apparatus | |
JP6051621B2 (en) | Audio encoding apparatus, audio encoding method, audio encoding computer program, and audio decoding apparatus | |
KR20080035448A (en) | Method and apparatus for encoding/decoding multi channel audio signal | |
JP6299202B2 (en) | Audio encoding apparatus, audio encoding method, audio encoding program, and audio decoding apparatus | |
KR20150011783A (en) | Decoding method for multi-channel audio signal using reverberation signal and decoder | |
KR20140122990A (en) | Apparatus and method for encoding/decoding multichannel audio signal | |
KR20150009426A (en) | Method and apparatus for processing audio signal to down mix and channel convert multichannel audio signal |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20160215 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
AX | Request for extension of the european patent |
Extension state: BA ME |
|
DAX | Request for extension of the european patent (deleted) | ||
A4 | Supplementary search report drawn up and despatched |
Effective date: 20170203 |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: G10L 19/008 20130101AFI20170130BHEP |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: EXAMINATION IS IN PROGRESS |
|
17Q | First examination report despatched |
Effective date: 20200901 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: EXAMINATION IS IN PROGRESS |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: EXAMINATION IS IN PROGRESS |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R003 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION HAS BEEN REFUSED |
|
18R | Application refused |
Effective date: 20230629 |