EP3023984A1 - Encoder and encoding method for multichannel signal, and decoder and decoding method for multichannel signal - Google Patents

Encoder and encoding method for multichannel signal, and decoder and decoding method for multichannel signal Download PDF

Info

Publication number
EP3023984A1
EP3023984A1 EP14826617.4A EP14826617A EP3023984A1 EP 3023984 A1 EP3023984 A1 EP 3023984A1 EP 14826617 A EP14826617 A EP 14826617A EP 3023984 A1 EP3023984 A1 EP 3023984A1
Authority
EP
European Patent Office
Prior art keywords
channel signal
lfe
signal
downmixed
encoding
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
EP14826617.4A
Other languages
German (de)
French (fr)
Other versions
EP3023984A4 (en
Inventor
Seung Kwon Beack
Jong Mo Sung
Tae Jin Lee
Kyeong Ok Kang
Jeong Il Seo
Jin Woong Kim
Yong Ju Lee
Dae Young Jang
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Electronics and Telecommunications Research Institute ETRI
Original Assignee
Electronics and Telecommunications Research Institute ETRI
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Electronics and Telecommunications Research Institute ETRI filed Critical Electronics and Telecommunications Research Institute ETRI
Publication of EP3023984A1 publication Critical patent/EP3023984A1/en
Publication of EP3023984A4 publication Critical patent/EP3023984A4/en
Ceased legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing

Definitions

  • Exemplary embodiments relate to an encoder and encoding method for a multi-channel signal, and a decoder and decoding method for a multi-channel signal, and more particularly to a codec for efficiently processing a multi-channel signal including a plurality of channel signals.
  • An aspect of the present invention is to provide an apparatus and method of encoding or decoding a multi-channel signal including a low-frequency effects (LFE) channel signal.
  • LFE low-frequency effects
  • Another aspect of the present invention is to provide an apparatus and method of performing two-stage encoding/decoding or one-stage encoding/decoding employing a time delay.
  • a method of encoding a multi-channel signal including outputting a first downmixed signal and a first spatial cue by encoding a first normal channel signal and a first low-frequency effects (LFE) channel signal which are included in a multi-channel signal; outputting a second downmixed signal and a second spatial cue by encoding a second normal channel signal and a second LFE channel signal which are included in the multi-channel signal; encoding the first downmixed signal the second downmixed signal together; and generating a bitstream including the encoded first downmixed signal, the encoded second downmixed signal, the first spatial cue and the second spatial cue.
  • LFE low-frequency effects
  • the outputting of the first cue may output the first downmixed signal and the first spatial cue by applying parametric coding to the first normal channel signal and the first LFE channel signal in an LFE mode
  • the outputting of the second cue may output the second downmixed signal and the second spatial cue by applying parametric coding to the second normal channel signal and the second LFE channel signal in the LFE mode
  • the first spatial cue and the second spatial cue may include a channel level difference (CLD) output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  • CLD channel level difference
  • a method of encoding a multi-channel signal including outputting a first downmixed signal and a first spatial cue by encoding a first normal channel signal and a first LFE channel signal which are included in a multi-channel signal; outputting a second downmixed signal and a second spatial cue by encoding a second normal channel signal and a second LFE channel signal which are included in the multi-channel signal; encoding the first downmixed signal; encoding the second downmixed signal separately from the first downmixed signal; and generating a bitstream including the encoded first downmixed signal, the encoded second downmixed signal, the first spatial cue and the second spatial cue.
  • the outputting of the first cue may output the first downmixed signal and the first spatial cue by applying parametric coding to the first normal channel signal and the first LFE channel signal in an LFE mode
  • the outputting of the second cue may output the second downmixed signal and the second spatial cue by applying parametric coding to the second normal channel signal and the second LFE channel signal in the LFE mode
  • the first spatial cue and the second spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  • a method of encoding a multi-channel signal including outputting a downmixed signal and a spatial cue by encoding a first LFE channel signal and a second LFE channel signal which are included in a multi-channel signal; encoding the downmixed signal; and generating a bitstream including the encoded downmixed signal and the spatial cue.
  • the outputting may output the downmixed signal and the spatial cue by applying parametric coding to the first LFE channel signal and the second LFE channel signal in an LFE mode, and the spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  • a method of encoding a multi-channel signal including applying a time delay to a first LFE channel signal included in a multi-channel signal; applying the time delay to a second LFE channel signal included in the multi-channel signal; encoding the first LEF channel signal to which the time delay is applied; encoding the second LEF channel signal to which the time delay is applied; and generating a bitstream including the encoded first LEF channel signal and the encoded second LEF channel signal.
  • the time delay may include a time delay which occurs in encoding a normal channel signal included in the multi-channel signal.
  • a method of encoding a multi-channel signal including applying a time delay a normal channel signal included in a multi-channel signal; encoding the normal channel signal to which the time delay is applied; outputting a downmixed signal and a spatial cue by encoding an LFE channel signal included in the multi-channel signal; and encoding the encoded LFE channel signal, wherein the time delay includes a time delay which occurs in encoding the LFE channel signal.
  • the outputting may output the downmixed signal and the spatial cue by conducting parametric coding on the LFE channel signal in an LFE mode.
  • a method of decoding a multi-channel signal including generating a first downmixed signal and a second downmixed signal by decoding an encoded result extracted from a bitstream; outputting a first normal channel signal and a first LFE channel signal by decoding the first downmixed signal; and outputting a second normal channel signal and a second LFE channel signal by decoding the second downmixed signal.
  • the outputting of the first normal channel signal and the first LFE channel signal may output the first normal channel signal and the first LEF channel signal from the first downmixed signal by applying a first spatial cue to parametric coding
  • the outputting of the second normal channel signal and the second LFE channel signal may output the second normal channel signal and the second LEF channel signal from the second downmixed signal by applying a second spatial cue to parametric coding
  • the first spatial cue and the second spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  • a method of decoding a multi-channel signal including generating a first downmixed signal by decoding an encoded result extracted from a bitstream; generating a second downmixed signal by decoding another encoded result extracted from the bitstream; outputting a first normal channel signal and a first LFE channel signal by decoding the first downmixed signal; and outputting a second normal channel signal and a second LFE channel signal by decoding the second downmixed signal.
  • the outputting of the first normal channel signal and the first LFE channel signal may output the first normal channel signal and the first LEF channel signal using parametric coding based on a first spatial cue for the first downmixed signal
  • the outputting of the second normal channel signal and the second LFE channel signal may output the second normal channel signal and the second LEF channel signal using parametric coding based on a second spatial cue for the second downmixed signal
  • the first spatial cue and the second spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  • a method of decoding a multi-channel signal including generating a downmixed signal by decoding an encoded result extracted from a bitstream; and outputting a first LFE channel signal and a second LFE channel signal by decoding the downmixed signal.
  • the outputting may output the first LEF channel signal and the second LFE channel signal by applying parametric coding based on a spatial cue to the downmixed signal, and the spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  • a method of decoding a multi-channel signal including outputting a first LFE channel signal by decoding an encoded result extracted from a bitstream; outputting a second LFE channel signal by decoding another encoded result extracted from the bitstream; applying a time delay to the first LEF channel signal; and applying the time delay to the second LFE channel signal.
  • the time delay may include a time delay which occurs in decoding a normal channel signal.
  • a method of decoding a multi-channel signal including decoding a normal channel signal from a bitstream; applying a time delay to the decoded normal channel signal; decoding an LFE channel signal from the bitstream; and decoding the decoded LFE channel signal.
  • the time delay may include a time delay which occurs in decoding the LFE channel signal.
  • the decoding of the LFE channel signal may output a downmixed signal and a spatial cue by conducting parametric coding on the LFE channel signal in an LFE mode.
  • an encoder for a multi-channel signal including a first encoding unit to output a first downmixed signal and a first spatial cue by encoding a first normal channel signal and a first low-frequency effects (LFE) channel signal which are included in a multi-channel signal and to output a second downmixed signal and a second spatial cue by encoding a second normal channel signal and a second LFE channel signal which are included in the multi-channel signal; a second encoding unit to encode the first downmixed signal and the second downmixed signal together; and a bitstream formatter to generate a bitstream including the encoded first downmixed signal, the encoded second downmixed signal, the first spatial cue and the second spatial cue.
  • LFE low-frequency effects
  • the first encoding unit may output the first downmixed signal and the first spatial cue by applying parametric coding to the first normal channel signal and the first LFE channel signal in an LFE mode and may output the second downmixed signal and the second spatial cue by applying parametric coding to the second normal channel signal and the second LFE channel signal in the LFE mode, and the first spatial cue and the second spatial cue may include a channel level difference (CLD) output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  • CLD channel level difference
  • an encoder for a multi-channel signal including a first encoding unit to output a first downmixed signal and a first spatial cue by encoding a first normal channel signal and a first LFE channel signal which are included in a multi-channel signal and to output a second downmixed signal and a second spatial cue by encoding a second normal channel signal and a second LFE channel signal which are included in the multi-channel signal; a second encoding unit to encode the first downmixed signal; encoding the second downmixed signal separately from the first downmixed signal; and a bitstream formatter to generate a bitstream including the encoded first downmixed signal, the encoded second downmixed signal, the first spatial cue and the second spatial cue.
  • the first encoding unit may output the first downmixed signal and the first spatial cue by applying parametric coding to the first normal channel signal and the first LFE channel signal in an LFE mode and may output the second downmixed signal and the second spatial cue by applying parametric coding to the second normal channel signal and the second LFE channel signal in the LFE mode, and the first spatial cue and the second spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  • an encoder for a multi-channel signal including a first encoding unit to output a downmixed signal and a spatial cue by encoding a first LFE channel signal and a second LFE channel signal which are included in a multi-channel signal; a second encoding unit to encode the downmixed signal; and a bitstream formatter to generate a bitstream including the encoded downmixed signal and the spatial cue.
  • the first encoding unit may output the downmixed signal and the spatial cue by applying parametric coding to the first LFE channel signal and the second LFE channel signal in an LFE mode, and the spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  • an encoder for a multi-channel signal including a delay unit to apply a time delay to a first LFE channel signal included in a multi-channel signal and to applying the time delay to a second LFE channel signal included in the multi-channel signal; a second encoding unit to encode the first LEF channel signal to which the time delay is applied and to encode the second LEF channel signal to which the time delay is applied; and a bitstream formatter to generate a bitstream including the encoded first LEF channel signal and the encoded second LEF channel signal.
  • the time delay may include a time delay which occurs in encoding a normal channel signal included in the multi-channel signal.
  • an encoder for a multi-channel signal including a delay unit to apply a time delay a normal channel signal included in a multi-channel signal; a first encoding unit to encode the normal channel signal to which the time delay is applied; a second encoding unit to output a downmixed signal and a spatial cue by encoding an LFE channel signal included in the multi-channel signal; and a third encoding unit to encode the encoded LFE channel signal, wherein the time delay includes a time delay which occurs in encoding the LFE channel signal.
  • the second encoding unit may output the downmixed signal and the spatial cue by conducting parametric coding on the LFE channel signal in an LFE mode.
  • a decoder for a multi-channel signal including a first decoding unit to generate a first downmixed signal and a second downmixed signal by decoding an encoded result extracted from a bitstream; and a second decoding unit to output a first normal channel signal and a first LFE channel signal by decoding the first downmixed signal and to output a second normal channel signal and a second LFE channel signal by decoding the second downmixed signal.
  • the second decoding unit may output the first normal channel signal and the first LEF channel signal from the first downmixed signal by applying a first spatial cue to parametric coding and may output the second normal channel signal and the second LEF channel signal from the second downmixed signal by applying a second spatial cue to parametric coding, and the first spatial cue and the second spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  • a decoder for a multi-channel signal including a first decoding unit to generate a first downmixed signal by decoding an encoded result extracted from a bitstream and to generate a second downmixed signal by decoding another encoded result extracted from the bitstream; and a second decoding unit to output a first normal channel signal and a first LFE channel signal by decoding the first downmixed signal; and to output a second normal channel signal and a second LFE channel signal by decoding the second downmixed signal.
  • the second decoding unit may output the first normal channel signal and the first LEF channel signal using parametric coding based on a first spatial cue for the first downmixed signal and may output the second normal channel signal and the second LEF channel signal using parametric coding based on a second spatial cue for the second downmixed signal, and the first spatial cue and the second spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  • a decoder for a multi-channel signal including a first decoding unit to generate a downmixed signal by decoding an encoded result extracted from a bitstream; and a second decoding unit to output a first LFE channel signal and a second LFE channel signal by decoding the downmixed signal.
  • the second decoding unit may output the first LEF channel signal and the second LFE channel signal by applying parametric coding based on a spatial cue to the downmixed signal, and the spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  • a decoder for a multi-channel signal including a first decoding unit to output a first LFE channel signal by decoding an encoded result extracted from a bitstream and to output a second LFE channel signal by decoding another encoded result extracted from the bitstream; and a delay unit to apply a time delay to the first LEF channel signal and to apply the time delay to the second LFE channel signal.
  • the time delay may include a time delay which occurs in decoding a normal channel signal.
  • a decoder for a multi-channel signal including a first decoding unit to decode a normal channel signal from a bitstream; a delay unit to apply a time delay to the decoded normal channel signal; a second decoding unit to decode an LFE channel signal from the bitstream; and a third decoding unit to decode the decoded LFE channel signal.
  • the time delay may include a time delay which occurs in decoding the LFE channel signal.
  • the second decoding unit may output a downmixed signal and a spatial cue by conducting parametric coding on the LFE channel signal in an LFE mode.
  • a multi-channel signal including a low-frequency effects (LFE) channel signal in addition to a normal channel signal may be effectively encoded or decoded.
  • LFE low-frequency effects
  • synchronized multi-channel signals may be output by employing two-stage encoding/decoding or one-stage encoding/decoding employing a time delay.
  • FIG. 1 illustrates an encoder and a decoder according to an embodiment.
  • the encoder 101 may encode a multi-channel signal including a plurality of channel signals to generate a bitstream.
  • the decoder 101 may decode the multi-channel signal from the bitstream received from the encoder 101 or stored in a medium of the encoder 101.
  • the multi-channel signal may include a low-frequency effects (LFE) channel signal.
  • LFE channel signal refers to a channel signal for low-frequency effects (LFE) of a selective and limited sound range.
  • a low sound range may refer to a low-frequency range from 20 to 120 Hz.
  • An LFE channel signal may be used to supplement low-frequency information on a main channel signal by transmitting additional low-frequency information.
  • FIG. 2 illustrates an encoder which encodes a multi-channel signal including an LFE channel signal according to a first embodiment.
  • FIGS. 2 to 5 illustrate processes of encoding a multi-channel signal including two LFE channel signals
  • FIGS. 6 to 9 illustrate processes of decoding encoded results of FIGS. 2 to 5 .
  • the encoder may include a first encoding unit 201, a first encoding unit 202, a second encoding unit 203 and a bitstream formatter 204.
  • the first encoding units 201 and 202 may perform the same operations.
  • the first encoding unit 201 may generate a downmixed signal dmx 1 using an LFE channel signal Lfe 1 and a normal channel signal x i .
  • a normal channel signal may refer to a channel signal which does not exhibit low-frequency effects.
  • the first encoding unit 202 may generate a downmixed signal dmx 2 using an LFE channel signal Lfe 2 and a normal channel signal x i+1 . i represents an index of a normal channel signal. That is, the encoder of FIG. 2 may encode a multi-channel signal including a normal channel signal coupled to an LFE channel signal.
  • the first encoding units 201 and 202 may perform parametric coding to output spatial cues and the downmixed signals.
  • the first encoding units 201 and 202 perform parametric coding using the LFE channel signals.
  • a channel level difference (CLD) as a spatial cue may be extracted from an LFE band. Accordingly, a spatial cue output through parametric coding using an LFE channel signal may output a relatively smaller amount of data than a spatial cue output through generally used parametric coding.
  • the spatial cues output from the first encoding units 201 and 202 are bit1 and bit2, respectively.
  • the second encoding unit 203 may encode the downmixed signal dmx 1 output from the first encoding unit 201 and the downmixed signal dmx 2 output from the first encoding unit 202.
  • the downmixed signals dmx 1 and dmx 2 may be input as a stereo signal to the second encoding unit 203.
  • the second encoding unit 203 may be an Advanced Audio Codec (AAC), MP3, or the like.
  • AAC Advanced Audio Codec
  • the second encoding unit 203 outputs bit3 as an encoded result, which is input to the bitstream formatter 204.
  • the bitstream formatter 204 may convert bit3 into a bitstream.
  • FIG. 3 illustrates an encoder which encodes a multi-channel signal including an LFE channel signal according to a second embodiment.
  • the encoder of FIG. 3 may include a first encoding unit 301, a second encoding unit 302, a first encoding unit 303, a second encoding unit 304 and a bitstream formatter 305.
  • the first encoding units 301 and 303 of FIG. 3 may operate in the same manner as the first encoding units 201 and 202 of FIG. 2 . That is, the first encoding units 301 and 303 may perform parametric coding using an LFE channel signal to extract a CLD as a spatial cue from an LFE band.
  • the first encoding unit 301 may generate a downmixed signal dmx 1 using an LFE channel signal Lfe 1 and a normal channel signal x i .
  • the first encoding unit 303 may generate a downmixed signal dmx 2 using an LFE channel signal Lfe 2 and a normal channel signal x i+1 .
  • the downmixed signal dmxi resulting from encoding by the first encoding unit 301 is input as a mono signal to the second encoding unit 302.
  • the second encoding unit 302 may output bit3 using the downmixed signal dmx 1 .
  • the downmixed signal dmx 2 resulting from encoding by the first encoding unit 303 is input as a mono signal to the second encoding unit 304.
  • the second encoding unit 304 may output bit4 using the downmixed signal dmx 2 .
  • FIG. 4 illustrates an encoder which encodes a multi-channel signal including an LFE channel signal according to a third embodiment.
  • the encoder may include a first encoding unit 401, a second encoding unit 402 and a bitstream formatter 403.
  • LFE channel signals Lfe 1 and Lfe 2 may be coupled to each other and input to the first encoding unit 401.
  • the first encoding unit 401 may output a downmixed signal dmx 3 as a mono signal using the LFE channel signals Lfe 1 and Lfe 2 .
  • bit1 means a spatial cue derived by the first encoding unit 401 through parametric coding.
  • the downmixed signal dmx 3 may be input to the second encoding unit 402.
  • the second encoding unit 402 may code an LFE band in the downmixed signal dmx 3 .
  • a Unified Speech and Audio Codec (USAC) and an Advanced Audio Codec (AAC) may have a separate coding mode for coding an LFE band.
  • the second encoding unit 402 may use a coding mode provided by the USAC or AAC.
  • Bit2 output from the second encoding unit 402 and bit1 output from the first encoding unit 401 may be output as a bitstream through the bitstream formatter 403.
  • FIG. 5 illustrates an encoder which encodes a multi-channel signal including an LFE channel signal according to a fourth embodiment.
  • the encoder may include a delay unit 501, a second encoding unit 502, a delay unit 503, a second encoding unit 504 and a bitstream formatter 505.
  • FIG. 5 illustrates a process of encoding an LFE channel signal using the second encoding units 502 and 504, not via the aforementioned first encoding units.
  • the second encoding units 502 and 504 may perform parametric coding on an LFE band. Input signals for the second encoding units 502 and 504 may need to be delayed corresponding to the presence of a first encoding unit.
  • Input signals for the second encoding units 502 and 504 may need to be delayed corresponding to the presence of a first encoding unit.
  • one more encoding process of the normal channel signals may cause a time delay. Accordingly, a bitstream of the LFE channel signals synchronized with the normal channel signals may be generated only when the time delay is considered.
  • the delay units 501 and 503 may apply a time delay ⁇ enc , which may occur in real encoding, to the LFE channel signals Lfe 1 and Lfe 2 . Subsequently, time-delayed LFE channel signals Lfe 1 (n- ⁇ enc ) and Lfe 2 (n- ⁇ enc ) may be input to the second encoding units 502 and 504, respectively. Bit1 and bit2, encoded results by the second encoding units 502 and 504 may be output as a bitstream via the bitstream formatter 505.
  • FIG. 6 illustrates a decoder which decodes an encoded result of FIG. 2 .
  • the decoder may include a bitstream deformatter 601, a first decoding unit 602, a second decoding unit 603 and a second decoding unit 604.
  • FIG. 6 may operate in an inverse manner to FIG. 2 .
  • a bitstream input to the bitstream deformatter 601 may be the bitstream generated in FIG. 2 .
  • the bitstream deformatter 601 may output bit1, bit2 and bit3 from the bitstream. Bit1, bit2 and bit3 are the same as those mentioned in FIG. 2 .
  • Bit3 may be input to the first decoding unit 602.
  • the first decoding unit 602 may generate downmixed signals dmx 1 and dmx 2 using bit3.
  • the second decoding unit 603 may perform parametric coding on bit1 as a spatial cue and the downmixed signal dmx 1 to output a normal channel signal x i and an LFE channel signal Lfe 1 .
  • the second decoding unit 604 may perform parametric coding on bit2 as a spatial cue and the downmixed signal dmx 2 to output a normal channel signal x i+1 and an LFE channel signal Lfe 2 .
  • FIG. 7 illustrates a decoder which decodes an encoded result of FIG. 3 .
  • the decoder may include a bitstream deformatter 701, a first decoding unit 702, a second decoding unit 703, a first decoding unit 704 and a second decoding unit 705.
  • FIG. 7 may operate in an inverse manner to FIG. 3 .
  • a bitstream input to the bitstream deformatter 701 may be the bitstream generated in FIG. 3 .
  • the bitstream deformatter 701 may output bit1, bit2, bit3 and bit4 from the bitstream. Bit1, bit2, bit3 and bit4 are the same as those mentioned in FIG. 3 .
  • Bit3 may be input to the first decoding unit 702, and bit4 may be input to the first decoding unit 704.
  • the first decoding unit 702 may generate a downmixed signal dmx 1 using bit3.
  • the first decoding unit 704 may generate a downmixed signal dmx 2 using bit4.
  • the second decoding unit 703 may perform parametric coding on bit1 as a spatial cue and the downmixed signal dmx 1 to output a normal channel signal x i and an LFE channel signal Lfe 1 .
  • the second decoding unit 703 may perform parametric coding on bit2 as a spatial cue and the downmixed signal dmx 2 to output a normal channel signal x i+1 and an LFE channel signal Lfe 2 .
  • FIG. 8 illustrates a decoder which decodes an encoded result of FIG. 4 .
  • the decoder may include a bitstream deformatter 801, a first decoding unit 802 and a second decoding unit 803.
  • FIG. 8 may operate in an inverse manner to FIG. 4 .
  • a bitstream input to the bitstream deformatter 801 may be the bitstream generated in FIG. 4 .
  • the bitstream deformatter 801 may output bit1 and bit2 from the bitstream. Bit1 and bit2 are the same as those mentioned in FIG. 4 .
  • Bit1 may be input to the first decoding unit 802, and bit2 may be input to the second decoding unit 803.
  • the first decoding unit 802 may generate a downmixed signal dmx 3 using bit3.
  • the second decoding unit 803 may perform parametric coding on bit2 as a spatial cue and the downmixed signal dmx 3 to output LFE channel signals Lfe 1 and Lfe 2 .
  • the first decoding unit 802 and the second decoding unit 803 may perform parametric coding on an LFE band of the input downmixed signal dmx 3 .
  • FIG. 9 illustrates a decoder which decodes an encoded result of FIG. 5 .
  • the decoder may include a bitstream deformatter 901, a first decoding unit 902, a delay unit 903, a first decoding unit 904 and a delay unit 905.
  • FIG. 9 may operate in an inverse manner to FIG. 5 .
  • a bitstream input to the bitstream deformatter 901 may be the bitstream generated in FIG. 5 .
  • the bitstream deformatter 901 may output bit1 and bit2 from the bitstream. Bit1 and bit2 are the same as those mentioned in FIG. 5 .
  • Bit1 may be input to the first decoding unit 902, and bit2 may be input to the first decoding unit 904.
  • the first decoding unit 902 may generate an LFE channel signal Lfe 1 (n- ⁇ enc ) using bit1
  • the second decoding unit 904 may generate an LFE channel signal Lfe 2 (n- ⁇ enc ) using bit2.
  • the delay unit 903 may apply a time delay to the LFE channel signal Lfe 1 (n- ⁇ enc ) to output Lfe 1 (n- ⁇ enc - ⁇ dec ).
  • the delay unit 905 may apply a time delay to the LFE channel signal Lfe 2 (n- ⁇ enc ) to output Lfe 2 (n- ⁇ enc - ⁇ dec ).
  • the delay units 903 and 905 may apply a time delay ⁇ dec occurring in one-time decoding so that signals subjected to one-time decoding synchronize with those subjected to two-time decoding.
  • FIG. 10 illustrates a process of encoding a multi-channel signal using the encoder of FIG. 2 .
  • FIG. 10 illustrates an encoder for a multi-channel signal which adopts the encoder Type1 illustrated in FIG. 2 .
  • Two To Ones (TTOs) 1001, 1002, 1004 and 1005 may encode an input signal according to a parametric coding mode for an MPEG Surround stereo signal. That is, the TTOs may correspond to the first encoding units of FIG. 2 , and USAC encoders may correspond to the second encoding unit of FIG. 2 .
  • TTOs 1001 and 1002 may perform parametric coding according to a normal mode
  • TTOs 1004 and 1005 may perform parametric coding according to an LFE mode.
  • a CLD, Inter-Channel Coherence (ICC) and Interchannel Phase Difference (IPD) as spatial cues may be extracted by analyzing a normal channel signal x i .
  • IPD Interchannel Phase Difference
  • a CLD may be extracted from an LFE band of an input LFE channel signal.
  • FIG. 10 illustrates an encoding process when N multi-channel signals are input.
  • the N multi-channel signals may be subjected to parametric coding via the TTOs into M downmixed signals dmx 1 to dmx M .
  • the M downmixed signals may be input in a stereo form and encoded through USAC core coding.
  • LFE channel signals Lfe 1 and Lfe 2 may be coupled to normal channel signals to be input to the TTO 1004 and TTO1005.
  • normal channel signals of the multi-channel signals may be coupled and downmixed by two channels, and a downmixed result may be subjected to stereo coding by the USAC encoders.
  • two normal channel signals x 2M-1 and x 2M may be respectively coupled to the LFE channel signals Lfe 1 and Lfe 2 and input to the TTOs(Lfe).
  • FIG. 10 shows that the encoder Type 1 of FIG. 2 is adopted, the encoder Type 2 illustrated in FIG. 3 may be applied, instead of the encoder Type 1.
  • FIG. 11 illustrates a decoder which decodes an encoded result of FIG. 10 .
  • FIG. 11 illustrates a decoder for a multi-channel signal which adopts the decoder Type 1 illustrated in FIG. 6 .
  • One To Twos (OTTs) 1103, 1104, 1106 and 1107 may decode an input signal according to a parametric coding mode for an MPEG Surround stereo signal. That is, the OTTs may correspond to the second decoding units of FIG. 6 , and USAC decoders may correspond to the first decoding unit.
  • OTTs One To Twos
  • USAC decoders may correspond to the first decoding unit.
  • OTTs 1103 and 1104 may perform parametric coding according to a normal mode, and the OTTs 1106 and 1107 may perform parametric coding according to an LFE mode.
  • the encoded result may be decoded to output N multi-channel audio signals.
  • M downmixed signals may be output from a bitstream via the USAC decoders.
  • the M downmixed signals may be input to the respective OTTs to output stereo signals.
  • the OTTs 1103 and 1104 may output two normal channel signals, and the OTTs 1106 and 1107 may output normal channel signals coupled to LEF channel signals.
  • FIG. 12 illustrates a process of encoding a multi-channel signal when encoding bits are sufficient in FIG. 10 .
  • normal channel signals x 1 to x 2M-2 may be encoded by USAC encoders 1203 and 1206.
  • a delay time ⁇ enc occurring in encoding via delay units 1201, 1201, 1204 and 1205, may be applied to the normal channel signals x 1 to x 2M-2 . Accordingly, time-delayed results may be encoded by the USAC encoders 1203 and 1206.
  • the time delay ⁇ enc occurs in OTTs 1207 and 1208 and may include time delays due to quadrature mirror filter (QMF) analysis, hybrid analysis and QMF synthesis.
  • QMF quadrature mirror filter
  • a time delay occurring by QMF synthesis may be excluded when calculating ⁇ enc .
  • FIG. 13 illustrates a decoder which decodes an encoded result of FIG. 12 .
  • FIG. 13 may perform an inverse process to FIG. 12 .
  • a normal channel signal is decoded by USAC decoders 1302 and 1305 and output via delay units 1303, 1304, 1306 and 1307.
  • a result derived by a bitstream deformatter 1301 maybe decoded by a USAC decoder 1308 to generate downmixed signals, and the downmixed signals may be respectively input to OTTs 1309 and 1310 to output normal channel signals x 2M-1 and x 2M respectively coupled to LFE channel signals Lfe 1 and Lfe 2 .
  • the delay units 1303, 1304, 1306 and 1307 may apply a time delay ⁇ dec occurring in one-time decoding to one-time to output results from the USAC decoders 1302 and 1305.
  • ⁇ dec includes QMF analysis, hybrid analysis, QMF synthesis and filtering delays and is different from ⁇ ene .
  • output signals from the USAC decoders 1302 and 1305 are QMF signals, a time delay occurring by QMF analysis may be excluded when determining ⁇ dec .
  • a filtering delay refers to a time delay which occurs due to a filtering operation by the OTTs 1309 and 1310, irrespective of QMF conversion.
  • a filtering delay may be a time delay occurring in a decorrelator operation by the OTTs 1309 and 1310.
  • FIG. 14 illustrates an example of encoding a multi-channel signal using the encoder of FIG. 4 .
  • the encoder Type 3 illustrated in FIG. 4 may be used.
  • Normal channel signals x 1 to x 2M of a multi-channel signal may be coupled by two and input to TTOs 1401 and 1402.
  • the TTOs 1401 and 1402 may perform parametric coding on the normal channel signals coupled by two to output downmixed signals dmx 1 and dmx 2 along with spatial cues.
  • the output downmixed signals dmx 1 and dmx 2 may be input in a stereo form to a USAC encoder 1403.
  • LFE channel signals Lfe 1 and Lfe 2 included in the multi-channel signal may be coupled by two and input to a TTO 140.
  • the TTO 1404 may perform parametric coding using the two LFE channel signals Lfe 1 and Lfe 2 to output a downmixed signal dmx 3 in a mono form.
  • a USAC encoder 1405 may encode the downmixed signal dmx 3 in the LFE mode.
  • FIG. 15 illustrates another example of encoding a multi-channel signal using the encoder of FIG. 4 .
  • a normal channel signal may be encoded by USAC encoders 1503 and 1506, instead of being subjected to parametric coding by TTOs 1401 and 1402 in FIG. 14 .
  • delay units 1501, 1502, 1504 and 1505 may apply a time delay occurring by the TTO 1507 to the normal channel signal.
  • LFE channel signals Lfe 1 and Lfe 2 may be encoded by the TTO 1507 in the LFE mode to output a downmixed signal dmx 3 , and the downmixed signal dmx 3 may be encoded by the USAC encoder.
  • FIGS. 14 and 15 illustrate the encoders
  • corresponding decoders may operate according to inverse processes.
  • a normal channel signal may be output from a bitstream obtained in FIG. 14 via a USAC decoder and an OTT.
  • LFE channel signals Lfe 1 and Lfe 2 may be output from the bitstream obtained in FIG. 14 via a USAC decoder and an OTT.
  • a normal channel signal may be output from a bitstream obtained in FIG. 15 via a USAC decoder and a delay unit.
  • LFE channel signals Lfe 1 and Lfe 2 may be output from the bitstream obtained in FIG. 15 via a USAC decoder and an OTT.
  • FIG. 16 illustrates an example of encoding a multi-channel signal using the encoder of FIG. 5 .
  • FIG. 16 illustrates an encoder for a multi-channel signal which adopts the encoder Type 4 illustrated in FIG. 5 .
  • Normal channel signals may be converted into downmixed signals through TTOs 1601 and 1602, and the converted downmixed signals may be output as a bitstream through a USAC encoder 1603.
  • delay units 1604 and 1606 may apply a time delay ⁇ enc occurring in the TTOs 1601 and 1602 to LFE channel signals, and time-delayed results may be encoded respectively by USAC encoders 1605 and 1607 according to the LFE mode. That is, since the LFE channel signals are subjected to encoding once, unlike the normal channel signals subjected to encoding two times, the time delay ⁇ enc occurring in encoding by the TTOs may need to be applied to the LFE channel signals.
  • FIG. 17 illustrates an example of decoding a multi-channel signal using the decoder of FIG. 9 .
  • the decoder Type 4 illustrated in FIG. 9 is used.
  • normal channel signals may be output through a USAC decoder 1702 and a TTO 1703 and 1704.
  • LFE channel signals may be output through USAC decoders 1705 and 1707 and delay units 1706 and 1708.
  • the delay units 1706 and 1708 may need to apply a time delay ⁇ dec occurring in decoding by the TTOs 1703 and 1704 to the LFE channel signals. Accordingly, the normal channel signals and the LFE channel signals output from the decoder may be synchronized with each other.
  • FIG. 18 illustrates an encoder which encodes a multi-channel signal when the multi-channel signal includes an odd number of LFE channel signals according to an embodiment.
  • FIGS. 2 to 17 illustrate an even number of LFE channel signals
  • FIG. 18 illustrates an odd number of LFE channel signals.
  • one LEF channel signal Lfe 2n+1 may be input to a delay unit 1801, and a time delay ⁇ may be applied to the LEF channel signal.
  • the time-delayed LFE channel signal may be encoded by a second encoding unit 1802 in an LFE mode to output bit1. That is, an odd number of LFE channel signals may be processed by the encoder Type 4 of FIG. 5 or the decoder Type 4 of FIG. 9 .
  • a normal channel signal is encoded by a first encoding unit and the second encoding unit, and thus a delay unit 1801 may need to apply a time delay occurring due to the first encoding unit to the LFE channel signal for synchronization with the normal channel signal.
  • FIG. 19 illustrates an encoder which encodes a normal audio signal, not an LFE channel signal, according to an embodiment.
  • normal channel signals x 1 and x 2 may be subjected to parametric coding by a first encoding unit 1901 to be converted into a downmixed signal dmx 1 along with a spatial cue bit1.
  • normal channel signals x 3 and x 4 may be subjected to parametric coding by a first encoding unit 1902 to be converted into a downmixed signal dmx 2 along with a spatial cue bit2.
  • parametric coding applied to the normal channel signals may extract not only a CLD but also an ICC and IPD as spatial cues.
  • the downmixed signal dmx 1 and dmx 2 may be input in a stereo form to a second encoding unit 1903 and encoded to output bit3.
  • Bit3 may be converted into a bistream by a bitstream formatter 1904.
  • FIG. 20 illustrates a decoder which decodes an encoded result of FIG. 19 .
  • spatial cues bit1 and bit2 and encoded bit3 may be output by a bitstream deformatter 2001 from the bitstream generated in FIG. 19 .
  • a first decoding unit 2002 may decode bit3 to output downmixed signals dmx 1 and dmx 2 .
  • a second decoding unit 2003 may decode a downmixed signal dmx 1 to output normal channel signals x 1 and x 2 .
  • a second decoding unit 2004 may decode a downmixed signal dmx 2 to output normal channel signals x 1 and x 2 .
  • FIG. 21 illustrates an encoding process and a decoding process according to an embodiment.
  • the foregoing first encoding units may correspond to TTOs 2101 and 2102 of FIG. 21
  • the foregoing second encoding units may correspond to a USAC encoder 2103.
  • the foregoing first decoding units may correspond to a USAC decoder 2104
  • the foregoing second encoding units may correspond to OTTs 2105 and 2106.
  • the USAC decoder 2104 may output two downmixed signals from a bitstream.
  • the OTTs 2105 and 2106 may output (i) four normal channel signals or (ii) results of coupling one normal channel signal and one LFE channel signal from the downmixed signals.
  • FIG. 22 illustrates a USAC encoder and a USAC decoder according to a first embodiment.
  • a USAC encoder may include TTOs 2203 and 2204 to configure an extended USAC encoder 2201.
  • a USAC decoder may include OTTs 2211 and 2212 to configure an extended USAC decoder 2202.
  • the non-frequency-extended core band is decoded by a core decoder 2208, and a decoded result may be input to and subjected to frequency extension by an SBR 2209, thereby reconstructing an original signal.
  • a result of frequency extension by the SBR 2209 may be subjected to parametric coding by an OTT 2210 to generate two downmixed signals, and the downmixed signals may be subjected to parametric coding by the OTTs 2211 and 2212 to output (i) four normal channel signals or (ii) one normal channel signal and one LFE channel signal.
  • FIG. 23 illustrates a USAC encoder and a USAC decoder according to a second embodiment.
  • positions of an SBR 2305 and a TTO 2306 in an extended USAC encoder 2301 and positions of an OTT 2309 and an SBR 2310 in an extended USAC decoder 2302 are changed from those in FIG. 22 .
  • Other components may be equivalent to those in FIG. 22 .
  • the apparatuses described herein may be implemented using hardware components, software components, and/or combinations of hardware components and software components.
  • the units and components illustrated in the embodiments may be implemented using one or more general-purpose or special purpose computers, such as, for example, a processor, a controller, an arithmetic logic unit (ALU), a digital signal processor, a microcomputer, a field programmable array (FPA), a programmable logic unit (PLU), a microprocessor or any other device capable of responding to and executing instructions.
  • a processing device may run an operating system (OS) and one or more software applications that run on the OS. The processing device also may access, store, manipulate, process, and create data in response to execution of the software.
  • OS operating system
  • the processing device also may access, store, manipulate, process, and create data in response to execution of the software.
  • a processing device may include multiple processing elements and multiple types of processing elements.
  • a processing device may include multiple processors or a processor and a controller.
  • different processing configurations are possible, such as parallel processors.
  • the software may include a computer program, a piece of code, an instruction, or one or more combinations thereof, to independently or collectively instruct or configure the processing device to operate as desired.
  • Software and/or data may be embodied permanently or temporarily in any type of machine, component, physical or virtual equipment, computer storage medium or device, or in a propagated signal wave in order to provide instructions or data to the processing device or to be interpreted by the processing device.
  • the software may also be distributed over network coupled computer systems so that the software is stored and executed in a distributed fashion.
  • the software and data may be stored by one or more non-transitory computer readable recording mediums.
  • the methods according to the embodiments may be realized as program instructions implemented by various computers and be recorded in non-transitory computer-readable media.
  • the media may also include, alone or in combination with the program instructions, data files, data structures, and the like.
  • the program instructions recorded in the media may be designed and configured specially for the embodiments or be known and available to those skilled in computer software.
  • Examples of the non-transitory computer readable recording medium may include magnetic media such as hard disks, floppy disks, and magnetic tape; optical media such as CD ROM disks and DVDs; magneto-optical media such as floptical disks; and hardware devices that are specially configured to store and perform program instructions, such as read-only memory (ROM), random access memory (RAM), flash memory, and the like.
  • Examples of program instructions include both machine codes, such as produced by a compiler, and higher level language codes that may be executed by the computer using an interpreter.
  • the described hardware devices may be configured to act as one or more software modules in order to perform the operations of the above-described exemplary embodiments, or vice versa.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

Disclosed are an encoder and encoding method for a multi-channel signal, and a decoder and decoding method for a multi-channel signal. The present invention may code a normal channel signal and a low-frequency effects (LFE) channel signal included in a multi-channel signal through a two-stage coding process or a one-stage coding process employing a time delay, thereby efficiently coding a multi-channel signal.

Description

    Technical Field
  • Exemplary embodiments relate to an encoder and encoding method for a multi-channel signal, and a decoder and decoding method for a multi-channel signal, and more particularly to a codec for efficiently processing a multi-channel signal including a plurality of channel signals.
  • Background Art
  • With demands for ultrahigh quality of audiovisual (AV) media require, novel technology for compression/transmission of AV media is needed. For superhigh audio content, audio quality and accurate representation of a sound field of multi-channels are important rather than backward comparability. For instance, a 22.2 channel audio signal, which is for reproducing a sound field of an ultrahigh-quality audio, requires a high-quality multi-channel audio coding technique which enables representation of unique sound quality and effects of a sound field of content as it is, rather than compression/transmission techniques for backward compatibility.
  • Thus, new codec structures are needed for encoding/decoding known 5.1 or 7.1 channel or greater multi-channel signals.
  • Disclosure of Invention Technical problems
  • An aspect of the present invention is to provide an apparatus and method of encoding or decoding a multi-channel signal including a low-frequency effects (LFE) channel signal.
  • Another aspect of the present invention is to provide an apparatus and method of performing two-stage encoding/decoding or one-stage encoding/decoding employing a time delay.
  • Technical solutions
  • According to a first embodiment of the present invention, there is provided a method of encoding a multi-channel signal, the method including outputting a first downmixed signal and a first spatial cue by encoding a first normal channel signal and a first low-frequency effects (LFE) channel signal which are included in a multi-channel signal; outputting a second downmixed signal and a second spatial cue by encoding a second normal channel signal and a second LFE channel signal which are included in the multi-channel signal; encoding the first downmixed signal the second downmixed signal together; and generating a bitstream including the encoded first downmixed signal, the encoded second downmixed signal, the first spatial cue and the second spatial cue.
  • The outputting of the first cue may output the first downmixed signal and the first spatial cue by applying parametric coding to the first normal channel signal and the first LFE channel signal in an LFE mode, the outputting of the second cue may output the second downmixed signal and the second spatial cue by applying parametric coding to the second normal channel signal and the second LFE channel signal in the LFE mode, and the first spatial cue and the second spatial cue may include a channel level difference (CLD) output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  • According to a second embodiment of the present invention, there is provided a method of encoding a multi-channel signal, the method including outputting a first downmixed signal and a first spatial cue by encoding a first normal channel signal and a first LFE channel signal which are included in a multi-channel signal; outputting a second downmixed signal and a second spatial cue by encoding a second normal channel signal and a second LFE channel signal which are included in the multi-channel signal; encoding the first downmixed signal; encoding the second downmixed signal separately from the first downmixed signal; and generating a bitstream including the encoded first downmixed signal, the encoded second downmixed signal, the first spatial cue and the second spatial cue.
  • The outputting of the first cue may output the first downmixed signal and the first spatial cue by applying parametric coding to the first normal channel signal and the first LFE channel signal in an LFE mode, the outputting of the second cue may output the second downmixed signal and the second spatial cue by applying parametric coding to the second normal channel signal and the second LFE channel signal in the LFE mode, and the first spatial cue and the second spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  • According to a third embodiment of the present invention, there is provided a method of encoding a multi-channel signal, the method including outputting a downmixed signal and a spatial cue by encoding a first LFE channel signal and a second LFE channel signal which are included in a multi-channel signal; encoding the downmixed signal; and generating a bitstream including the encoded downmixed signal and the spatial cue.
  • The outputting may output the downmixed signal and the spatial cue by applying parametric coding to the first LFE channel signal and the second LFE channel signal in an LFE mode, and the spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  • According to a fourth embodiment of the present invention, there is provided a method of encoding a multi-channel signal, the method including applying a time delay to a first LFE channel signal included in a multi-channel signal; applying the time delay to a second LFE channel signal included in the multi-channel signal; encoding the first LEF channel signal to which the time delay is applied; encoding the second LEF channel signal to which the time delay is applied; and generating a bitstream including the encoded first LEF channel signal and the encoded second LEF channel signal.
  • The time delay may include a time delay which occurs in encoding a normal channel signal included in the multi-channel signal.
  • According to a fifth embodiment of the present invention, there is provided a method of encoding a multi-channel signal, the method including applying a time delay a normal channel signal included in a multi-channel signal; encoding the normal channel signal to which the time delay is applied; outputting a downmixed signal and a spatial cue by encoding an LFE channel signal included in the multi-channel signal; and encoding the encoded LFE channel signal, wherein the time delay includes a time delay which occurs in encoding the LFE channel signal.
  • The outputting may output the downmixed signal and the spatial cue by conducting parametric coding on the LFE channel signal in an LFE mode.
  • According to a first embodiment of the present invention, there is provided a method of decoding a multi-channel signal, the method including generating a first downmixed signal and a second downmixed signal by decoding an encoded result extracted from a bitstream; outputting a first normal channel signal and a first LFE channel signal by decoding the first downmixed signal; and outputting a second normal channel signal and a second LFE channel signal by decoding the second downmixed signal.
  • The outputting of the first normal channel signal and the first LFE channel signal may output the first normal channel signal and the first LEF channel signal from the first downmixed signal by applying a first spatial cue to parametric coding, the outputting of the second normal channel signal and the second LFE channel signal may output the second normal channel signal and the second LEF channel signal from the second downmixed signal by applying a second spatial cue to parametric coding, and the first spatial cue and the second spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  • According to a second embodiment of the present invention, there is provided a method of decoding a multi-channel signal, the method including generating a first downmixed signal by decoding an encoded result extracted from a bitstream; generating a second downmixed signal by decoding another encoded result extracted from the bitstream; outputting a first normal channel signal and a first LFE channel signal by decoding the first downmixed signal; and outputting a second normal channel signal and a second LFE channel signal by decoding the second downmixed signal.
  • The outputting of the first normal channel signal and the first LFE channel signal may output the first normal channel signal and the first LEF channel signal using parametric coding based on a first spatial cue for the first downmixed signal, the outputting of the second normal channel signal and the second LFE channel signal may output the second normal channel signal and the second LEF channel signal using parametric coding based on a second spatial cue for the second downmixed signal, and the first spatial cue and the second spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  • According to a third embodiment of the present invention, there is provided a method of decoding a multi-channel signal, the method including generating a downmixed signal by decoding an encoded result extracted from a bitstream; and outputting a first LFE channel signal and a second LFE channel signal by decoding the downmixed signal.
  • The outputting may output the first LEF channel signal and the second LFE channel signal by applying parametric coding based on a spatial cue to the downmixed signal, and the spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  • According to a fourth embodiment of the present invention, there is provided a method of decoding a multi-channel signal, the method including outputting a first LFE channel signal by decoding an encoded result extracted from a bitstream; outputting a second LFE channel signal by decoding another encoded result extracted from the bitstream; applying a time delay to the first LEF channel signal; and applying the time delay to the second LFE channel signal.
  • The time delay may include a time delay which occurs in decoding a normal channel signal.
  • According to a fifth embodiment of the present invention, there is provided a method of decoding a multi-channel signal, the method including decoding a normal channel signal from a bitstream; applying a time delay to the decoded normal channel signal; decoding an LFE channel signal from the bitstream; and decoding the decoded LFE channel signal.
  • The time delay may include a time delay which occurs in decoding the LFE channel signal.
  • The decoding of the LFE channel signal may output a downmixed signal and a spatial cue by conducting parametric coding on the LFE channel signal in an LFE mode.
  • According to a first embodiment of the present invention, there is provided an encoder for a multi-channel signal, the encoder including a first encoding unit to output a first downmixed signal and a first spatial cue by encoding a first normal channel signal and a first low-frequency effects (LFE) channel signal which are included in a multi-channel signal and to output a second downmixed signal and a second spatial cue by encoding a second normal channel signal and a second LFE channel signal which are included in the multi-channel signal; a second encoding unit to encode the first downmixed signal and the second downmixed signal together; and a bitstream formatter to generate a bitstream including the encoded first downmixed signal, the encoded second downmixed signal, the first spatial cue and the second spatial cue.
  • The first encoding unit may output the first downmixed signal and the first spatial cue by applying parametric coding to the first normal channel signal and the first LFE channel signal in an LFE mode and may output the second downmixed signal and the second spatial cue by applying parametric coding to the second normal channel signal and the second LFE channel signal in the LFE mode, and the first spatial cue and the second spatial cue may include a channel level difference (CLD) output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  • According to a second embodiment of the present invention, there is provided an encoder for a multi-channel signal, the encoder including a first encoding unit to output a first downmixed signal and a first spatial cue by encoding a first normal channel signal and a first LFE channel signal which are included in a multi-channel signal and to output a second downmixed signal and a second spatial cue by encoding a second normal channel signal and a second LFE channel signal which are included in the multi-channel signal; a second encoding unit to encode the first downmixed signal; encoding the second downmixed signal separately from the first downmixed signal; and a bitstream formatter to generate a bitstream including the encoded first downmixed signal, the encoded second downmixed signal, the first spatial cue and the second spatial cue.
  • The first encoding unit may output the first downmixed signal and the first spatial cue by applying parametric coding to the first normal channel signal and the first LFE channel signal in an LFE mode and may output the second downmixed signal and the second spatial cue by applying parametric coding to the second normal channel signal and the second LFE channel signal in the LFE mode, and the first spatial cue and the second spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  • According to a third embodiment of the present invention, there is provided an encoder for a multi-channel signal, the encoder including a first encoding unit to output a downmixed signal and a spatial cue by encoding a first LFE channel signal and a second LFE channel signal which are included in a multi-channel signal; a second encoding unit to encode the downmixed signal; and a bitstream formatter to generate a bitstream including the encoded downmixed signal and the spatial cue.
  • The first encoding unit may output the downmixed signal and the spatial cue by applying parametric coding to the first LFE channel signal and the second LFE channel signal in an LFE mode, and the spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  • According to a fourth embodiment of the present invention, there is provided an encoder for a multi-channel signal, the encoder including a delay unit to apply a time delay to a first LFE channel signal included in a multi-channel signal and to applying the time delay to a second LFE channel signal included in the multi-channel signal; a second encoding unit to encode the first LEF channel signal to which the time delay is applied and to encode the second LEF channel signal to which the time delay is applied; and a bitstream formatter to generate a bitstream including the encoded first LEF channel signal and the encoded second LEF channel signal.
  • The time delay may include a time delay which occurs in encoding a normal channel signal included in the multi-channel signal.
  • According to a fifth embodiment of the present invention, there is provided an encoder for a multi-channel signal, the encoder including a delay unit to apply a time delay a normal channel signal included in a multi-channel signal; a first encoding unit to encode the normal channel signal to which the time delay is applied; a second encoding unit to output a downmixed signal and a spatial cue by encoding an LFE channel signal included in the multi-channel signal; and a third encoding unit to encode the encoded LFE channel signal, wherein the time delay includes a time delay which occurs in encoding the LFE channel signal.
  • The second encoding unit may output the downmixed signal and the spatial cue by conducting parametric coding on the LFE channel signal in an LFE mode.
  • According to a first embodiment of the present invention, there is provided a decoder for a multi-channel signal, the decoder including a first decoding unit to generate a first downmixed signal and a second downmixed signal by decoding an encoded result extracted from a bitstream; and a second decoding unit to output a first normal channel signal and a first LFE channel signal by decoding the first downmixed signal and to output a second normal channel signal and a second LFE channel signal by decoding the second downmixed signal.
  • The second decoding unit may output the first normal channel signal and the first LEF channel signal from the first downmixed signal by applying a first spatial cue to parametric coding and may output the second normal channel signal and the second LEF channel signal from the second downmixed signal by applying a second spatial cue to parametric coding, and the first spatial cue and the second spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  • According to a second embodiment of the present invention, there is provided a decoder for a multi-channel signal, the decoder including a first decoding unit to generate a first downmixed signal by decoding an encoded result extracted from a bitstream and to generate a second downmixed signal by decoding another encoded result extracted from the bitstream; and a second decoding unit to output a first normal channel signal and a first LFE channel signal by decoding the first downmixed signal; and to output a second normal channel signal and a second LFE channel signal by decoding the second downmixed signal.
  • The second decoding unit may output the first normal channel signal and the first LEF channel signal using parametric coding based on a first spatial cue for the first downmixed signal and may output the second normal channel signal and the second LEF channel signal using parametric coding based on a second spatial cue for the second downmixed signal, and the first spatial cue and the second spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  • According to a third embodiment of the present invention, there is provided a decoder for a multi-channel signal, the decoder including a first decoding unit to generate a downmixed signal by decoding an encoded result extracted from a bitstream; and a second decoding unit to output a first LFE channel signal and a second LFE channel signal by decoding the downmixed signal.
  • The second decoding unit may output the first LEF channel signal and the second LFE channel signal by applying parametric coding based on a spatial cue to the downmixed signal, and the spatial cue may include a CLD output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  • According to a fourth embodiment of the present invention, there is provided a decoder for a multi-channel signal, the decoder including a first decoding unit to output a first LFE channel signal by decoding an encoded result extracted from a bitstream and to output a second LFE channel signal by decoding another encoded result extracted from the bitstream; and a delay unit to apply a time delay to the first LEF channel signal and to apply the time delay to the second LFE channel signal.
  • The time delay may include a time delay which occurs in decoding a normal channel signal.
  • According to a fifth embodiment of the present invention, there is provided a decoder for a multi-channel signal, the decoder including a first decoding unit to decode a normal channel signal from a bitstream; a delay unit to apply a time delay to the decoded normal channel signal; a second decoding unit to decode an LFE channel signal from the bitstream; and a third decoding unit to decode the decoded LFE channel signal.
  • The time delay may include a time delay which occurs in decoding the LFE channel signal.
  • The second decoding unit may output a downmixed signal and a spatial cue by conducting parametric coding on the LFE channel signal in an LFE mode.
  • Effects of Invention
  • According to an aspect of the present invention, a multi-channel signal including a low-frequency effects (LFE) channel signal in addition to a normal channel signal may be effectively encoded or decoded.
  • According to another aspect of the present invention, synchronized multi-channel signals may be output by employing two-stage encoding/decoding or one-stage encoding/decoding employing a time delay.
  • Brief Description of Drawings
    • FIG. 1 illustrates an encoder and a decoder according to an embodiment.
    • FIG. 2 illustrates an encoder which encodes a multi-channel signal including a low-frequency effects (LFE) channel signal according to a first embodiment.
    • FIG. 3 illustrates an encoder which encodes a multi-channel signal including an LFE channel signal according to a second embodiment.
    • FIG. 4 illustrates an encoder which encodes a multi-channel signal including an LFE channel signal according to a third embodiment.
    • FIG. 5 illustrates an encoder which encodes a multi-channel signal including an LFE channel signal according to a fourth embodiment.
    • FIG. 6 illustrates a decoder which decodes an encoded result of FIG. 2.
    • FIG. 7 illustrates a decoder which decodes an encoded result of FIG. 3.
    • FIG. 8 illustrates a decoder which decodes an encoded result of FIG. 4.
    • FIG. 9 illustrates a decoder which decodes an encoded result of FIG. 5.
    • FIG. 10 illustrates a process of encoding a multi-channel signal using the encoder of FIG. 2.
    • FIG. 11 illustrates a decoder which decodes an encoded result of FIG. 10.
    • FIG. 12 illustrates a process of encoding a multi-channel signal when encoding bits are sufficient in FIG. 10.
    • FIG. 13 illustrates a decoder which decodes an encoded result of FIG. 12.
    • FIG. 14 illustrates an example of encoding a multi-channel signal using the encoder of FIG. 4.
    • FIG. 15 illustrates another example of encoding a multi-channel signal using the encoder of FIG. 4.
    • FIG. 16 illustrates an example of encoding a multi-channel signal using the encoder of FIG. 5.
    • FIG. 17 illustrates an example of decoding a multi-channel signal using the decoder of FIG. 9.
    • FIG. 18 illustrates an encoder which encodes a multi-channel signal when the multi-channel signal includes an odd number of LFE channel signals according to an embodiment.
    • FIG. 19 illustrates an encoder which encodes a normal audio signal, not an LFE channel signal, according to an embodiment.
    • FIG. 20 illustrates a decoder which decodes an encoded result of FIG. 19.
    • FIG. 21 illustrates an encoding process and a decoding process according to an embodiment.
    • FIG. 22 illustrates a USAC encoder and a USAC decoder according to a first embodiment.
    • FIG. 23 illustrates a USAC encoder and a USAC decoder according to a first embodiment.
    DETAILED DESCRIPTION OF EXEMPLARY EMBODIMENTS
  • Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings.
  • FIG. 1 illustrates an encoder and a decoder according to an embodiment.
  • Referring to FIG. 1, the encoder 101 and the decoder 102 are shown. The encoder 101 may encode a multi-channel signal including a plurality of channel signals to generate a bitstream. The decoder 101 may decode the multi-channel signal from the bitstream received from the encoder 101 or stored in a medium of the encoder 101.
  • Here, according to one embodiment, the multi-channel signal may include a low-frequency effects (LFE) channel signal. Here, an LFE channel signal refers to a channel signal for low-frequency effects (LFE) of a selective and limited sound range. Here, a low sound range may refer to a low-frequency range from 20 to 120 Hz. An LFE channel signal may be used to supplement low-frequency information on a main channel signal by transmitting additional low-frequency information.
  • Hereinafter, processes of encoding or decoding a multi-channel signal including an LFE channel signal will be described in detail.
  • FIG. 2 illustrates an encoder which encodes a multi-channel signal including an LFE channel signal according to a first embodiment.
  • FIGS. 2 to 5 illustrate processes of encoding a multi-channel signal including two LFE channel signals, and FIGS. 6 to 9 illustrate processes of decoding encoded results of FIGS. 2 to 5.
  • Referring to FIG. 2, the encoder may include a first encoding unit 201, a first encoding unit 202, a second encoding unit 203 and a bitstream formatter 204. Here, the first encoding units 201 and 202 may perform the same operations.
  • In detail, the first encoding unit 201 may generate a downmixed signal dmx1 using an LFE channel signal Lfe1 and a normal channel signal xi. Here, a normal channel signal may refer to a channel signal which does not exhibit low-frequency effects. The first encoding unit 202 may generate a downmixed signal dmx2 using an LFE channel signal Lfe2 and a normal channel signal xi+1. i represents an index of a normal channel signal. That is, the encoder of FIG. 2 may encode a multi-channel signal including a normal channel signal coupled to an LFE channel signal.
  • Here, the first encoding units 201 and 202 may perform parametric coding to output spatial cues and the downmixed signals. Here, the first encoding units 201 and 202 perform parametric coding using the LFE channel signals.
  • When parametric coding is performed using an LFE channel signal, a channel level difference (CLD) as a spatial cue may be extracted from an LFE band. Accordingly, a spatial cue output through parametric coding using an LFE channel signal may output a relatively smaller amount of data than a spatial cue output through generally used parametric coding. Here, the spatial cues output from the first encoding units 201 and 202 are bit1 and bit2, respectively.
  • The second encoding unit 203 may encode the downmixed signal dmx1 output from the first encoding unit 201 and the downmixed signal dmx2 output from the first encoding unit 202. The downmixed signals dmx1 and dmx2 may be input as a stereo signal to the second encoding unit 203. For instance, the second encoding unit 203 may be an Advanced Audio Codec (AAC), MP3, or the like. The second encoding unit 203 outputs bit3 as an encoded result, which is input to the bitstream formatter 204. The bitstream formatter 204 may convert bit3 into a bitstream.
  • FIG. 3 illustrates an encoder which encodes a multi-channel signal including an LFE channel signal according to a second embodiment.
  • The encoder of FIG. 3 may include a first encoding unit 301, a second encoding unit 302, a first encoding unit 303, a second encoding unit 304 and a bitstream formatter 305.
  • The first encoding units 301 and 303 of FIG. 3 may operate in the same manner as the first encoding units 201 and 202 of FIG. 2. That is, the first encoding units 301 and 303 may perform parametric coding using an LFE channel signal to extract a CLD as a spatial cue from an LFE band. In detail, the first encoding unit 301 may generate a downmixed signal dmx1 using an LFE channel signal Lfe1 and a normal channel signal xi. The first encoding unit 303 may generate a downmixed signal dmx2 using an LFE channel signal Lfe2 and a normal channel signal xi+1.
  • The downmixed signal dmxi resulting from encoding by the first encoding unit 301 is input as a mono signal to the second encoding unit 302. The second encoding unit 302 may output bit3 using the downmixed signal dmx1. The downmixed signal dmx2 resulting from encoding by the first encoding unit 303 is input as a mono signal to the second encoding unit 304. The second encoding unit 304 may output bit4 using the downmixed signal dmx2.
  • FIG. 4 illustrates an encoder which encodes a multi-channel signal including an LFE channel signal according to a third embodiment.
  • Referring to FIG. 4, the encoder may include a first encoding unit 401, a second encoding unit 402 and a bitstream formatter 403. LFE channel signals Lfe1 and Lfe2 may be coupled to each other and input to the first encoding unit 401. The first encoding unit 401 may output a downmixed signal dmx3 as a mono signal using the LFE channel signals Lfe1 and Lfe2. Here, bit1 means a spatial cue derived by the first encoding unit 401 through parametric coding.
  • The downmixed signal dmx3 may be input to the second encoding unit 402. Here, the second encoding unit 402 may code an LFE band in the downmixed signal dmx3. A Unified Speech and Audio Codec (USAC) and an Advanced Audio Codec (AAC) may have a separate coding mode for coding an LFE band. The second encoding unit 402 may use a coding mode provided by the USAC or AAC. Bit2 output from the second encoding unit 402 and bit1 output from the first encoding unit 401 may be output as a bitstream through the bitstream formatter 403.
  • FIG. 5 illustrates an encoder which encodes a multi-channel signal including an LFE channel signal according to a fourth embodiment.
  • Referring to FIG. 5, the encoder may include a delay unit 501, a second encoding unit 502, a delay unit 503, a second encoding unit 504 and a bitstream formatter 505. FIG. 5 illustrates a process of encoding an LFE channel signal using the second encoding units 502 and 504, not via the aforementioned first encoding units.
  • In FIG. 5, the second encoding units 502 and 504 may perform parametric coding on an LFE band. Input signals for the second encoding units 502 and 504 may need to be delayed corresponding to the presence of a first encoding unit. When normal channel signals are subjected to encoding two times and LFE channel signals are subjected to encoding once, one more encoding process of the normal channel signals may cause a time delay. Accordingly, a bitstream of the LFE channel signals synchronized with the normal channel signals may be generated only when the time delay is considered.
  • Thus, the delay units 501 and 503 may apply a time delay τenc, which may occur in real encoding, to the LFE channel signals Lfe1 and Lfe2. Subsequently, time-delayed LFE channel signals Lfe1(n-τenc) and Lfe2(n-τenc) may be input to the second encoding units 502 and 504, respectively. Bit1 and bit2, encoded results by the second encoding units 502 and 504 may be output as a bitstream via the bitstream formatter 505.
  • FIG. 6 illustrates a decoder which decodes an encoded result of FIG. 2.
  • Referring to FIG. 6, the decoder may include a bitstream deformatter 601, a first decoding unit 602, a second decoding unit 603 and a second decoding unit 604. FIG. 6 may operate in an inverse manner to FIG. 2.
  • A bitstream input to the bitstream deformatter 601 may be the bitstream generated in FIG. 2. The bitstream deformatter 601 may output bit1, bit2 and bit3 from the bitstream. Bit1, bit2 and bit3 are the same as those mentioned in FIG. 2.
  • Bit3 may be input to the first decoding unit 602. The first decoding unit 602 may generate downmixed signals dmx1 and dmx2 using bit3. The second decoding unit 603 may perform parametric coding on bit1 as a spatial cue and the downmixed signal dmx1 to output a normal channel signal xi and an LFE channel signal Lfe1. Likewise, the second decoding unit 604 may perform parametric coding on bit2 as a spatial cue and the downmixed signal dmx2 to output a normal channel signal xi+1 and an LFE channel signal Lfe2.
  • FIG. 7 illustrates a decoder which decodes an encoded result of FIG. 3.
  • Referring to FIG. 7, the decoder may include a bitstream deformatter 701, a first decoding unit 702, a second decoding unit 703, a first decoding unit 704 and a second decoding unit 705. FIG. 7 may operate in an inverse manner to FIG. 3.
  • A bitstream input to the bitstream deformatter 701 may be the bitstream generated in FIG. 3. The bitstream deformatter 701 may output bit1, bit2, bit3 and bit4 from the bitstream. Bit1, bit2, bit3 and bit4 are the same as those mentioned in FIG. 3.
  • Bit3 may be input to the first decoding unit 702, and bit4 may be input to the first decoding unit 704. The first decoding unit 702 may generate a downmixed signal dmx1 using bit3. The first decoding unit 704 may generate a downmixed signal dmx2 using bit4.
  • Subsequently, the second decoding unit 703 may perform parametric coding on bit1 as a spatial cue and the downmixed signal dmx1 to output a normal channel signal xi and an LFE channel signal Lfe1. Likewise, the second decoding unit 703 may perform parametric coding on bit2 as a spatial cue and the downmixed signal dmx2 to output a normal channel signal xi+1 and an LFE channel signal Lfe2.
  • FIG. 8 illustrates a decoder which decodes an encoded result of FIG. 4.
  • Referring to FIG. 8, the decoder may include a bitstream deformatter 801, a first decoding unit 802 and a second decoding unit 803. FIG. 8 may operate in an inverse manner to FIG. 4.
  • A bitstream input to the bitstream deformatter 801 may be the bitstream generated in FIG. 4. The bitstream deformatter 801 may output bit1 and bit2 from the bitstream. Bit1 and bit2 are the same as those mentioned in FIG. 4.
  • Bit1 may be input to the first decoding unit 802, and bit2 may be input to the second decoding unit 803. The first decoding unit 802 may generate a downmixed signal dmx3 using bit3. The second decoding unit 803 may perform parametric coding on bit2 as a spatial cue and the downmixed signal dmx3 to output LFE channel signals Lfe1 and Lfe2. In FIG. 8, the first decoding unit 802 and the second decoding unit 803 may perform parametric coding on an LFE band of the input downmixed signal dmx3.
  • FIG. 9 illustrates a decoder which decodes an encoded result of FIG. 5.
  • Referring to FIG. 9, the decoder may include a bitstream deformatter 901, a first decoding unit 902, a delay unit 903, a first decoding unit 904 and a delay unit 905. FIG. 9 may operate in an inverse manner to FIG. 5.
  • A bitstream input to the bitstream deformatter 901 may be the bitstream generated in FIG. 5. The bitstream deformatter 901 may output bit1 and bit2 from the bitstream. Bit1 and bit2 are the same as those mentioned in FIG. 5.
  • Bit1 may be input to the first decoding unit 902, and bit2 may be input to the first decoding unit 904. The first decoding unit 902 may generate an LFE channel signal Lfe1(n-τenc) using bit1, and the second decoding unit 904 may generate an LFE channel signal Lfe2(n-τenc) using bit2.
  • The delay unit 903 may apply a time delay to the LFE channel signal Lfe1(n-τenc) to output Lfe1(n-τencdec). Likewise, the delay unit 905 may apply a time delay to the LFE channel signal Lfe2(n-τenc) to output Lfe2(n-τencdec).
  • That is, unlike in FIGS. 6 to 8, since a decoding process is carried out once in FIG. 9, the delay units 903 and 905 may apply a time delay τdec occurring in one-time decoding so that signals subjected to one-time decoding synchronize with those subjected to two-time decoding.
  • FIG. 10 illustrates a process of encoding a multi-channel signal using the encoder of FIG. 2.
  • FIG. 10 illustrates an encoder for a multi-channel signal which adopts the encoder Type1 illustrated in FIG. 2. In FIG. 10, Two To Ones (TTOs) 1001, 1002, 1004 and 1005 may encode an input signal according to a parametric coding mode for an MPEG Surround stereo signal. That is, the TTOs may correspond to the first encoding units of FIG. 2, and USAC encoders may correspond to the second encoding unit of FIG. 2.
  • In FIG. 10, TTOs 1001 and 1002 may perform parametric coding according to a normal mode, and TTOs 1004 and 1005 may perform parametric coding according to an LFE mode. When parametric coding is performed according to the normal mode, a CLD, Inter-Channel Coherence (ICC) and Interchannel Phase Difference (IPD) as spatial cues may be extracted by analyzing a normal channel signal xi. When parametric coding is performed according to the LFE mode, only a CLD may be extracted from an LFE band of an input LFE channel signal.
  • FIG. 10 illustrates an encoding process when N multi-channel signals are input. In detail, in a first operation, the N multi-channel signals may be subjected to parametric coding via the TTOs into M downmixed signals dmx1 to dmxM. In a second operation, the M downmixed signals may be input in a stereo form and encoded through USAC core coding. In FIG. 10, LFE channel signals Lfe1 and Lfe2 may be coupled to normal channel signals to be input to the TTO 1004 and TTO1005.
  • That is, referring to FIG. 10, normal channel signals of the multi-channel signals may be coupled and downmixed by two channels, and a downmixed result may be subjected to stereo coding by the USAC encoders. Among the normal channel signals of the multi-channel signals, two normal channel signals x2M-1 and x2M may be respectively coupled to the LFE channel signals Lfe1 and Lfe2 and input to the TTOs(Lfe).
  • Although FIG. 10 shows that the encoder Type 1 of FIG. 2 is adopted, the encoder Type 2 illustrated in FIG. 3 may be applied, instead of the encoder Type 1.
  • FIG. 11 illustrates a decoder which decodes an encoded result of FIG. 10.
  • FIG. 11 illustrates a decoder for a multi-channel signal which adopts the decoder Type 1 illustrated in FIG. 6. In FIG. 11, One To Twos (OTTs) 1103, 1104, 1106 and 1107 may decode an input signal according to a parametric coding mode for an MPEG Surround stereo signal. That is, the OTTs may correspond to the second decoding units of FIG. 6, and USAC decoders may correspond to the first decoding unit.
  • In FIG. 11, OTTs 1103 and 1104 may perform parametric coding according to a normal mode, and the OTTs 1106 and 1107 may perform parametric coding according to an LFE mode. In FIG. 11, the encoded result may be decoded to output N multi-channel audio signals.
  • In detail, in a first operation, M downmixed signals may be output from a bitstream via the USAC decoders. In a second operation, the M downmixed signals may be input to the respective OTTs to output stereo signals. The OTTs 1103 and 1104 may output two normal channel signals, and the OTTs 1106 and 1107 may output normal channel signals coupled to LEF channel signals.
  • FIG. 12 illustrates a process of encoding a multi-channel signal when encoding bits are sufficient in FIG. 10.
  • When encoding bits for normal channel signals included in a multi-channel signal are sufficient, the encoding process of FIG. 12 may be performed. That is, normal channel signals x1 to x2M-2 may be encoded by USAC encoders 1203 and 1206. Here, a delay time τenc, occurring in encoding via delay units 1201, 1201, 1204 and 1205, may be applied to the normal channel signals x1 to x2M-2. Accordingly, time-delayed results may be encoded by the USAC encoders 1203 and 1206.
  • Here, the time delay τenc occurs in OTTs 1207 and 1208 and may include time delays due to quadrature mirror filter (QMF) analysis, hybrid analysis and QMF synthesis. When a signal input to a USAC Encoder 1209 is a QMF signal, a time delay occurring by QMF synthesis may be excluded when calculating τenc.
  • FIG. 13 illustrates a decoder which decodes an encoded result of FIG. 12.
  • FIG. 13 may perform an inverse process to FIG. 12. Referring to FIG. 13, a normal channel signal is decoded by USAC decoders 1302 and 1305 and output via delay units 1303, 1304, 1306 and 1307. A result derived by a bitstream deformatter 1301 maybe decoded by a USAC decoder 1308 to generate downmixed signals, and the downmixed signals may be respectively input to OTTs 1309 and 1310 to output normal channel signals x2M-1 and x2M respectively coupled to LFE channel signals Lfe1 and Lfe2.
  • Here, since the LFE channel signals are subjected to decoding two times, the delay units 1303, 1304, 1306 and 1307 may apply a time delay τdec occurring in one-time decoding to one-time to output results from the USAC decoders 1302 and 1305. Here, τdec includes QMF analysis, hybrid analysis, QMF synthesis and filtering delays and is different from τene. When output signals from the USAC decoders 1302 and 1305 are QMF signals, a time delay occurring by QMF analysis may be excluded when determining τdec. A filtering delay refers to a time delay which occurs due to a filtering operation by the OTTs 1309 and 1310, irrespective of QMF conversion. For example, a filtering delay may be a time delay occurring in a decorrelator operation by the OTTs 1309 and 1310.
  • FIG. 14 illustrates an example of encoding a multi-channel signal using the encoder of FIG. 4.
  • In FIG. 14, the encoder Type 3 illustrated in FIG. 4 may be used. Normal channel signals x1 to x2M of a multi-channel signal may be coupled by two and input to TTOs 1401 and 1402. The TTOs 1401 and 1402 may perform parametric coding on the normal channel signals coupled by two to output downmixed signals dmx1 and dmx2 along with spatial cues. The output downmixed signals dmx1 and dmx2 may be input in a stereo form to a USAC encoder 1403.
  • Meanwhile, LFE channel signals Lfe1 and Lfe2 included in the multi-channel signal may be coupled by two and input to a TTO 140. The TTO 1404 may perform parametric coding using the two LFE channel signals Lfe1 and Lfe2 to output a downmixed signal dmx3 in a mono form. Subsequently, a USAC encoder 1405 may encode the downmixed signal dmx3 in the LFE mode.
  • FIG. 15 illustrates another example of encoding a multi-channel signal using the encoder of FIG. 4.
  • In FIG. 15, the encoder Type 3 illustrated in FIG. 4 may be used. Here, in Fig. 15, a normal channel signal may be encoded by USAC encoders 1503 and 1506, instead of being subjected to parametric coding by TTOs 1401 and 1402 in FIG. 14. As illustrated below, since an LFE channel signal is subjected to encoding two times through a TTO 1507 and a USAC encoder 1508, delay units 1501, 1502, 1504 and 1505 may apply a time delay occurring by the TTO 1507 to the normal channel signal.
  • Meanwhile, LFE channel signals Lfe1 and Lfe2 may be encoded by the TTO 1507 in the LFE mode to output a downmixed signal dmx3, and the downmixed signal dmx3 may be encoded by the USAC encoder.
  • While FIGS. 14 and 15 illustrate the encoders, corresponding decoders may operate according to inverse processes. In detail, a normal channel signal may be output from a bitstream obtained in FIG. 14 via a USAC decoder and an OTT. Also, LFE channel signals Lfe1 and Lfe2 may be output from the bitstream obtained in FIG. 14 via a USAC decoder and an OTT.
  • In addition, a normal channel signal may be output from a bitstream obtained in FIG. 15 via a USAC decoder and a delay unit. Also, LFE channel signals Lfe1 and Lfe2 may be output from the bitstream obtained in FIG. 15 via a USAC decoder and an OTT.
  • FIG. 16 illustrates an example of encoding a multi-channel signal using the encoder of FIG. 5.
  • FIG. 16 illustrates an encoder for a multi-channel signal which adopts the encoder Type 4 illustrated in FIG. 5.
  • Normal channel signals may be converted into downmixed signals through TTOs 1601 and 1602, and the converted downmixed signals may be output as a bitstream through a USAC encoder 1603.
  • Meanwhile, delay units 1604 and 1606 may apply a time delay τenc occurring in the TTOs 1601 and 1602 to LFE channel signals, and time-delayed results may be encoded respectively by USAC encoders 1605 and 1607 according to the LFE mode. That is, since the LFE channel signals are subjected to encoding once, unlike the normal channel signals subjected to encoding two times, the time delay τenc occurring in encoding by the TTOs may need to be applied to the LFE channel signals.
  • FIG. 17 illustrates an example of decoding a multi-channel signal using the decoder of FIG. 9.
  • In FIG. 17, the decoder Type 4 illustrated in FIG. 9 is used. Referring to FIG. 17, normal channel signals may be output through a USAC decoder 1702 and a TTO 1703 and 1704. LFE channel signals may be output through USAC decoders 1705 and 1707 and delay units 1706 and 1708.
  • Since the LFE channel signals are subjected to decoding once, the delay units 1706 and 1708 may need to apply a time delay τdec occurring in decoding by the TTOs 1703 and 1704 to the LFE channel signals. Accordingly, the normal channel signals and the LFE channel signals output from the decoder may be synchronized with each other.
  • FIG. 18 illustrates an encoder which encodes a multi-channel signal when the multi-channel signal includes an odd number of LFE channel signals according to an embodiment.
  • While FIGS. 2 to 17 illustrate an even number of LFE channel signals, FIG. 18 illustrates an odd number of LFE channel signals.
  • Referring to FIG. 18, one LEF channel signal Lfe 2n+1 may be input to a delay unit 1801, and a time delay τ may be applied to the LEF channel signal. The time-delayed LFE channel signal may be encoded by a second encoding unit 1802 in an LFE mode to output bit1. That is, an odd number of LFE channel signals may be processed by the encoder Type 4 of FIG. 5 or the decoder Type 4 of FIG. 9.
  • Although not shown in FIG. 18, unlike the LFE channel signal, a normal channel signal is encoded by a first encoding unit and the second encoding unit, and thus a delay unit 1801 may need to apply a time delay occurring due to the first encoding unit to the LFE channel signal for synchronization with the normal channel signal.
  • FIG. 19 illustrates an encoder which encodes a normal audio signal, not an LFE channel signal, according to an embodiment.
  • Referring to FIG. 19, normal channel signals x1 and x2 may be subjected to parametric coding by a first encoding unit 1901 to be converted into a downmixed signal dmx1 along with a spatial cue bit1. Likewise, normal channel signals x3 and x4 may be subjected to parametric coding by a first encoding unit 1902 to be converted into a downmixed signal dmx2 along with a spatial cue bit2.
  • As described above, parametric coding applied to the normal channel signals may extract not only a CLD but also an ICC and IPD as spatial cues. The downmixed signal dmx1 and dmx2 may be input in a stereo form to a second encoding unit 1903 and encoded to output bit3. Bit3 may be converted into a bistream by a bitstream formatter 1904.
  • FIG. 20 illustrates a decoder which decodes an encoded result of FIG. 19.
  • Referring to FIG. 20, spatial cues bit1 and bit2 and encoded bit3 may be output by a bitstream deformatter 2001 from the bitstream generated in FIG. 19.
  • A first decoding unit 2002 may decode bit3 to output downmixed signals dmx1 and dmx2. A second decoding unit 2003 may decode a downmixed signal dmx1 to output normal channel signals x1 and x2. Likewise, a second decoding unit 2004 may decode a downmixed signal dmx2 to output normal channel signals x1 and x2.
  • FIG. 21 illustrates an encoding process and a decoding process according to an embodiment.
  • The foregoing first encoding units may correspond to TTOs 2101 and 2102 of FIG. 21, and the foregoing second encoding units may correspond to a USAC encoder 2103. Also, the foregoing first decoding units may correspond to a USAC decoder 2104, and the foregoing second encoding units may correspond to OTTs 2105 and 2106.
    1. (i) Four normal channel signals or (ii) results of coupling one normal channel signal and one LFE channel signal may be input to the TTOs 2101 and 2102. The TTOs 2101 and 2102 may generate a downmixed signal along with a spatial cue through parametric coding. The USAC encoder 2103 may encode the downmixed signal.
  • On the contrary, the USAC decoder 2104 may output two downmixed signals from a bitstream. The OTTs 2105 and 2106 may output (i) four normal channel signals or (ii) results of coupling one normal channel signal and one LFE channel signal from the downmixed signals.
  • FIG. 22 illustrates a USAC encoder and a USAC decoder according to a first embodiment.
  • The foregoing embodiments illustrate configurations in which a USAC encoder is separate from a TTO or a USAC decoder is separate from an OTT. Alternatively, as in FIG. 22, a USAC encoder may include TTOs 2203 and 2204 to configure an extended USAC encoder 2201. Likewise, a USAC decoder may include OTTs 2211 and 2212 to configure an extended USAC decoder 2202.
    1. (i) Four normal channel signals or (ii) one normal channel signal and one LFE channel signal may be subjected to parametric coding by the TTOs 2203 and 2204 and output as downmixed signals. The downmixed signals output from the TTOs 2203 and 2204 may be input in a stereo form to a TTO 2205 and be subjected to parametric coding one more time. A result of parametric coding is subjected to in frequency extension by a spectral band replication (SBR) 2206, and a non-frequency-extended core band may be encoded by a core encoder 2207.
  • In a bitstream generated by the extended USAC encoder 2201, the non-frequency-extended core band is decoded by a core decoder 2208, and a decoded result may be input to and subjected to frequency extension by an SBR 2209, thereby reconstructing an original signal. Subsequently, a result of frequency extension by the SBR 2209 may be subjected to parametric coding by an OTT 2210 to generate two downmixed signals, and the downmixed signals may be subjected to parametric coding by the OTTs 2211 and 2212 to output (i) four normal channel signals or (ii) one normal channel signal and one LFE channel signal.
  • FIG. 23 illustrates a USAC encoder and a USAC decoder according to a second embodiment.
  • In FIG. 23, positions of an SBR 2305 and a TTO 2306 in an extended USAC encoder 2301 and positions of an OTT 2309 and an SBR 2310 in an extended USAC decoder 2302 are changed from those in FIG. 22. Other components may be equivalent to those in FIG. 22.
  • The apparatuses described herein may be implemented using hardware components, software components, and/or combinations of hardware components and software components. For instance, the units and components illustrated in the embodiments may be implemented using one or more general-purpose or special purpose computers, such as, for example, a processor, a controller, an arithmetic logic unit (ALU), a digital signal processor, a microcomputer, a field programmable array (FPA), a programmable logic unit (PLU), a microprocessor or any other device capable of responding to and executing instructions. A processing device may run an operating system (OS) and one or more software applications that run on the OS. The processing device also may access, store, manipulate, process, and create data in response to execution of the software. For purpose of simplicity, the description of a processing device is used as singular; however, one skilled in the art will appreciated that a processing device may include multiple processing elements and multiple types of processing elements. For example, a processing device may include multiple processors or a processor and a controller. In addition, different processing configurations are possible, such as parallel processors.
  • The software may include a computer program, a piece of code, an instruction, or one or more combinations thereof, to independently or collectively instruct or configure the processing device to operate as desired. Software and/or data may be embodied permanently or temporarily in any type of machine, component, physical or virtual equipment, computer storage medium or device, or in a propagated signal wave in order to provide instructions or data to the processing device or to be interpreted by the processing device. The software may also be distributed over network coupled computer systems so that the software is stored and executed in a distributed fashion. The software and data may be stored by one or more non-transitory computer readable recording mediums.
  • The methods according to the embodiments may be realized as program instructions implemented by various computers and be recorded in non-transitory computer-readable media. The media may also include, alone or in combination with the program instructions, data files, data structures, and the like. The program instructions recorded in the media may be designed and configured specially for the embodiments or be known and available to those skilled in computer software. Examples of the non-transitory computer readable recording medium may include magnetic media such as hard disks, floppy disks, and magnetic tape; optical media such as CD ROM disks and DVDs; magneto-optical media such as floptical disks; and hardware devices that are specially configured to store and perform program instructions, such as read-only memory (ROM), random access memory (RAM), flash memory, and the like. Examples of program instructions include both machine codes, such as produced by a compiler, and higher level language codes that may be executed by the computer using an interpreter. The described hardware devices may be configured to act as one or more software modules in order to perform the operations of the above-described exemplary embodiments, or vice versa.
  • While a few exemplary embodiments have been shown and described with reference to the accompanying drawings, it will be apparent to those skilled in the art that various modifications and variations can be made from the foregoing descriptions. For example, adequate effects may be achieved even if the foregoing processes and methods are carried out in different order than described above, and/or the aforementioned elements, such as systems, structures, devices, or circuits, are combined or coupled in different forms and modes than as described above or be substituted or switched with other components or equivalents. Thus, other implementations, alternative embodiments and equivalents to the claimed subject matter are construed as being within the appended claims.

Claims (20)

  1. A method of encoding a multi-channel signal, the method comprising:
    outputting a first downmixed signal and a first spatial cue by encoding a first normal channel signal and a first low-frequency effects (LFE) channel signal which are comprised in a multi-channel signal;
    outputting a second downmixed signal and a second spatial cue by encoding a second normal channel signal and a second LFE channel signal which are comprised in the multi-channel signal;
    encoding the first downmixed signal the second downmixed signal together; and
    generating a bitstream comprising the encoded first downmixed signal, the encoded second downmixed signal, the first spatial cue and the second spatial cue.
  2. The method of claim 1, wherein the outputting of the first cue outputs the first downmixed signal and the first spatial cue by applying parametric coding to the first normal channel signal and the first LFE channel signal in an LFE mode, the outputting of the second cue outputs the second downmixed signal and the second spatial cue by applying parametric coding to the second normal channel signal and the second LFE channel signal in the LFE mode, and the first spatial cue and the second spatial cue comprise a channel level difference (CLD) output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  3. A method of encoding a multi-channel signal, the method comprising:
    outputting a first downmixed signal and a first spatial cue by encoding a first normal channel signal and a first low-frequency effects (LFE) channel signal which are comprised in a multi-channel signal;
    outputting a second downmixed signal and a second spatial cue by encoding a second normal channel signal and a second LFE channel signal which are comprised in the multi-channel signal;
    encoding the first downmixed signal;
    encoding the second downmixed signal separately from the first downmixed signal; and
    generating a bitstream comprising the encoded first downmixed signal, the encoded second downmixed signal, the first spatial cue and the second spatial cue.
  4. The method of claim 3, wherein the outputting of the first cue outputs the first downmixed signal and the first spatial cue by applying parametric coding to the first normal channel signal and the first LFE channel signal in an LFE mode, the outputting of the second cue outputs the second downmixed signal and the second spatial cue by applying parametric coding to the second normal channel signal and the second LFE channel signal in the LFE mode, and the first spatial cue and the second spatial cue comprise a channel level difference (CLD) output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  5. A method of encoding a multi-channel signal, the method comprising:
    outputting a downmixed signal and a spatial cue by encoding a first low-frequency effects (LFE) channel signal and a second LFE channel signal which are comprised in a multi-channel signal;
    encoding the downmixed signal; and
    generating a bitstream comprising the encoded downmixed signal and the spatial cue.
  6. The method of claim 5, wherein the outputting outputs the downmixed signal and the spatial cue by applying parametric coding to the first LFE channel signal and the second LFE channel signal in an LFE mode, and the spatial cue comprises a channel level difference (CLD) output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  7. A method of encoding a multi-channel signal, the method comprising:
    applying a time delay to a first low-frequency effects (LFE) channel signal comprised in a multi-channel signal;
    applying the time delay to a second LFE channel signal comprised in the multi-channel signal;
    encoding the first LEF channel signal to which the time delay is applied;
    encoding the second LEF channel signal to which the time delay is applied; and
    generating a bitstream comprising the encoded first LEF channel signal and the encoded second LEF channel signal.
  8. The method of claim 7, wherein the time delay comprises a time delay which occurs in encoding a normal channel signal comprised in the multi-channel signal.
  9. A method of encoding a multi-channel signal, the method comprising:
    applying a time delay a normal channel signal comprised in a multi-channel signal;
    encoding the normal channel signal to which the time delay is applied;
    outputting a downmixed signal and a spatial cue by encoding a low-frequency effects (LFE) channel signal comprised in the multi-channel signal; and
    encoding the encoded LFE channel signal,
    wherein the time delay comprises a time delay which occurs in encoding the LFE channel signal.
  10. The method of claim 9, wherein the outputting outputs the downmixed signal and the spatial cue by conducting parametric coding on the LFE channel signal in an LFE mode.
  11. A method of decoding a multi-channel signal, the method comprising:
    generating a first downmixed signal and a second downmixed signal by decoding an encoded result extracted from a bitstream;
    outputting a first normal channel signal and a first low-frequency effects (LFE) channel signal by decoding the first downmixed signal; and
    outputting a second normal channel signal and a second LFE channel signal by decoding the second downmixed signal.
  12. The method of claim 11, wherein the outputting of the first normal channel signal and the first LFE channel signal outputs the first normal channel signal and the first LEF channel signal from the first downmixed signal by applying a first spatial cue to parametric coding, the outputting of the second normal channel signal and the second LFE channel signal outputs the second normal channel signal and the second LEF channel signal from the second downmixed signal by applying a second spatial cue to parametric coding, and the first spatial cue and the second spatial cue comprise a channel level difference (CLD) output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  13. A method of decoding a multi-channel signal, the method comprising:
    generating a first downmixed signal by decoding an encoded result extracted from a bitstream;
    generating a second downmixed signal by decoding another encoded result extracted from the bitstream;
    outputting a first normal channel signal and a first low-frequency effects (LFE) channel signal by decoding the first downmixed signal; and
    outputting a second normal channel signal and a second LFE channel signal by decoding the second downmixed signal.
  14. The method of claim 13, wherein the outputting of the first normal channel signal and the first LFE channel signal outputs the first normal channel signal and the first LEF channel signal using parametric coding based on a first spatial cue for the first downmixed signal, the outputting of the second normal channel signal and the second LFE channel signal outputs the second normal channel signal and the second LEF channel signal using parametric coding based on a second spatial cue for the second downmixed signal, and the first spatial cue and the second spatial cue comprise a channel level difference (CLD) output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  15. A method of decoding a multi-channel signal, the method comprising:
    generating a downmixed signal by decoding an encoded result extracted from a bitstream; and
    outputting a first low-frequency effects (LFE) channel signal and a second LFE channel signal by decoding the downmixed signal.
  16. The method of claim 15, wherein the outputting outputs the first LEF channel signal and the second LFE channel signal by applying parametric coding based on a spatial cue to the downmixed signal, and the spatial cue comprises a channel level difference (CLD) output from an LFE band of the first LFE channel signal or the second LFE channel signal.
  17. A method of decoding a multi-channel signal, the method comprising:
    outputting a first low-frequency effects (LFE) channel signal by decoding an encoded result extracted from a bitstream;
    outputting a second LFE channel signal by decoding another encoded result extracted from the bitstream;
    applying a time delay to the first LEF channel signal; and
    applying the time delay to the second LFE channel signal.
  18. The method of claim 17, wherein the time delay comprises a time delay which occurs in decoding a normal channel signal.
  19. A method of decoding a multi-channel signal, the method comprising:
    decoding a normal channel signal from a bitstream;
    applying a time delay to the decoded normal channel signal;
    decoding a low-frequency effects (LFE) channel signal from the bitstream; and
    decoding the decoded LFE channel signal,
    wherein the time delay comprises a time delay which occurs in decoding the LFE channel signal.
  20. The method of claim 19, wherein the decoding of the LFE channel signal outputs a downmixed signal and a spatial cue by conducting parametric coding on the LFE channel signal in an LFE mode.
EP14826617.4A 2013-07-15 2014-07-15 Encoder and encoding method for multichannel signal, and decoder and decoding method for multichannel signal Ceased EP3023984A4 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
KR20130083073 2013-07-15
KR20130123416 2013-10-16
PCT/KR2014/006406 WO2015009040A1 (en) 2013-07-15 2014-07-15 Encoder and encoding method for multichannel signal, and decoder and decoding method for multichannel signal

Publications (2)

Publication Number Publication Date
EP3023984A1 true EP3023984A1 (en) 2016-05-25
EP3023984A4 EP3023984A4 (en) 2017-03-08

Family

ID=52572683

Family Applications (1)

Application Number Title Priority Date Filing Date
EP14826617.4A Ceased EP3023984A4 (en) 2013-07-15 2014-07-15 Encoder and encoding method for multichannel signal, and decoder and decoding method for multichannel signal

Country Status (3)

Country Link
EP (1) EP3023984A4 (en)
KR (1) KR20150009474A (en)
WO (1) WO2015009040A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180035230A1 (en) * 2015-02-17 2018-02-01 Electronics And Telecommunications Research Institute Multichannel signal processing method, and multichannel signal processing apparatus for performing same

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016133366A1 (en) * 2015-02-17 2016-08-25 한국전자통신연구원 Multichannel signal processing method, and multichannel signal processing apparatus for performing same
WO2018128457A2 (en) * 2017-01-05 2018-07-12 엘지전자 주식회사 Method for performing channel-coding of information on basis of polar code
CN115410584A (en) * 2021-05-28 2022-11-29 华为技术有限公司 Method and apparatus for encoding multi-channel audio signal

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7644003B2 (en) * 2001-05-04 2010-01-05 Agere Systems Inc. Cue-based audio coding/decoding
US7805313B2 (en) * 2004-03-04 2010-09-28 Agere Systems Inc. Frequency-based coding of channels in parametric multi-channel coding systems
KR101271069B1 (en) * 2005-03-30 2013-06-04 돌비 인터네셔널 에이비 Multi-channel audio encoder and decoder, and method of encoding and decoding
US7751572B2 (en) * 2005-04-15 2010-07-06 Dolby International Ab Adaptive residual audio coding
KR100891685B1 (en) * 2005-08-30 2009-04-03 엘지전자 주식회사 Apparatus for encoding and decoding audio signal and method thereof
EP2372701B1 (en) * 2006-10-16 2013-12-11 Dolby International AB Enhanced coding and parameter representation of multichannel downmixed object coding
US20120093323A1 (en) * 2010-10-14 2012-04-19 Samsung Electronics Co., Ltd. Audio system and method of down mixing audio signals using the same

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO2015009040A1 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180035230A1 (en) * 2015-02-17 2018-02-01 Electronics And Telecommunications Research Institute Multichannel signal processing method, and multichannel signal processing apparatus for performing same
US10225675B2 (en) * 2015-02-17 2019-03-05 Electronics And Telecommunications Research Institute Multichannel signal processing method, and multichannel signal processing apparatus for performing the method
US10638243B2 (en) 2015-02-17 2020-04-28 Electronics And Telecommunications Research Institute Multichannel signal processing method, and multichannel signal processing apparatus for performing the method

Also Published As

Publication number Publication date
WO2015009040A1 (en) 2015-01-22
EP3023984A4 (en) 2017-03-08
KR20150009474A (en) 2015-01-26

Similar Documents

Publication Publication Date Title
KR100888474B1 (en) Apparatus and method for encoding/decoding multichannel audio signal
EP1851997B1 (en) Near-transparent or transparent multi-channel encoder/decoder scheme
US11056122B2 (en) Encoder and encoding method for multi-channel signal, and decoder and decoding method for multi-channel signal
JP7413418B2 (en) Audio decoder for interleaving signals
RU2643644C2 (en) Coding and decoding of audio signals
CN109887516B (en) Method for decoding audio scene, audio decoder and medium
RU2696952C2 (en) Audio coder and decoder
KR101756838B1 (en) Method and apparatus for down-mixing multi channel audio signals
EP3023984A1 (en) Encoder and encoding method for multichannel signal, and decoder and decoding method for multichannel signal
KR20080066538A (en) Apparatus and method for encoding/decoding multi-channel signal
JP6303435B2 (en) Audio encoding apparatus, audio encoding method, audio encoding program, and audio decoding apparatus
JP6051621B2 (en) Audio encoding apparatus, audio encoding method, audio encoding computer program, and audio decoding apparatus
KR20080035448A (en) Method and apparatus for encoding/decoding multi channel audio signal
JP6299202B2 (en) Audio encoding apparatus, audio encoding method, audio encoding program, and audio decoding apparatus
KR20150011783A (en) Decoding method for multi-channel audio signal using reverberation signal and decoder
KR20140122990A (en) Apparatus and method for encoding/decoding multichannel audio signal
KR20150009426A (en) Method and apparatus for processing audio signal to down mix and channel convert multichannel audio signal

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20160215

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

AX Request for extension of the european patent

Extension state: BA ME

DAX Request for extension of the european patent (deleted)
A4 Supplementary search report drawn up and despatched

Effective date: 20170203

RIC1 Information provided on ipc code assigned before grant

Ipc: G10L 19/008 20130101AFI20170130BHEP

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: EXAMINATION IS IN PROGRESS

17Q First examination report despatched

Effective date: 20200901

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: EXAMINATION IS IN PROGRESS

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: EXAMINATION IS IN PROGRESS

REG Reference to a national code

Ref country code: DE

Ref legal event code: R003

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN REFUSED

18R Application refused

Effective date: 20230629