US20070055510A1 - Concept for bridging the gap between parametric multi-channel audio coding and matrixed-surround multi-channel coding - Google Patents

Concept for bridging the gap between parametric multi-channel audio coding and matrixed-surround multi-channel coding Download PDF

Info

Publication number
US20070055510A1
US20070055510A1 US11/323,965 US32396505A US2007055510A1 US 20070055510 A1 US20070055510 A1 US 20070055510A1 US 32396505 A US32396505 A US 32396505A US 2007055510 A1 US2007055510 A1 US 2007055510A1
Authority
US
United States
Prior art keywords
signal
channel
parametric data
parametric
deriving
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US11/323,965
Other languages
English (en)
Inventor
Johannes Hilpert
Christof Faller
Karsten Linzmeier
Ralph Sperschneider
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fraunhofer Gesellschaft zur Forderung der Angewandten Forschung eV
Original Assignee
Fraunhofer Gesellschaft zur Forderung der Angewandten Forschung eV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=36873210&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=US20070055510(A1) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Application filed by Fraunhofer Gesellschaft zur Forderung der Angewandten Forschung eV filed Critical Fraunhofer Gesellschaft zur Forderung der Angewandten Forschung eV
Priority to US11/323,965 priority Critical patent/US20070055510A1/en
Priority to EP18180076.4A priority patent/EP3404656B1/de
Priority to ES06743182.5T priority patent/ES2690278T3/es
Priority to ES18180076T priority patent/ES2952871T3/es
Priority to BRPI0616019-0A priority patent/BRPI0616019B1/pt
Priority to HUE18180076A priority patent/HUE064455T2/hu
Priority to PL06743182T priority patent/PL1908056T3/pl
Priority to RU2008106225/09A priority patent/RU2382418C2/ru
Priority to DK18180076.4T priority patent/DK3404656T3/da
Priority to PCT/EP2006/005971 priority patent/WO2007009548A1/en
Priority to PL18180076.4T priority patent/PL3404656T3/pl
Priority to FIEP18180076.4T priority patent/FI3404656T3/fi
Priority to EP23214132.5A priority patent/EP4307124A3/de
Priority to PT181800764T priority patent/PT3404656T/pt
Priority to EP06743182.5A priority patent/EP1908056B1/de
Priority to EP23214134.1A priority patent/EP4307126A3/de
Priority to PT06743182T priority patent/PT1908056T/pt
Priority to JP2008521820A priority patent/JP5265358B2/ja
Priority to EP23214133.3A priority patent/EP4307125A3/de
Priority to MX2008000828A priority patent/MX2008000828A/es
Priority to KR1020087002860A priority patent/KR100946688B1/ko
Priority to CA2614384A priority patent/CA2614384C/en
Priority to CN2006800259749A priority patent/CN101223578B/zh
Priority to AU2006272127A priority patent/AU2006272127B2/en
Priority to EP23180543.3A priority patent/EP4235440A3/de
Priority to MYPI20062999A priority patent/MY149198A/en
Priority to TW095125971A priority patent/TWI339028B/zh
Priority to US11/458,646 priority patent/US8180061B2/en
Assigned to FRAUNHOFER-GESELLSCHAFT ZUR FOERDERUNG DER ANGEWANDTEN FORSCHUNG E.V. reassignment FRAUNHOFER-GESELLSCHAFT ZUR FOERDERUNG DER ANGEWANDTEN FORSCHUNG E.V. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: FALLER, CHRISTOF, HILPERT, JOHANNES, LINZMEIER, KARSTEN, SPERSCHNEIDER, RALPH
Publication of US20070055510A1 publication Critical patent/US20070055510A1/en
Priority to IL188425A priority patent/IL188425A0/en
Priority to NO20080850A priority patent/NO342863B1/no
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R5/00Stereophonic arrangements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/02Systems employing more than two channels, e.g. quadraphonic of the matrix type, i.e. in which input signals are combined algebraically, e.g. after having been phase shifted with respect to each other
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F12/00Accessing, addressing or allocating within memory systems or architectures
    • G06F12/02Addressing or allocation; Relocation
    • G06F12/08Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
    • G06F12/0802Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches
    • G06F12/0806Multiuser, multiprocessor or multiprocessing cache systems
    • G06F12/0815Cache consistency protocols

Definitions

  • the present invention relates to multi-channel audio coding and transmission, and in particular to techniques to encode multi-channel audio in a manner that is fully backwards compatible with stereo devices and formats, allowing for an efficient coding of multi-channel audio.
  • MPEG Moving Pictures Experts Group
  • ISO International Organization for Standardization
  • the left down-mix signal (Lt) consists of the left-front signal (Lf), the centre signal (C) multiplied by a factor q, the left-surround signal (Ls) phase rotated by 90 degrees (j′) and scaled by a factor a, and the right-surround signal (Rs) which is also phase rotated by 90 degrees and scaled by a factor b.
  • the right down-mix signal (Rt) is generated similarly.
  • Typical down-mix factors are 0.707 for q and a, and 0.408 for b.
  • MPEG Surround audio coding An example for a coding method adding helper information, also called side information, is MPEG Surround audio coding.
  • This efficient way for parametric multi-channel audio coding is for example described in “The Reference Model Architecture for MPEG Spatial Audio Coding”, Herre, J., Purnhagen, H., Breebaart, J., Faller, C., Disch, S., Kjoerling, K., Schuijers, E., Hilpert, J., Myburg, F., Proc. 118th AES Convention, Barcelona, Spain, 2005 and in “Text of Working Draft for Spatial Audio Coding (SAC)”, ISO/IEC JTC1/SC29/WG11 (MPEG), Document N7136, Busan, Korea, 2005.
  • SAC Spatial Audio Coding
  • FIG. 6 A schematic overview of an encoder used in spatial audio coding is shown in FIG. 6 .
  • the encoder splits incoming signals 10 (input 1 , . . . input N) in separate time-frequency tiles by means of Quadrature Mirror Filters 12 (QMF). Groups of the resulting frequency tiles (bands) are referred to as “parameter bands”.
  • a number of spatial parameters 14 are determined by a parameter estimator 16 that describes the properties of the spatial image, e.g. level differences between pairs of channels (CLD), cross correlation between pairs of channels (ICC) or information on signal envelopes (CPC).
  • CLD level differences between pairs of channels
  • ICC cross correlation between pairs of channels
  • CPC information on signal envelopes
  • These parameters are subsequently quantized, encoded and compiled jointly into a bit-stream of spatial data. Depending on the operation mode, this bit-stream can cover a wide range of bit-rates, starting from a few kBit/s for good quality multi-channel audio up to tenth
  • the encoder also generates a mono or stereo down-mix from the multi-channel input signal.
  • the user has the choice of a conventional (ITU-style) stereo down-mix or of a down-mix that is compatible with matrixed-surround systems.
  • the stereo down-mix is transferred to the time-domain by means of QMF synthesis banks 18 .
  • the resulting down-mix can be transmitted to a decoder, accompanied by the spatial parameters or the spatial parameter bit-stream 14 .
  • the down-mix is also encoded before transmission (using a conventional mono or stereo core coder), while the bit-streams of the core coder and the spatial parameters might additionally be combined (multiplexed) to form a single output bit-stream.
  • a decoder as sketched in FIG. 7 , in principle performs the reverse process of the encoder.
  • An input-stream is split into a core coder bit-stream and a parameter bit-stream. This is not shown in FIG. 7 .
  • the decoded down-mix 20 is processed by a QMF analysis bank 22 to derive parameter bands that are the same as those applied in the encoder.
  • a spatial synthesis stage 24 reconstructs the multi-channel signal by means of control data 26 (i.e., the transmitted spatial parameters).
  • the QMF-domain signals are transferred to the time domain by means of a QMF synthesis bank 27 that derives the final multi-channel output signals 28 .
  • FIG. 8 shows a simple example of a QMF analysis, as it is performed within the prior art encoder in FIG. 6 and the prior art decoder in FIG. 7 .
  • An audio sample 30 sampled in the time domain and having four sample values is input into a filter bank 32 .
  • the filter bank 32 derives three output samples 34 a, 34 b and 34 c having four sample values each.
  • the filter bank 32 derives the output samples 34 a to 34 c such that the samples within the output signals do only comprise information on discrete frequency ranges of the underlying audio signal 30 .
  • FIG. 8 shows a simple example of a QMF analysis, as it is performed within the prior art encoder in FIG. 6 and the prior art decoder in FIG. 7 .
  • An audio sample 30 sampled in the time domain and having four sample values is input into a filter bank 32 .
  • the filter bank 32 derives three output samples 34 a, 34 b and 34 c having four sample values each.
  • the filter bank 32 derives the output samples 34
  • the sample 34 a has information on the frequency interval ranging from f 0 to f 1
  • the sample 34 b has information of the frequency interval [f 1 , f 2 ]
  • the sample 34 c has information on the frequency interval [f 2 , f 3 ].
  • the frequency intervals in FIG. 8 do not overlap, in a more general case the frequency intervals of the output samples coming out of a filter bank may very well have a frequency overlap.
  • a prior art encoder can, as already described above, deliver either an ITU-style down-mix or a matrixed-surround compatible down-mix, when a two-channel down-mix is desired.
  • a matrixed-surround compatible down-mix using for example the matrixing approach given in Equation 1), one possibility would be that the encoder generates a matrixed-surround compatible down-mix directly.
  • FIG. 9 shows an alternative approach to generate a matrixed-surround compatible down-mix using a down-mix post processing unit 30 working on a regular stereo down-mix 32 .
  • the matrixed-surround processor 30 modifies the regular stereo down-mix 32 to make it matrixed-surround compatible guided by the spatial parameters 14 extracted by the parameter extraction stage 16 .
  • a matrixed-surround compatible down-mix 34 is transferred to the time domain by a QMF synthesis using the QMF synthesis bank 18 .
  • Deriving the matrixed-surround compatible signal by post-processing a regular stereo down-mix has the advantage that the matrixed-surround compatibility processing can be fully reversed at a decoder side if the spatial parameters are available.
  • Matrixed-surround methods are very efficient (since no additional parameters are required) at the price of a very limited multi-channel reconstruction quality.
  • Parametric multi-channel approaches require a higher bit-rate due to the side information, which becomes a problem when a limit is set as a maximum acceptable bitrate for the parametric representation.
  • the encoded parameters require a comparatively high amount of bit-rate, the only possible way to stay within such a bit-rate limit is to decrease the quality of an encoded down-mix channel by increasing the compression of the channel.
  • the result is a general loss in audio quality, which may be unacceptably high.
  • bit-rate that has to be spent when applying the parametric method may be too high in case of certain application scenarios, the audio quality delivered by the methods without transmission of side-information might not be sufficient.
  • the US Patent Application 2005157883 is showing an apparatus for constructing a multi-channel audio signal using an input signal and parametric side information, the input signal including the first input channel and the second input channel derived from an original multi-channel signal, and the parametric side information describing interrelations between channels of the multi-channel original signal.
  • this object is achieved by a multi-channel audio decoder for processing an audio signal and for processing first parametric data describing a first portion of a multi-channel signal, wherein for a second portion of the multi-channel signal no parametric data or second parametric data is processed, the second parametric data requiring less information units than the first parametric data when describing an identical portion of the multi-channel signal, comprising: a processor for deriving an intermediate signal from the audio signal, using a first deriving rule for deriving a first portion of the intermediate signal, the first portion of the intermediate signal corresponding to the first portion of the multi-channel audio signal, wherein the first deriving rule is depending on the first parametric data; and using a second deriving rule for deriving a second portion of the intermediate signal, the second deriving rule using no parametric data or the second parametric data.
  • this object is achieved by a multi-channel encoder for generating a parametric representation describing spatial properties of a multi-channel audio signal
  • the multi-channel encoder comprising: a parameter generator for generating spatial parameters; an output interface for generating the parametric representation, wherein the parameter generator or the output interface is adapted to generate the parametric representation such that the parametric representation includes first parametric data for a first portion of the multi-channel signal and wherein for a second portion of the multi-channel signal no parametric data or second parametric data is included in the parametric representation, the second parametric data requiring less information units than the first parametric data when describing an identical portion of the multi-channel signal.
  • this object is achieved by a method for processing an audio signal and for processing first parametric data describing a first portion of a multi-channel signal, wherein for a second portion of the multi-channel signal no parametric data or second parametric data is processed, the second parametric data requiring less information units than the first parametric data when describing an identical portion of the multi-channel signal, the method comprising: deriving an intermediate signal from the down-mix signal using a first deriving rule depending on the first parametric data for deriving a first portion of the intermediate signal, the first portion of the intermediate signal corresponding to the first portion of the multi-channel audio signal; and deriving a second portion of the intermediate signal using a second deriving rule, the second deriving rule using the second parametric data or no parametric data.
  • this object is achieved by a method for generating a parametric representation describing spatial properties of a multi-channel audio signal, the method comprising: generating spatial parameters; and generating the parametric representation such that the parametric representation includes first parametric data for a first portion of the multi-channel signal and wherein for a second portion of the multi-channel signal no parametric data or second parametric data is included in the parametric representation, the second parametric data requiring less information units than the first parametric data when describing an identical portion of the multi-channel signal.
  • this object is achieved by a parametric representation describing spatial properties of a multi-channel audio signal, the parametric representation including first parametric data for a first portion of the multi-channel signal and wherein the parametric representation is including no parametric data or second parametric data for a second portion of the multi-channel signal, the second parametric data requiring less information units than the first parametric data for an identical portion of the multi-channel signal.
  • this object is achieved by a computer program having a program code for performing, when running on a computer, a method for processing an audio signal and for processing first parametric data describing a first portion of a multi-channel signal, wherein for a second portion of the multi-channel signal no parametric data or second parametric data is processed, the second parametric data requiring less information units than the first parametric data when describing an identical portion of the multi-channel signal, the method comprising: deriving an intermediate signal from the down-mix signal using a first deriving rule depending on the first parametric data for deriving a first portion of the intermediate signal, the first portion of the intermediate signal corresponding to the first portion of the multi-channel audio signal; and deriving a second portion of the intermediate signal using a second deriving rule, the second deriving rule using the second parametric data or no parametric data.
  • this object is achieved by a computer program having a program code for performing, when running on a computer, a method for generating a parametric representation describing spatial properties of a multi-channel audio signal, the method comprising: generating spatial parameters; and generating the parametric representation such that the parametric representation includes first parametric data for a first portion of the multi-channel signal and wherein for a second portion of the multi-channel signal no parametric data or second parametric data is included in the parametric representation, the second parametric data requiring less information units than the first parametric data when describing an identical portion of the multi-channel signal.
  • a transcoder for generating a parametric representation of a multi-channel audio signal using spatial parameters describing the spatial properties of the multi-channel audio signal, comprising: a parameter generator to generate the parametric representation such that the parametric representation includes first parametric data being derived from the spatial parameters for a first portion of the multi-channel signal and wherein for a second portion of the multi-channel signal no parametric data or second parametric data is included in the parametric representation, the second parametric data requiring less information units than the first parametric data when describing an identical portion of the multi-channel signal.
  • the present invention is based on the finding that a multi-channel audio signal can be efficiently represented by a parametric representation, when a first deriving rule is used for deriving first parametric data of the parametric representation describing a first portion of the multi-channel signal, and when for a second portion of the multi-channel signal second parametric data or no parametric data is included in the parametric representation, whereas the second parametric data is requiring less information units than the first parametric data when describing an identical portion of the multi-channel signal.
  • a first portion of the multi channel signal is represented by first parameters allowing for a reconstruction of the multi channel signal with higher quality and a second portion can be represented by second parameters allowing for a reconstruction with slightly lower quality.
  • the bit-rate consumed by the first parametric data is consequently higher than the bit rate consumed by the second parametric data when both parametric data is to describe the same portion of a multi-channel signal.
  • the first parameters require more bit rate per signal portion than the second parameters.
  • the purpose of the invention is to bridge the gap between both prior art worlds by gradually improving the sound of the up-mix signal while raising the bit-rate consumed by the side-information starting from 0 up to the bit-rates of the parametric methods. That is, the present invention aims at bridging the gap in bit-rates and perceptual quality between fully parametric methods and matrixed-surround methods. More specifically, it provides a method of flexibly choosing an “operating point” somewhere between matrixed-surround (no side-information, limited audio quality) and fully parametric reconstruction (full side-information rate required, good quality). This operating point can be chosen dynamically (i.e. varying in time) and in response to the permissible side-information rate, as it is dictated by the individual application.
  • the demanded bit-rate can be varied within a broad range. Representing major parts of a multi-channel signal by the spatial audio parameters will consume a comparatively high bit-rate at the benefit of a good perceptual quality. Since for the second portion of the multi-channel audio signal a parameter deriving rule is chosen that results in parameters consuming less bit-rate, the resulting total bit-rate can be decreased by increasing the size of the second portion of the multi-channel signal. In a preferred embodiment of the present invention, no parametric data at all is transmitted for the second portion of the multi-channel signal, which is of course most bit-saving. Therefore, by dynamically shifting the size of the first portion with respect to the size of the second portion, the bit-rate (or the perceptual quality) can be dynamically adjusted to the needs.
  • a down-mix signal is derived in a matrix compatible way. Therefore, the first portion of the multi-channel audio signal can be reproduced with high perceptual quality using the spatial audio parameters and the second portion of the multi-channel signal can be reproduced using matrix-based solutions. This allows for a high-quality reproduction of parts of the signals requiring higher quality. At the same time, the overall bit-rate is decreased by relying on a matrix-based reproduction for signal parts less vital for the quality of a reproduced signal.
  • the inventive concept is applied on the decoder side within a QMF representation of a received down-mix signal.
  • the up-mixing process can principally be sub-divided into three steps:
  • Both, the pre-de-correlator matrix as well as the mixed-matrix are two-dimensional matrices with the dimensions “number of time slots” on the one hand and “number of parameter bands” on the other hand.
  • the elements of these matrices are filled up with values that are derived from the parameters read from the spatial bit-stream, i.e. by the first parametric data.
  • the first parametric data is only received for a first portion of the multi-channel signal, only that portion of a reconstruction of a multi-channel signal can be derived using the first parametric data submitted.
  • the matrix elements for deriving the second part of the reconstruction of the multi-channel signal are, according to the present invention, derived using matrix compatible coding schemes. These matrix elements can therefore either be derived based only on knowledge achieved from the down-mix signal or be replaced by pre-defined values.
  • a multi-channel audio decoder recognizes by the amount of the transmitted first parametric data, which part of the matrix or which part of the multi-channel audio signal is to be processed by the rule depending on the spatial parameters and which part is to be processed by the matrix based solution.
  • an audio encoder creates window information, indicating which parts of a multi-channel signal are being processed by the matrix based solution or by the spatial audio compatible approach.
  • the window information is included in the parametric representation of a multi-channel signal.
  • An inventive decoder therefore, is able to receive and to process the window information created to apply the appropriate up-mixing rules on the portions of the multi-channel audio signal indicated by the window information.
  • the inventive concept is applied in the QMF domain during the signal processing, i.e. in a domain where the signals are represented by multiple representations each representation holding information on a certain frequency band.
  • the side-information free method (matrix based approach) is applied only to the higher frequency parts while applying (explicit) parametric information (i.e. the first encoding and decoding rule) for a proper reproduction of the low-frequency parts.
  • This is advantageous due to the property of the human hearing to notice small deviations of two similar signals (e.g. phase deviations) a lot easier for low frequencies than for high frequencies.
  • a great benefit of the present invention is that a backwards compatibility of a spatial audio encoding and decoding scheme with matrix based solutions is achieved without having to introduce additional hard- or software when the encoding and decoding rules of the spatial audio coders are chosen appropriately.
  • the coding scheme according to the present invention is furthermore extremely flexible, as it allows a seamless adjustment of the bit-rate or the quality, i.e. a smooth transition between full matrix based coding to full spatial audio coding of a given signal. That is, the coding scheme applied can be adjusted to the actual needs, either with respect to the required bit-rate or with respect to the desired quality.
  • FIG. 1 shows an inventive encoder
  • FIG. 2 shows an example of a parameter bit-stream created by the inventive concept
  • FIG. 2 a shows an inventive transcoder
  • FIG. 3 shows an inventive decoder
  • FIG. 4 shows an example of a spatial audio decoder implementing the inventive concept
  • FIG. 5 illustrates the use of the different coding schemes on a decoder side
  • FIG. 6 shows a prior art encoder
  • FIG. 7 shows a prior art decoder
  • FIG. 8 shows a block diagram of a filterbank
  • FIG. 9 shows a further example of a prior art encoder.
  • FIG. 1 shows an inventive multi-channel encoder.
  • the multi-channel encoder 100 is having a parameter generator 102 and an output interface 104 .
  • a multi-channel audio signal 106 is input into the encoder 100 , where a first portion 108 and a second portion 110 of the multi-channel signal 106 are processed.
  • the parameter generator 102 receives the first portion 108 and the second portion 110 and derives spatial parameters describing spatial properties of the multi-channel signal 106 .
  • the spatial parameters are transferred to the output interface 104 that derives a parametric representation 112 of the multi-channel signal 106 such that the parametric representation 112 includes first parametric data for a first portion 108 of the multi-channel signal and wherein for a second portion 110 of the multi-channel signal 106 second parametric data requiring less information than the first parametric data or no parametric data is included in the parametric representation 112 .
  • the parameter generator 102 can apply two different parameter deriving rules on the first portion 108 and on the second portion 110 that result in different parameter sets that are then transferred to the output interface 104 that combines the different parameter sets into the parametric representation 112 .
  • a special and preferred case is that for the second portion 110 no parameters are included in the parametric representation (and therefore not derived by the parameter generator 102 ) since on a decoder side the decoder derives the required decoding parameters by some heuristic rules.
  • the parameter generator 102 derives a full set of spatial audio parameters as well for the first portion 108 as for the second portion 110 .
  • the output interface 104 would have to process the spatial parameters such that the second parametric data require less bits than the first parametric data.
  • the output interface 104 could add an additional window signal to the parametric representation 112 that shall signal to a decoder, how the multi-channel signal 106 was split into the first portion 108 and into the second portion 110 during the encoding.
  • the multi-channel encoder 100 may additionally have a portion decider for deciding, which part of the multi-channel signal 106 is used as the first portion 108 and which part is used as the second portion 110 , the decision being based on a quality criterion.
  • the quality criterion can be derived with respect to a resulting total bit-rate of the parametric representation 112 or with respect to quality aspects, taking into account the perceptual quality of a reproduction of the multi-channel signal 106 based on the parametric representation 112 .
  • bit-rate consumed by the parametric representation can thus be varied in time, assuring that the quality criterion is met at any time during the encoding while allowing for an overall reduction of the required bit-rate compared to prior art methods.
  • FIG. 2 shows an example of a parametric representation 112 created by an inventive encoder.
  • FIG. 2 shows a parameter bit-stream, i.e. a parametric representation for two consecutive frames.
  • the parameter bit-stream is having a representation of a high-quality frame 120 and a representation of a lower quality frame 122 .
  • the decision was taken that the first portion 108 , which is being represented by parametric data has to be big compared to the second portion, which may for example be the case if the audio scene to encode is rather complex.
  • the 2 is furthermore created under the assumption that a preferred embodiment of an inventive encoder is used that does not derive any parametric data for the second portion 110 of the multi-channel signal 106 .
  • 28 spatial parameters ICC and ICLD are included in the parametric representation to describe the high-quality frame 120 .
  • the 28 spatial parameters describe the lower frequency bands of a QMF representation of the multi-channel signal.
  • the lower quality frame 122 comprises only 21 spatial parameter sets having ICC and ICLD parameters as this was found to be sufficient for the desired perceptual quality.
  • FIG. 2 a shows an inventive transcoder 150 .
  • the inventive transcoder receives as an input an input bit stream 152 having a full set of spatial parameters describing a first frame 154 and a second frame 156 of a multi-channel audio signal.
  • the transcoder 150 generates a bit stream 158 holding a parametric representation representing the spatial properties of the multi-channel audio signal.
  • the transcoder 150 derives the parametric representation such that for the first frame the number of parameters 160 is only slightly decreased.
  • the number of parameters 162 describing the second frame corresponding to the input parameters 156 are strongly decreased, which reduces the amount of bit rate needed by the resulting parametric representation significantly.
  • Such an inventive transcoder 150 can therefore be used to post-process an already existing bit stream of spatial parameters to derive an inventive parametric representation requiring less bit rate during transmission or less storage space when stored on a computer-readable medium. It should be noted here that it is of course also possible to implement a transcoder for transcoding in the other direction, i.e. using the parametric representation to generate spatial parameters.
  • the inventive transcoder 150 can be implemented in various different ways, as for example by reducing the amount of parameters with a given rule or by additionally receiving the multi-channel audio signal to analyze the reduction of bit rate possible without disturbing the perceptual quality beyond an acceptable limit.
  • FIG. 3 shows an inventive multi-channel audio decoder 200 having a processor 202 .
  • the processor is receiving as an input a down-mix signal 204 derived from a multi-channel audio signal, first parametric data 206 describing a first portion of the multi-channel signal and, for a second portion of the multi-channel signal, optional second parametric data 208 requiring less bits than the first parametric data 206 .
  • the processor 202 is deriving an intermediate signal 210 from the down-mix signal 204 using a first deriving rule for deriving a high-quality portion 212 of the intermediate signal, wherein the high-quality portion 212 of the intermediate signal 212 is corresponding to the first portion of the multi-channel audio signal.
  • the processor 202 is using a second deriving rule for a second portion 214 of the intermediate signal 210 , wherein the second deriving rule is using the second parametric data or no parametric data and wherein the first deriving rule is depending on the first parametric data 206 .
  • the intermediate signal 210 derived by the processor 202 is built from a combination of the high-quality portion 212 and of the second portion 214 .
  • the multi-channel audio decoder 200 may derive by itself, which portions of the down-mix signal 204 are to be processed with the first parametric data 206 by applying some appropriate rules, for example counting the number of spatial parameters included in the first parametric data 206 .
  • the processor 202 may be signalled the fractions of the high-quality portion 212 and of the second portion 214 within the down-mix signal 204 by some additional window information which is derived on an encoder side and that is additionally transmitted to the multi-channel audio decoder 200 .
  • the second parametric data 208 is omitted and the processor 202 derives the second deriving rule from information already contained in the down-mix signal 204 .
  • FIG. 4 shows a further embodiment of the present invention that combines the inventive feature of matrix compatibility in a spatial audio decoder.
  • the multi-channel audio decoder 600 comprises a pre-de-correlator 601 , a de-correlator 602 and a mix-matrix 603 .
  • the multi-channel audio decoder 600 is a flexible device allowing to operate in different modi depending on the configuration of input signals 605 input into the pre-de-correlator 601 .
  • the pre-de-correlator 601 derives intermediate signals 607 that serve as input for the de-correlator 602 and that are partially transmitted unaltered to form, together with decorrelated signals calculated by the de-correlator 602 , input signals 608 .
  • the input signals 608 are the signals input into the mix-matrix 603 that derives output channel configurations 610 a or 610 b, depending on the input channel configuration 605 .
  • a down-mix signal and an optional residual signal is supplied to the pre-de-correlator 601 , that derives four intermediate signals (e 1 to e 4 ) that are used as an input of the de-correlator, which derives four de-correlated signals (d 1 to d 4 ) that form the input parameters 608 together with a directly transmitted signal m derived from the input signal.
  • the de-correlator 602 may be operative to simply forward the residual signal instead of deriving a de-correlated signal. This may also be done in a frequency selective manner for certain frequency bands only.
  • the input signals 605 comprise a left channel, a right channel and optionally a residual signal.
  • the pre-de-correlator matrix 601 derives a left, a right and a center channel and in addition two intermediate channels (e 1 , e 2 ).
  • the input signals to the mix-matrix 603 are formed by the left channel, the right channel, the centre channel, and two de-correlated signals (d 1 and d 2 ).
  • the pre-de-correlator matrix may derive an additional intermediate signal (e 5 ) that is used as an input for a de-correlator (D 5 ) whose output is a combination of the de-correlated signal (d 5 ) derived from the signal (e 5 ) and the de-correlated signals (d 1 and d 2 ).
  • an additional de-correlation can be guaranteed between the centre channel and the left and the right channel.
  • the inventive audio decoder 600 implements the inventive concept in the 2-to-5 configuration.
  • the transmitted parametric representation is used in the pre-de-correlation matrix 601 and in the mix-matrix 603 .
  • the inventive concept can be implemented in different ways as shown in more detail in FIG. 5 .
  • FIG. 5 shows the pre-de-correlator, implemented as predecorrelator-matrix 601 and the mix-matrix 603 in a principle sketch, wherein the other components of the multi-channel audio decoder 600 are omitted.
  • the matrix used to perform the pre-de-correlation and the mixing has columns that represent time slots, i.e. the individual time samples of a signal and rows that represent the different parameter bands, i.e. each row is associated with one parameter band of an audio signal.
  • the matrix elements of the matrices 601 and 603 are only partly derived from transmitted parametric data, wherein the remaining matrix elements are derived by the decoder, based for example on knowledge of the down-mix signal.
  • FIG. 5 shows one example where below a given frequency border line 622 the elements of the pre-de-correlator matrix 601 and the mix-matrix 603 are derived from parameters 620 that are read from the bit-stream, i.e. based on information transmitted from the encoder. Above the frequency borderline 622 the matrix elements are derived in the decoder based on knowledge of the down-mix signal only.
  • the border frequency (or in general: the amount of matrix elements derived from transmitted data) can be freely adapted according to the quality and/or bit-rate constraints that have to be met for the particular application scenario.
  • a side-information free up-mix process may be performed with the same structure that has been outlined in the MPEG Spatial Audio Coding Reference Model 0.
  • This invention may consist in describing a method for side-information free up-mix, but preferably provides a method for seamless and advantageous combination of such concepts with methods for side-information assisted up-mix.
  • the elements of the matrices M 1 ( 601 ) and M 2 ( 603 ) are preferably not derived from data transmitted in a bit-stream but by different means without the help of side-information, e.g. by applying heuristic rules based only on knowledge achieved from the down-mix signal.
  • the present invention is by no means limited to this splitting of the multi-channel signal into a first portion and a second portion as it may also be advantageous or appropriate to describe higher frequency parts of the signal with better accuracy. This may especially be the case when in the lower frequency region only little energy is contained in the signal since most of the energy is contained in a high-frequency domain of the audio signal. Due to masking effects the low-frequency part will be mostly dominated by the high frequency parts then and it may be advantageous to provide the possibility for a high-quality reproduction of the high-frequency part of the signal.
  • the inventive methods can be implemented in hardware or in software.
  • the implementation can be performed using a digital storage medium, in particular a disk, DVD or a CD having electronically readable control signals stored thereon, which cooperate with a programmable computer system such that the inventive methods are performed.
  • the present invention is, therefore, a computer program product with a program code stored on a machine readable carrier, the program code being operative for performing the inventive methods when the computer program product runs on a computer.
  • the inventive methods are, therefore, a computer program having a program code for performing at least one of the inventive methods when the computer program runs on a computer.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Mathematical Physics (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Theoretical Computer Science (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Pure & Applied Mathematics (AREA)
  • Algebra (AREA)
  • General Engineering & Computer Science (AREA)
  • Stereophonic System (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Stereo-Broadcasting Methods (AREA)
US11/323,965 2005-07-19 2005-12-29 Concept for bridging the gap between parametric multi-channel audio coding and matrixed-surround multi-channel coding Abandoned US20070055510A1 (en)

Priority Applications (30)

Application Number Priority Date Filing Date Title
US11/323,965 US20070055510A1 (en) 2005-07-19 2005-12-29 Concept for bridging the gap between parametric multi-channel audio coding and matrixed-surround multi-channel coding
PT06743182T PT1908056T (pt) 2005-07-19 2006-06-21 Conceito para conciliar a codificação paramétrica de áudio multicanal e a codificação de matriz de surround multicanal
EP06743182.5A EP1908056B1 (de) 2005-07-19 2006-06-21 Konzept zur überbrückung der bresche zwischen parametrischer mehrkanal-audiocodierung und matrix-surround-mehrkanalcodierung
AU2006272127A AU2006272127B2 (en) 2005-07-19 2006-06-21 Concept for bridging the gap between parametric multi-channel audio coding and matrixed-surround multi-channel coding
ES18180076T ES2952871T3 (es) 2005-07-19 2006-06-21 Concepto para puentear el espacio entre codificación parámetrica de audio multicanal y codificación multicanal envolvente matricial
BRPI0616019-0A BRPI0616019B1 (pt) 2005-07-19 2006-06-21 conceito para superar a lacuna entre codificação paramétrica e codificação matrixed-surround de áudio multicanais
HUE18180076A HUE064455T2 (hu) 2005-07-19 2006-06-21 Koncepció a parametrikus többcsatornás audió kódolás és a mátrixolt térhatású többcsatornás kódolás közötti rés áthidalására
PL06743182T PL1908056T3 (pl) 2005-07-19 2006-06-21 Koncepcja wypełnienia luki między parametrycznym wielokanałowym kodowaniem audio i wielokanałowym kodowaniem matrix-surround
JP2008521820A JP5265358B2 (ja) 2005-07-19 2006-06-21 パラメトリックマルチチャネルオーディオ符号化とマトリックスサラウンドマルチチャネル符号化との間のギャップを埋めるための概念
DK18180076.4T DK3404656T3 (da) 2005-07-19 2006-06-21 Koncept til at bygge bro mellem parametrisk multikanalaudiokodning og matrix-surround-multikanalkodning
PCT/EP2006/005971 WO2007009548A1 (en) 2005-07-19 2006-06-21 Concept for bridging the gap between parametric multi-channel audio coding and matrixed-surround multi-channel coding
PL18180076.4T PL3404656T3 (pl) 2005-07-19 2006-06-21 Koncepcja wypełnienia luki między parametrycznym wielokanałowym kodowaniem audio i wielokanałowym kodowaniem matrix-surround
FIEP18180076.4T FI3404656T3 (fi) 2005-07-19 2006-06-21 Periaate parametrisen monikanava-audiokoodauksen ja matriisiympäröidyn monikanavakoodauksen välisen välin ohittamiseksi
EP23214132.5A EP4307124A3 (de) 2005-07-19 2006-06-21 Konzept zur überbrückung der lücke zwischen parametrischer mehrkanal-audiocodierung und matrix-surround-mehrkanal-codierung
PT181800764T PT3404656T (pt) 2005-07-19 2006-06-21 Conceito para conciliar a codificação paramétrica de áudio multicanal e a codificação de matriz de surround multicanal
EP18180076.4A EP3404656B1 (de) 2005-07-19 2006-06-21 Konzept zur überbrückung der lücke zwischen parametrischer mehrkanal-audiocodierung und matrix-surround-mehrkanal-codierung
EP23214134.1A EP4307126A3 (de) 2005-07-19 2006-06-21 Konzept zur überbrückung der lücke zwischen parametrischer mehrkanal-audiocodierung und matrix-surround-mehrkanal-codierung
EP23180543.3A EP4235440A3 (de) 2005-07-19 2006-06-21 Konzept zur überbrückung der lücke zwischen parametrischer mehrkanal-audiocodierung und matrix-surround-mehrkanal-codierung
RU2008106225/09A RU2382418C2 (ru) 2005-07-19 2006-06-21 Способ совмещения параметрического многоканального аудиокодирования с матричным многоканальным кодированием объемного звучания
EP23214133.3A EP4307125A3 (de) 2005-07-19 2006-06-21 Konzept zur überbrückung der lücke zwischen parametrischer mehrkanal-audiocodierung und matrix-surround-mehrkanal-codierung
MX2008000828A MX2008000828A (es) 2005-07-19 2006-06-21 Concepto para puentear el espacio entre codificacion parametrica de audio multicanal y codificacion multicanal de borde de matriz.
KR1020087002860A KR100946688B1 (ko) 2005-07-19 2006-06-21 멀티 채널 오디오 디코더, 멀티 채널 인코더, 오디오 신호 처리 방법 및 상기 처리 방법을 수행하는 프로그램을 기록한 기록매체
CA2614384A CA2614384C (en) 2005-07-19 2006-06-21 Concept for bridging the gap between parametric multi-channel audio coding and matrixed-surround multi-channel coding
CN2006800259749A CN101223578B (zh) 2005-07-19 2006-06-21 多通道音频的编码和解码
ES06743182.5T ES2690278T3 (es) 2005-07-19 2006-06-21 Concepto para puentear el espacio entre codificación parámetrica de audio multicanal y codificación multicanal envolvente matricial
MYPI20062999A MY149198A (en) 2005-07-19 2006-06-23 Concept for bridging the gap between parametric multi-channel audio coding and matrixed-surround multi-channel coding
TW095125971A TWI339028B (en) 2005-07-19 2006-07-17 Concept for bridging the gap between parametric multi-channel audio coding and matrixed-surround multi-channel coding
US11/458,646 US8180061B2 (en) 2005-07-19 2006-07-19 Concept for bridging the gap between parametric multi-channel audio coding and matrixed-surround multi-channel coding
IL188425A IL188425A0 (en) 2005-07-19 2007-12-26 Concept for bridging the gap between parameteric multi-channel audio coding and matrixed-surround multi-channel coding
NO20080850A NO342863B1 (no) 2005-07-19 2008-02-18 Konsept for kopling av gapet mellom parametrisk flerkanals audiokoding og matrise-surround flerkanalkoding

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US70100105P 2005-07-19 2005-07-19
US11/323,965 US20070055510A1 (en) 2005-07-19 2005-12-29 Concept for bridging the gap between parametric multi-channel audio coding and matrixed-surround multi-channel coding

Related Child Applications (2)

Application Number Title Priority Date Filing Date
US11/458,646 Continuation US8180061B2 (en) 2005-07-19 2006-07-19 Concept for bridging the gap between parametric multi-channel audio coding and matrixed-surround multi-channel coding
US13/727,532 Division US8990456B2 (en) 2005-12-30 2012-12-26 Method and apparatus for memory write performance optimization in architectures with out-of-order read/request-for-ownership response

Publications (1)

Publication Number Publication Date
US20070055510A1 true US20070055510A1 (en) 2007-03-08

Family

ID=36873210

Family Applications (2)

Application Number Title Priority Date Filing Date
US11/323,965 Abandoned US20070055510A1 (en) 2005-07-19 2005-12-29 Concept for bridging the gap between parametric multi-channel audio coding and matrixed-surround multi-channel coding
US11/458,646 Active 2030-05-31 US8180061B2 (en) 2005-07-19 2006-07-19 Concept for bridging the gap between parametric multi-channel audio coding and matrixed-surround multi-channel coding

Family Applications After (1)

Application Number Title Priority Date Filing Date
US11/458,646 Active 2030-05-31 US8180061B2 (en) 2005-07-19 2006-07-19 Concept for bridging the gap between parametric multi-channel audio coding and matrixed-surround multi-channel coding

Country Status (21)

Country Link
US (2) US20070055510A1 (de)
EP (6) EP4307124A3 (de)
JP (1) JP5265358B2 (de)
KR (1) KR100946688B1 (de)
CN (1) CN101223578B (de)
AU (1) AU2006272127B2 (de)
BR (1) BRPI0616019B1 (de)
CA (1) CA2614384C (de)
DK (1) DK3404656T3 (de)
ES (2) ES2952871T3 (de)
FI (1) FI3404656T3 (de)
HU (1) HUE064455T2 (de)
IL (1) IL188425A0 (de)
MX (1) MX2008000828A (de)
MY (1) MY149198A (de)
NO (1) NO342863B1 (de)
PL (2) PL1908056T3 (de)
PT (2) PT1908056T (de)
RU (1) RU2382418C2 (de)
TW (1) TWI339028B (de)
WO (1) WO2007009548A1 (de)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070223749A1 (en) * 2006-03-06 2007-09-27 Samsung Electronics Co., Ltd. Method, medium, and system synthesizing a stereo signal
US20080126104A1 (en) * 2004-08-25 2008-05-29 Dolby Laboratories Licensing Corporation Multichannel Decorrelation In Spatial Audio Coding
US20080199026A1 (en) * 2006-12-07 2008-08-21 Lg Electronics, Inc. Method and an Apparatus for Decoding an Audio Signal
US20080201153A1 (en) * 2005-07-19 2008-08-21 Koninklijke Philips Electronics, N.V. Generation of Multi-Channel Audio Signals
US20090164227A1 (en) * 2006-03-30 2009-06-25 Lg Electronics Inc. Apparatus for Processing Media Signal and Method Thereof
US20090240503A1 (en) * 2005-10-07 2009-09-24 Shuji Miyasaka Acoustic signal processing apparatus and acoustic signal processing method
US7873424B1 (en) * 2006-04-13 2011-01-18 Honda Motor Co., Ltd. System and method for optimizing digital audio playback
US20110013790A1 (en) * 2006-10-16 2011-01-20 Johannes Hilpert Apparatus and Method for Multi-Channel Parameter Transformation
US20110178808A1 (en) * 2005-09-14 2011-07-21 Lg Electronics, Inc. Method and Apparatus for Decoding an Audio Signal
US20120063604A1 (en) * 2005-03-30 2012-03-15 Koninklijke Philips Electronics N.V. Scalable multi-channel audio coding
US8867753B2 (en) 2009-01-28 2014-10-21 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V.. Apparatus, method and computer program for upmixing a downmix audio signal
US9704493B2 (en) 2013-05-24 2017-07-11 Dolby International Ab Audio encoder and decoder
RU2810027C2 (ru) * 2013-05-24 2023-12-21 Долби Интернэшнл Аб Аудиокодер и аудиодекодер
US11887609B2 (en) 2016-01-22 2024-01-30 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for estimating an inter-channel time difference

Families Citing this family (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007027050A1 (en) 2005-08-30 2007-03-08 Lg Electronics Inc. Apparatus for encoding and decoding audio signal and method thereof
KR101218776B1 (ko) 2006-01-11 2013-01-18 삼성전자주식회사 다운믹스된 신호로부터 멀티채널 신호 생성방법 및 그 기록매체
BRPI0715559B1 (pt) * 2006-10-16 2021-12-07 Dolby International Ab Codificação aprimorada e representação de parâmetros de codificação de objeto de downmix multicanal
JP5355387B2 (ja) * 2007-03-30 2013-11-27 パナソニック株式会社 符号化装置および符号化方法
KR101464977B1 (ko) * 2007-10-01 2014-11-25 삼성전자주식회사 메모리 관리 방법, 및 멀티 채널 데이터의 복호화 방법 및장치
JP4992979B2 (ja) 2007-11-06 2012-08-08 富士通株式会社 多地点間音声通話装置
WO2009068085A1 (en) * 2007-11-27 2009-06-04 Nokia Corporation An encoder
KR20100095586A (ko) * 2008-01-01 2010-08-31 엘지전자 주식회사 신호 처리 방법 및 장치
JP5202090B2 (ja) * 2008-05-07 2013-06-05 アルパイン株式会社 サラウンド生成装置
KR101414412B1 (ko) * 2008-05-09 2014-07-01 노키아 코포레이션 오디오 신호의 인코딩 장치, 오디오 신호의 디코딩 장치, 오디오 신호의 인코딩 방법, 스케일러블 인코딩 오디오 신호의 디코딩 방법, 인코더, 디코더, 전자기기 및 컴퓨터 판독가능한 기록 매체
RU2515704C2 (ru) * 2008-07-11 2014-05-20 Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. Аудиокодер и аудиодекодер для кодирования и декодирования отсчетов аудиосигнала
EP2175670A1 (de) * 2008-10-07 2010-04-14 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Binaurale Aufbereitung eines Mehrkanal-Audiosignals
CN102257562B (zh) 2008-12-19 2013-09-11 杜比国际公司 用空间线索参数对多通道音频信号应用混响的方法和装置
CA2746524C (en) * 2009-04-08 2015-03-03 Matthias Neusinger Apparatus, method and computer program for upmixing a downmix audio signal using a phase value smoothing
TWI444989B (zh) * 2010-01-22 2014-07-11 Dolby Lab Licensing Corp 針對改良多通道上混使用多通道解相關之技術
JP5604933B2 (ja) * 2010-03-30 2014-10-15 富士通株式会社 ダウンミクス装置およびダウンミクス方法
JP5533502B2 (ja) * 2010-09-28 2014-06-25 富士通株式会社 オーディオ符号化装置、オーディオ符号化方法及びオーディオ符号化用コンピュータプログラム
CN102802112B (zh) * 2011-05-24 2014-08-13 鸿富锦精密工业(深圳)有限公司 具有音频文件格式转换功能的电子装置
US9183842B2 (en) * 2011-11-08 2015-11-10 Vixs Systems Inc. Transcoder with dynamic audio channel changing
US9479886B2 (en) 2012-07-20 2016-10-25 Qualcomm Incorporated Scalable downmix design with feedback for object-based surround codec
US9761229B2 (en) 2012-07-20 2017-09-12 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for audio object clustering
EP2922053B1 (de) * 2012-11-15 2019-08-28 NTT Docomo, Inc. Audiocodierungsvorrichtung, audiocodierungsverfahren, audiocodierungsprogramm, audiodecodierungsvorrichtung, audiodecodierungsverfahren und audiodecodierungsprogramm
WO2014108738A1 (en) 2013-01-08 2014-07-17 Nokia Corporation Audio signal multi-channel parameter encoder
ES2924427T3 (es) * 2013-01-29 2022-10-06 Fraunhofer Ges Forschung Decodificador para generar una señal de audio mejorada en frecuencia, procedimiento de decodificación, codificador para generar una señal codificada y procedimiento de codificación que utiliza información lateral de selección compacta
US9715880B2 (en) * 2013-02-21 2017-07-25 Dolby International Ab Methods for parametric multi-channel encoding
WO2014171791A1 (ko) 2013-04-19 2014-10-23 한국전자통신연구원 다채널 오디오 신호 처리 장치 및 방법
EP3005351A4 (de) * 2013-05-28 2017-02-01 Nokia Technologies OY Audiosignalcodierer
EP2830053A1 (de) 2013-07-22 2015-01-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Mehrkanaliger Audiodecodierer, mehrkanaliger Audiocodierer, Verfahren und Computerprogramm mit restsignalbasierter Anpassung einer Beteiligung eines dekorrelierten Signals
US9319819B2 (en) * 2013-07-25 2016-04-19 Etri Binaural rendering method and apparatus for decoding multi channel audio
JP6392353B2 (ja) 2013-09-12 2018-09-19 ドルビー・インターナショナル・アーベー マルチチャネル・オーディオ・コンテンツの符号化
KR101841380B1 (ko) 2014-01-13 2018-03-22 노키아 테크놀로지스 오와이 다중-채널 오디오 신호 분류기
WO2015173422A1 (de) * 2014-05-15 2015-11-19 Stormingswiss Sàrl Verfahren und vorrichtung zur residualfreien erzeugung eines upmix aus einem downmix
KR102144332B1 (ko) * 2014-07-01 2020-08-13 한국전자통신연구원 다채널 오디오 신호 처리 방법 및 장치
DE102016214923B4 (de) 2016-08-11 2023-08-17 Continental Reifen Deutschland Gmbh Schwefelvernetzbare Kautschukmischung und deren Verwendung
US11363377B2 (en) * 2017-10-16 2022-06-14 Sony Europe B.V. Audio processing
KR20200116968A (ko) * 2018-02-01 2020-10-13 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. 하이브리드 인코더/디코더 공간 분석을 사용한 오디오 장면 인코더, 오디오 장면 디코더 및 관련 방법들
EP3984028B1 (de) 2019-06-14 2024-04-17 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Parameterkodierung und -dekodierung

Family Cites Families (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4799260A (en) * 1985-03-07 1989-01-17 Dolby Laboratories Licensing Corporation Variable matrix decoder
KR960012475B1 (ko) * 1994-01-18 1996-09-20 대우전자 주식회사 디지탈 오디오 부호화장치의 채널별 비트 할당 장치
US5912976A (en) 1996-11-07 1999-06-15 Srs Labs, Inc. Multi-channel audio enhancement system for use in recording and playback and methods for providing same
DE19900961A1 (de) 1999-01-13 2000-07-20 Thomson Brandt Gmbh Verfahren und Vorrichtung zur Wiedergabe von Mehrkanaltonsignalen
TW510143B (en) * 1999-12-03 2002-11-11 Dolby Lab Licensing Corp Method for deriving at least three audio signals from two input audio signals
JP2001339311A (ja) * 2000-05-26 2001-12-07 Yamaha Corp オーディオ信号圧縮回路および伸長回路
US7280664B2 (en) * 2000-08-31 2007-10-09 Dolby Laboratories Licensing Corporation Method for apparatus for audio matrix decoding
JP2002311994A (ja) * 2001-04-18 2002-10-25 Matsushita Electric Ind Co Ltd ステレオオーディオ信号符号化方法及び装置
US7006636B2 (en) 2002-05-24 2006-02-28 Agere Systems Inc. Coherence-based audio coding and synthesis
US7644003B2 (en) 2001-05-04 2010-01-05 Agere Systems Inc. Cue-based audio coding/decoding
US7292901B2 (en) 2002-06-24 2007-11-06 Agere Systems Inc. Hybrid multi-channel/cue coding/decoding of audio signals
SE0202159D0 (sv) * 2001-07-10 2002-07-09 Coding Technologies Sweden Ab Efficientand scalable parametric stereo coding for low bitrate applications
ATE332003T1 (de) 2002-04-22 2006-07-15 Koninkl Philips Electronics Nv Parametrische beschreibung von mehrkanal-audio
ATE377339T1 (de) 2002-07-12 2007-11-15 Koninkl Philips Electronics Nv Audio-kodierung
AU2003281128A1 (en) 2002-07-16 2004-02-02 Koninklijke Philips Electronics N.V. Audio coding
JP2004252068A (ja) * 2003-02-19 2004-09-09 Matsushita Electric Ind Co Ltd デジタルオーディオ信号の符号化装置及び方法
WO2004086817A2 (en) 2003-03-24 2004-10-07 Koninklijke Philips Electronics N.V. Coding of main and side signal representing a multichannel signal
US7318035B2 (en) * 2003-05-08 2008-01-08 Dolby Laboratories Licensing Corporation Audio coding systems and methods using spectral component coupling and spectral component regeneration
US7394903B2 (en) 2004-01-20 2008-07-01 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. Apparatus and method for constructing a multi-channel output signal or for generating a downmix signal
EP1749296B1 (de) * 2004-05-28 2010-07-14 Nokia Corporation Mehrkanalige audio-erweiterung
US7751572B2 (en) * 2005-04-15 2010-07-06 Dolby International Ab Adaptive residual audio coding
JP5006315B2 (ja) * 2005-06-30 2012-08-22 エルジー エレクトロニクス インコーポレイティド オーディオ信号のエンコーディング及びデコーディング方法及び装置

Cited By (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080126104A1 (en) * 2004-08-25 2008-05-29 Dolby Laboratories Licensing Corporation Multichannel Decorrelation In Spatial Audio Coding
US8015018B2 (en) * 2004-08-25 2011-09-06 Dolby Laboratories Licensing Corporation Multichannel decorrelation in spatial audio coding
US20120063604A1 (en) * 2005-03-30 2012-03-15 Koninklijke Philips Electronics N.V. Scalable multi-channel audio coding
US8352280B2 (en) * 2005-03-30 2013-01-08 Francois Philippus Myburg Scalable multi-channel audio coding
US20080201153A1 (en) * 2005-07-19 2008-08-21 Koninklijke Philips Electronics, N.V. Generation of Multi-Channel Audio Signals
US8160888B2 (en) * 2005-07-19 2012-04-17 Koninklijke Philips Electronics N.V Generation of multi-channel audio signals
US20110178808A1 (en) * 2005-09-14 2011-07-21 Lg Electronics, Inc. Method and Apparatus for Decoding an Audio Signal
US9747905B2 (en) 2005-09-14 2017-08-29 Lg Electronics Inc. Method and apparatus for decoding an audio signal
US20110196687A1 (en) * 2005-09-14 2011-08-11 Lg Electronics, Inc. Method and Apparatus for Decoding an Audio Signal
US20110182431A1 (en) * 2005-09-14 2011-07-28 Lg Electronics, Inc. Method and Apparatus for Decoding an Audio Signal
US20090240503A1 (en) * 2005-10-07 2009-09-24 Shuji Miyasaka Acoustic signal processing apparatus and acoustic signal processing method
US8073703B2 (en) * 2005-10-07 2011-12-06 Panasonic Corporation Acoustic signal processing apparatus and acoustic signal processing method
US8620011B2 (en) * 2006-03-06 2013-12-31 Samsung Electronics Co., Ltd. Method, medium, and system synthesizing a stereo signal
US20070223749A1 (en) * 2006-03-06 2007-09-27 Samsung Electronics Co., Ltd. Method, medium, and system synthesizing a stereo signal
US20090164227A1 (en) * 2006-03-30 2009-06-25 Lg Electronics Inc. Apparatus for Processing Media Signal and Method Thereof
US8626515B2 (en) * 2006-03-30 2014-01-07 Lg Electronics Inc. Apparatus for processing media signal and method thereof
US7873424B1 (en) * 2006-04-13 2011-01-18 Honda Motor Co., Ltd. System and method for optimizing digital audio playback
US20110013790A1 (en) * 2006-10-16 2011-01-20 Johannes Hilpert Apparatus and Method for Multi-Channel Parameter Transformation
US8687829B2 (en) * 2006-10-16 2014-04-01 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for multi-channel parameter transformation
US8311227B2 (en) 2006-12-07 2012-11-13 Lg Electronics Inc. Method and an apparatus for decoding an audio signal
US20080199026A1 (en) * 2006-12-07 2008-08-21 Lg Electronics, Inc. Method and an Apparatus for Decoding an Audio Signal
US8428267B2 (en) 2006-12-07 2013-04-23 Lg Electronics Inc. Method and an apparatus for decoding an audio signal
US8488797B2 (en) 2006-12-07 2013-07-16 Lg Electronics Inc. Method and an apparatus for decoding an audio signal
US20080205671A1 (en) * 2006-12-07 2008-08-28 Lg Electronics, Inc. Method and an Apparatus for Decoding an Audio Signal
US20080205670A1 (en) * 2006-12-07 2008-08-28 Lg Electronics, Inc. Method and an Apparatus for Decoding an Audio Signal
US20080205657A1 (en) * 2006-12-07 2008-08-28 Lg Electronics, Inc. Method and an Apparatus for Decoding an Audio Signal
US8340325B2 (en) 2006-12-07 2012-12-25 Lg Electronics Inc. Method and an apparatus for decoding an audio signal
US8867753B2 (en) 2009-01-28 2014-10-21 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V.. Apparatus, method and computer program for upmixing a downmix audio signal
US9704493B2 (en) 2013-05-24 2017-07-11 Dolby International Ab Audio encoder and decoder
US9940939B2 (en) 2013-05-24 2018-04-10 Dolby International Ab Audio encoder and decoder
US10418038B2 (en) 2013-05-24 2019-09-17 Dolby International Ab Audio encoder and decoder
US10714104B2 (en) 2013-05-24 2020-07-14 Dolby International Ab Audio encoder and decoder
US11024320B2 (en) 2013-05-24 2021-06-01 Dolby International Ab Audio encoder and decoder
US11594233B2 (en) 2013-05-24 2023-02-28 Dolby International Ab Audio encoder and decoder
RU2810027C2 (ru) * 2013-05-24 2023-12-21 Долби Интернэшнл Аб Аудиокодер и аудиодекодер
US11887609B2 (en) 2016-01-22 2024-01-30 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for estimating an inter-channel time difference

Also Published As

Publication number Publication date
EP1908056A1 (de) 2008-04-09
EP4307124A2 (de) 2024-01-17
PT3404656T (pt) 2023-10-09
RU2382418C2 (ru) 2010-02-20
EP4307125A2 (de) 2024-01-17
NO20080850L (no) 2008-04-17
TW200723712A (en) 2007-06-16
PT1908056T (pt) 2018-11-07
CA2614384A1 (en) 2007-01-25
WO2007009548A1 (en) 2007-01-25
TWI339028B (en) 2011-03-11
CN101223578A (zh) 2008-07-16
NO342863B1 (no) 2018-08-20
EP4307126A2 (de) 2024-01-17
AU2006272127A1 (en) 2007-01-25
RU2008106225A (ru) 2009-08-27
EP3404656A1 (de) 2018-11-21
MY149198A (en) 2013-07-31
EP4235440A2 (de) 2023-08-30
CA2614384C (en) 2012-07-24
KR100946688B1 (ko) 2010-03-12
BRPI0616019A2 (pt) 2011-06-07
ES2690278T3 (es) 2018-11-20
FI3404656T3 (fi) 2023-09-25
EP4307126A3 (de) 2024-03-27
AU2006272127B2 (en) 2010-02-04
PL1908056T3 (pl) 2019-01-31
IL188425A0 (en) 2008-11-03
US20070019813A1 (en) 2007-01-25
JP2009501948A (ja) 2009-01-22
EP4307124A3 (de) 2024-03-27
MX2008000828A (es) 2008-03-19
CN101223578B (zh) 2011-12-14
EP4235440A3 (de) 2023-10-25
EP1908056B1 (de) 2018-08-01
EP3404656B1 (de) 2023-06-28
HUE064455T2 (hu) 2024-03-28
ES2952871T3 (es) 2023-11-06
JP5265358B2 (ja) 2013-08-14
US8180061B2 (en) 2012-05-15
PL3404656T3 (pl) 2024-06-17
BRPI0616019B1 (pt) 2019-11-19
DK3404656T3 (da) 2023-09-25
EP4307125A3 (de) 2024-03-27
KR20080032146A (ko) 2008-04-14

Similar Documents

Publication Publication Date Title
US8180061B2 (en) Concept for bridging the gap between parametric multi-channel audio coding and matrixed-surround multi-channel coding
US8654985B2 (en) Stereo compatible multi-channel audio coding
JP4601669B2 (ja) マルチチャネル信号またはパラメータデータセットを生成する装置および方法
RU2382419C2 (ru) Многоканальный кодер
US9966080B2 (en) Audio object encoding and decoding
RU2576476C2 (ru) Декодер аудиосигнала, кодер аудиосигнала, способ формирования представления сигнала повышающего микширования, способ формирования представления сигнала понижающего микширования, компьютерная программа и бистрим, использующий значение общего параметра межобъектной корреляции
JP5930441B2 (ja) マルチチャネルオーディオ信号の適応ダウン及びアップミキシングを実行するための方法及び装置
CN107077861B (zh) 音频编码器和解码器
JP6248186B2 (ja) オーディオ・エンコードおよびデコード方法、対応するコンピュータ可読媒体ならびに対応するオーディオ・エンコーダおよびデコーダ
CN113614827B (zh) 用于预测性译码中的低成本错误恢复的方法和设备
WO2024052499A1 (en) Decoder and decoding method for discontinuous transmission of parametrically coded independent streams with metadata
WO2024051954A1 (en) Encoder and encoding method for discontinuous transmission of parametrically coded independent streams with metadata
CN113614827A (zh) 用于预测性译码中的低成本错误恢复的方法和设备

Legal Events

Date Code Title Description
AS Assignment

Owner name: FRAUNHOFER-GESELLSCHAFT ZUR FOERDERUNG DER ANGEWAN

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:HILPERT, JOHANNES;FALLER, CHRISTOF;LINZMEIER, KARSTEN;AND OTHERS;REEL/FRAME:018621/0470;SIGNING DATES FROM 20060326 TO 20060327

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION