WO2006111294A1 - Energy dependent quantization for efficient coding of spatial audio parameters - Google Patents

Energy dependent quantization for efficient coding of spatial audio parameters Download PDF

Info

Publication number
WO2006111294A1
WO2006111294A1 PCT/EP2006/003284 EP2006003284W WO2006111294A1 WO 2006111294 A1 WO2006111294 A1 WO 2006111294A1 EP 2006003284 W EP2006003284 W EP 2006003284W WO 2006111294 A1 WO2006111294 A1 WO 2006111294A1
Authority
WO
WIPO (PCT)
Prior art keywords
parameter
channel
channels
pair
measure
Prior art date
Application number
PCT/EP2006/003284
Other languages
French (fr)
Inventor
Jonas Roeden
Jonas Engdegard
Heiko Purnhagen
Jeroen Breebaart
Erik Schuijers
Steven Van De Par
Johannes Hilpert
Juergen Herre
Original Assignee
Coding Technologies Ab
Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V.
Koninklijke Philips Electronics N.V.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Coding Technologies Ab, Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V., Koninklijke Philips Electronics N.V. filed Critical Coding Technologies Ab
Priority to JP2007537308A priority Critical patent/JP4521032B2/en
Priority to PL06724214T priority patent/PL1754222T3/en
Priority to EP06724214A priority patent/EP1754222B1/en
Priority to DE602006000239T priority patent/DE602006000239T2/en
Priority to CN2006800005085A priority patent/CN1993733B/en
Priority to BRPI0605857-4A priority patent/BRPI0605857A/en
Priority to TW095113078A priority patent/TWI327306B/en
Priority to MYPI20061770A priority patent/MY141427A/en
Priority to US11/406,631 priority patent/US8054981B2/en
Publication of WO2006111294A1 publication Critical patent/WO2006111294A1/en
Priority to HK07103451A priority patent/HK1095993A1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/03Spectral prediction for preventing pre-echo; Temporary noise shaping [TNS], e.g. in MPEG2 or MPEG4
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • G11B20/10Digital recording or reproducing
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction

Definitions

  • the present invention relates to quantization of spatial audio parameters and in particular to a concept to allow for a more efficient compression without significantly reducing the perceptual quality of an audio signal reconstructed using the quantized spatial audio parameters .
  • a multi-channel encoding device generally receives - as input - at least two channels, and outputs one or more carrier channels and parametric data.
  • the parametric data is derived such that, in a decoder, an approximation of the original multi-channel signal can be calculated.
  • the carrier channel (channels) will include subband samples, spectral coefficients, time domain samples, etc., which provide a com- paratively fine representation of the underlying signal, while the parametric data do not include such samples of spectral coefficients but include control parameters for con- trolling a certain reconstruction algorithm instead.
  • Such a reconstruction could comprise weighting by multiplication, time shifting, frequency shifting, phase shifting, etc.
  • the parametric data includes only a comparatively coarse rep- resentation of the signal or the associated channel.
  • BCC binaural cue coding
  • ICLD Inter-Channel Level Difference
  • ICTD Inter-Channel Time Difference
  • ICLD and ICTD parameters represent the most impor- tant sound source localization parameters
  • a spatial representation using these parameters can be enhanced by introducing additional parameters.
  • a related technique, called “parametric stereo” describes the parametric coding of a two-channel stereo signal based on a transmitted mono signal plus parameter side information.
  • 3 types of spatial parameters referred to as inter- channel intensity difference (IIDs), inter-channel phase differences (IPDs), and inter-channel coherence (IC) are introduced.
  • IIDs inter- channel intensity difference
  • IPDs inter-channel phase differences
  • IC inter-channel coherence
  • the extension of the spatial parameter set with a coherence parameter (correlation parameter) enables a pa- rametrization of the perceived spatial "diffuseness" or spatial "compactness” of the sound stage.
  • Parametric stereo is described in more detail in: "Parametric Coding of stereo audio", J. Breebaart, S. van de Par, A. Kohlrausch, E. Schuijers (2005) Eurasip, J. Applied Signal Proc. 9, pages 1305-1322)", in "High-Quality Parametric Spatial Audio Coding at Low Bitrates", J. Breebaart, S. van de Par, A. Kohlrausch, E. Schuijers, AES 116 th Convention,. Preprint 6072, Berlin, May 2004, and in “Low Complexity Parametric Stereo Coding", E. Schuijers, J. Breebaart, H. Purnhagen, J. Engdegard, AES 116 th Convention, Preprint 6073, Berlin, May 2004.
  • a repre- sentation of the level differences (also called intensity differences ICLD or energy differences IID) between audio channels is a vital part of a parametric representation of a stereophonic/multi-channel audio signal.
  • level differences also called intensity differences ICLD or energy differences IID
  • Such information and other spatial parameters are transmitted from the encoder to the decoder for each time/frequency slot. In the view of coding efficiency, it is therefore of high interest to represent these parameters as compactly as possible while preserving audio quality.
  • the level differences are represented relative to a so-called "reference channel” and are quantized on a uniform scale in units of dB relative to a reference channel. This does not optimally exploit the fact that channels with low level with respect to the reference channel are subject to a significant masking effect when listened to by human listeners. In the extreme case of a channel having no signal at all f the bandwidth used by parameters describing this particular channel is completely wasted. In the more common case, where one channel is much fainter than another channel, that is a listener can hardly hear the faint channel during the playback, a less precise reproduction of the faint channel would also lead to the same perceptual quality of the listener, as the faint signal is mainly masked by the stronger signal.
  • the 5- channel configuration is having a left rear channel 101 (A, having a signal a(t)), a left front channel 102 (B, having a signal b(t)), a center channel 103 (C, having a signal c(t)), a right front channel 104 (D, having a signal d(t)) and a right back channel 105 (E, having a signal e(t)).
  • Intensity relations between single channels or channel pairs are marked with arrows.
  • the intensity distribution between the front left channel 102 and the front right channel 104 is marked ri (110)
  • the intensity distribution between the left back channel and the right back channel is marked TA (112)
  • the intensity distribution between the combination of the left front channel 102 and the right front channel 104 and the center channel 103 is marked r2 (114) and the intensity distribution between the combination of the back channels and the combination of the front channels is marked r 3 (116) .
  • Fig. 10a illustrates a multi channel parameterization for a five channel speaker set-up where the different audio chan- nels are indicated by 101 to 105; a(t) 101 represents signal of the left surround channel, b(t) 102 represents the . signal of the left front channel, c(t) 103 represents the signal of the center channel, d(t) 104 represents the signal of the right front channel, e(t) 105 represents the signal of the right surround channel.
  • the speaker set-up is divided into a front part and a back part.
  • the energy distribution between the entire front channel set-up (102, 103 and 104) and the back channels (101 and 105) are illustrated by the arrow in Fig. 10a and indicated by the r 3 parameter.
  • the energy dis- tribution between the center channel 103 and the left front 102 and right front 103 channels are indicated by r 2 .
  • the energy distribution between the left surround channel 101 and the right surround channel 105 is illustrated by r ⁇ .
  • the energy distribution between the left front channel 102 and the right front channel 104 is given by rj. Since ri to r 4 are parameterizations of different regions it is also clear that beside energy distribution also other essential region properties can be parameterized, as for example the correlation between the regions. Additionally for each parameter ri to r 4 a local energy can be calculated. For example the local energy of r 4 is the summed energy of channel A 101 and E 105.
  • E[.] is the expected value as defined by
  • Fig. 10b shows a multi-channel audio decoder built by hierarchically ordering parametric stereo modules, as for example described in WO 2004/008805 Al.
  • the audio channels 101 to 105 are reproduced step by step from a single monophonic down-mix signal 120 (M) and corresponding side information by a first two-channel decoder 122, a second two-channel decoder 124, a third two- channel decoder 126, and a fourth two-channel decoder 128.
  • M monophonic down-mix signal
  • the first two-channel decoder decomposes the monophonic down-mix sig- nal 120 into two signals fed into the second and the third two-channel decoders 124 and 126.
  • the channel fed into the third two-channel decoder 126 is a combined channel, being combined from the left back channel 101 and the right back channel 105.
  • the channel fed into the second two-channel decoder 124 is a combination of the center channel 103 and a combined channel which is again being a combination of the front left channel 102 and of the front right channel 104.
  • the left back channel 101, the right back channel 105, the center channel 103, and a combined channel, being a combination of the front left channel 102 and the front right channel 104 are reconstructed, using the transmitted spatial parameters, that are comprising a level parameter for use by each of the two-channel decoders 122, 124, and 126.
  • the fourth two-channel decoder 128 derives the front left channel 102 and the front right channel 104, using a level information transmitted as side information for the fourth two-channel decoder 128.
  • the desired energy for each single output channel follows from various different parametric stereo modules between the input signal and each output signal.
  • the energy of a specific output channel can depend on the IID/ICLD parameters of multiple parametric stereo modules.
  • a non-uniform quantization of HD parameters can be applied within each parametric stereo module to produce HD values, which are then used by a decoder as part of the side information.
  • each leaf has its own corresponding IID/ICLD parameter, which indicates the energy distribution from its input toward output channels.
  • the IID/ICLD parameter of leaf "r 3 " may indicate that 90 % of the incoming energy should be sent to leaf r 2 , while the remaining energy (10 %) should be sent to leaf r 4 . This process is repeated for each leaf in the tree.
  • each energy distribution parameter is represented with limited accuracy, the deviation between the desired and the actual energy of each output channel A to E depends on the quantization errors in the IID/ICLD parameters, as well as on the energy distribution (and hence propagation of quantization errors) .
  • the same quantization table is used for a certain parameter type, e.g. ICC or HD, within all parameterization stages n to r 4 , the IID/ICLD quantization is performed optimal only locally. This means that for each parameterization stage r 1 to r 4 , the error in output energy of the (local) output channels is maxi- mum for the weakest output channel in prior art implementations.
  • the quantization of level parameters IID or ICLD
  • other parameters such as ICC, phase differences or time differences describing the spatial perception of a multi-channel audio signal
  • bandwidth may be wasted for spatial parameters describing channels that are mainly masked due to low energy within the channel.
  • this ob- ject is achieved by a parameter quantizer for quantizing an input parameter, wherein the input parameter is a measure for a characteristic of a single channel or a pair of channels with respect to another single channel or a pair of channels of a multi-channel signal, comprising: a quantization rule generator for generating a quantization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal; and a value quantizer for deriving a quantized parameter from the input parameter, using the generated quantization rule.
  • a parameter dequantizer for dequantiz- ing a quantized parameter to derive a parameter, wherein the parameter is a measure for a characteristic of a single chan- nel or a pair of channels with respect to another single channel or a pair of channels of a multi-channel signal, comprising: a dequantization rule generator for generating a de- quantization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal; and a value dequantizer for deriving the parameter from the quantized parameter, using the generated dequantization rule.
  • this object is achieved by a method of quantizing an input parameter, wherein the input parameter is a measure for a charac- teristic of a single channel or a pair of channels with respect to another single channel or a pair of channels of a multi-channel signal, the method comprising: generating a quantization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal; and deriving a quantized parameter from the input parameter using the generated quantization rule.
  • this object is achieved by a method of dequantizing a quantized parameter to derive a parameter, wherein the parameter is a measure for a characteristic of a single channel or a pair of channels with respect to another single channel or a pair of channels of a multi-channel signal, the method comprising: generating a dequantization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal; and deriving the parameter from the quantized parameter using the generated dequantization rule.
  • this object is achieved by a representation of a multi-channel signal having a quantized parameter being a quantized representation of a parameter being a measure for a characteristic of a single channel or a pair of channels, wherein the parameter is a measure for a characteristic of the single channel or the pair of channels with respect to another single channel or a pair of channels of a multi-channel signal, wherein the quantized parameter is derived using a quantization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi- channel signal.
  • this object is achieved by a machine-readable storage medium having stored thereon a representation of a multi-channel signal as described above.
  • a transmitter or audio recorder having a parameter quantizer for quantizing an input parameter, wherein the input parameter is a measure for a characteristic of a single channel or a pair of channels with respect to another single channel or a pair of channels of a multi-channel signal, comprising: a quantization rule generator for generating a quantization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal; and a value quantizer for deriving a quantized parameter from the input parameter, using the generated quantization rule.
  • this object is achieved by a receiver or audio player having a parameter dequantizer for dequantizing a quantized parameter to derive a parameter, wherein the parameter is a measure for a characteristic of a single channel or a pair of channels with respect to another single channel or a pair of channels of a multi-channel signal, comprising: a dequantization rule generator for generating a dequantization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal; and a value dequantizer for deriving the parameter from the quantized parameter, using the generated dequantization rule.
  • this object is achieved by a method of transmitting or audio recording, the method comprising a method of quantizing an in- put parameter, wherein the input parameter is a measure for a characteristic of a single channel or a pair of channels with respect to another single channel or a pair of channels of a multi-channel signal, the method comprising: generating a quantization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal; and deriving a quantized parameter from the input parameter using the generated quantization rule.
  • this object is achieved by a method of receiving or audio playing, the method having a method of dequantizing a quantized parameter to derive a parameter, wherein the parameter is a measure for a characteristic of a single channel or a pair of channels with respect to another single channel or a pair of channels of a multi-channel signal, the method comprising: generating a dequantization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal; and deriving the parameter from the quantized parameter using the generated dequantization rule.
  • this object is achieved by a transmission system having a trans- mitter and a receiver, the transmitter having a parameter quantizer for quantizing an input parameter; and the receiver having a parameter dequantizer for dequantizing a quantized parameter.
  • this object is achieved by a method of transmitting and receiving, the method including a transmitting method having a method of quantizing an input parameter; and the method including a method of receiving including a method of dequantizing a quantized.
  • this object is achieved by a computer program for performing, when running on a computer, one of the above methods.
  • the present invention is based on the finding that parameters being a measure for a characteristic of a single channel or of a pair of channels with respect to another single channel or of a pair of channels of a multi-channel signal can be quantized more efficiently using a quantization rule that is generated based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal.
  • the inventive concept has the major advantage that a quantization rule is either generated or an appropriate quantiza- tion rule is selected from a group of available quantization rules, depending on the energy of the signal to be described. Therefore, a psycho-acoustic model can be applied to a quantizer during encoding or a dequantizer during decoding, to use a quantization rule adapted to the needs of the actual signal. Especially, when a channel contains very little energy compared to other channels within the multi-channel signal, the quantization can be much more coarse than for signals having high energies. This is due to the fact that the high energy signals mask the low energy signals during play- back, i.e. a listener will hardly recognize any details of the low energy signal and thus the low energy signal can be deteriorated more through coarse quantization without the listener being able to recognize the falsification because of the high masking of the low energy signal.
  • a parameter quantizer for quantizing parameters is having a quantization rule generator for generating a quantization rule and a value quantizer for deriving quantized parameters from input parameters using the generated quantization rule.
  • the quantizer selector re- ceives as an input the total energy of the multi-channel audio signal to be coded and the local energy of the channel or the pair of channels whose spatial parameters are to be quantized. Knowing the total energy and the local energy, the quantizer selector can decide, which quantization rule to use, i.e. select coarser quantization rules for channels or channel pairs having comparatively low local energy.
  • the quantizer selector could also derive an algorithmic rule to modify an existing quantization rule or to calculate a completely new quantization rule depending on the local and the total energy.
  • One possibility would for example be to calculate a general scale factor to be applied to a signal before a linear quantizer or a. non-linear quantizer to achieve the goal of reducing the size of the side information to be transmitted.
  • a multi channel signal is encoded in a pairwise manner, i.e. by using a hierarchical structure that is having several 2-to-l down- mixers ordered in a tree-like structure, each downmixer gen- erating a mono channel out of two channels input into the downmixer.
  • energy dependent quantization can now be implemented not only locally, i.e. at each 2-to-l downmixer having the information available at the input of the 2-to-l downmixer only, but based on the global knowledge on the sum of the signal energies. This enhances the perceptual quality of a perceptual signal significantly.
  • an inventive parameter quantizer is incorporated in a parameter encoder before a differential encoder and a Huffman encoder, both of which are used for further encoding the quantized pa- rameters to derive a parameter bit stream.
  • Such an inventive encoder has the great advantage that in addition to decreasing the size of code words needed to describe the quantized parameters, a coarser quantization will automatically increase the abundance of identical code words fed into the differential encoder and the Huffman encoder, which allows for a better compression of the quantized parameters, further reducing the size of the side information.
  • an inven- tive parameter quantizer is having a quantizer factor function generator and a parameter multiplier.
  • the quantizer factor function generator receives the total and the local energy as input and derives a single sealer value from the input quantities.
  • the parameter multiplier receives the parame- ters and the derived quantizer factor f to divide the parameters by the quantizer factor prior to transferring the modified parameters to the quantizer that applies a fixed quantization rule to the modified parameters .
  • a variation of this embodiment is to have a parameter multiplier after the quantizer and hence use the derived quantizer factor f to divide the resulting index out of the quantizer. The result of this then needs to be rounded into an integer index again.
  • Fig. 1 shows a block diagram of an inventive parameter quantizer
  • Figs. 2a to c show several possible quantization rules to be applied
  • Fig. 3 shows a parameter encoder having an inventive parameter quantizer
  • Figs. 4a, 4b show an alternative embodiment of a parameter encoder having an inventive parameter quantizer
  • Fig. 5 shows examples of scale factor functions
  • Fig. 6 shows a non-linear quantization rule
  • Fig. 7 shows an inventive parameter dequantizer
  • Fig. 8 shows a parameter decompressor having an inventive parameter dequantizer
  • Fig. 9a shows an embodiment of an inventive parameter dequantizer
  • Fig. 9b shows a further embodiment of an inventive parameter dequantizer
  • Fig. 9c shows an example for implementing energy dependent dequantization
  • Fig. 9d shows a further example for implementing energy dependent dequantization.
  • Fig. 9e shows examples of quantization and dequantization of parameters
  • Fig. 10a shows a representation of a 5-channel multi- channel audio signal
  • Fig. 10b shows a hierarchical parametric multi-channel decoder according to prior art.
  • Fig. 1 shows an inventive parameter quantizer 199 having a quantizer 200 and a quantizer selector 202.
  • the quantizer selector 202 receives the local energy of the channel or the pair of channels underlying the parameters to be encoded and the total energy of the multi-channel audio signal. Based on both energy informations, the quantizer selector 202 gener- ates a quantization rule that is used by the quantizer 200 to derive a quantized parameter 204 from a parameter 206 input into the quantizer 200.
  • the quantizer selector 202 serves as a quantization rule generator.
  • the input parameters to the quantizer selector 202 are the total energy of the original multi-channel signal and the local energy for the channel described by the parameter to be . quantized.
  • the ratio between the local energy and the total energy gives a measure that can be used to decide which quantizer to use.
  • this ratio q Relative Local energy
  • the selected quantizer is then used to quantize the parameter 206 with the quantizer .
  • the present invention teaches that a coarser quantization of IID/ICLD parameters (and the like) can be used if a pa- rametrization stage is lower in energy compared to the total energy, i.e. when the relative Local energy q is small.
  • the present invention utilizes the psycho-acoustic relation that it is more important to parameterize the dominant/high energy signals with high accuracy than the audio signal with less significance/low energy. To make this even clearer; reference is again made to Fig. 10a.
  • the surround channels can be quantized with less accuracy since the surround channels have much less energy.
  • the additional quantization error introduced from the coarser quantization cannot be perceived since the front channels have much higher energy and hence the quantization error of r 4 (and the resulting energy errors for surround channels A and E) is masked by channels B, D, and/or C.
  • the surround channels A and E only have some faint noise and the front channels B, C, and D have full amplitude signals.
  • a 16 bit PCM original signal would indicate an energy difference of more than 80 dB. Therefore, parameter r 4 could be quantized arbitrarily coarse without introducing any audible differences due to (coarse) quantization.
  • Figs. 2a to 2c show three possible quantization rules introducing different levels of quantization errors. All figures show the original parameter on their x-axis and the integer values assigned to the parameters on their y-axis. Furthermore the Figs. 2a to 2c show dashed lines which correspond to indices for each quantization step and hence can be used for transmission or storage. The transmitted indices can then be used on the decoder side, for example in combination with a lookup-table, for de-quantization.
  • the finest quantization is indicated in Fig. 2a by the quantization curve 230 that maps discrete parameter intervals of the x-axis to 13 integer values. Intermediate quantization is achieved by the quantization curve 232 in Fig. 2b, whereas the coarsest quantization is achieved by the quantization curve 234 of Fig. 2c. It is obvious that the quantization error introduced is biggest in the example shown in Fig. 2c and smallest in the example shown in Fig. 2a.
  • Figs. 2a to c illustrate three different linear quantization rules, where the x-axis describes the input value and the y-axis gives the corresponding quantized value.
  • Figs. 2a to 2c all have the same scale on the x-axis and y- axis and hence, Fig. 2a has the finest quantization of the three and thus the smallest quantization, error. Fig. 2c has the coarsest quantization and thus the largest quantization error. It would also yield the lowest bit rate after differential coding and Huffman coding since it has the smallest amount of quantization steps.
  • a possible quantization rule generation could be based on the relative Local energy q between the local energy and the total energy, as introduced above.
  • a possible range of q-values with corresponding selections of quantization rules is summarized, as an example, within the following table:
  • Fig. 3 shows an inventive parameter compressor having an inventive parameter quantizer 199, a differential encoder 220, and a Huffman encoder 222.
  • the inventive parameter encoder of Fig. 3 extends the parameter quantizer of Fig. 1 by using the quantized parameters as input for the differential encoder 220 that differentially encodes the quantized parameters 204 to derive differentially encoded quantized parame- ters that are then input into the Huffman encoder 222 that applies a Huffman coding scheme to the differentially encoded quantized parameters deriving a parameter bitstream element 224 of a final parameter bit stream as output.
  • Fig. 4a shows a further embodiment of an inventive parameter encoder using an inventive parameter quantizer 250, a differential encoder 252, and a Huffman encoder 254.
  • the parameter quantizer 250 is having a quantizer factor generator 256, a parameter sealer 258, and a quantizer 260.
  • the quantizer factor generator 256 together with the parameter sealer 258 serve as a quantization rule genera- tor.
  • the quantizer function generator 256 receives as input the total energy of the multi-channel audio signal and the local energy of the channel or the channel pair for the parameter to be quantized.
  • the quantizer factor generator 256 generates a scale factor 262 (f) based on the local energy and the total energy. In a preferred embodiment this is done on a basis of a ratio between the local energy and the total energy resulting in a relative local energy q, as follows:
  • This ratio q can be used within the quantizer factor generator 256 to calculate the quantizer factor f (262) that is used as input for the parameter sealer 258 that additionally receives the parameter to be quantized.
  • the parameter sealer 258 applies a scaling to the input parameter that could for example be a division of the parameter by the quantizer factor 262.
  • the scaling of the parameter is equivalent to selecting different quantization rules.
  • the scaled parameter is then input into a quantizer 260 that ap- plies a fixed quantization rule within this embodiment of the present invention.
  • the further processing of the quantized parameter is equal to the processing of Fig. 3, the parameter is differentially encoded and afterwards Huffman encoded to finally yield a parameter bit stream element.
  • Fig. 4b shows a further embodiment of an inventive parameter encoder 270 which is similar to the inventive parameter encoder 250 shown in Fig. 4a. Therefore, only the differences to parameter encoder 250 shall be explained shortly within the following paragraph.
  • the inventive parameter encoder 270 is not having a parameter sealer (parameter sealer 258 of parameter encoder 250) .
  • the parameter quantizer 270 is having a compression device 272 instead. That means the quantizer factor generator 256 together with the compression device 258 serve as a quantization rule generator in this case.
  • the compression device 272 is connected to the quantizer 260 and to the quantizer factor generator 256.
  • the compression unit 272 receives as an input a quantized parameter that is quantized by the quantizer 260 according using a fixed quantization scheme.
  • the compression unit uses the quantized parameter as input and scales the quantized parameter using the scale factor 262. This saves bit rate by decreasing the possible number of quantized parameters to be transmitted to the delta coder 252. This compression can for example be achieved by a division of the quantized parameter index by the scaling factor 262.
  • Fig. 5 shows as an example four different possible functions 300, 302, 303, and 304 that can be .used to derive the scale factor f.
  • the first factor function 300 is a constant function and thus has no energy dependency.
  • the factor functions 302, and 304 show two possibilities to implement factor functions, wherein the factor function 302 is the less aggressive one and would therefore increase the introduced quantization error less than using factor function 304. On the other hand, factor function 302 would save less bit rate than factor function 304.
  • Factor function 303 shows a fourth possibility to derive the quantizer factor from the energy quota q, whereas the factor function 303 is step-like in form and therefore assigns intervals of the energy quota q to the same quantizer factor.
  • Fig. 6 exemplifies a non-uniform quantizer where the input on the x-axis in dB is quantized according to the function 310 to result in the output y in dB that is drawn on the y-axis.
  • a non-uniform quantizer function can be used to quantize spatial parameters as well. This is of special interest when the reference channel within a BCC-coding scheme is chosen to be the strongest channel within a multi-channel signal .
  • the non-uniform quantizer as shown in Fig. 6 exemplifies a quantizer function 310 that would suit the needs then., since the quantization steps increase as the energy level becomes smaller compared to the referenced channel. This is a particularly attractive property since the energy level quantizing errors can be larger for channels with less energy than for the strongest channels.
  • Fig. 7 shows an inventive parameter dequantizer 500 having a dequantizer 502 and a dequantizer selector 504.
  • the dequantizer selector 504 receives the total energy of the multichannel audio signal and the local energy of the channel or channel pairs together with a quantized parameter 505 that is to be dequantized. Based on the received energy information, the dequantizer selector 504 derives a dequantization rule that is used by the dequantizer 502 to dequantize the quantized parameter 505. Hence, in this case the dequantizer selector 504 serves as a dequantization rule generator.
  • the dequantizer selector 504 may operate in different ways.
  • a first possibility is that the dequantizer selector 504 derives the quantization rule directly and transfers the derived quantization rule to the dequantizer 502.
  • Another possibility is that the dequantizer selec- tor 504 meets a dequantization rule decision, which is transferred to the dequantizer 502 that can use the dequantization rule decision to select the appropriate dequantization rule from a number of quantization rules that are for example stored in the dequantizer 502.
  • Fig. 8 shows an inventive parameter decoder having a parameter dequantizer 500, a differential decoder 510, and a Huffman decoder . 512.
  • the Huffman decoder 512 receives a parameter bit stream element 513 and in association therewith, the dequantizer selector 504 receives the local energy of a channel or a pair of channels described by the parameter bit stream element 513 and the total energy of the multi-channel audio signal.
  • the parameter bit stream element 513 is produced by an inventive parameter encoder, as shown in Fig. 3. Therefore, the parameter bit stream element 513 is Huffman decoded by the Huffman decoder 512 and differentially decoded by a differential decoder 510 before being supplied to the dequantizer 502. After the decoding by the Huffman decoder 512 and the differential decoder 510, the dequantization is performed by the inventive parameter dequantizer 500, as already described in the description of the inventive parameter of Fig. 7.
  • Fig. 8 illustrates a decoder using an energy dependent dequantizer 500, the decoder corresponding to an inventive encoder.
  • the parameter bit stream element is Huff- man decoded and differentially decoded into indices.
  • the correct dequantizer is chosen in the dequantizer selector 504 using the same rule and function as was used in the encoder with the total energy and local energy as input.
  • the selected dequantizer is then used to dequantize (using the dequantizer 502) the indices into dequantized parameters.
  • Fig. 9a shows a further embodiment of an inventive parameter decoder, having an inventive energy dependent dequan- tizer 520, a Huffman decoder 512, and a differential decoder 510.
  • the parameter dequantizer 520 comprises a quantizer factor generator 522, a dequantizer 524, and a parameter sealer 526.
  • the dequantizer factor generator 522 together with the parameter sealer 526 serve as a de- quantization rule generator.
  • the quantized parameter is dequantized by the dequantizer 524, wherein the dequantizer 524 is using a dequantization rule matching a quantization rule used to generate the quantized parameter.
  • the quantizer factor generator 522 derives a scale factor 528 (f) from a ratio of the local energy and the total energy of the multi-channel audio signal.
  • the parameter sca- ler 526 then applies the scale factor 528 to the dequantized parameter by a multiplication of the scale factor with the dequantized parameter.
  • the decom- pressed dequantized parameters are available at an output of the inventive parameter decoder.
  • Fig. 9b shows a further embodiment of an inventive parameter decoder 530, similar to the inventive parameter decoder 520. Therefore, only the differences to the parameter decoder 520 shall be elaborated on in the following paragraph.
  • the inventive parameter decoder 530 is having a decompressor 532, the decompressor 532 achieving the same functional result as the parameter sealer 526 in the inventive parameter decoder 520.
  • the decompressor 532 receives as an input the quantized parameters and as further input the scale factor 528 from the factor generator 522. That means the factor generator 522 together with the decompressor 532 serve as a dequantization rule generator in this case.
  • the quantized pa- rameter is scaled by the decompressor 532 before the so derived scaled quantized parameter is input into the dequan- tizer 524.
  • the dequantizer 524 then dequantizes the scaled quantized parameter to derive the dequantized parameter using a fixed dequantization rule. This decompression can for exam- pie be achieved by a multiplication of the quantized parameter index by the scale factor 528.
  • the scaling by the parameter sealer 258 and the parameter sealer 526 during the encoding and decoding is de- scribed to be a division during the encoding and a multiplication during the decoding, any other type of scaling that has the same effect as using a different quantization rule can be applied to the parameters during the encoding or decoding.
  • a decoder may either decide autonomously which dequantization rule to use using the total energy and the local energy. Alternatively, it could be signalled by some additional side information to the decoder, which de- quantization rule is the appropriate one to dequantize the parameters.
  • Figs. 9c and 9d Two possible ways of imple- menting energy dependent dequantization for the reconstruction of a multi-channel signal from a transferred monophonic signal M using additionally transmitted spatial parameters (CLD, ICC) are shown in Figs. 9c and 9d.
  • CLD, ICC transmitted spatial parameters
  • Fig. 9c shows the situation where the parameters CLD are derived such that it is assumed that a parameter CLD 0 describes the energy distribution between channels that are combined using a number of channels of the original signal.
  • CLD 0 describes the energy relation between two channels, wherein a first channel is a combination 1002 of a front-left, a front-right, a center and a low-frequency-enhancement channel.
  • the second channel is a combination of a back-left and a back-right channel.
  • the parameter CLD 0 describes the energy distribution between all rear channels and all front channels . It is therefore evident when CLD 0 indicates that only little energies contained in the rear channels, the parameters describing the spatial properties between the back-left and the back-right channel may be quantized stronger, since the addi- tionally-introduced distortion by the coarse quantization is hardly audible when all channels are played back simultaneously.
  • An inventive parameter dequantizer as shown in Fig. 9b is, for example, calculating a scale factor 528 to implement the dequantization by multiplying a parameter to be dequantized with a parameter index before the actual dequantization is performed. Therefore, if a parameter CLD 0 is transmitted, one may, when using the decoder of Fig. 9b for example, calculate the finally-used CLD parameters of other hierarchical steps according to the following formula.
  • DEQ describes the application of a fixed dequantization table to a parameter given to the pro- cedure DEQ. That means, a transmitted parameter IDX CLD (0,L) can be dequantized directly, indicated by the following expression:
  • the relative local energy of the back channels is accordingly:
  • CLD 1 can now be computed, taking into account the overall energy contained in the combination signal 1002:
  • the term "facFunc” describes a function giving a real value independency of the relative local energy FC.
  • formula 4 describes that before dequantization, the transmitted parameter index IDX CLD (1,1,m) is multiplied with a scale factor (facFunc) to derive an intermediate quantized parameter. Since the intermediate quantized parameter is not necessarily integer-valued, the intermediate quantized parameter must be rounded to derive IdxCLDEdQ, which is then dequantized into the final parameter used by the following operation:
  • Dequantization is performed by a standard dequantization table, such as, for example, the following:
  • the derived parameter CLD 1 describes an energy relation between a channel being a combination of a front-left and a front-right channel and a channel being a combination of a center and a low-frequency-enhancement channel, as can be seen from the channel decomposition in the second hierarchical step 1004.
  • a relative local energy F describing an energy contained in the front channels, front-left and front- right, can be computed according to the following formula:
  • parameter CLD 3 describing an energy relation between the front-left and the front-right channel can now be derived in an energy-dependent way according to the following formulas :
  • parameter CAD 4 describing an energy relation between the center and the low-frequency- enhancement channel can now be derived using no factor function:
  • Fig. 9d shows another possibility of defining a hierarchic for the derivation of the spatial parameters.
  • Fig. 9e shows the manipulations during encoding and decoding, further pointing out the concept of the invention.
  • Table 9d shows the manipulation of the quantization index on the quantizer side in a left column 1100, and the reconstruction of the transmitted parameter on the quantizer side in a column 1102.
  • the transmitted parameter is given in column 1104.
  • Two examples for a combination of channels having relatively low energy are shown. This is indicated by the common scale factor 4.5, which is significantly bigger than 1 (see Fig. 4) .
  • the quantization index IDX is divided by the scale factor after the quantiza- tion at the quantizer size. Afterwards, the result has to be rounded to an integer value to be differentially and Huffman encoded (see Fig. 4a) . Therefore, both example indexes 10 and 9 result in a transmitted index IDXtransm of 2.
  • the dequantizer multiplies the transmitted index by the scale factor to derive a rekonstructed index IDXrek used for de- quantization.
  • IDXrek used for de- quantization.
  • an additional error of 1 arises due to the rounding of the divided index on the quantizer size.
  • the division of the scale factor at the quantizer side yields an integer valued index IDXtransm to be transmitted, no additional error is introduced.
  • the relation of the local and the total energy upon which the decision which de-/quantization rule to use is based is described to be a logarithmic measure within the previous para- graphs. This of course not the only possible measure that can be used to realize the inventive concept. Any other measure describing an energy difference between the local energy or the total energy, as for example the plain difference, can be used to make the decision.
  • Another important feature with the present invention is that in combination with a two channel decoder (PS) design that distributes the incoming energy into the two output channels typically controlled by e.g. CLD like parameter (meaning that the incoming energy equals the sum of the energies for the two output channels) , is that the difference in energy, Relative Local Energy between the total energy and the local energy for each two channel decoders (122, 124, 126, and 128) is defined by the CLD parameters. This means that there is no need to actually measure the total energy and the local energy since the difference in energy in dB that is typically used to calculate the scale factor is defined by the CLD parameters .
  • PS two channel decoder
  • the inventive methods can be implemented in hardware or in software.
  • the implementation can be performed using a digital storage medium, in particular a disk, DVD or a CD having electronically readable control signals stored thereon, which cooperate with a programmable computer system such that the inventive methods are performed.
  • the present invention is, therefore, a computer program product with a program code stored on a machine-readable carrier, the program code being operative for performing the inventive methods when the computer program product runs on a computer.
  • the inventive methods are, therefore, a computer program having a program code for performing at least one of the inventive methods when the computer program runs on a computer.

Abstract

Parameters being a measure for a characteristic of a channel or of a pair of channels, wherein the parameter is a measure for a characteristic of the channel or of the pair of channels with respect to another channel of a multi-channel signal can be quantized more efficiently using a quantization rule that is generated based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal. With generation of the quantization rule taking into account a psycho acoustic approach, the size of an encoded representation of the multi-channel signal can be decreased by coarser quantization without significantly disturbing the perceptual quality of the multi-channel signal when reconstructed from the encoded representation.

Description

ENERGY DEPENDENT QUANTIZATION FOR EFFICIENT CODING OF SPATIAL
AUDIO PARAMETERS
Field of the invention
The present invention relates to quantization of spatial audio parameters and in particular to a concept to allow for a more efficient compression without significantly reducing the perceptual quality of an audio signal reconstructed using the quantized spatial audio parameters .
Background of the invention and prior art
Recently, multi-channel audio reproduction techniques are becoming more and more important. In the view of an efficient transmission of multi-channel audio signals having 5 or more separate audio channels, several ways of compressing a stereo or multi-channel signal have been developed. Recent ap- proaches for the parametric coding of multi-channel audio signals (parametric stereo (PS) , "Binaural Cue Coding" (BCC) etc.) represent a multi-channel audio signal by means of a down-mix signal (could be monophonic or comprise several channels) and parametric side information, also referred to as "spatial cues", characterizing its perceived spatial sound stage.
A multi-channel encoding device generally receives - as input - at least two channels, and outputs one or more carrier channels and parametric data. The parametric data is derived such that, in a decoder, an approximation of the original multi-channel signal can be calculated. Normally, the carrier channel (channels) will include subband samples, spectral coefficients, time domain samples, etc., which provide a com- paratively fine representation of the underlying signal, while the parametric data do not include such samples of spectral coefficients but include control parameters for con- trolling a certain reconstruction algorithm instead. Such a reconstruction could comprise weighting by multiplication, time shifting, frequency shifting, phase shifting, etc. Thus, the parametric data includes only a comparatively coarse rep- resentation of the signal or the associated channel.
The binaural cue coding (BCC) technique is described in a number of publications, as in "Binaural Cue Coding applied to Stereo and Multi-Channel Audio Compression", C. Faller, F. Baumgarte, AES convention paper 5574, May 2002, Munich, in the 2 ICASSP publications "Estimation of auditory spatial cues for binaural cue coding", and "Binaural cue coding: a normal and efficient representation of spatial audio", both authored by C. Faller, and F. Baumgarte, Orlando, FL, May 2002.
In BCC encoding, a number of audio input channels are converted to a spectral representation using a DFT (Discrete Fourier Transform) based transform with overlapping windows. The resulting uniform spectrum is then divided into non- overlapping partitions. Each partition has a bandwidth proportional to the equivalent rectangular bandwidth (ERB) . Then, spatial parameters called ICLD (Inter-Channel Level Difference) and ICTD (Inter-Channel Time Difference) are es- timated for each partition. The ICLD parameter describes a level difference between two channels and the ICTD parameter describes the time difference (phase shift) between two signals of different channels. The level differences and the time differences are normally given for each channel with re- spect to a reference channel. After the derivation of these parameters, the parameters are quantized and finally encoded for transmission.
Although ICLD and ICTD parameters represent the most impor- tant sound source localization parameters, a spatial representation using these parameters can be enhanced by introducing additional parameters. A related technique, called "parametric stereo" describes the parametric coding of a two-channel stereo signal based on a transmitted mono signal plus parameter side information. There, 3 types of spatial parameters, referred to as inter- channel intensity difference (IIDs), inter-channel phase differences (IPDs), and inter-channel coherence (IC) are introduced. The extension of the spatial parameter set with a coherence parameter (correlation parameter) enables a pa- rametrization of the perceived spatial "diffuseness" or spatial "compactness" of the sound stage. Parametric stereo is described in more detail in: "Parametric Coding of stereo audio", J. Breebaart, S. van de Par, A. Kohlrausch, E. Schuijers (2005) Eurasip, J. Applied Signal Proc. 9, pages 1305-1322)", in "High-Quality Parametric Spatial Audio Coding at Low Bitrates", J. Breebaart, S. van de Par, A. Kohlrausch, E. Schuijers, AES 116th Convention,. Preprint 6072, Berlin, May 2004, and in "Low Complexity Parametric Stereo Coding", E. Schuijers, J. Breebaart, H. Purnhagen, J. Engdegard, AES 116th Convention, Preprint 6073, Berlin, May 2004.
The international publication WO 2004/008805 Al teaches, how a multi-channel audio signal can be advantageously compressed by combining several parametric stereo modules, thus realiz- ing a hierarchical structure to derive a representation of the original multi-channel audio signal comprising a down-mix signal and parametric side information.
Within the BCC and parametric stereo (PS) approach, a repre- sentation of the level differences (also called intensity differences ICLD or energy differences IID) between audio channels is a vital part of a parametric representation of a stereophonic/multi-channel audio signal. Such information and other spatial parameters are transmitted from the encoder to the decoder for each time/frequency slot. In the view of coding efficiency, it is therefore of high interest to represent these parameters as compactly as possible while preserving audio quality.
In BCC coding, the level differences are represented relative to a so-called "reference channel" and are quantized on a uniform scale in units of dB relative to a reference channel. This does not optimally exploit the fact that channels with low level with respect to the reference channel are subject to a significant masking effect when listened to by human listeners. In the extreme case of a channel having no signal at allf the bandwidth used by parameters describing this particular channel is completely wasted. In the more common case, where one channel is much fainter than another channel, that is a listener can hardly hear the faint channel during the playback, a less precise reproduction of the faint channel would also lead to the same perceptual quality of the listener, as the faint signal is mainly masked by the stronger signal.
To explain the situation and the problems arising, when encoding a multi-channel signal, reference is made to Fig. 10a where a commonly used 5-channel signal is illustrated. The 5- channel configuration is having a left rear channel 101 (A, having a signal a(t)), a left front channel 102 (B, having a signal b(t)), a center channel 103 (C, having a signal c(t)), a right front channel 104 (D, having a signal d(t)) and a right back channel 105 (E, having a signal e(t)). Intensity relations between single channels or channel pairs are marked with arrows. Hence, the intensity distribution between the front left channel 102 and the front right channel 104 is marked ri (110) , the intensity distribution between the left back channel and the right back channel is marked TA (112) . The intensity distribution between the combination of the left front channel 102 and the right front channel 104 and the center channel 103 is marked r2 (114) and the intensity distribution between the combination of the back channels and the combination of the front channels is marked r3 (116) . When, for example, a simple monologue is recorded, most of the energy would be contained in the center channel 103. In this example, especially the back channels will contain only little (or 0) energy. Therefore, parameters describing the properties of the back channels are merely wasted in this example, since mainly the center channel 102 or the front channels will be active during the play back.
Based on Fig. 10a, ways of computing the energy distribution between channels or channel combinations are described within the following paragraph.
Fig. 10a illustrates a multi channel parameterization for a five channel speaker set-up where the different audio chan- nels are indicated by 101 to 105; a(t) 101 represents signal of the left surround channel, b(t) 102 represents the. signal of the left front channel, c(t) 103 represents the signal of the center channel, d(t) 104 represents the signal of the right front channel, e(t) 105 represents the signal of the right surround channel. The speaker set-up is divided into a front part and a back part. The energy distribution between the entire front channel set-up (102, 103 and 104) and the back channels (101 and 105) are illustrated by the arrow in Fig. 10a and indicated by the r3 parameter. The energy dis- tribution between the center channel 103 and the left front 102 and right front 103 channels are indicated by r2. The energy distribution between the left surround channel 101 and the right surround channel 105 is illustrated by r^. Finally, the energy distribution between the left front channel 102 and the right front channel 104 is given by rj. Since ri to r4 are parameterizations of different regions it is also clear that beside energy distribution also other essential region properties can be parameterized, as for example the correlation between the regions. Additionally for each parameter ri to r4 a local energy can be calculated. For example the local energy of r4 is the summed energy of channel A 101 and E 105.
Figure imgf000006_0001
Where E[.] is the expected value as defined by
Figure imgf000007_0001
Fig. 10b shows a multi-channel audio decoder built by hierarchically ordering parametric stereo modules, as for example described in WO 2004/008805 Al. Here, the audio channels 101 to 105, as introduced in Fig. 10a, are reproduced step by step from a single monophonic down-mix signal 120 (M) and corresponding side information by a first two-channel decoder 122, a second two-channel decoder 124, a third two- channel decoder 126, and a fourth two-channel decoder 128. As can be seen, in the treelike structure in Fig. 10b, the first two-channel decoder decomposes the monophonic down-mix sig- nal 120 into two signals fed into the second and the third two-channel decoders 124 and 126. Therein, the channel fed into the third two-channel decoder 126 is a combined channel, being combined from the left back channel 101 and the right back channel 105. The channel fed into the second two-channel decoder 124 is a combination of the center channel 103 and a combined channel which is again being a combination of the front left channel 102 and of the front right channel 104.
Thus, after the second step of the hierarchical decoding, the left back channel 101, the right back channel 105, the center channel 103, and a combined channel, being a combination of the front left channel 102 and the front right channel 104 are reconstructed, using the transmitted spatial parameters, that are comprising a level parameter for use by each of the two-channel decoders 122, 124, and 126.
In the third step of the hierarchical decoding, the fourth two-channel decoder 128 derives the front left channel 102 and the front right channel 104, using a level information transmitted as side information for the fourth two-channel decoder 128. Using a prior art hierarchical decoder as shown in Fig. 10b, the desired energy for each single output channel follows from various different parametric stereo modules between the input signal and each output signal. In other words, the energy of a specific output channel can depend on the IID/ICLD parameters of multiple parametric stereo modules. In such a treelike structure of connected parametric stereo modules, also a non-uniform quantization of HD parameters can be applied within each parametric stereo module to produce HD values, which are then used by a decoder as part of the side information. This would exploit the benefits of non-uniform HD quantization locally (i.e. within each parametric stereo module individually) , nonetheless it is sub- optimum because quantization in each module ("leafs") is carried out independently of the energies/level of other audio channels that may be high in relative level and, therefore, produce masking.
This is possible, since "leaf" modules are not aware of the global level distribution at a higher tree level (e.g. the "root" module) . Each leaf has its own corresponding IID/ICLD parameter, which indicates the energy distribution from its input toward output channels. For example, the IID/ICLD parameter of leaf "r3" (processed by the first two-channel decoder 122) may indicate that 90 % of the incoming energy should be sent to leaf r2, while the remaining energy (10 %) should be sent to leaf r4. This process is repeated for each leaf in the tree. Since each energy distribution parameter is represented with limited accuracy, the deviation between the desired and the actual energy of each output channel A to E depends on the quantization errors in the IID/ICLD parameters, as well as on the energy distribution (and hence propagation of quantization errors) . In other words, as the same quantization table is used for a certain parameter type, e.g. ICC or HD, within all parameterization stages n to r4, the IID/ICLD quantization is performed optimal only locally. This means that for each parameterization stage r1 to r4, the error in output energy of the (local) output channels is maxi- mum for the weakest output channel in prior art implementations.
As detailed in the previous paragraphs, the quantization of level parameters (IID or ICLD) or other parameters such as ICC, phase differences or time differences describing the spatial perception of a multi-channel audio signal is still sub-optimal, since bandwidth may be wasted for spatial parameters describing channels that are mainly masked due to low energy within the channel.
Summary of the invention
It is the object of the present invention to provide an improved concept for quantization of spatial parameters of a multi-channel audio signal .
According to a first aspect of the present invention this ob- ject is achieved by a parameter quantizer for quantizing an input parameter, wherein the input parameter is a measure for a characteristic of a single channel or a pair of channels with respect to another single channel or a pair of channels of a multi-channel signal, comprising: a quantization rule generator for generating a quantization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal; and a value quantizer for deriving a quantized parameter from the input parameter, using the generated quantization rule.
According to a second aspect of the present invention this object is achieved by a parameter dequantizer for dequantiz- ing a quantized parameter to derive a parameter, wherein the parameter is a measure for a characteristic of a single chan- nel or a pair of channels with respect to another single channel or a pair of channels of a multi-channel signal, comprising: a dequantization rule generator for generating a de- quantization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal; and a value dequantizer for deriving the parameter from the quantized parameter, using the generated dequantization rule.
According to a third aspect of the present invention this object is achieved by a method of quantizing an input parameter, wherein the input parameter is a measure for a charac- teristic of a single channel or a pair of channels with respect to another single channel or a pair of channels of a multi-channel signal, the method comprising: generating a quantization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal; and deriving a quantized parameter from the input parameter using the generated quantization rule.
According to a fourth aspect of the present invention this object is achieved by a method of dequantizing a quantized parameter to derive a parameter, wherein the parameter is a measure for a characteristic of a single channel or a pair of channels with respect to another single channel or a pair of channels of a multi-channel signal, the method comprising: generating a dequantization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal; and deriving the parameter from the quantized parameter using the generated dequantization rule.
According to a fifth aspect of the present invention this object is achieved by a representation of a multi-channel signal having a quantized parameter being a quantized representation of a parameter being a measure for a characteristic of a single channel or a pair of channels, wherein the parameter is a measure for a characteristic of the single channel or the pair of channels with respect to another single channel or a pair of channels of a multi-channel signal, wherein the quantized parameter is derived using a quantization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi- channel signal.
According to a sixth aspect of the present invention this object is achieved by a machine-readable storage medium having stored thereon a representation of a multi-channel signal as described above.
According to a seventh aspect of the present invention this object is achieved by a. transmitter or audio recorder having a parameter quantizer for quantizing an input parameter, wherein the input parameter is a measure for a characteristic of a single channel or a pair of channels with respect to another single channel or a pair of channels of a multi-channel signal, comprising: a quantization rule generator for generating a quantization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal; and a value quantizer for deriving a quantized parameter from the input parameter, using the generated quantization rule.
According to an eighth aspect of the present invention this object is achieved by a receiver or audio player having a parameter dequantizer for dequantizing a quantized parameter to derive a parameter, wherein the parameter is a measure for a characteristic of a single channel or a pair of channels with respect to another single channel or a pair of channels of a multi-channel signal, comprising: a dequantization rule generator for generating a dequantization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal; and a value dequantizer for deriving the parameter from the quantized parameter, using the generated dequantization rule. According to a ninth aspect of the present invention this object is achieved by a method of transmitting or audio recording, the method comprising a method of quantizing an in- put parameter, wherein the input parameter is a measure for a characteristic of a single channel or a pair of channels with respect to another single channel or a pair of channels of a multi-channel signal, the method comprising: generating a quantization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal; and deriving a quantized parameter from the input parameter using the generated quantization rule.
According to a tenth aspect of the present invention this object is achieved by a method of receiving or audio playing, the method having a method of dequantizing a quantized parameter to derive a parameter, wherein the parameter is a measure for a characteristic of a single channel or a pair of channels with respect to another single channel or a pair of channels of a multi-channel signal, the method comprising: generating a dequantization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal; and deriving the parameter from the quantized parameter using the generated dequantization rule.
According to an eleventh aspect of the present invention this object is achieved by a transmission system having a trans- mitter and a receiver, the transmitter having a parameter quantizer for quantizing an input parameter; and the receiver having a parameter dequantizer for dequantizing a quantized parameter.
According to a twelfth aspect of the present invention this object is achieved by a method of transmitting and receiving, the method including a transmitting method having a method of quantizing an input parameter; and the method including a method of receiving including a method of dequantizing a quantized.
According to a thirteenth aspect of the present invention this object is achieved by a computer program for performing, when running on a computer, one of the above methods.
The present invention is based on the finding that parameters being a measure for a characteristic of a single channel or of a pair of channels with respect to another single channel or of a pair of channels of a multi-channel signal can be quantized more efficiently using a quantization rule that is generated based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal.
The inventive concept has the major advantage that a quantization rule is either generated or an appropriate quantiza- tion rule is selected from a group of available quantization rules, depending on the energy of the signal to be described. Therefore, a psycho-acoustic model can be applied to a quantizer during encoding or a dequantizer during decoding, to use a quantization rule adapted to the needs of the actual signal. Especially, when a channel contains very little energy compared to other channels within the multi-channel signal, the quantization can be much more coarse than for signals having high energies. This is due to the fact that the high energy signals mask the low energy signals during play- back, i.e. a listener will hardly recognize any details of the low energy signal and thus the low energy signal can be deteriorated more through coarse quantization without the listener being able to recognize the falsification because of the high masking of the low energy signal.
In one embodiment of the present invention, a parameter quantizer for quantizing parameters is having a quantization rule generator for generating a quantization rule and a value quantizer for deriving quantized parameters from input parameters using the generated quantization rule. To generate an appropriate quantization rule, the quantizer selector re- ceives as an input the total energy of the multi-channel audio signal to be coded and the local energy of the channel or the pair of channels whose spatial parameters are to be quantized. Knowing the total energy and the local energy, the quantizer selector can decide, which quantization rule to use, i.e. select coarser quantization rules for channels or channel pairs having comparatively low local energy. Alternatively, the quantizer selector could also derive an algorithmic rule to modify an existing quantization rule or to calculate a completely new quantization rule depending on the local and the total energy. One possibility would for example be to calculate a general scale factor to be applied to a signal before a linear quantizer or a. non-linear quantizer to achieve the goal of reducing the size of the side information to be transmitted.
In a further embodiment of the present invention a multi channel signal is encoded in a pairwise manner, i.e. by using a hierarchical structure that is having several 2-to-l down- mixers ordered in a tree-like structure, each downmixer gen- erating a mono channel out of two channels input into the downmixer. Following the inventive concept, energy dependent quantization can now be implemented not only locally, i.e. at each 2-to-l downmixer having the information available at the input of the 2-to-l downmixer only, but based on the global knowledge on the sum of the signal energies. This enhances the perceptual quality of a perceptual signal significantly.
It is evident that following the inventive concept, the side information size can be decreased while the quality of the encoded multi-channel audio signal is hardly affected. In a further embodiment of the present invention, an inventive parameter quantizer is incorporated in a parameter encoder before a differential encoder and a Huffman encoder, both of which are used for further encoding the quantized pa- rameters to derive a parameter bit stream. Such an inventive encoder has the great advantage that in addition to decreasing the size of code words needed to describe the quantized parameters, a coarser quantization will automatically increase the abundance of identical code words fed into the differential encoder and the Huffman encoder, which allows for a better compression of the quantized parameters, further reducing the size of the side information.
In a further embodiment of the present invention, an inven- tive parameter quantizer is having a quantizer factor function generator and a parameter multiplier. The quantizer factor function generator receives the total and the local energy as input and derives a single sealer value from the input quantities. The parameter multiplier receives the parame- ters and the derived quantizer factor f to divide the parameters by the quantizer factor prior to transferring the modified parameters to the quantizer that applies a fixed quantization rule to the modified parameters .
A variation of this embodiment is to have a parameter multiplier after the quantizer and hence use the derived quantizer factor f to divide the resulting index out of the quantizer. The result of this then needs to be rounded into an integer index again.
Application of a scaling factor to the parameters has the same effect as choosing different quantization rules, since for example division by a big factor compresses the input parameter space such that effectively only a smaller part of a already existing quantization rule would be effective. This solution has the advantage that on the decoder and the encoder side additional memory can be saved because there is only one quantization rule to be stored or to be processed since the scaling is done by a simple multiplication requiring only limited additional hard- or software. An additional advantage is that by applying a quantizer factor, the quan- tizer factor can be derived using any possible functional dependence- Therefore, a quantizer or dequantizer sensitivity can be adjusted continuously within the whole possible input parameter space rather than selecting predefined quantization rules out of a given sample.
Brief description of the drawings
Preferred embodiments of the present invention are subse- quently described by referring to the enclosed drawings, wherein:
Fig. 1 shows a block diagram of an inventive parameter quantizer;
Figs. 2a to c show several possible quantization rules to be applied;
Fig. 3 shows a parameter encoder having an inventive parameter quantizer;
Figs. 4a, 4b show an alternative embodiment of a parameter encoder having an inventive parameter quantizer;
Fig. 5 shows examples of scale factor functions;
Fig. 6 shows a non-linear quantization rule;
Fig. 7 shows an inventive parameter dequantizer; Fig. 8 shows a parameter decompressor having an inventive parameter dequantizer;
Fig. 9a shows an embodiment of an inventive parameter dequantizer;
Fig. 9b shows a further embodiment of an inventive parameter dequantizer;
Fig. 9c shows an example for implementing energy dependent dequantization;
Fig. 9d shows a further example for implementing energy dependent dequantization.
Fig. 9e shows examples of quantization and dequantization of parameters;
Fig. 10a shows a representation of a 5-channel multi- channel audio signal; and
Fig. 10b shows a hierarchical parametric multi-channel decoder according to prior art.
Detailed description of preferred embodiments
Fig. 1 shows an inventive parameter quantizer 199 having a quantizer 200 and a quantizer selector 202. The quantizer selector 202 receives the local energy of the channel or the pair of channels underlying the parameters to be encoded and the total energy of the multi-channel audio signal. Based on both energy informations, the quantizer selector 202 gener- ates a quantization rule that is used by the quantizer 200 to derive a quantized parameter 204 from a parameter 206 input into the quantizer 200. Hence, in this case the quantizer selector 202 serves as a quantization rule generator.
The input parameters to the quantizer selector 202 are the total energy of the original multi-channel signal and the local energy for the channel described by the parameter to be . quantized. In a preferred embodiment of the present invention the ratio between the local energy and the total energy gives a measure that can be used to decide which quantizer to use. As an example this ratio q (Relative Local energy) can be calculated in dB, using the following equation:
Figure imgf000018_0001
The selected quantizer is then used to quantize the parameter 206 with the quantizer .
The present invention teaches that a coarser quantization of IID/ICLD parameters (and the like) can be used if a pa- rametrization stage is lower in energy compared to the total energy, i.e. when the relative Local energy q is small. The present invention utilizes the psycho-acoustic relation that it is more important to parameterize the dominant/high energy signals with high accuracy than the audio signal with less significance/low energy. To make this even clearer; reference is again made to Fig. 10a. When within an audio scene in the original multi-channel signal the energy/signal is primarily present in the front image, meaning the left front channel 102, the center channel 103 and the right front chan- nel 104, the surround channels can be quantized with less accuracy since the surround channels have much less energy. The additional quantization error introduced from the coarser quantization cannot be perceived since the front channels have much higher energy and hence the quantization error of r4 (and the resulting energy errors for surround channels A and E) is masked by channels B, D, and/or C. In the most extreme example, the surround channels A and E only have some faint noise and the front channels B, C, and D have full amplitude signals. In such a case, a 16 bit PCM original signal would indicate an energy difference of more than 80 dB. Therefore, parameter r4 could be quantized arbitrarily coarse without introducing any audible differences due to (coarse) quantization.
Figs. 2a to 2c show three possible quantization rules introducing different levels of quantization errors. All figures show the original parameter on their x-axis and the integer values assigned to the parameters on their y-axis. Furthermore the Figs. 2a to 2c show dashed lines which correspond to indices for each quantization step and hence can be used for transmission or storage. The transmitted indices can then be used on the decoder side, for example in combination with a lookup-table, for de-quantization.
The finest quantization is indicated in Fig. 2a by the quantization curve 230 that maps discrete parameter intervals of the x-axis to 13 integer values. Intermediate quantization is achieved by the quantization curve 232 in Fig. 2b, whereas the coarsest quantization is achieved by the quantization curve 234 of Fig. 2c. It is obvious that the quantization error introduced is biggest in the example shown in Fig. 2c and smallest in the example shown in Fig. 2a.
These three quantization rules are examples of quantization rules that may be selected by the quantizer selector 202. In other words, Figs. 2a to c illustrate three different linear quantization rules, where the x-axis describes the input value and the y-axis gives the corresponding quantized value.
Figs. 2a to 2c all have the same scale on the x-axis and y- axis and hence, Fig. 2a has the finest quantization of the three and thus the smallest quantization, error. Fig. 2c has the coarsest quantization and thus the largest quantization error. It would also yield the lowest bit rate after differential coding and Huffman coding since it has the smallest amount of quantization steps.
As an example, a possible quantization rule generation could be based on the relative Local energy q between the local energy and the total energy, as introduced above. A possible range of q-values with corresponding selections of quantization rules is summarized, as an example, within the following table:
Figure imgf000020_0001
Fig. 3 shows an inventive parameter compressor having an inventive parameter quantizer 199, a differential encoder 220, and a Huffman encoder 222. The inventive parameter encoder of Fig. 3 extends the parameter quantizer of Fig. 1 by using the quantized parameters as input for the differential encoder 220 that differentially encodes the quantized parameters 204 to derive differentially encoded quantized parame- ters that are then input into the Huffman encoder 222 that applies a Huffman coding scheme to the differentially encoded quantized parameters deriving a parameter bitstream element 224 of a final parameter bit stream as output.
The combination of an inventive parameter quantizer with a differential encoder and a Huffman encoder is particularly attractive since coarser quantization results in a higher abundance of equal symbols (quantized parameters) . The combination of the differential encoder 220 and the Huffman en- coder 222 will evidently provide an encoded representation of the quantized parameters (parameter bitstream element 224) that is more compact, when the maximum number of possible input symbols is decreased by a coarser quantization. Fig. 4a shows a further embodiment of an inventive parameter encoder using an inventive parameter quantizer 250, a differential encoder 252, and a Huffman encoder 254.
The parameter quantizer 250 is having a quantizer factor generator 256, a parameter sealer 258, and a quantizer 260. In this case the quantizer factor generator 256 together with the parameter sealer 258 serve as a quantization rule genera- tor.
The quantizer function generator 256 receives as input the total energy of the multi-channel audio signal and the local energy of the channel or the channel pair for the parameter to be quantized. The quantizer factor generator 256 generates a scale factor 262 (f) based on the local energy and the total energy. In a preferred embodiment this is done on a basis of a ratio between the local energy and the total energy resulting in a relative local energy q, as follows:
Figure imgf000021_0001
This ratio q can be used within the quantizer factor generator 256 to calculate the quantizer factor f (262) that is used as input for the parameter sealer 258 that additionally receives the parameter to be quantized.
The parameter sealer 258 applies a scaling to the input parameter that could for example be a division of the parameter by the quantizer factor 262. The scaling of the parameter is equivalent to selecting different quantization rules. The scaled parameter is then input into a quantizer 260 that ap- plies a fixed quantization rule within this embodiment of the present invention. The further processing of the quantized parameter is equal to the processing of Fig. 3, the parameter is differentially encoded and afterwards Huffman encoded to finally yield a parameter bit stream element.
Applying a scaling factor to the parameters has the advantage that the quantization rule could be adapted to the needs in a continuous way, since an analytical function deriving the quantization factor 262 can basically have any form.
Fig. 4b shows a further embodiment of an inventive parameter encoder 270 which is similar to the inventive parameter encoder 250 shown in Fig. 4a. Therefore, only the differences to parameter encoder 250 shall be explained shortly within the following paragraph.
The inventive parameter encoder 270 is not having a parameter sealer (parameter sealer 258 of parameter encoder 250) . To achieve an energy dependency of quantization, the parameter quantizer 270 is having a compression device 272 instead. That means the quantizer factor generator 256 together with the compression device 258 serve as a quantization rule generator in this case. The compression device 272 is connected to the quantizer 260 and to the quantizer factor generator 256. The compression unit 272 receives as an input a quantized parameter that is quantized by the quantizer 260 according using a fixed quantization scheme. To implement the energy dependence, the compression unit uses the quantized parameter as input and scales the quantized parameter using the scale factor 262. This saves bit rate by decreasing the possible number of quantized parameters to be transmitted to the delta coder 252. This compression can for example be achieved by a division of the quantized parameter index by the scaling factor 262.
Possible functions to derive the scale factor 262 from the relative Local energy ratio q are shown in Fig. 5. Fig. 5 shows as an example four different possible functions 300, 302, 303, and 304 that can be .used to derive the scale factor f. The first factor function 300 is a constant function and thus has no energy dependency.
The factor functions 302, and 304 show two possibilities to implement factor functions, wherein the factor function 302 is the less aggressive one and would therefore increase the introduced quantization error less than using factor function 304. On the other hand, factor function 302 would save less bit rate than factor function 304. Factor function 303 shows a fourth possibility to derive the quantizer factor from the energy quota q, whereas the factor function 303 is step-like in form and therefore assigns intervals of the energy quota q to the same quantizer factor.
Fig. 6 exemplifies a non-uniform quantizer where the input on the x-axis in dB is quantized according to the function 310 to result in the output y in dB that is drawn on the y-axis. Such a non-uniform quantizer function can be used to quantize spatial parameters as well. This is of special interest when the reference channel within a BCC-coding scheme is chosen to be the strongest channel within a multi-channel signal . The non-uniform quantizer as shown in Fig. 6 exemplifies a quantizer function 310 that would suit the needs then., since the quantization steps increase as the energy level becomes smaller compared to the referenced channel. This is a particularly attractive property since the energy level quantizing errors can be larger for channels with less energy than for the strongest channels.
Fig. 7 shows an inventive parameter dequantizer 500 having a dequantizer 502 and a dequantizer selector 504. The dequantizer selector 504 receives the total energy of the multichannel audio signal and the local energy of the channel or channel pairs together with a quantized parameter 505 that is to be dequantized. Based on the received energy information, the dequantizer selector 504 derives a dequantization rule that is used by the dequantizer 502 to dequantize the quantized parameter 505. Hence, in this case the dequantizer selector 504 serves as a dequantization rule generator.
It may be noted that the dequantizer selector 504 may operate in different ways. A first possibility is that the dequantizer selector 504 derives the quantization rule directly and transfers the derived quantization rule to the dequantizer 502. Another possibility is that the dequantizer selec- tor 504 meets a dequantization rule decision, which is transferred to the dequantizer 502 that can use the dequantization rule decision to select the appropriate dequantization rule from a number of quantization rules that are for example stored in the dequantizer 502.
Fig. 8 shows an inventive parameter decoder having a parameter dequantizer 500, a differential decoder 510, and a Huffman decoder.512.
The Huffman decoder 512 receives a parameter bit stream element 513 and in association therewith, the dequantizer selector 504 receives the local energy of a channel or a pair of channels described by the parameter bit stream element 513 and the total energy of the multi-channel audio signal. The parameter bit stream element 513 is produced by an inventive parameter encoder, as shown in Fig. 3. Therefore, the parameter bit stream element 513 is Huffman decoded by the Huffman decoder 512 and differentially decoded by a differential decoder 510 before being supplied to the dequantizer 502. After the decoding by the Huffman decoder 512 and the differential decoder 510, the dequantization is performed by the inventive parameter dequantizer 500, as already described in the description of the inventive parameter of Fig. 7.
In other words, Fig. 8 illustrates a decoder using an energy dependent dequantizer 500, the decoder corresponding to an inventive encoder. The parameter bit stream element is Huff- man decoded and differentially decoded into indices. The correct dequantizer is chosen in the dequantizer selector 504 using the same rule and function as was used in the encoder with the total energy and local energy as input. The selected dequantizer is then used to dequantize (using the dequantizer 502) the indices into dequantized parameters.
Fig. 9a shows a further embodiment of an inventive parameter decoder, having an inventive energy dependent dequan- tizer 520, a Huffman decoder 512, and a differential decoder 510. The parameter dequantizer 520 comprises a quantizer factor generator 522, a dequantizer 524, and a parameter sealer 526. In this case the dequantizer factor generator 522 together with the parameter sealer 526 serve as a de- quantization rule generator.
After decoding the parameter bit stream element 513 by the Huffman decoder and the differential decoder, the quantized parameter is dequantized by the dequantizer 524, wherein the dequantizer 524 is using a dequantization rule matching a quantization rule used to generate the quantized parameter. The quantizer factor generator 522 derives a scale factor 528 (f) from a ratio of the local energy and the total energy of the multi-channel audio signal. The parameter sca- ler 526 then applies the scale factor 528 to the dequantized parameter by a multiplication of the scale factor with the dequantized parameter.
After the scaling by the parameter sealer 526, the decom- pressed dequantized parameters are available at an output of the inventive parameter decoder.
Fig. 9b shows a further embodiment of an inventive parameter decoder 530, similar to the inventive parameter decoder 520. Therefore, only the differences to the parameter decoder 520 shall be elaborated on in the following paragraph. The inventive parameter decoder 530 is having a decompressor 532, the decompressor 532 achieving the same functional result as the parameter sealer 526 in the inventive parameter decoder 520. The decompressor 532 receives as an input the quantized parameters and as further input the scale factor 528 from the factor generator 522. That means the factor generator 522 together with the decompressor 532 serve as a dequantization rule generator in this case. To implement the energy weighted dequantizing functionality, the quantized pa- rameter is scaled by the decompressor 532 before the so derived scaled quantized parameter is input into the dequan- tizer 524. The dequantizer 524 then dequantizes the scaled quantized parameter to derive the dequantized parameter using a fixed dequantization rule. This decompression can for exam- pie be achieved by a multiplication of the quantized parameter index by the scale factor 528.
Although the scaling by the parameter sealer 258 and the parameter sealer 526 during the encoding and decoding is de- scribed to be a division during the encoding and a multiplication during the decoding, any other type of scaling that has the same effect as using a different quantization rule can be applied to the parameters during the encoding or decoding.
In the case of a stacked parameterization (hierarchical de- or encoding) as exemplified for example in Fig. 10b, it should be noted that since the decoder can decode the energy distribution from the roots (the down-mix channel) out to the leafs, there is a well-defined local energy in each pa- rametrization ri to r4 (two channel decoders 122, 124, 126, and 128), which can be used as the local energy on the decoder side. Additionally, if an encoder also quantizes from root to leaf, exactly the same local energy can be used on the encoder as local energy for the quantizer selector and the quantizer factor function. In other words, a decoder may either decide autonomously which dequantization rule to use using the total energy and the local energy. Alternatively, it could be signalled by some additional side information to the decoder, which de- quantization rule is the appropriate one to dequantize the parameters.
Although described within different embodiments of the present invention, the application of a scale factor and the se- lection of an appropriate dequantization rule can also be combined within one embodiment of an inventive encoder or decoder.
To give a more detailed example, two possible ways of imple- menting energy dependent dequantization for the reconstruction of a multi-channel signal from a transferred monophonic signal M using additionally transmitted spatial parameters (CLD, ICC) are shown in Figs. 9c and 9d. Before discussing the Figs . , it may be noted that the tree-like structure shown in the Figs, is only important for the reconstruction of the spatial parameters, wherein the actual ab-mix for generation of the individual channels of a multi-channel signal is normally performed within a single step.
Fig. 9c shows the situation where the parameters CLD are derived such that it is assumed that a parameter CLD0 describes the energy distribution between channels that are combined using a number of channels of the original signal.
In the first hierarchic up-mix position 1000-, CLD0 describes the energy relation between two channels, wherein a first channel is a combination 1002 of a front-left, a front-right, a center and a low-frequency-enhancement channel. The second channel is a combination of a back-left and a back-right channel. In other words, the parameter CLD0 describes the energy distribution between all rear channels and all front channels . It is therefore evident when CLD0 indicates that only little energies contained in the rear channels, the parameters describing the spatial properties between the back-left and the back-right channel may be quantized stronger, since the addi- tionally-introduced distortion by the coarse quantization is hardly audible when all channels are played back simultaneously.
An inventive parameter dequantizer, as shown in Fig. 9b is, for example, calculating a scale factor 528 to implement the dequantization by multiplying a parameter to be dequantized with a parameter index before the actual dequantization is performed. Therefore, if a parameter CLD0 is transmitted, one may, when using the decoder of Fig. 9b for example, calculate the finally-used CLD parameters of other hierarchical steps according to the following formula.
In the following, the term "DEQ" describes the application of a fixed dequantization table to a parameter given to the pro- cedure DEQ. That means, a transmitted parameter IDX CLD (0,L) can be dequantized directly, indicated by the following expression:
Figure imgf000028_0001
Since the CLD parameter describes an energy distribution between two channels and the channels are combinations of channels as indicated in Fig. 9c, one may now derive the relative local energy FC according to:
Figure imgf000028_0002
The relative local energy of the back channels is accordingly:
Figure imgf000029_0001
Given the above and the inventive concept, CLD1 can now be computed, taking into account the overall energy contained in the combination signal 1002:
Figure imgf000029_0002
In the formula given above, the term "facFunc" describes a function giving a real value independency of the relative local energy FC. In other words, formula 4 describes that before dequantization, the transmitted parameter index IDX CLD (1,1,m) is multiplied with a scale factor (facFunc) to derive an intermediate quantized parameter. Since the intermediate quantized parameter is not necessarily integer-valued, the intermediate quantized parameter must be rounded to derive IdxCLDEdQ, which is then dequantized into the final parameter used by the following operation:
Figure imgf000029_0003
Dequantization is performed by a standard dequantization table, such as, for example, the following:
Figure imgf000029_0004
The derived parameter CLD1 describes an energy relation between a channel being a combination of a front-left and a front-right channel and a channel being a combination of a center and a low-frequency-enhancement channel, as can be seen from the channel decomposition in the second hierarchical step 1004. Such, a relative local energy F, describing an energy contained in the front channels, front-left and front- right, can be computed according to the following formula:
Figure imgf000030_0001
Previously, a relative local energy S describing the energy of the back channels has been derived such that an intermediate quantized parameter IDX CLD EDQ can be calculated for the hierarchical box 1006 according to the following formulas:
Figure imgf000030_0002
Since, as previously described, a relative local energy de- scribing the energy of the front-channels only (F5151) is now available, parameter CLD3 describing an energy relation between the front-left and the front-right channel can now be derived in an energy-dependent way according to the following formulas :
Figure imgf000030_0003
In one possible implementation, parameter CAD4 describing an energy relation between the center and the low-frequency- enhancement channel can now be derived using no factor function:
Figure imgf000031_0003
In alternative embodiments, it is, of course, also feasible to implement energy-dependency also in the derivation of the- parameter CLD4.
Fig. 9d shows another possibility of defining a hierarchic for the derivation of the spatial parameters.
In analogy to the description of Fig. 9c, the individual CLD- parameters may be derived according to the following formulas:
Figure imgf000031_0004
Figure imgf000031_0001
Figure imgf000031_0006
Figure imgf000031_0002
Figure imgf000031_0005
Figure imgf000032_0001
It may be noted that different factor functions may be used to implement the inventive concept as, for example, one of the functions shown in Fig. 5.
Generally, as already mentioned above, it is the inventive concept to apply an energy-dependent quantization in the sense that parameters (CLD) of parts of the signal that con- tain relatively low energy compared to other signal parts, are quantized in a coarser way. That is, the factor function has to be such that for low energy components, the factor applied is large.
To illustrate this in more detail, one example is given in Fig. 9e, which shows the manipulations during encoding and decoding, further pointing out the concept of the invention. Reference is further made to the previously-introduced quantization table to calculate the examples shown.
Table 9d shows the manipulation of the quantization index on the quantizer side in a left column 1100, and the reconstruction of the transmitted parameter on the quantizer side in a column 1102. The transmitted parameter is given in column 1104. Two examples for a combination of channels having relatively low energy are shown. This is indicated by the common scale factor 4.5, which is significantly bigger than 1 (see Fig. 4) . According to the inventive concept, the quantization index IDX is divided by the scale factor after the quantiza- tion at the quantizer size. Afterwards, the result has to be rounded to an integer value to be differentially and Huffman encoded (see Fig. 4a) . Therefore, both example indexes 10 and 9 result in a transmitted index IDXtransm of 2.
The dequantizer multiplies the transmitted index by the scale factor to derive a rekonstructed index IDXrek used for de- quantization. As can be seen in the first example of an index 10 on the quantizer size, an additional error of 1 arises due to the rounding of the divided index on the quantizer size. On the other hand, when, by chance, the division of the scale factor at the quantizer side yields an integer valued index IDXtransm to be transmitted, no additional error is introduced.
Evidently, the danger of introducing additional errors rises with rising scale factor f. This means that the probability of adding additional errors to low energy signals is rather high. When signals described by the CLD parameter in question have comparatively equal energy, the CLD value will be close to unity and such will be the scale factor (see, for example Fig. 5) . This means, when the channels for which the parameters are encoded in an energy-dependent manner share roughly the same energy, no additional errors are normally introduced in the quantization. This is, of course, most appropriate, since when every channel has about the same energy within a multi-channel signal, every single channel is audible during simultaneous playback and, therefore, an error introduced would be clearly audible to the audience.
It is evidently an enormous advantage of the present inven- tion that errors are only accepted for channels having comparatively low energy. For those channels, on the other hand, by dividing the indices of the associated parameters by some large numbers brings the index values of those channels closer to zero, on the average. This can be exploited per- fectly by the following differential encoding and Huffman encoding procedure to efficiently decrease the bit rate consumed for the transmitted parameters of a multi-channel signal.
The relation of the local and the total energy upon which the decision which de-/quantization rule to use is based, is described to be a logarithmic measure within the previous para- graphs. This of course not the only possible measure that can be used to realize the inventive concept. Any other measure describing an energy difference between the local energy or the total energy, as for example the plain difference, can be used to make the decision.
Another important feature with the present invention is that in combination with a two channel decoder (PS) design that distributes the incoming energy into the two output channels typically controlled by e.g. CLD like parameter (meaning that the incoming energy equals the sum of the energies for the two output channels) , is that the difference in energy, Relative Local Energy between the total energy and the local energy for each two channel decoders (122, 124, 126, and 128) is defined by the CLD parameters. This means that there is no need to actually measure the total energy and the local energy since the difference in energy in dB that is typically used to calculate the scale factor is defined by the CLD parameters .
Depending on certain, implementation requirements of the inventive methods, the inventive methods can be implemented in hardware or in software. The implementation can be performed using a digital storage medium, in particular a disk, DVD or a CD having electronically readable control signals stored thereon, which cooperate with a programmable computer system such that the inventive methods are performed. Generally, the present invention is, therefore, a computer program product with a program code stored on a machine-readable carrier, the program code being operative for performing the inventive methods when the computer program product runs on a computer. In other words, the inventive methods are, therefore, a computer program having a program code for performing at least one of the inventive methods when the computer program runs on a computer. While the foregoing has been particularly shown and described with reference to particular embodiments thereof, it will be understood by those skilled in the art that various other changes in the form and details may be made without departing from the spirit and scope thereof. It is to be understood that various changes may be made in adapting to different embodiments without departing from the broader concepts disclosed herein and comprehended by the claims that follow.

Claims

CIAIMS
1. Parameter quantizer for quantizing an input parameter, wherein the input parameter is a measure for a charac- teristic of a single channel or a pair of channels with respect to another single channel or a pair of channels of a multi-channel signal, comprising:
a quantization rule generator for generating a quantiza- tion rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal; and
a value quantizer for deriving a quantized parameter from the input parameter, using the generated quantization rule.
2. Parameter quantizer according to claim 1, in which the quantization rule generator is operative to generate the quantization rule such that a quantization is coarser for a channel or a channel pair having a low energy measure than for a channel or a channel pair having a high energy measure.
3. Parameter quantizer according to claim 1, in which the quantization rule generator is operative to choose one quantization rule from two or more predetermined quantization rules.
4. Parameter quantizer according to claim 1, in which the quantization rule generator is operative to calculate a new quantization rule based on a relation of the energy measure of the channel or the pair of channels and the energy measure of the multi-channel signal.
5. Parameter quantizer according to claim 4, in which the quantization rule generator is operative such that the calculation of the quantization rule comprises a calculation of a scale factor.
6. Parameter quantizer according to claim 5, further com- prising a parameter sealer for modifying the input parameter using the scale factor.
7. Parameter quantizer according to claim 6, in which the parameter sealer is operative to modify the input pa- rameter such that the modification includes a division of the input parameter by the scale factor.
8. Parameter quantizer in accordance with claim 5, further comprising a compression device, in which
the parameter quantizer is operative to derive an intermediate quantized parameter using a predetermined quantization rule; and
in which the compression device is operative to derive the quantized parameter using the intermediate quantized parameter and the scale factor.
9. Parameter quantizer according to claim 1, in which the quantization rule generator is operative to generate a quantization rule such that an application of the quantization rule to the input parameter comprises an assignment of the same quantized parameter to all input parameters within a given input parameter range.
10. Parameter quantizer according to claim 1, in which the input parameter is a spatial parameter, describing a spatial perception of the multi-channel audio signal, and in which the input parameter is chosen from the fol- lowing list of parameters:
inter-channel correlation/coherence (ICC) , inter-channel level/intensity difference (ICLD or HD) , inter-channel phase difference (IPD), and inter-channel time difference (ICTD) .
11. Parameter quantizer according to claim 1, further comprising a differential encoder and a Huffman encoder,
wherein the differential encoder is operative to derive a differentially encoded representation of the quantized parameter; and
wherein the Huffman encoder is operative to derive a Huffman encoded representation of the differentially encoded representation.
12. Parameter dequantizer for dequantizing a quantized parameter to derive a parameter, wherein the parameter is a measure for a characteristic of a single channel or a pair of channels with respect to another single channel or a pair of channels of a multi-channel signal, comprising:
a dequantization rule generator for generating a dequan- tization rule based on a relation of an energy measure of the single channel or the pair of channels and an energy measure derived from channels of the multi-channel signal; and
a value dequantizer for deriving the parameter from the quantized parameter, using the generated dequantization rule.
13. Parameter dequantizer according to claim 12, in which the dequantization rule generator is operative to use an energy measure derived from channels of the multichannel signal which is derived from a combination of channels not having the channel or the pair of channels.
14. Parameter dequantizer according to claim 12, in which the dequantization rule generator is operative to generate the dequantization rule such that a dequantization is coarser for a channel or a pair of channels having a low energy measure than for a channel or a pair of channels having a high energy measure.
15. Parameter dequantizer according to claim 12, in which the dequantization rule generator is operative to choose one dequantization rule from two or more fixed dequantization rules stored in a memory.
16. Parameter dequantizer according to claim 12, in which the dequantization rule generator is operative to calculate the new dequantization rule based on a relation of the energy measure of the channel or the pair of channels and the energy measure derived from channels of the multi-channel signal.
17. Parameter dequantizer according to claim 12, in which the dequantization rule generator is operative such that the calculation of the dequantization rule comprises a calculation of a scale factor.
18. Parameter dequantizer according to claim 17, in which the dequantization rule generator further comprises a parameter sealer for modifying the parameter using the scale factor.
19. Parameter dequantizer according to claim 17, in which the parameter sealer is operative to modify the parameter such that the modification includes a multiplication of the parameter by the scale factor.
20. Parameter dequantizer according to claim 17, in which the dequantization rule generator further comprises a decompressor for deriving an intermediate quantized parameter from the quantized parameter using the scale factor; and
in which the value dequantizer is operative to derive the parameter from the intermediate quantized parameter using a fixed dequantization rule.
21. Parameter dequantizer according to claim 20, in which the decompressor is operative to derive the intermediate quantized parameter by multiplication of the scale factor and the quantized parameter.
22. Parameter dequantizer according to claim 20, .in which the dequantization rule generator further comprises a rounder to derive an integer valued intermediate quantized parameter from the intermediate quantized parameter; and
in which the value dequantizer is operative to derive the parameter from the integer valued intermediate quantized parameter using a fixed dequantization rule.
23. Parameter dequantizer according to claim 12, in which the quantized parameter is a measure for an energy relation between a combination of a left-front channel and a right-front channel and a combination of a center- channel and a low-frequency-enhancement-channel;
wherein the energy measure is an energy measure for a pair of channels having a first channel combined from the front-left and the front-right channel and having a second channel combined from the center-channel and the low-frequency-enhancement-channel; and wherein the energy measure derived from channels of the multi-channel signal is an energy measure derived from a combination of a back-left and a back-right channel.
24. Parameter dequantizer according to claim 12, in which the quantized parameter is a measure for an energy relation between a back-left and a back-right channel;
wherein the energy measure is an energy measure for a pair of channels having the back-left and the back-right channel; and
wherein the energy measure derived from channels of the multi-channel signal is an energy measure derived from a combination of a left-front, a right-front, a center and a low-frequency-enhancement channel .
25. Parameter dequantizer according to claim 12, in which the quantized parameter is a measure for an energy rela- tion between a front-left and a front-right channel;
wherein the energy measure is a measure for a pair of channels having the front-left and the front-right channel; and
wherein the energy measure derived from channels of the multi-channel signal is an energy measure derived from a combination of a center and a low-frequency-enhancement channel.
26. Parameter dequantizer according to claim 12, in which the quantized parameter is a measure for an energy relation between a combination of left-front and a left-back channel and a combination of a right-front and a right- back channel; wherein the energy measure is an energy measure for a pair of channels having a first channel combined from the left- front and the left-back channel and having a second channel combined from the right-front and the right-back channel; and
wherein the energy measure derived from channels of the multi-channel signal is an energy measure derived from a combination of a center and a low-frequency-enhancement channel .
27. Parameter dequantizer according to claim 12, in which the quantized parameter is a measure for an energy relation between a left-front and a left-back channel; wherein
the energy measure is an energy measure for a pair of channels having the left-front and the left-back channel; and
wherein the energy measure derived from channels of the multi-channel signal is an energy measure derived from a combination of a right-front and a right-back channel.
28. Parameter dequantizer according to claim 12, in which the quantized parameter is a measure for an energy relation between a right-front and a right-back channel; wherein
the energy measure is an energy measure for a pair of channels having the right-front and the right-back channel; and
wherein the energy measure derived from channels of the multi-channel signal is an energy measure derived from a combination of a left-front and a left-back channel.
29. Parameter dequantizer according to claim 12, in which the dequantization rule generator is operative to generate a dequantization rule such that an application of the dequantization rule to the quantized parameter com- prises an assignment of the quantized parameter to a parameter.
30. Parameter dequantizer according to claim 12, further comprising a differential decoder and a Huffman decoder,
wherein the Huffman decoder is operative to derive a Huffman decoded representation of a received Huffman encoded representation; and
wherein the differential decoder is operative to derive the quantized parameter from the Huffman decoded representation.
31. Parameter dequantizer according to claim 12, in which the parameter is a spatial parameter, describing a spatial perception of the multi-channel audio signal, and in which the input parameter is chosen from the following list of parameters:
inter-channel correlation/coherence (ICC) , inter-channel level/intensity difference (ICLD or IID) , inter-channel phase difference (IPD), and inter-channel time difference (ICTD) .
32. Method of quantizing an input parameter, wherein the input parameter is a measure for a characteristic of a single channel or a pair of channels with respect to another single channel or a pair of channels of a multichannel signal, the method comprising: generating a quantization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal; and
deriving a quantized parameter from the input parameter using the generated quantization rule.
33. Method of dequantizing a quantized parameter to derive a parameter, wherein the parameter is a measure for a characteristic of a single channel or a pair of channels with respect to another single channel or a pair of channels of a multi-channel signal, the method comprising:
generating a dequantization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal; and
deriving the parameter from the quantized parameter us- ing the generated dequantization rule.
34. Representation of a multi-channel signal having a quantized parameter being a quantized representation of a parameter being a measure for a characteristic of a sin- gle channel or a pair of channels, wherein the parameter is a measure for a characteristic of the single channel or the pair of channels with respect to another single channel or a pair of channels of a multi-channel signal, wherein the quantized parameter is derived using a quan- tization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal.
35. Machine-readable storage medium having stored thereon a Representation of a multi-channel signal having a quantized parameter being a quantized representation of a parameter being a measure for a characteristic of a sin- gle channel or a pair of channels, wherein the parameter is a measure for a characteristic of the single channel or the pair of channels with respect to another single channel or a pair of channels of a multi-channel signal, wherein the quantized parameter is derived using a quantization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal.
36. Transmitter or audio recorder having a parameter quantizer for quantizing an input parameter, wherein the input parameter is a measure for a characteristic of a single channel or a pair of channels with respect to another single channel or a pair of channels of a multi- channel signal, comprising:
a quantization rule generator for generating a quantization rule based on a relation of an energy measure of the channel or the pair of channels and an energy meas- ure of the multi-channel signal; and
a value quantizer for deriving a quantized parameter from the input parameter, using the generated quantization rule.
37. Receiver or audio player, having a parameter dequantizer for dequantizing a quantized parameter to derive a parameter, wherein the parameter is a measure for a characteristic of a single channel or a pair of channels with respect to another single channel or a pair of channels of a multi-channel signal, comprising:
a dequantization rule generator for generating a dequan- tization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal; and a value dequantizer for deriving the parameter from the quantized parameter, using the generated dequantization rule.
38. Method of transmitting or audio recording, the method comprising a method of quantizing an input parameter, wherein the input parameter is a measure for a characteristic of a single channel or a pair of channels with respect to another single channel or a pair of channels of a multi-channel signal, the method comprising:
generating a quantization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal; and
deriving a quantized parameter from the input parameter using the generated quantization rule.
39. Method of receiving or audio playing, the method having a method of dequantizing a quantized parameter to derive a parameter, wherein the parameter is a measure for a characteristic of a single channel or a pair of channels with respect to another single channel or a pair of channels of a multi-channel signal, the method compris- ing:
generating a dequantization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal; and
deriving the parameter from the quantized parameter using the generated dequantization rule,
40. Transmission system having a transmitter and a receiver,
the transmitter having a parameter quantizer for quantizing an input parameter, wherein the input parameter is a measure for a characteristic of a single channel or a pair of channels with respect to another single channel or a pair of channels of a multi-channel signal, comprising:
a quantization rule generator for generating a quantization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal; and
a value quantizer for deriving a quantized parameter from the input parameter, using the generated quantization rule; and
the receiver having a parameter dequantizer for dequan- tizing a quantized parameter to derive a parameter, wherein the parameter is a measure for a characteristic of a single channel or a pair of channels with respect to another single channel or a pair of channels of a multi-channel signal, comprising:
a dequantization rule generator for generating a dequantization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal; and
a value dequantizer for deriving the parameter from the quantized parameter, using the generated de- quantization rule.
41. Method of transmitting and receiving, the method including a transmitting method having a method of quantizing an input parameter, wherein the input parameter is a measure for a characteristic of a single channel or a pair of channels with respect to another single channel or a pair of channels of . a multi-channel signal, the method comprising:
generating a quantization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal; and
deriving a quantized parameter from the input pa- rameter using the generated quantization rule; and
a receiving method having a method of dequantizing a quantized parameter to derive a parameter being a measure for a characteristic of a channel or a pair of chan- nels, wherein the parameter is a measure for a characteristic of the channel or the pair of channels with respect to another channel of a multi-channel signal, the method comprising:
generating a dequantization rule based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal; and
deriving the parameter from the quantized parameter using the generated dequantization rule.
42. Computer program for performing, when running on a computer, a method in accordance with any of method claims 32, 33, 38, 39, or 41.
43. Multi-channel decoder for generating a reconstruction of a multi-channel signal:
a parameter dequantizer according to claim 12; and an up-mixer for up-mixing the reconstruction of the multi-channel signal from a transmitted downmixed signal using parameters dequantized by the parameter dequan- tizer.
44. Multi-channel encoder for generating an encoded representation of a multi-channel signal, comprising:
a parameter quantizer according to claim 1; and
a down-mixer for generating a down-mix signal from the multi-channel signal using parameters quantized by the quantizer, wherein this down-mix signal has fewer channels than the multi-channel signal.
PCT/EP2006/003284 2005-04-19 2006-04-10 Energy dependent quantization for efficient coding of spatial audio parameters WO2006111294A1 (en)

Priority Applications (10)

Application Number Priority Date Filing Date Title
JP2007537308A JP4521032B2 (en) 2005-04-19 2006-04-10 Energy-adaptive quantization for efficient coding of spatial speech parameters
PL06724214T PL1754222T3 (en) 2005-04-19 2006-04-10 Energy dependent quantization for efficient coding of spatial audio parameters
EP06724214A EP1754222B1 (en) 2005-04-19 2006-04-10 Energy dependent quantization for efficient coding of spatial audio parameters
DE602006000239T DE602006000239T2 (en) 2005-04-19 2006-04-10 ENERGY DEPENDENT QUANTIZATION FOR EFFICIENT CODING OF SPATIAL AUDIOPARAMETERS
CN2006800005085A CN1993733B (en) 2005-04-19 2006-04-10 Parameter quantizer and de-quantizer, parameter quantization and de-quantization of spatial audio frequency
BRPI0605857-4A BRPI0605857A (en) 2005-04-19 2006-04-10 energy-dependent quantization for efficient coding of spatial audio parameters
TW095113078A TWI327306B (en) 2005-04-19 2006-04-12 Parameter quantizer for quantizing parameter and method thereof,parameter dequantizer for dequantizing parameter and method thereof,and the application apparatus and method thereof
MYPI20061770A MY141427A (en) 2005-04-19 2006-04-18 Energy dependent quantization for efficient coding of spatial audio parameters
US11/406,631 US8054981B2 (en) 2005-04-19 2006-04-19 Energy dependent quantization for efficient coding of spatial audio parameters
HK07103451A HK1095993A1 (en) 2005-04-19 2007-03-30 Energy dependent quantization for efficient codingof spatial audio parameters

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US67294305P 2005-04-19 2005-04-19
US60/672,943 2005-04-19

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US11/406,631 Continuation US8054981B2 (en) 2005-04-19 2006-04-19 Energy dependent quantization for efficient coding of spatial audio parameters

Publications (1)

Publication Number Publication Date
WO2006111294A1 true WO2006111294A1 (en) 2006-10-26

Family

ID=36581679

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2006/003284 WO2006111294A1 (en) 2005-04-19 2006-04-10 Energy dependent quantization for efficient coding of spatial audio parameters

Country Status (15)

Country Link
US (1) US8054981B2 (en)
EP (1) EP1754222B1 (en)
JP (1) JP4521032B2 (en)
KR (1) KR100878371B1 (en)
CN (1) CN1993733B (en)
AT (1) ATE378675T1 (en)
BR (1) BRPI0605857A (en)
DE (1) DE602006000239T2 (en)
ES (1) ES2297825T3 (en)
HK (1) HK1095993A1 (en)
MY (1) MY141427A (en)
PL (1) PL1754222T3 (en)
RU (1) RU2376655C2 (en)
TW (1) TWI327306B (en)
WO (1) WO2006111294A1 (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1943642A1 (en) * 2005-09-27 2008-07-16 LG Electronics, Inc. Method and apparatus for encoding/decoding multi-channel audio signal
WO2009125046A1 (en) * 2008-04-11 2009-10-15 Nokia Corporation Processing of signals
WO2013179084A1 (en) * 2012-05-29 2013-12-05 Nokia Corporation Stereo audio signal encoder
WO2014161994A2 (en) * 2013-04-05 2014-10-09 Dolby International Ab Advanced quantizer
WO2014210284A1 (en) * 2013-06-27 2014-12-31 Dolby Laboratories Licensing Corporation Bitstream syntax for spatial voice coding
JP2015146641A (en) * 2006-12-07 2015-08-13 エルジー エレクトロニクス インコーポレイティド Method of decoding audio signal, and apparatus therefor
US9319703B2 (en) 2012-10-08 2016-04-19 Qualcomm Incorporated Hypothetical reference decoder parameter syntax structure
US9350970B2 (en) 2012-12-14 2016-05-24 Qualcomm Incorporated Disparity vector derivation
US9672837B2 (en) 2013-09-12 2017-06-06 Dolby International Ab Non-uniform parameter quantization for advanced coupling
US9715880B2 (en) 2013-02-21 2017-07-25 Dolby International Ab Methods for parametric multi-channel encoding
GB2574239A (en) * 2018-05-31 2019-12-04 Nokia Technologies Oy Signalling of spatial audio parameters
GB2595883A (en) * 2020-06-09 2021-12-15 Nokia Technologies Oy Spatial audio parameter encoding and associated decoding

Families Citing this family (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006104017A1 (en) * 2005-03-25 2006-10-05 Matsushita Electric Industrial Co., Ltd. Sound encoding device and sound encoding method
JP2009532712A (en) * 2006-03-30 2009-09-10 エルジー エレクトロニクス インコーポレイティド Media signal processing method and apparatus
US8352249B2 (en) * 2007-11-01 2013-01-08 Panasonic Corporation Encoding device, decoding device, and method thereof
KR101614160B1 (en) 2008-07-16 2016-04-20 한국전자통신연구원 Apparatus for encoding and decoding multi-object audio supporting post downmix signal
US8352279B2 (en) 2008-09-06 2013-01-08 Huawei Technologies Co., Ltd. Efficient temporal envelope coding approach by prediction between low band signal and high band signal
US20100324915A1 (en) * 2009-06-23 2010-12-23 Electronic And Telecommunications Research Institute Encoding and decoding apparatuses for high quality multi-channel audio codec
KR101646650B1 (en) * 2009-10-15 2016-08-08 오렌지 Optimized low-throughput parametric coding/decoding
JP2012023540A (en) * 2010-07-14 2012-02-02 Asahi Kasei Electronics Co Ltd Multi-bit delta-sigma modulator and ad converter
KR20120038311A (en) * 2010-10-13 2012-04-23 삼성전자주식회사 Apparatus and method for encoding and decoding spatial parameter
PL2740222T3 (en) 2011-08-04 2015-08-31 Dolby Int Ab Improved fm stereo radio receiver by using parametric stereo
EP2702587B1 (en) 2012-04-05 2015-04-01 Huawei Technologies Co., Ltd. Method for inter-channel difference estimation and spatial audio coding device
US9821908B2 (en) * 2013-06-07 2017-11-21 Bell Helicopter Textron Inc. System and method for assisting in rotor speed control
EP3011562A2 (en) * 2013-06-17 2016-04-27 Dolby Laboratories Licensing Corporation Multi-stage quantization of parameter vectors from disparate signal dimensions
JP6235725B2 (en) * 2014-01-13 2017-11-22 ノキア テクノロジーズ オサケユイチア Multi-channel audio signal classifier
US10163446B2 (en) * 2014-10-01 2018-12-25 Dolby International Ab Audio encoder and decoder
UA120372C2 (en) * 2014-10-02 2019-11-25 Долбі Інтернешнл Аб Decoding method and decoder for dialog enhancement
FR3048808A1 (en) * 2016-03-10 2017-09-15 Orange OPTIMIZED ENCODING AND DECODING OF SPATIALIZATION INFORMATION FOR PARAMETRIC CODING AND DECODING OF A MULTICANAL AUDIO SIGNAL
EP3539126B1 (en) * 2016-11-08 2020-09-30 Fraunhofer Gesellschaft zur Förderung der Angewand Apparatus and method for downmixing or upmixing a multichannel signal using phase compensation
JP7213364B2 (en) * 2018-10-31 2023-01-26 ノキア テクノロジーズ オーユー Coding of Spatial Audio Parameters and Determination of Corresponding Decoding
GB2582749A (en) * 2019-03-28 2020-10-07 Nokia Technologies Oy Determination of the significance of spatial audio parameters and associated encoding

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2004008805A1 (en) * 2002-07-12 2004-01-22 Koninklijke Philips Electronics N.V. Audio coding
WO2004072956A1 (en) * 2003-02-11 2004-08-26 Koninklijke Philips Electronics N.V. Audio coding

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
HU216669B (en) * 1990-09-19 1999-08-30 Koninklijke Philips Electronics N.V. Information carrier with main file and control file, method and apparatus for recording said files, as well as apparatus for reading said files
KR970011727B1 (en) * 1994-11-09 1997-07-14 Daewoo Electronics Co Ltd Apparatus for encoding of the audio signal
SE0202159D0 (en) * 2001-07-10 2002-07-09 Coding Technologies Sweden Ab Efficientand scalable parametric stereo coding for low bitrate applications
AU2002343151A1 (en) * 2001-11-23 2003-06-10 Koninklijke Philips Electronics N.V. Perceptual noise substitution
JP4296753B2 (en) * 2002-05-20 2009-07-15 ソニー株式会社 Acoustic signal encoding method and apparatus, acoustic signal decoding method and apparatus, program, and recording medium
JP2004309921A (en) * 2003-04-09 2004-11-04 Sony Corp Device, method, and program for encoding
US20060281092A1 (en) * 2003-07-24 2006-12-14 Tanja Wille Method for the reverse transcription and/or amplification of nucleic acids

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2004008805A1 (en) * 2002-07-12 2004-01-22 Koninklijke Philips Electronics N.V. Audio coding
WO2004072956A1 (en) * 2003-02-11 2004-08-26 Koninklijke Philips Electronics N.V. Audio coding

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
PURNHAGEN H: "Low complexity parametric stereo coding in mpeg-4", PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DIGITAL AUDIO EFFECTS, NAPLES (ITALY), 5 October 2004 (2004-10-05), pages 163 - 168, XP002364489 *

Cited By (40)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1943642A1 (en) * 2005-09-27 2008-07-16 LG Electronics, Inc. Method and apparatus for encoding/decoding multi-channel audio signal
EP1943642A4 (en) * 2005-09-27 2009-07-01 Lg Electronics Inc Method and apparatus for encoding/decoding multi-channel audio signal
JP2015146641A (en) * 2006-12-07 2015-08-13 エルジー エレクトロニクス インコーポレイティド Method of decoding audio signal, and apparatus therefor
WO2009125046A1 (en) * 2008-04-11 2009-10-15 Nokia Corporation Processing of signals
CN104509130B (en) * 2012-05-29 2017-03-29 诺基亚技术有限公司 Stereo audio signal encoder
US9799339B2 (en) 2012-05-29 2017-10-24 Nokia Technologies Oy Stereo audio signal encoder
CN104509130A (en) * 2012-05-29 2015-04-08 诺基亚公司 Stereo audio signal encoder
WO2013179084A1 (en) * 2012-05-29 2013-12-05 Nokia Corporation Stereo audio signal encoder
EP2856776A4 (en) * 2012-05-29 2016-02-17 Nokia Technologies Oy Stereo audio signal encoder
US9380317B2 (en) 2012-10-08 2016-06-28 Qualcomm Incorporated Identification of operation points applicable to nested SEI message in video coding
US9319703B2 (en) 2012-10-08 2016-04-19 Qualcomm Incorporated Hypothetical reference decoder parameter syntax structure
US9544566B2 (en) 2012-12-14 2017-01-10 Qualcomm Incorporated Disparity vector derivation
US9350970B2 (en) 2012-12-14 2016-05-24 Qualcomm Incorporated Disparity vector derivation
US9715880B2 (en) 2013-02-21 2017-07-25 Dolby International Ab Methods for parametric multi-channel encoding
US10643626B2 (en) 2013-02-21 2020-05-05 Dolby International Ab Methods for parametric multi-channel encoding
US11817108B2 (en) 2013-02-21 2023-11-14 Dolby International Ab Methods for parametric multi-channel encoding
US11488611B2 (en) 2013-02-21 2022-11-01 Dolby International Ab Methods for parametric multi-channel encoding
US10930291B2 (en) 2013-02-21 2021-02-23 Dolby International Ab Methods for parametric multi-channel encoding
US10360919B2 (en) 2013-02-21 2019-07-23 Dolby International Ab Methods for parametric multi-channel encoding
US10311884B2 (en) 2013-04-05 2019-06-04 Dolby International Ab Advanced quantizer
WO2014161994A3 (en) * 2013-04-05 2014-11-27 Dolby International Ab Advanced quantizer
WO2014161994A2 (en) * 2013-04-05 2014-10-09 Dolby International Ab Advanced quantizer
RU2640722C2 (en) * 2013-04-05 2018-01-11 Долби Интернешнл Аб Improved quantizer
US9940942B2 (en) 2013-04-05 2018-04-10 Dolby International Ab Advanced quantizer
CN105144288A (en) * 2013-04-05 2015-12-09 杜比国际公司 Advanced quantizer
KR101754094B1 (en) 2013-04-05 2017-07-05 돌비 인터네셔널 에이비 Advanced quantizer
KR20190097312A (en) * 2013-04-05 2019-08-20 돌비 인터네셔널 에이비 Advanced quantizer
KR20170078869A (en) * 2013-04-05 2017-07-07 돌비 인터네셔널 에이비 Advanced quantizer
KR102069493B1 (en) 2013-04-05 2020-01-28 돌비 인터네셔널 에이비 Advanced quantizer
KR102072365B1 (en) 2013-04-05 2020-02-03 돌비 인터네셔널 에이비 Advanced quantizer
US9530422B2 (en) 2013-06-27 2016-12-27 Dolby Laboratories Licensing Corporation Bitstream syntax for spatial voice coding
WO2014210284A1 (en) * 2013-06-27 2014-12-31 Dolby Laboratories Licensing Corporation Bitstream syntax for spatial voice coding
US9672837B2 (en) 2013-09-12 2017-06-06 Dolby International Ab Non-uniform parameter quantization for advanced coupling
US10694424B2 (en) 2013-09-12 2020-06-23 Dolby International Ab Non-uniform parameter quantization for advanced coupling
US11297533B2 (en) 2013-09-12 2022-04-05 Dolby International Ab Method and apparatus for audio decoding based on dequantization of quantized parameters
US10383003B2 (en) 2013-09-12 2019-08-13 Dolby International Ab Non-uniform parameter quantization for advanced coupling
US10057808B2 (en) 2013-09-12 2018-08-21 Dolby International Ab Non-uniform parameter quantization for advanced coupling
US11838798B2 (en) 2013-09-12 2023-12-05 Dolby International Ab Method and apparatus for audio decoding based on dequantization of quantized parameters
GB2574239A (en) * 2018-05-31 2019-12-04 Nokia Technologies Oy Signalling of spatial audio parameters
GB2595883A (en) * 2020-06-09 2021-12-15 Nokia Technologies Oy Spatial audio parameter encoding and associated decoding

Also Published As

Publication number Publication date
CN1993733A (en) 2007-07-04
KR100878371B1 (en) 2009-01-15
HK1095993A1 (en) 2007-05-25
TWI327306B (en) 2010-07-11
EP1754222A1 (en) 2007-02-21
ATE378675T1 (en) 2007-11-15
TW200703238A (en) 2007-01-16
DE602006000239D1 (en) 2007-12-27
KR20070062502A (en) 2007-06-15
US8054981B2 (en) 2011-11-08
CN1993733B (en) 2010-12-08
MY141427A (en) 2010-04-30
ES2297825T3 (en) 2008-05-01
JP4521032B2 (en) 2010-08-11
US20070016416A1 (en) 2007-01-18
BRPI0605857A (en) 2007-12-18
RU2007106874A (en) 2008-08-27
PL1754222T3 (en) 2008-04-30
DE602006000239T2 (en) 2008-09-18
JP2008517339A (en) 2008-05-22
RU2376655C2 (en) 2009-12-20
EP1754222B1 (en) 2007-11-14

Similar Documents

Publication Publication Date Title
EP1754222B1 (en) Energy dependent quantization for efficient coding of spatial audio parameters
US8654985B2 (en) Stereo compatible multi-channel audio coding
US8553895B2 (en) Device and method for generating an encoded stereo signal of an audio piece or audio datastream
US9565509B2 (en) Enhanced coding and parameter representation of multichannel downmixed object coding
KR100913987B1 (en) Multi-channel synthesizer and method for generating a multi-channel output signal
EP2028648B1 (en) Multi-channel audio encoding and decoding
US8081764B2 (en) Audio decoder
CN103765509B (en) Code device and method, decoding device and method
CN110890101B (en) Method and apparatus for decoding based on speech enhancement metadata
US20090112606A1 (en) Channel extension coding for multi-channel source
EP2261897A1 (en) Quantization and inverse quantization for audio
EP1934973A1 (en) Temporal and spatial shaping of multi-channel audio signals
US20070271095A1 (en) Audio Encoder
CA2583146A1 (en) Diffuse sound envelope shaping for binaural cue coding schemes and the like
EP1706865A1 (en) Apparatus and method for constructing a multi-channel output signal or for generating a downmix signal
RU2628195C2 (en) Decoder and method of parametric generalized concept of the spatial coding of digital audio objects for multi-channel mixing decreasing cases/step-up mixing
IL184340A (en) Compact side information for parametric coding of spatial audio

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 11406631

Country of ref document: US

WWE Wipo information: entry into national phase

Ref document number: 2006724214

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 3658/KOLNP/2006

Country of ref document: IN

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 200680000508.5

Country of ref document: CN

WWP Wipo information: published in national office

Ref document number: 11406631

Country of ref document: US

WWE Wipo information: entry into national phase

Ref document number: 1020077003513

Country of ref document: KR

WWP Wipo information: published in national office

Ref document number: 2006724214

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2007106874

Country of ref document: RU

WWE Wipo information: entry into national phase

Ref document number: 2007537308

Country of ref document: JP

NENP Non-entry into the national phase

Ref country code: DE

WWW Wipo information: withdrawn in national office

Ref document number: DE

WWG Wipo information: grant in national office

Ref document number: 2006724214

Country of ref document: EP

ENP Entry into the national phase

Ref document number: PI0605857

Country of ref document: BR