ES2314706T3 - Method and device for generating multichannel signal or set of parameter data. - Google Patents

Method and device for generating multichannel signal or set of parameter data. Download PDF

Info

Publication number
ES2314706T3
ES2314706T3 ES05782843T ES05782843T ES2314706T3 ES 2314706 T3 ES2314706 T3 ES 2314706T3 ES 05782843 T ES05782843 T ES 05782843T ES 05782843 T ES05782843 T ES 05782843T ES 2314706 T3 ES2314706 T3 ES 2314706T3
Authority
ES
Spain
Prior art keywords
data
configuration
parameter
indication
meaning
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
ES05782843T
Other languages
Spanish (es)
Inventor
Christian Ertel
Stefan Geyersberger
Jurgen Herre
Johannes Hilpert
Ralph Sperschneider
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fraunhofer Gesellschaft zur Forderung der Angewandten Forschung eV
Original Assignee
Fraunhofer Gesellschaft zur Forderung der Angewandten Forschung eV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority to DE102004043521A priority Critical patent/DE102004043521A1/en
Priority to DE102004043521 priority
Application filed by Fraunhofer Gesellschaft zur Forderung der Angewandten Forschung eV filed Critical Fraunhofer Gesellschaft zur Forderung der Angewandten Forschung eV
Application granted granted Critical
Publication of ES2314706T3 publication Critical patent/ES2314706T3/en
Application status is Active legal-status Critical
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding, i.e. using interchannel correlation to reduce redundancies, e.g. joint-stereo, intensity-coding, matrixing

Abstract

Device for generating a multi-channel signal using input data that includes transmission channel data representing M transmission channels and parameter data to obtain K output channels, in which the M transmission channels and the parameter data together represent N original channels, where M is less than N and greater than or equal to 1 and where K is greater than M, in which the input data comprises a parameter configuration indication (41), comprising: reconstruction means (24) multichannel designed to generate the K output channels from the transmission channel data and the parameter data; and configuration means (26) for configuring the multi-channel reconstruction means, in which the configuration means are designed to read the input data to interpret (30) the parameter configuration indication, when the parameter configuration indication has a first meaning, extract (31) configuration information contained in the input data and make (34) a configuration adjustment of the multi-channel reconstruction means, and when the parameter configuration indication has a second meaning that differs from the first meaning , configure (34) the multichannel reconstruction means using information about an encoding algorithm (23) with which the data of the transmission channel has been decoded from an encoded version thereof in such a way that the configuration setting of The multi-channel reconstruction media is identical to a configuration setting of the lgorithm (23) coding or depends on a configuration setting of the coding algorithm (23).

Description

Method and device to generate signal multichannel or parameter data set.

The present invention relates to techniques of parametric multichannel processing and in particular to encoders / decoders to generate and / or read a syntax flexible data and to associate parameter data with the data of the downmix and / or transmission channels.

In addition to the two stereo channels, a recommended multichannel surround representation includes a center channel C and two surround channels, that is, the left surround channel Ls and the right surround channel Rs, and additionally, if applicable, a subwoofer channel ( subwoofer ) also referred to as the LFE channel (LFE = Low Frequency Enhancement ). This reference sound format is also referred to as stereo 3/2 (plus LFE) and recently also as 5.1 multichannel, which means there are three front channels and two surround channels. In general, 5 or 6 transmission channels are required. In a playback environment, at least five speakers are required in the respective five different positions to obtain an optimum called the optimum sound point at a certain distance from the five correctly placed speakers. However, with respect to its placement, the subwoofer can be used relatively freely.

There are several techniques to reduce the amount of data required to transmit a multichannel audio signal. Such techniques are also called joint stereo techniques. For this purpose, reference is made to Figure 5. Figure 5 shows a joint stereo device 60. This device can be a device that implements, for example, the stereo intensity technique (IS technique, Intensity Stereo ) or the binaural indication coding technique (BCC technique, Binaural Cue Coding ). Such a device generally receives at least two channels (CH1, CH2, ... CHn) as input signals and emits at least a single carrier channel (downmix) and parametric data, ie one or more sets of parameters. The parametric data is defined in such a way that an approximation of each original channel (CH1, CH2, ... CHn) can be calculated in a decoder.

Normally, the carrier channel will include Subband samples, spectral coefficients or domain samples  of time, etc., which provide a representation comparatively accurate of the underlying signal, while the Parametric data and / or parameter sets do not include any sample or spectral coefficient of this type. Instead, the Parametric data includes control parameters to control a determined reconstruction algorithm, such as weighting by multiplication, displacement in time, displacement in frequency, ... Thus, parametric data includes only one comparatively sufficient representation of the signal or channel associated. Expressed in numbers, the amount of data required by a carrier channel (which are compressed, that is they are encoded by AAC medium, for example) is in the range of 60 to 70 kbit / s, while the amount of data required by the information Parametric side is of the order of 1.5 kbit / s for a channel. A example for parametric data are the scale factors known, stereo intensity information or parameters Binaural indication, as will be described later.

The stereo intensity coding technique is described in the preliminary document AES 3799 entitled " Intensity stereo coding " J. Herre, KH Brandenburg, D. Lederer, February 1994, Amsterdam. In general, the concept of stereo intensity is based on a main axis transform that will be applied to data from the two stereo audio channels. If most of the data points are placed around the first main axis, a coding gain can be achieved by rotating both signals a certain angle before coding. However, this does not always apply to real stereo reproduction techniques. The reconstructed signals for the left and right channels consist of weighted or scaled versions differently from the same transmitted signal. However, the reconstructed signals differ in amplitude, but are identical with respect to their phase information. The energy time envelopes of both original audio channels, however, are maintained by means of a selective scale adjustment operation that normally operates selectively in frequency. This corresponds to the perception of human sound at high frequencies where the dominant spatial indications are determined by the energy envelopes.

In addition, in practical implementations, the signal transmitted, that is, the carrier channel, is formed by the sum signal of the left channel and the right channel instead of rotate both components. In addition, this processing, that is, the generation of stereo intensity parameters to perform the scaling operation, is performed selectively in frequency, that is, independently of each other for each band of scale factor, that is, for each frequency division of encoder Preferably, both channels are combined to form a combined or "carrier" channel. In addition to the channel combined, the stereo intensity information is determined that depends on the energy of the first channel, the energy of the second channel and the energy of the combined or sum channel.

The BCC technique is described in the article of the AES 5574 convention entitled " Binaural cue coding applied to stereo and multi-channel audio compression ", C. Faller, F. Baumgarte, May 2002, Munich. In BCC encoding, several audio input channels are converted to a spectral representation using a DFT-based transform with overlapping windows. The resulting spectrum is divided into non-overlapping divisions. Each division has a bandwidth proportional to an equivalent right angle bandwidth (ERB, Equivalent Right-Angled Bandwidth ). The so-called level differences between channels (ICLD, Inter-Channel Level Difference ) are calculated as well as the so-called time differences between channels (ICTD, Inter-Channel Time Difference ) for each division, that is, for each band and for each frame k, that is, a block of time samples. The ICLD and ICDT parameters are quantified and encoded to obtain a BCC bit stream. Level differences between channels and time differences between channels are given for each channel with respect to a reference channel. In particular, the parameters are calculated according to predetermined formulas depending on the particular divisions of the signal to be processed.

On the decoder side, the decoder receives a mono signal and the BCC bit stream, that is, a first set of parameters for time differences between channels and a second set of parameters for level differences between channels per frame. The mono signal is transformed to the domain of the frequency and is introduced in a synthesis block that also receives decoded ICLD and ICTD values. In the synthesis block or reconstruction block, the BCC parameters (ICLD and ICTD) are used to perform a signal weighting operation monkey to rebuild the multichannel signal, which then, after a frequency / time conversion, represents a reconstruction of the original multichannel audio signal.

In the case of BCC, the stereo module 60 set operates to issue the lateral channel information, of such so that the parametric channel data are ICDL parameters and Quantified and encoded ICTD, where one of the channels originals can be used as a reference channel for channel side information coding. Normally the channel Carrier is formed by the sum of the original channels participants.

Of course, the prior art only provides a mono representation for a decoder that only it can decode the carrier channel, but it cannot generate Parameter data to generate one or more approximations of More than one input channel.

The audio coding technique referred to as the BCC technique is further described in US patent applications US 2003/0219130 A1, 2003/0026441 A1 and 2003/0035553 A1 and is used in European patent application EP 1 414 273 A1. In addition, see additionally " Binaural Cue Coding. Part. II: Schemes and Applications ", C. Faller and F. Baumgarte, IEEE: Transactions on Audio and Speech Proc., Volume 11, number 6, November 1993. In addition, see also C. Faller and F. Baumgarte " Binaural Cue Coding applied to Stereo and Multi-Channel Audio compression ", preliminary document, 112th Convention of the Audio Engineering Society (AES), May 2002, and J. Herre, C. Faller, C Ertel, J. Hilpert, A. Hoelzer, C. Spenger, " MP3 Surround: Efficient and Compatible Coding of Multi-Channel Audio ", 116th AES Convention, Berlin, 2004, preliminary document 6049. A BCC scheme will be represented below. Typical general for multichannel audio coding in more detail with respect to Figures 6 to 8. Figure 6 shows a general BCC coding scheme for coding / transmission of multichannel audio signals. The multichannel audio input signal is input to an input 110 of a 112 BCC encoder and is performed on the same "downmix" in a so-called downmix block 114, that is, converted to a single sum channel. In the present example, the signal at input 110 is a 5-channel surround signal that has a front left channel and a front right channel, a left surround channel and a right surround channel and a central channel. Normally, the downmix block generates a sum signal by simply adding these five channels into a mono signal. Other downmix schemes are known in the art, all resulting in the generation, using a multi-channel input signal, of a downmix signal that has a single channel or that has a number of downmix channels which, in any case, is less than the number of original input channels. In the present example, a downmix operation would already be achieved if four carrier channels were generated from the five input channels. The only output channel and / or the number of output channels is broadcast on a sum signal line 115.

The lateral information obtained by a block 116 of BCC analysis is emitted on a line 117 of lateral information. In the BCC analysis block, level differences between channels (ICDL), time differences between channels (ICTD) or correlation values between channels (ICC values, Inter-Channel Correlation ) can be calculated. Thus, there are three different sets of parameters, namely the differences in level between channels (ICLD), time differences between channels (ICTD) and correlation values between channels (ICC), for reconstruction in block 122 of BCC synthesis.

The sum signal and lateral information with the parameter sets are normally transmitted to a 120 BCC decoder in a quantified and encoded format. He BCC decoder divides the transmitted sum signal (and decoded, in the case of an encrypted transmission) in a number of subbands and makes adjustments to scale, delays and additional processing to generate the subbands of the various channels to be rebuilt. This processing is done in such that the ICLD, ICTD and ICC parameters (indications) of a Multichannel signal reconstructed at output 121 are similar to respective indications for the original multichannel signal in the input 110 to the 112 BCC encoder. For this purpose, the decoder 120 BCC includes a block 122 of BCC synthesis and a block 123 for processing side information.

The following will illustrate the internal structure of the BCC synthesis block 122 with respect to figure 7. The signal of sum over line 115 is entered into a conversion block of time / frequency normally implemented as a bank 125 of FB filters. At the output of block 125, there is a number N of signals of subband or, in an extreme case, a block of coefficients spectral, if the audio filter bank 125 performs a transform that generates N spectral coefficients from N Time domain samples.

BCC synthesis block 122 also includes a delay phase 126, a level modification phase 127, a phase 128 correlation processing and a phase 129 IFB that represents a reverse filter bank. At the exit of the phase 129, the reconstructed multichannel audio signal that has, for example, five channels in the case of 5 surround system channels can be broadcast on a set of speakers 124, as illustrated in figure 6.

Figure 7 further illustrates that the signal of input s (n) is converted to the frequency domain or filter bank domain by means of element 125. The signal emitted by element 125 is multiplied, so that it they get several versions of the same signal, as indicated by the node 130. The number of versions of the original signal is equal to number of output channels in the output signal that is going to rebuild If each version of the original signal is subjected to a determined delay, d_ {1}, d_ {2}, ... d_ {i}, d_ {N} in the node 130, the result is the situation at the exit of the blocks 126, which includes the versions of the same signal, but with Different delays The delay parameters are calculated by the block 123 for processing lateral information in Figure 6 and they derive from time differences between channels such as were determined by block 116 of BCC analysis.

The same applies to the parameters of multiplication a_ {1}, a_ {2}, ... a_ {i}, a_ {N}, which also they are calculated by information processing block 123 lateral based on level differences between channels determined by block 116 of BCC analysis.

The ICC parameters are calculated by block 116 BCC analysis and are used to control the functionality of the block 128, such that correlation values are obtained determined between delayed and manipulated signals in level in the output of block 128. It should be noted that the order of the phases 126, 127, 128 may be different from that shown in the figure 7.

It should also be noted that, in a processing by blocks of the audio signal, the BCC analysis is also Performs in blocks. In addition, the BCC analysis is also performed by frequency, that is, selectively in frequency. This means that, for each spectral band, there is an ICLD parameter, a ICTD parameter and an ICC parameter for each block. Parameters ICTD for at least one block for at least one channel through all bands therefore represent the set of parameters ICTD. The same applies to the ICLD parameter set that represents all ICLD parameters for at least one block for all frequency bands for rebuilding at least an output channel The same applies, in turn, to the set of ICC parameters that again includes several ICC parameters individual for at least one block for various bands for the rebuild at least one output channel based on the channel input or sum channel.

Next, reference is made to Figure 8 which shows a situation from which the BCC parameters determination. Normally the parameters ICLD, ICTD and ICC can be defined between any pair of channels. Normally, a determination of the ICLD parameters is performed ICTD between a reference channel and each of the other channels input, such that there is a set of parameters different for each of the input channels except the channel of reference. This is also illustrated in Figure 8A.

However, ICC parameters can be defined. differently In general, ICC parameters can be generated in the encoder between any pair of channels, as well as schematically illustrated in figure 8B. In this case, a decoder would perform an ICC synthesis, so that it you get approximately the same result that was present in the original signal between any pair of channels. However, it has there was the suggestion of calculating only ICC parameters between two more intense channels at any time, that is, for each time frame. This scheme is represented in Figure 8C, which shows an example in which, in a moment, it is calculated and transmitted an ICC parameter between channels 1 and 2 and in which, in another moment, an ICC parameter is calculated between channels 1 and 5. Then, the decoder synthesizes the correlation between channels between the two more intense channels in the decoder and also executes normally heuristic rules to synthesize coherence between channels for the remaining channel pairs.

With respect to the calculation of, for example, the multiplication parameters a_ {1}, ... a_ {N} based on the transmitted ICLD parameters, reference is made to the article of the AES convention cited 5574. The ICLD parameters represent a Power distribution in an original multichannel signal. Without loss of generality, Figure 8A shows that there are four ICLD parameters that represent the energy difference between all other channels and the front left channel. In block 123 of lateral information processing, the multiplication parameters
a_ {1}, ... a_ {N} are derived from the ICLD parameters, such that the total energy of all reconstructed output channels is the same energy as that present for the transmitted sum signal or is at least proportional to this energy. One way to determine these parameters is a two-phase process in which, in a first phase, the multiplication factor for the left front channel is set to one, while the multiplication factors for the other channels in Figure 8C are adjust to the transmitted ICLD values. Then, in a second phase, the energy of all five channels is calculated and compared with the energy of the transmitted sum signal. Then, all channels are scaled down, namely, using a scale factor that is the same for all channels, where the scale factor is selected such that the total energy of all output channels rebuilt afterwards The scale adjustment is equal to the total energy of the transmitted sum signal and / or the transmitted sum signals.

Regarding the measure of coherence between ICC channels transmitted from the BCC encoder to the decoder BCC as an additional set of parameters, it should be noted that consistency manipulation could be done by modification of multiplication factors, such as multiplying the weighting factors of all subbands by random numbers that have values between 20 log 10 - 6 and 20 log 10 6. The pseudorandom sequence is selected normally in such a way that the variance for all bands criticism is approximately equal and that the average value within Each critical band is zero. The same sequence is used for spectral coefficients of each different frame or block. So, at Audio scene width is controlled by modifications of the variances of the pseudorandom sequence. A larger variance generates a wider listening width. Variance modification can be performed on individual bands that have a band width review. This allows the simultaneous existence of several objects in an audition scene, where each object has a width of different hearing A distribution of adequate amplitude for the pseudorandom sequence is a uniform distribution on a scale logarithmic, as represented in the patent publication American 2002/0219130 A1.

In order to transmit the five channels in a compatible manner, for example, in a bit stream format that is also suitable for a normal stereo decoder, the so-called matrix technique described in " MUSICAM Surround: A universal multi- use" can be used. channel coding system compatible with ISO / IEC 11172-3 ", G. Theile and G. Stoll, preliminary AES document, October 1992, San Francisco.

In addition, see the multichannel coding techniques described in the publication " Improved MPEG 2 Audio multi-channel encoding ", B. Grill, J. Herre, KH Brandenburg, E. Eberlein, J. Koller, J. Miller, preliminary document AES 3865 , February 1994, Amsterdam, where a compatibility matrix is used to obtain the downmix channels from the original input channels.

In summary, it can be said that the BCC technique allows efficient and backward compatible coding of multichannel audio material, as also described, for example in the specialist publication by E. Shuijer, J. Breebaart, H. Purnhagen, J Engdeg å rd entitled " Low-Complexity Parametric Stereo Coding ", 119th AES Convention, Berlin, 2004, preliminary document 6073. In this context, I should also mention the MPEG-4 standard and particularly the expansion of parametric audio techniques, where This part of the standard is also known by the designation ISO / IEC 14496-3: 2001 / FDAM 2 (parametric audio). In this regard, the syntax in table 8.9 of the MPEG-4 standard entitled "ps_data () syntax" should be mentioned in particular. In this example, the "enable_icc" and "enable_ipdopd" syntax elements must be mentioned, where these syntax elements are used to activate and deactivate a transmission of an ICC parameter and a phase corresponding to time differences between channels. The syntax elements "icc_data ()""ipd_data()" and "opd_data ()" must also be mentioned.

In summary, it should be noted in general that such multichannel parametric techniques are used using one or several transmitted carrier channels, where M channels are formed transmitted from N original channels to rebuild from again the N output channels or a K number of output channels, where K is less than or equal to the number of original channels N.

As can be seen from Figure 6, the BCC analysis is a typical separate preprocessing to generate parameter data for one part and one or more transmission channels (downmix channels) on the other hand from a signal multichannel that has N original channels. Normally these downmix channels are then compressed for example by means of a Typical stereo / mono MP3 or AAC encoder, although this is not shown  in figure 6, such that, on the output side, there is a bit stream representing the transmission channel data in compressed form and there is another additional bit stream that represents Parameter data The BCC analysis thus occurs separately from the actual audio coding of the downmix channels and / or the sum signal 115 of figure 6.

The decoder side is similar. A decoder that has multichannel capability will first decode the bit stream that includes the compressed downmix signal depending on the encoding algorithm used and again provide one or more transmission channels on the output side, that is, usually as a sequence in the PCM data time (PCM = Pulse Code Modulation ). Then, the BCC synthesis will take place as a separate separate and isolated postprocessing that signals in a self-sufficient way with the flow of parameter data and data is provided to generate, on the output side, several output channels, preferably equal to the number of the original input channels of the audio decoded downmix signal.

Thus, it is an advantage of BCC technology that has a different filter bank for analysis purposes BCC and a different filter bank for the purposes of the BCC synthesis, for example, in such a way that it is separated from Filter bank of the audio encoder / decoder, in order of not having to make any intermediate solution regarding to audio compression on the one hand and reconstruction multichannel on the other hand. Generally speaking, the Audio compression is therefore performed separately from the multichannel parameter processing to be equipped optimally for both fields of application.

However, this concept has the disadvantage that a complete signaling has to be transmitted both for Multi-channel reconstruction as for audio decoding. This is particularly disadvantageous when, as will normally be the case, both the audio decoder and the means of multichannel reconstruction perform the same or similar stages and therefore require equal and / or mutual configuration settings Dependents Due to the completely separate concept, the data signaling are therefore transmitted twice giving as resulted in an artificial "expansion" of the amount of data, which is ultimately due to the fact that the separate concept between audio coding / decoding and multichannel analysis / synthesis.

On the other hand, a complete "link" of the multi-channel reconstruction with audio decoding would considerably restrict flexibility, because in that case the really important objective of the separation of both stages processing can perform each stage of processing An optimal way would have to be abandoned. Thus, losses would arise of considerable quality, particularly in the case of several phases successive coding / decoding, also referred to as "tandem" coding. If there is a full link of the BCC data with encoded audio data, has to be performed a multi-channel reconstruction with each decoding to Perform a multichannel synthesis again when recoded. Since it is the nature of each parametric technique that has losses, losses will be accumulated by synthesis analyzes of repeated analyzes in such a way that, with each phase of Encoder / decoder, the perceptible signal quality of Audio decreases further.

In this case, decoding / encoding of audio data without simultaneous analysis / synthesis processing of the parameter data would only be possible if each audio codec in the tandem chain worked identically, that is, had the same rate of sampling, block length, feed length, window size, transformed ... that is to say, in general, had the same configuration, and if, in addition, the respective block boundaries were also maintained. However, such a concept would considerably restrict the flexibility of the whole concept. Particularly with regard to the fact that parametric multichannel techniques are intended to complement existing stereo data, for example by means of additional parameter data, this limitation is of all the most painful. Since existing stereo data can originate from many different encoders that use all different block lengths or that do not even operate in the frequency domain, but in the time domain, etc., such limitation would take the concept of Supplementation later ad absurdum from the beginning.

It is the object of the present invention provide a flexible and efficient concept to generate a signal from  multichannel audio or a set of parameter data from reconstruction.

This object is achieved by a device to generate a multichannel signal according to claim 1, a method for generating a multichannel signal according to claim 14, a device to generate an output of parameter data according to claim 15, a method for generating data output from parameters according to claim 18, a device for generating an output of parameter data according to claim 19, a method to generate an output of parameter data according to the claim 20, or a computer program product according to the claim 21.

The present invention is based on the discovery that efficacy can be obtained on the one hand and flexibility on the other hand having the data flow, which can include transmission channel data and parameter data, contain a parameter setting indication that has been inserted on the encoder side and evaluated on the side of the decoder This indication indicates whether the means of Multi-channel reconstruction are configured from the data input, that is, from the data transmitted from the encoder to decoder, or if reconstruction means Multichannel are configured by an indication for an algorithm encoding with which channel data has been decoded from coded transmission The means of multi-channel reconstruction they have a configuration setting identical to a setting of audio decoder settings to decode the transmission channel data encoded or at least dependent on this setting.

If a detector detects the first situation, it is say the parameter setting indication has a first meaning, the decoder will look for configuration information additional input data received, to configure appropriately the multi-channel reconstruction means, to use the information then to make an adjustment of Multi-channel reconstruction media configuration. Such adjustment configuration could be, for example, block length, feedrate, sample rate, bank control data filters, the so-called granular information (how many BCC blocks are there in a frame), channel configurations (for example a output 5.1 whenever there is "mp3"), information on what data of parameters are mandatory in a case adjusted to scale (for example, ICLD) and which not (ICTD), etc.

However, if the decoder determines that the parameter setting indication has a second different meaning of the first meaning, the means of multichannel reconstruction will choose the configuration setting in multichannel reconstruction media depending on the information about the audio coding algorithm in which the coding / decoding of the channel data of transmission, that is, downmix channels.

In contrast to the concept separated from the Parameter data on the one hand and compressed downmix data  on the other hand, the device of the invention to generate a Multichannel audio signal commits a "steal", so to speak, for the configuration of multichannel reconstruction media, in the audio data really separated completely and self-sufficient and / or in an upstream audio decoder that it operates in a self-sufficient way, to configure itself.

The concept of the invention is particularly potent in a preferred embodiment of the present invention when different audio coding algorithms are considered. In this case, a large amount of explicit signaling information to get a synchronous operation, that is, an operation in which Multi-channel reconstruction media operate synchronously with the audio decoder, namely feed lengths corresponding, etc., for each coding algorithm different, such that the reconstruction algorithm Really independent multichannel works synchronously with The audio decoding algorithm.

According to the invention, the indication of parameter configuration, for which a single bit is sufficient,  signals to a decoder that, for the purpose of its configuration, it looks for what audio encoder is waters down. Following this, the decoder will receive information on which audio encoder is currently waters above a number of different audio encoders. When You have received this information, you will preferably enter a table configuration deposited in the multichannel decoder with this audio coding algorithm identification for retrieve the predefined configuration information there for each one of the possible audio coding algorithms for make at least one configuration setting of the media multi-channel reconstruction This achieves data rate savings. significant compared to the case in which the configuration it is explicitly signaled in the data stream, in which there is no therefore consideration among the means of reconstruction multichannel and audio decoder, in which there is no "theft" of the invention of audio decoder data neither by means of multi-channel reconstruction.

On the other hand, the concept of the invention still provides the high flexibility inherent in signaling  explicit configuration information, because, due to the parameter setting indication, for which it is enough a single bit in the data stream, there is the possibility of actually transmitting all the configuration information in the data flow, if necessary or, as a mixed form, of transmit at least part of the configuration information of parameters in the data flow and take another piece of information necessary for a certain set of information.

In a preferred embodiment of the present invention, the data transmitted from the encoder to decoder also include an indication signaling of then to a decoder if the settings of configuration at all compared to the settings of configuration already existing or previously signaled, or if you must continue as before, or if, in reaction to a certain adjustment of the continuation indication, the configuration indication of parameters are read to determine if there should be an alignment of the means of multi-channel reconstruction with respect to audio decoder, or if information at least partially explicit regarding the configuration is contained in the transmission data

Embodiments will be explained below. Preferences of the present invention in more detail with respect to the attached drawings, in which:

Figure 1 is a circuit diagram of blocks of a device of the invention to generate a set of parameter data that can be used on the side of the encoder;

Figure 2 is a circuit diagram of blocks of a device to generate a multichannel audio signal  used on the decoder side;

Figure 3 is a main flow chart of the operation of the configuration means of figure 2 in a preferred embodiment of the present invention;

Figure 4a is a schematic representation of data flows for synchronous operation between the audio decoder and reconstruction media multichannel;

Figure 4b is a schematic representation. of data flows for asynchronous operation between the audio decoder and reconstruction media multichannel;

Figure 4c is a preferred embodiment of the device to generate a multichannel audio signal in the form of syntax;

Figure 5 is a general representation of a multi-channel encoder;

Figure 6 is a block diagram schematic of a BCC encoder / decoder path BCC;

Figure 7 is a circuit diagram of blocks of the BCC synthesis block of Figure 6; Y

Figures 8A to 8C are a representation of Typical scenarios for calculating parameter sets ICLD, ICTD and ICC.

\ vskip1.000000 \ baselineskip

Figure 1 shows a circuit diagram of blocks of a device of the invention to generate a set of parameter data, where the parameter data set it can be output on an output 10 of the device shown in the Figure 1. The parameter data set contains data from parameters that, together with the transmission channel data, do not illustrated in Figure 1, but will be discussed later, represent N original channels, where the channel data of transmission will normally include M transmission channels, where the number M of transmission channels is less than the number M of original channels and is greater than or equal to 1.

The device shown in Figure 1, which is will host on the encoder side, include 11 means of parameters  multichannel designed to perform, for example, a BCC analysis in a stereo intensity analysis or the like. In this case, the 11 channels of multichannel parameters will receive N original channels in an entry 12. Alternatively, however, means 11 of Multichannel parameters can also be designed as means of transcoder to generate the parameter data at the output of media 11 using raw parameter data existing ones fed into an input 13 of raw parameters. If the parameter data is simple BCC data as provide by any means of BCC analysis, the processing of means 11 of multichannel parameters will simply consist of a function of copying data from input 13 into an output of means 11. However, means 11 of parameters Multichannel can also be designed to change syntax of the raw parameter data flow to add, by example signaling data or to write sets of parameters that can be decoded or skipped at least partially independent of each other from the parameter data without process existing.

The device shown in Figure 1 includes also signaling means 14 for determining and associating a PKH parameter setting indication with data from parameters at the output of the media 11. In particular, the media signaling are designed to determine the indication of parameter settings, so that you have a first meaning when the configuration information contained in the parameter data set will be used for a multi-channel reconstruction Alternatively, means 14 of signaling will determine the configuration indication of parameters, in such a way that it has a second meaning when configuration data that is based on an algorithm of coding to be used and / or used to encode the transmission channel data will be used for a multi-channel reconstruction

Finally, the device of the invention of the Figure 1 includes means 15 for writing configuration data designed to associate configuration information with data from parameters and the parameter setting indication for finally get the parameter data set at the output 10. Thus, the parameter data set 10 includes the data of media parameters 11 multi-channel parameters, the indication of parameter configuration PKH of signaling means 14 and, if applicable, configuration data of the media 15 of write configuration data. In the data set of parameters, these elements of the data set are arranged according to a given syntax and usually multiplex in time, as symbolically represented by an element called generally as combination means 16 in Figure 1.

In a preferred embodiment of the present invention, the signaling means 14 are coupled to the means 15 writing configuration data through line 17 of control to activate the data writing means 15 of configuration only when the configuration indication of parameters has the first meaning, that is, when, in a multichannel reconstruction, no information of configuration present in the decoder in any way but when there is explicit signaling, that is, when it is present additional configuration information in the data set of parameters In the other case, in which the indication of Parameter configuration has the second meaning, the 15 means of writing configuration data are not activated for enter data in the parameter data set in the output 10, because such data would not be read by a decoder and / or would not be required by the decoder, as will be commented later. In the case of a mixed solution, instead of signal everything in the data stream, only one is signaled part of the configuration, while the rest is taken, by example, from the configuration table in the decoder.

The signaling means 14 includes a control entry 18, whereby media 14 is informed signaling if the parameter setting indication It will have the first or second meaning. As will be commented with respect to figures 4a and 4b, in the so-called operation "synchronous", it is preferred to choose the configuration indication of parameters in such a way that it has the second meaning for obtain information about the coding algorithm of such way on the decoder side and to make adjustments to configuration in multi-channel reconstruction media on the side of the decoder depending on the same. In operation asynchronous, however, control input 18 will trigger the signaling means such that they determine the first meaning for the parameter setting indication, which will be interpreted by a decoder in such a way that there are configuration information in the data itself, and the algorithm audio coding on which the channel data is based on Transmission will not be used.

It should be noted that the data set of parameters and / or parameter data output do not have to be in a rigid way with each other. Thus, the configuration indication, configuration data and parameter data do not have to transmitted together necessarily in a stream or packet, but they can also be provided to the decoder separately between yes.

The following description will present the called "synchronous" operation with respect to the figure 4th. For illustration purposes, Figure 4a illustrates the data of parameters such as a sequence of frames 40, where the sequence of frames 40 is preceded by a header 41 in which is the parameter setting indication generated by the media 14 signaling, and in which if applicable, there is information of additional configuration generated by the writing means 15 of configuration data Parameter data at the output of the means 11 are housed in frames 1, 2, 3, 4, which is the reason why which are called payload data in figure 4a.

The continuation indication FSH, which is mentioned both in Figure 1 at the output of the signaling means 14 and is also mentioned additionally for the header 41 in Figure 4a, causes a decoder to maintain, that is, continue, an adjustment of configuration previously communicated to it, when it has a certain meaning, while, when the FSH continuation indication has another meaning, there is a decision based on the parameter configuration indication of whether configuration settings will be made in the multi-channel reconstruction means based on the configuration information in the data stream or based on configuration data retrieved by an indication for the audio coding algorithm on the decoding side
dor.

Figure 4a further represents a sequence 42 of transmission data blocks coded in association in the time, which also have four frames, frame 1, frame 2, frame 3, frame 4. The association in time of the parameter data with The encoded transmission channel data is illustrated by arrows vertical in figure 4a. Thus, a block of channel data from encoded transmission will always relate to a block of input data and / or, when overlapping windows are used, when less the progress of how much data in a block will be determined they process again compared to the previous block and, in synchronous operation, they will be synchronous with the block length and / or the progress in which the parameter data is obtained. This guarantees that the connection between the reconstruction parameters by one part and the transmission channel data on the other hand is not lose

This will be explained by means of a brief example. Assuming a 5-channel input signal, this input signal 5-channel will have five different audio channels that include time samples from time x to time y, respectively. In phase 114 of downmix of Figure 6, it is generated then at least one transmission channel that will be synchronous with the multichannel input data. A part of the channel data of transmission from time x to time and will correspond by both to a part of the respective multichannel input data from time x to time y. In addition, the means 116 of BCC analysis of Figure 6 generates, for example, data from parameters, again exactly for the time section of the transmission channel data from time x to time y, such that, on the decoder side, they can be generated again respective output channel data from time x up to time and from the transmission channel data from time x to time y and parameter data from time x to time y.

Operation is automatically achieved synchronous when the frames with which they are generated and write the Parameter data are equal to the frames with which the audio encoder to compress the one or more channels of transmission. So, if the frames of both the parameter data as the encoded transmission channel data (40 and 42 in the Figure 4a) always refer to the same part of time, a multi-channel reconstruction device can always process easily corresponding data to an audio frame and process a plot of parameters at the same time.

In synchronous operation, the frame length of the audio encoder used for data transmission downmix is therefore equal to the frame length used by the multichannel parametric scheme. Similarly, too there is of course the possibility that there is an entire relationship between the frame lengths of the parameter data and the Transmission channel data encoded. In this case, the lateral information for parametric multichannel coding can be multiplexed in the encoded bit stream of the signal audio downmix in such a way that a stream of a single bit In the case of "feedback" of stereo data already existing, there would still be two different data streams. Without However, there would be a ratio of 1: 1 and / or m: 1 or m: n between the two frame sequences. Plot sweeps would never move each. Thus, there is an unambiguous association between the plots of audio data and lateral information data frames corresponding parametric. This mode can be favorable for various applications

According to the invention, the indication of parameter settings would have the first meaning in such case. This means that there would not be or only part of the configuration information in header 41, because it provides the means of multi-channel reconstruction itself information about the underlying audio encoder and depending of the same, they choose their configuration setting, that is, by example the number of time samples for the advance or the block length, etc.

In constaste, Figure 4b shows a asynchronous operation. There is asynchronous operation when the transmission channel data 42 'does not have, for example, a frame structure, but only occur as a flow of PCM samples. Alternatively, such asynchronous situation also would arise when the audio encoder had a structure of irregular plot or simply a frame structure with a frame length and / or a different frame scan than the scan of 40 parameter data frame. Here, the scheme of parametric multichannel coding and the means of audio coding / decoding are therefore considered as isolated and separate processing phases that do not depend on yes. This is particularly advantageous in the case of so-called tandem coding scenarios in which there are several phases successive coding / decoding. If the data of parameters are fixedly coupled to the audio data tablets, a multichannel synthesis and a subsequent multichannel analysis simultaneously in each coding / decoding. Since these operations are many losses, losses would accumulate gradually, which would give as a result an increasing deterioration of printing multichannel

In a tandem chain of this type, the setting of the parameter setting indication to the second meaning and structure of configuration information in the data flow allow adjustment of media configuration of multichannel reconstruction in the decoder independently of the underlying audio encoder. So, downmix data they can be decoded / encoded in any way without having provided that you perform a multichannel synthesis or multichannel analysis at the same time. Entering configuration information in the data flow and preferably in the data flow of parameters according to the parameter data syntax allows, by so to speak, determine an absolute association of data from parameters with time samples of the channel data of decoded transmission, that is, an association that is self-sufficient and does not occur in relation to a processing rule frame encoder, as in synchronous operation.

In asynchronous operation, it is prevented by both the deterioration of multichannel sound characteristics, because a multichannel analysis / synthesis is not always performed. Therefore, the frame size for encoding / decoding parametric multichannel does not necessarily have to be connected with the frame size of the audio encoder.

The device of Figure 1 can be implemented both as an encoder and as called "forward transcoder". In the first case, the multichannel parameter means calculate the data of parameters In the second case, they receive the parameter data already in a certain way and provide the data output of parameters of the invention with the configuration indication of parameters and associated configuration data. Thus, the forward transcoder generates the data output of parameters of the invention from any output of data.

The inverse of this measure is done through a called "back transcoder" which, from the parameter data output of the invention, generates some output in which the configuration indication is no longer contained parameters, in which, however, they are completely contained also the configuration data, such that it is not no use of an audio coding algorithm is necessary in the Multi-channel reconstruction for configuration.

According to the invention, the transcoder towards back is designed as a device to generate an output of parameter data that, together with the transmission channel data which include M transmission channels, represents N channels originals, where M is less than N and greater than or equal to 1, using input data, in which the input data comprises a parameter setting indication (41) having a first meaning that configuration information is contained for multichannel reconstruction means in the input data, or It has a second meaning that the means of reconstruction multichannel will use configuration information that depends of a coding algorithm (23) with which they have decoded the transmission channel data from a coded version thereof. Contains writing media for write configuration data where the writing media are designed to first read the input data to interpret (30) the parameter setting indication, and to retrieve information about an encoding algorithm (23) with which the transmission channel data has been decoded from a coded version of them and to issue them as data configuration, when the configuration indication of Parameters has the second meaning.

Next, a diagram of block circuit of a device to generate a signal from multichannel audio according to a preferred embodiment of the present invention, with respect to figure 2. To generate the signal of multichannel audio, input data that includes data is used of the transmission channel representing the M channels of transmission and which also include parameter data 21 for Get K output channels. The M transmission channels and the Parameter data together represent N original channels, where M is less than N and is greater than or equal to 1, and where K is greater than M. In addition, the input data includes an indication of PKH parameter configuration, as already discussed, while the transmission channel data 20 is a decoded version of the transmission channel data 22 encoded according to a coding algorithm In the embodiment shown in the figure 2, the decoding algorithm is performed by a audio decoder 23 which has an algorithm operation coding, for example, according to the concept of MP3 or according to MPEG-2 (AAC) or according to any other concept of coding.

The device to be used on the side of the decoder shown in Figure 2 includes means 24 of Multi-channel reconstruction designed to generate the K channels of output on an output 25 from the channel data 20 of transmission and parameter data 21.

In addition, the device of the invention shown in figure 2 includes configuration means 26 designed for configure the multi-channel reconstruction means 24 by signaling a configuration setting via a signaling line 27. As the input data, the configuration means 26 receive preferably the parameter data 21 for reading and processing correspondingly the parameter setting indication, FSH continuation indication and possibly data from present configuration. In addition, the configuration media include an input 28 signaling algorithm of coding to obtain information about the algorithm of audio coding on which the channel data is based on decoded transmission, that is, the coding algorithm executed by audio encoder 23. The information can be obtained in different ways, for example from a observation of decoded transmission channel data, if can be seen from them with what coding algorithm have been encoded / decoded. Alternatively, the audio decoder 23 can communicate its identity by itself to configuration means 26. Still alternately, the configuration means 26 can also parse syntactically the transmission channel data 22 encoded to determine an indication from the transmission channel data encoded according to what coding algorithm coding there is It took place. Such "coding algorithm signature" will be normally contained in each output data stream of a encoder

Next, an implementation will be described preferred configuration media based on a diagram of blocks with respect to figure 3a. The means 26 of settings are designed to read the indication of PKH parameter configuration from the input data e interpret it, as illustrated in block 30. If the indication of Parameter configuration has a first meaning, the means configuration will continue reading the parameter data flow to extract configuration information (or at least part of the configuration information) in the parameter data flow, as illustrated in block 31. Yes, however, step 30 determines that the PKH parameter setting indication It has the second meaning, the configuration media will get information about an encoding algorithm on which they are based decoded transmission channel data, at the stage 32

If there are several basically possible coding algorithms for which the device of the invention is designed to generate the multichannel signal, step 32 is followed by a subsequent stage 33 in which the multichannel reconstruction means determines (33) a configuration setting. based on existing information on the decoder side. This can be done, for example, in the form of a query table (LUT, Look-Up Table ). If, at the end of step 32, an audio encoder identification indication is obtained, the query table is entered in step 33 using the audio encoder identification indication, where the audio encoder identification indication It is used as an index. Associated in the index are various configuration settings found, such as block length, sampling rate, feed rate, etc., associated with such audio encoder.

Then a configuration setting is applied to the multi-channel reconstruction means at stage 34. Without However, if the first meaning of the configuration indication of parameters is chosen in step 30, the same setting of configuration is done based on configuration information contained in the parameter data flow, as represented by the date of connection between block 31 and block 34 in the figure 3.

The scheme of the invention is flexible because supports configuration information signaling methods both explicit and implicit. This is what the PKH parameter setting indication, which is inserted preferably as a label and, in the best case, requires only a single bit to indicate the signaling of the configuration information itself. The multichannel decoder Parametric can later evaluate this label. If the availability of configuration information available explicitly signaled with this tag, this is used configuration information On the other hand, if the implicit signaling by the tag, the decoder will use information about the audio or voice coding method used and will apply the configuration information based on signaling coding method. For this purpose, the parametric multichannel decoder and / or the means of multichannel reconstruction preferably have a table of query that contains the standard configuration information for a certain number of audio or voice encoders. But nevertheless, there are also other possibilities of a query table that you can, for example, include wired solutions, etc. In general, the decoder can provide the configuration information with default information present in itself depending on the encoder identification information present currently.

This concept is particularly advantageous. because a complete configuration of the scheme can be achieved parameters with a minimum of additional effort, where, in the case extreme, a single bit will be enough, which forms a contrast with the situation that all the configuration information would have to be written explicitly in the data stream itself with a considerably greater effort with respect to bits.

According to the invention, signaling can commute alternately. This allows data manipulation. simple multichannel, even if the representation of data from transmission channel changes, for example when the channel data transmission are decoded and encoded again subsequently, that is, when there is a coding situation in tandem.

The concept of the invention therefore allows the signaling bit saving in the case of operation synchronous on the one hand and switching to asynchronous operation by other part, if necessary, that is, a savings implementation efficient bit and, on the other hand, flexible manipulation, which will be particular interest in relation to the "complementation" of existing stereo data for multichannel representation.

Next, an implementation will be given to example mode of the device of the invention to generate a multichannel audio signal with the example of a pseudocode of syntax, with respect to figure 4c. First, the value of the "useSameBccConfig" variable. Here, the variable It serves as an indication of continuation. Thus, there is only one then to interpret the configuration indication of parameters when this variable, that is, the indication of configuration, has a value equal to, for example, 1. However, if the configuration indication is not equal to 1, that is, it has the other meaning, a transmitted configuration is used previously. If there is still no configuration in the media Multi-channel reconstruction, you have to wait until you get just the first configuration information and / or adjustment of setting. The following will examine the configuration indication of parameters. The variable "codecToBccConfigAlignment" serves as an indication of PKH parameter configuration. If this variable is equal to 1, that is, if it has the second meaning, the decoder will not use any configuration information additional, but will determine the configuration information based on the encoder identification, such as MP3, CoderX or CoderY, as can be seen from the lines that start with "case" in figure 4c. It should be noted that, to As an example, the syntax shown in Figure 4c supports Only MP3, CoderX and CoderY. However, it can be added any other name / identification code.

\ newpage

When it has been determined, for example MP3 as Encoder information, the variable bccConfigID is set by example, to MP3_V1, which is the configuration for an MP3 encoder underlying with the syntax version V1. Subsequently, the decoder is configured with a set of parameters determined based on this BCC configuration identification. Thus, for example, a block length of 576 samples is activated as configuration setting. Thus, frames that have This block length. Configuration settings alternative / additional may be the sampling rate, etc., without However, if the parameter setting indication (codecToBccConfigAlignment) has the first meaning, that is, for example the value 0, the decoder will receive explicitly configuration information from the data stream, that is, you will receive a bccConfigID other than the data stream, that is, from the input data The next procedure is then the same as the one just described. However, this case, no decoder identification is used to decode the transmission channel data encoded for the purpose of Multi-channel reconstruction media configuration.

Thus, the bccConfigID can be used for the purpose of decoding the transmission channel data in the case of a MP3 audio decoder to configure reconstruction media multichannel On the other hand, there can also be any other bccConfigID of configuration information in the data stream and can be evaluated, regardless of whether the audio encoder underlying or is an MP3 encoder or not. The same applies to others. predefined configuration settings, such as CoderX and CoderY, and to an additional free configuration in which the information of Configuration (bccConfigID) is set to individual. In realizations preferred, there is additional configuration information in the flow of data that, in turn, signals to the decoder that it should use a mix of pre-defined configuration information present in the decoder and configuration information explicitly transmitted.

Unlike the described embodiments above, the present invention can also be applied to other multichannel signals that are not audio signals, such as parametrically encoded video signals, etc.

Depending on the circumstances, the method of the invention of generation and / or decoding can be implemented in hardware or software. The implementation can be done in a digital storage medium, in particular a floppy disk or a CD that has control signals that can be read electronically, which can act in conjunction with a system programmable computer, in such a way that the method is executed. Thus, in general, the invention also consists of a product of computer program that has a program code to perform the method stored on a machine-readable media, when the Computer program product runs on a computer. In in other words, the invention can therefore be realized as a computer program that has a program code to perform the method, when the computer program runs on a computer.

Claims (21)

1. Device to generate a multichannel signal using input data that includes channel data from transmission representing M transmission channels and data from parameters to obtain K output channels, in which the M transmission channels and parameter data represent together N original channels, where M is less than N and greater than or equal to 1 and where K is greater than M, in which the input data comprises an indication (41) of parameter settings, which understands:
means (24) of multi-channel reconstruction designed to generate the K output channels from the data  of transmission channel and parameter data; Y
configuration means (26) to configure the multichannel reconstruction means, in which the means of settings are designed to
read the data input to interpret (30) the configuration indication of parameters,
when the parameter setting indication has a first meaning, extract (31) configuration information contained in input data and make (34) a configuration setting of the means of multi-channel reconstruction, and
when the parameter setting indication has a second meaning that differs from the first meaning, configure (34) the multichannel reconstruction means using information on a coding algorithm (23) with which the decodes have been decoded transmission channel data from an encoded version of them in such a way that the configuration setting of the Multi-channel reconstruction means is identical to a setting of configuration of the coding algorithm (23) or depends on a algorithm configuration setting (23) of coding.
2. Device according to claim 1, in the that the transmission channel data comprises a data flow of transmission channel that has a channel data syntax of transmission,
in which the parameter data comprises a parameter data flow that has a data syntax of parameters, in which the transmission channel data syntax differs from the parameter data syntax, and
in which the configuration indication of parameters is inserted into the parameter data according to this syntax,
in which the configuration means (26) are designed to read the parameter data according to the syntax of parameter data and to extract (30) the indication of Parameter settings
3. Device according to claim 1 or 2, in which means (24) of multi-channel reconstruction are designed to perform block processing, in which data from transmission channel are a sequence of samples and in which the Configuration setting includes a block length or number of advance of samples that are processed again by the media (24) of multi-channel reconstruction by processing a block.
4. Device according to claim 3, in the that the transmission channel data are time samples from to minus a transmission channel, and the reconstruction means (24) multichannel comprise a bank of filters to convert a block of time samples of the transmission channel data to a frequency domain representation.
5. Device according to one of the claims above, in which the parameter data comprises a block sequence of parameter values, in which a block of parameter values is associated with a part of time of at least one transmission channel, in which the means (24) of multi-channel reconstruction are designed in such a way that the configuration setting causes the parameter value block and the associated time portion of the at least one transmission channel are used to generate the K output channels.
6. Device according to one of the claims above, in which the coding algorithm (23) is one of between a plurality of various coding algorithms, and
in which the configuration means (26) comprise query table means that include an index and a set of configuration information associated with the index for an encoding algorithm, which respectively comprises the configuration setting for coding algorithms,
in which the configuration means (26) are designed to determine the index for the query table at from the information about the coding algorithm and to determine (33) from it the information of configuration for multichannel reconstruction media.
7. Device according to one of the claims above, in which the input data includes information configuration for multichannel reconstruction means (24) in the case of a parameter setting indication that it has the first meaning, and they comprise only part of or none configuration information for reconstruction media multichannel in the case of the configuration indication of parameters that have the second meaning.
8. Device according to one of the claims above, in which the configuration means (26) are designed to extract only part of the configuration information required from the input data when the indication of Parameter configuration has the second meaning, and for use a remaining part of configuration information from the preset configuration information known to Multichannel reconstruction means.
9. Device according to one of the claims above, in which the configuration means (26) are designed to obtain information about the algorithm of coding through a connection line through which configuration media can be connected to a decoder which generates the transmission channel data from the data of encrypted transmission channel, or to obtain the information about the coding algorithm reading the channel data of transmission or encrypted transmission channel data, when the parameter setting indication has the second meaning.
10. Device according to one of the previous claims, wherein the input data further comprise an indication (41) of continuation and
in which the configuration means (26) are designed to read and interpret (29) the continuation indication  to make a fixed setting setting or previously signaled from multichannel reconstruction media in case the continuation indication has a first meaning, and to configure (30) the reconstruction means multichannel based on parameter setting indication  only if the continuation indication has a Second meaning that differs from the first meaning.
11. Device according to claim 10, in which the continuation indication is associated with the data of parameters according to a parameter data syntax and is a tag in the parameter data flow.
12. Device according to one of the previous claims, wherein the indication of Parameter configuration is associated with data from parameters according to a parameter data syntax and is a tag in the parameter data flow.
13. Device according to claim 11 or 12, in which the continuation indication or indication of Parameter configuration each includes a single bit.
14. Method to generate a multichannel signal using input data that includes channel data from transmission representing M transmission channels and data from parameters to obtain K output channels, in which the M transmission channels and parameter data represent together N original channels, where M is less than N and greater than or equal to 1, and where K is greater than M, in which the input data comprises an indication (41) of parameter settings, which understands:
rebuild (24) the K output channels to from the transmission channel data and the data from parameters according to a reconstruction algorithm;
configure (26) the reconstruction algorithm through the following sub-stages:
read the data input to interpret (30) the configuration indication of parameters;
when the parameter setting indication has a first meaning, extract (31) configuration information contained in input data and make (34) a configuration setting of the reconstruction algorithm and
when the parameter setting indication has a second meaning that differs from the first meaning, effect (34) the setting of the reconstruction algorithm using information about an encoding algorithm (22) with which have decoded the data of the transmission channel from a coded version thereof, such that the setting of configuration is identical to a configuration setting of the coding algorithm (23) or depends on an adjustment of Configuration of the coding algorithm (23).
15. Device for generating data output of parameters that, together with the transmission channel data that include M transmission channels, represents N original channels, where M is less than N and is greater than or equal to 1, which comprises:
means (11) of multi-channel parameters for provide the parameter data;
signaling means (14) for determining a parameter setting indication, in which the indication parameter setting has a first meaning when it goes to use configuration information contained in the output of parameter data for multichannel reconstruction media, and in which the parameter setting indication has a second meaning when configuration data will be used for a multi-channel reconstruction that are based on an algorithm of coding to be used for coding or decoding of the M transmission channels; Y
means (15) of writing data from configuration to issue the encoding information for get the output of parameter data.
16. Device according to claim 15, in which means (15) of writing configuration data are designed to insert a continuation indication in the parameter data set,
in which the continuation indication causes use a fixed setting setting previously signaled in a multi-channel reconstruction how much a first meaning, and makes a configuration of a multi-channel reconstruction using the indication of parameter setting when continuation indication It has a second meaning that differs from the first meaning.
17. Device according to claim 15 or 16, in which the means of writing configuration data are designed to associate any or only part of the information in necessary configuration with the parameter data set when the parameter setting indication has the second meaning (17).
18. Method to generate data output from parameters that, together with the transmission channel data that include M transmission channels, represents N original channels, where M is less than N and is greater than or equal to 1, which comprises:
provide (11) the parameter data;
determine (14) a configuration indication of parameters, in which the configuration indication of parameters has a first meaning when it is to be used configuration information contained in the data output of parameters for a multichannel reconstruction algorithm, and in the that the parameter setting indication has a second meaning when configuration data will be used for a multichannel reconstruction that is based on an algorithm of coding to be used for coding or decoding of the M transmission channels; Y
issue (15) the configuration information for get the output of parameter data.
19. Device for generating data output of parameters that, together with the transmission channel data that include M transmission channels, represents N original channels, where M is less than N and is greater than or equal to 1, using data from input, in which the input data comprises an indication (41) parameter configuration that has a first meaning that the configuration information for media of Multi-channel reconstruction is contained in the input data, or It has a second meaning that the means of reconstruction multichannel will use configuration information that depends of a coding algorithm (23) with which they have decoded the transmission channel data, which comprises:
writing media to write data from configuration, in which the writing media are designed for
read the data input to interpret (30) the configuration indication of parameters, and
when the parameter setting indication has the second meaning, retrieve and issue as the data information of configuration on a coding algorithm (23) with which They have decoded the transmission channel data.
20. Method to generate data output from parameters that, together with the transmission channel data that include M transmission channels, represents N original channels, where M is less than N and is greater than or equal to 1, using data from input, in which the input data comprises an indication (41) parameter configuration that has a first meaning that the configuration information for rebuild media  multichannel is contained in the input data, or has a second meaning that multichannel reconstruction means they will use configuration information that depends on a coding algorithm (23) with which the decodes have been decoded transmission channel data, comprising:
read the input data to interpret (30) the parameter setting indication;
when the setting indication of parameters has the second meaning, retrieve information about  an encoding algorithm (23) with which they have been decoded the transmission channel data, and broadcast the data of configuration recovered.
21. Software product you have a program code to perform the method according to the claim 14, claim 18 or claim 20, in which the computer program runs on a computer.
ES05782843T 2004-09-08 2005-08-10 Method and device for generating multichannel signal or set of parameter data. Active ES2314706T3 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
DE102004043521A DE102004043521A1 (en) 2004-09-08 2004-09-08 Device and method for generating a multi-channel signal or a parameter data set
DE102004043521 2004-09-08

Publications (1)

Publication Number Publication Date
ES2314706T3 true ES2314706T3 (en) 2009-03-16

Family

ID=35502612

Family Applications (1)

Application Number Title Priority Date Filing Date
ES05782843T Active ES2314706T3 (en) 2004-09-08 2005-08-10 Method and device for generating multichannel signal or set of parameter data.

Country Status (18)

Country Link
US (1) US8731204B2 (en)
EP (1) EP1687809B1 (en)
JP (1) JP4601669B2 (en)
KR (1) KR100857920B1 (en)
CN (1) CN101014999B (en)
AT (1) AT409938T (en)
AU (1) AU2005281966B2 (en)
BR (1) BRPI0515651B1 (en)
CA (1) CA2579114C (en)
DE (2) DE102004043521A1 (en)
ES (1) ES2314706T3 (en)
HK (1) HK1093595A1 (en)
IL (1) IL181743D0 (en)
MX (1) MX2007002854A (en)
NO (1) NO338932B1 (en)
PT (1) PT1687809E (en)
RU (1) RU2355046C2 (en)
WO (1) WO2006027079A1 (en)

Families Citing this family (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100740807B1 (en) 2004-12-31 2007-07-19 한국전자통신연구원 Method for obtaining spatial cues in Spatial Audio Coding
EP1691348A1 (en) * 2005-02-14 2006-08-16 Ecole Polytechnique Federale De Lausanne Parametric joint-coding of audio sources
US7788107B2 (en) 2005-08-30 2010-08-31 Lg Electronics Inc. Method for decoding an audio signal
JP4568363B2 (en) 2005-08-30 2010-10-27 エルジー エレクトロニクス インコーポレイティド Audio signal decoding method and apparatus
JP4859925B2 (en) 2005-08-30 2012-01-25 エルジー エレクトロニクス インコーポレイティド Audio signal decoding method and apparatus
EP1853092B1 (en) * 2006-05-04 2011-10-05 LG Electronics, Inc. Enhancing stereo audio with remix capability
ES2380059T3 (en) 2006-07-07 2012-05-08 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for combining multiple audio sources encoded parametrically
KR101438387B1 (en) * 2006-07-12 2014-09-05 삼성전자주식회사 Method and apparatus for encoding and decoding extension data for surround
JP5451394B2 (en) 2006-09-29 2014-03-26 韓國電子通信研究院Electronics and Telecommunications Research Institute Apparatus and method for encoding and decoding multi-object audio signal composed of various channels
US9418667B2 (en) 2006-10-12 2016-08-16 Lg Electronics Inc. Apparatus for processing a mix signal and method thereof
CN101169866B (en) 2006-10-26 2010-09-01 朱明程 Self-reconfigurable on-chip multimedia processing system and its self-reconfiguration realization method
EP2238589B1 (en) * 2007-12-09 2017-10-25 LG Electronics Inc. A method and an apparatus for processing a signal
CN100574431C (en) 2007-12-10 2009-12-23 上海华为技术有限公司 Method and network element equipment for implementing video service in wireless communication system
US8654988B2 (en) 2008-05-05 2014-02-18 Qualcomm Incorporated Synchronization of signals for multiple data sinks
EP2124486A1 (en) * 2008-05-13 2009-11-25 Clemens Par Angle-dependent operating device or method for generating a pseudo-stereophonic audio signal
WO2010008198A2 (en) * 2008-07-15 2010-01-21 Lg Electronics Inc. A method and an apparatus for processing an audio signal
EP2146342A1 (en) 2008-07-15 2010-01-20 LG Electronics Inc. A method and an apparatus for processing an audio signal
KR101499785B1 (en) 2008-10-23 2015-03-09 삼성전자주식회사 Method and apparatus of processing audio for mobile device
EP2323130A1 (en) * 2009-11-12 2011-05-18 Koninklijke Philips Electronics N.V. Parametric encoding and decoding
PL2625688T3 (en) * 2010-10-06 2015-05-29 Fraunhofer Ges Forschung Apparatus and method for processing an audio signal and for providing a higher temporal granularity for a combined unified speech and audio codec (usac)
EP2676267B1 (en) * 2011-02-14 2017-07-19 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Encoding and decoding of pulse positions of tracks of an audio signal
US8600692B2 (en) * 2011-03-17 2013-12-03 Sysacom Automatically configurable sensing device
JP5820487B2 (en) * 2011-03-18 2015-11-24 フラウンホーファーゲゼルシャフトツール フォルデルング デル アンゲヴァンテン フォルシユング エー.フアー. Frame element positioning in a bitstream frame representing audio content
ES2654792T3 (en) * 2012-08-03 2018-02-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Procedure and decoder for multi-instance spatial audio object coding that employs a parametric concept for down-mix / up-channel multi-channel mixing cases
CN103686179B (en) * 2012-09-26 2019-05-07 中兴通讯股份有限公司 Use the coding of parameter set, coding/decoding method and device, electronic equipment
CN105247613B (en) 2013-04-05 2019-01-18 杜比国际公司 audio processing system
CN103336747B (en) * 2013-07-05 2015-09-09 哈尔滨工业大学 The input of cpci bus digital quantity and the configurable driver of output switch parameter and driving method under vxworks operating system
EP2840811A1 (en) * 2013-07-22 2015-02-25 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Method for processing an audio signal; signal processing unit, binaural renderer, audio encoder and audio decoder
CN103412833A (en) * 2013-08-30 2013-11-27 哈尔滨工业大学 CPCI (Compact Peripheral Component Interconnect) bus scanning ADC (Analog-to-Digital Converter) function module driver under VxWorks operating system and control method thereof
EP2863386A1 (en) 2013-10-18 2015-04-22 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio decoder, apparatus for generating encoded audio output data and methods permitting initializing a decoder
CN103744805B (en) * 2014-01-03 2016-04-27 哈尔滨工业大学 Cpci bus switching value and analog output module hardware structure and the configurable driving method of sequential under VxWorks

Family Cites Families (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5329000A (en) * 1991-10-31 1994-07-12 Becton, Dickinson And Company Purification of DNA with silicon tetrahydrazide
DE4236989C2 (en) * 1992-11-02 1994-11-17 Fraunhofer Ges Forschung Method for transmitting and / or storing digital signals of multiple channels
EP0631458B1 (en) 1993-06-22 2001-11-07 Deutsche Thomson-Brandt Gmbh Method for obtaining a multi-channel decoder matrix
DE69428939D1 (en) * 1993-06-22 2001-12-13 Thomson Brandt Gmbh A method for maintaining a Mehrkanaldekodiermatrix
JP2001523937A (en) 1997-11-14 2001-11-27 エックスディー・ラブ・アール・アンド・ディー インコーポレーテッド Stereophonic surround sound decoding circuit after amplification
KR100335609B1 (en) * 1997-11-20 2002-04-23 삼성전자 주식회사 Scalable audio encoding/decoding method and apparatus
KR100335611B1 (en) * 1997-11-20 2002-04-23 삼성전자 주식회사 Scalable stereo audio encoding/decoding method and apparatus
JPH11330980A (en) 1998-05-13 1999-11-30 Matsushita Electric Ind Co Ltd Decoding device and method and recording medium recording decoding procedure
US6452941B1 (en) * 1998-09-16 2002-09-17 Telefonaktiebolaget Lm Ericsson (Publ) Method and system for alternating transmission of codec mode information
DE19900961A1 (en) 1999-01-13 2000-07-20 Thomson Brandt Gmbh A method and apparatus for reproducing multi-channel audio
US6539357B1 (en) * 1999-04-29 2003-03-25 Agere Systems Inc. Technique for parametric coding of a signal containing information
TW533746B (en) 2001-02-23 2003-05-21 Formosa Ind Computing Inc Surrounding sound effect system with automatic detection and multiple channels
US7116787B2 (en) * 2001-05-04 2006-10-03 Agere Systems Inc. Perceptual synthesis of auditory scenes
US20030035553A1 (en) * 2001-08-10 2003-02-20 Frank Baumgarte Backwards-compatible perceptual coding of spatial cues
TW569551B (en) 2001-09-25 2004-01-01 Roger Wallace Dressler Method and apparatus for multichannel logic matrix decoding
US8498422B2 (en) * 2002-04-22 2013-07-30 Koninklijke Philips N.V. Parametric multi-channel audio representation
DE60326782D1 (en) * 2002-04-22 2009-04-30 Koninkl Philips Electronics Nv Decoding device with decorrelation unit
US7006636B2 (en) * 2002-05-24 2006-02-28 Agere Systems Inc. Coherence-based audio coding and synthesis
US7292901B2 (en) * 2002-06-24 2007-11-06 Agere Systems Inc. Hybrid multi-channel/cue coding/decoding of audio signals
US20040037433A1 (en) 2002-08-21 2004-02-26 Heng-Chien Chen Multi-channel wireless professional audio system
EP1414273A1 (en) * 2002-10-22 2004-04-28 Philips Electronics N.V. Embedded data signaling
JP2005352396A (en) * 2004-06-14 2005-12-22 Matsushita Electric Ind Co Ltd Sound signal encoding device and sound signal decoding device
US8204261B2 (en) * 2004-10-20 2012-06-19 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Diffuse sound shaping for BCC schemes and the like
US7751572B2 (en) * 2005-04-15 2010-07-06 Dolby International Ab Adaptive residual audio coding
ES2407820T3 (en) * 2006-02-23 2013-06-14 Lg Electronics Inc. Method and apparatus for processing an audio signal

Also Published As

Publication number Publication date
EP1687809B1 (en) 2008-10-01
BRPI0515651A (en) 2008-07-29
AU2005281966A1 (en) 2006-03-16
BRPI0515651B1 (en) 2019-07-02
DE102004043521A1 (en) 2006-03-23
US20070206690A1 (en) 2007-09-06
US8731204B2 (en) 2014-05-20
DE502005005522D1 (en) 2008-11-13
RU2355046C2 (en) 2009-05-10
MX2007002854A (en) 2007-05-08
EP1687809A1 (en) 2006-08-09
NO20071132L (en) 2007-04-03
IL181743D0 (en) 2007-07-04
KR100857920B1 (en) 2008-09-10
AT409938T (en) 2008-10-15
HK1093595A1 (en) 2009-02-13
CN101014999A (en) 2007-08-08
CN101014999B (en) 2011-04-27
NO338932B1 (en) 2016-10-31
CA2579114A1 (en) 2006-03-16
PT1687809E (en) 2009-01-14
JP4601669B2 (en) 2010-12-22
AU2005281966B2 (en) 2008-07-17
JP2008512708A (en) 2008-04-24
RU2007112943A (en) 2008-10-20
WO2006027079A1 (en) 2006-03-16
KR20070065314A (en) 2007-06-22
CA2579114C (en) 2011-05-10

Similar Documents

Publication Publication Date Title
JP5209637B2 (en) Audio processing method and apparatus
TWI443647B (en) Methods and apparatuses for encoding and decoding object-based audio signals
TWI307248B (en) Apparatus and method for generating multi-channel synthesizer control signal and apparatus and method for multi-channel synthesizing
US10433091B2 (en) Compatible multi-channel coding-decoding
US9972330B2 (en) Audio decoder for audio channel reconstruction
AU2007300812B2 (en) Methods and apparatuses for encoding and decoding object-based audio signals
RU2381570C2 (en) Stereophonic compatible multichannel sound encoding
Breebaart et al. MPEG spatial audio coding/MPEG surround: Overview and current status
KR100955361B1 (en) Adaptive residual audio coding
AU2007312598B2 (en) Enhanced coding and parameter representation of multichannel downmixed object coding
EP1927266B1 (en) Audio coding
JP5189979B2 (en) Control of spatial audio coding parameters as a function of auditory events
JP5102213B2 (en) Temporal and spatial shaping of multichannel audio signals
ES2281795T3 (en) Synthesis of audio signal.
EP1829424B1 (en) Temporal envelope shaping of decorrelated signals
KR100924577B1 (en) Parametric Joint-Coding of Audio Sources
JP5191886B2 (en) Reconfiguration of channels with side information
JP4664371B2 (en) Individual channel time envelope shaping for binaural cue coding method etc.
US8180061B2 (en) Concept for bridging the gap between parametric multi-channel audio coding and matrixed-surround multi-channel coding
US8116459B2 (en) Enhanced method for signal shaping in multi-channel audio reconstruction
JP4887307B2 (en) Near-transparent or transparent multi-channel encoder / decoder configuration
CA2887939C (en) An apparatus for encoding an audio signal having a plurality of channels
EP2535892B1 (en) Audio signal decoder, method for decoding an audio signal and computer program using cascaded audio object processing stages
ES2513265T3 (en) Procedure and apparatus for processing a media signal
JP5134623B2 (en) Concept for synthesizing multiple parametrically encoded sound sources