EP2304721B1 - Spatial synthesis of multichannel audio signals - Google Patents

Spatial synthesis of multichannel audio signals Download PDF

Info

Publication number
EP2304721B1
EP2304721B1 EP09794018A EP09794018A EP2304721B1 EP 2304721 B1 EP2304721 B1 EP 2304721B1 EP 09794018 A EP09794018 A EP 09794018A EP 09794018 A EP09794018 A EP 09794018A EP 2304721 B1 EP2304721 B1 EP 2304721B1
Authority
EP
European Patent Office
Prior art keywords
signal
synthesis
coefficients
decorrelated
spatialization
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
EP09794018A
Other languages
German (de)
French (fr)
Other versions
EP2304721A1 (en
Inventor
Florent Jaillet
David Virette
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Orange SA
Original Assignee
France Telecom SA
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by France Telecom SA filed Critical France Telecom SA
Publication of EP2304721A1 publication Critical patent/EP2304721A1/en
Application granted granted Critical
Publication of EP2304721B1 publication Critical patent/EP2304721B1/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/008Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/02Systems employing more than two channels, e.g. quadraphonic of the matrix type, i.e. in which input signals are combined algebraically, e.g. after having been phase shifted with respect to each other
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/03Application of parametric coding in stereophonic audio systems

Definitions

  • the present invention relates to the field of coding / decoding multichannel digital audio signals.
  • the present invention relates to the parametric encoding / decoding of multichannel audio signals.
  • This type of coding / decoding is based on the extraction of spatialization parameters so that at decoding, the spatial perception of the listener can be reconstituted.
  • Such a coding technique is known under the name of "Binaural Cue Coding" in English (BCC) which aims on the one hand to extract and then code the indices of auditory spatialization and on the other hand to code a monophonic or stereophonic signal from a mastering of the original multi-channel signal.
  • This parametric approach is a low rate coding.
  • the main advantage of this coding approach is to allow a better compression rate than conventional multi-channel digital audio compression methods while ensuring the backward compatibility of the compressed format obtained with the existing coding formats and broadcasting systems.
  • the invention relates more particularly to the spatial decoding of a sound scene 3 D from a reduced number of transmitted channels.
  • the figure 1 describes such an encoding / decoding system in which the encoder 100 constructs a sum signal ("downmix") S s by matrixing at 110 the channels of the original multichannel signal S and delivers via an extraction module parameters 120, a reduced set of parameters P which characterize the spatial content of the original multi-channel signal.
  • a sum signal (“downmix") S s by matrixing at 110 the channels of the original multichannel signal S and delivers via an extraction module parameters 120, a reduced set of parameters P which characterize the spatial content of the original multi-channel signal.
  • the multichannel signal is reconstructed (S ') by a synthesis module 160 which takes into account both the sum signal and the transmitted parameters P.
  • the sum signal has a reduced number of channels. These channels can be encoded by a conventional audio encoder before transmission or storage. Typically, the sum signal has two channels and is compatible with conventional stereo broadcasting. Before transmission or storage, this sum signal can thus be encoded by any conventional stereo encoder. The signal thus coded is then compatible with the devices comprising the corresponding decoder which reconstruct the sum signal while ignoring the spatial data.
  • the figure 2 illustrates a first example of a coding structure or code tree using TTO blocks (TTO 0 , TTO 1 , TTO 2 , TTO 3 and TTO 4 ) to obtain a monophonic signal S from a multi-channel signal 5.1 with 6 channels (L, R, C, LFE, Ls and Rs).
  • the figure 3 illustrates a second example of an encoding structure using both TTO blocks and TTT blocks to obtain a stereo signal Sl and Sr from the 5.1 signal.
  • the decoding of the monophonic or stereophonic signals thus received is effected by using a decoding tree symmetrical to those represented in FIGS. Figures 2 and 3 .
  • the decoding can be seen as a succession of reconstruction step.
  • the first decoding step consists in reconstructing the signals corresponding to the input signals of the block TTO 0 from the sum signal S and the spatial parameters extracted by the block TTO 0
  • the next step consists in reconstructing the signals corresponding to the input signals of the block TTO 1 from the signal reconstructed in the previous step and the spatial parameters extracted by the block TTO 1
  • the decoding then proceeds in a similar manner until the reconstruction of all the channels encoded multi-channel signal.
  • the decoder constructs a matrix making it possible to go directly from the monophonic sum signal to the 6 reconstructed channels by combining the smaller size matrices of the different blocks TTO and TTT.
  • This technique consists, as shown with reference to the figure 4 , to perform a decorrelation step at 410 by filtering the sum signal to obtain a decorrelated signal d.
  • the sum signal and the decorrelated signal thus obtained are then processed by a synthesis module 420 via a synthesis matrix M, as a function of the spatial parameters R and I to create the two signals I and R respecting the specified spatial parameters.
  • the parameters R and I are respectively the energy ratio between the channels of the multi-channel signal and an inter-channel correlation index of the multi-channel signal channels.
  • the waveform of the reconstructed signal is not controlled since it totally depends on the decorrelation experienced by the signal s.
  • the signals of these different channels which have similar waveforms will interact in the rendering zone by creating constructive and destructive interferences which will make it possible to reconstruct the desired sound field.
  • each TTO block decoder involved in the decoding tree uses a different decorrelation filter, the deformation of the waveform will not be the same for the different channels.
  • the reconstructed channels then no longer have, as in the original signal, near waveforms and the interferences that allowed the reconstruction of the sound field during the restitution, are then no longer as in the original signal. This leads, on the one hand, to a bad spatial reconstruction of the sound stage, and on the other hand to the creation of audible artifacts, the differences in waveforms leading to the creation of perceptible noisy components.
  • the present invention improves the situation.
  • the method according to the invention thus makes it possible to handle the cases where a spatialization parameter situated in a predetermined value range causes such a situation.
  • such a quantitative function may be a decorrelated signal energy function.
  • the spatialization parameters are a parameter (R) of energy ratio between the channels of the multi-channel signal and a interchannel correlation parameter (I) of the multi-channel signal, a range of values being the range in which the intcrcanal correlation parameter is negative.
  • the invention applies more particularly to multi-channel signals having negative interchannel correlations.
  • a different quantitative function is chosen by value range of the spatialization parameters.
  • It relates to a decoder comprising a synthesis device as described above.
  • the invention also relates to multimedia equipment comprising a decoder as described above.
  • such equipment may be for example a mobile phone, an electronic organizer or digital content player, a computer, a set-top box ("set-top box").
  • the invention is directed to a computer program comprising code instructions for implementing the steps of the method as described above, when these instructions are executed by a processor.
  • the figure 5 illustrates an embodiment of the invention. It illustrates a synthesis device for the decoding of a TTO block (TTO -1 ).
  • This device comprises a decorrelation module 510, able to carry out a step of decorrelation of the received signal s which is a sum signal obtained by coding by a matrix of multichannel signals.
  • This decorrelation step is for example that described in the MPEG Surround standard mentioned above.
  • This decorrelated signal d and the sum signal s are taken into account in a synthesis module 520 using a matrix M Minq whose coefficients depend on spatialization parameters R and I received and producing output signals I and r.
  • is a function of R and I and is chosen according to one embodiment of the invention so as to limit the quantity of the decorrelated signal d introduced into the reconstructed signals regardless of the correlation values I, including for values negative.
  • the choice of the value ⁇ can be formalized by introducing a quantitative function q relating to the quantity of decorrelated signal taken into account in the matrixing for the reconstruction of the signals.
  • the quantitative function q is such that the increase in absolute value of the coefficients of the synthesis matrix applied to the decorrelated signal increases the value of the function q applied to these same coefficients.
  • the quantitative function q is a function of energy of the decorrelated signal.
  • the values of ⁇ guaranteeing a satisfactory reconstruction according to the embodiment of the invention described here are chosen so as to minimize the total energy of the decorrelated signal d in the reconstructed signals.
  • the figure 5 represents a synthesis device for decoding a TTO block, here called TTO -1 comprising a decorrelation module 510 of the sum signal, a synthesis module 520 able to apply a synthesis matrix to the decorrelated signal and to the sum signal.
  • the coefficients of this synthesis matrix are determined according to a criterion of minimization of a quantitative function q relative to the amount of decorrelated signal as described above.
  • the figure 5 also illustrates the steps of the spatial synthesis method according to the invention in which from a sum signal, at least two output signals 1 and r are obtained.
  • the sum signal comes from a parametric encoding by mastering a multi-channel signal also providing spatialization parameters.
  • This method is such that for at least one value range of at least one spatialization parameter, the coefficients of the synthesis matrix are determined according to a criterion for minimizing a quantitative function, relating to the quantity of decorrelated signal taken into account in the step of applying the synthesis matrix.
  • the spatialization parameters are parameters designating the energy ratio R between the channels of the original multi-channel signal and an inter-channel correlation measurement of this same signal.
  • parameters derived from parametric coding can also be chosen. These parameters may for example be parameters designating the phase shift between the channels of the multi-channel signal, or time envelope parameters of the audio channels.
  • the figure 6 illustrates another embodiment of the invention in which, according to a value range of at least one of the spatialization parameters received, here the interchannel correlation parameter I, a different synthesis matrix is chosen.
  • the example shown in figure 6 shows two types of synthesis matrix.
  • the first synthesis matrix M is for example that described in the state of the art in the MPEG Surround standard.
  • the corresponding synthesis module is illustrated in 630. This synthesis matrix is applied here to the sum signal and the decorrelated signal d when the parameter I is positive.
  • the synthesis matrix M Minq is that described with reference to the figure 5 .
  • the corresponding synthesis module is represented in 620.
  • the method implemented by this embodiment makes it possible to efficiently process multi-channel signals that exhibit negative interchannel correlations.
  • This type of multi-channel signal is for example a surround-type signal. Indeed, this type of signal has channels in phase opposition.
  • This characteristic element of the signals from surround sound is illustrated in the articles by M. Gerzon entitled “Hierarchical System of Surround Sound Transmission for HDTV “or” Ambisonic Decoders for HDTV ".
  • synthesis matrices can be provided for ranges of different values of the spatialization parameters.
  • Compatibility with existing systems within a certain operating range is then maintained.
  • An improvement in the quality of the synthesis in a particular range of spatialization parameter value is then provided in this embodiment.
  • This type of device TTO -1 as represented in figure 5 or in figure 6 is for example integrated in a digital signal decoder. Such a type of decoder is for example illustrated with reference to the figure 7 .
  • the decoder shown in this figure is typically provided for decoding 5.1 type multi-channel signals.
  • this decoder comprises a plurality of devices TTO -1 (TTO 0 -1 , TTO 1 -1 , TTO 2 -1 , TTO 3 -1 , TTO 4 -1 ) according to the invention for, from a signal S received, obtain a multi-channel signal with 6 channels (L, R, C, LFE, Ls, Rs).
  • the decoding module 730 comprising this plurality of synthesis devices may, of course, be differently configured depending on the coding tree that has been used for the original multi-channel signal.
  • the decoder as represented in figure 7 comprises a QMF analysis module (for "quadrature Mirror Filter” in English) capable of performing a transformation of the sum (or downmix) signal S from the encoder into a frequency signal per subband.
  • the signal per frequency band is then supplied to the input of the decoding module 730.
  • the processed signals enter the QMF synthesis module 720 able to perform an inverse transformation and to bring back the multi-channel signal obtained. in the time domain.
  • QMF analysis and QMF synthesis modules may for example be those as described in the MPEG Surround standard.
  • the decoder as represented in figure 7 receives from the encoder spatialization parameters P which are derived from the parametric encoding of the original multi-channel signal.
  • these parameters may be energy ratio parameters between the channels, correlation measurement between the channels or else phase shift between the channels or finally time envelope.
  • This decoder 700 can be integrated into a multimedia equipment type set-top box or "set-top box", computer or mobile phone, digital content player, personal electronic organizer, etc ...
  • the figure 8 represents an example of such multimedia equipment which comprises in particular an input module E adapted to receive multi-channel audio signals compressed either by a communication network for example or by means of a multi-channel sound recording.
  • These multi-channel signals have been compressed by a parametric coding method which, by mastering the original signal, generates a sum signal S and spatialization parameters P.
  • This coding may in an alternative mode be provided in the multimedia equipment.
  • This equipment comprises one or more synthesis devices according to the invention, represented here physically by a processor PROC cooperating with a memory block BM comprising a memory storage and / or working MEM.
  • the memory block may advantageously comprise a computer program comprising code instructions for implementing the steps of the method in the sense of the invention, when these instructions are executed by the processor PROC, and in particular a decorrelation step of a signal.
  • a computer program comprising code instructions for implementing the steps of the method in the sense of the invention, when these instructions are executed by the processor PROC, and in particular a decorrelation step of a signal.
  • the synthesis matrix is such that, for at least one value range of at least one spatialization parameter, its coefficients are determined according to a criterion of minimization of a quantitative function, relative to the quantity of decorrelated signal taken into account in the step of applying the synthesis matrix.
  • the description of the figure 5 takes the steps of an algorithm of such a computer program.
  • the computer program can also be stored on a memory medium readable by a reader of the device or downloadable in the memory space of the equipment.
  • the memory block thus comprises the coefficients of the synthesis matrix as defined above.
  • This memory block may comprise in another embodiment of the invention as described with reference to FIG. figure 6 , defining coefficients several synthesis matrices which are applied to the sum signal and to the decorrelated signal as a function of the range of values of the spatialization parameters received.
  • the processor of the equipment may also include instructions for implementing the steps of analysis and synthesis of the decoder as described with reference to the figure 7 .
  • the multimedia equipment as illustrated also comprises an output S for delivering the reconstructed multi-channel signal S 'either by speaker-type reproduction means or by communication means capable of transmitting this multi-channel signal.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Mathematical Physics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Human Computer Interaction (AREA)
  • Algebra (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Pure & Applied Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Stereophonic System (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

A method and associated device are provided for spatial synthesis of a sum signal to obtain at least two output signals, the sum signal as well as the spatialization parameters being output from a parametric coding by matrixing of an original multi-channel signal. The method comprises: decorrelation of the sum signal to obtain a decorrelated signal; applying a synthesis matrix, whose coefficients depend on the spatialization parameters, to the decorrelated signal and to the sum signal to obtain said output signals, wherein for at least one range of value of at least one spatialization parameter, the coefficients of the synthesis matrix are determined according to a criterion of minimizing a quantitative function, relating to the quantity of decorrelated signal in each of the output signals obtained by applying the synthesis matrix.

Description

La présente invention se rapporte au domaine du codage/décodage de signaux audio numériques multicanaux.The present invention relates to the field of coding / decoding multichannel digital audio signals.

Plus particulièrement, la présente invention se rapporte au codage/décodage paramétrique de signaux audio multicanaux.More particularly, the present invention relates to the parametric encoding / decoding of multichannel audio signals.

Ce type de codage/décodage se base sur l'extraction de paramètres de spatialisation pour qu'au décodage, la perception spatiale de l'auditeur puisse être reconstituée.This type of coding / decoding is based on the extraction of spatialization parameters so that at decoding, the spatial perception of the listener can be reconstituted.

Une telle technique de codage est connu sous le nom de "Binaural Cue Coding" en anglais (BCC) qui vise d'une part à extraire puis à coder les indices de spatialisation auditive et d'autre part à coder un signal monophonique ou stéréophonique issu d'un matriçage du signal multi-canal original.Such a coding technique is known under the name of "Binaural Cue Coding" in English (BCC) which aims on the one hand to extract and then code the indices of auditory spatialization and on the other hand to code a monophonic or stereophonic signal from a mastering of the original multi-channel signal.

Cette approche paramétrique est un codage à bas débit. Le principal intérêt de cette approche de codage est de permettre un taux de compression meilleur que les méthodes classiques de compression de signaux audionumériques multicanaux tout en assurant la rétrocompatibilité du format compressé obtenu avec les formats de codage et les systèmes de diffusions déjà existants.This parametric approach is a low rate coding. The main advantage of this coding approach is to allow a better compression rate than conventional multi-channel digital audio compression methods while ensuring the backward compatibility of the compressed format obtained with the existing coding formats and broadcasting systems.

Ainsi, l'invention porte plus particulièrement sur le décodage spatial d'une scène sonore 3 D à partir d'un nombre réduit de canaux transmis.Thus, the invention relates more particularly to the spatial decoding of a sound scene 3 D from a reduced number of transmitted channels.

La norme MPEG Surround décrite dans le document de la norme MPEG ISO/IEC 23003-1:2007 et dans le document de " Breebaart, J. and Hotho, G. and Koppens, J. and Schuijers, E. and Oomen, W. and van de Par, S.," intitulé "Background, concept, and architecture for the recent MPEG surround standard on multichannel audio compression" dans Journal of the Audio Engineering Society 55-5 (2007) 331-351 , décrit une structure spécifique de codage/décodage du signal audio multi-canal.The MPEG Surround standard described in the MPEG document ISO / IEC 23003-1: 2007 and in the document " Breebaart, J. and Hotho, G. and Koppens, J. and Schuijers, E. and Oomen, W. and van de Par, S., "entitled" Background, concept, and architecture for the recent MPEG surround standard on multichannel audio Compression "in Journal of the Audio Engineering Society 55-5 (2007) 331-351 , describes a specific coding / decoding structure of the multi-channel audio signal.

La figure 1 décrit un tel système de codage/décodage dans lequel l'encodeur 100 construit un signal somme ("downmix" en anglais) Ss par matriçage en 110 des canaux du signal multi-canal original S et fournit via un module d'extraction de paramètres 120, un ensemble réduit de paramètres P qui caractérisent le contenu spatial du signal multi-canal original.The figure 1 describes such an encoding / decoding system in which the encoder 100 constructs a sum signal ("downmix") S s by matrixing at 110 the channels of the original multichannel signal S and delivers via an extraction module parameters 120, a reduced set of parameters P which characterize the spatial content of the original multi-channel signal.

Au décodeur 150, le signal multicanal est reconstruit (S') par un module de synthèse 160 qui prend en compte à la fois le signal somme et les paramètres P transmis.At the decoder 150, the multichannel signal is reconstructed (S ') by a synthesis module 160 which takes into account both the sum signal and the transmitted parameters P.

Le signal somme comporte un nombre réduit de canaux. Ces canaux peuvent être codés par un codeur audio classique avant transmission ou stockage. Typiquement, le signal somme comporte deux canaux et est compatible avec une diffusion stéréo classique. Avant transmission ou stockage, ce signal somme peut ainsi être codé par n'importe quel codeur stéréo classique. Le signal ainsi codé est alors compatible avec les dispositifs comportant le décodeur correspondant qui reconstruisent le signal somme en ignorant les données spatiales.The sum signal has a reduced number of channels. These channels can be encoded by a conventional audio encoder before transmission or storage. Typically, the sum signal has two channels and is compatible with conventional stereo broadcasting. Before transmission or storage, this sum signal can thus be encoded by any conventional stereo encoder. The signal thus coded is then compatible with the devices comprising the corresponding decoder which reconstruct the sum signal while ignoring the spatial data.

La norme MPEG Surround a retenu une structure spécifique pour la représentation des données spatiales : le codeur s'appuie sur une structure arborescente de codage construite à partir d'un nombre réduit de blocs élémentaires de codage permettant chacun d'extraire des paramètres spatiaux sur un nombre réduit de canaux. Il existe deux types de bloc élémentaires de codage :

  • les blocs TTO (pour "Two To One" en anglais) qui permettent d'extraire les paramètres spatiaux entre deux canaux et de construire un signal somme monophonique à partir de ces deux canaux,
  • les blocs TTT (pour "Three To Two" en anglais) qui permettent d'extraire les paramètres spatiaux entre trois canaux et de construire un signal somme contenant deux canaux à partir de ces trois canaux.
The MPEG Surround standard has adopted a specific structure for the representation of spatial data: the encoder relies on a coding tree structure constructed from a reduced number of elementary coding blocks, each of which makes it possible to extract spatial parameters on a single space. reduced number of channels. There are two basic block types of coding:
  • the TTO blocks (for "Two To One" in English) which make it possible to extract the spatial parameters between two channels and to construct a monophonic sum signal from these two channels,
  • the TTT blocks (for "Three To Two" in English) which make it possible to extract the spatial parameters between three channels and to construct a sum signal containing two channels from these three channels.

La figure 2 illustre un premier exemple d'une structure de codage ou arbre de codage utilisant des blocs TTO (TTO0, TTO1, TTO2, TTO3 et TTO4) pour obtenir un signal monophonique S à partir d'un signal multi-canal 5.1 comportant 6 canaux (L, R, C, LFE, Ls et Rs).The figure 2 illustrates a first example of a coding structure or code tree using TTO blocks (TTO 0 , TTO 1 , TTO 2 , TTO 3 and TTO 4 ) to obtain a monophonic signal S from a multi-channel signal 5.1 with 6 channels (L, R, C, LFE, Ls and Rs).

La figure 3 illustre un deuxième exemple de structure de codage utilisant à la fois des blocs TTO et des blocs TTT pour obtenir un signal stéréophonique Sl et Sr à partir du signal 5.1.The figure 3 illustrates a second example of an encoding structure using both TTO blocks and TTT blocks to obtain a stereo signal Sl and Sr from the 5.1 signal.

Le décodage des signaux monophoniques ou stéréophoniques ainsi reçus s'effectue en utilisant un arbre de décodage symétrique à ceux représentés aux figures 2 et 3.The decoding of the monophonic or stereophonic signals thus received is effected by using a decoding tree symmetrical to those represented in FIGS. Figures 2 and 3 .

Ainsi, pour le décodage d'un signal encodé selon l'arbre de la figure 2, le décodage peut être vu comme une succession d'étape de reconstruction.Thus, for the decoding of a signal encoded according to the tree of the figure 2 , the decoding can be seen as a succession of reconstruction step.

Dans ce cas la première étape de décodage consiste à reconstruire les signaux correspondant aux signaux d'entrée du bloc TTO0 à partir du signal somme S et des paramètres spatiaux extraits par le bloc TTO0, l'étape suivante consiste alors à reconstruire les signaux correspondant aux signaux d'entrée du bloc TTO1 à partir du signal reconstruit à l'étape précédente et des paramètres spatiaux extraits par le bloc TTO1, le décodage se poursuit ensuite de manière similaire jusqu'à la reconstruction de l'ensemble des canaux du signal multi-canal codé. En pratique, le décodeur construit une matrice permettant de passer directement du signal somme monophonique aux 6 canaux reconstruits par combinaison des matrices de taille inférieure des différents blocs TTO et TTT.In this case, the first decoding step consists in reconstructing the signals corresponding to the input signals of the block TTO 0 from the sum signal S and the spatial parameters extracted by the block TTO 0 , the next step then consists in reconstructing the signals corresponding to the input signals of the block TTO 1 from the signal reconstructed in the previous step and the spatial parameters extracted by the block TTO 1 , the decoding then proceeds in a similar manner until the reconstruction of all the channels encoded multi-channel signal. In practice, the decoder constructs a matrix making it possible to go directly from the monophonic sum signal to the 6 reconstructed channels by combining the smaller size matrices of the different blocks TTO and TTT.

La technique retenue dans la norme MPEG Surround pour le décodage des blocs TTO impose cependant une limitation très pénalisante pour le codage de signaux multicanaux comportant des canaux en opposition de phase.The technique used in the MPEG Surround standard for decoding TTO blocks, however, imposes a very disadvantageous limitation for the coding of multichannel signals comprising channels in phase opposition.

Cette technique de décodage est plus précisément décrite dans la demande de brevet intitulée "signal synthesizing" publiée sous le numéro WO 03/090206 A1 le 30 octobre 2003 (Applicant : Koninklijke Philips Electronics N.V., Inventor : Dirk J. Breebaart).This decoding technique is more precisely described in the patent application entitled "signal synthesizing" published under the number WO 03/090206 A1 October 30, 2003 (Applicant: Koninklijke Philips Electronics NV, Inventor: Dirk J. Breebaart).

Cette technique consiste, comme représenté en référence à la figure 4, à effectuer une étape de décorrélation en 410 par filtrage du signal somme s pour obtenir un signal décorrélé d. Le signal somme et le signal décorrélé ainsi obtenu sont ensuite traités par un module de synthèse 420 via une matrice de synthèse M, en fonction des paramètres spatiaux R et I pour créer les deux signaux l et r respectant les paramètres spatiaux spécifiés. Les paramètres R et I sont ici respectivement le rapport d'énergie entre les canaux du signal multi-canal et un indice de corrélation intercanal des canaux du signal multi-canal.This technique consists, as shown with reference to the figure 4 , to perform a decorrelation step at 410 by filtering the sum signal to obtain a decorrelated signal d. The sum signal and the decorrelated signal thus obtained are then processed by a synthesis module 420 via a synthesis matrix M, as a function of the spatial parameters R and I to create the two signals I and R respecting the specified spatial parameters. The parameters R and I are respectively the energy ratio between the channels of the multi-channel signal and an inter-channel correlation index of the multi-channel signal channels.

Le matriçage des signaux s et d se fait suivant les relations suivantes : l r = λ 1 cos α + β λ 1 sin α + β λ 2 cos - α + β λ 2 sin - α + β s d

Figure imgb0001
avec λ 1 = R 1 + R ,
Figure imgb0002
λ 2 = 1 1 + R ,
Figure imgb0003
α = 1 2 arccos I
Figure imgb0004
et β = arctan λ 2 - λ 1 λ 2 + λ 1 tan α .
Figure imgb0005
The matriculation of the signals s and d is done according to the following relations: l r = λ 1 cos α + β λ 1 sin α + β λ 2 cos - α + β λ 2 sin - α + β s d
Figure imgb0001
with λ 1 = R 1 + R ,
Figure imgb0002
λ 2 = 1 1 + R ,
Figure imgb0003
α = 1 2 arccos I
Figure imgb0004
and β = arctan λ 2 - λ 1 λ 2 + λ 1 tan α .
Figure imgb0005

Or ce matriçage présente la limitation mentionnée ci-dessus et qui rend cette méthode inadaptée au codage des signaux audio multicanaux présentant des corrélations intercanal négatives.However, this matrixing has the limitation mentioned above and makes this method unsuitable for coding multichannel audio signals with negative inter-channel correlations.

En particulier, une telle technique n'est pas adaptée pour le décodage des signaux ambiophoniques qui comportent des oppositions de phase entre canaux.In particular, such a technique is not suitable for decoding the surround signals which comprise phase oppositions between channels.

En effet, lorsque la corrélation intercanal I est négative, et en particulier lorsqu'elle est proche de -1, la proportion de signal décorrélé utilisée pour synthétiser les signaux l et r devient très importante, dépassant dans certains cas de figure nettement la quantité de signal somme s utilisée. Dans le cas le plus problématique, on peut constater que pour une différence intercanal de niveau de 0dB, c'est à dire pour R=1, lorsque la corrélation intercanal I tend vers -1, la matrice de mixage tend vers la matrice suivante : 0 2 2 0 - 2 2 .

Figure imgb0006
Indeed, when the inter-channel correlation I is negative, and in particular when it is close to -1, the proportion of decorrelated signal used to synthesize the signals I and R becomes very important, in some cases clearly exceeding the quantity of the signal. signal sum used. In the most problematic case, it can be seen that for an interchannel level difference of 0 dB, ie for R = 1, when the interchannel correlation I tends to -1, the mixing matrix tends to the following matrix: 0 2 2 0 - 2 2 .
Figure imgb0006

Cette matrice correspond à des signaux reconstruits l = 2 2 d

Figure imgb0007
et r = - 2 2 d
Figure imgb0008
qui ne font pas intervenir le signal somme dans leur expression, mais utilisent uniquement le signal décorrélé. Ainsi, la forme d'onde du signal reconstruit n'est pas contrôlée puisqu'elle dépend totalement de la décorrélation subie par le signal s.This matrix corresponds to reconstructed signals l = 2 2 d
Figure imgb0007
and r = - 2 2 d
Figure imgb0008
which do not involve the sum signal in their expression, but only use the decorrelated signal. Thus, the waveform of the reconstructed signal is not controlled since it totally depends on the decorrelation experienced by the signal s.

Le problème de reconstruction illustré dans l'exemple précédent dans un cas extrême se présente également pour d'autres valeurs de R et I, et est d'autant plus marqué que I est proche de -1. Ainsi, la forme d'onde des canaux reconstruits n'est dans ces cas pas aussi proche qu'elle pourrait l'être des signaux originaux, ce qui limite inutilement la qualité des signaux reconstruits.The reconstruction problem illustrated in the previous example in an extreme case is also present for other values of R and I, and is even more so. marked that I is close to -1. Thus, the waveform of the reconstructed channels is in these cases not as close as it could be the original signals, which unnecessarily limits the quality of the reconstructed signals.

L'effet de cette limitation est encore plus marqué lorsque le signal présente plusieurs canaux ayant des corrélations intercanal proches de -1. Dans ce cas, plus de deux canaux ont des formes d'onde proches, mais certains d'entre eux sont en opposition de phase.The effect of this limitation is even more pronounced when the signal has several channels with interchanal correlations close to -1. In this case, more than two channels have near waveforms, but some of them are out of phase.

Lors de la restitution du signal multi-canal original, les signaux de ces différents canaux qui ont des formes d'ondes proches vont interagir dans la zone de restitution en créant des interférences constructives et destructives qui vont permettre de reconstruire le champ sonore souhaité.During the rendering of the original multi-channel signal, the signals of these different channels which have similar waveforms will interact in the rendering zone by creating constructive and destructive interferences which will make it possible to reconstruct the desired sound field.

Après décodage, la forme d'onde des canaux sera fortement déformée en raison du problème évoqué précédemment.After decoding, the waveform of the channels will be strongly deformed due to the problem mentioned above.

De plus comme chaque décodeur de bloc TTO intervenant dans l'arbre de décodage, utilise un filtre de décorrélation différent, la déformation de la forme d'onde ne sera pas la même pour les différents canaux.In addition, since each TTO block decoder involved in the decoding tree uses a different decorrelation filter, the deformation of the waveform will not be the same for the different channels.

Les canaux reconstruits n'ont alors plus, comme dans le signal original, des formes d'onde proches et les interférences qui permettaient la reconstruction du champ sonore lors de la restitution, ne se font alors plus comme dans le signal original. Ceci aboutit d'une part à une mauvaise reconstruction spatiale de la scène sonore, et d'autre part à la création d'artefacts audibles, les différences de forme d'onde entraînant la création de composantes bruitées perceptibles.The reconstructed channels then no longer have, as in the original signal, near waveforms and the interferences that allowed the reconstruction of the sound field during the restitution, are then no longer as in the original signal. This leads, on the one hand, to a bad spatial reconstruction of the sound stage, and on the other hand to the creation of audible artifacts, the differences in waveforms leading to the creation of perceptible noisy components.

La présente invention vient améliorer la situation.The present invention improves the situation.

A cet effet, la présente invention propose un procédé de synthèse spatiale d'un signal somme pour obtenir au moins deux signaux de sortie, le signal somme ainsi que des paramètres de spatialisation étant issus d'un codage paramétrique par matriçage d'un signal multi-canal original. Le procédé comporte les étapes de:

  • décorrélation du signal somme pour obtenir un signal décorrélé;
  • application d'une matrice de synthèse dont les coefficients dépendent des paramètres de spatialisation, au signal décorrélé et au signal somme pour obtenir lesdits signaux de sortie,
caractérisé en ce que pour au moins une plage de valeur d'au moins un paramètre de spatialisation, les coefficients de la matrice de synthèse sont déterminés selon un critère de minimisation d'une fonction quantitative (q), relative à la quantité de signal décorrélé dans chacun des signaux de sortie obtenus par l'étape d'application de la matrice de synthèse, la fonction quantitative étant telle que l'augmentation en valeur absolue des coefficients de la matrice de synthèse appliqués au signal décorrélé fait augmenter la valeur de ladite fonction appliquée à ces mêmes coefficients.For this purpose, the present invention proposes a method of spatial synthesis of a sum signal to obtain at least two output signals, the sum signal as well as spatialization parameters resulting from parametric encoding by mastering a multi signal. - original channel. The method comprises the steps of:
  • decorrelation of the sum signal to obtain a decorrelated signal;
  • applying a synthesis matrix whose coefficients depend on the spatialization parameters, the decorrelated signal and the sum signal to obtain said output signals,
characterized in that for at least one value range of at least one spatialization parameter, the coefficients of the synthesis matrix are determined according to a criterion of minimization of a quantitative function (q), relative to the quantity of decorrelated signal in each of the output signals obtained by the step of applying the synthesis matrix, the quantitative function being such that the increase in absolute value of the coefficients of the synthesis matrix applied to the decorrelated signal increases the value of said function applied to these same coefficients.

Ainsi, la prise en compte de la quantité de signal décorrélé dans chacun des signaux et donc dans l'étape de synthèse du signal, permet de s'affranchir du cas de figure mentionné précédemment où seul le signal décorrélé intervient dans le matriçage de synthèse. Le procédé selon l'invention permet ainsi de traiter les cas où un paramètre de spatialisation situé dans une plage de valeur prédéterminée entraine une telle situation.Thus, taking into account the amount of decorrelated signal in each of the signals and therefore in the signal synthesis step, makes it possible to dispense with the case mentioned above where only the decorrelated signal is involved in the synthesis matrix. The method according to the invention thus makes it possible to handle the cases where a spatialization parameter situated in a predetermined value range causes such a situation.

La minimisation une telle fonction quantitative permet de définir des coefficients de la matrice de synthèse qui permettent d'assurer un bon respect de la forme d'onde du signal d'entrée dans les signaux de sortie.Minimizing such a quantitative function makes it possible to define coefficients of the synthesis matrix that make it possible to ensure good compliance with the waveform of the input signal in the output signals.

Plus particulièrement et de façon simple, une telle fonction quantitative peut-être une fonction d'énergie du signal décorrélé.More particularly and simply, such a quantitative function may be a decorrelated signal energy function.

Cette fonction respecte bien les caractéristiques mentionnée précédemment.This function meets the characteristics mentioned above.

D'une façon plus générale, la fonction quantitative est du type: q x y = x p + y p 1 p

Figure imgb0009
avec p entier supérieur ou égal à 1.In a more general way, the quantitative function is of the type: q x there = x p + there p 1 p
Figure imgb0009
with p integer greater than or equal to 1.

Dans un mode particulier de réalisation, les paramètres de spatialisation sont un paramètre (R) de rapport d'énergie entre les canaux du signal multi-canal et un paramètre (I) de corrélation intercanal du signal multi-canal, une plage de valeur étant la plage dans laquelle le paramètre de corrélation intcrcanal est négatif.In a particular embodiment, the spatialization parameters are a parameter (R) of energy ratio between the channels of the multi-channel signal and a interchannel correlation parameter (I) of the multi-channel signal, a range of values being the range in which the intcrcanal correlation parameter is negative.

Ainsi, l'invention s'applique plus particulièrement pour les signaux multi-canaux présentant des corrélations intercanal négatives.Thus, the invention applies more particularly to multi-channel signals having negative interchannel correlations.

Elle peut donc être mise en oeuvre uniquement pour les valeurs du paramètre de corrélation intercanal négative ou pour toute valeur de ce paramètre.It can therefore be implemented only for the values of the negative inter-channel correlation parameter or for any value of this parameter.

Dans un autre mode de réalisation, une fonction quantitative différente est choisie par plage de valeur des paramètres de spatialisation.In another embodiment, a different quantitative function is chosen by value range of the spatialization parameters.

Il est alors possible de moduler l'importance relative que l'on veut donner aux différentes matrices de synthèse. Il est ainsi possible de donner un poids important à une matrice telle que définie dans l'état de l'art, pour une plage particulière de paramètres et inversement de donner un poids important à la matrice de synthèse au sens de l'invention pour une autre plage de paramètre. Ainsi, on peut conserver une compatibilité avec les systèmes existants dans une certaine plage de fonctionnement et améliorer la qualité du système dans une plage particulière. De plus, la possibilité d'utiliser plusieurs matrices de synthèse obtenues selon différents critères permet d'optimiser la qualité globale du système pour toute la plage de fonctionnement.It is then possible to modulate the relative importance that we want to give to the different synthesis matrices. It is thus possible to give a significant weight to a matrix as defined in the state of the art, for a particular range of parameters and conversely to give a significant weight to the synthesis matrix within the meaning of the invention for a other parameter range. Thus, one can maintain compatibility with existing systems within a certain operating range and improve the quality of the system in a particular range. In addition, the possibility of using several synthesis matrices obtained according to different criteria makes it possible to optimize the overall quality of the system for the entire operating range.

L'invention se rapporte également à un dispositif de synthèse spatiale d'un signal somme générant au moins deux signaux de sortie, le signal somme ainsi que des paramètres de spatialisation étant issus d'un dispositif de codage paramétrique mettant en oeuvre un matriçage d'un signal multi-canal original. Le dispositif comportant:

  • des moyens de décorrélation (510) du signal somme pour obtenir un signal décorrélé;
  • des moyens d'application (520) d'une matrice de synthèse (M Minq) dont les coefficients dépendent des paramètres de spatialisation, au signal décorrélé et au signal somme pour obtenir lesdits signaux de sortie,
caractérisé en ce que pour au moins une plage de valeur d'au moins un paramètre de spatialisation, les coefficients de la matrice de synthèse sont déterminés selon un critère de minimisation d'une fonction quantitative, relative à la quantité de signal décorrélé dans chacun des signaux de sortie obtenus par les moyens d'application de la matrice de synthèse la fonction quantitative étant telle que l'augmentation en valeur absolue des coefficients de la matrice de synthèse appliqués au signal décorrelé fait augmenter la valeur de ladite fonction appliquée à ces mêmes coefficients.The invention also relates to a device for spatially synthesizing a sum signal generating at least two output signals, the sum signal as well as spatialization parameters coming from a parametric coding device implementing a matrixing of an original multichannel signal. The device comprising:
  • decorrelation means (510) of the sum signal for obtaining a decorrelated signal;
  • application means (520) of a synthesis matrix (M Minq) whose coefficients depend on the spatialization parameters, the decorrelated signal and the sum signal to obtain said output signals,
characterized in that for at least one value range of at least one spatialization parameter, the coefficients of the synthesis matrix are determined according to a criterion of minimization of a quantitative function, relative to the quantity of signal decorrelated in each of the output signals obtained by the application means of the synthesis matrix, the quantitative function being such that the increase in absolute value of the coefficients of the synthesis matrix applied to the uncorrelated signal increases the value of said function applied to these same coefficients.

Elle se rapporte à un décodeur comportant un dispositif de synthèse tel que décrit ci-dessus.It relates to a decoder comprising a synthesis device as described above.

L'invention vise aussi un équipement multimédia comportant un décodeur tel que décrit ci-dessus.The invention also relates to multimedia equipment comprising a decoder as described above.

De façon non limitative, un tel équipement peut-être par exemple un téléphone mobile, un agenda électronique ou lecteur de contenu numérique, un ordinateur, un décodeur de salon ("set-top box").Without limitation, such equipment may be for example a mobile phone, an electronic organizer or digital content player, a computer, a set-top box ("set-top box").

Enfin, l'invention vise un programme informatique comportant des instructions de code pour la mise en oeuvre des étapes du procédé tel que décrit ci-dessus, lorsque ces instructions sont exécutées par un processeur.Finally, the invention is directed to a computer program comprising code instructions for implementing the steps of the method as described above, when these instructions are executed by a processor.

D'autres caractéristiques et avantages de l'invention apparaîtront plus clairement à la lecture de la description suivante, donnée uniquement à titre d'exemple non limitatif, et faite en référence aux dessins annexés, sur lesquels:

  • la figure 1 illustre un système de codage/décodage paramétrique classique de l'état de l'art tel que décrit précédemment;
  • les figures 2 et 3 illustrent des exemples d'arbres de codage tels que décrits précédemment, selon la norme MPEG Surround dans le cas d'un signal multi-canal de type 5.1;
  • la figure 4 illustre un système de décodage de l'état de l'art d'un bloc TTO tel que décrit précédemment;
  • la figure 5 illustre un dispositif de synthèse selon l'invention pour le décodage d'un bloc TTO;
  • la figure 6 illustre un dispositif de synthèse pour le décodage d'un bloc TTO selon un mode particulier de réalisation;
  • la figure 7 illustre un décodeur selon l'invention dans le cas de signaux multicanaux de type 5.1; et
  • la figure 8 illustre un exemple d'équipement multimédia comportant au moins un dispositif de synthèse selon l'invention.
Other features and advantages of the invention will appear more clearly on reading the following description, given solely by way of nonlimiting example, and with reference to the appended drawings, in which:
  • the figure 1 illustrates a conventional parametric coding / decoding system of the state of the art as described above;
  • the Figures 2 and 3 illustrate examples of encoding trees as described above, according to the MPEG Surround standard in the case of a 5.1 type multi-channel signal;
  • the figure 4 illustrates a decoding system of the state of the art of a TTO block as described above;
  • the figure 5 illustrates a synthesis device according to the invention for decoding a TTO block;
  • the figure 6 illustrates a synthesis device for decoding a TTO block according to a particular embodiment;
  • the figure 7 illustrates a decoder according to the invention in the case of multichannel signals of type 5.1; and
  • the figure 8 illustrates an example of multimedia equipment comprising at least one synthesis device according to the invention.

La figure 5 illustre un mode de réalisation de l'invention. Elle illustre un dispositif de synthèse pour le décodage d'un bloc TTO (TTO-1). Ce dispositif comprend un module de décorrélation 510, apte à effectuer une étape de décorrélation du signal reçu s qui est un signal somme obtenu au codage par un matriçage de signaux multicanaux.The figure 5 illustrates an embodiment of the invention. It illustrates a synthesis device for the decoding of a TTO block (TTO -1 ). This device comprises a decorrelation module 510, able to carry out a step of decorrelation of the received signal s which is a sum signal obtained by coding by a matrix of multichannel signals.

Cette étape de décorrélation est par exemple celle décrite dans la norme MPEG Surround citée précédemment.This decorrelation step is for example that described in the MPEG Surround standard mentioned above.

Ce signal décorrélé d et le signal somme s sont pris en compte dans un module de synthèse 520 utilisant une matrice M Minq dont les coefficients dépendent de paramètres de spatialisation R et I reçus et produisant des signaux de sortie l et r.This decorrelated signal d and the sum signal s are taken into account in a synthesis module 520 using a matrix M Minq whose coefficients depend on spatialization parameters R and I received and producing output signals I and r.

Plus précisément, les signaux l et r sont générés par le matriçage suivant: l r = h 11 h 12 h 21 h 22 s d

Figure imgb0010
en respectant les conditions suivantes :

  • l'énergie totale est conservée, c'est-à-dire: h 11 2 + h 12 2 + h 21 2 + h 22 2 = 1
    Figure imgb0011
  • le rapport d'énergie entre l et r vaut R, c'est-à-dire: h 11 2 + h 12 2 = R h 21 2 + h 22 2
    Figure imgb0012
  • l'intercorrélation normalisée entre l et r vaut I, c'est-à-dire: h 11 h 21 + h 12 h 22 h 11 2 + h 12 2 h 21 2 + h 22 2 = I
    Figure imgb0013
More precisely, the signals l and r are generated by the following matrixing: l r = h 11 h 12 h 21 h 22 s d
Figure imgb0010
respecting the following conditions:
  • the total energy is conserved, that is to say: h 11 2 + h 12 2 + h 21 2 + h 22 2 = 1
    Figure imgb0011
  • the energy ratio between l and r is R, that is to say: h 11 2 + h 12 2 = R h 21 2 + h 22 2
    Figure imgb0012
  • the standardized cross-correlation between l and r is equal to I, that is to say: h 11 h 21 + h 12 h 22 h 11 2 + h 12 2 h 21 2 + h 22 2 = I
    Figure imgb0013

En utilisant les deux premières conditions, on a h 11 2 + h 12 2 = R R + 1 et h 21 2 + h 22 2 = 1 R + 1

Figure imgb0014
Using the first two conditions, we have h 11 2 + h 12 2 = R R + 1 and h 21 2 + h 22 2 = 1 R + 1
Figure imgb0014

Les solutions peuvent donc s'écrire sous la forme : h 11 = R R + 1 cos a , h 12 = R R + 1 sin a , h 21 = 1 R + 1 cos b , h 22 = 1 R + 1 sin b

Figure imgb0015
The solutions can be written in the form: h 11 = R R + 1 cos at , h 12 = R R + 1 sin at , h 21 = 1 R + 1 cos b , h 22 = 1 R + 1 sin b
Figure imgb0015

La troisième condition, s'écrit alors : cos a cos b + sin a sin b = I

Figure imgb0016
c'est-à-dire cos(a-b) = I.The third condition is then written: cos at cos b + sin at sin b = I
Figure imgb0016
that is, cos ( ab ) = I.

On voit donc que les matrices solutions du problème sont l'ensemble des matrices paramétrées par β ∈ [0, 2π) de la forme : h 11 h 12 h 21 h 22 = R R + 1 0 0 1 R + 1 cos β + α sin β + α cos β - α sin β - α

Figure imgb0017
avec α = ± arccos I 2 .
Figure imgb0018
We thus see that the solution matrices of the problem are the set of matrices parameterized by β ∈ [0, 2π) of the form: h 11 h 12 h 21 h 22 = R R + 1 0 0 1 R + 1 cos β + α sin β + α cos β - α sin β - α
Figure imgb0017
with α = ± arccos I 2 .
Figure imgb0018

Ainsi, deux valeurs de a sont possibles. La valeur de β est fonction de R et I et est choisi selon un mode de réalisation de l'invention de manière à limiter la quantité du signal décorrélé d introduite dans les signaux reconstruits quelque soit les valeurs de corrélation I, y compris pour des valeurs négatives.Thus, two values of a are possible. The value of β is a function of R and I and is chosen according to one embodiment of the invention so as to limit the quantity of the decorrelated signal d introduced into the reconstructed signals regardless of the correlation values I, including for values negative.

Ainsi, le choix de la valeur β peut être formalisé en introduisant une fonction quantitative q relative à la quantité de signal décorrélé pris en compte dans le matriçage pour la reconstruction des signaux.Thus, the choice of the value β can be formalized by introducing a quantitative function q relating to the quantity of decorrelated signal taken into account in the matrixing for the reconstruction of the signals.

D'une façon générale, la fonction quantitative q est telle que l'augmentation en valeur absolue des coefficients de la matrice de synthèse appliqués au signal décorrélé, fait augmenter la valeur de la fonction q appliquée à ces mêmes coefficients.In general, the quantitative function q is such that the increase in absolute value of the coefficients of the synthesis matrix applied to the decorrelated signal increases the value of the function q applied to these same coefficients.

Ainsi, cette fonction quantitative q est telle qu'elle vérifie les conditions suivantes :

  • pour tous réels x, x', y si |x'| ≥ |x| alors q (x', y) ≥ q(x, y)
  • et symétriquement pour tous réels x, y, y' si |y'|≥|y| alors q(x, y') ≥ q(x, y).
Thus, this quantitative function q is such that it satisfies the following conditions:
  • for all real x, x ', y if | x ' | ≥ | x | then q ( x ' , y ) ≥ q ( x, y )
  • and symmetrically for all real x, y, y 'si | y ' | ≥ | y | then q ( x, y ' ) ≥ q ( x, y ).

Pour I et R fixés, la valeur de β est alors choisie par minimisation de la fonction : f β = q h 12 h 22 = q R R + 1 sin β + α , 1 R + 1 sin β - α

Figure imgb0019
For I and R fixed, the value of β is then chosen by minimizing the function: f β = q h 12 h 22 = q R R + 1 sin β + α , 1 R + 1 sin β - α
Figure imgb0019

De nombreuses fonctions quantitative respectant les conditions décrites ci-dessus peuvent être choisies et permettront d'effectuer un choix satisfaisants de β.Numerous quantitative functions complying with the conditions described above can be chosen and will make it possible to make a satisfactory choice of β.

Ainsi, la fonction q peut-être par exemple de type: q x y = x p + y p 1 p

Figure imgb0020

avec p un entier supérieur ou égal à 1.Thus, the function q can for example be of type: q x there = x p + there p 1 p
Figure imgb0020

with p an integer greater than or equal to 1.

Dans un mode particulier de réalisation, la fonction quantitative q est une fonction d'énergie du signal décorrélé.In a particular embodiment, the quantitative function q is a function of energy of the decorrelated signal.

La fonction q est donc telle que: q x y = x 2 + y 2

Figure imgb0021
The function q is such that: q x there = x 2 + there 2
Figure imgb0021

Ainsi, les valeurs de β garantissant une reconstruction satisfaisante selon le mode de réalisation de l'invention décrit ici sont choisies de façon à minimiser l'énergie totale du signal décorrélé d dans les signaux reconstruits.Thus, the values of β guaranteeing a satisfactory reconstruction according to the embodiment of the invention described here are chosen so as to minimize the total energy of the decorrelated signal d in the reconstructed signals.

On cherche alors β minimisant : h 12 2 + h 22 2 = R R + 1 sin 2 β + α + 1 R + 1 sin 2 β - α

Figure imgb0022
c'est-à-dire h 12 2 + h 22 2 = 1 2 R R + 1 1 - cos 2 β + 2 α + 1 R + 1 1 - cos 2 β - 2 α
Figure imgb0023
ce qui revient à maximiser : g β = R R + 1 cos 2 β + 2 α + 1 R + 1 cos 2 β - 2 α
Figure imgb0024
We then seek β minimizing: h 12 2 + h 22 2 = R R + 1 sin 2 β + α + 1 R + 1 sin 2 β - α
Figure imgb0022
that is to say h 12 2 + h 22 2 = 1 2 R R + 1 1 - cos 2 β + 2 α + 1 R + 1 1 - cos 2 β - 2 α
Figure imgb0023
which amounts to maximizing: boy Wut β = R R + 1 cos 2 β + 2 α + 1 R + 1 cos 2 β - 2 α
Figure imgb0024

La dérivée de g est : β = - 2 R R + 1 sin 2 β + 2 α + 1 R + 1 sin 2 β - 2 α

Figure imgb0025
β = - 2 R - 1 R + 1 sin 2 α cos 2 β + R + 1 R + 1 cos 2 α sin 2 β
Figure imgb0026
The derivative of g is: boy Wut β = - 2 R R + 1 sin 2 β + 2 α + 1 R + 1 sin 2 β - 2 α
Figure imgb0025
boy Wut β = - 2 R - 1 R + 1 sin 2 α cos 2 β + R + 1 R + 1 cos 2 α sin 2 β
Figure imgb0026

Elle s'annule lorsque: tan 2 β = 1 - R R + 1 tan 2 α

Figure imgb0027
It vanishes when: tan 2 β = 1 - R R + 1 tan 2 α
Figure imgb0027

La valeur de β retenue est donc choisie parmi les valeurs vérifiant β = 1 2 arctan 1 - R R + 1 tan 2 α mod π 2

Figure imgb0028
et correspondant bien à une valeur de maximum de g.The value of β retained is therefore chosen from the values verifying β = 1 2 arctan 1 - R R + 1 tan 2 α mod π 2
Figure imgb0028
and corresponding to a value of maximum of g.

Ainsi, la figure 5 représente un dispositif de synthèse pour le décodage d'un bloc TTO, appelé ici TTO-1 comportant un module de décorrélation 510 du signal somme, un module de synthèse 520 apte à appliquer une matrice de synthèse au signal décorrélé et au signal somme. Les coefficients de cette matrice de synthèse sont déterminés selon un critère de minimisation d'une fonction quantitative q relative à la quantité de signal décorrélé tel que décrit ci-dessus.So, the figure 5 represents a synthesis device for decoding a TTO block, here called TTO -1 comprising a decorrelation module 510 of the sum signal, a synthesis module 520 able to apply a synthesis matrix to the decorrelated signal and to the sum signal. The coefficients of this synthesis matrix are determined according to a criterion of minimization of a quantitative function q relative to the amount of decorrelated signal as described above.

La figure 5, illustre également les étapes du procédé de synthèse spatiale selon l'invention dans lequel à partir d'un signal somme s, on obtient au moins deux signaux de sortie 1 et r. Le signal somme est issu d'un codage paramétrique par matriçage d'un signal multi-canal fournissant également des paramètres de spatialisation.The figure 5 also illustrates the steps of the spatial synthesis method according to the invention in which from a sum signal, at least two output signals 1 and r are obtained. The sum signal comes from a parametric encoding by mastering a multi-channel signal also providing spatialization parameters.

Le procédé mis en oeuvre par le dispositif de synthèse comporte les étapes de:

  • décorrélation (Décorr.) du signal somme pour obtenir un signal décorrélé d;
  • application (Synth.) d'une matrice de synthèse (M Minq) dont les coefficients dépendent des paramètres de spatialisation (I, R), au signal décorrélé (d) et au signal somme (s) pour obtenir lesdits signaux de sortie.
The method implemented by the synthesis device comprises the steps of:
  • decorrelation (decorr.) of the sum signal to obtain a decorrelated signal d;
  • application (Synth.) of a synthesis matrix (M Minq) whose coefficients depend on the spatialization parameters (I, R), the decorrelated signal (d) and the sum signal (s) to obtain the said output signals.

Ce procédé est tel que pour au moins une plage de valeur d'au moins un paramètre de spatialisation, les coefficients de la matrice de synthèse sont déterminés selon un critère de minimisation d'une fonction quantitative, relative à la quantité de signal décorrélé pris en compte dans l'étape d'application de la matrice de synthèse.This method is such that for at least one value range of at least one spatialization parameter, the coefficients of the synthesis matrix are determined according to a criterion for minimizing a quantitative function, relating to the quantity of decorrelated signal taken into account in the step of applying the synthesis matrix.

Dans le mode de réalisation décrit précédemment en référence à la figure 5, les paramètres de spatialisation sont des paramètres désignant le rapport d'énergie R entre les canaux du signal multi-canal original et une mesure de corrélation intercanal de ce même signal.In the embodiment described above with reference to the figure 5 , the spatialization parameters are parameters designating the energy ratio R between the channels of the original multi-channel signal and an inter-channel correlation measurement of this same signal.

D'autres paramètres de spatialisation issus du codage paramétrique peuvent également être choisis. Ces paramètres peuvent par exemple être des paramètres désignant le déphasage entre les canaux du signal multi-canal, ou des paramètres d'enveloppe temporelle des canaux audio.Other spatialization parameters derived from parametric coding can also be chosen. These parameters may for example be parameters designating the phase shift between the channels of the multi-channel signal, or time envelope parameters of the audio channels.

La figure 6 illustre un autre mode de réalisation de l'invention dans lequel, en fonction d'une plage de valeur d'au moins un des paramètres de spatialisation reçus, ici le paramètre de corrélation intercanal I, une matrice de synthèse différente est choisi.The figure 6 illustrates another embodiment of the invention in which, according to a value range of at least one of the spatialization parameters received, here the interchannel correlation parameter I, a different synthesis matrix is chosen.

L'exemple illustré à la figure 6 montre deux types de matrice de synthèse.The example shown in figure 6 shows two types of synthesis matrix.

La première matrice de synthèse M est par exemple celle décrite dans l'état de l'art dans la norme MPEG Surround. Le module de synthèse correspondant est illustré en 630. Cette matrice de synthèse est appliquée ici au signal somme s et au signal décorrélé d lorsque le paramètre I est positif.The first synthesis matrix M is for example that described in the state of the art in the MPEG Surround standard. The corresponding synthesis module is illustrated in 630. This synthesis matrix is applied here to the sum signal and the decorrelated signal d when the parameter I is positive.

Lorsque le paramètre I est négatif, la matrice de synthèse M Minq est celle décrite en référence à la figure 5. Le module de synthèse correspondant est représenté en 620.When the parameter I is negative, the synthesis matrix M Minq is that described with reference to the figure 5 . The corresponding synthesis module is represented in 620.

Ainsi, le procédé mis en oeuvre par ce mode de réalisation permet de traiter efficacement des signaux multi-canaux qui présentent des corrélations intercanal négatives.Thus, the method implemented by this embodiment makes it possible to efficiently process multi-channel signals that exhibit negative interchannel correlations.

Ce type de signal multi-canal est par exemple un signal de type ambiophonique. En effet, ce type de signal présente des canaux en opposition de phase. Cet élément caractéristique des signaux issus d'une prise de son ambiophonique est illustré dans les articles de M. Gerzon intitulés "Hierarchical System of Surround Sound Transmission for HDTV" ou "Ambisonic Decoders for HDTV".This type of multi-channel signal is for example a surround-type signal. Indeed, this type of signal has channels in phase opposition. This characteristic element of the signals from surround sound is illustrated in the articles by M. Gerzon entitled "Hierarchical System of Surround Sound Transmission for HDTV "or" Ambisonic Decoders for HDTV ".

Dans une variante de réalisation, plusieurs matrices de synthèse peuvent être prévues pour des plages de valeurs différentes des paramètres de spatialisation.In an alternative embodiment, several synthesis matrices can be provided for ranges of different values of the spatialization parameters.

Ainsi, on peut moduler l'importance relative que l'on veut donner aux différentes matrices de synthèse en fonction des valeurs de paramètres reçus.Thus, it is possible to modulate the relative importance that we want to give to the different synthesis matrices as a function of the received parameter values.

Par exemple, il est ainsi possible de donner un poids important à une matrice M telle que décrite dans l'état de l'art pour une plage particulière de paramètres et inversement de donner un poids important à la matrice de synthèse MMinq au sens de l'invention pour une autre plage de paramètre.For example, it is thus possible to give a significant weight to a matrix M as described in the state of the art for a particular range of parameters and conversely to give a significant weight to the synthesis matrix MMinq within the meaning of invention for another parameter range.

La compatibilité avec les systèmes existants dans une certaine plage de fonctionnement est alors conservée. Une amélioration de la qualité de la synthèse dans une plage particulière de valeur de paramètres de spatialisation est alors apportée dans ce mode de réalisation.Compatibility with existing systems within a certain operating range is then maintained. An improvement in the quality of the synthesis in a particular range of spatialization parameter value is then provided in this embodiment.

De plus, la possibilité d'utiliser plusieurs matrices de synthèse obtenues selon différents critères permet d'optimiser la qualité globale de la synthèse pour toute la plage de fonctionnement.In addition, the possibility of using several synthesis matrices obtained according to different criteria makes it possible to optimize the overall quality of the synthesis for the entire operating range.

On peut par exemple utiliser différentes matrices de synthèse selon que la valeur d'au moins un paramètre de spatialisation est faible ou au contraire importante.For example, it is possible to use different synthesis matrices depending on whether the value of at least one spatialization parameter is small or, on the contrary, important.

Ainsi dans cette variante du mode de réalisation, on utilisera deux matrices de synthèse telles que pour des valeurs positives de l'indice de corrélation I, on utilisera la matrice M telle que décrit dans l'état de l'art, et pour des valeurs négative de l'indice de corrélation I, on utilisera la matrice MMinq.Thus in this variant of the embodiment, use will be made of two synthetic matrices such that for positive values of the correlation index I, the matrix M as described in the state of the art, and for values negative of the correlation index I, we will use the matrix MMinq.

On pourra aussi définir différentes plages de fonctionnement comme par exemple:

  • pour I > 0, on utilise une matrice Minter = M
  • pour 0≥ I>-0.25, on utilisera une interpolation des deux matrices Minter = α M + (1- α) MMinq
  • pour -0.25≥ I>-1, on utilisera la matrice Minter = MMinq
It will also be possible to define different operating ranges, for example:
  • for I> 0, we use a matrix Minter = M
  • for 0≥ I> -0.25, we will use an interpolation of the two matrices Minter = α M + (1- α) MMinq
  • for -0.25≥ I> -1, we will use the matrix Minter = MMinq

Ce type de dispositif TTO-1 tel que représenté en figure 5 ou en figure 6 est par exemple intégré dans un décodeur de signal numérique. Un tel type de décodeur est par exemple illustré en référence à la figure 7.This type of device TTO -1 as represented in figure 5 or in figure 6 is for example integrated in a digital signal decoder. Such a type of decoder is for example illustrated with reference to the figure 7 .

Le décodeur représenté sur cette figure est typiquement prévu pour décoder des signaux multi-canaux de type 5.1. Ainsi, ce décodeur comporte une pluralité de dispositif TTO-1 (TTO0 -1, TTO1 -1, TTO2 -1, TTO3 -1, TTO4 -1) selon l'invention pour, à partir d'un signal S reçu, obtenir un signal multi-canal comportant 6 canaux (L, R, C, LFE, Ls, Rs).The decoder shown in this figure is typically provided for decoding 5.1 type multi-channel signals. Thus, this decoder comprises a plurality of devices TTO -1 (TTO 0 -1 , TTO 1 -1 , TTO 2 -1 , TTO 3 -1 , TTO 4 -1 ) according to the invention for, from a signal S received, obtain a multi-channel signal with 6 channels (L, R, C, LFE, Ls, Rs).

Le module de décodage 730 comportant cette pluralité de dispositif de synthèse peut, bien évidemment, être configuré de façon différente selon l'arbre de codage qui a été utilisé pour le signal multi-canal original.The decoding module 730 comprising this plurality of synthesis devices may, of course, be differently configured depending on the coding tree that has been used for the original multi-channel signal.

Le décodeur tel que représenté en figure 7 comporte un module d'analyse QMF (pour "quadrature Mirror Filter" en anglais) apte à effectuer une transformation du signal temporel somme (ou downmix) S issu du codeur en un signal fréquentiel par sous-bande. Le signal par bande de fréquence est alors fourni en entrée du module de décodage 730. A la sortie du module de décodage, les signaux traités entrent dans le module de synthèse QMF 720 apte à effectuer une transformation inverse et ramener le signal multi-canal obtenu dans le domaine temporel.The decoder as represented in figure 7 comprises a QMF analysis module (for "quadrature Mirror Filter" in English) capable of performing a transformation of the sum (or downmix) signal S from the encoder into a frequency signal per subband. The signal per frequency band is then supplied to the input of the decoding module 730. At the output of the decoding module, the processed signals enter the QMF synthesis module 720 able to perform an inverse transformation and to bring back the multi-channel signal obtained. in the time domain.

Ces modules d'analyse QMF et de synthèse QMF peuvent par exemple être ceux tels que décrits dans la norme MPEG Surround.These QMF analysis and QMF synthesis modules may for example be those as described in the MPEG Surround standard.

Le décodeur tel que représenté en figure 7 reçoit en provenance du codeur des paramètres P de spatialisation qui sont issus du codage paramétrique du signal multi-canal original.The decoder as represented in figure 7 receives from the encoder spatialization parameters P which are derived from the parametric encoding of the original multi-channel signal.

Typiquement, ces paramètres peuvent être des paramètres de rapport d'énergie entre les canaux, de mesure de corrélation entre les canaux ou encore de déphasage entre les canaux ou enfin d'enveloppe temporelle.Typically, these parameters may be energy ratio parameters between the channels, correlation measurement between the channels or else phase shift between the channels or finally time envelope.

Ce décodeur 700 peut être intégré à un équipement multimédia de type décodeur de salon ou "set-top box", ordinateur ou encore téléphone mobile, lecteur de contenu numérique, agenda électronique personnel, etc...This decoder 700 can be integrated into a multimedia equipment type set-top box or "set-top box", computer or mobile phone, digital content player, personal electronic organizer, etc ...

La figure 8 représente un exemple d'un tel équipement multimédia qui comporte notamment un module d'entrée E apte à recevoir des signaux sonores multi-canaux compressés soit par un réseau de communication par exemple ou par le biais d'une prise de son multi-canal.The figure 8 represents an example of such multimedia equipment which comprises in particular an input module E adapted to receive multi-channel audio signals compressed either by a communication network for example or by means of a multi-channel sound recording.

Ces signaux multi-canaux ont été compressés par une méthode de codage paramétrique qui par matriçage du signal original génère un signal somme S et des paramètres de spatialisation P. Ce codage peut dans un mode alternatif être prévu dans l'équipement multimédia.These multi-channel signals have been compressed by a parametric coding method which, by mastering the original signal, generates a sum signal S and spatialization parameters P. This coding may in an alternative mode be provided in the multimedia equipment.

Cet équipement comporte un ou plusieurs dispositifs de synthèse selon l'invention représentés matériellement ici par un processeur PROC coopérant avec un bloc mémoire BM comportant une mémoire de stockage et/ou de travail MEM.This equipment comprises one or more synthesis devices according to the invention, represented here physically by a processor PROC cooperating with a memory block BM comprising a memory storage and / or working MEM.

Le bloc mémoire peut avantageusement comporter un programme informatique comportant des instructions de code pour la mise en oeuvre des étapes du procédé au sens de l'invention, lorsque ces instructions sont exécutées par le processeur PROC, et notamment une étape de décorrélation d'un signal somme reçu pour obtenir un signal décorrélé et une étape d'application d'une matrice de synthèse dont les coefficients dépendent des paramètres de spatialisation, au signal décorrélé et au signal somme pour obtenir au moins deux signaux de sortie. La matrice de synthèse est telle que , pour au moins une plage de valeur d'au moins un paramètre de spatialisation, ses coefficients sont déterminés selon un critère de minimisation d'une fonction quantitative, relative à la quantité de signal décorrélé pris en compte dans l'étape d'application de la matrice de synthèse.The memory block may advantageously comprise a computer program comprising code instructions for implementing the steps of the method in the sense of the invention, when these instructions are executed by the processor PROC, and in particular a decorrelation step of a signal. sum received to obtain a decorrelated signal and a step of applying a synthesis matrix whose coefficients depend on the spatialization parameters, the decorrelated signal and the sum signal to obtain at least two output signals. The synthesis matrix is such that, for at least one value range of at least one spatialization parameter, its coefficients are determined according to a criterion of minimization of a quantitative function, relative to the quantity of decorrelated signal taken into account in the step of applying the synthesis matrix.

Typiquement, la description de la figure 5 reprend les étapes d'un algorithme d'un tel programme informatique. Le programme informatique peut également être stocké sur un support mémoire lisible par un lecteur du dispositif ou téléchargeable dans l'espace mémoire de l'équipement.Typically, the description of the figure 5 takes the steps of an algorithm of such a computer program. The computer program can also be stored on a memory medium readable by a reader of the device or downloadable in the memory space of the equipment.

Le bloc mémoire comporte ainsi les coefficients de la matrice de synthèse telle que définie ci-dessus.The memory block thus comprises the coefficients of the synthesis matrix as defined above.

Ce bloc mémoire peut comporter dans un autre mode de réalisation de l'invention tel que décrit en référence à la figure 6, des coefficients définissants plusieurs matrices de synthèse qui sont appliqués au signal somme et au signal décorrélé en fonction de plage de valeurs des paramètres de spatialisation reçus.This memory block may comprise in another embodiment of the invention as described with reference to FIG. figure 6 , defining coefficients several synthesis matrices which are applied to the sum signal and to the decorrelated signal as a function of the range of values of the spatialization parameters received.

De même le processeur de l'équipement peut également comporter des instructions pour la mise en oeuvre des étapes d'analyse et de synthèse du décodeur tel que décrit en référence à la figure 7.Similarly, the processor of the equipment may also include instructions for implementing the steps of analysis and synthesis of the decoder as described with reference to the figure 7 .

L'équipement multimédia tel qu'illustré comporte également une sortie S pour délivrer le signal multi-canal reconstruit S' soit par des moyens de restitution de type haut-parleur soit par des moyens de communication apte à transmettre ce signal multi-canal.The multimedia equipment as illustrated also comprises an output S for delivering the reconstructed multi-channel signal S 'either by speaker-type reproduction means or by communication means capable of transmitting this multi-channel signal.

Claims (9)

  1. Method for spatially synthesizing a sum signal to obtain at least two output digital audio signals, the sum signal together with spatialization parameters being output by a parametric coding by matrixing of an original multi-channel digital audio signal, the method comprising the steps of:
    - decorrelation (Decorr.) of the sum signal (s) to obtain a decorrelated signal (d);
    - application (Synth.) of a synthesis matrix (M Minq) whose coefficients depend on the spatialization parameters (R, I), to the decorrelated signal and to the sum signal so as to obtain said output signals,
    characterized in that for at least one value range of at least one spatialization parameter, the coefficients of the synthesis matrix are determined according to a criterion for minimizing a quantitative function (q), relating to the quantity of decorrelated signal in each of the output signals obtained by the step of applying the synthesis matrix, the quantitative function being such that the increase in absolute value of the coefficients of the synthesis matrix that are applied to the decorrelated signal increases the value of said function applied to these same coefficients.
  2. Method according to Claim 1, characterized in that the quantitative function is an energy function of the decorrelated signal.
  3. Method according to Claim 1, characterized in that the quantitative function is of the type:
    q x y = x p + y p 1 p
    Figure imgb0030
    with p an integer greater than or equal to 1.
  4. Method according to Claim 1, characterized in that the spatialization parameters are a parameter (R) of energy ratio between the channels of the multi-channel signal and a parameter (I) of interchannel correlation of the multi-channel signal, a value range being the range in which the interchannel correlation parameter is negative.
  5. Method according to Claim 1, characterized in that a different quantitative function is chosen per value range of the spatialization parameters.
  6. Device for spatially synthesizing a sum signal generating at least two output digital audio signals, the sum signal together with spatialization parameters being output by a parametric coding device implementing a matrixing of an original multi-channel digital audio signal, the device comprising:
    - means (510) for decorrelating the sum signal to obtain a decorrelated signal;
    - means (520) for applying a synthesis matrix (M Minq) whose coefficients depend on the spatialization parameters, to the decorrelated signal and to the sum signal so as to obtain said output signals,
    characterized in that for at least one value range of at least one spatialization parameter, the coefficients of the synthesis matrix are determined according to a criterion for minimizing a quantitative function, relating to the quantity of decorrelated signal in each of the output signals obtained by the means for applying the synthesis matrix, the quantitative function being such that the increase in absolute value of the coefficients of the synthesis matrix that are applied to the decorrelated signal increases the value of said function applied to these same coefficients.
  7. Digital audio signal decoder comprising at least one synthesis device according to Claim 6.
  8. Multimedia appliance comprising a decoder according to Claim 7.
  9. Computer program comprising code instructions for the implementation of the steps of the method according to one of Claims 1 to 5, when these instructions are executed by a processor.
EP09794018A 2008-06-26 2009-06-16 Spatial synthesis of multichannel audio signals Active EP2304721B1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
FR0854282 2008-06-26
PCT/FR2009/051146 WO2010004155A1 (en) 2008-06-26 2009-06-16 Spatial synthesis of multichannel audio signals

Publications (2)

Publication Number Publication Date
EP2304721A1 EP2304721A1 (en) 2011-04-06
EP2304721B1 true EP2304721B1 (en) 2012-05-09

Family

ID=40328191

Family Applications (1)

Application Number Title Priority Date Filing Date
EP09794018A Active EP2304721B1 (en) 2008-06-26 2009-06-16 Spatial synthesis of multichannel audio signals

Country Status (7)

Country Link
US (1) US8583424B2 (en)
EP (1) EP2304721B1 (en)
JP (1) JP5366104B2 (en)
CN (1) CN102077276B (en)
AT (1) ATE557386T1 (en)
ES (1) ES2387867T3 (en)
WO (1) WO2010004155A1 (en)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2362375A1 (en) * 2010-02-26 2011-08-31 Fraunhofer-Gesellschaft zur Förderung der Angewandten Forschung e.V. Apparatus and method for modifying an audio signal using harmonic locking
EP2369861B1 (en) * 2010-03-25 2016-07-27 Nxp B.V. Multi-channel audio signal processing
KR101756838B1 (en) 2010-10-13 2017-07-11 삼성전자주식회사 Method and apparatus for down-mixing multi channel audio signals
KR101842257B1 (en) * 2011-09-14 2018-05-15 삼성전자주식회사 Method for signal processing, encoding apparatus thereof, and decoding apparatus thereof
CN104885150B (en) * 2012-08-03 2019-06-28 弗劳恩霍夫应用研究促进协会 The decoder and method of the universal space audio object coding parameter concept of situation are mixed/above mixed for multichannel contracting
EP2717263B1 (en) 2012-10-05 2016-11-02 Nokia Technologies Oy Method, apparatus, and computer program product for categorical spatial analysis-synthesis on the spectrum of a multichannel audio signal
EP2830334A1 (en) 2013-07-22 2015-01-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Multi-channel audio decoder, multi-channel audio encoder, methods, computer program and encoded audio representation using a decorrelation of rendered audio signals
CA2919080C (en) * 2013-07-22 2018-06-05 Sascha Disch Multi-channel audio decoder, multi-channel audio encoder, methods, computer program and encoded audio representation using a decorrelation of rendered audio signals
TWI671734B (en) 2013-09-12 2019-09-11 瑞典商杜比國際公司 Decoding method, encoding method, decoding device, and encoding device in multichannel audio system comprising three audio channels, computer program product comprising a non-transitory computer-readable medium with instructions for performing decoding m
WO2016108655A1 (en) 2014-12-31 2016-07-07 한국전자통신연구원 Method for encoding multi-channel audio signal and encoding device for performing encoding method, and method for decoding multi-channel audio signal and decoding device for performing decoding method
KR20160081844A (en) * 2014-12-31 2016-07-08 한국전자통신연구원 Encoding method and encoder for multi-channel audio signal, and decoding method and decoder for multi-channel audio signal
FR3048808A1 (en) * 2016-03-10 2017-09-15 Orange OPTIMIZED ENCODING AND DECODING OF SPATIALIZATION INFORMATION FOR PARAMETRIC CODING AND DECODING OF A MULTICANAL AUDIO SIGNAL
CN111407268B (en) * 2020-03-27 2021-05-14 华南理工大学 Multichannel electroencephalogram signal compression method based on correlation function

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5956674A (en) * 1995-12-01 1999-09-21 Digital Theater Systems, Inc. Multi-channel predictive subband audio coder using psychoacoustic adaptive bit allocation in frequency, time and over the multiple channels
US5835375A (en) * 1996-01-02 1998-11-10 Ati Technologies Inc. Integrated MPEG audio decoder and signal processor
DE19632734A1 (en) * 1996-08-14 1998-02-19 Thomson Brandt Gmbh Method and device for generating a multi-tone signal from a mono signal
US6199039B1 (en) * 1998-08-03 2001-03-06 National Science Council Synthesis subband filter in MPEG-II audio decoding
US7006636B2 (en) * 2002-05-24 2006-02-28 Agere Systems Inc. Coherence-based audio coding and synthesis
DE60311794T2 (en) 2002-04-22 2007-10-31 Koninklijke Philips Electronics N.V. SIGNAL SYNTHESIS
TWI315828B (en) * 2002-08-07 2009-10-11 Dolby Lab Licensing Corp Audio channel spatial translation
KR100923297B1 (en) * 2002-12-14 2009-10-23 삼성전자주식회사 Method for encoding stereo audio, apparatus thereof, method for decoding audio stream and apparatus thereof

Also Published As

Publication number Publication date
CN102077276A (en) 2011-05-25
US20110106543A1 (en) 2011-05-05
US8583424B2 (en) 2013-11-12
EP2304721A1 (en) 2011-04-06
WO2010004155A1 (en) 2010-01-14
ATE557386T1 (en) 2012-05-15
ES2387867T3 (en) 2012-10-03
CN102077276B (en) 2014-04-09
JP2011525999A (en) 2011-09-29
JP5366104B2 (en) 2013-12-11

Similar Documents

Publication Publication Date Title
EP2304721B1 (en) Spatial synthesis of multichannel audio signals
EP2374123B1 (en) Improved encoding of multichannel digital audio signals
EP2374124B1 (en) Advanced encoding of multi-channel digital audio signals
EP2005420B1 (en) Device and method for encoding by principal component analysis a multichannel audio signal
EP2002424B1 (en) Device and method for scalable encoding of a multichannel audio signal based on a principal component analysis
EP1600042B1 (en) Method for the treatment of compressed sound data for spatialization
EP2539892B1 (en) Multichannel audio stream compression
EP2042001B1 (en) Binaural spatialization of compression-encoded sound data
FR2966634A1 (en) ENHANCED STEREO PARAMETRIC ENCODING / DECODING FOR PHASE OPPOSITION CHANNELS
EP2319037B1 (en) Reconstruction of multi-channel audio data
WO2007101958A2 (en) Optimization of binaural sound spatialization based on multichannel encoding
FR2875351A1 (en) METHOD OF PROCESSING DATA BY PASSING BETWEEN DOMAINS DIFFERENT FROM SUB-BANDS
EP2489039A1 (en) Optimized low-throughput parametric coding/decoding
WO2017103418A1 (en) Adaptive channel-reduction processing for encoding a multi-channel audio signal
EP2168121A1 (en) Quantification after linear conversion combining audio signals of a sound scene, and related encoder
EP2126905B1 (en) Methods and devices for audio signals encoding and decoding, encoded audio signal
WO2011073600A1 (en) Parametric stereo encoding/decoding having downmix optimisation
EP4042418B1 (en) Determining corrections to be applied to a multichannel audio signal, associated coding and decoding
EP1362344A1 (en) Multichannel signal spectral reconstruction method and device
WO2011045549A1 (en) Optimized parametric stereo decoding
WO2023232823A1 (en) Title: spatialized audio encoding with configuration of a decorrelation processing operation
WO2009081002A1 (en) Processing of a 3d audio stream as a function of a level of presence of spatial components

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20110125

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO SE SI SK TR

AX Request for extension of the european patent

Extension state: AL BA RS

17Q First examination report despatched

Effective date: 20110622

DAX Request for extension of the european patent (deleted)
GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO SE SI SK TR

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

Free format text: NOT ENGLISH

REG Reference to a national code

Ref country code: AT

Ref legal event code: REF

Ref document number: 557386

Country of ref document: AT

Kind code of ref document: T

Effective date: 20120515

Ref country code: CH

Ref legal event code: EP

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

Free format text: LANGUAGE OF EP DOCUMENT: FRENCH

REG Reference to a national code

Ref country code: DE

Ref legal event code: R096

Ref document number: 602009006883

Country of ref document: DE

Effective date: 20120705

REG Reference to a national code

Ref country code: NL

Ref legal event code: VDEP

Effective date: 20120509

REG Reference to a national code

Ref country code: ES

Ref legal event code: FG2A

Ref document number: 2387867

Country of ref document: ES

Kind code of ref document: T3

Effective date: 20121003

REG Reference to a national code

Ref country code: LT

Ref legal event code: MG4D

Effective date: 20120509

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20120509

Ref country code: SE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20120509

Ref country code: PL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20120509

Ref country code: FI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20120509

Ref country code: CY

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20120509

Ref country code: IS

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20120909

Ref country code: NO

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20120809

REG Reference to a national code

Ref country code: AT

Ref legal event code: MK05

Ref document number: 557386

Country of ref document: AT

Kind code of ref document: T

Effective date: 20120509

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: HR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20120509

Ref country code: SI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20120509

Ref country code: LV

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20120509

Ref country code: PT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20120910

Ref country code: GR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20120810

BERE Be: lapsed

Owner name: FRANCE TELECOM

Effective date: 20120630

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: SK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20120509

Ref country code: NL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20120509

Ref country code: DK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20120509

Ref country code: MC

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20120630

Ref country code: EE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20120509

Ref country code: AT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20120509

Ref country code: RO

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20120509

Ref country code: CZ

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20120509

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20120509

PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

REG Reference to a national code

Ref country code: IE

Ref legal event code: MM4A

26N No opposition filed

Effective date: 20130212

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: BE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20120630

Ref country code: IE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20120616

REG Reference to a national code

Ref country code: DE

Ref legal event code: R097

Ref document number: 602009006883

Country of ref document: DE

Effective date: 20130212

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20120509

Ref country code: BG

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20120809

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: CH

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20130630

Ref country code: LI

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20130630

Ref country code: TR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20120509

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LU

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20120616

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: HU

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20090616

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 8

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 9

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 10

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: ES

Payment date: 20230703

Year of fee payment: 15

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GB

Payment date: 20240521

Year of fee payment: 16

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: DE

Payment date: 20240521

Year of fee payment: 16

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: IT

Payment date: 20240522

Year of fee payment: 16

Ref country code: FR

Payment date: 20240522

Year of fee payment: 16