EP2304721B1 - Spatial synthesis of multichannel audio signals - Google Patents
Spatial synthesis of multichannel audio signals Download PDFInfo
- Publication number
- EP2304721B1 EP2304721B1 EP09794018A EP09794018A EP2304721B1 EP 2304721 B1 EP2304721 B1 EP 2304721B1 EP 09794018 A EP09794018 A EP 09794018A EP 09794018 A EP09794018 A EP 09794018A EP 2304721 B1 EP2304721 B1 EP 2304721B1
- Authority
- EP
- European Patent Office
- Prior art keywords
- signal
- synthesis
- coefficients
- decorrelated
- spatialization
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000015572 biosynthetic process Effects 0.000 title claims abstract description 71
- 238000003786 synthesis reaction Methods 0.000 title claims abstract description 71
- 230000005236 sound signal Effects 0.000 title claims description 11
- 239000011159 matrix material Substances 0.000 claims abstract description 51
- 238000000034 method Methods 0.000 claims abstract description 26
- 238000004590 computer program Methods 0.000 claims description 5
- 230000002194 synthesizing effect Effects 0.000 claims description 4
- 230000006870 function Effects 0.000 description 29
- 229920000535 Tan II Polymers 0.000 description 3
- 230000005540 biological transmission Effects 0.000 description 3
- 230000006835 compression Effects 0.000 description 3
- 238000007906 compression Methods 0.000 description 3
- 241001080024 Telles Species 0.000 description 2
- 230000003416 augmentation Effects 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 230000010363 phase shift Effects 0.000 description 2
- 238000009877 rendering Methods 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 230000001066 destructive effect Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000005055 memory storage Effects 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000001308 synthesis method Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/008—Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/02—Systems employing more than two channels, e.g. quadraphonic of the matrix type, i.e. in which input signals are combined algebraically, e.g. after having been phase shifted with respect to each other
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/03—Application of parametric coding in stereophonic audio systems
Definitions
- the present invention relates to the field of coding / decoding multichannel digital audio signals.
- the present invention relates to the parametric encoding / decoding of multichannel audio signals.
- This type of coding / decoding is based on the extraction of spatialization parameters so that at decoding, the spatial perception of the listener can be reconstituted.
- Such a coding technique is known under the name of "Binaural Cue Coding" in English (BCC) which aims on the one hand to extract and then code the indices of auditory spatialization and on the other hand to code a monophonic or stereophonic signal from a mastering of the original multi-channel signal.
- This parametric approach is a low rate coding.
- the main advantage of this coding approach is to allow a better compression rate than conventional multi-channel digital audio compression methods while ensuring the backward compatibility of the compressed format obtained with the existing coding formats and broadcasting systems.
- the invention relates more particularly to the spatial decoding of a sound scene 3 D from a reduced number of transmitted channels.
- the figure 1 describes such an encoding / decoding system in which the encoder 100 constructs a sum signal ("downmix") S s by matrixing at 110 the channels of the original multichannel signal S and delivers via an extraction module parameters 120, a reduced set of parameters P which characterize the spatial content of the original multi-channel signal.
- a sum signal (“downmix") S s by matrixing at 110 the channels of the original multichannel signal S and delivers via an extraction module parameters 120, a reduced set of parameters P which characterize the spatial content of the original multi-channel signal.
- the multichannel signal is reconstructed (S ') by a synthesis module 160 which takes into account both the sum signal and the transmitted parameters P.
- the sum signal has a reduced number of channels. These channels can be encoded by a conventional audio encoder before transmission or storage. Typically, the sum signal has two channels and is compatible with conventional stereo broadcasting. Before transmission or storage, this sum signal can thus be encoded by any conventional stereo encoder. The signal thus coded is then compatible with the devices comprising the corresponding decoder which reconstruct the sum signal while ignoring the spatial data.
- the figure 2 illustrates a first example of a coding structure or code tree using TTO blocks (TTO 0 , TTO 1 , TTO 2 , TTO 3 and TTO 4 ) to obtain a monophonic signal S from a multi-channel signal 5.1 with 6 channels (L, R, C, LFE, Ls and Rs).
- the figure 3 illustrates a second example of an encoding structure using both TTO blocks and TTT blocks to obtain a stereo signal Sl and Sr from the 5.1 signal.
- the decoding of the monophonic or stereophonic signals thus received is effected by using a decoding tree symmetrical to those represented in FIGS. Figures 2 and 3 .
- the decoding can be seen as a succession of reconstruction step.
- the first decoding step consists in reconstructing the signals corresponding to the input signals of the block TTO 0 from the sum signal S and the spatial parameters extracted by the block TTO 0
- the next step consists in reconstructing the signals corresponding to the input signals of the block TTO 1 from the signal reconstructed in the previous step and the spatial parameters extracted by the block TTO 1
- the decoding then proceeds in a similar manner until the reconstruction of all the channels encoded multi-channel signal.
- the decoder constructs a matrix making it possible to go directly from the monophonic sum signal to the 6 reconstructed channels by combining the smaller size matrices of the different blocks TTO and TTT.
- This technique consists, as shown with reference to the figure 4 , to perform a decorrelation step at 410 by filtering the sum signal to obtain a decorrelated signal d.
- the sum signal and the decorrelated signal thus obtained are then processed by a synthesis module 420 via a synthesis matrix M, as a function of the spatial parameters R and I to create the two signals I and R respecting the specified spatial parameters.
- the parameters R and I are respectively the energy ratio between the channels of the multi-channel signal and an inter-channel correlation index of the multi-channel signal channels.
- the waveform of the reconstructed signal is not controlled since it totally depends on the decorrelation experienced by the signal s.
- the signals of these different channels which have similar waveforms will interact in the rendering zone by creating constructive and destructive interferences which will make it possible to reconstruct the desired sound field.
- each TTO block decoder involved in the decoding tree uses a different decorrelation filter, the deformation of the waveform will not be the same for the different channels.
- the reconstructed channels then no longer have, as in the original signal, near waveforms and the interferences that allowed the reconstruction of the sound field during the restitution, are then no longer as in the original signal. This leads, on the one hand, to a bad spatial reconstruction of the sound stage, and on the other hand to the creation of audible artifacts, the differences in waveforms leading to the creation of perceptible noisy components.
- the present invention improves the situation.
- the method according to the invention thus makes it possible to handle the cases where a spatialization parameter situated in a predetermined value range causes such a situation.
- such a quantitative function may be a decorrelated signal energy function.
- the spatialization parameters are a parameter (R) of energy ratio between the channels of the multi-channel signal and a interchannel correlation parameter (I) of the multi-channel signal, a range of values being the range in which the intcrcanal correlation parameter is negative.
- the invention applies more particularly to multi-channel signals having negative interchannel correlations.
- a different quantitative function is chosen by value range of the spatialization parameters.
- It relates to a decoder comprising a synthesis device as described above.
- the invention also relates to multimedia equipment comprising a decoder as described above.
- such equipment may be for example a mobile phone, an electronic organizer or digital content player, a computer, a set-top box ("set-top box").
- the invention is directed to a computer program comprising code instructions for implementing the steps of the method as described above, when these instructions are executed by a processor.
- the figure 5 illustrates an embodiment of the invention. It illustrates a synthesis device for the decoding of a TTO block (TTO -1 ).
- This device comprises a decorrelation module 510, able to carry out a step of decorrelation of the received signal s which is a sum signal obtained by coding by a matrix of multichannel signals.
- This decorrelation step is for example that described in the MPEG Surround standard mentioned above.
- This decorrelated signal d and the sum signal s are taken into account in a synthesis module 520 using a matrix M Minq whose coefficients depend on spatialization parameters R and I received and producing output signals I and r.
- ⁇ is a function of R and I and is chosen according to one embodiment of the invention so as to limit the quantity of the decorrelated signal d introduced into the reconstructed signals regardless of the correlation values I, including for values negative.
- the choice of the value ⁇ can be formalized by introducing a quantitative function q relating to the quantity of decorrelated signal taken into account in the matrixing for the reconstruction of the signals.
- the quantitative function q is such that the increase in absolute value of the coefficients of the synthesis matrix applied to the decorrelated signal increases the value of the function q applied to these same coefficients.
- the quantitative function q is a function of energy of the decorrelated signal.
- the values of ⁇ guaranteeing a satisfactory reconstruction according to the embodiment of the invention described here are chosen so as to minimize the total energy of the decorrelated signal d in the reconstructed signals.
- the figure 5 represents a synthesis device for decoding a TTO block, here called TTO -1 comprising a decorrelation module 510 of the sum signal, a synthesis module 520 able to apply a synthesis matrix to the decorrelated signal and to the sum signal.
- the coefficients of this synthesis matrix are determined according to a criterion of minimization of a quantitative function q relative to the amount of decorrelated signal as described above.
- the figure 5 also illustrates the steps of the spatial synthesis method according to the invention in which from a sum signal, at least two output signals 1 and r are obtained.
- the sum signal comes from a parametric encoding by mastering a multi-channel signal also providing spatialization parameters.
- This method is such that for at least one value range of at least one spatialization parameter, the coefficients of the synthesis matrix are determined according to a criterion for minimizing a quantitative function, relating to the quantity of decorrelated signal taken into account in the step of applying the synthesis matrix.
- the spatialization parameters are parameters designating the energy ratio R between the channels of the original multi-channel signal and an inter-channel correlation measurement of this same signal.
- parameters derived from parametric coding can also be chosen. These parameters may for example be parameters designating the phase shift between the channels of the multi-channel signal, or time envelope parameters of the audio channels.
- the figure 6 illustrates another embodiment of the invention in which, according to a value range of at least one of the spatialization parameters received, here the interchannel correlation parameter I, a different synthesis matrix is chosen.
- the example shown in figure 6 shows two types of synthesis matrix.
- the first synthesis matrix M is for example that described in the state of the art in the MPEG Surround standard.
- the corresponding synthesis module is illustrated in 630. This synthesis matrix is applied here to the sum signal and the decorrelated signal d when the parameter I is positive.
- the synthesis matrix M Minq is that described with reference to the figure 5 .
- the corresponding synthesis module is represented in 620.
- the method implemented by this embodiment makes it possible to efficiently process multi-channel signals that exhibit negative interchannel correlations.
- This type of multi-channel signal is for example a surround-type signal. Indeed, this type of signal has channels in phase opposition.
- This characteristic element of the signals from surround sound is illustrated in the articles by M. Gerzon entitled “Hierarchical System of Surround Sound Transmission for HDTV “or” Ambisonic Decoders for HDTV ".
- synthesis matrices can be provided for ranges of different values of the spatialization parameters.
- Compatibility with existing systems within a certain operating range is then maintained.
- An improvement in the quality of the synthesis in a particular range of spatialization parameter value is then provided in this embodiment.
- This type of device TTO -1 as represented in figure 5 or in figure 6 is for example integrated in a digital signal decoder. Such a type of decoder is for example illustrated with reference to the figure 7 .
- the decoder shown in this figure is typically provided for decoding 5.1 type multi-channel signals.
- this decoder comprises a plurality of devices TTO -1 (TTO 0 -1 , TTO 1 -1 , TTO 2 -1 , TTO 3 -1 , TTO 4 -1 ) according to the invention for, from a signal S received, obtain a multi-channel signal with 6 channels (L, R, C, LFE, Ls, Rs).
- the decoding module 730 comprising this plurality of synthesis devices may, of course, be differently configured depending on the coding tree that has been used for the original multi-channel signal.
- the decoder as represented in figure 7 comprises a QMF analysis module (for "quadrature Mirror Filter” in English) capable of performing a transformation of the sum (or downmix) signal S from the encoder into a frequency signal per subband.
- the signal per frequency band is then supplied to the input of the decoding module 730.
- the processed signals enter the QMF synthesis module 720 able to perform an inverse transformation and to bring back the multi-channel signal obtained. in the time domain.
- QMF analysis and QMF synthesis modules may for example be those as described in the MPEG Surround standard.
- the decoder as represented in figure 7 receives from the encoder spatialization parameters P which are derived from the parametric encoding of the original multi-channel signal.
- these parameters may be energy ratio parameters between the channels, correlation measurement between the channels or else phase shift between the channels or finally time envelope.
- This decoder 700 can be integrated into a multimedia equipment type set-top box or "set-top box", computer or mobile phone, digital content player, personal electronic organizer, etc ...
- the figure 8 represents an example of such multimedia equipment which comprises in particular an input module E adapted to receive multi-channel audio signals compressed either by a communication network for example or by means of a multi-channel sound recording.
- These multi-channel signals have been compressed by a parametric coding method which, by mastering the original signal, generates a sum signal S and spatialization parameters P.
- This coding may in an alternative mode be provided in the multimedia equipment.
- This equipment comprises one or more synthesis devices according to the invention, represented here physically by a processor PROC cooperating with a memory block BM comprising a memory storage and / or working MEM.
- the memory block may advantageously comprise a computer program comprising code instructions for implementing the steps of the method in the sense of the invention, when these instructions are executed by the processor PROC, and in particular a decorrelation step of a signal.
- a computer program comprising code instructions for implementing the steps of the method in the sense of the invention, when these instructions are executed by the processor PROC, and in particular a decorrelation step of a signal.
- the synthesis matrix is such that, for at least one value range of at least one spatialization parameter, its coefficients are determined according to a criterion of minimization of a quantitative function, relative to the quantity of decorrelated signal taken into account in the step of applying the synthesis matrix.
- the description of the figure 5 takes the steps of an algorithm of such a computer program.
- the computer program can also be stored on a memory medium readable by a reader of the device or downloadable in the memory space of the equipment.
- the memory block thus comprises the coefficients of the synthesis matrix as defined above.
- This memory block may comprise in another embodiment of the invention as described with reference to FIG. figure 6 , defining coefficients several synthesis matrices which are applied to the sum signal and to the decorrelated signal as a function of the range of values of the spatialization parameters received.
- the processor of the equipment may also include instructions for implementing the steps of analysis and synthesis of the decoder as described with reference to the figure 7 .
- the multimedia equipment as illustrated also comprises an output S for delivering the reconstructed multi-channel signal S 'either by speaker-type reproduction means or by communication means capable of transmitting this multi-channel signal.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Multimedia (AREA)
- Mathematical Physics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Human Computer Interaction (AREA)
- Algebra (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Analysis (AREA)
- Mathematical Optimization (AREA)
- Pure & Applied Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Stereophonic System (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
La présente invention se rapporte au domaine du codage/décodage de signaux audio numériques multicanaux.The present invention relates to the field of coding / decoding multichannel digital audio signals.
Plus particulièrement, la présente invention se rapporte au codage/décodage paramétrique de signaux audio multicanaux.More particularly, the present invention relates to the parametric encoding / decoding of multichannel audio signals.
Ce type de codage/décodage se base sur l'extraction de paramètres de spatialisation pour qu'au décodage, la perception spatiale de l'auditeur puisse être reconstituée.This type of coding / decoding is based on the extraction of spatialization parameters so that at decoding, the spatial perception of the listener can be reconstituted.
Une telle technique de codage est connu sous le nom de "Binaural Cue Coding" en anglais (BCC) qui vise d'une part à extraire puis à coder les indices de spatialisation auditive et d'autre part à coder un signal monophonique ou stéréophonique issu d'un matriçage du signal multi-canal original.Such a coding technique is known under the name of "Binaural Cue Coding" in English (BCC) which aims on the one hand to extract and then code the indices of auditory spatialization and on the other hand to code a monophonic or stereophonic signal from a mastering of the original multi-channel signal.
Cette approche paramétrique est un codage à bas débit. Le principal intérêt de cette approche de codage est de permettre un taux de compression meilleur que les méthodes classiques de compression de signaux audionumériques multicanaux tout en assurant la rétrocompatibilité du format compressé obtenu avec les formats de codage et les systèmes de diffusions déjà existants.This parametric approach is a low rate coding. The main advantage of this coding approach is to allow a better compression rate than conventional multi-channel digital audio compression methods while ensuring the backward compatibility of the compressed format obtained with the existing coding formats and broadcasting systems.
Ainsi, l'invention porte plus particulièrement sur le décodage spatial d'une scène sonore 3 D à partir d'un nombre réduit de canaux transmis.Thus, the invention relates more particularly to the spatial decoding of a sound scene 3 D from a reduced number of transmitted channels.
La norme MPEG Surround décrite dans le document de la norme MPEG ISO/IEC 23003-1:2007 et dans le document de "
La
Au décodeur 150, le signal multicanal est reconstruit (S') par un module de synthèse 160 qui prend en compte à la fois le signal somme et les paramètres P transmis.At the
Le signal somme comporte un nombre réduit de canaux. Ces canaux peuvent être codés par un codeur audio classique avant transmission ou stockage. Typiquement, le signal somme comporte deux canaux et est compatible avec une diffusion stéréo classique. Avant transmission ou stockage, ce signal somme peut ainsi être codé par n'importe quel codeur stéréo classique. Le signal ainsi codé est alors compatible avec les dispositifs comportant le décodeur correspondant qui reconstruisent le signal somme en ignorant les données spatiales.The sum signal has a reduced number of channels. These channels can be encoded by a conventional audio encoder before transmission or storage. Typically, the sum signal has two channels and is compatible with conventional stereo broadcasting. Before transmission or storage, this sum signal can thus be encoded by any conventional stereo encoder. The signal thus coded is then compatible with the devices comprising the corresponding decoder which reconstruct the sum signal while ignoring the spatial data.
La norme MPEG Surround a retenu une structure spécifique pour la représentation des données spatiales : le codeur s'appuie sur une structure arborescente de codage construite à partir d'un nombre réduit de blocs élémentaires de codage permettant chacun d'extraire des paramètres spatiaux sur un nombre réduit de canaux. Il existe deux types de bloc élémentaires de codage :
- les blocs TTO (pour "Two To One" en anglais) qui permettent d'extraire les paramètres spatiaux entre deux canaux et de construire un signal somme monophonique à partir de ces deux canaux,
- les blocs TTT (pour "Three To Two" en anglais) qui permettent d'extraire les paramètres spatiaux entre trois canaux et de construire un signal somme contenant deux canaux à partir de ces trois canaux.
- the TTO blocks (for "Two To One" in English) which make it possible to extract the spatial parameters between two channels and to construct a monophonic sum signal from these two channels,
- the TTT blocks (for "Three To Two" in English) which make it possible to extract the spatial parameters between three channels and to construct a sum signal containing two channels from these three channels.
La
La
Le décodage des signaux monophoniques ou stéréophoniques ainsi reçus s'effectue en utilisant un arbre de décodage symétrique à ceux représentés aux
Ainsi, pour le décodage d'un signal encodé selon l'arbre de la
Dans ce cas la première étape de décodage consiste à reconstruire les signaux correspondant aux signaux d'entrée du bloc TTO0 à partir du signal somme S et des paramètres spatiaux extraits par le bloc TTO0, l'étape suivante consiste alors à reconstruire les signaux correspondant aux signaux d'entrée du bloc TTO1 à partir du signal reconstruit à l'étape précédente et des paramètres spatiaux extraits par le bloc TTO1, le décodage se poursuit ensuite de manière similaire jusqu'à la reconstruction de l'ensemble des canaux du signal multi-canal codé. En pratique, le décodeur construit une matrice permettant de passer directement du signal somme monophonique aux 6 canaux reconstruits par combinaison des matrices de taille inférieure des différents blocs TTO et TTT.In this case, the first decoding step consists in reconstructing the signals corresponding to the input signals of the block TTO 0 from the sum signal S and the spatial parameters extracted by the block TTO 0 , the next step then consists in reconstructing the signals corresponding to the input signals of the block TTO 1 from the signal reconstructed in the previous step and the spatial parameters extracted by the block TTO 1 , the decoding then proceeds in a similar manner until the reconstruction of all the channels encoded multi-channel signal. In practice, the decoder constructs a matrix making it possible to go directly from the monophonic sum signal to the 6 reconstructed channels by combining the smaller size matrices of the different blocks TTO and TTT.
La technique retenue dans la norme MPEG Surround pour le décodage des blocs TTO impose cependant une limitation très pénalisante pour le codage de signaux multicanaux comportant des canaux en opposition de phase.The technique used in the MPEG Surround standard for decoding TTO blocks, however, imposes a very disadvantageous limitation for the coding of multichannel signals comprising channels in phase opposition.
Cette technique de décodage est plus précisément décrite dans la demande de brevet intitulée "signal synthesizing" publiée sous le numéro
Cette technique consiste, comme représenté en référence à la
Le matriçage des signaux s et d se fait suivant les relations suivantes :
Or ce matriçage présente la limitation mentionnée ci-dessus et qui rend cette méthode inadaptée au codage des signaux audio multicanaux présentant des corrélations intercanal négatives.However, this matrixing has the limitation mentioned above and makes this method unsuitable for coding multichannel audio signals with negative inter-channel correlations.
En particulier, une telle technique n'est pas adaptée pour le décodage des signaux ambiophoniques qui comportent des oppositions de phase entre canaux.In particular, such a technique is not suitable for decoding the surround signals which comprise phase oppositions between channels.
En effet, lorsque la corrélation intercanal I est négative, et en particulier lorsqu'elle est proche de -1, la proportion de signal décorrélé utilisée pour synthétiser les signaux l et r devient très importante, dépassant dans certains cas de figure nettement la quantité de signal somme s utilisée. Dans le cas le plus problématique, on peut constater que pour une différence intercanal de niveau de 0dB, c'est à dire pour R=1, lorsque la corrélation intercanal I tend vers -1, la matrice de mixage tend vers la matrice suivante :
Cette matrice correspond à des signaux reconstruits
Le problème de reconstruction illustré dans l'exemple précédent dans un cas extrême se présente également pour d'autres valeurs de R et I, et est d'autant plus marqué que I est proche de -1. Ainsi, la forme d'onde des canaux reconstruits n'est dans ces cas pas aussi proche qu'elle pourrait l'être des signaux originaux, ce qui limite inutilement la qualité des signaux reconstruits.The reconstruction problem illustrated in the previous example in an extreme case is also present for other values of R and I, and is even more so. marked that I is close to -1. Thus, the waveform of the reconstructed channels is in these cases not as close as it could be the original signals, which unnecessarily limits the quality of the reconstructed signals.
L'effet de cette limitation est encore plus marqué lorsque le signal présente plusieurs canaux ayant des corrélations intercanal proches de -1. Dans ce cas, plus de deux canaux ont des formes d'onde proches, mais certains d'entre eux sont en opposition de phase.The effect of this limitation is even more pronounced when the signal has several channels with interchanal correlations close to -1. In this case, more than two channels have near waveforms, but some of them are out of phase.
Lors de la restitution du signal multi-canal original, les signaux de ces différents canaux qui ont des formes d'ondes proches vont interagir dans la zone de restitution en créant des interférences constructives et destructives qui vont permettre de reconstruire le champ sonore souhaité.During the rendering of the original multi-channel signal, the signals of these different channels which have similar waveforms will interact in the rendering zone by creating constructive and destructive interferences which will make it possible to reconstruct the desired sound field.
Après décodage, la forme d'onde des canaux sera fortement déformée en raison du problème évoqué précédemment.After decoding, the waveform of the channels will be strongly deformed due to the problem mentioned above.
De plus comme chaque décodeur de bloc TTO intervenant dans l'arbre de décodage, utilise un filtre de décorrélation différent, la déformation de la forme d'onde ne sera pas la même pour les différents canaux.In addition, since each TTO block decoder involved in the decoding tree uses a different decorrelation filter, the deformation of the waveform will not be the same for the different channels.
Les canaux reconstruits n'ont alors plus, comme dans le signal original, des formes d'onde proches et les interférences qui permettaient la reconstruction du champ sonore lors de la restitution, ne se font alors plus comme dans le signal original. Ceci aboutit d'une part à une mauvaise reconstruction spatiale de la scène sonore, et d'autre part à la création d'artefacts audibles, les différences de forme d'onde entraînant la création de composantes bruitées perceptibles.The reconstructed channels then no longer have, as in the original signal, near waveforms and the interferences that allowed the reconstruction of the sound field during the restitution, are then no longer as in the original signal. This leads, on the one hand, to a bad spatial reconstruction of the sound stage, and on the other hand to the creation of audible artifacts, the differences in waveforms leading to the creation of perceptible noisy components.
La présente invention vient améliorer la situation.The present invention improves the situation.
A cet effet, la présente invention propose un procédé de synthèse spatiale d'un signal somme pour obtenir au moins deux signaux de sortie, le signal somme ainsi que des paramètres de spatialisation étant issus d'un codage paramétrique par matriçage d'un signal multi-canal original. Le procédé comporte les étapes de:
- décorrélation du signal somme pour obtenir un signal décorrélé;
- application d'une matrice de synthèse dont les coefficients dépendent des paramètres de spatialisation, au signal décorrélé et au signal somme pour obtenir lesdits signaux de sortie,
- decorrelation of the sum signal to obtain a decorrelated signal;
- applying a synthesis matrix whose coefficients depend on the spatialization parameters, the decorrelated signal and the sum signal to obtain said output signals,
Ainsi, la prise en compte de la quantité de signal décorrélé dans chacun des signaux et donc dans l'étape de synthèse du signal, permet de s'affranchir du cas de figure mentionné précédemment où seul le signal décorrélé intervient dans le matriçage de synthèse. Le procédé selon l'invention permet ainsi de traiter les cas où un paramètre de spatialisation situé dans une plage de valeur prédéterminée entraine une telle situation.Thus, taking into account the amount of decorrelated signal in each of the signals and therefore in the signal synthesis step, makes it possible to dispense with the case mentioned above where only the decorrelated signal is involved in the synthesis matrix. The method according to the invention thus makes it possible to handle the cases where a spatialization parameter situated in a predetermined value range causes such a situation.
La minimisation une telle fonction quantitative permet de définir des coefficients de la matrice de synthèse qui permettent d'assurer un bon respect de la forme d'onde du signal d'entrée dans les signaux de sortie.Minimizing such a quantitative function makes it possible to define coefficients of the synthesis matrix that make it possible to ensure good compliance with the waveform of the input signal in the output signals.
Plus particulièrement et de façon simple, une telle fonction quantitative peut-être une fonction d'énergie du signal décorrélé.More particularly and simply, such a quantitative function may be a decorrelated signal energy function.
Cette fonction respecte bien les caractéristiques mentionnée précédemment.This function meets the characteristics mentioned above.
D'une façon plus générale, la fonction quantitative est du type:
Dans un mode particulier de réalisation, les paramètres de spatialisation sont un paramètre (R) de rapport d'énergie entre les canaux du signal multi-canal et un paramètre (I) de corrélation intercanal du signal multi-canal, une plage de valeur étant la plage dans laquelle le paramètre de corrélation intcrcanal est négatif.In a particular embodiment, the spatialization parameters are a parameter (R) of energy ratio between the channels of the multi-channel signal and a interchannel correlation parameter (I) of the multi-channel signal, a range of values being the range in which the intcrcanal correlation parameter is negative.
Ainsi, l'invention s'applique plus particulièrement pour les signaux multi-canaux présentant des corrélations intercanal négatives.Thus, the invention applies more particularly to multi-channel signals having negative interchannel correlations.
Elle peut donc être mise en oeuvre uniquement pour les valeurs du paramètre de corrélation intercanal négative ou pour toute valeur de ce paramètre.It can therefore be implemented only for the values of the negative inter-channel correlation parameter or for any value of this parameter.
Dans un autre mode de réalisation, une fonction quantitative différente est choisie par plage de valeur des paramètres de spatialisation.In another embodiment, a different quantitative function is chosen by value range of the spatialization parameters.
Il est alors possible de moduler l'importance relative que l'on veut donner aux différentes matrices de synthèse. Il est ainsi possible de donner un poids important à une matrice telle que définie dans l'état de l'art, pour une plage particulière de paramètres et inversement de donner un poids important à la matrice de synthèse au sens de l'invention pour une autre plage de paramètre. Ainsi, on peut conserver une compatibilité avec les systèmes existants dans une certaine plage de fonctionnement et améliorer la qualité du système dans une plage particulière. De plus, la possibilité d'utiliser plusieurs matrices de synthèse obtenues selon différents critères permet d'optimiser la qualité globale du système pour toute la plage de fonctionnement.It is then possible to modulate the relative importance that we want to give to the different synthesis matrices. It is thus possible to give a significant weight to a matrix as defined in the state of the art, for a particular range of parameters and conversely to give a significant weight to the synthesis matrix within the meaning of the invention for a other parameter range. Thus, one can maintain compatibility with existing systems within a certain operating range and improve the quality of the system in a particular range. In addition, the possibility of using several synthesis matrices obtained according to different criteria makes it possible to optimize the overall quality of the system for the entire operating range.
L'invention se rapporte également à un dispositif de synthèse spatiale d'un signal somme générant au moins deux signaux de sortie, le signal somme ainsi que des paramètres de spatialisation étant issus d'un dispositif de codage paramétrique mettant en oeuvre un matriçage d'un signal multi-canal original. Le dispositif comportant:
- des moyens de décorrélation (510) du signal somme pour obtenir un signal décorrélé;
- des moyens d'application (520) d'une matrice de synthèse (M Minq) dont les coefficients dépendent des paramètres de spatialisation, au signal décorrélé et au signal somme pour obtenir lesdits signaux de sortie,
- decorrelation means (510) of the sum signal for obtaining a decorrelated signal;
- application means (520) of a synthesis matrix (M Minq) whose coefficients depend on the spatialization parameters, the decorrelated signal and the sum signal to obtain said output signals,
Elle se rapporte à un décodeur comportant un dispositif de synthèse tel que décrit ci-dessus.It relates to a decoder comprising a synthesis device as described above.
L'invention vise aussi un équipement multimédia comportant un décodeur tel que décrit ci-dessus.The invention also relates to multimedia equipment comprising a decoder as described above.
De façon non limitative, un tel équipement peut-être par exemple un téléphone mobile, un agenda électronique ou lecteur de contenu numérique, un ordinateur, un décodeur de salon ("set-top box").Without limitation, such equipment may be for example a mobile phone, an electronic organizer or digital content player, a computer, a set-top box ("set-top box").
Enfin, l'invention vise un programme informatique comportant des instructions de code pour la mise en oeuvre des étapes du procédé tel que décrit ci-dessus, lorsque ces instructions sont exécutées par un processeur.Finally, the invention is directed to a computer program comprising code instructions for implementing the steps of the method as described above, when these instructions are executed by a processor.
D'autres caractéristiques et avantages de l'invention apparaîtront plus clairement à la lecture de la description suivante, donnée uniquement à titre d'exemple non limitatif, et faite en référence aux dessins annexés, sur lesquels:
- la
figure 1 illustre un système de codage/décodage paramétrique classique de l'état de l'art tel que décrit précédemment; - les
figures 2 illustrent des exemples d'arbres de codage tels que décrits précédemment, selon la norme MPEG Surround dans le cas d'un signal multi-canal de type 5.1;et 3 - la
figure 4 illustre un système de décodage de l'état de l'art d'un bloc TTO tel que décrit précédemment; - la
figure 5 illustre un dispositif de synthèse selon l'invention pour le décodage d'un bloc TTO; - la
figure 6 illustre un dispositif de synthèse pour le décodage d'un bloc TTO selon un mode particulier de réalisation; - la
figure 7 illustre un décodeur selon l'invention dans le cas de signaux multicanaux de type 5.1; et - la
figure 8 illustre un exemple d'équipement multimédia comportant au moins un dispositif de synthèse selon l'invention.
- the
figure 1 illustrates a conventional parametric coding / decoding system of the state of the art as described above; - the
Figures 2 and 3 illustrate examples of encoding trees as described above, according to the MPEG Surround standard in the case of a 5.1 type multi-channel signal; - the
figure 4 illustrates a decoding system of the state of the art of a TTO block as described above; - the
figure 5 illustrates a synthesis device according to the invention for decoding a TTO block; - the
figure 6 illustrates a synthesis device for decoding a TTO block according to a particular embodiment; - the
figure 7 illustrates a decoder according to the invention in the case of multichannel signals of type 5.1; and - the
figure 8 illustrates an example of multimedia equipment comprising at least one synthesis device according to the invention.
La
Cette étape de décorrélation est par exemple celle décrite dans la norme MPEG Surround citée précédemment.This decorrelation step is for example that described in the MPEG Surround standard mentioned above.
Ce signal décorrélé d et le signal somme s sont pris en compte dans un module de synthèse 520 utilisant une matrice M Minq dont les coefficients dépendent de paramètres de spatialisation R et I reçus et produisant des signaux de sortie l et r.This decorrelated signal d and the sum signal s are taken into account in a
Plus précisément, les signaux l et r sont générés par le matriçage suivant:
- l'énergie totale est conservée, c'est-à-dire:
- le rapport d'énergie entre l et r vaut R, c'est-à-dire:
- l'intercorrélation normalisée entre l et r vaut I, c'est-à-dire:
- the total energy is conserved, that is to say:
- the energy ratio between l and r is R, that is to say:
- the standardized cross-correlation between l and r is equal to I, that is to say:
En utilisant les deux premières conditions, on a
Les solutions peuvent donc s'écrire sous la forme :
La troisième condition, s'écrit alors :
On voit donc que les matrices solutions du problème sont l'ensemble des matrices paramétrées par β ∈ [0, 2π) de la forme :
Ainsi, deux valeurs de a sont possibles. La valeur de β est fonction de R et I et est choisi selon un mode de réalisation de l'invention de manière à limiter la quantité du signal décorrélé d introduite dans les signaux reconstruits quelque soit les valeurs de corrélation I, y compris pour des valeurs négatives.Thus, two values of a are possible. The value of β is a function of R and I and is chosen according to one embodiment of the invention so as to limit the quantity of the decorrelated signal d introduced into the reconstructed signals regardless of the correlation values I, including for values negative.
Ainsi, le choix de la valeur β peut être formalisé en introduisant une fonction quantitative q relative à la quantité de signal décorrélé pris en compte dans le matriçage pour la reconstruction des signaux.Thus, the choice of the value β can be formalized by introducing a quantitative function q relating to the quantity of decorrelated signal taken into account in the matrixing for the reconstruction of the signals.
D'une façon générale, la fonction quantitative q est telle que l'augmentation en valeur absolue des coefficients de la matrice de synthèse appliqués au signal décorrélé, fait augmenter la valeur de la fonction q appliquée à ces mêmes coefficients.In general, the quantitative function q is such that the increase in absolute value of the coefficients of the synthesis matrix applied to the decorrelated signal increases the value of the function q applied to these same coefficients.
Ainsi, cette fonction quantitative q est telle qu'elle vérifie les conditions suivantes :
- pour tous réels x, x', y si |x'| ≥ |x| alors q (x', y) ≥ q(x, y)
- et symétriquement pour tous réels x, y, y' si |y'|≥|y| alors q(x, y') ≥ q(x, y).
- for all real x, x ', y if | x ' | ≥ | x | then q ( x ' , y ) ≥ q ( x, y )
- and symmetrically for all real x, y, y 'si | y ' | ≥ | y | then q ( x, y ' ) ≥ q ( x, y ).
Pour I et R fixés, la valeur de β est alors choisie par minimisation de la fonction :
De nombreuses fonctions quantitative respectant les conditions décrites ci-dessus peuvent être choisies et permettront d'effectuer un choix satisfaisants de β.Numerous quantitative functions complying with the conditions described above can be chosen and will make it possible to make a satisfactory choice of β.
Ainsi, la fonction q peut-être par exemple de type:
avec p un entier supérieur ou égal à 1.Thus, the function q can for example be of type:
with p an integer greater than or equal to 1.
Dans un mode particulier de réalisation, la fonction quantitative q est une fonction d'énergie du signal décorrélé.In a particular embodiment, the quantitative function q is a function of energy of the decorrelated signal.
La fonction q est donc telle que:
Ainsi, les valeurs de β garantissant une reconstruction satisfaisante selon le mode de réalisation de l'invention décrit ici sont choisies de façon à minimiser l'énergie totale du signal décorrélé d dans les signaux reconstruits.Thus, the values of β guaranteeing a satisfactory reconstruction according to the embodiment of the invention described here are chosen so as to minimize the total energy of the decorrelated signal d in the reconstructed signals.
On cherche alors β minimisant :
La dérivée de g est :
Elle s'annule lorsque:
La valeur de β retenue est donc choisie parmi les valeurs vérifiant
Ainsi, la
La
Le procédé mis en oeuvre par le dispositif de synthèse comporte les étapes de:
- décorrélation (Décorr.) du signal somme pour obtenir un signal décorrélé d;
- application (Synth.) d'une matrice de synthèse (M Minq) dont les coefficients dépendent des paramètres de spatialisation (I, R), au signal décorrélé (d) et au signal somme (s) pour obtenir lesdits signaux de sortie.
- decorrelation (decorr.) of the sum signal to obtain a decorrelated signal d;
- application (Synth.) of a synthesis matrix (M Minq) whose coefficients depend on the spatialization parameters (I, R), the decorrelated signal (d) and the sum signal (s) to obtain the said output signals.
Ce procédé est tel que pour au moins une plage de valeur d'au moins un paramètre de spatialisation, les coefficients de la matrice de synthèse sont déterminés selon un critère de minimisation d'une fonction quantitative, relative à la quantité de signal décorrélé pris en compte dans l'étape d'application de la matrice de synthèse.This method is such that for at least one value range of at least one spatialization parameter, the coefficients of the synthesis matrix are determined according to a criterion for minimizing a quantitative function, relating to the quantity of decorrelated signal taken into account in the step of applying the synthesis matrix.
Dans le mode de réalisation décrit précédemment en référence à la
D'autres paramètres de spatialisation issus du codage paramétrique peuvent également être choisis. Ces paramètres peuvent par exemple être des paramètres désignant le déphasage entre les canaux du signal multi-canal, ou des paramètres d'enveloppe temporelle des canaux audio.Other spatialization parameters derived from parametric coding can also be chosen. These parameters may for example be parameters designating the phase shift between the channels of the multi-channel signal, or time envelope parameters of the audio channels.
La
L'exemple illustré à la
La première matrice de synthèse M est par exemple celle décrite dans l'état de l'art dans la norme MPEG Surround. Le module de synthèse correspondant est illustré en 630. Cette matrice de synthèse est appliquée ici au signal somme s et au signal décorrélé d lorsque le paramètre I est positif.The first synthesis matrix M is for example that described in the state of the art in the MPEG Surround standard. The corresponding synthesis module is illustrated in 630. This synthesis matrix is applied here to the sum signal and the decorrelated signal d when the parameter I is positive.
Lorsque le paramètre I est négatif, la matrice de synthèse M Minq est celle décrite en référence à la
Ainsi, le procédé mis en oeuvre par ce mode de réalisation permet de traiter efficacement des signaux multi-canaux qui présentent des corrélations intercanal négatives.Thus, the method implemented by this embodiment makes it possible to efficiently process multi-channel signals that exhibit negative interchannel correlations.
Ce type de signal multi-canal est par exemple un signal de type ambiophonique. En effet, ce type de signal présente des canaux en opposition de phase. Cet élément caractéristique des signaux issus d'une prise de son ambiophonique est illustré dans les articles de M. Gerzon intitulés "Hierarchical System of Surround Sound Transmission for HDTV" ou "Ambisonic Decoders for HDTV".This type of multi-channel signal is for example a surround-type signal. Indeed, this type of signal has channels in phase opposition. This characteristic element of the signals from surround sound is illustrated in the articles by M. Gerzon entitled "Hierarchical System of Surround Sound Transmission for HDTV "or" Ambisonic Decoders for HDTV ".
Dans une variante de réalisation, plusieurs matrices de synthèse peuvent être prévues pour des plages de valeurs différentes des paramètres de spatialisation.In an alternative embodiment, several synthesis matrices can be provided for ranges of different values of the spatialization parameters.
Ainsi, on peut moduler l'importance relative que l'on veut donner aux différentes matrices de synthèse en fonction des valeurs de paramètres reçus.Thus, it is possible to modulate the relative importance that we want to give to the different synthesis matrices as a function of the received parameter values.
Par exemple, il est ainsi possible de donner un poids important à une matrice M telle que décrite dans l'état de l'art pour une plage particulière de paramètres et inversement de donner un poids important à la matrice de synthèse MMinq au sens de l'invention pour une autre plage de paramètre.For example, it is thus possible to give a significant weight to a matrix M as described in the state of the art for a particular range of parameters and conversely to give a significant weight to the synthesis matrix MMinq within the meaning of invention for another parameter range.
La compatibilité avec les systèmes existants dans une certaine plage de fonctionnement est alors conservée. Une amélioration de la qualité de la synthèse dans une plage particulière de valeur de paramètres de spatialisation est alors apportée dans ce mode de réalisation.Compatibility with existing systems within a certain operating range is then maintained. An improvement in the quality of the synthesis in a particular range of spatialization parameter value is then provided in this embodiment.
De plus, la possibilité d'utiliser plusieurs matrices de synthèse obtenues selon différents critères permet d'optimiser la qualité globale de la synthèse pour toute la plage de fonctionnement.In addition, the possibility of using several synthesis matrices obtained according to different criteria makes it possible to optimize the overall quality of the synthesis for the entire operating range.
On peut par exemple utiliser différentes matrices de synthèse selon que la valeur d'au moins un paramètre de spatialisation est faible ou au contraire importante.For example, it is possible to use different synthesis matrices depending on whether the value of at least one spatialization parameter is small or, on the contrary, important.
Ainsi dans cette variante du mode de réalisation, on utilisera deux matrices de synthèse telles que pour des valeurs positives de l'indice de corrélation I, on utilisera la matrice M telle que décrit dans l'état de l'art, et pour des valeurs négative de l'indice de corrélation I, on utilisera la matrice MMinq.Thus in this variant of the embodiment, use will be made of two synthetic matrices such that for positive values of the correlation index I, the matrix M as described in the state of the art, and for values negative of the correlation index I, we will use the matrix MMinq.
On pourra aussi définir différentes plages de fonctionnement comme par exemple:
- pour I > 0, on utilise une matrice Minter = M
- pour 0≥ I>-0.25, on utilisera une interpolation des deux matrices Minter = α M + (1- α) MMinq
- pour -0.25≥ I>-1, on utilisera la matrice Minter = MMinq
- for I> 0, we use a matrix Minter = M
- for 0≥ I> -0.25, we will use an interpolation of the two matrices Minter = α M + (1- α) MMinq
- for -0.25≥ I> -1, we will use the matrix Minter = MMinq
Ce type de dispositif TTO-1 tel que représenté en
Le décodeur représenté sur cette figure est typiquement prévu pour décoder des signaux multi-canaux de type 5.1. Ainsi, ce décodeur comporte une pluralité de dispositif TTO-1 (TTO0 -1, TTO1 -1, TTO2 -1, TTO3 -1, TTO4 -1) selon l'invention pour, à partir d'un signal S reçu, obtenir un signal multi-canal comportant 6 canaux (L, R, C, LFE, Ls, Rs).The decoder shown in this figure is typically provided for decoding 5.1 type multi-channel signals. Thus, this decoder comprises a plurality of devices TTO -1 (TTO 0 -1 , TTO 1 -1 , TTO 2 -1 , TTO 3 -1 , TTO 4 -1 ) according to the invention for, from a signal S received, obtain a multi-channel signal with 6 channels (L, R, C, LFE, Ls, Rs).
Le module de décodage 730 comportant cette pluralité de dispositif de synthèse peut, bien évidemment, être configuré de façon différente selon l'arbre de codage qui a été utilisé pour le signal multi-canal original.The
Le décodeur tel que représenté en
Ces modules d'analyse QMF et de synthèse QMF peuvent par exemple être ceux tels que décrits dans la norme MPEG Surround.These QMF analysis and QMF synthesis modules may for example be those as described in the MPEG Surround standard.
Le décodeur tel que représenté en
Typiquement, ces paramètres peuvent être des paramètres de rapport d'énergie entre les canaux, de mesure de corrélation entre les canaux ou encore de déphasage entre les canaux ou enfin d'enveloppe temporelle.Typically, these parameters may be energy ratio parameters between the channels, correlation measurement between the channels or else phase shift between the channels or finally time envelope.
Ce décodeur 700 peut être intégré à un équipement multimédia de type décodeur de salon ou "set-top box", ordinateur ou encore téléphone mobile, lecteur de contenu numérique, agenda électronique personnel, etc...This
La
Ces signaux multi-canaux ont été compressés par une méthode de codage paramétrique qui par matriçage du signal original génère un signal somme S et des paramètres de spatialisation P. Ce codage peut dans un mode alternatif être prévu dans l'équipement multimédia.These multi-channel signals have been compressed by a parametric coding method which, by mastering the original signal, generates a sum signal S and spatialization parameters P. This coding may in an alternative mode be provided in the multimedia equipment.
Cet équipement comporte un ou plusieurs dispositifs de synthèse selon l'invention représentés matériellement ici par un processeur PROC coopérant avec un bloc mémoire BM comportant une mémoire de stockage et/ou de travail MEM.This equipment comprises one or more synthesis devices according to the invention, represented here physically by a processor PROC cooperating with a memory block BM comprising a memory storage and / or working MEM.
Le bloc mémoire peut avantageusement comporter un programme informatique comportant des instructions de code pour la mise en oeuvre des étapes du procédé au sens de l'invention, lorsque ces instructions sont exécutées par le processeur PROC, et notamment une étape de décorrélation d'un signal somme reçu pour obtenir un signal décorrélé et une étape d'application d'une matrice de synthèse dont les coefficients dépendent des paramètres de spatialisation, au signal décorrélé et au signal somme pour obtenir au moins deux signaux de sortie. La matrice de synthèse est telle que , pour au moins une plage de valeur d'au moins un paramètre de spatialisation, ses coefficients sont déterminés selon un critère de minimisation d'une fonction quantitative, relative à la quantité de signal décorrélé pris en compte dans l'étape d'application de la matrice de synthèse.The memory block may advantageously comprise a computer program comprising code instructions for implementing the steps of the method in the sense of the invention, when these instructions are executed by the processor PROC, and in particular a decorrelation step of a signal. sum received to obtain a decorrelated signal and a step of applying a synthesis matrix whose coefficients depend on the spatialization parameters, the decorrelated signal and the sum signal to obtain at least two output signals. The synthesis matrix is such that, for at least one value range of at least one spatialization parameter, its coefficients are determined according to a criterion of minimization of a quantitative function, relative to the quantity of decorrelated signal taken into account in the step of applying the synthesis matrix.
Typiquement, la description de la
Le bloc mémoire comporte ainsi les coefficients de la matrice de synthèse telle que définie ci-dessus.The memory block thus comprises the coefficients of the synthesis matrix as defined above.
Ce bloc mémoire peut comporter dans un autre mode de réalisation de l'invention tel que décrit en référence à la
De même le processeur de l'équipement peut également comporter des instructions pour la mise en oeuvre des étapes d'analyse et de synthèse du décodeur tel que décrit en référence à la
L'équipement multimédia tel qu'illustré comporte également une sortie S pour délivrer le signal multi-canal reconstruit S' soit par des moyens de restitution de type haut-parleur soit par des moyens de communication apte à transmettre ce signal multi-canal.The multimedia equipment as illustrated also comprises an output S for delivering the reconstructed multi-channel signal S 'either by speaker-type reproduction means or by communication means capable of transmitting this multi-channel signal.
Claims (9)
- Method for spatially synthesizing a sum signal to obtain at least two output digital audio signals, the sum signal together with spatialization parameters being output by a parametric coding by matrixing of an original multi-channel digital audio signal, the method comprising the steps of:- decorrelation (Decorr.) of the sum signal (s) to obtain a decorrelated signal (d);- application (Synth.) of a synthesis matrix (M Minq) whose coefficients depend on the spatialization parameters (R, I), to the decorrelated signal and to the sum signal so as to obtain said output signals,characterized in that for at least one value range of at least one spatialization parameter, the coefficients of the synthesis matrix are determined according to a criterion for minimizing a quantitative function (q), relating to the quantity of decorrelated signal in each of the output signals obtained by the step of applying the synthesis matrix, the quantitative function being such that the increase in absolute value of the coefficients of the synthesis matrix that are applied to the decorrelated signal increases the value of said function applied to these same coefficients.
- Method according to Claim 1, characterized in that the quantitative function is an energy function of the decorrelated signal.
- Method according to Claim 1, characterized in that the spatialization parameters are a parameter (R) of energy ratio between the channels of the multi-channel signal and a parameter (I) of interchannel correlation of the multi-channel signal, a value range being the range in which the interchannel correlation parameter is negative.
- Method according to Claim 1, characterized in that a different quantitative function is chosen per value range of the spatialization parameters.
- Device for spatially synthesizing a sum signal generating at least two output digital audio signals, the sum signal together with spatialization parameters being output by a parametric coding device implementing a matrixing of an original multi-channel digital audio signal, the device comprising:- means (510) for decorrelating the sum signal to obtain a decorrelated signal;- means (520) for applying a synthesis matrix (M Minq) whose coefficients depend on the spatialization parameters, to the decorrelated signal and to the sum signal so as to obtain said output signals,characterized in that for at least one value range of at least one spatialization parameter, the coefficients of the synthesis matrix are determined according to a criterion for minimizing a quantitative function, relating to the quantity of decorrelated signal in each of the output signals obtained by the means for applying the synthesis matrix, the quantitative function being such that the increase in absolute value of the coefficients of the synthesis matrix that are applied to the decorrelated signal increases the value of said function applied to these same coefficients.
- Digital audio signal decoder comprising at least one synthesis device according to Claim 6.
- Multimedia appliance comprising a decoder according to Claim 7.
- Computer program comprising code instructions for the implementation of the steps of the method according to one of Claims 1 to 5, when these instructions are executed by a processor.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR0854282 | 2008-06-26 | ||
PCT/FR2009/051146 WO2010004155A1 (en) | 2008-06-26 | 2009-06-16 | Spatial synthesis of multichannel audio signals |
Publications (2)
Publication Number | Publication Date |
---|---|
EP2304721A1 EP2304721A1 (en) | 2011-04-06 |
EP2304721B1 true EP2304721B1 (en) | 2012-05-09 |
Family
ID=40328191
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP09794018A Active EP2304721B1 (en) | 2008-06-26 | 2009-06-16 | Spatial synthesis of multichannel audio signals |
Country Status (7)
Country | Link |
---|---|
US (1) | US8583424B2 (en) |
EP (1) | EP2304721B1 (en) |
JP (1) | JP5366104B2 (en) |
CN (1) | CN102077276B (en) |
AT (1) | ATE557386T1 (en) |
ES (1) | ES2387867T3 (en) |
WO (1) | WO2010004155A1 (en) |
Families Citing this family (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2362375A1 (en) * | 2010-02-26 | 2011-08-31 | Fraunhofer-Gesellschaft zur Förderung der Angewandten Forschung e.V. | Apparatus and method for modifying an audio signal using harmonic locking |
EP2369861B1 (en) * | 2010-03-25 | 2016-07-27 | Nxp B.V. | Multi-channel audio signal processing |
KR101756838B1 (en) | 2010-10-13 | 2017-07-11 | 삼성전자주식회사 | Method and apparatus for down-mixing multi channel audio signals |
KR101842257B1 (en) * | 2011-09-14 | 2018-05-15 | 삼성전자주식회사 | Method for signal processing, encoding apparatus thereof, and decoding apparatus thereof |
CN104885150B (en) * | 2012-08-03 | 2019-06-28 | 弗劳恩霍夫应用研究促进协会 | The decoder and method of the universal space audio object coding parameter concept of situation are mixed/above mixed for multichannel contracting |
EP2717263B1 (en) | 2012-10-05 | 2016-11-02 | Nokia Technologies Oy | Method, apparatus, and computer program product for categorical spatial analysis-synthesis on the spectrum of a multichannel audio signal |
EP2830334A1 (en) | 2013-07-22 | 2015-01-28 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Multi-channel audio decoder, multi-channel audio encoder, methods, computer program and encoded audio representation using a decorrelation of rendered audio signals |
CA2919080C (en) * | 2013-07-22 | 2018-06-05 | Sascha Disch | Multi-channel audio decoder, multi-channel audio encoder, methods, computer program and encoded audio representation using a decorrelation of rendered audio signals |
TWI671734B (en) | 2013-09-12 | 2019-09-11 | 瑞典商杜比國際公司 | Decoding method, encoding method, decoding device, and encoding device in multichannel audio system comprising three audio channels, computer program product comprising a non-transitory computer-readable medium with instructions for performing decoding m |
WO2016108655A1 (en) | 2014-12-31 | 2016-07-07 | 한국전자통신연구원 | Method for encoding multi-channel audio signal and encoding device for performing encoding method, and method for decoding multi-channel audio signal and decoding device for performing decoding method |
KR20160081844A (en) * | 2014-12-31 | 2016-07-08 | 한국전자통신연구원 | Encoding method and encoder for multi-channel audio signal, and decoding method and decoder for multi-channel audio signal |
FR3048808A1 (en) * | 2016-03-10 | 2017-09-15 | Orange | OPTIMIZED ENCODING AND DECODING OF SPATIALIZATION INFORMATION FOR PARAMETRIC CODING AND DECODING OF A MULTICANAL AUDIO SIGNAL |
CN111407268B (en) * | 2020-03-27 | 2021-05-14 | 华南理工大学 | Multichannel electroencephalogram signal compression method based on correlation function |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5956674A (en) * | 1995-12-01 | 1999-09-21 | Digital Theater Systems, Inc. | Multi-channel predictive subband audio coder using psychoacoustic adaptive bit allocation in frequency, time and over the multiple channels |
US5835375A (en) * | 1996-01-02 | 1998-11-10 | Ati Technologies Inc. | Integrated MPEG audio decoder and signal processor |
DE19632734A1 (en) * | 1996-08-14 | 1998-02-19 | Thomson Brandt Gmbh | Method and device for generating a multi-tone signal from a mono signal |
US6199039B1 (en) * | 1998-08-03 | 2001-03-06 | National Science Council | Synthesis subband filter in MPEG-II audio decoding |
US7006636B2 (en) * | 2002-05-24 | 2006-02-28 | Agere Systems Inc. | Coherence-based audio coding and synthesis |
DE60311794T2 (en) | 2002-04-22 | 2007-10-31 | Koninklijke Philips Electronics N.V. | SIGNAL SYNTHESIS |
TWI315828B (en) * | 2002-08-07 | 2009-10-11 | Dolby Lab Licensing Corp | Audio channel spatial translation |
KR100923297B1 (en) * | 2002-12-14 | 2009-10-23 | 삼성전자주식회사 | Method for encoding stereo audio, apparatus thereof, method for decoding audio stream and apparatus thereof |
-
2009
- 2009-06-16 CN CN200980124551.6A patent/CN102077276B/en active Active
- 2009-06-16 JP JP2011515543A patent/JP5366104B2/en active Active
- 2009-06-16 WO PCT/FR2009/051146 patent/WO2010004155A1/en active Application Filing
- 2009-06-16 AT AT09794018T patent/ATE557386T1/en active
- 2009-06-16 US US12/996,406 patent/US8583424B2/en active Active
- 2009-06-16 EP EP09794018A patent/EP2304721B1/en active Active
- 2009-06-16 ES ES09794018T patent/ES2387867T3/en active Active
Also Published As
Publication number | Publication date |
---|---|
CN102077276A (en) | 2011-05-25 |
US20110106543A1 (en) | 2011-05-05 |
US8583424B2 (en) | 2013-11-12 |
EP2304721A1 (en) | 2011-04-06 |
WO2010004155A1 (en) | 2010-01-14 |
ATE557386T1 (en) | 2012-05-15 |
ES2387867T3 (en) | 2012-10-03 |
CN102077276B (en) | 2014-04-09 |
JP2011525999A (en) | 2011-09-29 |
JP5366104B2 (en) | 2013-12-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP2304721B1 (en) | Spatial synthesis of multichannel audio signals | |
EP2374123B1 (en) | Improved encoding of multichannel digital audio signals | |
EP2374124B1 (en) | Advanced encoding of multi-channel digital audio signals | |
EP2005420B1 (en) | Device and method for encoding by principal component analysis a multichannel audio signal | |
EP2002424B1 (en) | Device and method for scalable encoding of a multichannel audio signal based on a principal component analysis | |
EP1600042B1 (en) | Method for the treatment of compressed sound data for spatialization | |
EP2539892B1 (en) | Multichannel audio stream compression | |
EP2042001B1 (en) | Binaural spatialization of compression-encoded sound data | |
FR2966634A1 (en) | ENHANCED STEREO PARAMETRIC ENCODING / DECODING FOR PHASE OPPOSITION CHANNELS | |
EP2319037B1 (en) | Reconstruction of multi-channel audio data | |
WO2007101958A2 (en) | Optimization of binaural sound spatialization based on multichannel encoding | |
FR2875351A1 (en) | METHOD OF PROCESSING DATA BY PASSING BETWEEN DOMAINS DIFFERENT FROM SUB-BANDS | |
EP2489039A1 (en) | Optimized low-throughput parametric coding/decoding | |
WO2017103418A1 (en) | Adaptive channel-reduction processing for encoding a multi-channel audio signal | |
EP2168121A1 (en) | Quantification after linear conversion combining audio signals of a sound scene, and related encoder | |
EP2126905B1 (en) | Methods and devices for audio signals encoding and decoding, encoded audio signal | |
WO2011073600A1 (en) | Parametric stereo encoding/decoding having downmix optimisation | |
EP4042418B1 (en) | Determining corrections to be applied to a multichannel audio signal, associated coding and decoding | |
EP1362344A1 (en) | Multichannel signal spectral reconstruction method and device | |
WO2011045549A1 (en) | Optimized parametric stereo decoding | |
WO2023232823A1 (en) | Title: spatialized audio encoding with configuration of a decorrelation processing operation | |
WO2009081002A1 (en) | Processing of a 3d audio stream as a function of a level of presence of spatial components |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20110125 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO SE SI SK TR |
|
AX | Request for extension of the european patent |
Extension state: AL BA RS |
|
17Q | First examination report despatched |
Effective date: 20110622 |
|
DAX | Request for extension of the european patent (deleted) | ||
GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
GRAS | Grant fee paid |
Free format text: ORIGINAL CODE: EPIDOSNIGR3 |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO SE SI SK TR |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D Free format text: NOT ENGLISH |
|
REG | Reference to a national code |
Ref country code: AT Ref legal event code: REF Ref document number: 557386 Country of ref document: AT Kind code of ref document: T Effective date: 20120515 Ref country code: CH Ref legal event code: EP |
|
REG | Reference to a national code |
Ref country code: IE Ref legal event code: FG4D Free format text: LANGUAGE OF EP DOCUMENT: FRENCH |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R096 Ref document number: 602009006883 Country of ref document: DE Effective date: 20120705 |
|
REG | Reference to a national code |
Ref country code: NL Ref legal event code: VDEP Effective date: 20120509 |
|
REG | Reference to a national code |
Ref country code: ES Ref legal event code: FG2A Ref document number: 2387867 Country of ref document: ES Kind code of ref document: T3 Effective date: 20121003 |
|
REG | Reference to a national code |
Ref country code: LT Ref legal event code: MG4D Effective date: 20120509 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: LT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 Ref country code: SE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 Ref country code: PL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 Ref country code: FI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 Ref country code: CY Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 Ref country code: IS Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120909 Ref country code: NO Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120809 |
|
REG | Reference to a national code |
Ref country code: AT Ref legal event code: MK05 Ref document number: 557386 Country of ref document: AT Kind code of ref document: T Effective date: 20120509 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: HR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 Ref country code: SI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 Ref country code: LV Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 Ref country code: PT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120910 Ref country code: GR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120810 |
|
BERE | Be: lapsed |
Owner name: FRANCE TELECOM Effective date: 20120630 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: SK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 Ref country code: NL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 Ref country code: DK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 Ref country code: MC Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20120630 Ref country code: EE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 Ref country code: AT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 Ref country code: RO Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 Ref country code: CZ Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: MK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 |
|
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
REG | Reference to a national code |
Ref country code: IE Ref legal event code: MM4A |
|
26N | No opposition filed |
Effective date: 20130212 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: BE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20120630 Ref country code: IE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20120616 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R097 Ref document number: 602009006883 Country of ref document: DE Effective date: 20130212 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: MT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 Ref country code: BG Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120809 |
|
REG | Reference to a national code |
Ref country code: CH Ref legal event code: PL |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: CH Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20130630 Ref country code: LI Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20130630 Ref country code: TR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: LU Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20120616 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: HU Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20090616 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 8 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 9 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 10 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: ES Payment date: 20230703 Year of fee payment: 15 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: GB Payment date: 20240521 Year of fee payment: 16 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: DE Payment date: 20240521 Year of fee payment: 16 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: IT Payment date: 20240522 Year of fee payment: 16 Ref country code: FR Payment date: 20240522 Year of fee payment: 16 |