EP2042001B1 - Binaurale spatialisierung kompressionsverschlüsselter tondaten - Google Patents
Binaurale spatialisierung kompressionsverschlüsselter tondaten Download PDFInfo
- Publication number
- EP2042001B1 EP2042001B1 EP07803885A EP07803885A EP2042001B1 EP 2042001 B1 EP2042001 B1 EP 2042001B1 EP 07803885 A EP07803885 A EP 07803885A EP 07803885 A EP07803885 A EP 07803885A EP 2042001 B1 EP2042001 B1 EP 2042001B1
- Authority
- EP
- European Patent Office
- Prior art keywords
- hrtf
- channels
- loudspeaker
- listener
- ear
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000006870 function Effects 0.000 claims abstract description 88
- 230000010363 phase shift Effects 0.000 claims abstract description 5
- 238000000034 method Methods 0.000 claims description 32
- 230000006835 compression Effects 0.000 claims description 13
- 238000007906 compression Methods 0.000 claims description 13
- 210000005069 ears Anatomy 0.000 claims description 4
- 230000015654 memory Effects 0.000 claims description 3
- 238000004590 computer program Methods 0.000 claims description 2
- 241001362574 Decodes Species 0.000 claims 1
- 230000001419 dependent effect Effects 0.000 claims 1
- 238000001914 filtration Methods 0.000 abstract description 14
- 238000009877 rendering Methods 0.000 description 23
- 230000000875 corresponding effect Effects 0.000 description 9
- 230000000694 effects Effects 0.000 description 7
- 230000008901 benefit Effects 0.000 description 6
- 238000006243 chemical reaction Methods 0.000 description 5
- 230000005236 sound signal Effects 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 230000004044 response Effects 0.000 description 3
- 230000002123 temporal effect Effects 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 230000002596 correlated effect Effects 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 238000003672 processing method Methods 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 230000007704 transition Effects 0.000 description 2
- 241001080024 Telles Species 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000004040 coloring Methods 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 230000006837 decompression Effects 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 235000021183 entrée Nutrition 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000008054 signal transmission Effects 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 230000003936 working memory Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/002—Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution
- H04S3/004—For headphones
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/02—Systems employing more than two channels, e.g. quadraphonic of the matrix type, i.e. in which input signals are combined algebraically, e.g. after having been phase shifted with respect to each other
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S1/00—Two-channel systems
- H04S1/002—Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/01—Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/01—Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/03—Application of parametric coding in stereophonic audio systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/05—Application of the precedence or Haas effect, i.e. the effect of first wavefront, in order to improve sound-source localisation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/002—Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution
Definitions
- the invention relates to the processing of sound data for spatialized reproduction.
- 3D rendering of compressed audio signals occurs especially during the decompression of a 3D audio signal, for example encoded in compression and represented on a number of channels, to a number of different channels (two for example to allow the rendering of 3D audio effects on a headset).
- binaural aims at the reproduction on a stereophonic headphones of a sound signal with nevertheless effects of spatialization.
- the invention is however not limited to the aforementioned technique and applies, in particular, to techniques derived from the "binaural", such as the so-called technical rendering techniques TRANSAURAL (registered trademark), that is to say on remote speakers.
- Such techniques can then use a "crosstalk cancellation” (or “cross-talk cancellation” in English), which consists in canceling the crossed acoustic paths, so that a sound, thus processed and then emitted by the loudspeakers, speakers, can be perceived only by one of the two ears of a listener.
- these two binaural and transaural rendering techniques are commonly referred to as binaural restitution.
- the invention relates to the transmission of multichannel audio signals and their conversion for a spatialized rendering (with 3D rendering) on two channels.
- the rendering device simple headset with earflaps for example
- the conversion can aim for example the case of a reproduction of a sound scene initially in 5.1 multichannel format (or 7.1, or other) by a simple audio headset (in binaural technique).
- the invention also relates to the restitution, in the frame of a game or a video recording, for example, of one or more sound samples stored in files, with a view to their spatialization.
- HRTF Head Related Transfer Functions
- HRIR Head Related Impulse Response
- each sound source S i two signals (left and right) which are then added to the left and right signals from the spatialization of all other sound sources, to finally give the signals L and R which will be broadcast in the left and right ears of the listener through two respective speakers (headphones of a binaural technique or remote speakers in transaural technique).
- N denotes the number of sound sources or incident audio streams to be spatialized
- the number of filters, or transfer functions, necessary for the binaural synthesis is 2xN for rendering in static binaural spatialization and 4xN for rendering in dynamic binaural spatialization (with transitions of transfer functions).
- left rear speaker and BR for a right rear speaker are encoded in compression by an ENCOD module capable of delivering two compressed L and R channels, as well as SPAT spatialization information.
- the compressed channels L and R, as well as the spatialization information SPAT are then conveyed through one or more telecommunication networks RES, on one or two channels according to the available bit rate ( Figure 2B ).
- a decoder reconstructs the original signal in the initial multichannel format by means of the SPAT spatialization information delivered by the coder and, in the example of Figures 2A and 2C , there are still five channels, after decoding, feeding five speakers (HP-FL, HP-FR, HP-C, HP-BL and HP-BR) for a 5.1 format playback.
- Audio encoders use time-frequency representations of signals to compress information. These representations are based on an analysis by filter banks or by time-frequency transformation of the MDCT type (for "Modified Discrete Cosine Transform"). In the case where a binaural spatialization must be performed after an audio decoding, the filtering operations are advantageously performed from the outset in the transformed domain.
- the subband filters in the transformed domain are calculated for each ear and for each of the five positions of the loudspeakers. This technique is often called “virtual speaker technology”.
- the binaural spatialization can then be advantageously performed by applying these binaural filters directly to the transformed domain at the heart of the decoder.
- DECOD BIN audio as shown on the figure 3 .
- this type of DECOD BIN decoder uses a monophonic or stereophonic representation (compressed L, R channels) of the multichannel audio scene, representation to which are associated SPAT spatialization parameters (which may consist, for example, of energy differences between channels and correlation indices between channels). These SPAT parameters are used at decoding to reproduce the original multichannel sound scene.
- the decoding can use representations decorrelated from these L, R signals (which are obtained, for example, by the application of filters, all-pass decorrelation or reverb filters). These signals are then adjusted in energy thanks to interchannel energy differences, then recombined to obtain the multichannel signal for restitution.
- the parametric encoder (ENCOD - Figure 2A ) from the multichannel format to two compressed channels (stereo or mono) according to the draft standard "MPEG Surround” provides cross-channel decorrelation information in the initial multichannel format and this decorrelation information can be taken over by the peer parametric decoder (DECOD - Figure 2C ) when rendering in the initial multichannel format.
- DECOD - Figure 2C peer parametric decoder
- phase compensation a function of the target energy of the channels, is to avoid a so-called "coloring" effect resulting from the addition of two filters shifted in time (comb filtering).
- a decoder receives the SPAT spatialization parameters accompanying the signals compressed on two L and R channels in the example shown, and it has been illustrated on this same Figure 5A , how the above filter h L, L applies to the compressed channel L to form a component of the L-BIN signal for binaural restitution. Nevertheless, as represented again on the Figure 5A , it is also necessary to take into account the compressed signal on the channel R which must, for its part, be filtered by a filter involving HRTF transfer functions (denoted H L, FR and H L, BR ) relating to the cross paths H and E of the figure 4 , always to the left ear.
- HRTF transfer functions decoder
- the filter corresponding to these crossed paths (denoted h L, R ) is computed according to the gains, target energies and phase shifts, taken from the SPAT spatialization parameters, using an expression equivalent to the relation (1) given previously. .
- This filter h L, R is finally applied to the compressed signal on the channel R. It is also necessary to take into account the "contribution" of the central loudspeaker in the construction of the signal intended for the binaural restitution L-BIN and, for this is done by applying a filter h L, C ( Figure 5A ) to a combination (for example by addition) of the compressed signals of the two channels L and R to take into account here the path J to the left ear OL of the figure 4 .
- FIG. 5B another example is shown, in which a decoder receives the compressed signal on a single channel M, accompanying the spatialization parameters SPAT.
- the M channel is duplicated in two L and R channels and the rest of the processing is strictly equivalent to the treatment shown in FIG. Figure 5A .
- the two signals L-BIN and R-BIN resulting from these filterings can then be applied to two loudspeakers intended respectively for the left ear and the right ear of the listener after a transition from the transformed domain to the time domain.
- the present invention improves the situation.
- the initial multichannel format can be of ambiophonic type (or "ambisonic" in English and aiming at the decomposition of the sound signal in a spherical harmonics base). Alternatively, it can be a type of 5.1 or 7.1, or 10.2. It will be understood that for the latter types of format using channels intended to respectively supply at least pairs of speakers front-left / left-back, on the one hand, and front-right / right-back, d On the other hand, the decorrelation information can be directed to the respective channels of the front / rear speakers preferably associated with the same ear (left or right).
- this decorrelation information at the rear of a 3D scene is represented in the binaural or transaural restitution, a better representation of the ambiences is obtained, for example crowd noise or reverberation at the back of a scene, or otherwise, contrary to the achievements of the prior art.
- This weighting advantageously makes it possible to favor the gross transfer function of this rear loudspeaker, or the decorrelated version of this raw transfer function, depending on whether the signal in the back channel of the initial multichannel format is correlated or not with at least one signal. from one of the front channels.
- the encoding in compression implements a parametric encoder delivering, in the compressed stream including the spatialization parameters, cross-channel decorrelation information of the multichannel format, from which the aforementioned weighting can be determined dynamically.
- the aforementioned combination of transfer functions takes advantage of the information already present concerning the correlation between channel signals in multichannel format, this information simply being provided by the coder parametric, with the aforementioned spatialization parameters.
- the parametric decoder according to the draft MPEG Surround standard delivers such cross-channel decorrelation information in multichannel 5.1 format.
- the compressed signal is first recovered on two L and R channels in the example represented, as well as the SPAT spatialization parameters provided by an encoder such as the ENCOD module of the Figure 2A previously described.
- transfer functions are determined to construct a combination of filters ("+" sign of the Figure 6A ), each filter being to be applied to a channel L (filter h L, L of the Figure 5A ), or R (filter h L, R of the Figure 5A ), or a combination of these channels (filter h L, C of the Figure 5A ) to construct a signal feeding one of two binaural L-BIN recovery channels.
- transfer functions are representative of the disturbances experienced by an acoustic wave on a path between a loudspeaker that would have been fed by a channel of the initial multichannel format and an ear of the listener. For example, if the audio content is initially in 5.1 format, as described above with reference to the figure 4 a total of ten HRTF transfer functions are determined, five HRTF functions for the right ear (on paths B, D, G, F and I of the figure 4 ) and five HRTF functions for the left ear (on the A, C, H, E and J paths).
- the HRTF functions of front and rear speakers, located on the same side of the listener are thus grouped to construct each filter of a filter combination specific to a playback channel on an ear of a listener.
- a grouping of HRTF functions to construct a filter is for example an addition, by means of multiplicative coefficients, an example of which will be described later.
- a decorrelated version of the HRTF functions of the loudspeakers located at the rear of the listener is also determined. figure 4 ) and this decorrelated version is integrated with each grouping to form a filter to be applied to a compressed channel.
- a similar treatment is planned to construct the signal intended to feed the other binaural R-BIN restitution channel of the Figure 6B .
- HRTF functions of the paths leading to the right ear OD of the listener AU ( figure 4 ).
- a first grouping includes the HRTF-G functions (for the right front speaker in a direct path), HRTF-F (for the right rear speaker in a direct path) and the decorrelated version HRTF-F * of the HRTF-F function to form the filter to be applied to the compressed channel R.
- a second grouping includes the function HRTF-B (for the front left speaker according to a cross path), the function HRTF-D (for the rear speaker left in a cross path) and the decorrelated version, denoted HRTF-D *, of the HRTF-D function, to form the filter to be applied to the compressed channel L.
- the filter combinations integrating the decorrelated versions of the HRTF functions of the rear loudspeakers are applied to the L and R compressed channels to deliver the L-BIN and R-BIN rendering channels, for spatial binaural rendering with 3D rendering.
- the received sound data is encoded in compression on two stereo channels L and R as illustrated in the example of the Figure 5A .
- they could be encoded in compression on a single monophonic channel M, as shown in FIG. Figure 5B , in which case the filter combinations are applied to the monophonic channel (duplicated) as shown in FIG. Figure 5B , to supply two signals supplying the two L-BIN, R-BIN reproduction channels respectively.
- the initial sound data is in the 5.1 multichannel format and is encoded in compression by a parametric encoder according to the aforementioned draft standard MPEG Surround. More particularly, during such encoding, it is possible to obtain, among the spatialization parameters provided, decorrelation information between the right rear channel and the right front channel (respective loudspeakers HP-BR and HP-FR of the figure 4 ), as well as homologous decorrelation information between the left rear channel and the left front channel (respective loudspeakers HP-FR and HP-BR of the figure 4 ).
- This decorrelation information in a 5.1 format, is intended to make the reproduction of the rear speakers as independent as possible from the reproduction of the front speakers, to enrich, in 5.1 format, the enveloping effect by noise of reverb or audience for concert recordings for example. It is recalled that this enrichment of the 3D envelopment has not been proposed in binaural restitution and an advantage of the invention is to take advantage of the availability of the decorrelation information among the SPAT spatialization parameters to build uncorrelated versions of the functions. HRTF that integrate advantageously with filter combinations for binaural restitution.
- these filter combinations can be calculated directly in the transformed domain, for example in the field of sub-bands, and the filters representing the decorrelated versions of the HRTF functions of the rear loudspeakers can be obtained for example by applying the initial HRTF functions a phase shift function of the sub-frequency band considered.
- the decorrelation filters can be so-called "natural" reverb filters (recorded in a particular acoustic environment such as a concert hall for example), or “synthetic” (created by summation of multiple reflections of decreasing amplitudes in the time).
- the application of a decorrelated filter can therefore return to apply to the signal broken down into frequency subbands a different phase difference in each of the subbands, combined with the addition of a global delay.
- a parametric decoder of the aforementioned type this amounts to multiplying each sub-frequency band by a complex exponential, of different phase in each sub-band. .
- These decorrelation filters can therefore correspond to syntheses of phase-shifting all-pass filters.
- a weighting is provided between the transfer function of a rear loudspeaker and its uncorrelated version in the same group forming a filter.
- the decorrelated version is preferred for crossed paths (right rear speaker for the left ear and left rear speaker for the right ear), so that, in general, the coefficient ⁇ 1 , may often be greater than the coefficient ⁇ 2 .
- the coefficients ⁇ ( ⁇ 1 or ⁇ 2 ) are given by variable weighting functions so as to dynamically favor the raw version of the HRTF function of the rear loudspeaker or its uncorrelated version depending on whether the back signal is correlated or no with the signal before. This gives a better representation of the ambiences (crowd noise, reverb, or other) in the 3D rendering.
- the weighting function ⁇ can be defined dynamically by means of the decorrelation information provided with the spatialization parameters.
- an equivalent expression can be applied to calculate the weighting coefficient ⁇ involved in the homologous filter h R, R specific to direct acoustic paths to the right ear.
- the "sqrt" function no longer applies for the crossed paths and for the calculation of the corresponding coefficient ⁇ 2 , in the example described.
- the target energies and the correlation indices are terms between 0 and 1 so that the coefficient ⁇ 2 is generally lower than the coefficient ⁇ 1 .
- the global filter combination for the L-BIN pathway, has many groupings of HRTF functions forming filters h L, L and h L, R obtained by the formulas given above, and in each grouping, the HRTF function is well involved. a front speaker, the HRTF function of a rear speaker and a decorrelated version of this Last function HRTF, which allows to represent a decorrelation between the front and back channels directly in the combination of filters, and thus directly in the binaural synthesis.
- the combination of filters can be directly applied in the transformed domain as a function of the target energies ( ⁇ FL , ⁇ BL , ⁇ FR , ⁇ BR ) associated with the channels of the multichannel format, these target energies being determined from the SPAT spatialization parameters.
- the transformation from the transformed domain to the time domain is then again planned for the actual reproduction in binaural context (TRANS modules).
- Figures 6A and 6B ).
- the present invention also aims at a computer program, intended to be stored in a memory of a decoding module, such that the MEM memory of the DECOD-BIN module of the figure 7 , for spatialized three-dimensional reproduction on two L-BIN and R-BIN rendering channels.
- the program then comprises instructions for executing the method according to the invention and, in particular, to construct the filter combinations integrating the uncorrelated versions as illustrated on the Figures 6A and 6B described above.
- one or the other of these figures can constitute a flowchart representing the algorithm underlying the program.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Mathematical Physics (AREA)
- Human Computer Interaction (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Multimedia (AREA)
- Algebra (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Analysis (AREA)
- Mathematical Optimization (AREA)
- Pure & Applied Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Stereophonic System (AREA)
- Golf Clubs (AREA)
Claims (10)
- Verfahren zur Verarbeitung von Tondaten für eine dreidimensionale verräumlichte Wiedergabe auf zwei Wiedergabepfaden für die Ohren eines Hörers,
wobei die Tondaten anfangs in einem Mehrkanalformat dargestellt und dann auf eine reduzierte Anzahl von Kanälen (L,R) kompressionscodiert (ENCOD) werden,
wobei das Mehrkanalformat darin besteht, mehr als zwei Kanäle vorzusehen, die jeweilige Lautsprecher speisen können,
wobei das Verfahren die Schritte aufweist:- mit den auf die reduzierte Anzahl von Kanälen komprimierten Daten Verräumlichungsparameter (SPAT) zu erhalten,- für jeden einem Ohr des Hörers zugeordneten Wiedergabepfad ausgehend von den Verräumlichungsparametern eine Kombination von Filtern zu formen, die je für Transferfunktionen (HRTF) zwischen diesem Ohr des Hörers und Lautsprechern repräsentativ sind, die von Kanälen des Anfangs-Mehrkanalformats gespeist werden können, und- an die komprimierten Daten die Kombination von Filtern (hL,L, hL,R, hL,C; hR,R, hR,L, hR,C) anzuwenden, die jedem Wiedergabepfad (L-BIN; R-BIN) zugeordnet ist,dadurch gekennzeichnet, dass das Verfahren außerdem die Schritt aufweist:- für jeden einem Ohr des Hörers zugeordneten Wiedergabepfad ausgehend von den Verräumlichungsparametern mindestens eine Transferfunktion eines hinter dem Ohr des Hörers befindlichen Lautsprechers zu bestimmen, die für eine Dekorrelation zwischen den Kanälen des Mehrkanalformats repräsentativ ist, welche dem hinteren Lautsprecher bzw. mindestens einem vor dem Ohr des Hörers befindlichen Lautsprecher zugeordnet sind, und- für jeden Wiedergabepfad die für eine Dekorrelation repräsentative Transferfunktion in die Kombination von Filtern zu integrieren, die diesem Wiedergabepfad zugeordnet ist. - Verfahren nach Anspruch 1, dadurch gekennzeichnet, dass die einem Wiedergabepfad (L-BIN) zugeordnete Kombination von Filtern mindestens eine erste Gruppierung aufweist, die ein erstes Filter (hL,L) formt, ausgehend von:- der Transferfunktion eines vorderen Lautsprechers (HRTF-A),- der Transferfunktion eines hinteren Lautsprechers (HRTF-C), und- einer Version (HRTF-C*) der Transferfunktion des hinteren Lautsprechers, repräsentativ für eine Dekorrelation zwischen Kanälen,und dass der vordere und der hintere Lautsprecher sich bezüglich des Hörers auf der gleichen Seite befinden.
- Verfahren nach Anspruch 2, dadurch gekennzeichnet, dass die Gruppierung eine Gewichtung, gemäß einem gewählten Koeffizienten (α1; α2), aufweist zwischen:- der Transferfunktion des hinten befindlichen Lautsprechers, und- der für eine Dekorrelation dieser Transferfunktion des hinteren Lautsprechers repräsentativen Version.
- Verfahren nach Anspruch 3, dadurch gekennzeichnet, dass die Kompressionscodierung einen parametrischen Codierer (ENCOD) verwendet, der eine Dekorrelationsinformation zwischen Kanälen des Mehrkanalformats liefert, und dass der Gewichtungskoeffizient durch eine in Abhängigkeit von der Dekorrelationsfunktion (ICCL; ICCR), die der parametrische Codierer liefert, dynamisch variable Funktion dargestellt wird.
- Verfahren nach einem der Ansprüche 2 bis 4, bei dem die Tondaten auf zwei Kanälen (L,R) kompressionscodiert werden,
dadurch gekennzeichnet, dass die dem Wiedergabepfad (L-BIN) zugeordnete Kombination von Filtern außer der ersten ein Filter formenden Gruppierung (hL,L) eines der komprimierten Kanäle (L) eine zweite ein Filter formende Gruppierung (hL,R) des anderen der komprimierten Kanäle (R) aufweist, ausgehend von:- der Transferfunktion eines vorderen Lautsprechers (HRTF-H), der sich auf einer zweiten Seite entgegengesetzt zur ersten Seite bezüglich des Hörers befindet,- der Transferfunktion eines hinteren Lautsprechers (HRTF-E), der sich auf der zweiten Seite befindet, und- einer Version (HRTF-E*) der Transferfunktion dieses hinteren Lautsprechers, die für eine Dekorrelation zwischen Kanälen repräsentativ ist. - Verfahren nach einem der vorhergehenden Ansprüche,
dadurch gekennzeichnet, dass, wenn die Tondaten in einem transformierten Bereich kompressionscodiert werden, die Kombination von Filtern im transformierten Bereich in Abhängigkeit von Zielenergien angewendet wird, die den Kanälen des Mehrkanalformats zugeordnet sind, wobei diese Zielenergien ausgehend von den Verräumlichungsparametern bestimmt werden. - Verfahren nach einem der vorhergehenden Ansprüche,
dadurch gekennzeichnet, dass die Transferfunktionen der Lautsprecher vom Typ HRTF sind und akustische Störungen auf Wegen zwischen jedem Lautsprecher und einem Ohr für einen diesem Ohr zugeordneten Wiedergabepfad darstellen. - Verfahren nach den Ansprüchen 6 und 7, bei dem der transformierte Bereich der Bereich der Unterbänder ist, dadurch gekennzeichnet, dass die dekorrelierten Versionen der HRTF-Funktionen der hinteren Lautsprecher erhalten werden, indem an die HRTF-Anfangsfunktionen der hinteren Lautsprecher einer Phasenverzögerung angewendet wird, die von jedem Frequenz-Unterband abhängt.
- Decodiermodul (DECOD BIN) für eine verräumlichte dreidimensionale Wiedergabe auf zwei Wiedergabepfaden, dadurch gekennzeichnet, dass es Einrichtungen zur Verarbeitung von Tondaten zur Durchführung des Verfahrens nach einem der vorhergehenden Ansprüche aufweist.
- EDV-Programm, das dazu bestimmt ist, in einem Speicher eines Decodiermoduls für eine verräumlichte dreidimensionale Wiedergabe auf zwei Wiedergabepfaden gespeichert zu werden, dadurch gekennzeichnet, dass es Anweisungen zur Ausführung des Verfahrens nach einem der Ansprüche 1 bis 8 aufweist.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR0606212A FR2903562A1 (fr) | 2006-07-07 | 2006-07-07 | Spatialisation binaurale de donnees sonores encodees en compression. |
PCT/FR2007/051457 WO2008003881A1 (fr) | 2006-07-07 | 2007-06-19 | Spatialisation binaurale de donnees sonores encodees en compression |
Publications (2)
Publication Number | Publication Date |
---|---|
EP2042001A1 EP2042001A1 (de) | 2009-04-01 |
EP2042001B1 true EP2042001B1 (de) | 2009-10-21 |
Family
ID=37684981
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP07803885A Active EP2042001B1 (de) | 2006-07-07 | 2007-06-19 | Binaurale spatialisierung kompressionsverschlüsselter tondaten |
Country Status (7)
Country | Link |
---|---|
US (1) | US8880413B2 (de) |
EP (1) | EP2042001B1 (de) |
AT (1) | ATE446652T1 (de) |
DE (1) | DE602007002917D1 (de) |
ES (1) | ES2334856T3 (de) |
FR (1) | FR2903562A1 (de) |
WO (1) | WO2008003881A1 (de) |
Families Citing this family (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR100954385B1 (ko) * | 2007-12-18 | 2010-04-26 | 한국전자통신연구원 | 개인화된 머리전달함수를 이용한 3차원 오디오 신호 처리장치 및 그 방법과, 그를 이용한 고현장감 멀티미디어 재생시스템 |
US8219409B2 (en) * | 2008-03-31 | 2012-07-10 | Ecole Polytechnique Federale De Lausanne | Audio wave field encoding |
KR20090110242A (ko) * | 2008-04-17 | 2009-10-21 | 삼성전자주식회사 | 오디오 신호를 처리하는 방법 및 장치 |
CA2820199C (en) | 2008-07-31 | 2017-02-28 | Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. | Signal generation for binaural signals |
US9167368B2 (en) * | 2011-12-23 | 2015-10-20 | Blackberry Limited | Event notification on a mobile device using binaural sounds |
EP2939443B1 (de) | 2012-12-27 | 2018-02-14 | DTS, Inc. | System und verfahren zur variablen dekorrelation von audiosignalen |
US9420393B2 (en) * | 2013-05-29 | 2016-08-16 | Qualcomm Incorporated | Binaural rendering of spherical harmonic coefficients |
EP2830335A3 (de) | 2013-07-22 | 2015-02-25 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung, Verfahren und Computerprogramm zur Zuordnung eines ersten und eines zweiten Eingabekanals an mindestens einen Ausgabekanal |
EP3412038A4 (de) * | 2016-02-03 | 2019-08-14 | Global Delight Technologies Pvt. Ltd. | Verfahren und systeme zur bereitstellung von virtuellem surround-sound bei kopfhörern |
FR3048808A1 (fr) * | 2016-03-10 | 2017-09-15 | Orange | Codage et decodage optimise d'informations de spatialisation pour le codage et le decodage parametrique d'un signal audio multicanal |
US10304468B2 (en) | 2017-03-20 | 2019-05-28 | Qualcomm Incorporated | Target sample generation |
FR3067511A1 (fr) * | 2017-06-09 | 2018-12-14 | Orange | Traitement de donnees sonores pour une separation de sources sonores dans un signal multicanal |
CN109327766B (zh) * | 2018-09-25 | 2021-04-30 | Oppo广东移动通信有限公司 | 3d音效处理方法及相关产品 |
JP7545960B2 (ja) | 2018-10-05 | 2024-09-05 | マジック リープ, インコーポレイテッド | オーディオ空間化のための強調 |
US11363402B2 (en) | 2019-12-30 | 2022-06-14 | Comhear Inc. | Method for providing a spatialized soundfield |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6175631B1 (en) * | 1999-07-09 | 2001-01-16 | Stephen A. Davis | Method and apparatus for decorrelating audio signals |
KR101016975B1 (ko) * | 2002-09-23 | 2011-02-28 | 코닌클리케 필립스 일렉트로닉스 엔.브이. | 미디어 시스템, 이 미디어 시스템에서 적어도 하나의 출력 신호를 생성하는 방법, 및 컴퓨터 판독 가능한 매체 |
SE0400998D0 (sv) * | 2004-04-16 | 2004-04-16 | Cooding Technologies Sweden Ab | Method for representing multi-channel audio signals |
WO2007080211A1 (en) * | 2006-01-09 | 2007-07-19 | Nokia Corporation | Decoding of binaural audio signals |
KR100754220B1 (ko) * | 2006-03-07 | 2007-09-03 | 삼성전자주식회사 | Mpeg 서라운드를 위한 바이노럴 디코더 및 그 디코딩방법 |
-
2006
- 2006-07-07 FR FR0606212A patent/FR2903562A1/fr not_active Withdrawn
-
2007
- 2007-06-19 DE DE602007002917T patent/DE602007002917D1/de active Active
- 2007-06-19 US US12/309,074 patent/US8880413B2/en active Active
- 2007-06-19 AT AT07803885T patent/ATE446652T1/de not_active IP Right Cessation
- 2007-06-19 ES ES07803885T patent/ES2334856T3/es active Active
- 2007-06-19 WO PCT/FR2007/051457 patent/WO2008003881A1/fr active Application Filing
- 2007-06-19 EP EP07803885A patent/EP2042001B1/de active Active
Also Published As
Publication number | Publication date |
---|---|
DE602007002917D1 (de) | 2009-12-03 |
US20090292544A1 (en) | 2009-11-26 |
FR2903562A1 (fr) | 2008-01-11 |
ES2334856T3 (es) | 2010-03-16 |
WO2008003881A1 (fr) | 2008-01-10 |
ATE446652T1 (de) | 2009-11-15 |
EP2042001A1 (de) | 2009-04-01 |
US8880413B2 (en) | 2014-11-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP2042001B1 (de) | Binaurale spatialisierung kompressionsverschlüsselter tondaten | |
EP1600042B1 (de) | Verfahren zum bearbeiten komprimierter audiodaten zur räumlichen wiedergabe | |
EP2000002B1 (de) | Verfahren und einrichtung zur effizienten binauralen raumklangerzeugung im transformierten bereich | |
EP2005420B1 (de) | Einrichtung und verfahren zur codierung durch hauptkomponentenanalyse eines mehrkanaligen audiosignals | |
EP2374123B1 (de) | Verbesserte codierung von mehrkanaligen digitalen audiosignalen | |
EP2374124B1 (de) | Verwaltete codierung von mehrkanaligen digitalen audiosignalen | |
EP2002424B1 (de) | Vorrichtung und verfahren zur skalierbaren kodierung eines mehrkanaligen audiosignals auf der basis einer hauptkomponentenanalyse | |
KR100928311B1 (ko) | 오디오 피스 또는 오디오 데이터스트림의 인코딩된스테레오 신호를 생성하는 장치 및 방법 | |
EP1992198B1 (de) | Optimierung des binauralen raumklangeffektes durch mehrkanalkodierung | |
EP3427260B1 (de) | Optimierte codierung und decodierung von verräumlichungsinformationen zur parametrischen codierung und decodierung eines mehrkanaligen audiosignals | |
EP2304721B1 (de) | Raumsynthese mehrkanaliger tonsignale | |
WO2011045506A1 (fr) | Traitement de donnees sonores encodees dans un domaine de sous-bandes | |
EP1905003A2 (de) | Verfahren und vorrichtung zum decodieren von audiosignalen | |
WO2007110520A1 (fr) | Procede de synthese binaurale prenant en compte un effet de salle | |
FR3045915A1 (fr) | Traitement de reduction de canaux adaptatif pour le codage d'un signal audio multicanal | |
EP3475943A1 (de) | Verfahren zur umwandlung, stereophonen codierung, decodierung und transcodierung eines dreidimensionalen audiosignals | |
EP2009891A2 (de) | Übertragung eines Audiosignals in einem immersiven Audiokonferenzsystem | |
FR3065137A1 (fr) | Procede de spatialisation sonore | |
EP3025514B1 (de) | Klangverräumlichung mit raumwirkung | |
WO2006075079A1 (fr) | Procede d’encodage de pistes audio d’un contenu multimedia destine a une diffusion sur terminaux mobiles | |
FR2857552A1 (fr) | Procede de decodage d'un signal permettant de reconstituer une scene sonore a transformation temps-frequence faible complexite, et dispositif correspondant |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20081223 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC MT NL PL PT RO SE SI SK TR |
|
AX | Request for extension of the european patent |
Extension state: AL BA HR MK RS |
|
GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
GRAS | Grant fee paid |
Free format text: ORIGINAL CODE: EPIDOSNIGR3 |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC MT NL PL PT RO SE SI SK TR |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D Free format text: NOT ENGLISH |
|
REG | Reference to a national code |
Ref country code: CH Ref legal event code: EP |
|
REG | Reference to a national code |
Ref country code: IE Ref legal event code: FG4D |
|
REF | Corresponds to: |
Ref document number: 602007002917 Country of ref document: DE Date of ref document: 20091203 Kind code of ref document: P |
|
REG | Reference to a national code |
Ref country code: ES Ref legal event code: FG2A Ref document number: 2334856 Country of ref document: ES Kind code of ref document: T3 |
|
LTIE | Lt: invalidation of european patent or patent extension |
Effective date: 20091021 |
|
NLV1 | Nl: lapsed or annulled due to failure to fulfill the requirements of art. 29p and 29m of the patents act | ||
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: LT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 Ref country code: IS Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20100221 Ref country code: SE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 Ref country code: PT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20100222 Ref country code: FI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 |
|
REG | Reference to a national code |
Ref country code: IE Ref legal event code: FD4D |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: PL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 Ref country code: SI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 Ref country code: LV Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 Ref country code: CY Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: AT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: RO Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 Ref country code: DK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 Ref country code: IE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 Ref country code: EE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 Ref country code: BG Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20100121 |
|
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: CZ Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 Ref country code: SK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 |
|
26N | No opposition filed |
Effective date: 20100722 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: GR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20100122 |
|
BERE | Be: lapsed |
Owner name: FRANCE TELECOM Effective date: 20100630 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: MC Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20100630 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: IT Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20100619 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: MT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: BE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20100630 |
|
REG | Reference to a national code |
Ref country code: CH Ref legal event code: PL |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: CH Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20110630 Ref country code: LI Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20110630 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: LU Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20100619 Ref country code: NL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 Ref country code: HU Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20100422 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: TR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 9 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 10 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 11 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 12 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: ES Payment date: 20230703 Year of fee payment: 17 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: GB Payment date: 20240521 Year of fee payment: 18 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: DE Payment date: 20240521 Year of fee payment: 18 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: IT Payment date: 20240522 Year of fee payment: 18 Ref country code: FR Payment date: 20240521 Year of fee payment: 18 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: ES Payment date: 20240701 Year of fee payment: 18 |