EP2042001B1 - Binaurale spatialisierung kompressionsverschlüsselter tondaten - Google Patents

Binaurale spatialisierung kompressionsverschlüsselter tondaten Download PDF

Info

Publication number
EP2042001B1
EP2042001B1 EP07803885A EP07803885A EP2042001B1 EP 2042001 B1 EP2042001 B1 EP 2042001B1 EP 07803885 A EP07803885 A EP 07803885A EP 07803885 A EP07803885 A EP 07803885A EP 2042001 B1 EP2042001 B1 EP 2042001B1
Authority
EP
European Patent Office
Prior art keywords
hrtf
channels
loudspeaker
listener
ear
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
EP07803885A
Other languages
English (en)
French (fr)
Other versions
EP2042001A1 (de
Inventor
David Virette
Alexandre Guerin
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Orange SA
Original Assignee
France Telecom SA
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by France Telecom SA filed Critical France Telecom SA
Publication of EP2042001A1 publication Critical patent/EP2042001A1/de
Application granted granted Critical
Publication of EP2042001B1 publication Critical patent/EP2042001B1/de
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/002Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution
    • H04S3/004For headphones
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/02Systems employing more than two channels, e.g. quadraphonic of the matrix type, i.e. in which input signals are combined algebraically, e.g. after having been phase shifted with respect to each other
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S1/00Two-channel systems
    • H04S1/002Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/01Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/01Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/03Application of parametric coding in stereophonic audio systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/05Application of the precedence or Haas effect, i.e. the effect of first wavefront, in order to improve sound-source localisation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/002Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution

Definitions

  • the invention relates to the processing of sound data for spatialized reproduction.
  • 3D rendering of compressed audio signals occurs especially during the decompression of a 3D audio signal, for example encoded in compression and represented on a number of channels, to a number of different channels (two for example to allow the rendering of 3D audio effects on a headset).
  • binaural aims at the reproduction on a stereophonic headphones of a sound signal with nevertheless effects of spatialization.
  • the invention is however not limited to the aforementioned technique and applies, in particular, to techniques derived from the "binaural", such as the so-called technical rendering techniques TRANSAURAL (registered trademark), that is to say on remote speakers.
  • Such techniques can then use a "crosstalk cancellation” (or “cross-talk cancellation” in English), which consists in canceling the crossed acoustic paths, so that a sound, thus processed and then emitted by the loudspeakers, speakers, can be perceived only by one of the two ears of a listener.
  • these two binaural and transaural rendering techniques are commonly referred to as binaural restitution.
  • the invention relates to the transmission of multichannel audio signals and their conversion for a spatialized rendering (with 3D rendering) on two channels.
  • the rendering device simple headset with earflaps for example
  • the conversion can aim for example the case of a reproduction of a sound scene initially in 5.1 multichannel format (or 7.1, or other) by a simple audio headset (in binaural technique).
  • the invention also relates to the restitution, in the frame of a game or a video recording, for example, of one or more sound samples stored in files, with a view to their spatialization.
  • HRTF Head Related Transfer Functions
  • HRIR Head Related Impulse Response
  • each sound source S i two signals (left and right) which are then added to the left and right signals from the spatialization of all other sound sources, to finally give the signals L and R which will be broadcast in the left and right ears of the listener through two respective speakers (headphones of a binaural technique or remote speakers in transaural technique).
  • N denotes the number of sound sources or incident audio streams to be spatialized
  • the number of filters, or transfer functions, necessary for the binaural synthesis is 2xN for rendering in static binaural spatialization and 4xN for rendering in dynamic binaural spatialization (with transitions of transfer functions).
  • left rear speaker and BR for a right rear speaker are encoded in compression by an ENCOD module capable of delivering two compressed L and R channels, as well as SPAT spatialization information.
  • the compressed channels L and R, as well as the spatialization information SPAT are then conveyed through one or more telecommunication networks RES, on one or two channels according to the available bit rate ( Figure 2B ).
  • a decoder reconstructs the original signal in the initial multichannel format by means of the SPAT spatialization information delivered by the coder and, in the example of Figures 2A and 2C , there are still five channels, after decoding, feeding five speakers (HP-FL, HP-FR, HP-C, HP-BL and HP-BR) for a 5.1 format playback.
  • Audio encoders use time-frequency representations of signals to compress information. These representations are based on an analysis by filter banks or by time-frequency transformation of the MDCT type (for "Modified Discrete Cosine Transform"). In the case where a binaural spatialization must be performed after an audio decoding, the filtering operations are advantageously performed from the outset in the transformed domain.
  • the subband filters in the transformed domain are calculated for each ear and for each of the five positions of the loudspeakers. This technique is often called “virtual speaker technology”.
  • the binaural spatialization can then be advantageously performed by applying these binaural filters directly to the transformed domain at the heart of the decoder.
  • DECOD BIN audio as shown on the figure 3 .
  • this type of DECOD BIN decoder uses a monophonic or stereophonic representation (compressed L, R channels) of the multichannel audio scene, representation to which are associated SPAT spatialization parameters (which may consist, for example, of energy differences between channels and correlation indices between channels). These SPAT parameters are used at decoding to reproduce the original multichannel sound scene.
  • the decoding can use representations decorrelated from these L, R signals (which are obtained, for example, by the application of filters, all-pass decorrelation or reverb filters). These signals are then adjusted in energy thanks to interchannel energy differences, then recombined to obtain the multichannel signal for restitution.
  • the parametric encoder (ENCOD - Figure 2A ) from the multichannel format to two compressed channels (stereo or mono) according to the draft standard "MPEG Surround” provides cross-channel decorrelation information in the initial multichannel format and this decorrelation information can be taken over by the peer parametric decoder (DECOD - Figure 2C ) when rendering in the initial multichannel format.
  • DECOD - Figure 2C peer parametric decoder
  • phase compensation a function of the target energy of the channels, is to avoid a so-called "coloring" effect resulting from the addition of two filters shifted in time (comb filtering).
  • a decoder receives the SPAT spatialization parameters accompanying the signals compressed on two L and R channels in the example shown, and it has been illustrated on this same Figure 5A , how the above filter h L, L applies to the compressed channel L to form a component of the L-BIN signal for binaural restitution. Nevertheless, as represented again on the Figure 5A , it is also necessary to take into account the compressed signal on the channel R which must, for its part, be filtered by a filter involving HRTF transfer functions (denoted H L, FR and H L, BR ) relating to the cross paths H and E of the figure 4 , always to the left ear.
  • HRTF transfer functions decoder
  • the filter corresponding to these crossed paths (denoted h L, R ) is computed according to the gains, target energies and phase shifts, taken from the SPAT spatialization parameters, using an expression equivalent to the relation (1) given previously. .
  • This filter h L, R is finally applied to the compressed signal on the channel R. It is also necessary to take into account the "contribution" of the central loudspeaker in the construction of the signal intended for the binaural restitution L-BIN and, for this is done by applying a filter h L, C ( Figure 5A ) to a combination (for example by addition) of the compressed signals of the two channels L and R to take into account here the path J to the left ear OL of the figure 4 .
  • FIG. 5B another example is shown, in which a decoder receives the compressed signal on a single channel M, accompanying the spatialization parameters SPAT.
  • the M channel is duplicated in two L and R channels and the rest of the processing is strictly equivalent to the treatment shown in FIG. Figure 5A .
  • the two signals L-BIN and R-BIN resulting from these filterings can then be applied to two loudspeakers intended respectively for the left ear and the right ear of the listener after a transition from the transformed domain to the time domain.
  • the present invention improves the situation.
  • the initial multichannel format can be of ambiophonic type (or "ambisonic" in English and aiming at the decomposition of the sound signal in a spherical harmonics base). Alternatively, it can be a type of 5.1 or 7.1, or 10.2. It will be understood that for the latter types of format using channels intended to respectively supply at least pairs of speakers front-left / left-back, on the one hand, and front-right / right-back, d On the other hand, the decorrelation information can be directed to the respective channels of the front / rear speakers preferably associated with the same ear (left or right).
  • this decorrelation information at the rear of a 3D scene is represented in the binaural or transaural restitution, a better representation of the ambiences is obtained, for example crowd noise or reverberation at the back of a scene, or otherwise, contrary to the achievements of the prior art.
  • This weighting advantageously makes it possible to favor the gross transfer function of this rear loudspeaker, or the decorrelated version of this raw transfer function, depending on whether the signal in the back channel of the initial multichannel format is correlated or not with at least one signal. from one of the front channels.
  • the encoding in compression implements a parametric encoder delivering, in the compressed stream including the spatialization parameters, cross-channel decorrelation information of the multichannel format, from which the aforementioned weighting can be determined dynamically.
  • the aforementioned combination of transfer functions takes advantage of the information already present concerning the correlation between channel signals in multichannel format, this information simply being provided by the coder parametric, with the aforementioned spatialization parameters.
  • the parametric decoder according to the draft MPEG Surround standard delivers such cross-channel decorrelation information in multichannel 5.1 format.
  • the compressed signal is first recovered on two L and R channels in the example represented, as well as the SPAT spatialization parameters provided by an encoder such as the ENCOD module of the Figure 2A previously described.
  • transfer functions are determined to construct a combination of filters ("+" sign of the Figure 6A ), each filter being to be applied to a channel L (filter h L, L of the Figure 5A ), or R (filter h L, R of the Figure 5A ), or a combination of these channels (filter h L, C of the Figure 5A ) to construct a signal feeding one of two binaural L-BIN recovery channels.
  • transfer functions are representative of the disturbances experienced by an acoustic wave on a path between a loudspeaker that would have been fed by a channel of the initial multichannel format and an ear of the listener. For example, if the audio content is initially in 5.1 format, as described above with reference to the figure 4 a total of ten HRTF transfer functions are determined, five HRTF functions for the right ear (on paths B, D, G, F and I of the figure 4 ) and five HRTF functions for the left ear (on the A, C, H, E and J paths).
  • the HRTF functions of front and rear speakers, located on the same side of the listener are thus grouped to construct each filter of a filter combination specific to a playback channel on an ear of a listener.
  • a grouping of HRTF functions to construct a filter is for example an addition, by means of multiplicative coefficients, an example of which will be described later.
  • a decorrelated version of the HRTF functions of the loudspeakers located at the rear of the listener is also determined. figure 4 ) and this decorrelated version is integrated with each grouping to form a filter to be applied to a compressed channel.
  • a similar treatment is planned to construct the signal intended to feed the other binaural R-BIN restitution channel of the Figure 6B .
  • HRTF functions of the paths leading to the right ear OD of the listener AU ( figure 4 ).
  • a first grouping includes the HRTF-G functions (for the right front speaker in a direct path), HRTF-F (for the right rear speaker in a direct path) and the decorrelated version HRTF-F * of the HRTF-F function to form the filter to be applied to the compressed channel R.
  • a second grouping includes the function HRTF-B (for the front left speaker according to a cross path), the function HRTF-D (for the rear speaker left in a cross path) and the decorrelated version, denoted HRTF-D *, of the HRTF-D function, to form the filter to be applied to the compressed channel L.
  • the filter combinations integrating the decorrelated versions of the HRTF functions of the rear loudspeakers are applied to the L and R compressed channels to deliver the L-BIN and R-BIN rendering channels, for spatial binaural rendering with 3D rendering.
  • the received sound data is encoded in compression on two stereo channels L and R as illustrated in the example of the Figure 5A .
  • they could be encoded in compression on a single monophonic channel M, as shown in FIG. Figure 5B , in which case the filter combinations are applied to the monophonic channel (duplicated) as shown in FIG. Figure 5B , to supply two signals supplying the two L-BIN, R-BIN reproduction channels respectively.
  • the initial sound data is in the 5.1 multichannel format and is encoded in compression by a parametric encoder according to the aforementioned draft standard MPEG Surround. More particularly, during such encoding, it is possible to obtain, among the spatialization parameters provided, decorrelation information between the right rear channel and the right front channel (respective loudspeakers HP-BR and HP-FR of the figure 4 ), as well as homologous decorrelation information between the left rear channel and the left front channel (respective loudspeakers HP-FR and HP-BR of the figure 4 ).
  • This decorrelation information in a 5.1 format, is intended to make the reproduction of the rear speakers as independent as possible from the reproduction of the front speakers, to enrich, in 5.1 format, the enveloping effect by noise of reverb or audience for concert recordings for example. It is recalled that this enrichment of the 3D envelopment has not been proposed in binaural restitution and an advantage of the invention is to take advantage of the availability of the decorrelation information among the SPAT spatialization parameters to build uncorrelated versions of the functions. HRTF that integrate advantageously with filter combinations for binaural restitution.
  • these filter combinations can be calculated directly in the transformed domain, for example in the field of sub-bands, and the filters representing the decorrelated versions of the HRTF functions of the rear loudspeakers can be obtained for example by applying the initial HRTF functions a phase shift function of the sub-frequency band considered.
  • the decorrelation filters can be so-called "natural" reverb filters (recorded in a particular acoustic environment such as a concert hall for example), or “synthetic” (created by summation of multiple reflections of decreasing amplitudes in the time).
  • the application of a decorrelated filter can therefore return to apply to the signal broken down into frequency subbands a different phase difference in each of the subbands, combined with the addition of a global delay.
  • a parametric decoder of the aforementioned type this amounts to multiplying each sub-frequency band by a complex exponential, of different phase in each sub-band. .
  • These decorrelation filters can therefore correspond to syntheses of phase-shifting all-pass filters.
  • a weighting is provided between the transfer function of a rear loudspeaker and its uncorrelated version in the same group forming a filter.
  • the decorrelated version is preferred for crossed paths (right rear speaker for the left ear and left rear speaker for the right ear), so that, in general, the coefficient ⁇ 1 , may often be greater than the coefficient ⁇ 2 .
  • the coefficients ⁇ ( ⁇ 1 or ⁇ 2 ) are given by variable weighting functions so as to dynamically favor the raw version of the HRTF function of the rear loudspeaker or its uncorrelated version depending on whether the back signal is correlated or no with the signal before. This gives a better representation of the ambiences (crowd noise, reverb, or other) in the 3D rendering.
  • the weighting function ⁇ can be defined dynamically by means of the decorrelation information provided with the spatialization parameters.
  • an equivalent expression can be applied to calculate the weighting coefficient ⁇ involved in the homologous filter h R, R specific to direct acoustic paths to the right ear.
  • the "sqrt" function no longer applies for the crossed paths and for the calculation of the corresponding coefficient ⁇ 2 , in the example described.
  • the target energies and the correlation indices are terms between 0 and 1 so that the coefficient ⁇ 2 is generally lower than the coefficient ⁇ 1 .
  • the global filter combination for the L-BIN pathway, has many groupings of HRTF functions forming filters h L, L and h L, R obtained by the formulas given above, and in each grouping, the HRTF function is well involved. a front speaker, the HRTF function of a rear speaker and a decorrelated version of this Last function HRTF, which allows to represent a decorrelation between the front and back channels directly in the combination of filters, and thus directly in the binaural synthesis.
  • the combination of filters can be directly applied in the transformed domain as a function of the target energies ( ⁇ FL , ⁇ BL , ⁇ FR , ⁇ BR ) associated with the channels of the multichannel format, these target energies being determined from the SPAT spatialization parameters.
  • the transformation from the transformed domain to the time domain is then again planned for the actual reproduction in binaural context (TRANS modules).
  • Figures 6A and 6B ).
  • the present invention also aims at a computer program, intended to be stored in a memory of a decoding module, such that the MEM memory of the DECOD-BIN module of the figure 7 , for spatialized three-dimensional reproduction on two L-BIN and R-BIN rendering channels.
  • the program then comprises instructions for executing the method according to the invention and, in particular, to construct the filter combinations integrating the uncorrelated versions as illustrated on the Figures 6A and 6B described above.
  • one or the other of these figures can constitute a flowchart representing the algorithm underlying the program.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Mathematical Physics (AREA)
  • Human Computer Interaction (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Algebra (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Pure & Applied Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Stereophonic System (AREA)
  • Golf Clubs (AREA)

Claims (10)

  1. Verfahren zur Verarbeitung von Tondaten für eine dreidimensionale verräumlichte Wiedergabe auf zwei Wiedergabepfaden für die Ohren eines Hörers,
    wobei die Tondaten anfangs in einem Mehrkanalformat dargestellt und dann auf eine reduzierte Anzahl von Kanälen (L,R) kompressionscodiert (ENCOD) werden,
    wobei das Mehrkanalformat darin besteht, mehr als zwei Kanäle vorzusehen, die jeweilige Lautsprecher speisen können,
    wobei das Verfahren die Schritte aufweist:
    - mit den auf die reduzierte Anzahl von Kanälen komprimierten Daten Verräumlichungsparameter (SPAT) zu erhalten,
    - für jeden einem Ohr des Hörers zugeordneten Wiedergabepfad ausgehend von den Verräumlichungsparametern eine Kombination von Filtern zu formen, die je für Transferfunktionen (HRTF) zwischen diesem Ohr des Hörers und Lautsprechern repräsentativ sind, die von Kanälen des Anfangs-Mehrkanalformats gespeist werden können, und
    - an die komprimierten Daten die Kombination von Filtern (hL,L, hL,R, hL,C; hR,R, hR,L, hR,C) anzuwenden, die jedem Wiedergabepfad (L-BIN; R-BIN) zugeordnet ist,
    dadurch gekennzeichnet, dass das Verfahren außerdem die Schritt aufweist:
    - für jeden einem Ohr des Hörers zugeordneten Wiedergabepfad ausgehend von den Verräumlichungsparametern mindestens eine Transferfunktion eines hinter dem Ohr des Hörers befindlichen Lautsprechers zu bestimmen, die für eine Dekorrelation zwischen den Kanälen des Mehrkanalformats repräsentativ ist, welche dem hinteren Lautsprecher bzw. mindestens einem vor dem Ohr des Hörers befindlichen Lautsprecher zugeordnet sind, und
    - für jeden Wiedergabepfad die für eine Dekorrelation repräsentative Transferfunktion in die Kombination von Filtern zu integrieren, die diesem Wiedergabepfad zugeordnet ist.
  2. Verfahren nach Anspruch 1, dadurch gekennzeichnet, dass die einem Wiedergabepfad (L-BIN) zugeordnete Kombination von Filtern mindestens eine erste Gruppierung aufweist, die ein erstes Filter (hL,L) formt, ausgehend von:
    - der Transferfunktion eines vorderen Lautsprechers (HRTF-A),
    - der Transferfunktion eines hinteren Lautsprechers (HRTF-C), und
    - einer Version (HRTF-C*) der Transferfunktion des hinteren Lautsprechers, repräsentativ für eine Dekorrelation zwischen Kanälen,
    und dass der vordere und der hintere Lautsprecher sich bezüglich des Hörers auf der gleichen Seite befinden.
  3. Verfahren nach Anspruch 2, dadurch gekennzeichnet, dass die Gruppierung eine Gewichtung, gemäß einem gewählten Koeffizienten (α1; α2), aufweist zwischen:
    - der Transferfunktion des hinten befindlichen Lautsprechers, und
    - der für eine Dekorrelation dieser Transferfunktion des hinteren Lautsprechers repräsentativen Version.
  4. Verfahren nach Anspruch 3, dadurch gekennzeichnet, dass die Kompressionscodierung einen parametrischen Codierer (ENCOD) verwendet, der eine Dekorrelationsinformation zwischen Kanälen des Mehrkanalformats liefert, und dass der Gewichtungskoeffizient durch eine in Abhängigkeit von der Dekorrelationsfunktion (ICCL; ICCR), die der parametrische Codierer liefert, dynamisch variable Funktion dargestellt wird.
  5. Verfahren nach einem der Ansprüche 2 bis 4, bei dem die Tondaten auf zwei Kanälen (L,R) kompressionscodiert werden,
    dadurch gekennzeichnet, dass die dem Wiedergabepfad (L-BIN) zugeordnete Kombination von Filtern außer der ersten ein Filter formenden Gruppierung (hL,L) eines der komprimierten Kanäle (L) eine zweite ein Filter formende Gruppierung (hL,R) des anderen der komprimierten Kanäle (R) aufweist, ausgehend von:
    - der Transferfunktion eines vorderen Lautsprechers (HRTF-H), der sich auf einer zweiten Seite entgegengesetzt zur ersten Seite bezüglich des Hörers befindet,
    - der Transferfunktion eines hinteren Lautsprechers (HRTF-E), der sich auf der zweiten Seite befindet, und
    - einer Version (HRTF-E*) der Transferfunktion dieses hinteren Lautsprechers, die für eine Dekorrelation zwischen Kanälen repräsentativ ist.
  6. Verfahren nach einem der vorhergehenden Ansprüche,
    dadurch gekennzeichnet, dass, wenn die Tondaten in einem transformierten Bereich kompressionscodiert werden, die Kombination von Filtern im transformierten Bereich in Abhängigkeit von Zielenergien angewendet wird, die den Kanälen des Mehrkanalformats zugeordnet sind, wobei diese Zielenergien ausgehend von den Verräumlichungsparametern bestimmt werden.
  7. Verfahren nach einem der vorhergehenden Ansprüche,
    dadurch gekennzeichnet, dass die Transferfunktionen der Lautsprecher vom Typ HRTF sind und akustische Störungen auf Wegen zwischen jedem Lautsprecher und einem Ohr für einen diesem Ohr zugeordneten Wiedergabepfad darstellen.
  8. Verfahren nach den Ansprüchen 6 und 7, bei dem der transformierte Bereich der Bereich der Unterbänder ist, dadurch gekennzeichnet, dass die dekorrelierten Versionen der HRTF-Funktionen der hinteren Lautsprecher erhalten werden, indem an die HRTF-Anfangsfunktionen der hinteren Lautsprecher einer Phasenverzögerung angewendet wird, die von jedem Frequenz-Unterband abhängt.
  9. Decodiermodul (DECOD BIN) für eine verräumlichte dreidimensionale Wiedergabe auf zwei Wiedergabepfaden, dadurch gekennzeichnet, dass es Einrichtungen zur Verarbeitung von Tondaten zur Durchführung des Verfahrens nach einem der vorhergehenden Ansprüche aufweist.
  10. EDV-Programm, das dazu bestimmt ist, in einem Speicher eines Decodiermoduls für eine verräumlichte dreidimensionale Wiedergabe auf zwei Wiedergabepfaden gespeichert zu werden, dadurch gekennzeichnet, dass es Anweisungen zur Ausführung des Verfahrens nach einem der Ansprüche 1 bis 8 aufweist.
EP07803885A 2006-07-07 2007-06-19 Binaurale spatialisierung kompressionsverschlüsselter tondaten Active EP2042001B1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
FR0606212A FR2903562A1 (fr) 2006-07-07 2006-07-07 Spatialisation binaurale de donnees sonores encodees en compression.
PCT/FR2007/051457 WO2008003881A1 (fr) 2006-07-07 2007-06-19 Spatialisation binaurale de donnees sonores encodees en compression

Publications (2)

Publication Number Publication Date
EP2042001A1 EP2042001A1 (de) 2009-04-01
EP2042001B1 true EP2042001B1 (de) 2009-10-21

Family

ID=37684981

Family Applications (1)

Application Number Title Priority Date Filing Date
EP07803885A Active EP2042001B1 (de) 2006-07-07 2007-06-19 Binaurale spatialisierung kompressionsverschlüsselter tondaten

Country Status (7)

Country Link
US (1) US8880413B2 (de)
EP (1) EP2042001B1 (de)
AT (1) ATE446652T1 (de)
DE (1) DE602007002917D1 (de)
ES (1) ES2334856T3 (de)
FR (1) FR2903562A1 (de)
WO (1) WO2008003881A1 (de)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100954385B1 (ko) * 2007-12-18 2010-04-26 한국전자통신연구원 개인화된 머리전달함수를 이용한 3차원 오디오 신호 처리장치 및 그 방법과, 그를 이용한 고현장감 멀티미디어 재생시스템
US8219409B2 (en) * 2008-03-31 2012-07-10 Ecole Polytechnique Federale De Lausanne Audio wave field encoding
KR20090110242A (ko) * 2008-04-17 2009-10-21 삼성전자주식회사 오디오 신호를 처리하는 방법 및 장치
CA2820199C (en) 2008-07-31 2017-02-28 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. Signal generation for binaural signals
US9167368B2 (en) * 2011-12-23 2015-10-20 Blackberry Limited Event notification on a mobile device using binaural sounds
EP2939443B1 (de) 2012-12-27 2018-02-14 DTS, Inc. System und verfahren zur variablen dekorrelation von audiosignalen
US9420393B2 (en) * 2013-05-29 2016-08-16 Qualcomm Incorporated Binaural rendering of spherical harmonic coefficients
EP2830335A3 (de) 2013-07-22 2015-02-25 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung, Verfahren und Computerprogramm zur Zuordnung eines ersten und eines zweiten Eingabekanals an mindestens einen Ausgabekanal
EP3412038A4 (de) * 2016-02-03 2019-08-14 Global Delight Technologies Pvt. Ltd. Verfahren und systeme zur bereitstellung von virtuellem surround-sound bei kopfhörern
FR3048808A1 (fr) * 2016-03-10 2017-09-15 Orange Codage et decodage optimise d'informations de spatialisation pour le codage et le decodage parametrique d'un signal audio multicanal
US10304468B2 (en) 2017-03-20 2019-05-28 Qualcomm Incorporated Target sample generation
FR3067511A1 (fr) * 2017-06-09 2018-12-14 Orange Traitement de donnees sonores pour une separation de sources sonores dans un signal multicanal
CN109327766B (zh) * 2018-09-25 2021-04-30 Oppo广东移动通信有限公司 3d音效处理方法及相关产品
JP7545960B2 (ja) 2018-10-05 2024-09-05 マジック リープ, インコーポレイテッド オーディオ空間化のための強調
US11363402B2 (en) 2019-12-30 2022-06-14 Comhear Inc. Method for providing a spatialized soundfield

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6175631B1 (en) * 1999-07-09 2001-01-16 Stephen A. Davis Method and apparatus for decorrelating audio signals
KR101016975B1 (ko) * 2002-09-23 2011-02-28 코닌클리케 필립스 일렉트로닉스 엔.브이. 미디어 시스템, 이 미디어 시스템에서 적어도 하나의 출력 신호를 생성하는 방법, 및 컴퓨터 판독 가능한 매체
SE0400998D0 (sv) * 2004-04-16 2004-04-16 Cooding Technologies Sweden Ab Method for representing multi-channel audio signals
WO2007080211A1 (en) * 2006-01-09 2007-07-19 Nokia Corporation Decoding of binaural audio signals
KR100754220B1 (ko) * 2006-03-07 2007-09-03 삼성전자주식회사 Mpeg 서라운드를 위한 바이노럴 디코더 및 그 디코딩방법

Also Published As

Publication number Publication date
DE602007002917D1 (de) 2009-12-03
US20090292544A1 (en) 2009-11-26
FR2903562A1 (fr) 2008-01-11
ES2334856T3 (es) 2010-03-16
WO2008003881A1 (fr) 2008-01-10
ATE446652T1 (de) 2009-11-15
EP2042001A1 (de) 2009-04-01
US8880413B2 (en) 2014-11-04

Similar Documents

Publication Publication Date Title
EP2042001B1 (de) Binaurale spatialisierung kompressionsverschlüsselter tondaten
EP1600042B1 (de) Verfahren zum bearbeiten komprimierter audiodaten zur räumlichen wiedergabe
EP2000002B1 (de) Verfahren und einrichtung zur effizienten binauralen raumklangerzeugung im transformierten bereich
EP2005420B1 (de) Einrichtung und verfahren zur codierung durch hauptkomponentenanalyse eines mehrkanaligen audiosignals
EP2374123B1 (de) Verbesserte codierung von mehrkanaligen digitalen audiosignalen
EP2374124B1 (de) Verwaltete codierung von mehrkanaligen digitalen audiosignalen
EP2002424B1 (de) Vorrichtung und verfahren zur skalierbaren kodierung eines mehrkanaligen audiosignals auf der basis einer hauptkomponentenanalyse
KR100928311B1 (ko) 오디오 피스 또는 오디오 데이터스트림의 인코딩된스테레오 신호를 생성하는 장치 및 방법
EP1992198B1 (de) Optimierung des binauralen raumklangeffektes durch mehrkanalkodierung
EP3427260B1 (de) Optimierte codierung und decodierung von verräumlichungsinformationen zur parametrischen codierung und decodierung eines mehrkanaligen audiosignals
EP2304721B1 (de) Raumsynthese mehrkanaliger tonsignale
WO2011045506A1 (fr) Traitement de donnees sonores encodees dans un domaine de sous-bandes
EP1905003A2 (de) Verfahren und vorrichtung zum decodieren von audiosignalen
WO2007110520A1 (fr) Procede de synthese binaurale prenant en compte un effet de salle
FR3045915A1 (fr) Traitement de reduction de canaux adaptatif pour le codage d'un signal audio multicanal
EP3475943A1 (de) Verfahren zur umwandlung, stereophonen codierung, decodierung und transcodierung eines dreidimensionalen audiosignals
EP2009891A2 (de) Übertragung eines Audiosignals in einem immersiven Audiokonferenzsystem
FR3065137A1 (fr) Procede de spatialisation sonore
EP3025514B1 (de) Klangverräumlichung mit raumwirkung
WO2006075079A1 (fr) Procede d’encodage de pistes audio d’un contenu multimedia destine a une diffusion sur terminaux mobiles
FR2857552A1 (fr) Procede de decodage d'un signal permettant de reconstituer une scene sonore a transformation temps-frequence faible complexite, et dispositif correspondant

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20081223

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC MT NL PL PT RO SE SI SK TR

AX Request for extension of the european patent

Extension state: AL BA HR MK RS

GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC MT NL PL PT RO SE SI SK TR

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

Free format text: NOT ENGLISH

REG Reference to a national code

Ref country code: CH

Ref legal event code: EP

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

REF Corresponds to:

Ref document number: 602007002917

Country of ref document: DE

Date of ref document: 20091203

Kind code of ref document: P

REG Reference to a national code

Ref country code: ES

Ref legal event code: FG2A

Ref document number: 2334856

Country of ref document: ES

Kind code of ref document: T3

LTIE Lt: invalidation of european patent or patent extension

Effective date: 20091021

NLV1 Nl: lapsed or annulled due to failure to fulfill the requirements of art. 29p and 29m of the patents act
PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

Ref country code: IS

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100221

Ref country code: SE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

Ref country code: PT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100222

Ref country code: FI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

REG Reference to a national code

Ref country code: IE

Ref legal event code: FD4D

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: PL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

Ref country code: SI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

Ref country code: LV

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

Ref country code: CY

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: AT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: RO

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

Ref country code: DK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

Ref country code: IE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

Ref country code: EE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

Ref country code: BG

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100121

PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: CZ

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

Ref country code: SK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

26N No opposition filed

Effective date: 20100722

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100122

BERE Be: lapsed

Owner name: FRANCE TELECOM

Effective date: 20100630

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MC

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20100630

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IT

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20100619

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: BE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20100630

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: CH

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20110630

Ref country code: LI

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20110630

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LU

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20100619

Ref country code: NL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

Ref country code: HU

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100422

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: TR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 9

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 10

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 11

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 12

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: ES

Payment date: 20230703

Year of fee payment: 17

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GB

Payment date: 20240521

Year of fee payment: 18

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: DE

Payment date: 20240521

Year of fee payment: 18

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: IT

Payment date: 20240522

Year of fee payment: 18

Ref country code: FR

Payment date: 20240521

Year of fee payment: 18

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: ES

Payment date: 20240701

Year of fee payment: 18