EP3025514A1 - Sound spatialization with room effect - Google Patents
Sound spatialization with room effectInfo
- Publication number
- EP3025514A1 EP3025514A1 EP14748239.2A EP14748239A EP3025514A1 EP 3025514 A1 EP3025514 A1 EP 3025514A1 EP 14748239 A EP14748239 A EP 14748239A EP 3025514 A1 EP3025514 A1 EP 3025514A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- input signals
- transfer function
- room effect
- weighting
- signal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000000694 effects Effects 0.000 title claims abstract description 49
- 238000012546 transfer Methods 0.000 claims abstract description 89
- 238000000034 method Methods 0.000 claims abstract description 35
- 238000001914 filtration Methods 0.000 claims abstract description 21
- 230000006870 function Effects 0.000 claims description 90
- 230000004044 response Effects 0.000 claims description 37
- 230000005236 sound signal Effects 0.000 claims description 10
- 238000010276 construction Methods 0.000 claims description 3
- 238000010606 normalization Methods 0.000 claims description 3
- 238000004590 computer program Methods 0.000 claims description 2
- 230000005669 field effect Effects 0.000 claims description 2
- 210000005069 ears Anatomy 0.000 description 11
- 238000012545 processing Methods 0.000 description 11
- 238000011282 treatment Methods 0.000 description 9
- 238000004364 calculation method Methods 0.000 description 8
- 238000009877 rendering Methods 0.000 description 6
- 238000007792 addition Methods 0.000 description 5
- 230000000875 corresponding effect Effects 0.000 description 5
- 238000002360 preparation method Methods 0.000 description 4
- 238000012935 Averaging Methods 0.000 description 3
- 238000007796 conventional method Methods 0.000 description 3
- 238000002955 isolation Methods 0.000 description 3
- 230000003595 spectral effect Effects 0.000 description 3
- 238000013459 approach Methods 0.000 description 2
- 230000001934 delay Effects 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 230000003936 working memory Effects 0.000 description 2
- SVTBMSDMJJWYQN-UHFFFAOYSA-N 2-methylpentane-2,4-diol Chemical compound CC(O)CC(C)(C)O SVTBMSDMJJWYQN-UHFFFAOYSA-N 0.000 description 1
- 101100071959 Bartonella bacilliformis (strain ATCC 35685 / NCTC 12138 / KC583) ialB gene Proteins 0.000 description 1
- 206010011878 Deafness Diseases 0.000 description 1
- 241001362574 Decodes Species 0.000 description 1
- 230000005355 Hall effect Effects 0.000 description 1
- 101100397045 Xenopus laevis invs-b gene Proteins 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 230000001186 cumulative effect Effects 0.000 description 1
- 230000000593 degrading effect Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000009792 diffusion process Methods 0.000 description 1
- 238000002592 echocardiography Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 101150114988 invA gene Proteins 0.000 description 1
- 238000000691 measurement method Methods 0.000 description 1
- 230000015654 memory Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000004091 panning Methods 0.000 description 1
- 238000005192 partition Methods 0.000 description 1
- 230000000704 physical effect Effects 0.000 description 1
- 238000004321 preservation Methods 0.000 description 1
- 238000002203 pretreatment Methods 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 101150021607 rppH gene Proteins 0.000 description 1
- 101150082821 sacA gene Proteins 0.000 description 1
- 101150080136 sacC gene Proteins 0.000 description 1
- 230000035807 sensation Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 101150094617 spaK gene Proteins 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 230000004584 weight gain Effects 0.000 description 1
- 235000019786 weight gain Nutrition 0.000 description 1
- 101150070603 yadA gene Proteins 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
- H04S7/305—Electronic adaptation of stereophonic audio signals to reverberation of the listening space
- H04S7/306—For headphones
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S1/00—Two-channel systems
- H04S1/002—Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution
- H04S1/005—For headphones
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/03—Aspects of down-mixing multi-channel audio to configurations with lower numbers of playback channels, e.g. 7.1 -> 5.1
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/13—Aspects of volume control, not necessarily automatic, in stereophonic sound systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/01—Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD]
Definitions
- the invention relates to the processing of sound data, and more particularly to the spatialization (called "3D rendering") of audio signals.
- Such an operation is for example performed when decoding a coded 3D audio signal, represented on a number of channels, to a number of different channels, for example two, to allow the reproduction of the 3D audio effects on a headset. listening.
- the invention also relates to the transmission and reproduction of multichannel audio signals and their conversion to a rendering device, transducer, imposed by the equipment of a user. This is for example the case for the reproduction of a 5.1 sound stage by an audio headset, or by a pair of loudspeakers.
- the invention also relates to the rendering, in the context of a game or video recording, for example, of one or more sound samples stored in files, with a view to their spatialization.
- the binauralization is based on the monophonic signal filtering by the transfer function between the desired position of the source and each of the two ears.
- the binaural signal (two channels) obtained can then feed a headphone and provide the listener with a feeling of the source at the simulated position.
- the term "binaural" refers to the reproduction of a sound signal with spatialization effects.
- Each of the transfer functions simulating different positions can be measured in a deaf chamber, thus resulting in a set of HRTFs (for "Head Related Transfer Functions") in which no room effect is present.
- BRIRs Binary Room Impulse Response
- the set of BRIRs thus correspond to a set of transfer functions between a given position and the ears of a listener (real or artificial head) placed in a room.
- the usual BRIR measurement technique consists of successively sending in each of the actual loudspeakers, positioned around a head (real or artificial) equipped with microphones in the ears, a test signal (for example a sweep signal, a sequence pseudo-random binary or white noise).
- This test signal makes it possible, during a non-real-time processing, to reconstitute (generally by deconvolution) the impulse response between the position of the loudspeaker and each of the two ears.
- the difference between a set of HRTF and BRIR lies mainly in the length of the impulse response, of the order of one millisecond for the HRTF, to the order of one second for the BRIRs.
- the index 1 such that l £ [i. i-j refers to one of the L speakers.
- L represents the number of FFTs to frequency transform the input signals (1 FFT per input signal)
- the 2 represents the number of inverse FFTs to obtain the time binaural signal (2 inverse FFTs for both binaural channels)
- the 6 indicates a coefficient of complexity per FFT
- the second 2 indicates a zero stuffing necessary to avoid the problems due to the circular convolution
- Fs indicates the size of each of the BRIRs
- nBlocs represents the fact of use block processing, more realistic in an approach where latency should not be excessively high, and. represents multiplication.
- the present invention improves the situation.
- the present invention proposes for this purpose a sound spatialization method, in which at least one filtering, with summation, is applied to at least two input signals (1 (1), 1 (2), I (L)), the filtering comprising: applying at least one first room effect transfer function (A k (1), A k (2),..., A k (L)), this first transfer function being specific to each input signal, and the application of at least one second room effect transfer function (B mean k ), this second transfer function being common to all the input signals.
- the method is such that it comprises a step of weighting at least one input signal with a weighting weight (W k (1)), said weighting weight being specific to each of the input signals.
- the input signals correspond for example to the different channels of a multichannel signal.
- Such a filtering can in particular deliver at least two output signals intended for restitution. spatialized (in binaural or transaural, or in ambiophonic restitution involving more than two output signals).
- the filter delivers precisely two output signals, the first output signal being spatialized for the left ear and the second output signal being spatialized for the right ear. This makes possible the preservation of the degree of natural correlation that can exist between the left and right ears at low frequencies.
- the physical properties (for example the energy or the correlation between the various transfer functions) of the transfer functions over certain time intervals make simplifications possible. At these intervals, the transfer functions can be approximated by an average filter.
- At least one first transfer function specific to each input signal may be applied for intervals where it is not possible to make approximations.
- At least one second transfer function approximated to an average filter may be applied for intervals where it is possible to make approximations.
- the application of a single transfer function common to each of the input signals substantially reduces the number of calculations to be performed for the spatialization.
- the complexity of this spatialization is therefore advantageously reduced.
- This simplification thus advantageously reduces the processing time while minimizing the CPU or processors used for these calculations.
- weighting weights specific to each of the input signals the energy differences between the different input signals can be taken into account even if the treatment applied to them is partly approximated by an average filter.
- the first and second transfer functions are respectively representative of: direct sound propagation and first sound reflections of these propagations.
- the method in the sense of the invention furthermore comprises: the application of first transfer functions respectively specific to the input signals, and the application of a second transfer function, identical for all the input signals, and resulting from an overall approximation of a diffuse sound field effect.
- the complexity of the treatment is advantageously reduced by this approximation.
- the influence of such an approximation on the quality of the processing is reduced because this approximation is related to the effects of diffuse sound field and not to direct sound propagation. These effects of diffuse sound field are indeed less sensitive to approximations.
- the first sound reflections are typically a first succession of echoes of the sound wave. In an exemplary practical embodiment, it is considered that these first reflections are two in number, at most.
- a preliminary step of constructing the first and second transfer functions from impulse responses incorporating a room effect comprises, for the construction of a first transfer function, the operations:
- the start time of diffuse field presence is determined from predetermined criteria.
- the detection of a monotonic decay of a spectral density of sound power in a given room can typically characterize the beginning of presence of the diffuse field, and hence give the moment of beginning of presence diffuse field.
- the moment of beginning of presence can be determined by an estimate according to the characteristics of the room, for example simply from the volume of the room as will be seen later.
- the start time of presence of the diffuse field occurs, for example, after N / 2 samples of the impulse response.
- the presence start time is predetermined and therefore corresponds to a fixed value.
- this value may correspond, for example, to the 2048th sample on 48000 samples of an impulse response incorporating a room effect.
- the instant of onset of the presence of direct sound waves may correspond, for example, to the beginning of the time signal of an impulse response with room effect.
- the second transfer function is constructed from a set of impulse response portions beginning temporally after the start time of the presence of the diffuse field.
- the second transfer function can be determined from the characteristics of the room, or from predetermined standard filters.
- the impulse responses incorporating a room effect are advantageously divided into two parts separated by a start of presence time.
- Such a separation makes possible a treatment adapted to each of these parts.
- a selection of the first samples (the first 2048) of an impulse response can be used to use it as the first transfer function in the filtering and then ignore the remaining samples (from 2048 to 48000 for example) or the average with those of other impulse responses.
- the advantage of such an embodiment is then, particularly advantageously, to simplify the filtering calculations specific to the input signals, and to add a form of noise from the sound diffusion that can be calculated from the second halves impulse responses (in the form of an average, for example as will be seen later), or simply from a predetermined impulse response, estimated simply as a function of characteristics of the given room (its volume, the walls of the walls of the room). room, or others), or a standard room.
- the second transfer function is given by applying a formula of the type: with k the index relating to an output signal, ⁇ E fl; Lj the index relating to an input signal,
- L the number of input signals, normalized transfer method obtained from a set of impulse response portions beginning temporally after the start time of presence of the diffuse field.
- the first and second transfer functions are derived from a plurality of binaural BRIR room impulse responses.
- these first and second transfer functions are obtained from experimental values derived from measurement of propagations and reverberations in a given room.
- the treatment is carried out from experimental data. Such data translate very precisely the room effects and thus guarantee a great realism of the rendering.
- the first and second transfer functions are obtained from reference filters, synthesized for example with a network of curly delays.
- truncation is applied at the beginning of the BRIRs.
- the first samples of BRIR for which the application to the input signals has no influence are advantageously eliminated.
- a BRIR start truncation compensation delay is applied. This compensation time makes it possible to compensate for the time offset introduced by the truncation.
- truncation is applied at the end of BRIR.
- the filtering comprises the application of at least one compensation delay corresponding to a time difference between the aforementioned instant of the start of direct sound waves and the start time of presence of diffuse field.
- the first and second room effect transfer functions are applied parallel to the input signals.
- at least one compensation delay is applied to the input signals filtered by the second transfer functions.
- an energy compensation gain is applied to the weighting weights.
- At least one input signal is applied, at least one energy compensation gain.
- the output amplitude is advantageously normalized. This energy compensation gain makes it possible to respect the energy of the binauralized signals. It corrects the energy of the binauralized signals according to the degree of correlation of the input signals
- the energy compensation gain is a function of the correlation between the input signals.
- the correlation between signals is advantageously taken into account.
- At least one output signal is given by applying a formula of the type:
- W k i a weight weighting among the weighting weight, s - TDD Corres 0n p (j is the application period for compensation, which is multiplication, and where * is the convolution operator.
- a decorrelation step is applied to the input signals prior to the application of the second transfer functions.
- at least one output signal is thus obtained by applying a formula of the type:
- I d (l) a decorrelated input signal among said input signals, the other values being those defined above.
- the energy differences due to the energy differences between the correlated signal additions and the decorrelated signal additions can be taken into account.
- the decorrelation is applied prior to filtering. Thus, it is possible to dispense with energy compensation steps during the filtering.
- At least one output signal is obtained by applying a formula of the type: ⁇ .3 ⁇ 4- * o
- G the determined energy compensation gain, the other values being those defined above.
- G does not depend on 1 (1).
- the weight for the weighting is given by applying a formula of the type: with k the index relating to an output signal, l € fl; Ij the index relating an input signal among the input signals, L the number of input signals, with S Bm k the energy of a transfer function with room effect among the second transfer functions with room effect, an energy relative to gain in normalization.
- the invention also relates to a computer program comprising instructions for implementing the method described above.
- the invention can be implemented by a sound spatialization device, comprising at least one summation filter applied to at least two input signals (1 (1), 1 (2), I (L)), the filter using: at least one first room effect transfer function (A k (1), A k (2), A k (L)), this first transfer function being specific to each input signal, and at least one second a room effect transfer function (B mean k ), this second transfer function being common to all the input signals.
- the device is such that it comprises weighting modules for weighting at least one input signal with a weighting weight, said weighting weight being specific to each of the input signals.
- Such a device can take the physical form of, for example, a processor and possibly a working memory, typically in a communication terminal.
- the invention can also be implemented in a sound signal decoding module, as input signals, comprising the spatialization device described above.
- FIG. 1 illustrates a method of spatialization of the FIG. 2 schematically illustrates the steps of a method in the sense of the invention, in an exemplary embodiment
- FIG. 3 represents a BRIR binaural impulse response
- FIG. 4 schematically illustrates the steps of FIG. a method in the sense of the invention, in an exemplary embodiment
- - Figure 5 schematically illustrates the steps of a method in the sense of the invention, in an exemplary embodiment
- Figure 6 schematically shows a device comprising means implementation of the method within the meaning of the invention.
- a TER terminal-connected device for example a telephone, smartphone or other device, or a connected tablet, a computer connected, or others.
- a device TER comprises reception means (typically an antenna) of audio signals Xc encoded in compression, a decoding device DECOD delivering decoded signals X ready to be processed by a spatialization device before the audio signals are returned (for example by in binaural on a CAS headset).
- a spatialization device for example in the field of sub-bands
- it may be advantageous to keep the partially decoded signals for example in the field of sub-bands
- the spatialization processing is carried out in the same domain (frequency processing in the field of sub-bands by example).
- the spatialization device is presented by a combination of elements: hardware typically comprising one or more CIR circuits cooperating with a working memory MEM and a processor PROC, and software, of which FIGS. are examples of flowcharts illustrating the general algorithm.
- the cooperation between the hardware and software elements produces a technical effect providing in particular an economy of complexity of the spatialization for substantially the same audio rendering (same sensation for a listener), as will be seen below.
- a data preparation is performed. This preparation is optional, the signals can be processed according to steps S22 and following without this pre-treatment.
- this preparation consists in truncating each BRIR to ignore the inaudible samples at the beginning and at the end of the impulse response.
- This preparation for the truncation at the beginning of the TRONC S impulse response, in step S211, consists in determining a start time of direct sound waves and can be implemented by the following steps:
- a cumulative sum of the energies of each of the BRIR filters (1) is calculated. Typically, this energy is computed by a sum squared of the amplitudes of samples 1 to j, with j included in [1; J] with J the sample number of a BRIR filter.
- the energy value of the maximum energy filter valMax (among the filters relating to the left ear and the right ear) is calculated.
- the index for which the energy of each of the BRIR filters (l) exceeds a certain threshold in dB calculated with respect to valMax is calculated.
- the truncation index iT retained for all BRIRs is the minimum index among all the indices of the BRIRs and is considered as the moment of beginning of direct sound waves.
- the index iT obtained therefore corresponds to the number of samples to be ignored for each of the BRIRs. Abrupt truncation at the beginning of an impulse response with a rectangular window can lead to audible artifacts if it is applied in too much energy. It may therefore be preferable to apply a suitable input fade window, however if precautions have been taken in the selected threshold, this windowing becomes useless, because inaudible (just cut the inaudible signal).
- the synchronism between BRIR makes it possible to apply a constant delay for all BRIRs for the sake of simplicity of implementation, even if an optimization of complexity is possible.
- each BRIR to ignore the inaudible samples at the end of the impulse response TRONC E, in step S212 can be performed from steps similar to those described above, adapted to suit the end of the impulse response. Sudden truncation at the end of an impulse response with a rectangular window may lead to audible artifacts on pulse signals where the reverb tail may be audible. Thus, in one embodiment, a suitable output fade window is applied.
- ISOL A / B synchronism isolation is performed.
- This isolation in synchronism consists of separating, for each BRIR, the part “direct sound” and “first reflections” (or Direct, noted A) and the part “diffuse sound” (or Diffus, noted B).
- the treatment to be performed on the "diffuse sound” part may advantageously be different from that to be performed on the "direct sound” part, since it is preferable to have a better quality of treatment on the "direct sound” part. Only on the "diffuse sound” part. This makes it possible to optimize the quality / complexity ratio.
- Figure 3 shows the iDD partitioning index at the 2000 sample.
- the left part of this iDD index corresponds to part A.
- the right part of this iDD index corresponds to part B.
- these two parts are isolated, without windowing, in order to undergo different treatments.
- a windowing between the parts A (1) and B (1) is applied.
- the iDD index may be specific to the room for which the BRIRs were determined. The calculation of this index may therefore depend on the spectral envelope, the correlation of the BRIRs or the echogram of these BRIRs.
- iDD is a fixed value, typically 2000. In one variant, iDD varies, advantageously dynamically, depending on the environment from which the input signals are captured.
- the output signal for the left (g) and right (d) ears, represented by O s' - & , is written as follows:
- the sample indices selected for A and B may also consider frame lengths in the case of integration into an audio encoder. Indeed, typical frame sizes of 1024 samples can lead to a choice such that A makes 1024 and B makes 2048, making sure that B is a diffuse field area for all BRIRs. In particular, it may be interesting that the size of B is a multiple of the size of A because if the filtering is implemented in blocks of FFT, then the calculation of an FFT for A can be reused for B.
- a diffuse field is characterized by the fact that it is statistically identical in all points of the room. Thus, its frequency response varies little depending on the speaker to simulate.
- the present invention exploits this feature in order to replace all Diffus D (l) filters of all BRIRs with a single and only one "mean" B mean filter in order to greatly reduce the complexity due to multiple convolutions.
- the value of the mean filter B mean is calculated.
- this average filter can be obtained by averaging time samples. In a variant, it can be obtained by any other type of averaging such as averaging power spectral densities.
- the energy of the average filter 3 ⁇ 4 " . can be measured directly from the filter built & MSA ? " ⁇ Alternatively, it can also be estimated by taking into account the assumption that Bnoim 3i filters" ⁇ i are uncorrelated. Indeed, in this case, as we sum unit energy signals, we have:
- the energy can be calculated on all the samples corresponding to the diffuse field part.
- step S23B2 the value of the weighting factor W S ⁇ a ( ⁇ ) is calculated.
- a single weighting factor to be applied to the input signal is calculated, taking into account the standardizations of the Diffus filters and the average filter:
- the L convolutions with the diffuse field portion are replaced by a single convolution with a mean filter, with a weighted sum of the input signal.
- step S23B3 it is possible to calculate a gain G correcting the gain of the average filter & msa. Indeed, in the case of the convolution between the input signals and the unmatched filters, whatever the correlation values between the input signals, the filtering by decorrelated filters that are the B Si d ⁇ 1) leads to signals to be summed, which are also decorrelated. Conversely, in the case of the convolution between the input signals and the approximated average filter, the energy of the signal resulting from the summation of the filtered signals will depend on the correlation value existing between the input signals.
- This case is equivalent to the previous one in the sense that the signals coming from the filtering are all decorrelated, thanks to the input signals in the first case, and thanks to the filters in the second case.
- the gain Sf / f) can be estimated by a calculation of correlation between each of the signals. It can also be estimated by comparing the energies of the signals before and after summations. In this case, the gain G may vary dynamically over time, depending for example on correlations between the input signals, which vary themselves over time.
- the constant gain G can then be applied offline to the weighting factors (thus 7777777 :), or to the -Bmean filter, which will avoid the application of an additional gain on the fly.
- the processing of the multichannel signal by applying the Direct (A) and Diffus (B) filters for each of the ears is carried out as follows:
- the S4A1 to S4AL is applied to the multichannel input signal effective filtering (eg direct convolution based -FFT) by the Direct filters (A), as described in the state of the art.
- a signal 0 ° ' As a function of the relationships between the input signals, in particular as a function of their correlation, it is optionally possible to correct, in step S4B11, the gain of the average filter Bmsayi S / P by the application of the gain G to the output signals after summation of the signal signals.
- input previously weighted steps M4B1 to M4BL.
- the multichannel signal B at step S4B1 is applied efficiently by means of the mean diffuse filter B mean . This step takes place after summing the previously weighted input signals (steps M4B1 to M4BL).
- a delay iDD is applied to the signal ⁇ ' ' "in order to compensate for the delay introduced during the step of isolating the signal B in step S4B2 .-
- the signals ⁇ ⁇ "" and ⁇ T " are summed.
- step S41 the input signal is applied with a delay iT corresponding to the inaudible samples deleted.
- the signals are not only calculated for the left and right ears (indices g and d above) but for k playback devices (typically loudspeakers).
- the gain G is applied prior to the summing of the input signals, that is to say during the weighting steps (steps M4B1 to M4BL).
- a decorrelation is applied to the input signals.
- the signals are decorrelated after convolution by the B mean filter regardless of the original correlations between input signals.
- An efficient implementation of decorrelation (for example using a loopback network) can be used to avoid the use of expensive decorrelating filters.
- the invention can find a direct application in the MPEG-H 3D Audio standard.
- the Direct A signal is not approximated by an average filter.
- an average filter of A it is possible to use an average filter of A to make the convolutions (steps S4A1 to S4AL) with the signals coming from the loudspeakers.
- An embodiment has been described above based on the processing of multichannel content generated for L speakers.
- the multichannel content can be generated by any type of audio source such as voice, a musical instrument, any noise, etc.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Multimedia (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Stereophonic System (AREA)
Abstract
Description
Claims
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR1357299A FR3009158A1 (en) | 2013-07-24 | 2013-07-24 | SPEECH SOUND WITH ROOM EFFECT |
PCT/FR2014/051728 WO2015011359A1 (en) | 2013-07-24 | 2014-07-04 | Sound spatialization with room effect |
Publications (2)
Publication Number | Publication Date |
---|---|
EP3025514A1 true EP3025514A1 (en) | 2016-06-01 |
EP3025514B1 EP3025514B1 (en) | 2019-09-11 |
Family
ID=49876752
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP14748239.2A Active EP3025514B1 (en) | 2013-07-24 | 2014-07-04 | Sound spatialization with room effect |
Country Status (8)
Country | Link |
---|---|
US (1) | US9848274B2 (en) |
EP (1) | EP3025514B1 (en) |
JP (1) | JP6486351B2 (en) |
KR (2) | KR102310859B1 (en) |
CN (1) | CN105684465B (en) |
ES (1) | ES2754245T3 (en) |
FR (1) | FR3009158A1 (en) |
WO (1) | WO2015011359A1 (en) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP3001701B1 (en) * | 2014-09-24 | 2018-11-14 | Harman Becker Automotive Systems GmbH | Audio reproduction systems and methods |
US10187740B2 (en) * | 2016-09-23 | 2019-01-22 | Apple Inc. | Producing headphone driver signals in a digital audio signal processing binaural rendering environment |
JP1640846S (en) * | 2018-10-16 | 2019-09-09 | ||
CN109584892A (en) * | 2018-11-29 | 2019-04-05 | 网易(杭州)网络有限公司 | Audio analogy method, device, medium and electronic equipment |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FR2851879A1 (en) * | 2003-02-27 | 2004-09-03 | France Telecom | PROCESS FOR PROCESSING COMPRESSED SOUND DATA FOR SPATIALIZATION. |
GB0419346D0 (en) * | 2004-09-01 | 2004-09-29 | Smyth Stephen M F | Method and apparatus for improved headphone virtualisation |
JP2006279588A (en) * | 2005-03-29 | 2006-10-12 | Yamaha Corp | Terminal for multispot teleconference |
CN102395098B (en) * | 2005-09-13 | 2015-01-28 | 皇家飞利浦电子股份有限公司 | Method of and device for generating 3D sound |
KR101562379B1 (en) * | 2005-09-13 | 2015-10-22 | 코닌클리케 필립스 엔.브이. | A spatial decoder and a method of producing a pair of binaural output channels |
JP2010118978A (en) * | 2008-11-14 | 2010-05-27 | Victor Co Of Japan Ltd | Controller of localization of sound, and method of controlling localization of sound |
US9431987B2 (en) * | 2013-06-04 | 2016-08-30 | Sony Interactive Entertainment America Llc | Sound synthesis with fixed partition size convolution of audio signals |
-
2013
- 2013-07-24 FR FR1357299A patent/FR3009158A1/en not_active Withdrawn
-
2014
- 2014-07-04 ES ES14748239T patent/ES2754245T3/en active Active
- 2014-07-04 US US14/906,311 patent/US9848274B2/en active Active
- 2014-07-04 KR KR1020217001620A patent/KR102310859B1/en active IP Right Grant
- 2014-07-04 WO PCT/FR2014/051728 patent/WO2015011359A1/en active Application Filing
- 2014-07-04 CN CN201480052602.XA patent/CN105684465B/en active Active
- 2014-07-04 EP EP14748239.2A patent/EP3025514B1/en active Active
- 2014-07-04 KR KR1020167003222A patent/KR102206572B1/en active IP Right Grant
- 2014-07-04 JP JP2016528570A patent/JP6486351B2/en active Active
Non-Patent Citations (1)
Title |
---|
See references of WO2015011359A1 * |
Also Published As
Publication number | Publication date |
---|---|
FR3009158A1 (en) | 2015-01-30 |
KR20210008952A (en) | 2021-01-25 |
US20160174013A1 (en) | 2016-06-16 |
KR102206572B1 (en) | 2021-01-22 |
KR102310859B1 (en) | 2021-10-12 |
WO2015011359A1 (en) | 2015-01-29 |
ES2754245T3 (en) | 2020-04-16 |
CN105684465B (en) | 2018-06-12 |
JP6486351B2 (en) | 2019-03-20 |
JP2016527815A (en) | 2016-09-08 |
EP3025514B1 (en) | 2019-09-11 |
CN105684465A (en) | 2016-06-15 |
KR20160034942A (en) | 2016-03-30 |
US9848274B2 (en) | 2017-12-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1600042B1 (en) | Method for the treatment of compressed sound data for spatialization | |
EP1992198B1 (en) | Optimization of binaural sound spatialization based on multichannel encoding | |
EP1999998B1 (en) | Method for binaural synthesis taking into account a spatial effect | |
JP5406956B2 (en) | System for extracting and modifying the echo content of an audio input signal | |
JP5090436B2 (en) | Method and device for efficient binaural sound spatialization within the transform domain | |
EP2005420A1 (en) | Device and method for encoding by principal component analysis a multichannel audio signal | |
EP3025514B1 (en) | Sound spatialization with room effect | |
EP3400599B1 (en) | Improved ambisonic encoder for a sound source having a plurality of reflections | |
JP7453997B2 (en) | Packet Loss Concealment for DirAC-based Spatial Audio Coding | |
WO2018115666A1 (en) | Processing in sub-bands of an actual ambisonic content for improved decoding | |
FR3065137A1 (en) | SOUND SPATIALIZATION METHOD | |
FR3009121A1 (en) | METHOD OF SUPPRESSING LATE REVERBERATION OF A SOUND SIGNAL | |
EP1994526B1 (en) | Joint sound synthesis and spatialization | |
EP4184505B1 (en) | Complexity optimized sound spatialization with room effect | |
EP4042418B1 (en) | Determining corrections to be applied to a multichannel audio signal, associated coding and decoding |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20160107 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
AX | Request for extension of the european patent |
Extension state: BA ME |
|
DAX | Request for extension of the european patent (deleted) | ||
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: EXAMINATION IS IN PROGRESS |
|
17Q | First examination report despatched |
Effective date: 20180322 |
|
GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: GRANT OF PATENT IS INTENDED |
|
INTG | Intention to grant announced |
Effective date: 20190416 |
|
GRAS | Grant fee paid |
Free format text: ORIGINAL CODE: EPIDOSNIGR3 |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE PATENT HAS BEEN GRANTED |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D Free format text: NOT ENGLISH |
|
REG | Reference to a national code |
Ref country code: CH Ref legal event code: EP |
|
REG | Reference to a national code |
Ref country code: AT Ref legal event code: REF Ref document number: 1180058 Country of ref document: AT Kind code of ref document: T Effective date: 20190915 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R096 Ref document number: 602014053445 Country of ref document: DE Ref country code: IE Ref legal event code: FG4D Free format text: LANGUAGE OF EP DOCUMENT: FRENCH |
|
REG | Reference to a national code |
Ref country code: NL Ref legal event code: MP Effective date: 20190911 |
|
REG | Reference to a national code |
Ref country code: LT Ref legal event code: MG4D |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: BG Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20191211 Ref country code: SE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20190911 Ref country code: NO Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20191211 Ref country code: FI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20190911 Ref country code: LT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20190911 Ref country code: HR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20190911 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: AL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20190911 Ref country code: LV Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20190911 Ref country code: GR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20191212 Ref country code: RS Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20190911 |
|
REG | Reference to a national code |
Ref country code: AT Ref legal event code: MK05 Ref document number: 1180058 Country of ref document: AT Kind code of ref document: T Effective date: 20190911 |
|
REG | Reference to a national code |
Ref country code: ES Ref legal event code: FG2A Ref document number: 2754245 Country of ref document: ES Kind code of ref document: T3 Effective date: 20200416 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: PT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20200113 Ref country code: PL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20190911 Ref country code: EE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20190911 Ref country code: AT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20190911 Ref country code: NL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20190911 Ref country code: RO Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20190911 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: IS Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20200224 Ref country code: CZ Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20190911 Ref country code: SK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20190911 Ref country code: SM Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20190911 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R097 Ref document number: 602014053445 Country of ref document: DE |
|
RAP2 | Party data changed (patent owner data changed or rights of a patent transferred) |
Owner name: ORANGE |
|
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
PG2D | Information on lapse in contracting state deleted |
Ref country code: IS |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: DK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20190911 Ref country code: IS Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20200112 |
|
26N | No opposition filed |
Effective date: 20200615 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: SI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20190911 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: MC Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20190911 |
|
REG | Reference to a national code |
Ref country code: CH Ref legal event code: PL |
|
REG | Reference to a national code |
Ref country code: BE Ref legal event code: MM Effective date: 20200731 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: LU Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20200704 Ref country code: IE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20200704 Ref country code: LI Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20200731 Ref country code: CH Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20200731 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: BE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20200731 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: TR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20190911 Ref country code: MT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20190911 Ref country code: CY Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20190911 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: MK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20190911 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: ES Payment date: 20230801 Year of fee payment: 10 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: GB Payment date: 20240620 Year of fee payment: 11 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: FR Payment date: 20240619 Year of fee payment: 11 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: IT Payment date: 20240619 Year of fee payment: 11 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: DE Payment date: 20240619 Year of fee payment: 11 |