EP3934282A1 - Verfahren zur umwandlung eines ersten satzes repräsentativer signale eines schallfelds in einen zweiten satz von signalen und entsprechende elektronische vorrichtung - Google Patents

Verfahren zur umwandlung eines ersten satzes repräsentativer signale eines schallfelds in einen zweiten satz von signalen und entsprechende elektronische vorrichtung Download PDF

Info

Publication number
EP3934282A1
EP3934282A1 EP21181285.4A EP21181285A EP3934282A1 EP 3934282 A1 EP3934282 A1 EP 3934282A1 EP 21181285 A EP21181285 A EP 21181285A EP 3934282 A1 EP3934282 A1 EP 3934282A1
Authority
EP
European Patent Office
Prior art keywords
frequency band
signals
temporal frequency
values
space
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
EP21181285.4A
Other languages
English (en)
French (fr)
Inventor
Nicolas Epain
François SALMON
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fondation B Com
Original Assignee
Fondation B Com
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fondation B Com filed Critical Fondation B Com
Publication of EP3934282A1 publication Critical patent/EP3934282A1/de
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/008Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/02Systems employing more than two channels, e.g. quadraphonic of the matrix type, i.e. in which input signals are combined algebraically, e.g. after having been phase shifted with respect to each other
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/07Synergistic effects of band splitting and sub-band processing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/11Application of ambisonics in stereophonic audio systems

Definitions

  • the present invention relates to the technical field of processing signals representative of a sound field.
  • It relates in particular to a method of converting a first set of signals representative of a sound field into a second set of signals and an associated electronic device.
  • the signals of the first set sometimes have in this situation a format which is not directly usable by the reproduction system.
  • This is typically a scene-oriented format, such as an HOA (for "High-Order Ambisonics”) format.
  • this solution is based on the estimation of at least one dominant direction per frequency band by means of an analysis of the signals of the first set.
  • the electronic device stores for example, for each temporal frequency band, data associated with a number of particular directions in space equal to the number of signals in the first set of signals, which makes it possible to obtain optimal processing.
  • the input signals are converted into a representation in plane waves in the different directions associated with the frequency band concerned.
  • the particular directions associated with the data stored for a given temporal frequency band are for example distributed (possibly evenly) among all the directions in space.
  • the number of signals in the second set is for example strictly greater than the number of signals in the first set. In this case, the conversion allows an artificial increase in the spatial resolution of the sound scene represented.
  • the set of said particular directions can comprise at least 50 particular directions, for example between 50 and 5,000 particular directions.
  • the values respectively associated with said temporal frequency bands can be determined by time-frequency transformation on the basis of the signals of the first set.
  • Each signal of the second set can for its part be constructed by frequency-time transformation on the basis of the values associated with this signal of the second set and obtained respectively for the different temporal frequency bands.
  • the conversion step can be carried out in practice by matrix multiplication of a vector comprising the values associated with the relevant temporal frequency band and determined for the various signals of the first set.
  • the matrix used for this matrix multiplication with respect to a given temporal frequency band can comprise the data stored for this given temporal frequency band and associated with the different particular directions assigned to this given temporal frequency band.
  • the step of determining a plurality of values associated respectively with the different signals of the second set can be carried out by matrix multiplication of a vector comprising said at least one value representative of a source. sound virtual and obtained in the conversion step for the temporal frequency band concerned. It is thus possible to pass from a representation in plane waves (by means of the values representative of sound sources) to a representation corresponding to the signals of the second set (output signals).
  • the method can furthermore comprise preliminary steps of defining a plurality of directions of space by an optimization process, of attributing the directions of space of the plurality to said bands of temporal frequency, and of memorization, for each temporal frequency band, of said at least one datum associated with the direction of space attributed to the frequency band concerned.
  • the figure 1 represents an electronic device 2 for converting a first set of signals (or input signals) representative of a sound field in a space into a second set of signals (or output signals).
  • the space concerned is the space for the propagation of sound waves; this space is here three-dimensional. As a variant, however, this space could be two-dimensional (for example in the case of a two-dimensional representation of a three-dimensional system).
  • the figure 1 represents the electronic device 2 in the form of functional blocks (each forming a module or a unit such as (Ile) as described below).
  • each of these functional blocks can be achieved by the cooperation of software elements, such as computer program instructions executable by a processor of the electronic device, and hardware elements, for example this same processor and a memory of the electronic device 2.
  • This memory can moreover store the aforementioned computer program instructions.
  • the input signals are for example ambisonic signals of order L.
  • the first set in this case comprises (L + 1) 2 signals.
  • each vector b E (t) is therefore of dimension (L + 1) 2 , here of dimension 4.
  • the values taken by the different signals (and thus the different elements of the vectors b E (t)) are for example complex values; alternatively, these values could be actual values.
  • temporal frequency bands of the sound field
  • temporal frequency bands are disjoint two by two and cover (united) the spectrum of audible frequencies.
  • the plurality of temporal frequency bands comprises for example between 100 and 1000 temporal frequency bands, here 256 temporal frequency bands.
  • Each temporal frequency band has for example a width of between 10 Hz and 500 Hz.
  • the electronic device 2 comprises a storage unit 4 designed to store, for each temporal frequency band of this plurality of temporal frequency bands, at least one datum associated with a particular direction ⁇ j of space.
  • the storage unit 4 stores, for each temporal frequency band, data associated with a number of particular directions ⁇ j of space equal to the number of signals in the first set of signals (input signals), i.e. (L + 1) 2 in the case of ambisonic input signals of order L.
  • ⁇ 1 (f), ⁇ 2 (f), ..., ⁇ ( L + 1 ) 2 (f) the directions thus associated with a given temporal frequency band.
  • the datum associated with a particular direction ⁇ j of space can be a datum defining this particular direction of space, for example by means of an azimuth angle and / or an elevation angle.
  • the datum associated with a particular direction ⁇ j of space can also be a datum making it possible to perform a calculation linked to this particular direction ⁇ j .
  • a particular direction ⁇ k (f) several coefficients D k, i (f) (forming a row of a matrix D (f)) allowing respectively to obtain the contribution of different input signals to a plane wave in the particular direction ⁇ k (f), as explained later.
  • Each particular direction ⁇ j is here defined by an azimuth angle ⁇ (on the abscissa on the figure 2 ) and an elevation angle ⁇ (on the y-axis on the figure 2 ).
  • the set of particular directions ⁇ j of space associated with a datum stored for at least one temporal frequency band forms a mesh of all of the directions of space.
  • the set of particular directions ⁇ j comprises for example more than 50 particular directions.
  • this mesh is not a regular mesh in the example described. As a variant, however, it could be a regular mesh (for example with a constant pitch in azimuth and a constant pitch in elevation).
  • the set of particular directions ⁇ j comprises at least 5 particular directions ⁇ j defined by an azimuth ⁇ included in this range of azimuth values and an elevation ⁇ included in this range of elevation values.
  • the set of directions particulars comprises at least one other particular direction ⁇ j ' defined by an elevation ⁇ ' included in this range of elevation values and by an azimuth ⁇ 'which differs by less than 30 ° from the given azimuth ⁇ .
  • the set of particular directions includes at least one other particular direction ⁇ j ' defined by an azimuth ⁇ ' included in this range of azimuth values and by an elevation ⁇ 'which differs from less than 30 ° of the given elevation ⁇ .
  • the electronic device 2 comprises a reception module 6 designed to receive data representative of the input signals (signals of the first set), here the vectors b E (t) respectively associated with the successive instants of the time interval considered.
  • This reception module 6 can be a communication module capable of receiving the data representative of the input signals coming from another electronic device.
  • the reception module 6 can be a module for reading data representative of the input signals in a memory (such as the already mentioned memory of the electronic device 2).
  • the electronic device 2 comprises a configuration module 8 designed to configure the other modules, in particular as a function of the input signals b E (t) (in particular as a function of the format of the input signals b E (t)).
  • the electronic device 2 can include a detection module 10 designed to analyze the input signals b E (t) and to communicate to the configuration module information I indicative of the format of the input signals b E (t ).
  • This information I is for example the number of signals making up the input signals b E (t).
  • the data representative of the input signals b E (t) (received by the reception module 6) can comprise metadata M indicative of the format of the input signals b E (t).
  • the module of reception 6 transmits these metadata M to the configuration module 8, as shown in dotted lines on the figure 1 .
  • the electronic device 2 furthermore comprises a transformation module 12 designed to determine, for each of the input signals (signals of the first set), values respectively associated with the different temporal frequency bands.
  • the transformation module 12 determines, on the basis of the values ⁇ i (t) relating to a given input signal (denoted by the index i), values ⁇ i (f) respectively associated with the different frequency bands and representative of this same input signal in the frequency domain.
  • the values ⁇ i (f) respectively associated with the different temporal frequency bands are for example determined by time-frequency transformation (such as a short-term Fourier transformation) on the basis of the ⁇ values i (t) taken over time (over the time interval considered) by this signal from the first set.
  • ⁇ (f) [ ⁇ 1 (f ), ⁇ 2 (f), ..., ⁇ ( L + 1 ) 2 (f)] T.
  • the electronic device 2 comprises a decoding module 14 designed to convert, for each temporal frequency band, the values ⁇ 1 (f), ⁇ 2 (f), ⁇ (L + 1) 2 (f) associated with the band of temporal frequency concerned and determined respectively for the different signals of the first set, in values ⁇ 1 (f), ⁇ 2 (f), ⁇ (L + 1) 2 (f) each representative of a virtual sound source oriented in l 'one of the directions ⁇ 1 (f), Q 2 (f), ..., ⁇ ( L + 1 ) 2 (f) of the space associated with the data stored for the temporal frequency band concerned.
  • the decoding module 14 performs for example, for each temporal frequency band, the aforementioned conversion by matrix multiplication of the vector ⁇ (f), which comprises, as already indicated, the values ⁇ 1 (f), ⁇ 2 (f), ⁇ ( L + 1) 2 (f) associated with the relevant temporal frequency band and determined respectively for the different signals of the first set.
  • the matrices D (f) are such that the values ⁇ 1 (f), ⁇ 2 (f), ⁇ (L + 1) 2 (f) on the one hand and the values ⁇ 1 (f), ⁇ 2 (f ), ..., ⁇ (L + 1) 2 (f) on the other hand represent the same sound field, but in two different representations, here an ambisonic representation for the values ⁇ 1 (f), ⁇ 2 (f) , ⁇ (L + 1) 2 (f) and a representation in plane waves oriented according to the particular directions of space associated with the frequency band concerned for the values ⁇ 1 (f), ⁇ 2 (f), .. ., ⁇ (L + 1) 2 (f).
  • each matrix D (f) allows, for a temporal frequency band, the passage from an ambisonic representation to a plane wave representation.
  • Each matrix D (f) is therefore formed of elements D k, i which each represent the coefficient to be assigned to a value ⁇ i (f) (obtained for an input signal ⁇ i (t)) to determine its contribution to the plane wave emitted by the virtual sound source oriented in the direction ⁇ k (f).
  • each matrix D (f) is a square matrix, of dimension equal to the number of signals in the first set, here (L + 1) 2 .
  • the decoding module 14 can in practice comprise a plurality of conversion units 16 each designed to carry out the aforementioned conversion for a given temporal frequency band, that is to say here to carry out the multiplication of a vector ⁇ (f) received from the transformation module 12 by the matrix D (f) associated with this frequency band.
  • the electronic device 2 comprises an encoding module 18 designed to determine, for each temporal frequency band, a plurality of values ⁇ 1 (f), ⁇ 2 (f), ..., ⁇ N (f) respectively associated with the different signals of the second set (output signals), on the basis of the values ⁇ 1 (f), ⁇ 2 (f), ..., ⁇ (L + 1) 2 (f) representative of the virtual sound sources and obtained by the aforementioned conversion for the relevant temporal frequency band.
  • N the number of signals in the second set.
  • the number N of signals in the second set is strictly greater than the number of signals (here equal to (L + 1) 2 ) in the first set. This is particularly the case when the processing carried out by the electronic device, described below with reference to the figure 3 , aims to artificially increase the spatial resolution of sound scenes (functionality which is sometimes referred to by the English term "upscaling").
  • the order L ′ of the output signals is strictly greater than the order L of the input signals.
  • the encoding module 18 determines, for each temporal frequency band, the plurality of values ⁇ 1 (f), ⁇ 2 (f), ..., ⁇ N (f) associated respectively with the different signals of the second set by matrix multiplication (by means of a matrix E (f)) of the vector ⁇ (f) comprising the values ⁇ 1 (f), ⁇ 2 (f), ..., ⁇ (L + 1 ) 2 (f) representative of the virtual sound sources and obtained in the conversion step for the relevant temporal frequency band.
  • Such a matrix E (f) therefore has here a number of columns equal to the number of signals in the first set (here (L + 1) 2 ) and a number of rows equal to the number N of signals in the second set.
  • the encoding module 18 can in practice comprise a plurality of processing units 20 each designed to perform the transformation which has just been described for a given temporal frequency band, that is to say here to perform the transformation. multiplication of a vector ⁇ (f) received from the decoding module 14 (precisely here: received from a conversion unit 16) by the matrix E (f) associated with this frequency band.
  • the electronic device 2 finally comprises a construction module 22 designed to construct each signal ⁇ i (t) of the second set on the basis of the values ⁇ i (f) associated with this signal ⁇ i (t) of the second set and obtained respectively for the values. different time frequency bands.
  • the construction module 22 constructs for example each signal ⁇ ⁇ (t) of the second set by frequency-time transformation (such as an inverse short-term Fourier transformation) on the basis of the values ⁇ i (f) associated with this signal of the second set and obtained respectively for the different temporal frequency bands.
  • frequency-time transformation such as an inverse short-term Fourier transformation
  • N output signals (signals of the second set), precisely here, for each output signal, a set of values ⁇ i (t) forming this output signal for the different instants t (successive) of the interval of time considered.
  • b s (t) [ ⁇ 1 (t), ⁇ 2 (t), ..., ⁇ N (t)] T.
  • the figure 3 represents in the form of a flowchart a conversion method in accordance with the invention. This method is for example implemented by the electronic device of the figure 2 , as described below.
  • step E2 of determining the format of the input signals b E (t), received here by the reception module 6.
  • This step E2 is for example implemented by the detection module 10.
  • this step E2 could be implemented by the configuration module 8 by reading metadata M indicative of the format of the input signals b E (t).
  • This step E2 here makes it possible to determine the number of signals present in the first set of signals.
  • the process of figure 3 then comprises a step E4 of configuring the decoding module 14 and / or the encoding module 18 as a function of the format determined in step E2.
  • This configuration step E4 is implemented here by the configuration module 8.
  • This step E2 can also comprise the configuration (here by the configuration module 8) of other elements of the electronic device 2, such as the transformation module 12 and / or the construction module 22.
  • the configuration module 8 configures the transformation module 12 and / or the construction module 22 according to the number of temporal frequency bands to be used (this number can be stored in a memory of the electronic device 2 and / or entered by a user via a user interface - not shown - of the electronic device 2).
  • the configuration module 8 determines (as a function of the format determined in step E2) the matrices D (f) to be used, and configures the conversion units 16 respectively by means of these matrices D (f).
  • the configuration module 8 determines for example the matrices D (f) to be used as a function of the number of signals present in the first set of signals.
  • the configuration module 8 reads a set of matrices D (f) stored (for example in the memory of the electronic device 2) in association with this number of signals in the first set of signals.
  • the configuration module 8 could send this number of signals in the first set of signals to a remote server and receive in response the associated set of matrices D (f).
  • the configuration module 8 performs a method such as that described below in figure 4 to define a plurality of directions ⁇ j of space, assign these directions ⁇ j of space to the temporal frequency bands, and construct, for each temporal frequency band, the matrix D (f) using the directions ⁇ 1 (f), Q 2 (f), ..., ⁇ (L + 1) 2 (f) of the space assigned to the relevant temporal frequency band (construction of the matrix D (f) using different directions ⁇ 1 (f), ⁇ 2 (f), ..., ⁇ (L + 1) 2 (f) of the space being already presented above).
  • the matrices D (f) thus constructed can be stored (for example in the memory of the electronic device 2) for later use (in accordance with the first possibility indicated above).
  • the configuration module 8 can determine the matrices E (f) to be used (for example according to the format of the output signals, here the number of output signals, which can be stored and / or entered by a user by means of the user interface of the electronic device 2), and configures the processing units 20 respectively by means of these matrices E (f).
  • the configuration module 8 determines for example the matrices E (f) to be used as a function of the number of signals present in the second set of signals (output signals).
  • the configuration module 8 reads a set of matrices E (f) stored (for example in the memory of the electronic device 2) in association with this number of signals in the second set of signals.
  • the configuration module 8 could send this number of signals in the second set of signals to a remote server and receive in response the associated set of matrices E (f).
  • the configuration module 8 performs a process such as that described below on figure 4 to define a plurality of directions ⁇ j of space, assign these directions ⁇ j of space to the temporal frequency bands, and construct, for each temporal frequency band, the matrix E (f) using the directions ⁇ 1 (f), Q 2 (f), ..., ⁇ (L + 1) 2 (f) of the space assigned to the relevant temporal frequency band (construction of the matrix E (f) using different directions ⁇ 1 (f), Q 2 (f), ..., ⁇ (L + 1) 2 (f) of the space being already presented above).
  • the matrices E (f) thus constructed can be stored (for example in the memory of the electronic device 2) for later use (in accordance with the first possibility indicated above).
  • the process of figure 3 then provides, for each of the signals ⁇ i (t) of the first set (input signals), a step E6 of determining values ⁇ i (f) respectively associated with the different temporal frequency bands.
  • these different values ⁇ i (f) respectively associated with the different temporal frequency bands represent the signal ⁇ i (t) concerned in the frequency domain.
  • This determination step E6 is here carried out by the transformation module 12.
  • the values ⁇ i (f) respectively associated with said temporal frequency bands can be determined by time-frequency transformation on the basis of the signals ⁇ i (t) of the first set.
  • the process of figure 3 then comprises, for each temporal frequency band, a step E8 of converting the values ⁇ i (f) associated with the relevant temporal frequency band and determined for the various signals ⁇ 1 (t), ⁇ 2 (t), .. ., ⁇ (L + 1) 2 (t) of the first set, in values ⁇ 1 (f), ⁇ 2 (f), ⁇ (L + 1) 2 (f) representative of virtual sound sources oriented respectively in the different directions of space ⁇ 1 (f), Q 2 (f), ..., ⁇ (L + 1) 2 (f) associated (for example attributed) to the temporal frequency band concerned.
  • one of the conversion units 16 performs a matrix product D (f) ⁇ (f) to obtain a vector ⁇ (f) formed of the values ⁇ 1 (f), ⁇ 2 (f ), ⁇ (L + 1) 2 (f) representative of virtual sound sources oriented respectively in the different directions of space ⁇ 1 (f), ⁇ 2 (f), ..., ⁇ (L + 1) 2 (f) for the relevant temporal frequency band.
  • the process of figure 3 then comprises a step E10 of determining, for each temporal frequency band, on the basis of the values ⁇ 1 (f), ⁇ 2 (f), ⁇ (L + 1) 2 (f) representative of the virtual sound sources and obtained at the step of converting E8 for the temporal frequency band concerned, of a plurality of values ⁇ 1 (f), ⁇ 2 (f), ..., ⁇ N (f) associated respectively with the signals of the second set (c 'that is to say to the N output signals).
  • one of the processing units 20 performs a matrix product E (f) ⁇ (f) to obtain a vector ⁇ (f) formed of values ⁇ 1 (f), ⁇ 2 (f ), ..., ⁇ N (f) respectively associated with the signals ⁇ 1 (t), ⁇ 2 (t), ..., ⁇ N (t) of the second set.
  • the different values ⁇ i (f) obtained for the different temporal frequency bands and associated with the same signal ⁇ i (t) of the second set form a representation of this signal ⁇ i (t) of the second set in the frequency domain.
  • the process of figure 3 then comprises a step E12 of construction of each signal ⁇ i (t) of the second set on the basis of the values ⁇ i (f) associated with this signal ⁇ ⁇ (t) of the second set and obtained respectively for the different time frequency bands.
  • Step E12 is implemented here by the construction module 22.
  • each signal ⁇ i (t) of the second set can be constructed by frequency-time transformation on the basis of the values ⁇ i (f) associated with this signal ⁇ i (t) of the second set and obtained respectively for the different bands of temporal frequency.
  • the figure 4 presents a method for defining and assigning particular directions ⁇ j of space to different temporal frequency bands.
  • This method begins with a step E20 of defining a plurality of directions in space by an optimization process, here an optimization process called a “Thomson problem”.
  • the plurality of directions in space thus obtained forms a mesh of all the directions in space, as already indicated.
  • each group the particular directions are distributed in space and therefore form in the example described here a tetrahedron (for example a regular tetrahedron).
  • Each of the 4F particular directions ⁇ j is modeled as a charged particle located on the surface of a sphere, and moving in solidarity with the other directions belonging to the same group, that is to say to the same tetrahedron. Two charged particles exert a repulsive force on each other similar to electrostatic interaction.
  • the process of figure 4 then comprises a step E22 of attribution of the particular directions of space obtained in step E20 to the F temporal frequency bands.
  • the tetrahedron assigned to the second temporal frequency band is that which corresponds to the smallest rotation with respect to the tetrahedron assigned to the first temporal frequency band.
  • the other tetrahedra are thus assigned successively to the different temporal frequency bands so that the angular distance between two successive groups of directions is as small as possible.
  • Two particular directions allocated to two adjacent frequency bands are thus neighboring within the mesh, which makes it possible to avoid jumps in the processing operations carried out for two neighboring frequency bands.
  • the process of figure 4 comprises a step E24 of construction and storage, for each temporal frequency band, of data associated with the particular directions ⁇ 1 (f), Q 2 (f), ..., ⁇ (L + 1) 2 (f) of space allocated to the frequency band concerned.
  • step E24 comprises the construction and storage of the matrix D (f) and / or of the matrix E (f) as indicated above, on the basis of particular directions ⁇ 1 (f), Q 2 (f), ..., ⁇ (L + 1) 2 (f) assigned to the frequency band concerned.
  • the invention which has just been described can be applied in various situations where it is desired to convert a first set of signals having a first format into a second set of signals having a second format.
  • loudspeakers for example by means of 10 loudspeakers or plus
  • L-order ambisonics signals to L-order ambisonics signals. strictly greater than L and to reproduce the converted signals on the loudspeakers in order to avoid the production of artifacts which are unpleasant to the ear.
  • ambisonic signals b E (t) of order L and ambisonic signals b '(t) of order L' strictly greater than L.
  • the ambisonic signals b '(t) represent (in detail) a sound in direct propagation between a sound source and the user, while the ambisonic signals b E (t) represent sounds arriving at the user after reflection and / or reverberation.
  • the use of ambisonic signals b E (t) of low order makes it possible to lighten the processing carried out on these signals (for example to produce these signals).
  • the various signals are considered as a scene-oriented format in which the base of space functions used consists of so-called “panning” functions.
  • a panning function expresses the gains applied to different speakers to give the impression to a listener that a sound source is in a given direction.
  • the VBAP method (for "Vector Base Amplitude Panning"), for example, makes it possible to calculate panning functions for a given set of loudspeakers. For example, we can refer to this subject in the article " Virtual Sound Source Positioning Using Vector Base Amplitude Panning ", by V. Pulkki, in Journal of the Audio Engineering Society, 45 (6), pp. 456-466, June 1997 .
  • the matrices D (f) and E ( f) mentioned above can in this case be constructed by concatenating the vectors made up of the panning gains for the different directions ⁇ j of plane waves.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Mathematical Physics (AREA)
  • Multimedia (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Pure & Applied Mathematics (AREA)
  • Mathematical Optimization (AREA)
  • General Physics & Mathematics (AREA)
  • Algebra (AREA)
  • Stereophonic System (AREA)
  • Circuit For Audible Band Transducer (AREA)
EP21181285.4A 2020-06-30 2021-06-23 Verfahren zur umwandlung eines ersten satzes repräsentativer signale eines schallfelds in einen zweiten satz von signalen und entsprechende elektronische vorrichtung Pending EP3934282A1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
FR2006878A FR3112016B1 (fr) 2020-06-30 2020-06-30 Procédé de conversion d’un premier ensemble de signaux représentatifs d’un champ sonore en un second ensemble de signaux et dispositif électronique associé

Publications (1)

Publication Number Publication Date
EP3934282A1 true EP3934282A1 (de) 2022-01-05

Family

ID=73038106

Family Applications (1)

Application Number Title Priority Date Filing Date
EP21181285.4A Pending EP3934282A1 (de) 2020-06-30 2021-06-23 Verfahren zur umwandlung eines ersten satzes repräsentativer signale eines schallfelds in einen zweiten satz von signalen und entsprechende elektronische vorrichtung

Country Status (3)

Country Link
US (1) US20210407524A1 (de)
EP (1) EP3934282A1 (de)
FR (1) FR3112016B1 (de)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1992012607A1 (en) * 1991-01-08 1992-07-23 Dolby Laboratories Licensing Corporation Encoder/decoder for multidimensional sound fields
EP2824662A1 (de) * 2013-06-14 2015-01-14 Nokia Corporation Audioverarbeitung
FR3060830A1 (fr) * 2016-12-21 2018-06-22 Orange Traitement en sous-bandes d'un contenu ambisonique reel pour un decodage perfectionne
US20180182402A1 (en) * 2014-07-02 2018-06-28 Dolby International Ab Method and apparatus for encoding/decoding of directions of dominant directional signals within subbands of a hoa signal representation

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20220159402A1 (en) * 2019-03-28 2022-05-19 Sony Group Corporation Signal processing device and method, and program

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1992012607A1 (en) * 1991-01-08 1992-07-23 Dolby Laboratories Licensing Corporation Encoder/decoder for multidimensional sound fields
EP2824662A1 (de) * 2013-06-14 2015-01-14 Nokia Corporation Audioverarbeitung
US20180182402A1 (en) * 2014-07-02 2018-06-28 Dolby International Ab Method and apparatus for encoding/decoding of directions of dominant directional signals within subbands of a hoa signal representation
FR3060830A1 (fr) * 2016-12-21 2018-06-22 Orange Traitement en sous-bandes d'un contenu ambisonique reel pour un decodage perfectionne

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
A. POLITISS. TERVOV. PULKKI: "COMPASS : Coding and Multidirectional Parametrization of Ambisonic Sound Scenes", PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFÉRENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP, April 2018 (2018-04-01)
V. PULKKI: "Virtual Sound Source Positioning Using Vector Base Amplitude Panning", JOURNAL OF THE AUDIO ENGINEERING SOCIETY, vol. 45, no. 6, June 1997 (1997-06-01), pages 456 - 466, XP002719359

Also Published As

Publication number Publication date
FR3112016B1 (fr) 2023-04-14
FR3112016A1 (fr) 2021-12-31
US20210407524A1 (en) 2021-12-30

Similar Documents

Publication Publication Date Title
EP2898707B1 (de) Optimierte kalibrierung eines klangwiedergabesystems mit mehreren lautsprechern
EP1836876B1 (de) Verfahren und vorrichtung zur individualisierung von hrtfs durch modellierung
EP1992198B1 (de) Optimierung des binauralen raumklangeffektes durch mehrkanalkodierung
EP2374123B1 (de) Verbesserte codierung von mehrkanaligen digitalen audiosignalen
EP2374124B1 (de) Verwaltete codierung von mehrkanaligen digitalen audiosignalen
EP1946612B1 (de) Hrtfs-individualisierung durch modellierung mit finiten elementen gekoppelt mit einem korrekturmodell
EP3807669B1 (de) Ortung von schallquellen in einer bestimmten akustischen umgebung
EP2901718B1 (de) Verfahren und vorrichtung zur wiedergabe eines audiosignals
EP1586220B1 (de) Verfahren und einrichtung zur steuerung einer wiedergabeeinheitdurch verwendung eines mehrkanalsignals
FR2844894A1 (fr) Procede et systeme de traitement d'une representation d'un champ acoustique
EP3040989B1 (de) Verbessertes trennverfahren und computerprogrammprodukt
EP1479266A2 (de) Verfahren und vorrichtung zur steuerung einer anordnung zur wiedergabe eines schallfeldes
EP1502475B1 (de) Verfahren und system zum repräsentieren eines schallfeldes
FR2858403A1 (fr) Systeme et procede de determination d'une representation d'un champ acoustique
EP3400599B1 (de) Verbesserter ambisonic-codierer für eine tonquelle mit mehreren reflexionen
EP3025514B1 (de) Klangverräumlichung mit raumwirkung
EP3934282A1 (de) Verfahren zur umwandlung eines ersten satzes repräsentativer signale eines schallfelds in einen zweiten satz von signalen und entsprechende elektronische vorrichtung
EP1994526B1 (de) Gemeinsame schallsynthese und -spatialisierung
WO2022106765A1 (fr) Localisation perfectionnée d'une source acoustique
WO2021130132A1 (fr) Procédé et système d'estimation d'une grandeur représentative de l'énergie sonore
EP3384688B1 (de) Aufeinanderfolgende dekompositionen von audiofiltern
EP4042418B1 (de) Bestimmung von korrekturen zur anwendung auf ein mehrkanalaudiosignal, zugehörige codierung und decodierung
EP4365683A1 (de) Verfahren und vorrichtung zur schätzung einer tiefenkarte in zusammenhang mit einem digitalen hologramm zur darstellung einer szene und computerprogramm dafür
WO2023110549A1 (fr) Procédé d'estimation d'une pluralité de signaux représentatifs du champ sonore en un point, dispositif électronique et programme d'ordinateur associés
WO2010076412A2 (fr) Procede et un dispositif d'estimation de signaux de source issus d'un signal de melange

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN PUBLISHED

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

B565 Issuance of search results under rule 164(2) epc

Effective date: 20211109

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE

17P Request for examination filed

Effective date: 20220616

RBV Designated contracting states (corrected)

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: EXAMINATION IS IN PROGRESS

17Q First examination report despatched

Effective date: 20231107