WO2013068284A1 - Method and apparatus for processing signals of a spherical microphone array on a rigid sphere used for generating an ambisonics representation of the sound field - Google Patents

Method and apparatus for processing signals of a spherical microphone array on a rigid sphere used for generating an ambisonics representation of the sound field Download PDF

Info

Publication number
WO2013068284A1
WO2013068284A1 PCT/EP2012/071537 EP2012071537W WO2013068284A1 WO 2013068284 A1 WO2013068284 A1 WO 2013068284A1 EP 2012071537 W EP2012071537 W EP 2012071537W WO 2013068284 A1 WO2013068284 A1 WO 2013068284A1
Authority
WO
WIPO (PCT)
Prior art keywords
noise
power
transfer function
filter
microphone
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/EP2012/071537
Other languages
English (en)
French (fr)
Inventor
Sven Kordon
Johann-Markus Batke
Alexander Krüger
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Thomson Licensing SAS
Original Assignee
Thomson Licensing SAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Thomson Licensing SAS filed Critical Thomson Licensing SAS
Priority to JP2014540396A priority Critical patent/JP6113739B2/ja
Priority to US14/356,265 priority patent/US9420372B2/en
Priority to CN201280066109.4A priority patent/CN104041074B/zh
Priority to EP12788472.4A priority patent/EP2777298B1/en
Priority to KR1020147015683A priority patent/KR101957544B1/ko
Publication of WO2013068284A1 publication Critical patent/WO2013068284A1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers
    • H04R3/005Circuits for transducers for combining the signals of two or more microphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/20Arrangements for obtaining desired frequency or directional characteristics
    • H04R1/32Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
    • H04R1/326Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only for microphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/20Arrangements for obtaining desired frequency or directional characteristics
    • H04R1/32Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
    • H04R1/40Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers
    • H04R1/406Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers microphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R2201/00Details of transducers, loudspeakers or microphones covered by H04R1/00 but not provided for in any of its subgroups
    • H04R2201/40Details of arrangements for obtaining desired directional characteristic by combining a number of identical transducers covered by H04R1/40 but not provided for in any of its subgroups
    • H04R2201/4012D or 3D arrays of transducers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R29/00Monitoring arrangements; Testing arrangements
    • H04R29/004Monitoring arrangements; Testing arrangements for microphones
    • H04R29/005Microphone arrays
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R5/00Stereophonic arrangements
    • H04R5/027Spatial or constructional arrangements of microphones, e.g. in dummy heads
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/15Aspects of sound capture and related signal processing for recording or reproduction

Definitions

  • the invention relates to a method and to an apparatus for processing signals of a spherical microphone array on a rigid sphere used for generating an Ambisonics representa ⁇ tion of the sound field, wherein an equalisation filter is applied to the inverse microphone array response.
  • Spherical microphone arrays offer the ability to capture a three-dimensional sound field.
  • One way to store and process the sound field is the Ambisonics representation.
  • Ambisonics uses orthonormal spherical functions for describing the sound field in the area around the point of origin, also known as the sweet spot. The accuracy of that description is determined by the Ambisonics order N, where a finite number of Ambisonics coefficients describes the sound field.
  • Ambisonics representation is that the reproduction of the sound field can be adapted individually to any given loudspeaker arrangement. Furthermore, this rep ⁇ resentation enables the simulation of different microphone characteristics using beam forming techniques at the post production .
  • the B-format is one known example of Ambisonics.
  • a B-format microphone requires four capsules on a tetrahedron to cap ⁇ ture the sound field with an Ambisonics order of one.
  • Ambisonics of an order greater than one is called Higher Order Ambisonics (HOA)
  • HOA microphones are typically spherical microphone arrays on a rigid sphere, for example the Eigenmike of mhAcoustics.
  • HOA Higher Order Ambisonics
  • For the Ambisonics processing the pressure distribution on the surface of the sphere is sampled by the capsules of the array. The sampled pressure is then converted to the Ambisonics representation.
  • Am ⁇ bisonics representation describes the sound field, but in ⁇ cluding the impact of the microphone array.
  • the impact of the microphones on the captured sound field is removed using the inverse microphone array response, which transforms the sound field of a plane wave to the pressure measured at the microphone capsules. It simulates the directivity of the capsules and the interference of the microphone array with the sound field.
  • the distorted spectral power of a reconstructed Ambisonics signal captured by a spherical microphone array should be equalised.
  • that distortion is caused by the spatial aliasing signal power.
  • due to the noise reduction for spherical microphone arrays on a rigid sphere higher order coefficients are missing in the spheri ⁇ cal harmonics representation, and these missing coefficients unbalance the spectral power spectrum of the reconstructed signal, especially for beam forming applications.
  • a problem to be solved by the invention is to reduce the distortion of the spectral power of a reconstructed Ambison ⁇ ics signal captured by a spherical microphone array, and to equalise the spectral power. This problem is solved by the method disclosed in claim 1. An apparatus that utilises this method is disclosed in claim 2.
  • the inventive processing serves for determining a filter that balances the frequency spectrum of the reconstructed Ambisonics signal.
  • the signal power of the filtered and re ⁇ constructed Ambisonics signal is analysed, whereby the im ⁇ pact of the average spatial aliasing power and the missing higher order Ambisonics coefficients is described for Ambi ⁇ sonics decoding and beam forming applications. From these results an easy-to-use equalisation filter is derived that balances the average frequency spectrum of the reconstructed Ambisonics signal: dependent on the used decoding coeffi ⁇ cients and the signal-to-noise ratio SNR of the recording, the average power at the point of origin is estimated.
  • the equalisation filter is obtained from:
  • the frequency response of the equalisation filter is formed from the square root of the fraction of a given reference power and the computed average spatial signal power at the point of origin.
  • the resulting filter is applied to the spherical harmonics representation of the recorded sound field, or to the recon ⁇ structed signals.
  • the design of such filter is highly compu ⁇ tational complex.
  • the computational complex processing can be reduced by using the computation of con- stant filter design parameters. These parameters are con ⁇ stant for a given microphone array and can be stored in a look-up table. This facilitates a time-variant adaptive fil ⁇ ter design with a manageable computational complexity.
  • the filter removes the raised average signal power at high frequencies. Furthermore, the filter balances the frequency response of a beam forming decoder in the spherical harmonics representation at low frequencies. With ⁇ out usage of the inventive filter the reconstructed sound from a spherical microphone array recording sounds unbal- anced because the power of the recorded sound field is not reconstructed correctly in all frequency sub-bands.
  • the inventive method is suited for processing microphone capsule signals of a spherical microphone array on a rigid sphere, said method including the steps:
  • the inventive apparatus is suited for process ⁇ ing microphone capsule signals of a spherical microphone ar ⁇ ray on a rigid sphere, said apparatus including:
  • said means being adapted for converting said microphone cap- sule signals representing the pressure on the surface of said microphone array to a spherical harmonics or Ambisonics representation ATM(t);
  • means being adapted for computing per wave number k an estimation of the time-variant signal-to-noise ratio SNR(k) of said microphone capsule signals, using the average source power ⁇ P 0 (k) ⁇ 2 of the piane wave recorded from said microphone array and the corresponding noise power
  • - means being adapted for computing per wave number k the average spatial signal power at the point of origin for a diffuse sound field, using reference, aliasing and noise signal power components, and for forming the frequency response of an equalisation filter from the square root of the fraction of a given ref ⁇ erence power and said average spatial signal power at the point of origin,
  • means being adapted for applying said adapted transfer function to said spherical harmonics representation ATM(t) using a linear filter processing, resulting in adapted directional coefficients d (t).
  • Fig. 1 power of reference, aliasing and noise components from the resulting loudspeaker weight for a microphone array with 32 capsules on a rigid sphere;
  • Fig. 3 average power of weight components following the op- timisation filter of Fig. 2, using a conventional
  • Fig. 4 average power of the weight components after the
  • Fig. 5 optimised array response for a conventional Ambison ⁇ ics decoder and an SNR(k) of 20dB;
  • Fig. 6 optimised array response for a beam forming decoder and an SNR(k) of 20dB;
  • Fig. 7 block diagram for the adaptive Ambisonics processing according to the invention.
  • Fig. 8 average power of the resulting weight after the
  • noise optimisation filter n (/c) and the filter E Q(/C) have been applied, using conventional Ambisonics de ⁇ coding, whereby the power of the optimised weight, the reference weight and the noise weight are com ⁇ pared;
  • Ambisonics decoding is defined by assuming loudspeakers that are radiating the sound field of a plane wave, cf. M.A.
  • f the frequency and c sound is the speed of sound.
  • ⁇ dex n runs from 0 to the finite order N, whereas index m runs from —n to n for each index n.
  • Equation (1) defines the conversion of the Ambisonics coef- ficients dTM(k) to the loudspeaker weights w(/2j,/c). These weights are the driving functions of the loudspeakers. The superposition of all speaker weights reconstructs the sound field .
  • the decoding coefficients DTM(J2i) are describing the general Ambisonics decoding processing. This includes the conjugated complex coefficients of a beam pattern as shown in section 3 ⁇ ⁇ ⁇ ) i n Morag Agmon, Boaz Rafaely, "Beamforming for a
  • the Ambisonics coefficients dTM(k) can always be decomposed into a superposition of plane waves, as described in section 3 in Boaz Rafaely, "Plane-wave decomposition of the sound field on a sphere by spherical convolution", J. Acoustical Society of America, vol.116, no.4, pages 2149-2157, 2004. Therefore the analysis can be limited to the coefficients of a plane wave impinging from a direction s :
  • the coefficients of a plane wave dTM ⁇ ane (k) are defined for the assumption of loudspeakers that are radiating the sound field of a plane wave.
  • the pressure at the point of origin is defined by P (k) for the wave number k .
  • the conjugated complex spherical harmonics YTM( s y denote the directional coefficients of a plane wave.
  • the definition of the spheri ⁇ cal harmonics YTM( S ) given in the above-mentioned M.A. Po- letti article is used.
  • the spherical harmonics are the orthonormal base functions of the Ambisonics representations and satisfy
  • N (N + l) 2 of Ambisonics coefficients.
  • the conjugated complex spherical harmonics can be replaced by the columns of the pseudo-inverse matrix , which is obtained from the L X 0 spherical harmonics matrix Y_, where the 0 coefficients of the spherical harmonics
  • YTM( ⁇ Q C ) are the row-elements of Y_, cf. section 3.2.2 in the above-mentioned Moreau/Daniel/Bertet article:
  • a complete HOA processing chain for spherical microphone ar- rays on a rigid (stiff, fixed) sphere includes the estima ⁇ tion of the pressure at the capsules, the computation of the HOA coefficients and the decoding to the loudspeaker
  • the description of the microphone array in the spherical harmonics representation enables the estimation of the average spectral power at the point of origin for a given decoder.
  • the power for the mode matching Ambisonics decoder and a simple beam forming decoder is evaluated.
  • the estimated average power at the sweet spot is used to design an equalisation filter.
  • the following section describes the decomposition of w(/c) into the reference weight w ref (/c), the spatial aliasing weight w alias (/c) and a noise weight w noise (/c).
  • the aliasing is caused by the sampling of the continuous sound field for a finite or ⁇ der N and the noise simulates the spatially uncorrelated signal parts introduced for each capsule.
  • the spatial alias ⁇ ing cannot be removed for a given microphone array.
  • the isotropic noise signal P no i se (Jl c >k) is added to simulate transducer noise, where 'isotropic' means that the noise signals of the capsules are spatially uncorrelated, which does not include the correlation in the temporal domain.
  • the pressure can be separated into the pressure P Ye f(J2 c , kR) computed for the maximal order N of the microphone array and the pressure from the remaining orders, cf. section 7, equa- tion (24) in the above-mentioned Rafaely "Analysis and de ⁇ sign " article.
  • the pressure from the remaining orders Paiias (Ji c , kR) is called the spatial aliasing pressure because the order of the microphone array is not sufficient to re ⁇ construct these signal components.
  • the total pressure recorded at the capsule c is defined by:
  • the Ambisonics coefficients dTM(k are obtained from the pres ⁇ sure at the capsules by the inversion of equation (11) given in equation (13a), cf. section 3.2.2, equation (26) of the above-mentioned Moreau/Daniel/Bertet article.
  • the Ambisonics coefficients dTM(k can be separated into the reference coefficients dTM re ⁇ (k) , the aliasing coefficients d alias (/c) and the noise coefficients dTM noise (k) using equations (13a) and (12a) as shown in equations (13b) and (13c) .
  • Equation (14) provides w(/c) from equations (1) and (13b), where L is the number of loudspeakers:
  • Equation (14b) shows that w(/c) can also be separated into the three weights w ref (/c), w alias (/c) and noise (/c). For simplicity, the positioning error given in section 7, equation (24) of the above-mentioned Rafaely "Analysis and design " arti ⁇ cle is not considered here.
  • the reference coefficients are the weights that a synthetically generated plane wave of order n would create.
  • the reference pres ⁇ sure P Ye f(J2 c ,kR) from equation (12b) is substituted in equation (14a), whereby the pressure signals ⁇ are ignored (i.e. set to zero) :
  • Equation (15a) can be simplified to the sum of the weights of a plane wave in the Ambisonics representation from equation (3) .
  • equation (15a) can be simplified to the sum of the weights of a plane wave in the Ambisonics representation from equation (3) .
  • the maximal Ambi- sonics order N supported by this array is four.
  • the mode matching processing as described in the above-mentioned M.A.
  • Poletti article is used to obtain the decoding coefficients DTM(J2i) for 25 uniformly distributed loudspeaker positions according to Jorg Fliege, Ulrike Maier, "A Two-Stage Approach for Computing Cubature Formulae for the Sphere", Technical report, 1996, labor Schlauer, Universitat Dortmund, Germany.
  • the node numbers are shown at http: //www. mathematik . uni-dortmund . de/lsx/research/projects/fliege/nodes /nodes, html .
  • the power of the reference weight w ref (/c) is constant over the entire frequency range.
  • the resulting noise weight noise (/c) shows high power at low frequencies and decreases at higher frequencies.
  • the noise signal or power is simulated by a normally distributed unbiased pseudo-random noise with a variance of 20dB (i.e. 20dB lower than the power of the plane wave) .
  • the aliasing noise a j ias (/c) can be ignored at low frequencies but increases with rising frequency, and above 10kHz exceeds the reference power.
  • the slope of the aliasing power curve depends on the plane wave direction. However, the average tendency is consistent for all directions.
  • the noise signal is compensated using the method de ⁇ scribed in the European application with internal reference PD110039, filed on the same day by the same applicant and having the same inventors.
  • the overall signal power is equalised under consideration of the alias ⁇ ing signal and the first processing step.
  • the mean square error between the refer ⁇ ence weight and the distorted reference weight is minimised for all incoming plane wave directions.
  • the weight from the aliasing signal w alias (/c) is ignored because w alias (/c) cannot be corrected after having been spatially band-limited by the order of the Ambisonics representation. This is equivalent to the time domain aliasing where the aliasing cannot be re ⁇ moved from the sampled and band-limited time signal.
  • the average power of the reconstructed weight is estimated for all plane wave directions.
  • a filter is described below that balances the power of the recon ⁇ structed weight to the power of the reference weight. That filter equalises the power only at the sweet spot. However, the aliasing error still disrupts the sound field represen ⁇ tation for high frequencies.
  • the spatial frequency limit of a microphone array is called spatial aliasing frequency.
  • fa alniaass £ 2TMR0.73 i (20) is computed from the distance of the capsules (cf. WO 03/ 061336 Al), which is approximately 5594Hz for the Eigenmike with a radius R equal to 4.2cm .
  • the parameters of transfer function F n (k) depend on the number of microphone capsules and on the signal-to-noise ratio for the wave number k .
  • the filter is independent of the Am- bisonics decoder, which means that it is valid for three- dimensional Ambisonics decoding and directional beam form ⁇ ing.
  • the SNR(k) can be obtained from the above-mentioned European application with internal reference PD110039.
  • the filter is a high-pass filter that limits the order of the Ambisonics representation for low frequencies.
  • the cut-off frequency of the filter decreases for a higher SNR(k).
  • the transfer functions F n (k) of the filter for an SNR(k) of 20dB are shown in Fig.
  • the average power of the optimised weight w'(/c) is obtained from its squared magnitude expectation value.
  • the noise weight ' noise (/c) is spatially uncorrelated to the weights w 'ref(k) and ' alias (/c) so that the noise power can be computed independently as shown in equation (23a) .
  • the power of the reference and aliasing weight are derived from equation (23b).
  • the combination of the equations (22), (15a) and (17) results in equation (23c) , where ' noise (/c) is ignored in equa ⁇ tion (22) .
  • the expansion of the squared magnitude simplifies equations (23c) and (23d) using equation (4).
  • the resulting power depends on the used decoding processing. However, for conventional three-dimensional Ambisonics de- coding it is assumed that all directions are covered by the loudspeaker arrangement. In this case the coefficients with an order greater than zero are eliminated by the sum of the decoding coefficients DTM(J2i) given in equation (23) . This means that the pressure at the point of origin is equivalent to the zero order signal so that the missing higher order coefficients at low frequencies do not reduce the power at the sweet spot.
  • the power is equivalent to the sum of the squared magnitudes of DTM(J2i), so that for one loudspeaker I the power increases with the order N.
  • Fig. 3 The average power components of w'(/c), obtained from the noise optimisation filter, are shown in Fig. 3 for conventional Ambisonics decoding.
  • Fig. 3b shows the reference + alias power
  • Fig. 3c shows the noise power
  • Fig. 3a the sum of both.
  • the noise power is reduced to -35dB up to a frequency of 1kHz. Above 1kHz the noise power increases linearly to -lOdB.
  • the total power is raised by lOdB above 10kHz, which is caused by the alias ⁇ ing power.
  • Fig. 4b shows the reference + alias power
  • Fig. 4c shows the noise power
  • Fig. 4a the sum of both.
  • the first increase is caused by the extenuation of the higher order coeffi- cients because 3kHz is approximately the cut-off frequency of F n (k) for the fourth order coefficients shown in Fig. 2e.
  • the second increase is caused by the spatial aliasing power as discussed for the Ambisonics decoding.
  • Equation (26a) The real-valued equalisation filter E Q(/C) is given in equation (26a) . It compensates the average power of w'(/c) to the reference power of w ref (/c) .
  • equations (23e) and (27) are used to show in equation (26b) that E Q(/C) is also a function of the SNR(k). +w' alias (/c))
  • Equation (28d) it is shown that the highly complex compu- tation of E ⁇ w' rei (k) + ' alias (/c)
  • Each element of these sums is a multiplication of the filter F n (/c), its conjugated complex value, the infinite sums over n' and ⁇ ' of the product of ATM,' n , and its conjugated complex value.
  • the results of these sums give the constant filter design coefficients for each combination of n and n" . These coefficients are computed once for a given array and can be stored in a look-up table for a time- variant signal-to-noise ratio adaptive filter design.
  • the reciprocal of the transfer function b n (kR) converts ATM(t) to the directional co ⁇ efficients d (t) , where it is assumed that the sampled sound field is created by a superposition of plane waves that were scattered on the surface of the sphere.
  • the coefficients dTM(t) are representing the plane wave decomposition of the sound field described in section 3, equation (14) of the above-mentioned Rafaely "Plane-wave decomposition " arti- cle, and this representation is basically used for the transmission of Ambisonics signals.
  • the optimisation transfer function F n (k) reduces the contribution of the higher order coefficients in order to remove the HOA coefficients that are covered by noise.
  • the power of the reconstructed signal is equalised by the filter EQ (/C) for a known or assumed decoder processing.
  • the second processing step results in a convolution of ATM(t) with the designed time domain filter.
  • the resulting optimised array responses for the conventional Ambisonics decod ⁇ ing are shown in Fig. 5, and the resulting optimised array responses for the beam forming decoder example are shown in Fig. 6.
  • the processing of the coefficients ATM(t) can be regarded as a linear filtering operation, where the transfer function of the filter is determined by F niarray (/c) . This can be performed in the frequency domain as well as in the time domain.
  • the FFT can be used for transforming the coefficients ATM(t) to the frequency domain for the successive multiplication by the transfer function f ⁇ array CO ⁇
  • the inverse FFT of the prod ⁇ uct results in the time domain coefficients d (t).
  • This transfer function processing is also known as the fast convolution using the overlap-add or overlap-save method.
  • the linear filter can be approximated by an FIR filter, whose coefficients can be computed from the transfer function ⁇ arra C ⁇ ) by transforming it to the time do- main with an inverse FFT, performing a circular shift and applying a tapering window to the resulting filter impulse response to smooth the corresponding transfer function.
  • the linear filtering process is then performed in the time do ⁇ main by a convolution of the time domain coefficients of the transfer function ⁇ arra C ⁇ ) an d the coefficients ATM(t) for each combination of n and m.
  • Fig. 7 The inventive adaptive block based Ambisonics processing is depicted in Fig. 7.
  • the time do- main pressure signals P(Jl c ,t) of the microphone capsule sig ⁇ nals are converted in step or stage 71 to the Ambisonics representation ATM(t) using equation (13a), whereby the division by the microphone transfer function b n (kR) is not car- ried out (thereby ATM(t is calculated instead of dTM(k)) , and is instead carried out in step/stage 72.
  • Step/stage 72 per ⁇ forms then the described linear filtering operation in the time domain or frequency domain in order to obtain the coef- ficients d (t), whereby the microphone array response is re ⁇ moved from ATM(t).
  • the second processing path is used for an automatic adaptive filter design of the transfer function F narray (_k) .
  • the step/stage 73 performs the estimation of the signal-to-noise ratio SNR(k) for a considered time period (i.e. block of samples) .
  • the estimation is performed in the frequency domain for a finite number of discrete wave num ⁇ bers k .
  • the regarded pressure signals ⁇ ( ⁇ ⁇ , t) have to be transformed to the frequency domain using for example an FFT .
  • the SNR(k) value is specified by the two power signals lP n oise(k)l 2 ancl ⁇ P 0 (k) ⁇ 2 .
  • P no i S e(k)l 2 of the noise signal is constant for a given array and represents the noise pro ⁇ cuted by the capsules.
  • the power ⁇ Po(k) ⁇ 2 of the plane wave is estimated from the pressure signals ⁇ ( ⁇ ⁇ , t). The estimation is further described in section SNR estimation in the above- mentioned European application with internal reference
  • the transfer function F narray (k) with n ⁇ N is designed in step/stage 74 in the fre ⁇ quency domain using equations (30), (26c), (21) and (10).
  • the filter design can use a Wiener filter and the inverse array response or inverse transfer function l/b n (kR) .
  • the filter implementation is then adapted to the corresponding linear filter processing in the time or frequency domain of step/stage 72. The results of the inventive processing are discussed in the following. Therefore, the equalisation filter EQ (/C) from equation (26c) is applied to the expectation value E ⁇ w'(k) ⁇ 2 ⁇ .
  • 2 ⁇ , the reference power £ ⁇ l w ref(Ol 2 ⁇ an d the resulting noise power for the examples of the conventional Ambisonics decoding from Fig. 3 and the beam forming from Fig. 4 are discussed.
  • the resulting power spectra for a conventional Ambisonics decoder are depicted in Fig. 8, and for the beam forming decoder in Fig. 9, wherein curves a) to c) show
  • the power of the reference and the optimised weight are identical so that the resulting weight has a balanced fre- quency spectrum.
  • the resulting signal-to- noise ratio at the sweet spot has increased for the conven ⁇ tional Ambisonics decoding and decreased for the beam form ⁇ ing decoding, compared to the given SNR(k) of 20db.
  • the signal-to-noise ratio is equal to the given SNR(k) for both decoders.
  • the SNR at high frequencies is greater with respect to that at low frequencies, while for the Ambisonics decoder the SNR at high frequencies is smaller with respect to that at low frequencies.
  • Example beam pattern is a nar- row beam pattern that has strong high order coefficients.
  • Decoding coefficients that produce beam pattern with wider beams can increase the SNR. These beams have strong coeffi ⁇ cients in the low orders. Better results can be achieved by using different decoding coefficients for several frequency bands in order to adapt to the limited order at low frequen ⁇ cies .
  • optimised beam forming Other methods for optimised beam forming exist that minimise the resulting SNR, wherein the decoding coefficients DTM(J2i) are obtained by a numerical optimisation for a specific steering direction.
  • the optimal modal beam forming presented in Y. Shefeng, S. Haohai, U.P. Svensson, M. Xiaochuan, J.M. Hovem, "Optimal Modal Beamforming for Spherical Microphone Arrays", IEEE Transactions on Audio, Speech, and language processing, vol.19, no.2, pages 361-371, February 2011, and the maximum directivity beam forming discussed in M. Agmon, B. Rafaely, J. Tabrikian, "Maximum Directivity Beamformer for Spherical-Aperture Microphones", 2009 IEEE Workshop on Applcations of Signal Processing to Audio and Acoustics
  • the example Ambisonics decoder uses mode matching process ⁇ ing, where each loudspeaker weight is computed from the de ⁇ coding coefficients used in the beam forming example.
  • the loudspeaker sig ⁇ nals have the same SNR as for the beam forming decoder example. However, on one hand the superposition of the loud ⁇ speaker signals at the point of origin results in an excel ⁇ lent SNR. On the other hand, the SNR becomes lower if the listening position moves out of the sweet spot.
  • the described optimisation is produc ⁇ ing a balanced frequency spectrum with an increased SNR at the point of origin for a conventional Ambisonics decoder, i.e. the inventive time-variant adaptive filter design is advantageous for Ambisonics recordings.
  • the inventive procesing can also be used for designing a time-invariant filter if the SNR of the recording can be assumed constant over the time.
  • the inventive procesing can balance the resulting frequency spectrum, with the drawback of a low SNR at low frequencies.
  • the SNR can be increased by selecting appropriate decoding coefficients that produce wider beams, or by adapting the beam width on the Ambisonics order of different frequency sub-bands.
  • the invention is applicable to all spherical microphone re ⁇ cordings in the spherical harmonics representation, where the reproduced spectral power at the point of origin is un- balanced due to aliasing or missing spherical harmonic coef ⁇ ficients .

Landscapes

  • Health & Medical Sciences (AREA)
  • Otolaryngology (AREA)
  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • General Health & Medical Sciences (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Stereophonic System (AREA)
  • Obtaining Desirable Characteristics In Audible-Bandwidth Transducers (AREA)
PCT/EP2012/071537 2011-11-11 2012-10-31 Method and apparatus for processing signals of a spherical microphone array on a rigid sphere used for generating an ambisonics representation of the sound field Ceased WO2013068284A1 (en)

Priority Applications (5)

Application Number Priority Date Filing Date Title
JP2014540396A JP6113739B2 (ja) 2011-11-11 2012-10-31 音場のアンビソニックス表現を生成するために使われる剛体球上の球状マイクロホン・アレイの信号を処理する方法および装置
US14/356,265 US9420372B2 (en) 2011-11-11 2012-10-31 Method and apparatus for processing signals of a spherical microphone array on a rigid sphere used for generating an ambisonics representation of the sound field
CN201280066109.4A CN104041074B (zh) 2011-11-11 2012-10-31 处理用于产生声场的高保真度立体声响复制表示的刚性球上的球形麦克风阵列的信号的方法和装置
EP12788472.4A EP2777298B1 (en) 2011-11-11 2012-10-31 Method and apparatus for processing signals of a spherical microphone array on a rigid sphere used for generating a spherical harmonics representation or an ambisonics representation of the sound field
KR1020147015683A KR101957544B1 (ko) 2011-11-11 2012-10-31 사운드 필드의 앰비소닉스 표현을 생성하는데 사용되는 강체구 상의 구면 마이크로폰 어레이의 신호들을 프로세싱하는 방법 및 장치

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP11306472.9A EP2592846A1 (en) 2011-11-11 2011-11-11 Method and apparatus for processing signals of a spherical microphone array on a rigid sphere used for generating an Ambisonics representation of the sound field
EP11306472.9 2011-11-11

Publications (1)

Publication Number Publication Date
WO2013068284A1 true WO2013068284A1 (en) 2013-05-16

Family

ID=47216219

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2012/071537 Ceased WO2013068284A1 (en) 2011-11-11 2012-10-31 Method and apparatus for processing signals of a spherical microphone array on a rigid sphere used for generating an ambisonics representation of the sound field

Country Status (6)

Country Link
US (1) US9420372B2 (https=)
EP (2) EP2592846A1 (https=)
JP (1) JP6113739B2 (https=)
KR (1) KR101957544B1 (https=)
CN (1) CN104041074B (https=)
WO (1) WO2013068284A1 (https=)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104581604A (zh) * 2013-10-17 2015-04-29 奥迪康有限公司 再现声学声场的方法
CN107995582A (zh) * 2013-11-28 2018-05-04 杜比国际公司 使用奇异值分解进行hoa编码和解码的方法和装置
RU2685997C2 (ru) * 2014-05-16 2019-04-23 Квэлкомм Инкорпорейтед Кодирование векторов, разложенных из аудиосигналов на основе амбиофонии высшего порядка
US10499176B2 (en) 2013-05-29 2019-12-03 Qualcomm Incorporated Identifying codebooks to use when coding spatial components of a sound field
US10770087B2 (en) 2014-05-16 2020-09-08 Qualcomm Incorporated Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals

Families Citing this family (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2592845A1 (en) * 2011-11-11 2013-05-15 Thomson Licensing Method and Apparatus for processing signals of a spherical microphone array on a rigid sphere used for generating an Ambisonics representation of the sound field
US10021508B2 (en) 2011-11-11 2018-07-10 Dolby Laboratories Licensing Corporation Method and apparatus for processing signals of a spherical microphone array on a rigid sphere used for generating an ambisonics representation of the sound field
US9466305B2 (en) 2013-05-29 2016-10-11 Qualcomm Incorporated Performing positional analysis to code spherical harmonic coefficients
DK3025130T3 (da) * 2013-07-22 2020-04-20 Brueel & Kjaer Sound & Vibration Measurement As Akustisk bredbåndsholografi
US20150127354A1 (en) * 2013-10-03 2015-05-07 Qualcomm Incorporated Near field compensation for decomposed representations of a sound field
WO2015101915A2 (en) 2013-12-31 2015-07-09 Distran Gmbh Acoustic transducer array device
US9502045B2 (en) 2014-01-30 2016-11-22 Qualcomm Incorporated Coding independent frames of ambient higher-order ambisonic coefficients
US9922656B2 (en) 2014-01-30 2018-03-20 Qualcomm Incorporated Transitioning of ambient higher-order ambisonic coefficients
US20150332682A1 (en) * 2014-05-16 2015-11-19 Qualcomm Incorporated Spatial relation coding for higher order ambisonic coefficients
US9620137B2 (en) 2014-05-16 2017-04-11 Qualcomm Incorporated Determining between scalar and vector quantization in higher order ambisonic coefficients
EP2988527A1 (en) 2014-08-21 2016-02-24 Patents Factory Ltd. Sp. z o.o. System and method for detecting location of sound sources in a three-dimensional space
US9747910B2 (en) 2014-09-26 2017-08-29 Qualcomm Incorporated Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework
CN105072557B (zh) * 2015-08-11 2017-04-19 北京大学 一种三维环绕声重放系统的扬声器环境自适应校准方法
JP6606784B2 (ja) * 2015-09-29 2019-11-20 本田技研工業株式会社 音声処理装置および音声処理方法
US10206040B2 (en) 2015-10-30 2019-02-12 Essential Products, Inc. Microphone array for generating virtual sound field
RU2687882C1 (ru) 2016-03-15 2019-05-16 Фраунхофер-Гезеллшафт Цур Фёрдерунг Дер Ангевандтен Форшунг Е.В. Устройство, способ формирования характеристики звукового поля и машиночитаемый носитель информации
WO2018053050A1 (en) * 2016-09-13 2018-03-22 VisiSonics Corporation Audio signal processor and generator
EP3520437A1 (en) * 2016-09-29 2019-08-07 Dolby Laboratories Licensing Corporation Method, systems and apparatus for determining audio representation(s) of one or more audio sources
FR3060830A1 (fr) * 2016-12-21 2018-06-22 Orange Traitement en sous-bandes d'un contenu ambisonique reel pour un decodage perfectionne
WO2018157098A1 (en) * 2017-02-27 2018-08-30 Essential Products, Inc. Microphone array for generating virtual sound field
EP3625974B1 (en) * 2017-05-15 2020-12-23 Dolby Laboratories Licensing Corporation Methods, systems and apparatus for conversion of spatial audio format(s) to speaker signals
CN109275084B (zh) * 2018-09-12 2021-01-01 北京小米智能科技有限公司 麦克风阵列的测试方法、装置、系统、设备和存储介质
JP6969793B2 (ja) 2018-10-04 2021-11-24 株式会社ズーム アンビソニックスのためのa/bフォーマット変換装置、a/bフォーマット変換ソフトウェア、レコーダー、再生ソフトウェア
CN111193990B (zh) * 2020-01-06 2021-01-19 北京大学 一种抗高频空间混叠的3d音频系统及实现方法
US11489505B2 (en) * 2020-08-10 2022-11-01 Cirrus Logic, Inc. Methods and systems for equalization
CN115002640B (zh) * 2021-10-21 2025-02-07 杭州爱华智能科技有限公司 传声器的声场特性转换方法与电容型测试传声器系统
CN114928788B (zh) * 2022-04-10 2025-02-21 西北工业大学 一种基于稀疏平面波分解的声场重放空间解码方法

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030016835A1 (en) * 2001-07-18 2003-01-23 Elko Gary W. Adaptive close-talking differential microphone array
EP1737271A1 (en) * 2005-06-23 2006-12-27 AKG Acoustics GmbH Array microphone
US20100008517A1 (en) * 2002-01-11 2010-01-14 Mh Acoustics,Llc Audio system based on at least second-order eigenbeams
US20100142732A1 (en) * 2006-10-06 2010-06-10 Craven Peter G Microphone array

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7558393B2 (en) * 2003-03-18 2009-07-07 Miller Iii Robert E System and method for compatible 2D/3D (full sphere with height) surround sound reproduction
EP1931169A4 (en) * 2005-09-02 2009-12-16 Japan Adv Inst Science & Tech Post filter for microphone array
GB0906269D0 (en) * 2009-04-09 2009-05-20 Ntnu Technology Transfer As Optimal modal beamformer for sensor arrays
EP2592845A1 (en) 2011-11-11 2013-05-15 Thomson Licensing Method and Apparatus for processing signals of a spherical microphone array on a rigid sphere used for generating an Ambisonics representation of the sound field
US9197962B2 (en) * 2013-03-15 2015-11-24 Mh Acoustics Llc Polyhedral audio system based on at least second-order eigenbeams

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030016835A1 (en) * 2001-07-18 2003-01-23 Elko Gary W. Adaptive close-talking differential microphone array
US20100008517A1 (en) * 2002-01-11 2010-01-14 Mh Acoustics,Llc Audio system based on at least second-order eigenbeams
EP1737271A1 (en) * 2005-06-23 2006-12-27 AKG Acoustics GmbH Array microphone
US20100142732A1 (en) * 2006-10-06 2010-06-10 Craven Peter G Microphone array

Non-Patent Citations (8)

* Cited by examiner, † Cited by third party
Title
BOAZ RAFAELY: "Plane-wave decomposition of the sound field on a sphere by spherical convolution", J. ACOUSTICAL SOCIETY OF AMERICA, vol. 116, no. 4, 2004, pages 2149 - 2157, XP012072566, DOI: doi:10.1121/1.1792643
F. ZOTTER: "Sampling Strategies for Acoustic Holography/ Holophony on the Sphere", PROCEEDINGS OF THE NAG-DAGA, 23 March 2009 (2009-03-23)
JOHANN-MARKUS BATKE; FLORIAN KEILER: "Using VBAP-Derived Panning Functions for 3D Ambisonics Decoding", PROC. OF THE 2ND INTERNATIONAL SYMPOSIUM ON AMBISONICS AND SPHERICAL ACOUSTICS, 6 May 2010 (2010-05-06)
M. AGMON; B. RAFAELY; J. TABRIKIAN: "Maximum Directivity Beamformer for Spherical-Aperture Microphones", 2009 IEEE WORKSHOP ON APPLCATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS WASPAA '09, PROC. IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 18 October 2009 (2009-10-18), pages 153 - 156, XP031575097
M.A. POLETTI: "Three-Dimensional Surround Sound Systems Based on Spherical Harmonics", JOURNAL AUDIO ENGINEERING SOCIETY, vol. 53, no. LL, 2005, pages 1004 - 1025
MORAG AGMON; BOAZ RAFAELY: "Beamforming for a Spherical-Aperture Microphone", IEEEI, 2008, pages 227 - 230, XP031399568
MOREAU, DANIEL, BERTET: "3D Sound Field Recording with Higher Order Ambisonics- Objective Measurements and Validation of Spherical Microphone", AUDIO ENGINEERING SOCIETY, 6857, 20 May 2006 (2006-05-20) - 23 May 2006 (2006-05-23), XP002672712 *
Y. SHEFENG; S. HAOHAI; U.P. SVENSSON; M. XIAOCHUAN; J.M. HOVEM: "Optimal Modal Beamforming for Spherical Microphone Arrays", IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, vol. 19, no. 2, February 2011 (2011-02-01), pages 361 - 371, XP011307131, DOI: doi:10.1109/TASL.2010.2047815

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10499176B2 (en) 2013-05-29 2019-12-03 Qualcomm Incorporated Identifying codebooks to use when coding spatial components of a sound field
US11146903B2 (en) 2013-05-29 2021-10-12 Qualcomm Incorporated Compression of decomposed representations of a sound field
US11962990B2 (en) 2013-05-29 2024-04-16 Qualcomm Incorporated Reordering of foreground audio objects in the ambisonics domain
CN104581604A (zh) * 2013-10-17 2015-04-29 奥迪康有限公司 再现声学声场的方法
CN104581604B (zh) * 2013-10-17 2019-07-05 奥迪康有限公司 再现声学声场的方法
CN107995582A (zh) * 2013-11-28 2018-05-04 杜比国际公司 使用奇异值分解进行hoa编码和解码的方法和装置
RU2685997C2 (ru) * 2014-05-16 2019-04-23 Квэлкомм Инкорпорейтед Кодирование векторов, разложенных из аудиосигналов на основе амбиофонии высшего порядка
US10770087B2 (en) 2014-05-16 2020-09-08 Qualcomm Incorporated Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals

Also Published As

Publication number Publication date
EP2592846A1 (en) 2013-05-15
KR20140089601A (ko) 2014-07-15
EP2777298A1 (en) 2014-09-17
US9420372B2 (en) 2016-08-16
US20140307894A1 (en) 2014-10-16
CN104041074B (zh) 2017-04-12
JP2014535232A (ja) 2014-12-25
CN104041074A (zh) 2014-09-10
KR101957544B1 (ko) 2019-03-12
JP6113739B2 (ja) 2017-04-12
EP2777298B1 (en) 2016-03-16

Similar Documents

Publication Publication Date Title
WO2013068284A1 (en) Method and apparatus for processing signals of a spherical microphone array on a rigid sphere used for generating an ambisonics representation of the sound field
US9503818B2 (en) Method and apparatus for processing signals of a spherical microphone array on a rigid sphere used for generating an ambisonics representation of the sound field
US10382849B2 (en) Spatial audio processing apparatus
KR20140138907A (ko) 통합 또는 하이브리드 사운드-필드 제어 전략을 적용하는 방법
US8005244B2 (en) Apparatus for implementing 3-dimensional virtual sound and method thereof
Sakamoto et al. Sound-space recording and binaural presentation system based on a 252-channel microphone array
MX2013013058A (es) Aparato y metodo para generar una señal de salida que emplea un descomponedor.
US10021508B2 (en) Method and apparatus for processing signals of a spherical microphone array on a rigid sphere used for generating an ambisonics representation of the sound field
Wang et al. Two-stage enhancement of noisy and reverberant microphone array speech for automatic speech recognition systems trained with only clean speech
JP6031364B2 (ja) 収音装置及び再生装置
Zotter et al. Higher-order ambisonic microphones and the wave equation (linear, lossless)
JP2014143570A (ja) 収音装置及び再生装置
Kashiwazaki et al. Attempt to improve the total performance of sound field reproduction system: Integration of wave-based methods and simple reproduction method
Suzuki et al. Speech Enhancement Network with Windowed Cross Attention Using Noise-Reference Microphone
Pedamallu Microphone Array Wiener Beamforming with emphasis on Reverberation
Garcia-Martinez et al. Integrating High Order Ambisonics and Deep Learning for Advanced Instrument Separation in Spatial Audio Applications
Harma Coding principles for virtual acoustic openings
KR20020080730A (ko) 머리 모델링을 이용한 입체음향 합성방법
Helwani et al. Calibration of microphone arrays with arbitrary geometries
Nilsson Suppression of reverberation in hearing aids
CHISAKI et al. Concurrent speech segregation using a microphone array for computer users
KR20060091966A (ko) 머리 모델링을 이용한 입체음향 합성방법

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 12788472

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 14356265

Country of ref document: US

WWE Wipo information: entry into national phase

Ref document number: 2012788472

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 2014540396

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 20147015683

Country of ref document: KR

Kind code of ref document: A