WO2011010292A1 - Audio beamforming - Google Patents

Audio beamforming Download PDF

Info

Publication number
WO2011010292A1
WO2011010292A1 PCT/IB2010/053335 IB2010053335W WO2011010292A1 WO 2011010292 A1 WO2011010292 A1 WO 2011010292A1 IB 2010053335 W IB2010053335 W IB 2010053335W WO 2011010292 A1 WO2011010292 A1 WO 2011010292A1
Authority
WO
WIPO (PCT)
Prior art keywords
angle
output signal
circuit
combining
estimate
Prior art date
Application number
PCT/IB2010/053335
Other languages
French (fr)
Inventor
Rene Martinus Maria Derkx
Original Assignee
Koninklijke Philips Electronics N.V.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics N.V. filed Critical Koninklijke Philips Electronics N.V.
Priority to CN201080033006.9A priority Critical patent/CN102474680B/en
Priority to JP2012521148A priority patent/JP5777616B2/en
Priority to RU2012106592/28A priority patent/RU2550300C2/en
Priority to EP10745004.1A priority patent/EP2457384B1/en
Priority to US13/384,720 priority patent/US9084037B2/en
Publication of WO2011010292A1 publication Critical patent/WO2011010292A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/005Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2201/00Details of transducers, loudspeakers or microphones covered by H04R1/00 but not provided for in any of its subgroups
    • H04R2201/40Details of arrangements for obtaining desired directional characteristic by combining a number of identical transducers covered by H04R1/40 but not provided for in any of its subgroups
    • H04R2201/4012D or 3D arrays of transducers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2430/00Signal processing covered by H04R, not provided for in its groups
    • H04R2430/20Processing of the output signals of the acoustic transducers of an array for obtaining a desired directivity characteristic
    • H04R2430/25Array processing for suppression of unwanted side-lobes in directivity characteristics, e.g. a blocking matrix

Definitions

  • the invention relates to audio beamforming and in particular, but not exclusively, to audio beamforming using microphone arrays substantially smaller than the wavelength of the audio signals being beamformed.
  • Advanced processing of audio signals has become increasingly important in many areas including e.g. telecommunication, content distribution etc.
  • complex processing of inputs from a plurality of microphones has been used to provide a configurable directional sensitivity for a microphone array comprising the microphones.
  • the processing of signals from a microphone array can generate an audio beam with a direction that can be changed simply by changing the characteristics of the combination of the individual microphone signals.
  • beam form algorithms seek to attenuate interferers while providing a high gain for a desired sound source.
  • a beamforming algorithm can be controlled to provide a strong attenuation (preferably a null) in the direction of a signal received from a main interferer.
  • the microphone array is relatively small.
  • many beamforming algorithms such as additive delay-and-sum beamforming algorithms, are not able to provide sufficient directivity as the beamwidth deteriorates substantially for such wavelengths.
  • superdirective beamforming techniques are based on filters with asymmetrical filter coefficients and the approach essentially corresponds to subtraction of signals or determining spatial derivatives of the sound pressure field.
  • an improved approach for audio beamforming would be advantageous and in particular an approach allowing improved adaptation to current conditions and audio environment, increased flexibility, facilitated implementation, improved performance for different operating scenarios and/or improved performance would be advantageous.
  • the Invention seeks to preferably mitigate, alleviate or eliminate one or more of the above mentioned disadvantages singly or in any combination.
  • an audio beamforming apparatus comprising: a receiving circuit for receiving signals from an at least two-dimensional microphone array comprising at least three microphones; a reference circuit for generating at least three reference beams from the microphone signals; combining circuit for generating an output signal corresponding to a desired beam pattern by combining the reference beams in response to a first direction of a desired sound source and a direction estimate for an interfering sound source; an estimation circuit for generating the direction estimate by: determining a first angle corresponding to a local minimum for a power measure of the output signal in a first angle interval, determining a second angle
  • the combining circuit is arranged to determine combination parameters for the combining of the reference beams to provide a notch in an angle corresponding to the direction estimate and a minimization of a directivity cost measure, the directivity cost measure being indicative of a ratio between a gain in the first direction and an average gain.
  • the invention may allow improved performance.
  • an improved and/or facilitated adaptation to a current audio environment can be achieved.
  • the invention may allow a beamforming approach which provides high performance for both directional point interference cancellation and for diffuse noise attenuation.
  • the approach is particularly suitable for, and may provide particularly advantageous performance for, systems wherein the wavelength of the audio signals may be substantially larger than the size of the microphone array.
  • the invention may allow low complexity implementation and/or operation.
  • the approach may be suitable for providing improved directivity and may in particular be suitable for scenarios wherein the size of the microphone array is much smaller than a wavelength of interest.
  • the approach may allow a null to be directed towards a single point interference while substantially reducing diffuse noise.
  • the approach may in many scenarios allow a reduction of a single point interference corresponding to or better than many prior art interference reduction techniques, while at the same time providing improved diffuse noise.
  • the approach may in many scenarios allow a low complexity yet highly efficient and advantageous beam steering based on low complexity parallel local minima extraction.
  • the approach may ensure that at least one of the identified local minima is also a global minimum and thus may allow an efficient estimation of the angle of interference.
  • the reference beams may be non-adaptive and may be independent of the captured signals and/or the audio conditions.
  • the reference beams may be constant and may be generated by a constant/ non-adaptive combination of the signals from the at least three microphones.
  • the reference beams may specifically be Eigenbeams or orthogonal beams.
  • the first angle interval and the second angle interval may be disjoint intervals and may be adjacent intervals.
  • the first and second angle intervals may together cover the entire 360° interval.
  • the interfering sound source may be an assumed interfering sound source.
  • a direction estimate for a sound source may be generated independently of whether the sound source is present or not. Thus, even if no interfering point source is detected, the estimation circuit may generate the direction estimate from the microphone signals under the assumption that an interfering sound source is present.
  • the estimation circuit is arranged to select the direction estimate as one of the first angle and the second angle in response to a gradient of a power measure of the output signal as a function of the direction estimate for an angle separating the first angle interval and the second angle interval.
  • the angle may be any angle between the first angle interval and the second angle interval including the end points of one or both of the angle intervals.
  • the first angle interval comprises angles from 0 to ⁇ and the second angle interval comprises angles from ⁇ to 2 ⁇ .
  • This may provide particularly advantageous performance and may in particular allow adaptation for all possible directions of the interfering sound source.
  • the estimation circuit is arranged to select the direction estimate as one of the first angle and the second angle in response to a gradient of a power measure of the output signal as a function of the direction estimate for an angle of ⁇ .
  • This may provide a particularly efficient and low complexity determination of the direction estimate.
  • the combining circuit comprises a sidelobe canceller.
  • the sidelobe canceller is arranged to generate the output signal as a weighted combination of at least a primary signal, a first noise reference signal and a second noise reference signal.
  • the primary signal may correspond to a beam adapted in the direction of the desired sound source and each of the reference signals may correspond to beams adapted to cancel/ reduce noise.
  • the noise reference signals may specifically have notches in the direction of the desired sound source.
  • the combining circuit is arranged to calculate weights for the first and second noise reference signals in response to the direction estimate and a minimization of the directivity cost measure.
  • the weights may be determined as a function of the direction estimate wherein the function is selected to minimize the directivity cost measure.
  • the estimation circuit is arranged to determine at least one of the first and second angles by a gradient search applied to a sidelobe canceller corresponding to the side lobe canceller of the combining circuit and having an angle input variable.
  • a gradient search may provide a highly efficient approach for identifying potential minima that may optimize the beamforming operation.
  • An efficient and low complexity adaptation of the beamforming may be achieved which can reduce both diffuse noise and reduce/cancel a single point interference.
  • both the first and single angle are determined by a gradient search.
  • the gradient search may be performed using a sidelobe canceller operation which is identical to the sidelobe canceller operation used to generate the output signal but with a value of the angle input variable that may be different than the phase value (the direction estimate) used to generate the output signal (thus which can be varied
  • a gradient search may be applied in parallel in the two angle intervals using parallel sidelobe canceller operations with independent angle input variables.
  • the output signal of the combining circuit may be selected as the signal of the parallel sidelobe canceller corresponding to the selected angle of the first and second angles.
  • a sidelobe canceller corresponding to the sidelobe canceller of the combining circuit may be used to determine a gradient of a power measure of the output signal for a given angle (specifically ⁇ ) and the selection between the first and second angle may be in response to the gradient.
  • an update value for the angle input variable is determined as a function of an output signal of the sidelobe canceller for a current phase value of the angle input variable, and a first and second noise reference signal of the sidelobe canceller for the current phase value.
  • This may provide particularly advantageous performance and/or facilitated implementation and/or operation.
  • the first and second noise reference signals are weighted as a function of the current phase value. This may provide particularly advantageous performance and/or facilitated implementation or operation.
  • the estimation circuit is arranged to determine a power estimate for at least one of the first and second noise reference signals and to perform a normalization of the update value as a function of the power estimate.
  • This may provide particularly advantageous performance and/or facilitated implementation and/or operation.
  • the at least two- dimensional microphone array comprises at least four microphones and the apparatus comprises a circuit for combining signals from at least two of the at least four microphones prior to generating the reference beams.
  • This may provide particularly advantageous performance and/or facilitated implementation and/or operation. In particular, it may provide improved noise performance in many scenarios.
  • the apparatus of further comprises the at least two-dimensional microphone array, the at least two- dimensional microphone array comprising directional microphones having a maximum response in a direction outwardly of a perimeter of the at least two-dimensional microphone array.
  • This may provide particularly advantageous performance and/or facilitated implementation and/or operation.
  • a method of audio beamforming comprising: receiving signals from an at least two-dimensional microphone array comprising at least three microphones; generating at least three reference beams from the microphone signals; generating an output signal corresponding to a desired beam pattern by combining the reference beams in response to a first direction of a desired sound source and a direction estimate for an interfering sound source; generating the direction estimate by: determining a first angle corresponding to a local minimum for a power measure of the output signal in a first angle interval, determining a second angle corresponding to a local minimum for a power measure of the output signal in a second angle interval, and
  • the combining of the reference beams comprises determining combination parameters for the combining of the reference beams to provide a notch in an angle corresponding to the direction estimate and a minimization of a directivity cost measure, the directivity cost measure being indicative of a ratio between a gain in the first direction and an energy averaged gain.
  • Fig. 1 illustrates an example of a system for capturing audio with an adaptable directional characteristic in accordance with some embodiments of the invention
  • Fig. 2 illustrates an example of a microphone configuration for a microphone array
  • Fig. 3 illustrates an example of Eigenbeams generated by the system of Fig. 1;
  • Fig. 4 illustrates an example of a sidelobe canceller used in the system of Fig. 1;
  • Fig. 5 illustrates an example of a cost function for adapting the system of Fig. 1;
  • Fig. 6 illustrates an example of local minima for the cost function of Fig. 5;
  • Fig. 7 illustrates an example of local maxima for the cost function of Fig. 5
  • Fig. 8 illustrates an example of a method for capturing audio with an adaptable directional characteristic in accordance with some embodiments of the invention.
  • Fig. 1 illustrates an example of a system for capturing audio with an adaptable directional characteristic.
  • the system processes signals from a plurality of microphones to generate a suitable desired beam pattern.
  • the processing is specifically adapted such that the generated output signal has substantially improved noise and interference characteristics.
  • the system provides for a joint improvement in both single point interference and diffuse noise performance.
  • the system is furthermore suitable for use in scenarios wherein the wavelength of the signals is substantially longer than the dimensions of the microphone array, i.e. than the distances between the microphones.
  • the system processes the received microphone signals to generate a set of constant non-adaptable reference beams. These reference beams are then adaptively combined to generate a desired beam pattern.
  • the combination is adapted such that the resulting beam form is adapted to cancel or substantially attenuate an assumed single point interference source while at the same time minimizing or reducing the impact of diffuse noise.
  • the system of Fig. 1 specifically includes an adaptive null-steering scheme with multiple gradient-estimates for adjusting the directivity pattern in such a way that this effective rejection of noise and interference can be achieved automatically.
  • the system of Fig. 1 comprises a microphone array 101 which is a two- dimensional microphone array.
  • the microphone array 101 comprises at least three microphones which are not arranged in a single one dimensional line. In most embodiments, the shortest distance from one microphone to a line going through two other microphones is at least a fifth of the distance between these two microphones.
  • the microphone array 101 comprises three microphones which are spaced uniformly on a circle as illustrated in Fig. 2.
  • a circular array of at least three (omni- or uni-directional) sensors in a planar geometry is used. It will be appreciated that in other embodiments, other arrangements of the microphones may be used. It will also be appreciated that for
  • the microphone array may be a three dimensional microphone array.
  • the following description will focus on a three microphone equidistant circular array arranged in the azimuth plane.
  • the microphone array 101 is coupled to a receiving circuit 103 which receives the microphone signals.
  • the receiving circuit 103 is arranged to amplify, filter and digitize the microphone signals as is well known to the skilled person.
  • the receiving circuit 103 is coupled to a reference processor 105 which is arranged to generate at least three reference beams from the microphone signals.
  • the reference beams are constant beams that are not adapted but are generated by a fixed combination of the digitized microphone signals from the receiving circuit 103.
  • three orthogonal Eigenbeams are generated by the reference processor 105.
  • the three microphones of the microphone array are directional microphones and are specifically uni-directional cardioid microphones which are arranged such that the main gain is pointing outwardly from the perimeter formed by joining the positions of the microphones (and thus outwardly of the circle of the circular array in the specific example).
  • the use of uni-directional cardioid microphones provides an advantage in that the sensitivity to sensor noise and sensor-mismatches is greatly reduced.
  • other microphone types may be used, such as omni- directional microphones.
  • ⁇ and ⁇ are the standard spherical coordinate angles: elevation and azimuth, c is the speed of sound and X i and y l are the x and y coordinates of the z'th microphone.
  • the three orthogonal Eigenbeams can be determined from:
  • Eigenbeams are frequency invariant and ideally equal to:
  • the zero'th-order Eigenbeam Em represents the monopole response corresponding to a sphere whereas the other Eigenbeams represent first order Eigenbeams corresponding to double spheres as illustrated in Fig. 3.
  • the two first order Eigenbeams are orthogonal dipoles.
  • the resulting signals from each of the three Eigenbeams are fed to a beamform circuit 107 which proceeds to adaptively combine these signals to provide a desired beam pattern.
  • a dipole can be steered to any angle ⁇ s .
  • a weighted summation of the orthogonal diagonals can be generated:
  • ⁇ s represents the desired angle for the resulting dipole.
  • the steered and scaled superdirectional microphone response can then be constructed by combining the steered dipole with the monopole, e.g. as:
  • ⁇ ⁇ 1 is a parameter for controlling the directional pattern of the first-order response and S is an arbitrary scaling factor (that can also have negative values).
  • the beamform circuit 107 can generate a suitable beam pattern by a suitable combination of the reference Eigenbeams.
  • the direction of the desired speaker is assumed to be known by the beamform circuit 107. It will be appreciated that any suitable way of determining a desired direction may be used without detracting from the invention. For example, a fixed direction may be used or e.g. a tracking algorithm for a desired speaker or sound source may be used. It will be appreciated that many different algorithms for determining a desired sound source direction will be known to the skilled person.
  • the beamform circuit 107 is furthermore arranged to adapt the beam such that the sensitivity to diffuse noise is minimized and a notch is generated in an estimated direction of an assumed interfering point source.
  • the system of Fig. 1 is specifically arranged to adapt the combination of the reference Eigenbeams such that the nominal gain is provided in the desired direction, a notch is generated in the direction estimated to correspond to a point source interference and with a minimization of the diffuse noise under these constraints. This is achieved by a highly efficient adaptation algorithm which will be described in the following.
  • the beamform circuit 107 is specifically coupled to an estimation circuit 109 which determines an estimate for the direction to an assumed point source interference. Based on the estimated direction, the beamform circuit 107 generates combination parameters for the combination of the Eigenbeams such that a notch (typically a null) is generated in the estimated direction.
  • a notch typically a null
  • the combination of three Eigenbeams provides sufficient degrees of freedom to allow a range of solutions to the constraint of providing a nominal gain in a desired direction and a notch in an interference direction. In the system, this additional degree of freedom is used to improve the diffuse noise performance. This is specifically achieved by the combination parameters being selected to minimize a directivity cost measure where the directivity cost measure is indicative of a ratio between a power/energy gain in the first direction and an average power/energy gain.
  • the directivity cost measure may be indicative of the gain in the desired direction relative to an average gain of the resulting beam where the averaging is over all angles in the azimuth plane (i.e. from 0-2 ⁇ ) or from all directions in the three dimensions.
  • the directivity cost measure is a function which indicates the attenuation of homogenous spatially diffuse noise (i.e. the same noise level in all direction) provided by the beam pattern.
  • the estimation circuit 109 is specifically arranged to determine the estimated angle of an interference point by searching for local minima of a power measure for the output signal. Thus, the estimation circuit 109 seeks to minimize the power of the output signal as this will correspond to the lowest noise/interference. In some embodiments, the estimation may only be performed when the desired sound source is inactive (e.g. when a desired speaker is not speaking) but it will be appreciated that this is not necessary for the minimization of the power of the output signal to be an indication of optimal
  • noise/interference operation specifically the presence of the desired signal may introduce an offset to the power measure but will not change the position of the minimum.
  • the estimation circuit 109 determines at least two local minima by searching in at least two angle intervals.
  • the two angle intervals are typically disjoint, although in some embodiments some overlap may occur.
  • the local minima are determined in the different angle intervals by a parallel processing based on different angles.
  • the estimation circuit 109 may copy the operation of the beamform circuit 107 and evaluate the resulting output signal for different angles in the different angle intervals. The estimation circuit 109 may then select one of the angles that have been found to correspond to a local minima for the output signals and the selected angle is then used as the estimate for the assumed single point interference source.
  • the selected angle is then fed to the beamform circuit 107 which proceeds to perform the combination such that a nominal gain is provided in the direction of the desired source and a notch is provided in the estimated direction of the main single point interference. Furthermore, the combination uses weights that are selected to further minimize the diffuse noise. This constraint is imposed by the weights being selected to minimize a directivity cost measure.
  • the estimation operation and adaptation is independent of the actual noise and interference conditions and specifically is independent of whether a significant single point interferer or diffuse noise is present or not.
  • the approach results in very efficient performance across a wide variety of scenarios including scenarios with a dominant single point interference and no diffuse noise as well as scenarios with no single point interference but substantial diffuse noise.
  • the approach and underlying assumptions result in an operation that not only adapts to the specific characteristics of the noise and single point interference characteristics but also adapts to the type of noise/interference scenario that is experienced. This also reduces complexity and facilitates operation as there is no need to adapt the algorithm to the type of audio environment being experienced. This also provides increased flexibility and a wider application of the approach.
  • the beamform circuit 107 implements a side lobe canceller and the local minima are determined using a gradient search within each angle interval. Once the direction to the assumed single point interference has been estimated, combination parameters in terms of the weights applied to the noise reference signals are determined under the constraint that the directivity cost measure is minimized.
  • Fig. 4 illustrates an example of a generalized sidelobe canceller used in the system of Fig. 1.
  • the two dipole reference beams are first combined to generate two dipoles which are angled in the desired directions.
  • the resulting dipoles are then combined with the monopole to generate a primary signal which corresponds to a beam directed towards the desired audio source.
  • the primary response may be given by
  • the primary signal thus corresponds to the desired audio signal but also comprises signals from undesired directions.
  • the impact of these sidelobes is reduced by generation of noise reference signals which are weighted and subtracted from the primary signal to generate the output signal.
  • the sidelobe canceller generates the noise reference signals given by
  • B is a blocking matrix given by:
  • the two noise reference signals are then weighted by weights W 1 and W 2 before being subtracted from the primary signal to provide the output signal.
  • W 1 and W 2 weights W 1 and W 2 before being subtracted from the primary signal to provide the output signal.
  • the beamform circuit 107 is arranged to generate a nominal gain, in the following a unity gain, in a desired angle ⁇ s and a notch, specifically a zero, in the direction ⁇ n of an assumed single point interference determined by the estimation circuit 109.
  • a unity gain in the direction of ⁇ s the weights required to steer a zero towards the angle ⁇ n can be calculated by solving the equation:
  • this degree of freedom is used to optimize diffuse noise performance.
  • the noise reference weights are selected such that a directivity cost measure is minimized.
  • a suitable directivity cost measure is given by:
  • the directivity cost measure represents a ratio between the gain in the desired direction and the overall (power) gain averaged over the entire sphere. It will be appreciated that in other embodiments, the gain averaging may e.g. only be in a two- dimensional plane such as the azimuth plane. For the response given by
  • Wi and W 2 can be calculated such that a unity gain is provided in the desired direction, a zero is formed in the direction of an assumed interferer and the diffuse noise attenuation is maximized under these constraints.
  • the estimation circuit 109 has determined a suitable angle estimate for the assumed point source interferer, the derived equations can be used to calculate suitable weights that will also minimize the directivity cost measure and thus optimize the diffuse noise performance.
  • w 2 can then be derived from:
  • the estimation circuit 109 proceeds to determine the direction estimate by minimizing a power measure for the output signal in different angle intervals.
  • the estimation circuit 109 seeks to minimize the cost function given by: where denotes the expected value.
  • Fig. 5 illustrates some examples of this cost function for a scenario wherein there is a single point interferer at the direction of ⁇ equal to 1, 2 and 3 radians respectively (i.e. the angle difference ⁇ between the desired direction and the direction between an actual interferer is 1, 2 and 3 radians respectively).
  • the cost function is shown as a function of the estimated direction, i.e. as a function of the steering of the null performed by the weights of the reference signals.
  • Fig. 6 illustrate the cost function in the presence of noise which either may be spherical (coming from all directions) or cylindrical (coming from all directions in a two-dimensional plane). The situation for spherical noise is shown by a full line and the situation for cylindrical noise is shown by the dashed line.
  • the correct minimum is always the only local minimum in the phase interval from either 0 to ⁇ or from ⁇ to 2 ⁇ .
  • the only local minimum in the interval of [0; ⁇ ] is the correct value.
  • the only local minimum in the interval of [ ⁇ ;2 ⁇ ] is the correct value.
  • the estimation circuit 109 is arranged to determine a local minimum in the angle interval of [0; ⁇ ] and a local minimum in the angle interval of [ ⁇ ;2 ⁇ ]. Thus, the estimation circuit 109 determines two angles for which the cost function corresponding to the power of the output signal is minimized. This approach ensures that one of the determined local minima will correspond to the correct estimated angle.
  • the estimation circuit 109 then proceeds to select one of the two estimated values as the estimated angle that is used to control the beamforming by the beamform circuit 107.
  • one of the local minima is selected and used to calculate the weights for the noise reference signals using the equations that also optimize diffuse noise performance.
  • a simple beamforming may be applied to the microphone signals such that a beam is formed in each of the two directions in order to measure the interference level in those directions. The direction having the highest level is then selected as it corresponds to the most dominant interference.
  • the selection of the correct local minima is based on the gradient of the cost function at a specific angle which separates the two angle intervals (i.e. it is inbetween the two angle intervals and may specifically be an endpoint of one or both of the intervals).
  • all the local minima of the function may be determined and separated into the two angle intervals. Indeed, in such an embodiment, the detection of two local minima in one interval may automatically lead to the selection of the other minimum (i.e. the one in the other angle interval).
  • the correct local minimum will be the only local minimum in the angle interval. It will also be appreciated that this leads to the conclusion that it is not necessary to identify more than one minimum in each angle interval as any non-identified local minima will inherently not be the correct minimum as it is in an angle interval with more than one minimum.
  • the determination of the local minima is performed by performing a gradient search in each angle interval.
  • the estimation circuit 109 performs a sidelobe cancelling operation corresponding to that of the beamform circuit 107 while using an input angle value that is constantly updated and biased in the direction that will reduce the cost function. This approach will result in the angle variable ending in a local minimum.
  • a steepest descent update equation for ⁇ can be derived by stepping in the direction opposite to the surface of the cost function with respect to ⁇ :
  • the update value for the angle input variable of the gradient search is a function of an output signal of the sidelobe canceller and of the first and second noise reference signals.
  • the update value is dependent on the power of the noise references.
  • the estimation circuit 109 may determine a power estimate for one or both of the noise reference signals and normalize the update value accordingly.
  • is a small value to prevent zero division and is the power estimate of the z'th noise reference signal. This can specifically be calculated by a recursive averaging: where ⁇ is a suitable design parameter.
  • the estimation circuit 109 operates a sidelobe canceller applied to the same signals as the sidelobe canceller of the beamform circuit 107.
  • the sidelobe cancellers are operated based on an input angle variable which corresponds to a current estimate of the angle to the assumed point source interferer.
  • the input angle variable is continuously updated using the gradient search approach such that it will converge on the local minimum in the angle interval.
  • the estimation circuit 109 selects between the current values of the input angle variables and uses this result as the estimated angle for the assumed point source interferer. The selection is based on the gradient of the cost function for an input variable of ⁇ .
  • the estimation circuit 109 may specifically determine this by operating a further sidelobe canceller process on the input signals but with a fixed angle value of ⁇ . Specifically, the estimation circuit 109 may continuously evaluate the update value:
  • the derived values may be averaged over time and the sign of the averaged value (i.e. the gradient of the cost function at ⁇ ) is then used to select which of the angles determined by the gradient searches is used.
  • the beamform circuit 107 will not repeat a sidelobe canceller operation for the estimated angle but will directly use the output signal calculated for the selected angle when performing the estimation.
  • the gradient search is arranged to re-initialize the gradient search if the value of the angle input variable moves out of the corresponding angle interval.
  • a re-initialization of the gradient search may be performed if the two gradient searches reach a scenario wherein the both have angle values in the same angle interval. For example, if during the gradient search in the [0; ⁇ ] interval, the updated angle value moves into the [ ⁇ ;2 ⁇ ] interval such that both gradient searches have current values within this interval, the gradient search is re-initialized.
  • the re-initialization is specifically performed by resetting the value of the input angle variable of one of the two gradient searches to an initial value.
  • the initial value may for example be a fixed value such as the midpoint in the interval (i.e. ⁇ /2 and 3 ⁇ /2).
  • step 801 the parameter values are initialized. Step 801 is followed by step 803 wherein it is ensured that is smaller
  • Step 803 is followed by step 805 wherein it is determined if the two gradient searches have resulted in angle values, in the same angle interval. If so, the
  • Step 805 is followed by step 807 wherein the weights for the noise reference signals, the resulting output signal and the cost function gradients are calculated.
  • Step 807 is followed by step 809 wherein the new values for the angle input variables, Cp 1 [A:] , Cp 2 [A:] , of the gradient searches are calculated. Furthermore, the filtered cost function gradient at ⁇ is calculated.
  • Step 809 is followed by step 811 wherein the appropriate angle value is selected based on the filtered cost function gradient at ⁇ .
  • Step 811 is followed by step 813 wherein the power estimates for the noise reference signals used in the update value determination are updated.
  • step 813 the method returns to step 803 to process the next sample.
  • a pseudo-code of an algorithm corresponding to Fig. 1 may be represented as:
  • the number of microphones in the microphone array 101 corresponded to the number of reference beams (i.e. three).
  • the microphone array may comprise more microphones than reference beams.
  • the microphone array 101 may comprise at least four microphones.
  • the system may still only generate three reference beams and may specifically be arranged to combine signals from at least two microphones prior to generating the reference beams.
  • the reference processor 105 may still only receive three input signals and generate three reference beams from these.
  • at least one of these input signals may be generated by combining (and specifically averaging or adding (e.g. by a weighted summation)) the signals from at least two microphones.
  • Such an approach may provide improved noise performance in many scenarios as the level of uncorrelated noise may be averaged.
  • using more microphones on a particular area has the advantage that spatial aliasing will occur at a higher frequency.
  • the invention can be implemented in any suitable form including hardware, software, firmware or any combination of these.
  • the invention may optionally be
  • the functionality may be implemented in a single unit, in a plurality of units or as part of other functional units.
  • the invention may be implemented in a single unit or may be physically and functionally distributed between different units, circuits and processors.

Landscapes

  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Otolaryngology (AREA)
  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Obtaining Desirable Characteristics In Audible-Bandwidth Transducers (AREA)

Abstract

An audio beamforming apparatus comprises a receiving circuit (103) which receives signals from an at least two-dimensional microphone array (101). A reference circuit (105) generates reference beams and a combining circuit (107) generates an output signal corresponding to a desired beam pattern by combining the reference beams. An estimation circuit (109) generates a direction estimate by determining angles corresponding to local minima for a power measure of the output signal in at least a first and respectively second angle interval. The direction estimate is generated by selecting one of the angles. The combining circuit (107) determines combination parameters to provide a notch in an angle corresponding to the direction estimate and a minimization of a directivity cost measure where the directivity cost measure is indicative of a ratio between a gain in the first direction and an energy averaged gain.

Description

Audio beamforming
FIELD OF THE INVENTION
The invention relates to audio beamforming and in particular, but not exclusively, to audio beamforming using microphone arrays substantially smaller than the wavelength of the audio signals being beamformed.
BACKGROUND OF THE INVENTION
Advanced processing of audio signals has become increasingly important in many areas including e.g. telecommunication, content distribution etc. For example, in some applications, such as hands-free communication and voice control systems, complex processing of inputs from a plurality of microphones has been used to provide a configurable directional sensitivity for a microphone array comprising the microphones. Specifically, the processing of signals from a microphone array can generate an audio beam with a direction that can be changed simply by changing the characteristics of the combination of the individual microphone signals.
Typically, beam form algorithms seek to attenuate interferers while providing a high gain for a desired sound source. For example, a beamforming algorithm can be controlled to provide a strong attenuation (preferably a null) in the direction of a signal received from a main interferer.
For practical reasons it is desirable that the microphone array is relatively small. However, when the wavelength of the sound of interest is much larger than the size of the array, many beamforming algorithms, such as additive delay-and-sum beamforming algorithms, are not able to provide sufficient directivity as the beamwidth deteriorates substantially for such wavelengths.
One approach for achieving an improved directivity is to apply so called superdirective beamforming techniques. Such superdirective beamforming techniques are based on filters with asymmetrical filter coefficients and the approach essentially corresponds to subtraction of signals or determining spatial derivatives of the sound pressure field.
However, although this may improve the directivity, it is also known that this is achieved at the expense of robustness, such as increased sensitivity to white (sensor) noise and an increased sensitivity to mismatches in microphones characteristics.
In the article "Optimal Azimuthal Steering of a First-order Superdirectional
Microphone Response" by R.M.M. Derkx, International Workshop on Acoustic Echo and Noise Control, September 2008, Seattle, a system is analyzed which generates Eigenbeams for a two dimensional microphone array. The Eigenbeams are then combined to maximize the attenuation of a single point interference source. In particular, a null is located in the direction of a single point interferer while maintaining a suitable gain for the desired direction.
However, although this approach provides improved performance in many scenarios, it provides non optimal performance in some practical scenarios. It also tends to require relatively complex and resource demanding processing.
Hence, an improved approach for audio beamforming would be advantageous and in particular an approach allowing improved adaptation to current conditions and audio environment, increased flexibility, facilitated implementation, improved performance for different operating scenarios and/or improved performance would be advantageous.
SUMMARY OF THE INVENTION
Accordingly, the Invention seeks to preferably mitigate, alleviate or eliminate one or more of the above mentioned disadvantages singly or in any combination.
According to an aspect of the invention there is provided an audio beamforming apparatus comprising: a receiving circuit for receiving signals from an at least two-dimensional microphone array comprising at least three microphones; a reference circuit for generating at least three reference beams from the microphone signals; combining circuit for generating an output signal corresponding to a desired beam pattern by combining the reference beams in response to a first direction of a desired sound source and a direction estimate for an interfering sound source; an estimation circuit for generating the direction estimate by: determining a first angle corresponding to a local minimum for a power measure of the output signal in a first angle interval, determining a second angle
corresponding to a local minimum for a power measure of the output signal in a second angle interval, and determining the direction estimate as an angle selected from a set of angles corresponding to local minima for a power measure of the output signal, the set of angles comprising at least the first angle and the second angle; and wherein the combining circuit is arranged to determine combination parameters for the combining of the reference beams to provide a notch in an angle corresponding to the direction estimate and a minimization of a directivity cost measure, the directivity cost measure being indicative of a ratio between a gain in the first direction and an average gain.
The invention may allow improved performance. In particular, an improved and/or facilitated adaptation to a current audio environment can be achieved. The invention may allow a beamforming approach which provides high performance for both directional point interference cancellation and for diffuse noise attenuation. The approach is particularly suitable for, and may provide particularly advantageous performance for, systems wherein the wavelength of the audio signals may be substantially larger than the size of the microphone array.
The invention may allow low complexity implementation and/or operation. The approach may be suitable for providing improved directivity and may in particular be suitable for scenarios wherein the size of the microphone array is much smaller than a wavelength of interest.
In many embodiments and scenarios, the approach may allow a null to be directed towards a single point interference while substantially reducing diffuse noise. In particular, the approach may in many scenarios allow a reduction of a single point interference corresponding to or better than many prior art interference reduction techniques, while at the same time providing improved diffuse noise.
The approach may in many scenarios allow a low complexity yet highly efficient and advantageous beam steering based on low complexity parallel local minima extraction. In many embodiments, the approach may ensure that at least one of the identified local minima is also a global minimum and thus may allow an efficient estimation of the angle of interference.
The reference beams may be non-adaptive and may be independent of the captured signals and/or the audio conditions. The reference beams may be constant and may be generated by a constant/ non-adaptive combination of the signals from the at least three microphones. The reference beams may specifically be Eigenbeams or orthogonal beams.
The first angle interval and the second angle interval may be disjoint intervals and may be adjacent intervals. The first and second angle intervals may together cover the entire 360° interval.
The interfering sound source may be an assumed interfering sound source. A direction estimate for a sound source may be generated independently of whether the sound source is present or not. Thus, even if no interfering point source is detected, the estimation circuit may generate the direction estimate from the microphone signals under the assumption that an interfering sound source is present.
In accordance with an optional feature of the invention, the estimation circuit is arranged to select the direction estimate as one of the first angle and the second angle in response to a gradient of a power measure of the output signal as a function of the direction estimate for an angle separating the first angle interval and the second angle interval.
This may provide a particularly efficient and low complexity determination of the direction estimate. The angle may be any angle between the first angle interval and the second angle interval including the end points of one or both of the angle intervals.
In accordance with an optional feature of the invention, the first angle interval comprises angles from 0 to π and the second angle interval comprises angles from π to 2π.
This may provide particularly advantageous performance and may in particular allow adaptation for all possible directions of the interfering sound source.
In accordance with an optional feature of the invention, the estimation circuit is arranged to select the direction estimate as one of the first angle and the second angle in response to a gradient of a power measure of the output signal as a function of the direction estimate for an angle of π.
This may provide a particularly efficient and low complexity determination of the direction estimate.
In accordance with an optional feature of the invention, the combining circuit comprises a sidelobe canceller.
This may provide particularly advantageous performance and/or practical implementation.
In accordance with an optional feature of the invention, the sidelobe canceller is arranged to generate the output signal as a weighted combination of at least a primary signal, a first noise reference signal and a second noise reference signal.
This may provide particularly advantageous performance and/or practical implementation. The primary signal may correspond to a beam adapted in the direction of the desired sound source and each of the reference signals may correspond to beams adapted to cancel/ reduce noise. The noise reference signals may specifically have notches in the direction of the desired sound source.
In accordance with an optional feature of the invention, the combining circuit is arranged to calculate weights for the first and second noise reference signals in response to the direction estimate and a minimization of the directivity cost measure. This may provide a particularly advantageous performance and/or low complexity implementation. In particular, the weights may be determined as a function of the direction estimate wherein the function is selected to minimize the directivity cost measure.
In accordance with an optional feature of the invention, the estimation circuit is arranged to determine at least one of the first and second angles by a gradient search applied to a sidelobe canceller corresponding to the side lobe canceller of the combining circuit and having an angle input variable.
This may provide a particularly advantageous performance and/or low complexity implementation. In particular, a gradient search may provide a highly efficient approach for identifying potential minima that may optimize the beamforming operation. An efficient and low complexity adaptation of the beamforming may be achieved which can reduce both diffuse noise and reduce/cancel a single point interference.
In many embodiments both the first and single angle are determined by a gradient search. The gradient search may be performed using a sidelobe canceller operation which is identical to the sidelobe canceller operation used to generate the output signal but with a value of the angle input variable that may be different than the phase value (the direction estimate) used to generate the output signal (thus which can be varied
independently).
In some embodiments, a gradient search may be applied in parallel in the two angle intervals using parallel sidelobe canceller operations with independent angle input variables. The output signal of the combining circuit may be selected as the signal of the parallel sidelobe canceller corresponding to the selected angle of the first and second angles.
In some embodiments, a sidelobe canceller corresponding to the sidelobe canceller of the combining circuit may be used to determine a gradient of a power measure of the output signal for a given angle (specifically π) and the selection between the first and second angle may be in response to the gradient.
In accordance with an optional feature of the invention, an update value for the angle input variable is determined as a function of an output signal of the sidelobe canceller for a current phase value of the angle input variable, and a first and second noise reference signal of the sidelobe canceller for the current phase value.
This may provide particularly advantageous performance and/or facilitated implementation and/or operation.
In accordance with an optional feature of the invention, the first and second noise reference signals are weighted as a function of the current phase value. This may provide particularly advantageous performance and/or facilitated implementation or operation.
In accordance with an optional feature of the invention, the estimation circuit is arranged to determine a power estimate for at least one of the first and second noise reference signals and to perform a normalization of the update value as a function of the power estimate.
This may provide particularly advantageous performance and/or facilitated implementation and/or operation.
In accordance with an optional feature of the invention, the at least two- dimensional microphone array comprises at least four microphones and the apparatus comprises a circuit for combining signals from at least two of the at least four microphones prior to generating the reference beams.
This may provide particularly advantageous performance and/or facilitated implementation and/or operation. In particular, it may provide improved noise performance in many scenarios.
In accordance with an optional feature of the invention, the apparatus of further comprises the at least two-dimensional microphone array, the at least two- dimensional microphone array comprising directional microphones having a maximum response in a direction outwardly of a perimeter of the at least two-dimensional microphone array.
This may provide particularly advantageous performance and/or facilitated implementation and/or operation.
According to an aspect of the invention there is provided a method of audio beamforming comprising: receiving signals from an at least two-dimensional microphone array comprising at least three microphones; generating at least three reference beams from the microphone signals; generating an output signal corresponding to a desired beam pattern by combining the reference beams in response to a first direction of a desired sound source and a direction estimate for an interfering sound source; generating the direction estimate by: determining a first angle corresponding to a local minimum for a power measure of the output signal in a first angle interval, determining a second angle corresponding to a local minimum for a power measure of the output signal in a second angle interval, and
determining the direction estimate as an angle selected from a set of angles corresponding to local minima for a power measure of the output signal, the set of angles comprising at least the first angle and the second angle; and wherein the combining of the reference beams comprises determining combination parameters for the combining of the reference beams to provide a notch in an angle corresponding to the direction estimate and a minimization of a directivity cost measure, the directivity cost measure being indicative of a ratio between a gain in the first direction and an energy averaged gain.
These and other aspects, features and advantages of the invention will be apparent from and elucidated with reference to the embodiment(s) described hereinafter.
BRIEF DESCRIPTION OF THE DRAWINGS
Embodiments of the invention will be described, by way of example only, with reference to the drawings, in which
Fig. 1 illustrates an example of a system for capturing audio with an adaptable directional characteristic in accordance with some embodiments of the invention;
Fig. 2 illustrates an example of a microphone configuration for a microphone array;
Fig. 3 illustrates an example of Eigenbeams generated by the system of Fig. 1;
Fig. 4 illustrates an example of a sidelobe canceller used in the system of Fig. 1;
Fig. 5 illustrates an example of a cost function for adapting the system of Fig. 1;
Fig. 6 illustrates an example of local minima for the cost function of Fig. 5; and
Fig. 7 illustrates an example of local maxima for the cost function of Fig. 5
Fig. 8 illustrates an example of a method for capturing audio with an adaptable directional characteristic in accordance with some embodiments of the invention.
DETAILED DESCRIPTION OF SOME EMBODIMENTS OF THE INVENTION
Fig. 1 illustrates an example of a system for capturing audio with an adaptable directional characteristic. The system processes signals from a plurality of microphones to generate a suitable desired beam pattern. The processing is specifically adapted such that the generated output signal has substantially improved noise and interference characteristics. The system provides for a joint improvement in both single point interference and diffuse noise performance. The system is furthermore suitable for use in scenarios wherein the wavelength of the signals is substantially longer than the dimensions of the microphone array, i.e. than the distances between the microphones. The system processes the received microphone signals to generate a set of constant non-adaptable reference beams. These reference beams are then adaptively combined to generate a desired beam pattern. The combination is adapted such that the resulting beam form is adapted to cancel or substantially attenuate an assumed single point interference source while at the same time minimizing or reducing the impact of diffuse noise.
The system provides an efficient adaptive beamforming where the main lobe can be steered towards the direction of a desired sound source while adapting the directional pattern such that a point interferer from another angle is effectively rejected and a
substantially optimal rejection of diffuse (isotropic) noise is achieved. The system of Fig. 1 specifically includes an adaptive null-steering scheme with multiple gradient-estimates for adjusting the directivity pattern in such a way that this effective rejection of noise and interference can be achieved automatically.
The system of Fig. 1 comprises a microphone array 101 which is a two- dimensional microphone array. The microphone array 101 comprises at least three microphones which are not arranged in a single one dimensional line. In most embodiments, the shortest distance from one microphone to a line going through two other microphones is at least a fifth of the distance between these two microphones.
In the specific example, the microphone array 101 comprises three microphones which are spaced uniformly on a circle as illustrated in Fig. 2.
Thus, in the example a circular array of at least three (omni- or uni-directional) sensors in a planar geometry is used. It will be appreciated that in other embodiments, other arrangements of the microphones may be used. It will also be appreciated that for
embodiments wherein more than three microphones are used, these may possibly be arranged in a non-planar geometry, i.e. the microphone array may be a three dimensional microphone array. However, the following description will focus on a three microphone equidistant circular array arranged in the azimuth plane.
The microphone array 101 is coupled to a receiving circuit 103 which receives the microphone signals. In the example of Fig. 1, the receiving circuit 103 is arranged to amplify, filter and digitize the microphone signals as is well known to the skilled person.
The receiving circuit 103 is coupled to a reference processor 105 which is arranged to generate at least three reference beams from the microphone signals. The reference beams are constant beams that are not adapted but are generated by a fixed combination of the digitized microphone signals from the receiving circuit 103. In the example of Fig. 1, three orthogonal Eigenbeams are generated by the reference processor 105.
In the example, the three microphones of the microphone array are directional microphones and are specifically uni-directional cardioid microphones which are arranged such that the main gain is pointing outwardly from the perimeter formed by joining the positions of the microphones (and thus outwardly of the circle of the circular array in the specific example). The use of uni-directional cardioid microphones provides an advantage in that the sensitivity to sensor noise and sensor-mismatches is greatly reduced. However, it will be appreciated that in other scenarios other microphone types may be used, such as omni- directional microphones.
Denoting the responses of the three cardioid microphones as respectively Ec° , and and ignoring any uncorrelated sensor-noise, the z'th cardioid microphone
Figure imgf000010_0004
Figure imgf000010_0005
response is ideally given by:
Figure imgf000010_0001
with
Figure imgf000010_0002
where θ and Φ are the standard spherical coordinate angles: elevation and azimuth, c is the speed of sound and Xi and yl are the x and y coordinates of the z'th microphone.
Using:
and
Figure imgf000010_0003
with r the radius of the circle we can write:
Figure imgf000011_0001
From the three cardioid microphones, the three orthogonal Eigenbeams can be determined from:
Figure imgf000011_0002
For wavelengths larger than the size of the array, the responses of the
Eigenbeams are frequency invariant and ideally equal to:
Figure imgf000011_0003
The directivity patterns of these Eigenbeams are illustrated in Fig. 3.
The zero'th-order Eigenbeam Em represents the monopole response corresponding to a sphere whereas the other Eigenbeams represent first order Eigenbeams corresponding to double spheres as illustrated in Fig. 3. Thus, the two first order Eigenbeams are orthogonal dipoles.
The resulting signals from each of the three Eigenbeams are fed to a beamform circuit 107 which proceeds to adaptively combine these signals to provide a desired beam pattern.
Specifically, by suitable combining the first order Eigenbeams, a dipole can be steered to any angle φs . E.g. a weighted summation of the orthogonal diagonals can be generated:
Figure imgf000011_0004
where φs represents the desired angle for the resulting dipole.
The steered and scaled superdirectional microphone response can then be constructed by combining the steered dipole with the monopole, e.g. as:
Figure imgf000012_0001
where α < 1 is a parameter for controlling the directional pattern of the first-order response and S is an arbitrary scaling factor (that can also have negative values).
Thus, the beamform circuit 107 can generate a suitable beam pattern by a suitable combination of the reference Eigenbeams. The beamform circuit 107 is arranged to generate a nominal (e.g. unity) gain in the direction of a desired speaker coming from an arbitrary azimuthal angle Φ=φs. The direction of the desired speaker is assumed to be known by the beamform circuit 107. It will be appreciated that any suitable way of determining a desired direction may be used without detracting from the invention. For example, a fixed direction may be used or e.g. a tracking algorithm for a desired speaker or sound source may be used. It will be appreciated that many different algorithms for determining a desired sound source direction will be known to the skilled person.
The beamform circuit 107 is furthermore arranged to adapt the beam such that the sensitivity to diffuse noise is minimized and a notch is generated in an estimated direction of an assumed interfering point source. The system of Fig. 1 is specifically arranged to adapt the combination of the reference Eigenbeams such that the nominal gain is provided in the desired direction, a notch is generated in the direction estimated to correspond to a point source interference and with a minimization of the diffuse noise under these constraints. This is achieved by a highly efficient adaptation algorithm which will be described in the following.
The beamform circuit 107 is specifically coupled to an estimation circuit 109 which determines an estimate for the direction to an assumed point source interference. Based on the estimated direction, the beamform circuit 107 generates combination parameters for the combination of the Eigenbeams such that a notch (typically a null) is generated in the estimated direction. However, the combination of three Eigenbeams provides sufficient degrees of freedom to allow a range of solutions to the constraint of providing a nominal gain in a desired direction and a notch in an interference direction. In the system, this additional degree of freedom is used to improve the diffuse noise performance. This is specifically achieved by the combination parameters being selected to minimize a directivity cost measure where the directivity cost measure is indicative of a ratio between a power/energy gain in the first direction and an average power/energy gain. Specifically, the directivity cost measure may be indicative of the gain in the desired direction relative to an average gain of the resulting beam where the averaging is over all angles in the azimuth plane (i.e. from 0-2π) or from all directions in the three dimensions. Thus, the directivity cost measure is a function which indicates the attenuation of homogenous spatially diffuse noise (i.e. the same noise level in all direction) provided by the beam pattern.
The estimation circuit 109 is specifically arranged to determine the estimated angle of an interference point by searching for local minima of a power measure for the output signal. Thus, the estimation circuit 109 seeks to minimize the power of the output signal as this will correspond to the lowest noise/interference. In some embodiments, the estimation may only be performed when the desired sound source is inactive (e.g. when a desired speaker is not speaking) but it will be appreciated that this is not necessary for the minimization of the power of the output signal to be an indication of optimal
noise/interference operation (specifically the presence of the desired signal may introduce an offset to the power measure but will not change the position of the minimum).
The estimation circuit 109 determines at least two local minima by searching in at least two angle intervals. The two angle intervals are typically disjoint, although in some embodiments some overlap may occur. The local minima are determined in the different angle intervals by a parallel processing based on different angles. Specifically, the estimation circuit 109 may copy the operation of the beamform circuit 107 and evaluate the resulting output signal for different angles in the different angle intervals. The estimation circuit 109 may then select one of the angles that have been found to correspond to a local minima for the output signals and the selected angle is then used as the estimate for the assumed single point interference source. The selected angle is then fed to the beamform circuit 107 which proceeds to perform the combination such that a nominal gain is provided in the direction of the desired source and a notch is provided in the estimated direction of the main single point interference. Furthermore, the combination uses weights that are selected to further minimize the diffuse noise. This constraint is imposed by the weights being selected to minimize a directivity cost measure.
The estimation operation and adaptation is independent of the actual noise and interference conditions and specifically is independent of whether a significant single point interferer or diffuse noise is present or not. However, the approach results in very efficient performance across a wide variety of scenarios including scenarios with a dominant single point interference and no diffuse noise as well as scenarios with no single point interference but substantial diffuse noise. Indeed, the approach and underlying assumptions result in an operation that not only adapts to the specific characteristics of the noise and single point interference characteristics but also adapts to the type of noise/interference scenario that is experienced. This also reduces complexity and facilitates operation as there is no need to adapt the algorithm to the type of audio environment being experienced. This also provides increased flexibility and a wider application of the approach.
In the following, a specific example of the system of Fig. 1 will be described.
In the example, the beamform circuit 107 implements a side lobe canceller and the local minima are determined using a gradient search within each angle interval. Once the direction to the assumed single point interference has been estimated, combination parameters in terms of the weights applied to the noise reference signals are determined under the constraint that the directivity cost measure is minimized.
Fig. 4 illustrates an example of a generalized sidelobe canceller used in the system of Fig. 1. The two dipole reference beams are first combined to generate two dipoles which are angled in the desired directions. The resulting dipoles are then combined with the monopole to generate a primary signal which corresponds to a beam directed towards the desired audio source.
The primary response may be given by
Figure imgf000014_0002
where
Figure imgf000014_0001
and
Figure imgf000015_0001
The primary signal thus corresponds to the desired audio signal but also comprises signals from undesired directions. The impact of these sidelobes is reduced by generation of noise reference signals which are weighted and subtracted from the primary signal to generate the output signal.
Thus, the sidelobe canceller generates the noise reference signals given by
Figure imgf000015_0002
where B is a blocking matrix given by:
Figure imgf000015_0003
It is noted that the noise-references are respectively a cardioid and a dipole response, with a null steered towards the primary signal at azimuth φs and elevation θ=π/2.
The two noise reference signals are then weighted by weights W1 and W2 before being subtracted from the primary signal to provide the output signal. Thus, the overall beam-pattern from the sidelobe canceller is given by:
Figure imgf000015_0004
The beamform circuit 107 is arranged to generate a nominal gain, in the following a unity gain, in a desired angle φs and a notch, specifically a zero, in the direction φn of an assumed single point interference determined by the estimation circuit 109. With a unity gain in the direction of φs, the weights required to steer a zero towards the angle φn can be calculated by solving the equation:
Figure imgf000016_0002
where
Figure imgf000016_0005
Solving the equation yields:
Figure imgf000016_0003
or alternatively:
Figure imgf000016_0004
As can be seen, the constraints of the unity gain and the direction of the zero do not uniquely define the required weights but provide an extra degree of freedom.
In the system, this degree of freedom is used to optimize diffuse noise performance. In particular, the noise reference weights are selected such that a directivity cost measure is minimized.
A suitable directivity cost measure is given by:
Figure imgf000016_0001
Thus, the directivity cost measure represents a ratio between the gain in the desired direction and the overall (power) gain averaged over the entire sphere. It will be appreciated that in other embodiments, the gain averaging may e.g. only be in a two- dimensional plane such as the azimuth plane. For the response given by
Figure imgf000017_0001
this can be shown to correspond to:
Figure imgf000017_0002
Inserting the output signal response given by:
Figure imgf000017_0004
and inserting the value of W1 as given by:
Figure imgf000017_0003
in the directivity cost measure, and differentiating with respect to W2 and setting the result to zero allows for the minima of the directivity cost measure with respect to W2 to be determined. Thus, the value of W2 for which the directivity cost measure is minimized and thus the diffuse noise sensitivity is minimized can be determined. This specifically yields:
Figure imgf000017_0005
With this value of W2, we can also compute W1 as:
Figure imgf000018_0001
Thus, Wi and W2 can be calculated such that a unity gain is provided in the desired direction, a zero is formed in the direction of an assumed interferer and the diffuse noise attenuation is maximized under these constraints.
Thus, once the estimation circuit 109 has determined a suitable angle estimate for the assumed point source interferer, the derived equations can be used to calculate suitable weights that will also minimize the directivity cost measure and thus optimize the diffuse noise performance.
Rather than using the previously derived equation for W1, a value compensated for the effect of the design parameter can be used:
Figure imgf000018_0003
It can be shown that w2 can then be derived from:
Figure imgf000018_0002
Thus, the output signal y[k] is given by:
Figure imgf000018_0004
with
Figure imgf000019_0001
where is the estimate of the angle of the assumed undesired interferer and φs is the angle of the desired audio source.
The estimation circuit 109 proceeds to determine the direction estimate by minimizing a power measure for the output signal in different angle intervals.
Specifically, the estimation circuit 109 seeks to minimize the cost function given by:
Figure imgf000019_0002
where denotes the expected value.
Fig. 5 illustrates some examples of this cost function for a scenario wherein there is a single point interferer at the direction of φ equal to 1, 2 and 3 radians respectively (i.e. the angle difference φ between the desired direction and the direction between an actual interferer is 1, 2 and 3 radians respectively). The cost function is shown as a function of the estimated direction, i.e. as a function of the steering of the null performed by the weights of the reference signals. Fig. 6 illustrate the cost function in the presence of noise which either may be spherical (coming from all directions) or cylindrical (coming from all directions in a two-dimensional plane). The situation for spherical noise is shown by a full line and the situation for cylindrical noise is shown by the dashed line.
Some observations can be made from Fig. 5. Firstly, it is clear that in all situations, a notch (and specifically a null) exists for the right estimate, i.e. when φ = φ . However, it is clear that whereas this null is indeed a local minimum for the cost function, it is not the only local minimum. In particular, in some cases local minima are found which do not correspond to a null and in some cases other nulls exist. Thus, it can be seen that merely determining the angle estimate by finding local minima is not a sufficient approach. This can further be seen in Fig. 6 which illustrates the local minima of the cost function for different directions φ of an actual point source interferer. Again, it can be seen that there is a local minimum for the correct value (i.e. a minimum exists φ = φ for as indicated by the diagonal line). However, in addition it can be seen that at least one and possibly two other local minima exist. For example, for an interferer at an angle of 1 radian, a cost function minimum exists at the right value of 1 radian but also at the wrong value of around 3.8 radians. Furthermore, for an interferer angle between around 2 radians to 4.3 radians, two wrong local minima exist.
However, a further observation is that the correct minimum is always the only local minimum in the phase interval from either 0 to π or from π to 2π. Thus, for an interferer angle within the interval of [0;π], the only local minimum in the interval of [0;π] is the correct value. Similarly, for an interferer angle within the interval of [π;2π], the only local minimum in the interval of [π;2π] is the correct value.
This realization is exploited in the system of Fig. 1. Specifically, the estimation circuit 109 is arranged to determine a local minimum in the angle interval of [0;π] and a local minimum in the angle interval of [π;2π]. Thus, the estimation circuit 109 determines two angles for which the cost function corresponding to the power of the output signal is minimized. This approach ensures that one of the determined local minima will correspond to the correct estimated angle.
The estimation circuit 109 then proceeds to select one of the two estimated values as the estimated angle that is used to control the beamforming by the beamform circuit 107. Thus, one of the local minima is selected and used to calculate the weights for the noise reference signals using the equations that also optimize diffuse noise performance.
It will be appreciated that different criteria for selecting between the determined local minima may be used. For example, in some embodiments, a simple beamforming may be applied to the microphone signals such that a beam is formed in each of the two directions in order to measure the interference level in those directions. The direction having the highest level is then selected as it corresponds to the most dominant interference.
However, in the specific example, the selection of the correct local minima is based on the gradient of the cost function at a specific angle which separates the two angle intervals (i.e. it is inbetween the two angle intervals and may specifically be an endpoint of one or both of the intervals). In the specific example, the gradient at π is determined and is used to select the appropriate local minimum. Specifically, if the cost function has a positive gradient for φ =π then the local minimum in the interval of [0;π] is selected and otherwise the local minimum in the interval of [π;2π] is selected. Indeed, it has been found that such a selection provides a very reliable indication of the correct local minima and thus provides a low complexity but efficient selection approach.
Intuitively, it can be understood as follows. The directional beam pattern for φ =π yields a cardioid response with only a single null. The gradient of the cost function for φ =π therefore yields the direction toward the true value of φ. If the gradient is negative, the true value of φ lies in the interval [π;2π]. If the gradient is positive, the true value of φ lies in the interval [0;π].
It should also be noted that in some embodiments, all the local minima of the function may be determined and separated into the two angle intervals. Indeed, in such an embodiment, the detection of two local minima in one interval may automatically lead to the selection of the other minimum (i.e. the one in the other angle interval). This approach is based on the realization that (as illustrated in Fig. 6), the correct local minimum will be the only local minimum in the angle interval. It will also be appreciated that this leads to the conclusion that it is not necessary to identify more than one minimum in each angle interval as any non-identified local minima will inherently not be the correct minimum as it is in an angle interval with more than one minimum.
In the system of Fig. 1, the determination of the local minima is performed by performing a gradient search in each angle interval.
Thus, the estimation circuit 109 performs a sidelobe cancelling operation corresponding to that of the beamform circuit 107 while using an input angle value that is constantly updated and biased in the direction that will reduce the cost function. This approach will result in the angle variable ending in a local minimum.
Specifically, a steepest descent update equation for φ can be derived by stepping in the direction opposite to the surface of the cost function with respect to φ :
Figure imgf000021_0001
with a gradient given by:
Figure imgf000021_0002
and where μ is the update step-size with 0 <μ< 1.
In practice, the mean value is not available and therefore an instantaneous estimate of the gradient is used:
Figure imgf000022_0001
by inserting the previously derived formulas for y[k] and performing the derivation, this can be shown to lead to:
Figure imgf000022_0002
where
Figure imgf000022_0003
Thus, the update value for the angle input variable of the gradient search is a function of an output signal of the sidelobe canceller and of the first and second noise reference signals.
In the above example, the update value is dependent on the power of the noise references. In order to compensate for this, the estimation circuit 109 may determine a power estimate for one or both of the noise reference signals and normalize the update value accordingly.
Accordingly, the following update equation for the gradient search may be used:
Figure imgf000023_0001
where ε is a small value to prevent zero division and is the power estimate of the z'th noise reference signal. This can specifically be calculated by a recursive averaging:
Figure imgf000023_0003
where β is a suitable design parameter.
Thus, for each angle interval, the estimation circuit 109 operates a sidelobe canceller applied to the same signals as the sidelobe canceller of the beamform circuit 107. However, the sidelobe cancellers are operated based on an input angle variable which corresponds to a current estimate of the angle to the assumed point source interferer. The input angle variable is continuously updated using the gradient search approach such that it will converge on the local minimum in the angle interval.
The estimation circuit 109 then selects between the current values of the input angle variables and uses this result as the estimated angle for the assumed point source interferer. The selection is based on the gradient of the cost function for an input variable of π. The estimation circuit 109 may specifically determine this by operating a further sidelobe canceller process on the input signals but with a fixed angle value of π. Specifically, the estimation circuit 109 may continuously evaluate the update value:
Figure imgf000023_0002
for φ =π. The derived values may be averaged over time and the sign of the averaged value (i.e. the gradient of the cost function at π) is then used to select which of the angles determined by the gradient searches is used.
It will be appreciated that whereas the previous discussion illustrated the principle by referring to the use of four sidelobe cancellers (one for the beamform circuit 107, one for each gradient search, and one for determining the gradient at π), this is merely used to illustrate the principle. Indeed, in many embodiments, the same sidelobe canceller may be implemented, e.g. as a subroutine, and used for the different purposes and with different input angles.
It will also be appreciated that typically, the beamform circuit 107 will not repeat a sidelobe canceller operation for the estimated angle but will directly use the output signal calculated for the selected angle when performing the estimation.
In the example, the gradient search is arranged to re-initialize the gradient search if the value of the angle input variable moves out of the corresponding angle interval. Specifically, a re-initialization of the gradient search may be performed if the two gradient searches reach a scenario wherein the both have angle values in the same angle interval. For example, if during the gradient search in the [0;π] interval, the updated angle value moves into the [π;2π] interval such that both gradient searches have current values within this interval, the gradient search is re-initialized. The re-initialization is specifically performed by resetting the value of the input angle variable of one of the two gradient searches to an initial value. The initial value may for example be a fixed value such as the midpoint in the interval (i.e. π/2 and 3π/2).
From Fig. 6, we can see that the relevant quadrant for the gradient search in the interval [0;π] is the lower-left quadrant. For the gradient search in the interval [π;2π], the upper-right quadrant is relevant.
Next looking at the lower-left quadrant for the gradient search in the interval
[0;π], we can see from Fig. 6 and Fig. 7 (showing respectively the minima and maxima of the cost function) that a re-initialization within this interval would only lead to a correct convergence φ =φ in case the re-initialization would be done in the range [0;η], where η~2.55.
When the re-initialization would be larger than η, there is a risk that the gradient search again ends up in the wrong quadrant, i.e. the upper-left quadrant. Especially when φ is equal or close to zero, it is mandatory that re-initialization is done in the range [0;η], where η~2.55.
Hence, for the re-initialization, it is safe to choose a mid-point in the interval [0;η], (i.e. η / 2), where η~2.55.
A specific example of the approach that may be used is illustrated in Fig. 8. In step 801, the parameter values are initialized. Step 801 is followed by step 803 wherein it is ensured that is smaller
Figure imgf000025_0001
than Cp2[A:] (if not the two variable values are simply swapped).
Step 803 is followed by step 805 wherein it is determined if the two gradient searches have resulted in angle values, in the same angle interval. If so, the
Figure imgf000025_0002
appropriate value is re-initialized to ensure there is one angle in each angle interval.
Step 805 is followed by step 807 wherein the weights for the noise reference signals, the resulting output signal and the cost function gradients are calculated.
Step 807 is followed by step 809 wherein the new values for the angle input variables, Cp1[A:] , Cp2[A:] , of the gradient searches are calculated. Furthermore, the filtered cost function gradient at π is calculated.
Step 809 is followed by step 811 wherein the appropriate angle value is selected based on the filtered cost function gradient at π.
Step 811 is followed by step 813 wherein the power estimates for the noise reference signals used in the update value determination are updated.
After step 813 the method returns to step 803 to process the next sample.
A pseudo-code of an algorithm corresponding to Fig. 1 may be represented as:
Figure imgf000026_0001
In the specific example, the number of microphones in the microphone array 101 corresponded to the number of reference beams (i.e. three). However, in some embodiments, the microphone array may comprise more microphones than reference beams.
Specifically the microphone array 101 may comprise at least four microphones. The system may still only generate three reference beams and may specifically be arranged to combine signals from at least two microphones prior to generating the reference beams. Thus, the reference processor 105 may still only receive three input signals and generate three reference beams from these. However, at least one of these input signals may be generated by combining (and specifically averaging or adding (e.g. by a weighted summation)) the signals from at least two microphones. Such an approach may provide improved noise performance in many scenarios as the level of uncorrelated noise may be averaged. Furthermore, using more microphones on a particular area, has the advantage that spatial aliasing will occur at a higher frequency.
It will be appreciated that the above description for clarity has described embodiments of the invention with reference to different functional circuits, units and processors. However, it will be apparent that any suitable distribution of functionality between different functional circuits, units or processors may be used without detracting from the invention. For example, functionality illustrated to be performed by separate processors or controllers may be performed by the same processor or controllers. Hence, references to specific functional units or circuits are only to be seen as references to suitable means for providing the described functionality rather than indicative of a strict logical or physical structure or organization.
The invention can be implemented in any suitable form including hardware, software, firmware or any combination of these. The invention may optionally be
implemented at least partly as computer software running on one or more data processors and/or digital signal processors. The elements and components of an embodiment of the invention may be physically, functionally and logically implemented in any suitable way.
Indeed the functionality may be implemented in a single unit, in a plurality of units or as part of other functional units. As such, the invention may be implemented in a single unit or may be physically and functionally distributed between different units, circuits and processors.
Although the present invention has been described in connection with some embodiments, it is not intended to be limited to the specific form set forth herein. Rather, the scope of the present invention is limited only by the accompanying claims. Additionally, although a feature may appear to be described in connection with particular embodiments, one skilled in the art would recognize that various features of the described embodiments may be combined in accordance with the invention. In the claims, the term comprising does not exclude the presence of other elements or steps.
Furthermore, although individually listed, a plurality of means, elements, circuits or method steps may be implemented by e.g. a single circuit, unit or processor.
Additionally, although individual features may be included in different claims, these may possibly be advantageously combined, and the inclusion in different claims does not imply that a combination of features is not feasible and/or advantageous. Also the inclusion of a feature in one category of claims does not imply a limitation to this category but rather indicates that the feature is equally applicable to other claim categories as appropriate.
Furthermore, the order of features in the claims do not imply any specific order in which the features must be worked and in particular the order of individual steps in a method claim does not imply that the steps must be performed in this order. Rather, the steps may be performed in any suitable order. In addition, singular references do not exclude a plurality. Thus references to "a", "an", "first", "second" etc do not preclude a plurality. Reference signs in the claims are provided merely as a clarifying example shall not be construed as limiting the scope of the claims in any way.

Claims

CLAIMS:
1. An audio beamforming apparatus comprising:
a receiving circuit (103) for receiving signals from an at least two-dimensional microphone array (101) comprising at least three microphones;
a reference circuit (105) for generating at least three reference beams from the microphone signals;
combining circuit (107) for generating an output signal corresponding to a desired beam pattern by combining the reference beams in response to a first direction of a desired sound source and a direction estimate for an interfering sound source;
an estimation circuit (109) for generating the direction estimate by:
determining a first angle corresponding to a local minimum for a power measure of the output signal in a first angle interval,
determining a second angle corresponding to a local minimum for a power measure of the output signal in a second angle interval, and
determining the direction estimate as an angle selected from a set of angles corresponding to local minima for a power measure of the output signal, the set of angles comprising at least the first angle and the second angle; and
wherein the combining circuit (107) is arranged to determine combination parameters for the combining of the reference beams to provide a notch in an angle corresponding to the direction estimate and a minimization of a directivity cost measure, the directivity cost measure being indicative of a ratio between a gain in the first direction and an average gain.
2. The apparatus of claim 1 wherein the estimation circuit (109) is arranged to select the direction estimate as one of the first angle and the second angle in response to a gradient of a power measure of the output signal as a function of the direction estimate for an angle separating the first angle interval and the second angle interval.
3. The apparatus of claim 1 wherein the first angle interval comprises angles from 0 to π and the second angle interval comprises angles from π to 2π.
4. The apparatus of claim 3 wherein the estimation circuit (109) is arranged to select the direction estimate as one of the first angle and the second angle in response to a gradient of a power measure of the output signal as a function of the direction estimate for an angle of π.
5. The apparatus of claim 1 wherein the combining circuit (107) comprises a sidelobe canceller.
6. The apparatus of claim 5 wherein the sidelobe canceller is arranged to generate the output signal as a weighted combination of at least a primary signal, a first noise reference signal and a second noise reference signal.
7. The apparatus of claim 6 wherein the combining circuit (107) is arranged to calculate weights for the first and second noise reference signals in response to the direction estimate and a minimization of the directivity cost measure.
8. The apparatus of claim 5 wherein the estimation circuit (109) is arranged to determine at least one of the first and second angles by a gradient search applied to a sidelobe canceller corresponding to the sidelobe canceller of the combining circuit and having an angle input variable.
9. The apparatus of claim 8 wherein an update value for the angle input variable is determined as a function of an output signal of the sidelobe canceller for a current phase value of the angle input variable, and a first and second noise reference signal of the sidelobe canceller for the current phase value.
10. The apparatus of claim 9 wherein the first and second noise reference signals are weighted as a function of the current phase value.
11 The apparatus of claim 9 wherein the estimation circuit (109) is arranged to determine a power estimate for at least one of the first and second noise reference signals and to perform a normalization of the update value as a function of the power estimate.
12. The apparatus of claim 1 wherein the at least two-dimensional microphone array comprises at least four microphones and the apparatus comprises a circuit for combining signals from at least two of the at least four microphones prior to generating the reference beams.
13. The apparatus of claim 1 further comprising the at least two-dimensional microphone array (101), the at least two-dimensional microphone array (101) comprising directional microphones having a maximum response in a direction outwardly of a perimeter of the at least two-dimensional microphone array.
14. A method of audio beamforming comprising:
receiving signals from an at least two-dimensional microphone array comprising at least three microphones;
generating at least three reference beams from the microphone signals;
generating an output signal corresponding to a desired beam pattern by combining the reference beams in response to a first direction of a desired sound source and a direction estimate for an interfering sound source;
generating the direction estimate by:
determining a first angle corresponding to a local minimum for a power measure of the output signal in a first angle interval,
determining a second angle corresponding to a local minimum for a power measure of the output signal in a second angle interval, and
determining the direction estimate as an angle selected from a set of angles corresponding to local minima for a power measure of the output signal, the set of angles comprising at least the first angle and the second angle; and
wherein the combining of the reference beams comprises determining combination parameters for the combining of the reference beams to provide a notch in an angle corresponding to the direction estimate and a minimization of a directivity cost measure, the directivity cost measure being indicative of a ratio between a gain in the first direction and an energy averaged gain.
15. A computer program product comprising a computer program enabling a processor to carry out the method of claim 14.
PCT/IB2010/053335 2009-07-24 2010-07-22 Audio beamforming WO2011010292A1 (en)

Priority Applications (5)

Application Number Priority Date Filing Date Title
CN201080033006.9A CN102474680B (en) 2009-07-24 2010-07-22 Audio signal beam is formed
JP2012521148A JP5777616B2 (en) 2009-07-24 2010-07-22 Sound beam forming
RU2012106592/28A RU2550300C2 (en) 2009-07-24 2010-07-22 Audio signal beam forming
EP10745004.1A EP2457384B1 (en) 2009-07-24 2010-07-22 Audio beamforming
US13/384,720 US9084037B2 (en) 2009-07-24 2010-07-22 Audio beamforming

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP09166297.3 2009-07-24
EP09166297 2009-07-24

Publications (1)

Publication Number Publication Date
WO2011010292A1 true WO2011010292A1 (en) 2011-01-27

Family

ID=42989670

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2010/053335 WO2011010292A1 (en) 2009-07-24 2010-07-22 Audio beamforming

Country Status (6)

Country Link
US (1) US9084037B2 (en)
EP (1) EP2457384B1 (en)
JP (1) JP5777616B2 (en)
CN (1) CN102474680B (en)
RU (1) RU2550300C2 (en)
WO (1) WO2011010292A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102969002A (en) * 2012-11-28 2013-03-13 厦门大学 Microphone array speech enhancement device capable of suppressing mobile noise
CN104106267A (en) * 2011-06-21 2014-10-15 若威尔士有限公司 Signal-enhancing beamforming in augmented reality environment
ITUA20164622A1 (en) * 2016-06-23 2017-12-23 St Microelectronics Srl BEAMFORMING PROCEDURE BASED ON MICROPHONE DIES AND ITS APPARATUS
CN110517677A (en) * 2019-08-27 2019-11-29 腾讯科技(深圳)有限公司 Speech processing system, method, equipment, speech recognition system and storage medium

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104464739B (en) 2013-09-18 2017-08-11 华为技术有限公司 Acoustic signal processing method and device, Difference Beam forming method and device
US9854594B2 (en) * 2014-03-31 2017-12-26 International Business Machines Corporation Wireless cross-connect switch
US9516409B1 (en) 2014-05-19 2016-12-06 Apple Inc. Echo cancellation and control for microphone beam patterns
US9326060B2 (en) * 2014-08-04 2016-04-26 Apple Inc. Beamforming in varying sound pressure level
US20170164102A1 (en) * 2015-12-08 2017-06-08 Motorola Mobility Llc Reducing multiple sources of side interference with adaptive microphone arrays
CN105959872B (en) * 2016-04-21 2019-07-02 歌尔股份有限公司 Intelligent robot and Sounnd source direction discriminating conduct for intelligent robot
EP3520503B1 (en) * 2016-09-28 2021-06-09 IDAC Holdings, Inc. Uplink power control
CN110140359B (en) * 2017-01-03 2021-10-29 皇家飞利浦有限公司 Audio capture using beamforming
US10638224B2 (en) * 2017-01-03 2020-04-28 Koninklijke Philips N.V. Audio capture using beamforming
EP3416407B1 (en) * 2017-06-13 2020-04-08 Nxp B.V. Signal processor
CN109104683B (en) * 2018-07-13 2021-02-02 深圳市小瑞科技股份有限公司 Method and system for correcting phase measurement of double microphones
GB201814988D0 (en) 2018-09-14 2018-10-31 Squarehead Tech As Microphone Arrays
US11956590B2 (en) 2019-03-19 2024-04-09 Northwestern Polytechnical University Flexible differential microphone arrays with fractional order
US11438691B2 (en) * 2019-03-21 2022-09-06 Shure Acquisition Holdings, Inc. Auto focus, auto focus within regions, and auto placement of beamformed microphone lobes with inhibition functionality
CN112216299B (en) * 2019-07-12 2024-02-20 大众问问(北京)信息科技有限公司 Dual-microphone array beam forming method, device and equipment
KR20220022315A (en) * 2020-08-18 2022-02-25 삼성전자주식회사 Electronic device including a camera and a microphone

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1455552A2 (en) * 2003-03-06 2004-09-08 Samsung Electronics Co., Ltd. Microphone array, method and apparatus for forming constant directivity beams using the same, and method and apparatus for estimating acoustic source direction using the same
US20060140417A1 (en) * 2004-12-23 2006-06-29 Zurek Robert A Method and apparatus for audio signal enhancement
WO2009034524A1 (en) * 2007-09-13 2009-03-19 Koninklijke Philips Electronics N.V. Apparatus and method for audio beam forming

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0820210A3 (en) * 1997-08-20 1998-04-01 Phonak Ag A method for elctronically beam forming acoustical signals and acoustical sensorapparatus
WO2005065012A2 (en) * 2003-12-24 2005-07-21 Nokia Corporation A method for efficient beamforming using a complementary noise separation filter
JP2008061186A (en) * 2006-09-04 2008-03-13 Yamaha Corp Directional characteristic control apparatus, sound collecting device and sound collecting system
JP2008256448A (en) * 2007-04-03 2008-10-23 Toshiba Corp High-resolution system
US8934640B2 (en) * 2007-05-17 2015-01-13 Creative Technology Ltd Microphone array processor based on spatial analysis

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1455552A2 (en) * 2003-03-06 2004-09-08 Samsung Electronics Co., Ltd. Microphone array, method and apparatus for forming constant directivity beams using the same, and method and apparatus for estimating acoustic source direction using the same
US20060140417A1 (en) * 2004-12-23 2006-06-29 Zurek Robert A Method and apparatus for audio signal enhancement
WO2009034524A1 (en) * 2007-09-13 2009-03-19 Koninklijke Philips Electronics N.V. Apparatus and method for audio beam forming

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
DERKX, R.: "Adaptive Azimuthal Null-Steering for a First-order Microphone Response", PROCEEDINGS OF THE 12TH INTERNATIONAL WORKSHOP ON ACOUSTIC ECHO AND NOISE CONTROL, 30 August 2010 (2010-08-30) - 2 September 2010 (2010-09-02), Tel-Aviv, Israel, XP002608523 *
DERKX, R.: "Optimal Azimuthal Steering of a First-order Superdirectional Microphone Response", PROCEEDINGS OF THE 11TH INTERNATIONAL WORKSHOP ON ACOUSTIC ECHO AND NOISE CONTROL, 14 September 2008 (2008-09-14) - 17 September 2008 (2008-09-17), Seattle, WA, USA, XP002608524 *
NAGATA N., ABE M.: "Two-Channel Adaptive Microphone Array with Target Tracking", ELECTRONICS AND COMMUNICATIONS IN JAPAN, PART 3, vol. 83, no. 12, 2000, pages 19 - 24, XP002608525 *
R.M.M. DERKX: "Optimal Azimuthal Steering of a First-order Superdirectional Microphone Response", INTERNATIONAL WORKSHOP ON ACOUSTIC ECHO AND NOISE CONTROL, September 2008 (2008-09-01)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104106267A (en) * 2011-06-21 2014-10-15 若威尔士有限公司 Signal-enhancing beamforming in augmented reality environment
US9973848B2 (en) 2011-06-21 2018-05-15 Amazon Technologies, Inc. Signal-enhancing beamforming in an augmented reality environment
CN102969002A (en) * 2012-11-28 2013-03-13 厦门大学 Microphone array speech enhancement device capable of suppressing mobile noise
CN102969002B (en) * 2012-11-28 2014-09-03 厦门大学 Microphone array speech enhancement device capable of suppressing mobile noise
ITUA20164622A1 (en) * 2016-06-23 2017-12-23 St Microelectronics Srl BEAMFORMING PROCEDURE BASED ON MICROPHONE DIES AND ITS APPARATUS
EP3261361A1 (en) * 2016-06-23 2017-12-27 STMicroelectronics Srl Beamforming method based on arrays of microphones and corresponding apparatus
CN107544055A (en) * 2016-06-23 2018-01-05 意法半导体股份有限公司 Beamforming Method and corresponding device based on microphone array
US9913030B2 (en) 2016-06-23 2018-03-06 Stmicroelectronics S.R.L. Beamforming method based on arrays of microphones and corresponding apparatus
CN110517677A (en) * 2019-08-27 2019-11-29 腾讯科技(深圳)有限公司 Speech processing system, method, equipment, speech recognition system and storage medium
CN110517677B (en) * 2019-08-27 2022-02-08 腾讯科技(深圳)有限公司 Speech processing system, method, apparatus, speech recognition system, and storage medium

Also Published As

Publication number Publication date
CN102474680A (en) 2012-05-23
EP2457384B1 (en) 2020-09-09
US20120114128A1 (en) 2012-05-10
EP2457384A1 (en) 2012-05-30
CN102474680B (en) 2015-08-19
JP2013500617A (en) 2013-01-07
US9084037B2 (en) 2015-07-14
JP5777616B2 (en) 2015-09-09
RU2012106592A (en) 2013-08-27
RU2550300C2 (en) 2015-05-10

Similar Documents

Publication Publication Date Title
EP2457384B1 (en) Audio beamforming
EP2540094B1 (en) Audio source localization
US11381906B2 (en) Conference system with a microphone array system and a method of speech acquisition in a conference system
US20230124859A1 (en) Conferencing Device with Beamforming and Echo Cancellation
US11765498B2 (en) Microphone array system
JP3701940B2 (en) System and method for processing a signal emitted from a target signal source into a noisy environment
Rafaely et al. Spherical microphone array beamforming
WO2009034524A1 (en) Apparatus and method for audio beam forming
KR100831655B1 (en) Method for adjusting adaptation control of adaptive interference canceller
WO2010043998A1 (en) Microphone system and method of operating the same
WO2013085605A1 (en) Near-field null and beamforming
WO2006006935A1 (en) Capturing sound from a target region
WO2009034536A2 (en) Audio activity detection
Itzhak et al. Differential and constant-beamwidth beamforming with uniform rectangular arrays
Itzhak et al. Region-of-interest oriented constant-beamwidth beamforming with rectangular arrays
Markovich‐Golan et al. Spatial filtering
Sun et al. The deconvolved conventional beamforming for non-uniform line arrays
US20230224635A1 (en) Audio beamforming with nulling control system and methods
Lin et al. Recursive extended instrumental variable based LCMV beamformers for planar radial coprime arrays under spatially colored noise
Ali et al. Performance investigation of robust cylindrical antenna array beamformer in appearance of look direction mismatch
Chang A robust space-time generalized sidelobe canceller
Zheng et al. Robustness and distance discrimination of adaptive near field beamformers
Choi Robust adaptive array with variable uncertainty bound under weight vector norm Constraint
Choi Efficient generalized sidelobe canceller for partially adaptive beamforming
Selvan et al. Methods for preventing signal cancellation due to correlated interferences in adaptive array systems: A tutorial

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 201080033006.9

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 10745004

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 2010745004

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 13384720

Country of ref document: US

WWE Wipo information: entry into national phase

Ref document number: 2012521148

Country of ref document: JP

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 966/CHENP/2012

Country of ref document: IN

WWE Wipo information: entry into national phase

Ref document number: 2012106592

Country of ref document: RU