US20140241546A1

US20140241546A1 - Microphone sensitivity difference correction device, method, and noise suppression device

Info

Publication number: US20140241546A1
Application number: US14/155,731
Authority: US
Inventors: Chikako Matsumoto
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2013-02-28
Filing date: 2014-01-15
Publication date: 2014-08-28
Also published as: JP2014168188A; EP2773137A2; EP2773137A3; EP2773137B1; US9204218B2; JP6020258B2

Abstract

A microphone sensitivity difference correction device includes a detection section that detects a frequency domain signal expressing a stationary noise, based on frequency domain signals of input sound signals respectively input from plural microphones; a first correction section that employs the stationary noise to compute a first correction coefficient for correcting the sensitivity difference between the plural microphones by a frame unit; and a second correction section that employs the frequency domain signals that have been corrected by the first correction section to compute a second correction coefficient for correcting by frequency unit the sensitivity difference between the plural microphones for each of the frames.

Description

CROSS-REFERENCE TO RELATED APPLICATION

This application is based upon and claims the benefit of priority of the prior Japanese Patent Application No. 2013-039695, filed on Feb. 28, 2013, the entire contents of which are incorporated herein by reference.

FIELD

The embodiments discussed herein are related to a microphone sensitivity difference correction device, a microphone sensitivity difference correction method, a microphone sensitivity difference correction program and a noise suppression device.

BACKGROUND

In, for example, a vehicle mounted car navigation system, a hands-free phone, or a telephone conference system, noise suppression is conventionally performed to suppress noise contained in a speech signal that has mixed-in noise other than a target voice (for example voices of people talking). Technology employing a microphone array including plural microphones is known as such noise suppression technology.
In such conventional noise suppression technology using a microphone array, there is a method for noise suppression based on an amplitude ratio between signals received from plural microphones. The amplitude ratio becomes 1.0 when the distance between each of the microphones and the sound source is the same distance or when far away, and the amplitude ratio is a value that deviates from 1.0 when the distance between each of the microphones and the sound source is a different distance. Noise suppression based on the amplitude ratio is a method that employs the amplitude ratio, and so, for example, when a target sound source is present at a position that has different distances to each of the microphones, the method suppresses noise that has a value of amplitude ratio of close to 1.0 in the received signals from the plural microphones.
However, even when the distances between each of the microphones and the sound sources are the same distances, sometimes the value of the amplitude ratio deviates from 1.0 due to sensitivity differences that arise between each of the microphones. Since accurate noise suppression based on amplitude ratio is not be performed in such cases, there is accordingly a need for technology to correct for such sensitivity differences between the microphones.
As technology to correct sensitivity differences between microphones, there is, for example, a proposal for a device that corrects the level from at least one sound signal by deriving a correction coefficient when performing audio processing based on sound signals respectively generated from sound input to plural sound input sections. In such a device, for respective sounds input to the plural sound input sections, frequency components are detected of sound arriving from a substantially orthogonal direction with respect to a straight line defining the placement position of a first sound input section and a second sound input section among the plural sound input sections. The direction from which the sound arrives is detected based on phase differences between the sounds arriving from the first sound input section and the second sound input section. In order to match the levels of sound signal respectively generated by the first sound input section and the second sound input section based on the sound of the detected frequency components, correction coefficients are derived for correcting the level of at least one of the respective sound signals generated from the input sound by the first sound input section and the second sound input section.

Claims

What is claimed is:

1. A microphone sensitivity difference correction device comprising:

a detection section that detects a frequency domain signal expressing a stationary noise, based on frequency domain signals of input sound signals respectively input from a plurality of microphones contained in a microphone array that have been converted into signals in a frequency domain for each frame;

a first correction section that employs the frequency domain signal expressing the stationary noise to compute a first correction coefficient for correcting the sensitivity difference between the plurality of microphones by a frame unit, and that employs the first correction coefficient to correct the frequency domain signals by frame unit; and

a second correction section that employs the frequency domain signals that have been corrected by the first correction section to compute a second correction coefficient for correcting by frequency unit the sensitivity difference between the plurality of microphones for each of the frames, and that employs the second correction coefficient to correct for each of the frames by frequency unit the frequency domain signals that have been corrected by the first correction section.

2. The microphone sensitivity difference correction device of claim 1, further comprising:

a phase difference computation section that computes a phase difference for each frequency between frequency domain signals that correspond to each of the input sound signals,

wherein the detection section, based on the phase difference for each of the frequencies, detects, as a frequency domain signal expressing the stationary noise, the frequency domain signals that correspond to the input sound signal that has arrived from a direction other than a sound source direction of a target voice.

3. The microphone sensitivity difference correction device of claim 2, further comprising:

a phase difference utilization range setting section that, based on an inter-microphone distance between the plurality of microphones and a sampling frequency, sets, as a phase difference utilization range, a frequency band in which phase rotation of phase difference for each of the frequencies does not occur, wherein:

the phase difference computation section computes a phase difference for each of the frequencies in the phase difference utilization range, and

the detection section detects a frequency domain signal expressing the stationary noise in the phase difference utilization range.

4. The microphone sensitivity difference correction device of claim 3, further comprising an accuracy computation section that computes a probability that the input sound signal has arrived from the sound source direction of the target voice based on a phase difference for each frequency of the phase difference utilization range, and that, when the probability is higher than a predetermined probability threshold value, computes a degree of accuracy of correction by the first correction section and the second correction section based on respective frequency domain signals that correspond to each of the input sound signals.

5. The microphone sensitivity difference correction device of claim 4, wherein, based on the degree of accuracy, the accuracy computation section updates at least one of:

a first update coefficient expressing a degree to reflect the first correction coefficient value computed the previous time when the first correction coefficient is being computed by the first correction section,

a second update coefficient expressing a degree to reflect the second correction coefficient value computed the previous time when the second correction coefficient is being computed by the second correction section, or

a third update coefficient expressing a degree to reflect the degree of accuracy value computed the previous time when the degree of accuracy is being computed by the accuracy computation section.

6. The microphone sensitivity difference correction device of claim 4, wherein when the degree of accuracy has exceeded a predetermined end threshold value, the accuracy computation section ends degree of accuracy computation, and ends computation of the first correction coefficient by the first correction section and ends computation of the second correction coefficient by the second correction section.

7. A noise suppression device comprising:

the microphone sensitivity difference correction device of claim 1; and

a suppression section that suppresses noise contained in the input sound signal based on an amplitude ratio between the plurality of input sound signals derived using the frequency domain signals that have been corrected by the second correction section.

8. A noise suppression device comprising:

the microphone sensitivity difference correction device of claim 4; and

a suppression section that, when a degree of accuracy computed by the accuracy computation section is greater than a predetermined suppression threshold value, suppresses noise contained in the input sound signal based on an amplitude ratio between the plurality of input sound signals derived using the frequency domain signals that have been corrected by the second correction section.

9. A microphone sensitivity difference correction method that causes a computer to execute processing, the processing comprising:

detecting a frequency domain signal expressing a stationary noise, based on frequency domain signals of input sound signals respectively input from a plurality of microphones contained in a microphone array that have been converted into signals in a frequency domain for each frame;

employing the frequency domain signal expressing the stationary noise to compute a first correction coefficient for correcting the sensitivity difference between the plurality of microphones by a frame unit, and employing the first correction coefficient to correct the frequency domain signals by frame unit; and

employing the frequency domain signals that have been corrected employing the first correction coefficient to compute a second correction coefficient for correcting by frequency unit the sensitivity difference between the plurality of microphones for each of the frames, and employing the second correction coefficient to correct for each of the frames by frequency unit the frequency domain signals that have been corrected using the first correction coefficient.

10. The microphone sensitivity difference correction method of claim 9, wherein the processing further comprises:

computing a phase difference for each frequency between frequency domain signals that correspond to each of the input sound signals; and

based on the phase difference for each of the frequencies, detecting, as a frequency domain signal expressing the stationary noise, the frequency domain signals that correspond to the input sound signal that has arrived from a direction other than a sound source direction of a target voice.

11. The microphone sensitivity difference correction method of claim 10, wherein the processing further comprises:

based on an inter-microphone distance between the plurality of microphones and a sampling frequency, setting, as a phase difference utilization range, a frequency band in which phase rotation of phase difference for each of the frequencies does not occur;

computing a phase difference for each of the frequencies in the phase difference utilization range; and

detecting a frequency domain signal expressing the stationary noise in the phase difference utilization range.

12. The microphone sensitivity difference correction method of claim 11, wherein the processing further comprises:

computing a probability that the input sound signal has arrived from the sound source direction of the target voice based on a phase difference for each frequency of the phase difference utilization range, and, when the probability is higher than a predetermined probability threshold value, computing a degree of accuracy of correction based on respective frequency domain signals that correspond to each of the input sound signals.

13. The microphone sensitivity difference correction method of claim 12, wherein the processing further comprises, based on the degree of accuracy, updating at least one of:

a first update coefficient expressing a degree to reflect the first correction coefficient value computed the previous time when the first correction coefficient is being computed,

a second update coefficient expressing a degree to reflect the second correction coefficient value computed the previous time when the second correction coefficient is being computed, or

a third update coefficient expressing a degree to reflect the degree of accuracy value computed the previous time when the degree of accuracy is being computed.

14. A noise suppression method that causes a computer to execute processing, the processing comprising:

the processing of the microphone sensitivity difference correction method of claim 12; and

when a computed degree of accuracy is greater than a predetermined suppression threshold value, suppressing noise contained in the input sound signal based on an amplitude ratio between the plurality of input sound signals derived using the corrected frequency domain signals.

15. A storage medium storing a microphone sensitivity difference correction program that causes a computer to execute processing, the processing comprising:

detecting a frequency domain signal expressing a stationary noise based on frequency domain signals of input sound signals respectively input from a plurality of microphones contained in a microphone array that have been converted into signals in a frequency domain for each frame;

16. The storage medium storing a microphone sensitivity difference correction program of claim 15, wherein the processing further comprises:

17. The storage medium storing a microphone sensitivity difference correction program of claim 16, wherein the processing further comprises:

18. The storage medium storing a microphone sensitivity difference correction program of claim 17, wherein the processing further comprises:

19. The storage medium storing a microphone sensitivity difference correction program of claim 18, wherein the processing further comprises, based on the degree of accuracy, updating at least one of:

20. A storage medium storing a noise suppression program that causes a computer to execute processing, the processing comprising:

the processing of the microphone sensitivity difference correction program of claim 15; and