JP2012195772A

JP2012195772A - Audio signal processing device, control method thereof, and computer program

Info

Publication number: JP2012195772A
Application number: JP2011058297A
Authority: JP
Inventors: Koichi Washisu; 晃一鷲巣
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2011-03-16
Filing date: 2011-03-16
Publication date: 2012-10-11
Anticipated expiration: 2031-03-16
Also published as: JP5836616B2

Abstract

PROBLEM TO BE SOLVED: To provide a sound processing technique capable of reducing even noise produced at an initial stage of wind noise occurrence.SOLUTION: An audio signal processing device comprises: sound collection means; storage means for storing the audio signals collected by the sound collection means; noise detection means for detecting noise included in the audio signals stored in the storage means; noise reduction means for reducing the noise detected by the noise detection means from the audio signals; and setting means for setting a time constant which controls noise reduction characteristics of the noise reduction means according to a level of the noise detected by the noise detection means. The setting means starts setting of the time constant in the noise reduction means in response to detection of the noise detected by the noise detection means. The noise reduction means reads the audio signal from the storage means and reduces the noise when the setting of the time constant by the setting means according to the noise level is completed.

Description

本発明は、音声信号処理装置及びその制御方法、コンピュータプログラムに関する。 The present invention relates to an audio signal processing device, a control method thereof, and a computer program.

音声信号処理装置として、音声をマイクにより採取して録音する録音装置がある。音声を録音する録音機能は、録音装置のような録音のみを行う装置に採用されているだけではなく、撮像装置のような音声と映像とを同時に取得してムービーの生成が可能な装置にも搭載されている。 As an audio signal processing device, there is a recording device that collects and records audio with a microphone. The recording function for recording audio is not only used for recording devices such as recording devices, but also for devices capable of generating movies by simultaneously acquiring audio and video, such as imaging devices. It is installed.

このような録音機能を搭載する装置には、音声信号に重畳される風切り音（風雑音）を低減する機能が搭載されてきている。当該低減技術として、特許文献１はマイクにより集音する音声に重畳する風切り音をハイパスフィルタで減衰する技術、および、ハイパスフィルタの時定数を雑音の大きさに基づいて変更する技術を開示している。特許文献２は、風を検出するために熱電対を用い、その出力にあわせて風切り音のレベルを推測し、雑音低減の為に使用するハイパスフィルタを選択している。 Devices equipped with such a recording function have been equipped with a function of reducing wind noise (wind noise) superimposed on an audio signal. As the reduction technique, Patent Document 1 discloses a technique for attenuating wind noise superimposed on sound collected by a microphone with a high-pass filter, and a technique for changing the time constant of the high-pass filter based on the magnitude of noise. Yes. In Patent Document 2, a thermocouple is used to detect wind, the level of wind noise is estimated according to the output of the thermocouple, and a high-pass filter used for noise reduction is selected.

特開平５−１７６２１１号公報JP-A-5-176221 特開平５−３２８４８０号公報JP-A-5-328480

しかしながら、上述の提案技術では、風切り音を検出するまでに時間を要し、風切り音発生初期の雑音が精度良く低減できない。 However, the above-described proposed technique requires time until wind noise is detected, and noise at the initial stage of wind noise generation cannot be reduced with high accuracy.

そこで、本発明は、風切り音の発生初期からの雑音も低減可能とする音声処理技術を提供する。 Therefore, the present invention provides an audio processing technique that can reduce noise from the beginning of wind noise generation.

上記目的を達成するために、本発明の音声信号処理装置は、
音声採取手段と、
前記音声採取手段で採取した音声信号を記憶する記憶手段と、
前記記憶手段に記憶された前記音声信号に含まれる雑音を検出する雑音検出手段と、
前記雑音検出手段が検出した雑音を、前記音声信号から低減するための雑音低減手段と、
前記雑音低減手段の雑音低減特性を制御する時定数を、前記雑音検出手段が検出した前記雑音のレベルに応じて設定する設定手段と
を備え、
前記設定手段は、前記雑音検出手段による前記雑音の検出に応じて前記雑音低減手段への前記時定数の設定を開始し、
前記雑音低減手段は、前記設定手段による前記雑音のレベルに応じた前記時定数の設定が終了した場合に、前記記憶手段から前記音声信号を読み出して前記雑音の低減を行うことを特徴とする。 In order to achieve the above object, an audio signal processing device according to the present invention provides:
Voice collection means;
Storage means for storing the voice signal collected by the voice sampling means;
Noise detection means for detecting noise contained in the audio signal stored in the storage means;
Noise reduction means for reducing noise detected by the noise detection means from the voice signal;
A setting means for setting a time constant for controlling a noise reduction characteristic of the noise reduction means according to a level of the noise detected by the noise detection means,
The setting means starts setting the time constant to the noise reduction means in response to detection of the noise by the noise detection means,
The noise reduction unit reads the audio signal from the storage unit and performs the noise reduction when the setting of the time constant according to the noise level by the setting unit is completed.

本発明によれば、風切り音の発生初期より精度良く風切り音低減を行うことができる。 According to the present invention, wind noise can be reduced with high accuracy from the initial generation of wind noise.

発明の実施形態１に対応する音声信号処理装置のブロック図。The block diagram of the audio | voice signal processing apparatus corresponding to Embodiment 1 of invention. 発明の実施形態に対応する雑音低減部の特性を説明する図。The figure explaining the characteristic of the noise reduction part corresponding to embodiment of invention. 発明の実施形態に対応する雑音低減部の他の特性を説明する図。The figure explaining the other characteristic of the noise reduction part corresponding to embodiment of invention. 発明の実施形態に対応する雑音低減部の構成をアナログ回路で模式的に示す図。The figure which shows typically the structure of the noise reduction part corresponding to embodiment of invention with an analog circuit. 発明の実施形態に対応する雑音低減処理のタイミングチャート。The timing chart of the noise reduction process corresponding to embodiment of invention. 発明の実施形態に対応する雑音低減処理の効果を説明する図。The figure explaining the effect of the noise reduction process corresponding to embodiment of invention. 発明の実施形態に対応する雑音検出処理のフローチャート。The flowchart of the noise detection process corresponding to embodiment of invention. 発明の実施形態に対応する雑音低減処理のフローチャート。The flowchart of the noise reduction process corresponding to embodiment of invention. 発明の実施形態２に対応する音声信号処理装置のブロック図。The block diagram of the audio | voice signal processing apparatus corresponding to Embodiment 2 of invention. 発明の実施形態２に対応する雑音低減処理のフローチャート。The flowchart of the noise reduction process corresponding to Embodiment 2 of invention. 発明の実施形態２に対応する音声信号処理装置の別態様を示すブロック図。The block diagram which shows another aspect of the audio | voice signal processing apparatus corresponding to Embodiment 2 of invention.

以下、添付図面を参照して発明の実施形態を説明する。 Embodiments of the invention will be described below with reference to the accompanying drawings.

［実施形態１］
図１は、発明の実施形態１に対応する音声信号処理装置の構成例を示すブロック図であり、ステレオマイクを有する録音装置の例を示している。なお、音声信号処理装置は、当該録音装置の主要構成を含む撮像装置であってもよい。その場合、撮像装置は、音声と映像とを同期して記録可能な装置であればよく、たとえば、動画撮影機能を有するデジタルスチルカメラ、ビデオカメラ、携帯電話、ノートパソコン等が含まれるが、それらに限定されるものではない。 [Embodiment 1]
FIG. 1 is a block diagram showing a configuration example of an audio signal processing apparatus corresponding to Embodiment 1 of the invention, and shows an example of a recording apparatus having a stereo microphone. Note that the audio signal processing device may be an imaging device including a main configuration of the recording device. In that case, the imaging device may be any device that can record audio and video in synchronization, and includes, for example, a digital still camera, a video camera, a mobile phone, a laptop computer, and the like having a video shooting function. It is not limited to.

マイク１１Ｒ、１１Ｌは、音声を検出する音声検出部であって、ステレオマイクとして対になっている。マイク１１Ｒ、１１Ｌからのアナログ音声信号は、利得変更部１２に出力される。利得変更部１２は、利得変更部１２は、被写体音が小さい場合にはマイク１１Ｒ、１１Ｌの出力の増幅率を大きくしてＳ／Ｎを向上させる。一方で、被写体音が大きい時には増幅率を小さくして、増幅回路内での飽和を防止する。なお、音声信号をデジタル処理したい場合には、利得変更部１２がアナログ・デジタル変換器（ＡＤＣ）を含むことができ、マイクからのアナログ音声信号をデジタル音声信号に変換する。また、デジタル信号に変換する際に、利得を制御して所望の利得が得られるようにする。 The microphones 11R and 11L are audio detection units that detect audio and are paired as stereo microphones. Analog audio signals from the microphones 11R and 11L are output to the gain changing unit 12. The gain changing unit 12 increases the amplification factor of the outputs of the microphones 11R and 11L to improve the S / N when the subject sound is small. On the other hand, when the subject sound is loud, the amplification factor is reduced to prevent saturation in the amplification circuit. In the case where it is desired to digitally process the audio signal, the gain changing unit 12 can include an analog / digital converter (ADC), and converts the analog audio signal from the microphone into a digital audio signal. Further, when converting into a digital signal, the gain is controlled so that a desired gain can be obtained.

記憶部１３は利得変更部１２から入力された音声信号を一時的に記憶しておくバッファである。記憶部１３からの出力は雑音低減部１４に入力される。雑音低減部１４は図２ａや図２ｂに示す特性を有する複数のハイパスフィルタ段（１４ａ、１４ｂ）で構成される。フィルタの段数は、本実施形態では２段としてるが、３段以上としてもよい。なお、図１ではマイクを左右で２本設けてあるのに対し、その後の信号処理の出力ラインは１本しかないが、実際には左右のマイク毎に個別に雑音低減信号処理が行われている。 The storage unit 13 is a buffer that temporarily stores the audio signal input from the gain changing unit 12. The output from the storage unit 13 is input to the noise reduction unit 14. The noise reduction unit 14 includes a plurality of high-pass filter stages (14a, 14b) having the characteristics shown in FIGS. 2a and 2b. The number of filter stages is two in this embodiment, but may be three or more. In FIG. 1, although two microphones are provided on the left and right, there is only one output line for subsequent signal processing, but in practice, noise reduction signal processing is performed separately for each of the left and right microphones. Yes.

図２ａは、マイクに入力された音声のマイクからの出力音声の周波数特性を示す。横軸は周波数、縦軸は入力される音声信号の出力時の利得を示している。図２ａにおいて１０Ｈｚより低い周波数の音声信号が減衰している（利得が低い）ことが分かる。ここには人の声も含まれ、被写体音源には１０Ｈｚ以下の低周波は殆どないので、図２ａのような特性を有する雑音低減部による被写体音の劣化は起こらない。図２ｂも、同様にマイクの出力音声の周波数特性であるが、ここでは２００Ｈｚより低い周波数の音声信号が減衰している場合を示している。風切り音の様に風がマイクにあたって発生する雑音は、２００Ｈｚ以下の周波数成分が多い。そのため、図２ｂの特性にすることで音声信号に含まれる風切り音の低減が可能となる。 FIG. 2a shows the frequency characteristics of the output sound from the microphone of the sound input to the microphone. The horizontal axis indicates the frequency, and the vertical axis indicates the gain at the time of output of the input audio signal. In FIG. 2a, it can be seen that the audio signal having a frequency lower than 10 Hz is attenuated (low gain). Here, human voice is also included, and the subject sound source has almost no low frequency of 10 Hz or less, so that the subject sound is not deteriorated by the noise reduction unit having the characteristics as shown in FIG. FIG. 2b also shows the frequency characteristics of the output sound of the microphone, but here shows a case where an audio signal having a frequency lower than 200 Hz is attenuated. The noise generated by the wind, such as wind noise, has many frequency components of 200 Hz or less. Therefore, the wind noise included in the audio signal can be reduced by using the characteristics shown in FIG. 2b.

以下、図２ａの特性から図２ｂの特性に切り替える方法について説明する。まず、雑音低減部が図３のようなアナログ回路で形成されている場合を説明する。 Hereinafter, a method of switching from the characteristic of FIG. 2a to the characteristic of FIG. 2b will be described. First, the case where the noise reduction unit is formed of an analog circuit as shown in FIG. 3 will be described.

図３は公知のアナログ・ハイパス・フィルタであり、コンデンサ２２２と可変抵抗２２３とバッファアンプ２２１で構成されている。制御信号２２４は、可変抵抗２２３の抵抗値を制御するための制御信号であり、例えば雑音低減制御部１９から与えられる。コンデンサ２２２と可変抵抗２２３のインピーダンス積により図２ａの１０Ｈｚ、図２ｂの２００Ｈｚが設定できる。例えば、可変抵抗２２３の抵抗値を大きくして図２ａの特性にし、制御信号２２４により可変抵抗２２３の抵抗値を小さくして図２ｂの特性にすることができる。また、ハイパスフィルタを公知の差分方程式などのデジタルフィルタで形成する場合、入力値及び前回の出力値に掛け合わされる係数を調整することで図２ａ、図２ｂの特性が実現できる。また、ハイパスフィルタの動作中に上記係数を変更することで、図２ａの特性から図２ｂの特性に変更することができる。 FIG. 3 shows a known analog high-pass filter, which includes a capacitor 222, a variable resistor 223, and a buffer amplifier 221. The control signal 224 is a control signal for controlling the resistance value of the variable resistor 223 and is given from, for example, the noise reduction control unit 19. 10 Hz in FIG. 2 a and 200 Hz in FIG. 2 b can be set by the impedance product of the capacitor 222 and the variable resistor 223. For example, the resistance value of the variable resistor 223 can be increased to obtain the characteristics shown in FIG. 2A, and the resistance value of the variable resistor 223 can be reduced by the control signal 224 to obtain the characteristics shown in FIG. When the high-pass filter is formed by a digital filter such as a known difference equation, the characteristics shown in FIGS. 2a and 2b can be realized by adjusting the coefficient multiplied by the input value and the previous output value. Further, by changing the coefficient during the operation of the high-pass filter, the characteristic shown in FIG. 2a can be changed to the characteristic shown in FIG. 2b.

図１に戻って、第１雑音低減部１４ａからの出力は、第２雑音低減部１４ｂを経て、あるいは、そのまま後述する次数変更部１５、利得復元部１６を介して音声記録部１７に記録される。音声記録部１７は記録媒体に音声信号を記録する記録媒体であって、固体メモリ、ＤＶＤ、ハードディスク、テープ等を含むが、特定の種類の記録媒体に限定されるものではない。 Returning to FIG. 1, the output from the first noise reduction unit 14a is recorded in the audio recording unit 17 via the second noise reduction unit 14b or via the order changing unit 15 and the gain restoration unit 16 described later. The The audio recording unit 17 is a recording medium that records an audio signal on a recording medium, and includes a solid memory, a DVD, a hard disk, a tape, and the like, but is not limited to a specific type of recording medium.

雑音検出部１８は、記憶部１３から入力された音声信号より風切り音を検出する。一般的にステレオマイクに入力される被写体音声は、左右のチャンネル間で強い相関関係があるのに対して風切り音は無相関であることが多い。そこで雑音検出部１８はまず、マイク１１Ｒおよび１１Ｌにより得られた音声信号を比較する。もし、両者に相関が無い場合には風切り音が発生していると判定し、且つ、相関の無い信号振幅に基づいて、風切り音のレベルを判定することができる。具体的に振幅が大きければ、風切り音が大きいと判定することができる。なお、相関判定は、一部の帯域に注目して行ってもよい。例えば、雑音検出部１８が４００Ｈｚ以下の周波数帯における相関性の高さを判定することにより、風切り音が大きいか否かを判定しても良い。 The noise detection unit 18 detects wind noise from the audio signal input from the storage unit 13. In general, subject sound input to a stereo microphone has a strong correlation between left and right channels, whereas wind noise is often uncorrelated. Therefore, the noise detector 18 first compares the audio signals obtained by the microphones 11R and 11L. If there is no correlation between the two, it can be determined that a wind noise is generated, and the level of the wind noise can be determined based on the signal amplitude having no correlation. Specifically, if the amplitude is large, it can be determined that the wind noise is large. The correlation determination may be performed by paying attention to a part of the band. For example, the noise detection unit 18 may determine whether or not the wind noise is loud by determining the degree of correlation in a frequency band of 400 Hz or less.

雑音低減制御部１９は、雑音検出部１８で検出された雑音信号のタイミングと大きさとを受けて雑音低減部１４の特性を変更する。当該変更は、雑音低減部１４を構成するハイパスフィルタの雑音低減特性を制御する時定数を変更することで実現する。例えば、図２ａの状態から図２ｂの状態に変更する。 The noise reduction control unit 19 changes the characteristics of the noise reduction unit 14 in response to the timing and magnitude of the noise signal detected by the noise detection unit 18. This change is realized by changing the time constant that controls the noise reduction characteristics of the high-pass filter that constitutes the noise reduction unit 14. For example, the state shown in FIG. 2a is changed to the state shown in FIG. 2b.

本実施形態では、図２ａに示すフィルタ特性を、「時定数の大きなハイパスフィルタ特性」と呼ぶ。その一方で、図２ｂに示すフィルタ特性を、「時定数の小さなハイパスフィルタ特性」と呼ぶ。そして、雑音検出部１８が検出する雑音のレベル（マイク１１Ｒ、１１Ｌの無相関波形の振幅の大きさ）に応じて、時定数を決定できる。例えば、雑音レベルが一定値に満たないような風切り音の場合、図２ｂまで至らない１００Ｈｚ以下を減衰させるハイパスフィルタ特性を用いる。一方、雑音レベルが一定値以上になるような大きな風切り音の場合、図２ｂに示す２００Ｈｚ以下を減衰させるハイパスフィルタ特性を用いる。なお、更に大きな風切り音の場合、４００Ｈｚ以下を減衰させるハイパスフィルタ特性を用いてもよい。このように、風切り音の大きさに応じて、ハイパスフィルタのカットオフ周波数を変更することができる。 In the present embodiment, the filter characteristic shown in FIG. 2A is referred to as “high-pass filter characteristic having a large time constant”. On the other hand, the filter characteristic shown in FIG. 2B is referred to as “a high-pass filter characteristic with a small time constant”. The time constant can be determined according to the level of noise detected by the noise detection unit 18 (the magnitude of the amplitude of the uncorrelated waveform of the microphones 11R and 11L). For example, in the case of a wind noise whose noise level is less than a certain value, a high-pass filter characteristic that attenuates 100 Hz or less that does not reach FIG. 2b is used. On the other hand, in the case of a loud wind noise whose noise level is a certain value or higher, a high-pass filter characteristic that attenuates 200 Hz or less shown in FIG. In the case of a larger wind noise, a high-pass filter characteristic that attenuates 400 Hz or less may be used. In this way, the cut-off frequency of the high-pass filter can be changed according to the wind noise level.

雑音低減制御部１９は、第２雑音低減部１４ｂも制御し、その時定数を変更する。第２雑音低減部１４ｂは第１雑音低減部１４ａと同じ構成のハイパスフィルタとすることができる。本実施形態で第１雑音低減部１４ａおよび第２雑音低減部１４ｂの２つを設けた理由は、以下の通りである。すなわち、第１雑音低減部１４ａは風切り音の大きさに応じてその特性を変更してゆく。風切り音が大きい時はハイパスフィルタを４００Ｈｚ以下を減衰する特性にまで変更する。しかしながらそれ以上大きな風切り音の場合に更に時定数を小さくして、例えば６００Ｈｚ以下を減衰させるハイパスフィルタ特性にすることはない。これは減衰させる周波数を高くしてゆくと、被写体の音声信号まで影響を受けて、劣化してしまうためである。 The noise reduction control unit 19 also controls the second noise reduction unit 14b and changes its time constant. The second noise reduction unit 14b can be a high-pass filter having the same configuration as the first noise reduction unit 14a. The reason why the first noise reduction unit 14a and the second noise reduction unit 14b are provided in the present embodiment is as follows. That is, the first noise reduction unit 14a changes its characteristics according to the magnitude of the wind noise. When the wind noise is loud, the high-pass filter is changed to a characteristic that attenuates 400 Hz or less. However, in the case of wind noise that is larger than that, the time constant is not further reduced, and for example, a high-pass filter characteristic that attenuates 600 Hz or less is not obtained. This is because if the frequency to be attenuated is increased, the sound signal of the subject is affected and deteriorates.

そこで本実施形態では、ハイパスフィルタ時定数の変更ではなく、次数の変更で対応する。即ち、２つのハイパスフィルタを直接に接続することで減衰を開始させる周波数は変化させずに低周波の減衰性能を高める。雑音低減制御部１９は、次数変更部１５に対して切替信号を出力し、次数変更部１５の内部スイッチを切り替えて、雑音低減部１４を再構成する。次数変更部１５は、第１雑音低減部１４ａのみ通過した音声信号と、第１雑音低減部１４ａおよび第２雑音低減部１４ｂを通過した音声信号を次数変更部１５で切り替えて利得復元部１６に出力する。なお、次数変更部１５の配置箇所は、第１雑音低減部１４ａと第２雑音低減部１４ｂとの出力端でなくてもよい。たとえば、第１雑音低減部１４ａの出力端に接続し、第２雑音低減部１４ｂをバイパスするか、第２雑音低減部１４ｂを通過させるかを切り替えるようにしてもよい。また、第２の雑音低減部１４ｂではなく第１雑音低減部１４ａをバイパスするか否かを切り替えるようにしてもよい。この場合、第２の雑音低減部１４ｂは常に使用され、第１雑音低減部１４ａは、検出雑音レベルに応じて使用される。 Therefore, in the present embodiment, the change is made not by changing the high-pass filter time constant but by changing the order. That is, by directly connecting two high-pass filters, the low-frequency attenuation performance is improved without changing the frequency at which attenuation starts. The noise reduction control unit 19 outputs a switching signal to the order changing unit 15 and switches the internal switch of the order changing unit 15 to reconfigure the noise reducing unit 14. The order changing unit 15 switches the audio signal that has passed through only the first noise reducing unit 14a and the audio signal that has passed through the first noise reducing unit 14a and the second noise reducing unit 14b by the order changing unit 15 to the gain restoring unit 16. Output. In addition, the arrangement | positioning location of the order change part 15 does not need to be the output end of the 1st noise reduction part 14a and the 2nd noise reduction part 14b. For example, it may be connected to the output terminal of the first noise reduction unit 14a to switch between bypassing the second noise reduction unit 14b or passing the second noise reduction unit 14b. Moreover, you may make it switch whether the 1st noise reduction part 14a is bypassed instead of the 2nd noise reduction part 14b. In this case, the second noise reduction unit 14b is always used, and the first noise reduction unit 14a is used according to the detected noise level.

雑音低減制御部１９は風切り音のレベルが極めて大きくなり、雑音低減部１４ａだけでは対応できなくなった場合に、次数変更部１５を切り替えて第１雑音低減部１４ａ及び第２雑音低減部１４ｂを通過した音声信号を利得復元部１６に入力させる。利得復元部１６は利得変更部１２で変更された利得のばらつきを元に戻す役割を担っている。例えば、通常は利得変更部１２で２０倍のゲインを与えている時には利得復元部１６のゲインは１、利得変更部１２の利得が変更され、２倍のゲインを与えている時は利得復元部１６のゲインは１０となる。これにより、利得復元部１６から出力される音声信号は共通に２０倍のゲインが与えられていることになる。 The noise reduction control unit 19 switches the order change unit 15 and passes through the first noise reduction unit 14a and the second noise reduction unit 14b when the wind noise level becomes extremely high and cannot be dealt with by the noise reduction unit 14a alone. The obtained audio signal is input to the gain restoring unit 16. The gain restoring unit 16 plays a role of returning the gain variation changed by the gain changing unit 12. For example, normally, when the gain changing unit 12 gives a gain of 20 times, the gain of the gain restoring unit 16 is 1, and when the gain changing unit 12 is changed and the gain is given twice, the gain restoring unit The gain of 16 is 10. As a result, the audio signal output from the gain restoring unit 16 is commonly given a gain of 20 times.

利得変更部１２が音声信号を２０倍から２倍までの範囲でゲイン変更するのは、風切り音レベルが高くなって音声信号が飽和するのを防ぐためである。例えば、マイク１１に入力された音声信号に混入した風切り音のレベルが大きい場合には、利得変更部１２でのゲインを２倍にし、小さい場合には２０倍にする。そのため、雑音検出部１８において検出された雑音のレベルの検出結果を利得変更部１２にフィードバックして、ゲインを調整している。よって、利得復元部１６で１０倍のゲインを与えると再び飽和が生ずることになるが、実際には第１雑音低減部１４ａ及び第２雑音低減部１４ｂで風切り音レベルが低減されているので飽和が生ずることは無い。 The reason why the gain changing unit 12 changes the gain of the audio signal in the range of 20 to 2 times is to prevent the wind noise level from becoming high and saturating the audio signal. For example, when the level of wind noise mixed in the audio signal input to the microphone 11 is large, the gain in the gain changing unit 12 is doubled, and when the level is small, the gain is 20 times. Therefore, the detection result of the noise level detected by the noise detection unit 18 is fed back to the gain changing unit 12 to adjust the gain. Therefore, when a gain of 10 times is given by the gain restoration unit 16, saturation occurs again. However, since the wind noise level is actually reduced by the first noise reduction unit 14a and the second noise reduction unit 14b, saturation is achieved. Will not occur.

全体制御部２０は、音声信号処理装置の全体的な動作を制御する。本実施形態では、雑音低減処理を実行するための各機能ブロックの主な動作を説明するが、当該動作は全体制御部２０による制御の元に実行される。 The overall control unit 20 controls the overall operation of the audio signal processing device. In the present embodiment, the main operation of each functional block for executing the noise reduction process will be described. The operation is executed under the control of the overall control unit 20.

図４は図１のブロックのタイミングチャートであり、横軸は時間、縦軸は各要素の動作を表している。まず、雑音検出部１８は、記憶部１３に一時的に保存されている音声信号を読み出して雑音検出処理を行う。波形４１は雑音検出部１８の検出結果であり、対のマイク１１Ｒ、１１Ｌの相関状態およびその時の音圧レベルにより風切り音（風圧レベル）を検出している。波形４１では時刻ｔ１で風切り音が発生している。 FIG. 4 is a timing chart of the block of FIG. 1, where the horizontal axis represents time and the vertical axis represents the operation of each element. First, the noise detection unit 18 reads out an audio signal temporarily stored in the storage unit 13 and performs noise detection processing. A waveform 41 is a detection result of the noise detection unit 18 and detects wind noise (wind pressure level) based on the correlation state of the pair of microphones 11R and 11L and the sound pressure level at that time. In the waveform 41, wind noise is generated at time t1.

雑音検出部１８は、波形４１の振幅値に基づいて風切り音の発生及びそのレベルを判定する。レベル判定は雑音検出部１８に設定されたレベル判定用の閾値を用いて行ってもよい。雑音検出部１８は、風切り音の発生を検知すると、風切り音の発生と検出したレベルを後述するように、記憶部１３に通知し、音声処理指示として音声信号と同期して記憶させる。雑音低減制御部１９は、音声信号に記録された音声処理指示に含まれる雑音レベルに基づいて時定数及び次数を決定して第１雑音低減部１４ａ及び第２雑音低減部１４ｂと、次数変更部１５とを制御する。 The noise detector 18 determines the generation of wind noise and the level thereof based on the amplitude value of the waveform 41. The level determination may be performed using a threshold for level determination set in the noise detection unit 18. When detecting the occurrence of wind noise, the noise detection unit 18 notifies the storage unit 13 of the occurrence of wind noise and the detected level, and stores it as a voice processing instruction in synchronization with the audio signal. The noise reduction control unit 19 determines the time constant and the order based on the noise level included in the audio processing instruction recorded in the audio signal, and the first noise reduction unit 14a and the second noise reduction unit 14b, and the order change unit 15 is controlled.

図４では、時刻ｔ１で第１雑音低減部１４ａの時定数１の設定が開始される。よって、時刻ｔ１で風切り音が検出された後、実際に雑音低減処理が開始されるのは、波形４４で示すように時刻ｔ２である。この波形４４は、検出された雑音が処理されるタイミングを示している。時刻ｔ１からｔ２の間に、波形４２で示すように第１雑音低減部１４ａの時定数１が小さくなる方向に変化する。風切り音のレベルによっては、例えば、図２ａから図２ｂの時定数に対応するように変化させることができる。時刻ｔ２で時定数１の設定が終了すると、第１雑音低減部１４ａは記憶部１３から音声信号を読み出す。このように、本実施形態では音声信号を記憶部１３に一時的に保存しておくので音声処理の系のために音声信号を遅延させることができ、処理を行うタイミングで音声信号を読み出すことができる。 In FIG. 4, the setting of the time constant 1 of the first noise reduction unit 14a is started at time t1. Therefore, after the wind noise is detected at time t1, the noise reduction processing is actually started at time t2, as shown by the waveform 44. This waveform 44 shows the timing at which the detected noise is processed. Between time t1 and t2, as shown by the waveform 42, the time constant 1 of the first noise reduction unit 14a changes in the direction of decreasing. Depending on the level of wind noise, for example, it can be changed to correspond to the time constant of FIGS. 2a to 2b. When the setting of the time constant 1 is completed at time t2, the first noise reduction unit 14a reads an audio signal from the storage unit 13. As described above, in the present embodiment, since the audio signal is temporarily stored in the storage unit 13, the audio signal can be delayed for the audio processing system, and the audio signal can be read out at the timing of processing. it can.

図４に示す例では、雑音低減制御部１９は時刻ｔ１における波形４１の検出レベルでは図２ｂの特性までハイパスフィルタ特性を変更する必要は無いと判定する。その場合、雑音低減制御部１９は、例えば１００Ｈｚ以下を減衰させるフィルタ特性にまで時定数１を小さくした時点で時定数１の変更を終了する。レベルの判定手法としては、風切り音のレベルを閾値により複数段階に分け、段階毎に時定数を１つずつ関連づけておき、検出された風切り音のレベルが属する段階に応じて時定数を決定することができる。あるいは、風切り音の振幅値を入力とする線形関数に基づいて、検出された個々の風切り音のレベルに応じた時定数を決定してもよい。 In the example shown in FIG. 4, the noise reduction control unit 19 determines that there is no need to change the high-pass filter characteristics up to the characteristics shown in FIG. 2b at the detection level of the waveform 41 at time t1. In that case, the noise reduction control unit 19 ends the change of the time constant 1 when the time constant 1 is reduced to a filter characteristic that attenuates, for example, 100 Hz or less. As a method for determining the level, the wind noise level is divided into a plurality of stages according to a threshold value, one time constant is associated with each stage, and the time constant is determined according to the stage to which the detected wind noise level belongs. be able to. Alternatively, a time constant corresponding to the level of each detected wind noise may be determined based on a linear function that receives the amplitude value of the wind noise.

時定数の変更は時刻ｔ２にちょうど終了する様に設定してあり、時刻ｔ２で音声信号から雑音を低減する処理を開始するときにはハイパスフィルタは既に所望の特性で待機状態にある。また、時定数の変更を時刻ｔ１から時刻ｔ２の間で一定時間をかけて徐々に変更している理由は以下の通りである。 The change of the time constant is set to end just at time t2, and when the process of reducing noise from the audio signal is started at time t2, the high-pass filter is already in a standby state with desired characteristics. The reason for changing the time constant gradually over time from time t1 to time t2 is as follows.

まず、ハイパスフィルタの時定数を切り替える際に注意しなくてはならないのは出力位相の変化が生じることである。例えば図２ｂに示した２００Ｈｚ以下を減衰させる１次のハイパスフィルタでは２００Ｈｚの信号位相は元の信号と比べて４５度変化している。それに対して図２ａに示すような１０Ｈｚ以下を減衰させる１次のハイパスフィルタでは２００Ｈｚの信号位相の変化は少ない。そのため図２ａから図２ｂへ特性を瞬時に切り替えると２００Ｈｚ信号の連続性が失われ、そのタイミングで不要な雑音が発生する。それを避けるために所定時間（例えば１００ｍｓ）を費やして時定数を「大」から「小」に徐々に変更している。この所定時間は、時定数を変化させる場合には固定値としてもよい。その場合には、例えば、最大の変化幅（例えば、１０Ｈｚから２００Ｈｚ）で時定数を連続的に変化させた場合に、信号の連続性が損なわれない時間とすることができる。 First, care must be taken when switching the time constant of the high-pass filter because a change in the output phase occurs. For example, in the first-order high-pass filter that attenuates 200 Hz or less shown in FIG. 2B, the signal phase of 200 Hz changes by 45 degrees compared to the original signal. In contrast, a first-order high-pass filter that attenuates 10 Hz or less as shown in FIG. Therefore, when the characteristics are instantaneously switched from FIG. 2a to FIG. 2b, the continuity of the 200 Hz signal is lost, and unnecessary noise is generated at that timing. In order to avoid this, a predetermined time (for example, 100 ms) is spent, and the time constant is gradually changed from “large” to “small”. The predetermined time may be a fixed value when the time constant is changed. In that case, for example, when the time constant is continuously changed with the maximum change width (for example, 10 Hz to 200 Hz), it is possible to set the time so that the continuity of the signal is not impaired.

次に時刻ｔ３では、波形４１に示すように雑音検出部１８が更に大きな風切り音の発生を検出している。雑音低減制御部１９は、時刻ｔ２で設定したハイパスフィルタの特性では風切り音低減に不十分であるとして、第１雑音低減部１４ａのハイパスフィルタの時定数１を更に小さく設定する。ここでは、ハイパスフィルタの特性を図２ｂに示した２００Ｈｚ以下を減衰させる特性にまで時刻ｔ３からｔ４までの間で変更する。ｔ４−ｔ３の時間も上記の所定時間に相当する。 Next, at time t3, as indicated by the waveform 41, the noise detector 18 detects the generation of a larger wind noise. The noise reduction control unit 19 sets the time constant 1 of the high-pass filter of the first noise reduction unit 14a to be even smaller, assuming that the characteristics of the high-pass filter set at time t2 are insufficient for wind noise reduction. Here, the characteristic of the high-pass filter is changed from time t3 to time t4 to the characteristic of attenuating 200 Hz or less shown in FIG. 2b. The time t4-t3 also corresponds to the predetermined time.

次に、時刻ｔ５では波形４１に示すように雑音検出部１８が更に大きな風切り音の発生を検出している。この場合も、雑音低減制御部１９は雑音低減部１４ａであるハイパスフィルタの時定数１を更に小さくして音声の低周波領域の減衰を大きくし、雑音の低減を図ることも考えられる。例えば４００Ｈｚ以下を減衰させる特性にまで変更することである。しかしながら低減を開始する周波数を高周波側に変更（例えば２００Ｈｚを４００Ｈｚに変更）してゆくと被写体の音声までも減衰してしまい、違和感のある音声記録となってしまう。 Next, at time t5, as indicated by the waveform 41, the noise detector 18 detects the generation of a larger wind noise. In this case, the noise reduction control unit 19 may further reduce the noise by further reducing the time constant 1 of the high-pass filter that is the noise reduction unit 14a to increase the attenuation in the low frequency region of the voice. For example, the characteristic is changed to a characteristic that attenuates 400 Hz or less. However, if the frequency at which the reduction is started is changed to the high frequency side (for example, 200 Hz is changed to 400 Hz), even the sound of the subject is attenuated, resulting in an uncomfortable sound recording.

そこで本実施形態では、低減すべき雑音レベルが所定レベルより大きくなった場合は、第１雑音低減部１４ａであるハイパスフィルタ時定数１を小さくするのではなく、更にハイパスフィルタを直列に追加することで対応する。即ち、第１雑音低減部１４ａであるハイパスフィルタの出力を第２雑音低減部１４ｂであるハイパスフィルタに入力する。これによりハイパスフィルタの時定数では無く、次数を増やすことで雑音レベルの低減を図る。 Therefore, in the present embodiment, when the noise level to be reduced becomes higher than a predetermined level, the high-pass filter time constant 1 that is the first noise reduction unit 14a is not reduced, but a high-pass filter is further added in series. Correspond with. That is, the output of the high-pass filter that is the first noise reduction unit 14a is input to the high-pass filter that is the second noise reduction unit 14b. This reduces the noise level by increasing the order, not the time constant of the high-pass filter.

この場合、低減を開始する周波数を高周波側に変更（例えば２００Ｈｚを４００Ｈｚに変更）しないので被写体音への影響は少なくなる。雑音レベルが小さい場合に対しても上記次数を増やす対策を行うことも考えられる。しかし、次数を増やすことも被写体音への影響は少なからず生じる為に雑音レベルが小さい場合には、即ち雑音低減部１４ａであるハイパスフィルタのみを使用することで高品位の音声録音が行える。 In this case, since the frequency at which the reduction is started is not changed to the high frequency side (for example, 200 Hz is changed to 400 Hz), the influence on the subject sound is reduced. It is also conceivable to take measures to increase the order even when the noise level is low. However, increasing the order also has a considerable influence on the subject sound, so that when the noise level is small, that is, only the high-pass filter that is the noise reduction unit 14a is used, so that high-quality voice recording can be performed.

雑音低減部１４の次数の切り替えは図１に示した様に次数変更部１５のスイッチ切り替えで行う。即ち、次数変更部１５には第１雑音低減部１４ａであるハイパスフィルタの出力、および、第１雑音低減部１４ａであるハイパスフィルタと第２雑音低減部１４ｂであるハイパスフィルタとの直列接続出力信号の両者が入力されている。次数変更部１５は、波形４５に示すような雑音低減制御部１９からの次数切替信号に応じて、パスを切替える。図４の例では、時刻ｔ５に、第１雑音低減部１４ａ単体のパスから、第１雑音低減部１４ａ及び第２雑音低減部１４ｂのパスに切り替える。 The order of the noise reduction unit 14 is switched by switching the order changing unit 15 as shown in FIG. That is, the order changing unit 15 outputs the output of the high-pass filter that is the first noise reduction unit 14a, and the serial connection output signal of the high-pass filter that is the first noise reduction unit 14a and the high-pass filter that is the second noise reduction unit 14b. Both are entered. The order changing unit 15 switches paths in accordance with the order switching signal from the noise reduction control unit 19 as shown by the waveform 45. In the example of FIG. 4, at the time t5, the path of the first noise reduction unit 14a alone is switched to the path of the first noise reduction unit 14a and the second noise reduction unit 14b.

次数の切り替えに応じて第２雑音低減部１４ｂの時定数２が変更される。時定数２の変更は、第１雑音低減部１４ａの場合と同様に時間をかけて行う。図４では、時刻ｔ５からｔ６までの時間をかけて第２雑音低減部１４ｂの特性が、１０Ｈｚ以下を減衰させる特性から２００Ｈｚ以下を減衰させる特性に変更される。この結果、次数が変更され、第２雑音低減部１４ｂの時定数２の変更も終了した時刻ｔ６では、風切り音低減に十分な特性となっている。次に、波形４１に示すように時刻ｔ７では、風切り音のレベルが下がるので、波形４３に示すように第２雑音低減部１４ｂの時定数２を時刻ｔ８からｔ９にかけて大きくして初期の特性（例えば１０Ｈｚ以下を減衰させる特性）に戻す。また、第１雑音低減部１４ａの時定数１を、時刻ｔ７で検出された雑音レベルに応じて時刻ｔ８からｔ９にかけて大きくして、例えば５０Ｈｚ以下を減衰させる特性に変更する。時定数を大きくする場合、風切り音のレベルが下がる前の遅延された音声信号の処理が終了するまで時定数の変更タイミングを遅延させる。図４の例では、時刻ｔ７で風切り音のレベルの低下が検出されても、実際に時定数の変更が開始されるのは時刻ｔ８である。 The time constant 2 of the second noise reduction unit 14b is changed according to the switching of the order. The time constant 2 is changed over time as in the case of the first noise reduction unit 14a. In FIG. 4, the characteristic of the second noise reduction unit 14b is changed from a characteristic that attenuates 10 Hz or less to a characteristic that attenuates 200 Hz or less over time from time t5 to t6. As a result, at time t6 when the order is changed and the change of the time constant 2 of the second noise reduction unit 14b is completed, the characteristics are sufficient for reducing wind noise. Next, as shown in the waveform 41, the wind noise level decreases at time t7. Therefore, as shown in the waveform 43, the time constant 2 of the second noise reduction unit 14b is increased from time t8 to time t9 to obtain initial characteristics ( For example, the characteristic is attenuated to 10 Hz or less. Further, the time constant 1 of the first noise reduction unit 14a is increased from time t8 to t9 according to the noise level detected at time t7, and is changed to a characteristic that attenuates, for example, 50 Hz or less. When the time constant is increased, the time constant change timing is delayed until the processing of the delayed audio signal before the wind noise level is lowered. In the example of FIG. 4, even if a decrease in the level of wind noise is detected at time t7, the time constant change is actually started at time t8.

また、時刻ｔ９で時定数の変更が終了した後、雑音低減制御部１９は波形４５に示すように、次数変更部１５を制御して、第１雑音低減部１４ａのみのパスに切り替える。波形４１に示すように時刻ｔ１０では風切り音レベルが更に下がるので波形４２に示すように第１雑音低減部１４ａの時定数を時刻ｔ１１からｔ１２にかけて大きくして初期の特性（例えば１０Ｈｚ以下を減衰させる特性）に戻す。 In addition, after the change of the time constant is completed at time t9, the noise reduction control unit 19 controls the order change unit 15 to switch to the path of only the first noise reduction unit 14a as shown by the waveform 45. As shown in the waveform 41, the wind noise level further decreases at the time t10, so the time constant of the first noise reduction unit 14a is increased from the time t11 to the time t12 as shown in the waveform 42 to attenuate initial characteristics (for example, 10 Hz or less). Return to characteristics.

このようにして、記憶部１３に一時的に音声信号を保持し、必要な処理を順に行っていく音により、風切り音の発生初期から適切な時定数に基づく雑音低減を行うことが可能となる。なお、図４の波形４６は利得復元部１６の補正利得の様子をあらわしている。前述した様にマイク１１Ｒ、１１Ｌであるマイクの増幅レベルは被写体の大きさにより自動設定される（ＡＬＣ：オート・レベル・コントロール）。これは小さな被写体音もクリアに録音すること及び大きな被写体音も増幅による飽和無く録音するためである。そのため風切り音が生じるとそのレベルに応じてマイクの増幅レベルが小さくなる。前述したように風切り音は雑音低減部１４により低減されるので、マイクの増幅レベルが小さいと風切り音処理後の被写体音声は小さくなり不自然となる。 In this way, it is possible to perform noise reduction based on an appropriate time constant from the beginning of the generation of wind noise by using the sound that temporarily holds the audio signal in the storage unit 13 and sequentially performs the necessary processing. . Note that a waveform 46 in FIG. 4 represents the state of the correction gain of the gain restoring unit 16. As described above, the amplification levels of the microphones 11R and 11L are automatically set according to the size of the subject (ALC: auto level control). This is because a small subject sound is recorded clearly and a large subject sound is recorded without saturation due to amplification. Therefore, when wind noise is generated, the amplification level of the microphone is reduced according to the level. As described above, since the wind noise is reduced by the noise reduction unit 14, if the amplification level of the microphone is small, the subject sound after the wind noise processing becomes small and unnatural.

そこで風切り音処理後に変更された増幅レベルの補完（利得復元）を行う。図１においては次数変更部１５の出力が利得復元部１６に入力され、その利得を復元して音声記録部１７に出力する。尚、利得復元部１６にはマイクの増幅レベルを変更する利得変更部１２の利得変更信号が入力され、どのタイミングでどれくらい利得が変更されたかを伝えている。そのため利得復元部１６はその信号にあわせて利得の復元を行う。 Therefore, complementation (gain restoration) of the amplification level changed after the wind noise processing is performed. In FIG. 1, the output of the order changing unit 15 is input to the gain restoring unit 16, and the gain is restored and output to the audio recording unit 17. The gain restoring unit 16 receives a gain change signal from the gain changing unit 12 that changes the amplification level of the microphone, and reports how much the gain has been changed at what timing. Therefore, the gain restoring unit 16 restores the gain according to the signal.

図４の波形４６は利得復元レベルを表しており、ちょうど風切り音の発生レベル変化と各時刻で同期している。例えば時刻ｔ１で風切り音が発生するとマイクの増幅レベルが小さくなるので、雑音処理後の音声に対してはその分増幅の補完を行うべく波形４６に示す様に時刻ｔ２より徐々に増幅レベルを上げている。ここで利得を徐々に変化させているのは利得を急激に変化させることによる違和感を防ぐ為である。その後の時刻においても風切り音レベルに合わせて変化するマイクの増幅レベルを補うように利得の復元を行う。 A waveform 46 in FIG. 4 represents the gain restoration level, which is exactly synchronized with the change in the level of wind noise generation at each time. For example, if wind noise is generated at time t1, the amplification level of the microphone is reduced. Therefore, for the sound after noise processing, the amplification level is gradually increased from time t2 as shown in the waveform 46 in order to complement the amplification accordingly. ing. The reason why the gain is gradually changed is to prevent a sense of incongruity caused by abruptly changing the gain. At a later time, the gain is restored so as to compensate for the microphone amplification level that changes in accordance with the wind noise level.

次に図５（ａ）から（ｆ）を参照して、本発明の効果について対比説明する。本実施形態では、雑音低減部１４は雑音発生に先立って、特性を調整して待機することが可能であり、その結果として音声信号における雑音発生直後から雑音低減効果がある。図５（ａ）は被写体音に風切り音が重畳している波形であり、横軸は時間、縦軸は音圧レベルを示している。ここで５１ａ、５３ａは風切り音が無く被写体音のみの場合にマイク１１Ｒ或いは１１Ｌが検出する波形であり、男性の声のスペクトルにより擬似的に表している。５２ａは波形５１ａに風切り音が重畳している波形であり、風切り音のスペクトルより擬似的に表した波形に波形５１ａを混合している。 Next, the effects of the present invention will be described with reference to FIGS. In the present embodiment, the noise reduction unit 14 can wait by adjusting the characteristics prior to noise generation, and as a result, has a noise reduction effect immediately after noise generation in the audio signal. FIG. 5A shows a waveform in which wind noise is superimposed on the subject sound. The horizontal axis indicates time, and the vertical axis indicates the sound pressure level. Here, 51a and 53a are waveforms detected by the microphone 11R or 11L when there is no wind noise and only the subject sound, and are simulated by the spectrum of the male voice. Reference numeral 52a denotes a waveform in which wind noise is superimposed on the waveform 51a, and the waveform 51a is mixed with a waveform that is expressed in a pseudo manner from the spectrum of wind noise.

図５（ｂ）は図５（ａ）の波形を図２ｂで示す特性の雑音低減部１４で処理した結果であり、風切り音である低周波成分は低減している（波形５２ｂに低周波成分がない）。しかしながら波形５１ｂ、５３ｂを波形５１ａ、５３ａと比較すると若干低周波成分が劣化している。即ち、被写体音も雑音低減部１４による影響を受けている。そこで風切り音の発生している期間だけ雑音低減部１４を動作させることを考える。図５（ｃ）はその結果であり、波形５１ｃ、５３ｃでは雑音低減処理を行わず、波形５２ｃの区間のみ雑音低減部１４を動作させている。そのため波形５１ａ、５３ａと波形５１ｃ、５３ｃは差がない。波形５２ｃにおける風切り音の低減も十分行えているように見える。 FIG. 5B shows the result of processing the waveform of FIG. 5A by the noise reduction unit 14 having the characteristics shown in FIG. 2B, and the low-frequency component that is a wind noise is reduced (the low-frequency component is shown in the waveform 52b). There is no). However, when the waveforms 51b and 53b are compared with the waveforms 51a and 53a, the low frequency components are slightly deteriorated. That is, the subject sound is also affected by the noise reduction unit 14. Therefore, consider that the noise reduction unit 14 is operated only during a period in which wind noise is generated. FIG. 5C shows the result. The noise reduction processing is not performed on the waveforms 51c and 53c, and the noise reduction unit 14 is operated only in the section of the waveform 52c. Therefore, there is no difference between the waveforms 51a and 53a and the waveforms 51c and 53c. It seems that the wind noise in the waveform 52c is sufficiently reduced.

図５（ｄ）は図５(ｂ）と図５（ｃ）との差を示している。風切り音の発生しない波形５１ｄ、５３ｄの区間においては雑音低減部１４で劣化した波形５１ｂ、５３ｂと雑音低減部１４を作動させていない波形５１ｃ、５３ｃの差になるので雑音低減部１４による被写体音の劣化状態が示される。 FIG. 5 (d) shows the difference between FIG. 5 (b) and FIG. 5 (c). In the sections of the waveforms 51d and 53d in which no wind noise is generated, the difference between the waveforms 51b and 53b deteriorated by the noise reduction unit 14 and the waveforms 51c and 53c in which the noise reduction unit 14 is not operated is present. The degradation state of is shown.

風切り音が発生している波形５２ｄの区間においては共に雑音低減部１４を作動させている為に同じ波形になり波形５２ｃ、５２ｂの差はゼロになる筈である。ところが図５（ｄ）に示すように、波形５２ｄの初期においては丸で囲んだ範囲５４にて風切り音未処理の部分が残っている。これは雑音低減部１４を作動させても直ぐにはその効果が得られない為である。 In the section of the waveform 52d where the wind noise is generated, since the noise reduction unit 14 is operated, the waveform is the same and the difference between the waveforms 52c and 52b should be zero. However, as shown in FIG. 5 (d), at the initial stage of the waveform 52d, the unprocessed portion of the wind noise remains in a circled area 54. This is because the effect cannot be obtained immediately even if the noise reduction unit 14 is operated.

そこで、例えば図５（ｂ）の様に常に雑音低減部１４を作動させた波形と図５（ａ）のように雑音低減部１４未処理の波形の両者を常に用意しておく。そして、風切り音が発生したときにのみ図５（ｂ）の波形を図５（ａ）の波形に切り替える方法も考えられる。この場合、既に風切り音発生前に雑音低減部１４が作動しているので範囲５４の様な未処理の波形は無くなる。しかしながら、切替を行うとその接合部が不連続になる問題がある。これは雑音低減部１４により変形された信号（減衰や位相のズレあり）と、雑音低減部１４により変形されていない元の信号とを接続するためである。この不連続部分が音として聞こえる場合、品位のない録音となってしまう。 Therefore, for example, both a waveform in which the noise reduction unit 14 is always operated as shown in FIG. 5B and a waveform that has not been processed in the noise reduction unit 14 as shown in FIG. 5A are always prepared. A method of switching the waveform of FIG. 5B to the waveform of FIG. 5A only when a wind noise is generated is also conceivable. In this case, since the noise reduction unit 14 has already been operated before the generation of wind noise, the unprocessed waveform as in the range 54 is eliminated. However, when switching is performed, there is a problem that the joint becomes discontinuous. This is to connect the signal deformed by the noise reduction unit 14 (with attenuation or phase shift) and the original signal not transformed by the noise reduction unit 14. If this discontinuous part is heard as sound, the recording will be of poor quality.

図５（ｅ）は、本実施形態に対応する処理を示している。波形５１ｅ、５３ｅでは雑音低減部１４を作動させていないので図５（ｃ）の手法と同様に被写体音の劣化は生じない。風切り音の発生している波形５２ｅに関しても雑音低減部１４の作動により風切り音である低周波成分は減衰している。 FIG. 5E shows processing corresponding to this embodiment. In the waveforms 51e and 53e, since the noise reduction unit 14 is not operated, the subject sound is not deteriorated as in the method of FIG. As for the waveform 52e in which the wind noise is generated, the low frequency component which is the wind noise is attenuated by the operation of the noise reduction unit 14.

図４のタイミングチャートで説明したように、雑音低減部１４は風切り音発生に先立って周波数減衰特性を変化させていき（時定数を小さくしてゆき）、風切り音発生時点ではその風切り音レベルを低減するのに適したフィルタ特性になっている。即ち、波形５１ｅの後半期間５５で、雑音低減部１４が徐々に作動して、雑音低減効果が出てくる。このように雑音低減部１４の時定数を時間をかけて（期間５５）小さくしてゆく構成にしているので雑音低減部１４を突然作動させる図５（ｃ）の場合に比べて、雑音低減部１４の作動前後の波形の連続性が保たれる。 As described with reference to the timing chart of FIG. 4, the noise reduction unit 14 changes the frequency attenuation characteristic (decreases the time constant) prior to the generation of the wind noise, and at the time of the wind noise generation, the wind noise level is changed. The filter characteristics are suitable for reduction. That is, in the second half period 55 of the waveform 51e, the noise reduction unit 14 is gradually activated, and a noise reduction effect is produced. Since the time constant of the noise reduction unit 14 is thus reduced over time (period 55), the noise reduction unit is compared with the case of FIG. 5C in which the noise reduction unit 14 is suddenly operated. The continuity of the waveform before and after 14 operations is maintained.

図５（ｆ）は図５（ｅ）と図５（ｂ）の差を示している。風切り音の発生しない波形５１ｆ、５３ｆの区間においては雑音低減部で劣化した波形５１ｂ、５３ｂと雑音低減部を作動させていない波形５１e、５３eの差になるので雑音低減部による被写体音の劣化状態が示される。風切り音が発生している波形５２ｆの区間においては共に雑音低減部１４を作動させている為に同じ波形になり波形５２e、５２ｂの差はゼロになる筈である。図５（ｆ)に示すように本実施形態では丸で囲んだ範囲５６の風切り音発生初期においても風切り音の低減が行えており高品位の録音が可能になっている。 FIG. 5 (f) shows the difference between FIG. 5 (e) and FIG. 5 (b). In the sections of the waveforms 51f and 53f in which no wind noise is generated, the difference between the waveforms 51b and 53b deteriorated in the noise reduction unit and the waveforms 51e and 53e in which the noise reduction unit is not operated is present. Is shown. In the section of the waveform 52f where the wind noise is generated, since the noise reduction unit 14 is operated, the waveform is the same and the difference between the waveforms 52e and 52b should be zero. As shown in FIG. 5 (f), in this embodiment, wind noise can be reduced even in the early stage of wind noise generation in a circled range 56, and high-quality recording is possible.

次に図６を参照して、本実施形態における雑音検出部１８における処理を説明する。当該処理は音声の録音開始と共にスタートする。 Next, with reference to FIG. 6, the process in the noise detection part 18 in this embodiment is demonstrated. This process starts when voice recording starts.

マイク１１Ｒ、１１Ｌにて図１の記憶部１３に音声が記憶されるのと同期してスタートする。Ｓ６０１では、雑音検出部１８がマイク１１Ｒ、１１Ｌの比較を行うことで風切り音の検出を行う。もし、風切り音が検出されると（Ｓ６０１で「ＹＥＳ」）Ｓ６０２に進む。風切り音が検出されない間は（Ｓ６０１で「ＮＯ」）、風切り音の発生を監視する。続くＳ６０２では、雑音検出部１８が、雑音レベルをマイク１１Ｒ、１１Ｌの相関性のない波形の振幅（音圧）や、或いはローパスフィルタにより被写体音を減衰させた後の波形の振幅等を利用して測定する。 The operation starts in synchronization with the sound being stored in the storage unit 13 of FIG. 1 by the microphones 11R and 11L. In S601, the noise detector 18 detects wind noise by comparing the microphones 11R and 11L. If wind noise is detected (“YES” in S601), the process proceeds to S602. While the wind noise is not detected (“NO” in S601), the generation of the wind noise is monitored. In subsequent S602, the noise detector 18 uses the amplitude (sound pressure) of the waveform with no correlation between the microphones 11R and 11L, the amplitude of the waveform after the subject sound is attenuated by the low-pass filter, or the like as the noise level. To measure.

続くＳ６０３では、風切り音を検出したタイミングとそのレベルを、雑音検出部１８が記憶部１３に通知する。記憶部１３は、風切り音処理指示として、記憶部１３に記憶される音声信号と同期して記録する。この時、雑音検出部１８における検出遅れを見込んだタイミングで風切り音の処理を指示する「音声処理指示」を記録する。その後Ｓ６０１に戻る。 In subsequent S603, the noise detection unit 18 notifies the storage unit 13 of the timing and level at which the wind noise has been detected. The storage unit 13 records the wind noise processing instruction in synchronization with the audio signal stored in the storage unit 13. At this time, a “voice processing instruction” for instructing the processing of wind noise is recorded at the timing when the detection delay in the noise detection unit 18 is expected. Thereafter, the process returns to S601.

次に図７を参照して、本実施形態における雑音低減処理および利得復元、音声記録の処理を説明する。当該処理は、音声の録音開始より所定の時間遅れてスタートする。この時間遅れは、雑音検出部１８の検出遅れや雑音低減部１４の起動遅れを見込んだものである。まず、雑音低減制御部１９は、図６の処理で記録された音声処理指示の記録を確実に読み出せるだけの遅れを持って、記憶部１３から音声信号を読み出して処理を行う。第１雑音低減部１４ａはそれより更に、時定数を変更するために必要な所定時間の遅れをもって、記憶部１３から音声信号を読み出して処理を行う。あるいは、全体制御部２０が、各機能ブロックにおける、音声信号の読み出しタイミングを制御してもよい。 Next, noise reduction processing, gain restoration, and voice recording processing in the present embodiment will be described with reference to FIG. This process starts after a predetermined time from the start of voice recording. This time delay allows for the detection delay of the noise detection unit 18 and the startup delay of the noise reduction unit 14. First, the noise reduction control unit 19 reads and processes the audio signal from the storage unit 13 with a delay that can reliably read the recording of the audio processing instruction recorded in the process of FIG. The first noise reduction unit 14a further reads and processes the audio signal from the storage unit 13 with a predetermined time delay necessary for changing the time constant. Alternatively, the overall control unit 20 may control the readout timing of the audio signal in each functional block.

まず、Ｓ７０１では雑音低減制御部１９は、記憶部１３に記録されている音声信号を走査しつつ、図６のＳ６０３で記録された音声処理指示を探す。もし、音声処理指示を見つけた場合（Ｓ７０１で「ＹＥＳ」）、Ｓ７０２に移行する。音声処理指示が見つからない場合はＳ７０１で監視を継続する。 First, in S701, the noise reduction control unit 19 searches for the audio processing instruction recorded in S603 of FIG. 6 while scanning the audio signal recorded in the storage unit 13. If a voice processing instruction is found (“YES” in S701), the process proceeds to S702. If no voice processing instruction is found, monitoring is continued in S701.

続くＳ７０２、Ｓ７０３及びＳ７０４では、雑音低減制御部１９は、音声処理指示に記録されている雑音レベルを判定し、雑音レベルに応じた雑音低減の選択を行う。雑音レベルの判定は、各レベルに対応する閾値を用いて判定する。レベル１は閾値Ｔｈ１を、レベル２は閾値Ｔｈ２を、レベル３は閾値Ｔｈ３をそれぞれ用いる。まず、Ｓ７０２では、雑音レベルが最も高いレベル３（閾値Ｔｈ３）以上か否かを判定しており、レベル３以上の場合（Ｓ７０２で「ＹＥＳ」）Ｓ７０９に進み、そうでないときは（Ｓ７０２で「ＮＯ」）Ｓ７０３に進む。Ｓ７０３では、雑音レベルが中間レベルのレベル２（閾値Ｔｈ２）以上か否かを判定し、レベル２以上の場合（Ｓ７０３で「ＹＥＳ」）Ｓ７０８に進み、そうでないときは（Ｓ７０３「ＮＯ」）Ｓ７０４に進む。Ｓ７０４では雑音レベルが最も低いレベル１（閾値Ｔｈ１）以上か否かを判定し、レベル１以上の場合（Ｓ７０４で「ＹＥＳ」）Ｓ７０５に進む。雑音レベルが閾値Ｔｈ１に満たない場合（Ｓ７０４で「ＮＯ」）、レベル検出不能として、音声処理指示を無視してＳ７０１に戻る。 In subsequent S702, S703, and S704, the noise reduction control unit 19 determines the noise level recorded in the voice processing instruction and selects noise reduction according to the noise level. The noise level is determined using a threshold corresponding to each level. Level 1 uses threshold Th1, Level 2 uses threshold Th2, and Level 3 uses threshold Th3. First, in S702, it is determined whether or not the noise level is higher than level 3 (threshold Th3) or higher. If the noise level is higher than level 3 ("YES" in S702), the process proceeds to S709, otherwise (" NO ") Proceed to S703. In S703, it is determined whether or not the noise level is equal to or higher than the intermediate level 2 (threshold Th2). If the noise level is equal to or higher than level 2 ("YES" in S703), the process proceeds to S708, otherwise (S703 "NO") S704. Proceed to In S704, it is determined whether the noise level is equal to or higher than the lowest level 1 (threshold Th1). If the noise level is equal to or higher than Level 1 (“YES” in S704), the process proceeds to S705. If the noise level is less than the threshold value Th1 (“NO” in S704), it is determined that the level cannot be detected, the voice processing instruction is ignored, and the process returns to S701.

Ｓ７０５では第１雑音低減部１４ａの時定数１を対応する時定数に変更する。ここでは雑音レベルが低いので時定数をそれほど小さくせず、例えば１００Ｈｚ以下の雑音を減衰させるフィルタ特性に対応する時定数：τ１に時定数を変更する。本実施形態では、所定時間（例えば、１００ｍｓ）をかけて時定数を変更するが、当該変更は、雑音低減制御部１９からの制御信号２４に基づいて行うことができる。 In S705, the time constant 1 of the first noise reduction unit 14a is changed to a corresponding time constant. Here, since the noise level is low, the time constant is not reduced so much, for example, the time constant is changed to τ 1 corresponding to the filter characteristic for attenuating noise of 100 Hz or less. In this embodiment, the time constant is changed over a predetermined time (for example, 100 ms), but the change can be performed based on the control signal 24 from the noise reduction control unit 19.

続くＳ７０６では、雑音低減部１４で処理した音声波形に対して利得復元部１６により利得を復元する。前述のように、風切り音が発生したときには利得変更部１２により音声を電気信号に変換する利得が自動的に小さくなる。よって、風切り音低減処理を行った後で、その利得変化分を補完して録音レベルを一定に保っている。続くＳ７０７では利得復元された音声信号を記録媒体２１に記録してＳ７０１に戻る。 In subsequent S 706, the gain restoring unit 16 restores the gain of the speech waveform processed by the noise reducing unit 14. As described above, when wind noise is generated, the gain for converting the sound into an electrical signal by the gain changing unit 12 is automatically reduced. Therefore, after performing the wind noise reduction process, the gain change is complemented and the recording level is kept constant. In subsequent S707, the audio signal whose gain has been restored is recorded in the recording medium 21, and the process returns to S701.

次に、Ｓ７０３でレベル２以上と判定された場合を説明する。Ｓ７０８では、雑音低減制御部１９が、第１雑音低減部１４ａの時定数１を対応する時定数に変更する。このときの雑音レベルは比較的高いので、時定数を十分小さくし、例えば２００Ｈｚ以下の雑音を減衰させるフィルタ特性に対応する時定数：τ２にまで、所定時間をかけて変更する。その後Ｓ７０６に移行する。 Next, the case where it is determined in S703 that the level is 2 or higher will be described. In S708, the noise reduction control unit 19 changes the time constant 1 of the first noise reduction unit 14a to a corresponding time constant. Since the noise level at this time is relatively high, the time constant is made sufficiently small, for example, changed to a time constant corresponding to a filter characteristic that attenuates noise of 200 Hz or less: τ2 over a predetermined time. Thereafter, the process proceeds to S706.

次に、Ｓ７０２でレベル３以上と判定された場合を説明する。この場合、第１雑音低減部１４ａだけなく、第２雑音低減部１４ｂも作動させて雑音の低減を行う。そのためＳ７０９では、雑音低減制御部１９が、次数変更部１５を切り替えて第１雑音低減部１４ａ、第２雑音低減部１４ｂを直列接続した２次のハイパスフィルタによる処理を選択する。 Next, the case where it is determined in S702 that the level is 3 or higher will be described. In this case, not only the first noise reduction unit 14a but also the second noise reduction unit 14b are operated to reduce noise. Therefore, in S709, the noise reduction control unit 19 switches the order changing unit 15 and selects processing by a secondary high-pass filter in which the first noise reduction unit 14a and the second noise reduction unit 14b are connected in series.

続くＳ７１０では第１雑音低減部１４ａおよび第２雑音低減部１４ｂの各時定数を、次数変更部１５作動後に変更する。この時の雑音レベルは最大レベルであるので、時定数を変更する上限も最大レベルとなる。例えば２００Ｈｚ以下の雑音を減衰させるフィルタ特性に対応する時定数：τ２にまで所定時間をかけて変更する。尚、第１雑音低減部１４ａの時定数１が既に時定数τ２にまで変更済である場合は、第２雑音低減部１４ｂの時定数２のみ変更すればよい。その後、Ｓ７０６に移行する。以上の処理は、音声記録終了の指示があるまで継続する。 In subsequent S710, the time constants of the first noise reduction unit 14a and the second noise reduction unit 14b are changed after the order change unit 15 is actuated. Since the noise level at this time is the maximum level, the upper limit for changing the time constant is also the maximum level. For example, the time constant corresponding to the filter characteristic for attenuating noise of 200 Hz or less: τ2 is changed over a predetermined time. If the time constant 1 of the first noise reduction unit 14a has already been changed to the time constant τ2, only the time constant 2 of the second noise reduction unit 14b needs to be changed. Thereafter, the process proceeds to S706. The above processing continues until an instruction to end audio recording is given.

以上のように、本実施形態では音声信号を一時的に記憶部１３に記憶させておくと共に、予め風切り音の発生箇所を検出して記録しておく。そして実際の雑音低減処理では、検出された風切り音の発生タイミングに先だって、雑音低減部であるハイパスフィルタの時定数を検出された雑音レベルに対応する値に変更しておく。これにより、ハイパスフィルタの特性を、音声信号における風切り音の発生箇所直前で、十分な風切り音低減特性を有するように安定させておくことができる。よって、雑音低減処理に対する風切り音の検出遅れや雑音低減部の起動遅れの影響は解消され、かつ、雑音低減部の起動前後における信号の連続性も保たれて高品位な音声記録が可能となる。 As described above, in the present embodiment, the audio signal is temporarily stored in the storage unit 13 and the location where the wind noise is generated is detected and recorded in advance. In the actual noise reduction process, prior to the occurrence timing of the detected wind noise, the time constant of the high-pass filter that is the noise reduction unit is changed to a value corresponding to the detected noise level. Thereby, the characteristics of the high-pass filter can be stabilized so as to have sufficient wind noise reduction characteristics immediately before the wind noise generation location in the audio signal. As a result, the effects of wind noise detection delay and noise activation start delay on noise reduction processing are eliminated, and high-quality audio recording is possible while maintaining continuity of the signal before and after activation of the noise reduction unit. .

［実施形態２］
以下、実施形態２を説明する。本実施形態では、雑音検出の機構として熱電対を使用する構成を説明する。図８は、実施形態２に対応する音声信号処理装置の構成例を示す図である。図８では、音声採取のための機構としてモノラルマイクを用いている。 [Embodiment 2]
Hereinafter, Embodiment 2 will be described. In this embodiment, a configuration using a thermocouple as a noise detection mechanism will be described. FIG. 8 is a diagram illustrating a configuration example of an audio signal processing device corresponding to the second embodiment. In FIG. 8, a monaural microphone is used as a mechanism for collecting sound.

音声検出部８０はマイクユニット８１、マイク８２、マイク支持部材８３、熱電対８４で構成され、部分的に示してある音声信号処理装置の筐体８５に取り付けてある。筐体８５には孔８５ａが設けてあり、孔８５ａを通って音声がマイク８２に到達する。マイク８２はマイクユニット８１に対して防振ゴムで成型されたマイク支持部材８３により弾性支持されているので、録音機器自体が発生する振動が音声として記録されない構造になっている。 The voice detection unit 80 includes a microphone unit 81, a microphone 82, a microphone support member 83, and a thermocouple 84, and is attached to a casing 85 of a voice signal processing device shown partially. The housing 85 is provided with a hole 85a, and the sound reaches the microphone 82 through the hole 85a. Since the microphone 82 is elastically supported by the microphone support member 83 formed of vibration-proof rubber with respect to the microphone unit 81, the vibration generated by the recording device itself is not recorded as sound.

熱電対８４はマイクユニット８１に取り付けられている。熱電対８４は風速計などに利用されており、風で熱電対８４が冷えることによる抵抗変化により風速を検出する。孔８５ａより流入する風はマイクユニット８１内で対流するため、マイクにあたる風と熱電対８４にあたる風の位相がずれる可能性がある。そこで、熱電対８４はマイク８２のできるだけ近傍に配置される必要がある。熱電対８４をマイクユニット８１内に配置すると、筐体８５に対して熱電対８４が露出しないため、信頼性が高くなる。 The thermocouple 84 is attached to the microphone unit 81. The thermocouple 84 is used for an anemometer or the like, and detects the wind speed based on a resistance change caused by the thermocouple 84 being cooled by wind. Since the wind flowing in from the hole 85a is convected in the microphone unit 81, the phase of the wind hitting the microphone and the wind hitting the thermocouple 84 may be shifted. Therefore, the thermocouple 84 needs to be arranged as close to the microphone 82 as possible. When the thermocouple 84 is arranged in the microphone unit 81, the thermocouple 84 is not exposed to the housing 85, and thus the reliability is improved.

マイク８２の音声信号は、実施形態１と同様に雑音低減処理されて記録媒体２１に記録される。熱電対８４の信号は雑音検出部１８に入力される。雑音検出部１８は熱電対８４の信号を増幅し、公知のウィンドコンパレータ等を用いて風速の有無とそのレベルを検出する。雑音低減制御部１９は、雑音検出部１８からの風切り音レベルに応じて実施形態１で説明したのと同様に、雑音低減部１４を制御して、風切り音の低減処理を行う。 The audio signal from the microphone 82 is subjected to noise reduction processing and recorded on the recording medium 21 as in the first embodiment. The signal from the thermocouple 84 is input to the noise detector 18. The noise detector 18 amplifies the signal of the thermocouple 84 and detects the presence and level of wind speed using a known wind comparator or the like. The noise reduction control unit 19 controls the noise reduction unit 14 in accordance with the wind noise level from the noise detection unit 18 to perform wind noise reduction processing in the same manner as described in the first embodiment.

一般に、熱電対による風速の検出には遅れが生ずる。従って、リアルタイムで風切り音を処理しようとする場合、この検出遅延のために発生初期段階の風切り音の低減が困難であった。そこで、本実施形態では雑音検出部１８と雑音低減制御部１９との間に新たに遅れ補正部８６が設ける。遅れ補正部８６は、熱電対８４の検出遅れ（例えば０．２秒）が設定されており、図６のＳ６０３では、この検出遅れ分と時定数変更時間分を見込んで雑音低減部１４の起動タイミングを決定し、記憶部１３に記憶された音声信号に音声処理の指示を記録する。そのために記録媒体２１への音声記録時には風切り音の検出遅れは解消される。 In general, there is a delay in detecting wind speed by a thermocouple. Therefore, when trying to process wind noise in real time, it is difficult to reduce wind noise at the initial stage due to this detection delay. Therefore, in this embodiment, a delay correction unit 86 is newly provided between the noise detection unit 18 and the noise reduction control unit 19. The delay correction unit 86 is set with a detection delay (for example, 0.2 seconds) of the thermocouple 84. In S603 of FIG. 6, the noise reduction unit 14 is activated in anticipation of the detection delay and the time constant change time. The timing is determined, and an audio processing instruction is recorded in the audio signal stored in the storage unit 13. For this reason, the detection delay of wind noise is eliminated during recording of sound on the recording medium 21.

電源状態検出部８７は電源８８の消耗状態を検出しており、電源が所定レベルまで下がった場合に、雑音検出部１８に通知して熱電対８４の駆動を停止する。これは熱電対８４を駆動することで電源が更に消耗するためである。電源状態検出部８７は、同時に雑音低減制御部１９にも通知して雑音低減部１４に対する時定数をその時点で設定していた時定数に固定的とする。なお、録音開始時から電源が消耗している場合、熱電対８４を一度駆動して風切り音レベルを検出した後に、雑音低減部１４ａ、１４ｂの特性と次数変更部１５の状態とを決定し、それらを固定した後に熱電対８４の駆動を中止する。電源８８は、音声信号処理装置が動作するための電源を供給する、例えばバッテリーであって、図８では電源状態検出手段８７以外との接続関係を省略しているが、各機能ブロックに電源供給を行っている。なお、図１では、発明の動作と直接に関連しないため、記載を省略していたものである。 The power supply state detection unit 87 detects the consumption state of the power supply 88, and when the power supply is lowered to a predetermined level, notifies the noise detection unit 18 and stops driving the thermocouple 84. This is because driving the thermocouple 84 further consumes power. The power supply state detection unit 87 also notifies the noise reduction control unit 19 at the same time so that the time constant for the noise reduction unit 14 is fixed to the time constant set at that time. If the power has been consumed since the start of recording, after the thermocouple 84 is driven once and the wind noise level is detected, the characteristics of the noise reduction units 14a and 14b and the state of the order changing unit 15 are determined. After fixing them, the driving of the thermocouple 84 is stopped. The power supply 88 supplies power for operating the audio signal processing apparatus, for example, a battery. In FIG. 8, the connection relationship with other than the power supply state detection means 87 is omitted, but power is supplied to each functional block. It is carried out. In FIG. 1, the description is omitted because it is not directly related to the operation of the invention.

図９は上記を説明するフローチャートであり、このフローは音声信号処理装置の電源が投入後に開始される。Ｓ９０１では電源状態検出部８７により電源状態の検出を行い、所定レベルよりも消耗している状態が検出された場合（Ｓ９０１で「ＹＥＳ」）、Ｓ９０２に進む。電源が所定レベルより消耗していない場合は（Ｓ９０１で「ＮＯ」）、Ｓ９０１に戻って、電源状態の監視を継続する。この場合は、通常の雑音低減処理が実行されることになる。 FIG. 9 is a flowchart for explaining the above, and this flow is started after the power of the audio signal processing apparatus is turned on. In S901, the power supply state detection unit 87 detects the power supply state. If a state in which the power consumption is consumed below a predetermined level is detected (“YES” in S901), the process proceeds to S902. If the power is not consumed below a predetermined level (“NO” in S901), the process returns to S901 to continue monitoring the power supply state. In this case, normal noise reduction processing is performed.

続くＳ９０２では、録音を開始してから既に雑音検出が行われたか否かを判定し、検出済の場合（Ｓ９０２で「ＹＥＳ」）Ｓ９０７に移行する。まだ、雑音検出が行われていない場合（Ｓ９０２で「ＮＯ」）、Ｓ９０３に進む。この雑音検出処理の有無については、雑音検出部１８が、前回の検出値を有しているか否かに基づいて判定すればよい。なお、検出値は電源投入時にリセットされるものとして、電源投入時は雑音検出が行われていないと判定される。 In subsequent S902, it is determined whether or not noise detection has already been performed since the start of recording, and if it has been detected (“YES” in S902), the process proceeds to S907. If noise detection has not yet been performed (“NO” in S902), the process proceeds to S903. The presence or absence of this noise detection process may be determined based on whether or not the noise detection unit 18 has the previous detection value. Note that the detected value is reset when the power is turned on, and it is determined that no noise is detected when the power is turned on.

続くＳ９０３では、雑音検出部１８が熱電対８４の駆動して、続くＳ９０４では熱電対８４により風切り音レベルを検出する。Ｓ９０５では、検出した風切り音のレベルに応じて雑音低減制御部１９が雑音低減部１４ａ、１４ｂの時定数を決定し、次数変更部１５の状態を決める。この時、既に雑音検出が行われて時定数、次数が設定されている状態であるならば、既に設定されている時定数及び次数を変更しない。一方、新たに雑音検出を行った場合（即ち、Ｓ９０２で「ＮＯ」と判定された場合）は、Ｓ９０４で検出された雑音レベルに合わせた時定数及び次数を設定する。なお、設定値はその後固定される。なお、時定数、次数を変更する場合、例えば雑音低減部１４ａの時定数を変更後に次数変更部１５で次数を切り替え、その後に雑音低減部１４ｂの時定数を変更する。続くＳ９０６では、雑音検出部１８が熱電対８４の駆動を停止して省電力化を図りＳ９０１に戻る。 In subsequent S903, the noise detection unit 18 drives the thermocouple 84, and in subsequent S904, the wind noise level is detected by the thermocouple 84. In S905, the noise reduction control unit 19 determines the time constants of the noise reduction units 14a and 14b according to the detected wind noise level, and determines the state of the order changing unit 15. At this time, if noise detection has already been performed and the time constant and order are set, the time constant and order already set are not changed. On the other hand, when noise detection is newly performed (that is, when “NO” is determined in S902), the time constant and the order are set in accordance with the noise level detected in S904. The set value is fixed thereafter. When changing the time constant and the order, for example, after changing the time constant of the noise reduction unit 14a, the order is changed by the order change unit 15, and then the time constant of the noise reduction unit 14b is changed. In subsequent S906, the noise detection unit 18 stops driving the thermocouple 84 to save power, and the process returns to S901.

Ｓ９０７では、雑音検出部１８が、前回の雑音検出から一定時間が経過しているか否かを判定する。ここで一定時間は、例えば０．５ｓと設定することができる。もし、一定時間が経過している場合には（Ｓ９０７で「ＹＥＳ」）、Ｓ９０３に移行する。一方。一定時間が経過していない場合には（Ｓ９０７で「ＮＯ」）、Ｓ９０５に移行する。 In S907, the noise detection unit 18 determines whether or not a certain time has elapsed since the previous noise detection. Here, the certain time can be set to 0.5 s, for example. If the predetermined time has elapsed (“YES” in S907), the process proceeds to S903. on the other hand. If the predetermined time has not elapsed (“NO” in S907), the process proceeds to S905.

以上の構成によれば、熱電対８４の稼働による電源消耗を抑えることができる。この方法は省電力化には貢献するが、雑音低減部１４の時定数、次数を固定的に設定してしまうとその後の風切り音の変化に対応できない場合も出てくる。そこで、Ｓ９０７において前回の雑音検出後に一定時間が経過したか否かを判定し、熱電対８４を間欠的に駆動することを可能とする。これにより、風切り音の変化に対応できるとともに、電源消費を抑制して省電力化を図ることができる。 According to the above configuration, power consumption due to operation of the thermocouple 84 can be suppressed. This method contributes to power saving, but if the time constant and the order of the noise reduction unit 14 are fixedly set, there may be cases where it is not possible to cope with subsequent changes in wind noise. Therefore, in S907, it is determined whether or not a certain time has elapsed after the previous noise detection, and the thermocouple 84 can be driven intermittently. As a result, it is possible to cope with changes in wind noise and to save power by suppressing power consumption.

以上のように実施形態２では熱電対を用いて雑音検出を行ったが、実施形態１のような対のマイクの出力相関値に基づく風切り音検出と組み合わせて風切り音検出精度を上げることもできる。例えば、対のマイク出力相関値を用いて風切り音発生タイミングを掴み、そのレベルは熱電対で検出する方法がある。また、共に風切り音を検出しているときのみ風切り音が発生していると判定して雑音低減処理を行ってもよい。 As described above, the noise detection is performed using the thermocouple in the second embodiment, but the wind noise detection accuracy can be increased in combination with the wind noise detection based on the output correlation value of the pair of microphones as in the first embodiment. . For example, there is a method in which the wind noise generation timing is grasped using a pair of microphone output correlation values, and the level is detected by a thermocouple. Further, it may be determined that wind noise is generated only when wind noise is detected, and noise reduction processing may be performed.

また、図８ではモノラルマイクの実施形態であったが、ステレオマイクにおいても熱電対を用いることもできる。例えば図１０に示すように熱電対８４はステレオマイクの片方にのみ設けてあり、その出力で左右の各チャンネルの音声信号を処理しても良いし、ステレオマイクの両方に熱電対８４を設け、その平均値で音声信号処理を行っても良い。 Moreover, although FIG. 8 is an embodiment of a monaural microphone, a thermocouple can also be used in a stereo microphone. For example, as shown in FIG. 10, the thermocouple 84 is provided only on one side of the stereo microphone, and the audio signal of each of the left and right channels may be processed by the output, or the thermocouple 84 is provided on both of the stereo microphones. Audio signal processing may be performed with the average value.

以上説明した発明の実施形態２では、風切り音検出のために熱電対を設けることで更に高精度に風切り音の検出ができる。 In Embodiment 2 of the invention described above, wind noise can be detected with higher accuracy by providing a thermocouple for wind noise detection.

（その他の実施例）
また、本発明は、以下の処理を実行することによっても実現される。即ち、上述した実施形態の機能を実現するソフトウェア（プログラム）を、ネットワーク又は各種記憶媒体を介してシステム或いは装置に供給し、そのシステム或いは装置のコンピュータ（またはＣＰＵやＭＰＵ等）がプログラムを読み出して実行する処理である。 (Other examples)
The present invention can also be realized by executing the following processing. That is, software (program) that realizes the functions of the above-described embodiments is supplied to a system or apparatus via a network or various storage media, and a computer (or CPU, MPU, or the like) of the system or apparatus reads the program. It is a process to be executed.

Claims

Voice collection means;
Storage means for storing the voice signal collected by the voice sampling means;
Noise detection means for detecting noise contained in the audio signal stored in the storage means;
Noise reduction means for reducing noise detected by the noise detection means from the voice signal;
An audio signal processing apparatus comprising: a setting unit that sets a time constant for controlling a noise reduction characteristic of the noise reduction unit according to a level of the noise detected by the noise detection unit;
The setting means starts setting the time constant to the noise reduction means in response to detection of the noise by the noise detection means,
The noise reduction unit reads the audio signal from the storage unit and reduces the noise when the setting of the time constant according to the noise level by the setting unit is completed. Signal processing device.

The noise reduction means is configured by connecting a first noise reduction means and a second noise reduction means in series,
The audio signal processing apparatus further includes a switching unit that switches between using either the first noise reduction unit and the second noise reduction unit, or using only one of them.
The setting means uses both the first noise reduction means and the second noise reduction means when the noise level exceeds a level corresponding to a predetermined time constant set in the noise reduction means. The audio signal processing apparatus according to claim 1, wherein the switching means is switched so as to do so.

When both the first noise reduction unit and the second noise reduction unit are used, the setting unit switches the switching unit to use both, and then sets the time constant of the second noise reduction unit. The audio signal processing apparatus according to claim 2, wherein the setting of the audio signal is started.

The said noise detection means has a thermocouple provided in the vicinity of the said audio | voice collection means, The said noise is detected based on the signal detected by the said thermocouple, Any one of Claim 1 thru | or 3 characterized by the above-mentioned. The audio signal processing device according to claim 1.

The noise detection means detects the noise by activating the thermocouple every predetermined time,
5. The audio signal according to claim 4, wherein the setting unit fixedly sets a time constant of the noise reduction unit during the predetermined time based on a noise level detected by the noise detection unit. Processing equipment.

The audio signal processing device further includes power state detection means for detecting a power consumption state of the audio signal processing device,
When the power supply state detection unit determines that the power supply is consumed below a predetermined level, the noise detection unit turns the thermocouple only when it is determined that a time constant is not set in the noise reduction unit. Activate and detect the noise,
6. The audio signal processing apparatus according to claim 4, wherein the setting unit fixedly sets a time constant of the noise reduction unit based on a noise level detected by the noise detection unit.

The audio sampling means is a stereo microphone, and the thermocouple is provided only in the vicinity of the microphone of one of the left and right channels,
The audio signal according to any one of claims 4 to 6, wherein the noise detection unit detects the noise further based on a comparison of audio signals of left and right channels collected by the stereo microphone. Processing equipment.

The voice sampling means is a stereo microphone,
4. The audio signal processing according to claim 1, wherein the noise detection unit detects the noise based on a comparison of audio signals of left and right channels collected by the stereo microphone. 5. apparatus.

Voice collection means;
Storage means for storing the voice signal collected by the voice sampling means;
Noise detection means for detecting noise contained in the audio signal stored in the storage means;
Noise reduction means for reducing noise detected by the noise detection means from the voice signal;
A control method for an audio signal processing apparatus comprising: a setting unit that sets a time constant for controlling a noise reduction characteristic of the noise reduction unit according to a level of the noise detected by the noise detection unit,
The storage means storing the voice signal collected by the voice sampling means;
The noise detecting means detecting noise included in the audio signal stored in the storage means;
The setting means starting the setting of the time constant to the noise reduction means in response to detection of the noise by the noise detection means;
The noise reduction means comprises a step of reading the audio signal from the storage means and reducing the noise when the setting of the time constant according to the noise level by the setting means is completed. A control method of an audio signal processing device.