JP5219499B2

JP5219499B2 - Wind noise reduction device

Info

Publication number: JP5219499B2
Application number: JP2007334121A
Authority: JP
Inventors: 昌弘吉田; 智岐奥
Original assignee: Sanyo Electric Co Ltd
Current assignee: Sanyo Electric Co Ltd
Priority date: 2007-08-01
Filing date: 2007-12-26
Publication date: 2013-06-26
Anticipated expiration: 2027-12-26
Also published as: JP2009055583A

Description

本発明は、入力音響信号に含まれる風雑音を低減するための風雑音低減装置及び風雑音低減方法に関し、また、その風雑音低減装置を利用した録音装置及び撮像装置に関する。 The present invention relates to a wind noise reduction device and a wind noise reduction method for reducing wind noise contained in an input acoustic signal, and also relates to a recording device and an imaging device using the wind noise reduction device.

マイクロホンを備えた録音装置において、マイクロホンに風が当たると音響信号に風雑音が混入する。この風雑音は、マイクロホンの振動板に風圧が加わることによって発生し、本来の音響信号にとっての雑音となるため除去されることが望ましい。 In a recording apparatus equipped with a microphone, wind noise is mixed into an acoustic signal when wind strikes the microphone. This wind noise is generated when a wind pressure is applied to the diaphragm of the microphone, and is preferably removed because it becomes noise for the original acoustic signal.

風雑音が存在する周波数帯域は比較的低く、通常、風雑音は３００Ｈｚ以下程度の帯域に集中して存在する。このような特性を利用し、従来の風雑音低減装置では、低帯域信号を中心に風雑音の低減を図っていた。一般的には、図１１に示す如く、ハイパスフィルタ（ＨＰＦ）とローパスフィルタ（ＬＰＦ）を用いて入力音響信号を低帯域成分とそれ以上の帯域成分とに分離し、低帯域の信号を低減させてから（又はカットしてから）両者を再度足し合わせるという手法がとられる。 The frequency band in which wind noise exists is relatively low, and normally wind noise is concentrated in a band of about 300 Hz or less. Utilizing such characteristics, the conventional wind noise reduction apparatus has been designed to reduce wind noise centering on low-band signals. Generally, as shown in FIG. 11, a high-pass filter (HPF) and a low-pass filter (LPF) are used to separate an input acoustic signal into a low-band component and a higher-band component to reduce a low-band signal. After that (or after cutting), the method of adding both together again is taken.

また、風雑音の有無を判定する部位を備えた風雑音低減装置も提案されている。風雑音の有無の判定は、一般的に「風雑音が左右のチャンネル信号間で相互相関がない」という特徴を利用する。具体的には、入力音響信号を形成する左右のチャンネル信号間で相互相関を求め、相互相関を表す相関値が或る閾値以下である場合に、入力音響信号に風雑音が含まれていると判断する。また、単に風雑音の有無を判断するだけでなく、求められた相関値は風雑音の強さを表す指標としても利用される。例えば、相関値に応じて低帯域信号の低減度合いを変動させる手法も提案されている（例えば、下記特許文献１参照）。 There has also been proposed a wind noise reduction device having a portion for determining the presence or absence of wind noise. The presence / absence of wind noise generally uses the feature that “wind noise has no cross-correlation between left and right channel signals”. Specifically, the cross-correlation is obtained between the left and right channel signals forming the input acoustic signal, and the input acoustic signal contains wind noise when the correlation value representing the cross-correlation is a certain threshold value or less. to decide. In addition to simply determining the presence or absence of wind noise, the obtained correlation value is also used as an index representing the strength of wind noise. For example, a method of varying the degree of reduction of the low-band signal according to the correlation value has been proposed (see, for example, Patent Document 1 below).

特開平１１−６９４８０号公報Japanese Patent Laid-Open No. 11-69480

低帯域は、風雑音の周波数帯域を含み、風雑音の影響を多く受けるが、低帯域には音の重要な要素が含まれている。特に、人間の声に関しては、その声のピッチ（ピッチ周波数）が男性で９０〜１６０Ｈｚ、女性で２３０〜３７０Ｈｚ程度であり、音質を決定する上で非常に重要な要素が低帯域に含まれている。ピッチとは、声帯振動による信号の基本周波数のことである。このような重要要素を含む帯域の成分を、単純に低減したりカットしたりすると風雑音とは異なる信号成分の要素まで低減又はカットされてしまい、歪んだ音になってしまう。人間の声の場合では、その声が小さくなったり声色が変化してしまったりする。 The low band includes a frequency band of wind noise and is greatly affected by wind noise, but the low band includes important elements of sound. In particular, with regard to human voice, the pitch (pitch frequency) of the voice is 90 to 160 Hz for males and 230 to 370 Hz for females, and very important elements for determining sound quality are included in the low band. Yes. The pitch is a fundamental frequency of a signal due to vocal cord vibration. If a band component including such an important element is simply reduced or cut, the signal component element different from the wind noise is reduced or cut, resulting in a distorted sound. In the case of a human voice, the voice becomes smaller or the voice color changes.

また、低帯域だけに風雑音低減対策を施し、それ以外の帯域に対して風雑音低減対策を何ら施さなければ、比較的高い周波数の風雑音（コロコロといった音）が残存し、ユーザは違和感を覚える。 Also, if wind noise reduction measures are taken only in the low band and no wind noise reduction measures are taken in the other bands, wind noise with a relatively high frequency (sounds like a clog) will remain and the user will feel uncomfortable. Remember.

そこで本発明は、歪みの発生が少なく風雑音低減効果の高い風雑音低減装置及び風雑音低減方法を提供することを目的とする。また、その風雑音低減装置を利用した録音装置及び撮像装置を提供することを目的とする。 SUMMARY OF THE INVENTION An object of the present invention is to provide a wind noise reduction device and a wind noise reduction method that generate little distortion and have a high wind noise reduction effect. It is another object of the present invention to provide a recording device and an imaging device using the wind noise reduction device.

上記目的を達成するために本発明に係る風雑音低減装置は、入力音響信号から風雑音が低減された補正音響信号を生成する風雑音低減装置において、風雑音の帯域を含む所定帯域を第１帯域とし且つ前記第１帯域よりも周波数が高い所定帯域を第２帯域とした場合、当該風雑音低減装置は、前記入力音響信号に含まれる、第１帯域よりも周波数が高い帯域の音響信号に基づいて、前記入力音響信号に含まれる第１帯域の音響信号と異なる、前記第１帯域の音響信号を生成する信号生成手段を有し、前記信号生成手段が生成した前記音響信号に基づいて第１補正音響信号を生成する第１補正手段と、前記入力音響信号の第２帯域の音響信号の信号レベルを低減することにより、前記風雑音が低減された第２帯域の音響信号である第２補正音響信号を生成する第２補正手段と、前記第１及び第２補正音響信号に基づいて前記補正音響信号を出力する補正音響信号出力手段と、を備えたことを特徴とする。 In order to achieve the above object, a wind noise reduction device according to the present invention is a wind noise reduction device that generates a corrected acoustic signal in which wind noise is reduced from an input acoustic signal. When the second band is a predetermined band having a frequency higher than that of the first band, the wind noise reduction device converts the sound signal in the band having a frequency higher than the first band included in the input acoustic signal. Based on the acoustic signal generated by the signal generating means, and having a signal generating means for generating the acoustic signal of the first band different from the acoustic signal of the first band included in the input acoustic signal. A first correcting means for generating a first corrected acoustic signal, and a second acoustic signal in the second band in which the wind noise is reduced by reducing the signal level of the second acoustic signal in the input acoustic signal. Corrected acoustic signal A second correction means for generating, characterized by comprising a correction sound signal output means for outputting the corrected sound signal based on the first and second correction acoustic signal.

重要な音の要素を含む比較的低周波側の第１帯域に対しては音響信号の生成処理（例えば、信号復元処理）を実行し、音の歪みの発生を抑制しつつ風雑音の低減を図る。一方、第１帯域よりも高周波側の、信号低減によって歪みを生じにくい第２帯域に対しては、信号低減処理によって風雑音の低減を図る。これにより、歪みの発生を抑えつつ、高い風雑音低減効果が得られる。 Acoustic signal generation processing (for example, signal restoration processing) is performed on the first low frequency band containing important sound elements to reduce wind noise while suppressing sound distortion. Plan. On the other hand, wind noise is reduced by a signal reduction process for the second band on the higher frequency side than the first band, where distortion is less likely to occur due to signal reduction. Thereby, the high wind noise reduction effect is acquired, suppressing generation | occurrence | production of distortion.

具体的には例えば、前記第１補正手段は、前記入力音響信号に含まれる第１帯域の前記音響信号と前記信号生成手段が生成した前記音響信号に基づいて、前記第１補正音響信号を生成する。 Specifically, for example, the first correcting unit generates the first corrected acoustic signal based on the acoustic signal in the first band included in the input acoustic signal and the acoustic signal generated by the signal generating unit. To do.

更に具体的には例えば、前記入力音響信号は、複数のチャンネル信号から成り、当該風雑音低減装置は、更に、前記風雑音の帯域を含む各チャンネル信号の所定帯域成分の、互いに異なるチャンネル信号間における相互相関に基づいて、前記入力音響信号に対する前記風雑音の影響度合いを判定する風雑音判定手段を備え、前記第１補正手段は、前記風雑音判定手段の判定結果に応じて、前記第１補正音響信号を生成する。 More specifically, for example, the input acoustic signal includes a plurality of channel signals, and the wind noise reduction device further includes a predetermined band component of each channel signal including the wind noise band between different channel signals. Wind noise determination means for determining the degree of influence of the wind noise on the input acoustic signal based on the cross-correlation in the above, the first correction means according to the determination result of the wind noise determination means A corrected acoustic signal is generated.

また例えば、前記入力音響信号は、複数のチャンネル信号から成り、当該風雑音低減装置は、更に、前記風雑音の帯域を含む各チャンネル信号の所定帯域成分の、互いに異なるチャンネル信号間における相互相関に基づいて、前記入力音響信号に対する前記風雑音の影響度合いを判定する風雑音判定手段を備え、前記第２補正手段は、前記風雑音判定手段の判定結果に応じて、前記第２補正音響信号を生成する。 Further, for example, the input acoustic signal is composed of a plurality of channel signals, and the wind noise reduction device further performs cross-correlation between different channel signals of predetermined band components of each channel signal including the wind noise band. Based on the wind noise determining means for determining the degree of influence of the wind noise on the input acoustic signal, and the second correcting means determines the second corrected acoustic signal according to the determination result of the wind noise determining means. Generate.

或いは例えば、前記入力音響信号は周波数軸上の信号として当該風雑音低減装置に与えられ、且つ、前記入力音響信号は複数のチャンネル信号から成り、前記第２補正手段は、前記入力音響信号の第２帯域を複数の要素帯域に分割し、各要素帯域の音響信号の信号レベルを低減することにより周波数軸上の前記第２補正音響信号を生成し、前記複数の要素帯域の夫々に対して、互いに異なるチャンネル信号間における前記要素帯域の音響信号の相互相関を求め、各相互相関に基づいて前記要素帯域ごとに前記信号レベルの低減度合いを決定する。 Alternatively, for example, the input acoustic signal is given to the wind noise reduction device as a signal on the frequency axis, and the input acoustic signal is composed of a plurality of channel signals, and the second correcting means is The second corrected acoustic signal on the frequency axis is generated by dividing the two bands into a plurality of element bands and reducing the signal level of the acoustic signal of each element band, and for each of the plurality of element bands, The cross-correlation of the acoustic signal of the element band between different channel signals is obtained, and the degree of reduction of the signal level is determined for each element band based on each cross-correlation.

これにより、無駄に歪みを発生させることなく第２帯域に対する風雑音低減効果が得られる。 Thereby, the wind noise reduction effect with respect to the second band can be obtained without generating distortion unnecessarily.

また例えば、前記入力音響信号は時間軸上の信号として当該風雑音低減装置に与えられ、且つ、前記入力音響信号は複数のチャンネル信号から成り、前記第１補正手段によって生成される前記第１補正音響信号は、時間軸上の信号であり、当該音声信号補正装置は、更に、前記入力音響信号から前記第１帯域を含まず且つ前記第２帯域を含む所定帯域の成分を抽出する抽出手段と、前記第１補正音響信号と前記抽出手段の抽出信号との合成信号の信号形式を時間軸上から周波数軸上に変換する時間周波数変換手段と、を備え、前記第２補正手段は、周波数軸上の前記合成信号における第２帯域の音響信号の信号レベルを低減することにより、周波数軸上の前記第２補正音響信号を生成し、前記補正音響信号出力手段は、前記第２補正手段から得られる周波数軸上の前記第２補正音響信号と、前記時間周波数変換手段から得られる周波数軸上の前記第１補正音響信号を含む音響信号と、に基づいて周波数軸上の前記補正音響信号を出力する。 Also, for example, the input acoustic signal is given to the wind noise reduction device as a signal on the time axis, and the input acoustic signal is composed of a plurality of channel signals and is generated by the first correction means. The acoustic signal is a signal on a time axis, and the audio signal correction apparatus further includes an extraction unit that extracts a component of a predetermined band that does not include the first band and includes the second band from the input acoustic signal. A time frequency conversion means for converting a signal format of a synthesized signal of the first corrected acoustic signal and the extraction signal of the extraction means from a time axis to a frequency axis, and the second correction means includes a frequency axis The second corrected acoustic signal on the frequency axis is generated by reducing the signal level of the second band acoustic signal in the synthesized signal above, and the corrected acoustic signal output means is provided by the second correcting means. The corrected acoustic signal on the frequency axis is output based on the second corrected acoustic signal on the frequency axis and the acoustic signal including the first corrected acoustic signal on the frequency axis obtained from the time-frequency conversion means. To do.

このように構成すれば、風雑音低減装置のエンコーダへの組み込みが容易となる。 With this configuration, the wind noise reduction device can be easily incorporated into the encoder.

そして例えば、前記第２補正手段は、周波数軸上の前記合成信号の第２帯域を複数の要素帯域に分割し、各要素帯域の音響信号の信号レベルを低減することにより周波数軸上の前記第２補正音響信号を生成し、前記複数の要素帯域の夫々に対して、互いに異なるチャンネル信号間における前記要素帯域の音響信号の相互相関を求め、各相互相関に基づいて前記要素帯域ごとに前記信号レベルの低減度合いを決定する。 And, for example, the second correction means divides the second band of the synthesized signal on the frequency axis into a plurality of element bands, and reduces the signal level of the acoustic signal in each element band, thereby reducing the first level on the frequency axis. 2 corrected acoustic signals are generated, cross-correlation of the acoustic signals of the element bands between different channel signals is obtained for each of the plurality of element bands, and the signal for each element band is obtained based on each cross-correlation Determine the level of reduction.

また例えば、前記入力音響信号は、複数のチャンネル信号から成り、前記第２補正手段は、前記第２帯域の全部又は一部の帯域に着目し、前記入力音響信号に含まれる前記複数のチャンネル信号分の着目帯域の音響信号を平均化することにより、前記風雑音の影響を比較的多く受けているチャンネルにおける前記着目帯域の音響信号の信号レベルを低減させ、この平均化によって得られた信号から前記第２補正音響信号を生成する。 Further, for example, the input acoustic signal is composed of a plurality of channel signals, and the second correction means focuses on all or a part of the second band, and the plurality of channel signals included in the input acoustic signal. The signal level of the acoustic signal in the bandwidth of interest in a channel that is relatively affected by the wind noise is reduced by averaging the acoustic signal in the bandwidth of interest in minutes, and the signal obtained by this averaging is reduced. The second corrected acoustic signal is generated.

また例えば、前記入力音響信号は、複数のチャンネル信号から成り、前記第２補正手段は、前記第２帯域の全部又は一部の帯域に着目し、前記入力音響信号に含まれる前記複数のチャンネル信号分の着目帯域の音響信号の内、最小の信号レベルを有する音響信号を最小音響信号として且つそれ以外の音響信号を非最小音響信号として特定し、前記非最小音響信号を前記最小音響信号にて置き換えることにより、前記風雑音の影響を比較的多く受けているチャンネルにおける前記着目帯域の音響信号の信号レベルを低減させ、この置き換えによって得られた信号から前記第２補正音響信号を生成する。 Further, for example, the input acoustic signal is composed of a plurality of channel signals, and the second correction means focuses on all or a part of the second band, and the plurality of channel signals included in the input acoustic signal. Minute acoustic signal having the minimum signal level is specified as the minimum acoustic signal and other acoustic signals are specified as non-minimum acoustic signals, and the non-minimum acoustic signal is determined by the minimum acoustic signal. By replacing, the signal level of the acoustic signal in the band of interest in a channel that is relatively affected by the wind noise is reduced, and the second corrected acoustic signal is generated from the signal obtained by this replacement.

また、上記目的を達成するために本発明に係る録音装置は、前記風雑音低減装置と、前記風雑音低減装置に対する前記入力音響信号を生成するためのマイクロホンと、を備えたことを特徴とする。 In order to achieve the above object, a recording apparatus according to the present invention includes the wind noise reduction device and a microphone for generating the input acoustic signal for the wind noise reduction device. .

また、上記目的を達成するために本発明に係る撮像装置は、前記風雑音低減装置と、前記風雑音低減装置に対する前記入力音響信号を生成するためのマイクロホンと、画像を取得するための撮像手段と、を備えたことを特徴とする。 In order to achieve the above object, an imaging apparatus according to the present invention includes the wind noise reduction apparatus, a microphone for generating the input acoustic signal for the wind noise reduction apparatus, and an imaging means for acquiring an image. And.

また、上記目的を達成するために本発明に係る風雑音低減方法は、入力音響信号から風雑音が低減された補正音響信号を生成する風雑音低減方法において、風雑音の帯域を含む所定帯域を第１帯域とし且つ前記第１帯域よりも周波数が高い所定帯域を第２帯域とした場合、当該風雑音低減方法は、前記入力音響信号に含まれる、第１帯域よりも周波数が高い帯域の音響信号に基づいて、前記入力音響信号に含まれる第１帯域の音響信号と異なる、前記第１帯域の音響信号を生成する信号生成ステップと、前記信号生成ステップにて生成された前記音響信号に基づいて第１補正音響信号を生成する第１補正ステップと、前記入力音響信号の第２帯域の音響信号の信号レベルを低減することにより、前記風雑音が低減された第２帯域の音響信号である第２補正音響信号を生成する第２補正ステップと、を備え、前記第１及び第２補正音響信号に基づいて前記補正音響信号を生成することを特徴とする。 In order to achieve the above object, a wind noise reduction method according to the present invention is a wind noise reduction method for generating a corrected acoustic signal in which wind noise is reduced from an input acoustic signal. When the second band is a predetermined band having a frequency higher than that of the first band and the first band, the wind noise reducing method includes sound in a band higher in frequency than the first band included in the input acoustic signal. Based on the signal, a signal generation step of generating an acoustic signal of the first band different from the acoustic signal of the first band included in the input acoustic signal, and the acoustic signal generated in the signal generation step A first correction step for generating a first corrected acoustic signal, and a second-band acoustic signal in which the wind noise is reduced by reducing a signal level of the second-band acoustic signal of the input acoustic signal. That a second correction step of generating a second correction acoustic signals, comprises a, and generates the corrected sound signal based on the first and second correction acoustic signal.

本発明によれば、歪みの発生が少なく風雑音低減効果の高い風雑音低減装置及び風雑音低減方法並びに録音装置及び撮像装置を提供することができる。 According to the present invention, it is possible to provide a wind noise reduction device, a wind noise reduction method, a recording device, and an imaging device that generate little distortion and have a high wind noise reduction effect.

本発明の意義ないし効果は、以下に示す実施の形態の説明により更に明らかとなろう。ただし、以下の実施の形態は、あくまでも本発明の一つの実施形態であって、本発明ないし各構成要件の用語の意義は、以下の実施の形態に記載されたものに制限されるものではない。 The significance or effect of the present invention will become more apparent from the following description of embodiments. However, the following embodiment is merely one embodiment of the present invention, and the meaning of the term of the present invention or each constituent element is not limited to that described in the following embodiment. .

以下、本発明の実施の形態につき、図面を参照して具体的に説明する。参照される各図において、同一の部分には同一の符号を付し、同一の部分に関する重複する説明を原則として省略する。後に第１〜第５実施例を説明するが、まず、各実施例に共通する事項又は各実施例にて参照される事項について説明する。 Hereinafter, embodiments of the present invention will be specifically described with reference to the drawings. In each of the drawings to be referred to, the same part is denoted by the same reference numeral, and redundant description regarding the same part is omitted in principle. The first to fifth embodiments will be described later. First, matters common to each embodiment or items referred to in each embodiment will be described.

図１は、本発明の実施形態に係る撮像装置１の外観斜視図である。撮像装置１は、音声録音をも可能なデジタルビデオカメラである。撮像装置１の筐体の左側にはマイクロホンＭＩＣ１が備え付けられ、撮像装置１の筐体の右側にはマイクロホンＭＩＣ２が備え付けられている。マイクロホンＭＩＣ１が撮像装置１の左方向から到来する音を収音し、マイクロホンＭＩＣ２が撮像装置１の右方向から到来する音を収音することにより、マイクロホンＭＩＣ１及びＭＩＣ２はスレテオマイクを形成する。尚、図１に示す配置位置とは異なるが、マイクロホンＭＩＣ１及びＭＩＣ２は、例えば、表示ディスプレイが嵌め込まれた板状筐体の裏面（表示ディスプレイの反対側の面）に互いに近接して設置されうる。 FIG. 1 is an external perspective view of an imaging apparatus 1 according to an embodiment of the present invention. The imaging device 1 is a digital video camera that can also record audio. A microphone MIC1 is provided on the left side of the casing of the imaging apparatus 1, and a microphone MIC2 is provided on the right side of the casing of the imaging apparatus 1. The microphone MIC1 picks up the sound coming from the left direction of the imaging device 1, and the microphone MIC2 picks up the sound coming from the right direction of the imaging device 1, whereby the microphones MIC1 and MIC2 form a stereo microphone. Although different from the arrangement position shown in FIG. 1, the microphones MIC1 and MIC2 can be installed close to each other, for example, on the back surface of the plate-like housing in which the display display is fitted (the surface on the opposite side of the display display). .

図２に、撮像装置１の電気的構成を表す概略ブロック図を示す。撮像装置１は、マイクロホンＭＩＣ１及びＭＩＣ２の他、撮像部２、映像信号処理部３、音声信号処理部４及び記録媒体５を備える。図示を省略しているが、撮像装置１には、シャッタボタン及び録画ボタンを含む操作部、表示ディスプレイ、スピーカ、ＣＰＵ（Central Processing Unit）などが備えられている。 FIG. 2 is a schematic block diagram showing the electrical configuration of the imaging apparatus 1. The imaging device 1 includes an imaging unit 2, a video signal processing unit 3, an audio signal processing unit 4, and a recording medium 5 in addition to the microphones MIC 1 and MIC 2. Although not shown, the imaging apparatus 1 includes an operation unit including a shutter button and a recording button, a display display, a speaker, a CPU (Central Processing Unit), and the like.

撮像部２は、光学系と、ＣＣＤ（Charge Coupled Devices）又はＣＭＯＳ（Complementary Metal Oxide Semiconductor）イメージセンサなどの撮像素子とを含み、光学系を介して入射する光学像を電気信号に変換することによって該電気信号にて表される画像を取得する。映像信号処理部３は、その電気信号に基づき、撮像部２の取得画像を表す映像信号を生成する。撮像装置１に備えられた操作部（不図示）に対する操作に従って、該映像信号は、メモリカードや光ディスクなどの記録媒体５に記録される。 The imaging unit 2 includes an optical system and an imaging device such as a charge coupled device (CCD) or a complementary metal oxide semiconductor (CMOS) image sensor, and converts an optical image incident through the optical system into an electrical signal. An image represented by the electrical signal is acquired. The video signal processing unit 3 generates a video signal representing an acquired image of the imaging unit 2 based on the electrical signal. The video signal is recorded on a recording medium 5 such as a memory card or an optical disk in accordance with an operation on an operation unit (not shown) provided in the imaging apparatus 1.

マイクロホンＭＩＣ１及びＭＩＣ２は、夫々、自身が収音した音をアナログの電気信号に変換して出力する。マイクロホンＭＩＣ１及びＭＩＣ２の各出力信号は、音声信号処理部４内に設けられたＡ／Ｄ変換器（不図示）によってデジタル信号に変換され、音声信号処理部４は、このデジタル信号に所望の処理を施す。この処理を介して得た信号は、撮像装置１に備えられた操作部（不図示）に対する操作に従って記録媒体５に記録される。 Each of the microphones MIC1 and MIC2 converts the sound collected by itself into an analog electric signal and outputs it. Each output signal of the microphones MIC1 and MIC2 is converted into a digital signal by an A / D converter (not shown) provided in the audio signal processing unit 4, and the audio signal processing unit 4 converts the digital signal into a desired process. Apply. A signal obtained through this processing is recorded on the recording medium 5 in accordance with an operation on an operation unit (not shown) provided in the imaging apparatus 1.

マイクロホンＭＩＣ１及びＭＩＣ２は、夫々、振動体としての振動板（不図示）を有している。各振動板は、音波による空気振動によって振動する一方で振動板に作用した風圧によっても振動する。故に、振動板に音波と風圧が作用している時、振動板は音波と風圧に応じて振動する。マイクロホンＭＩＣ１及びＭＩＣ２の夫々は、各振動板の振動を電気信号に変換して出力する。マイクロホンの出力信号の内、風圧に由来する雑音（雑音成分）を風雑音という。風雑音は、音波として振動板に到来する雑音ではない。 The microphones MIC1 and MIC2 each have a diaphragm (not shown) as a vibrating body. Each diaphragm vibrates due to air vibration caused by sound waves, but also vibrates due to wind pressure acting on the diaphragm. Therefore, when the sound wave and the wind pressure act on the diaphragm, the diaphragm vibrates according to the sound wave and the wind pressure. Each of the microphones MIC1 and MIC2 converts the vibration of each diaphragm into an electrical signal and outputs it. Of the output signal of the microphone, noise (noise component) derived from wind pressure is called wind noise. Wind noise is not noise that arrives at the diaphragm as sound waves.

音声信号処理部４は、風雑音低減部６を備える。風雑音低減部６は、各マイクロホンの出力信号に基づく風雑音低減部６の入力信号から風雑音を低減し、風雑音が低減された音響信号を出力信号として出力する。 The audio signal processing unit 4 includes a wind noise reduction unit 6. The wind noise reduction unit 6 reduces wind noise from the input signal of the wind noise reduction unit 6 based on the output signal of each microphone, and outputs an acoustic signal with reduced wind noise as an output signal.

風雑音が存在する周波数帯域は比較的低く、通常、風雑音は３００Ｈｚ以下程度の帯域に集中して存在する。従って、風雑音低減部６は、３００Ｈｚを境界として取り扱い、３００Ｈｚ（ヘルツ）より小さい周波数帯域を「低帯域」として取り扱って、低帯域に対して風雑音を低減するための処理を施す。但し、強度は比較的小さいものの、３００Ｈｚ以上であって且つ低帯域に近い周波数帯域にも風雑音は存在する。そこで、風雑音低減部６は、３００Ｈｚ以上の周波数帯域を、更に、中帯域及び高帯域に分割して取り扱い、中帯域に対しても風雑音を低減するための処理を施す。具体的な数値例として、３００Ｈｚ以上であって且つ１．５ｋＨｚ（キロヘルツ）よりも小さい周波数帯域を「中帯域」として取り扱い、１．５ｋＨｚ以上の周波数帯域を「高帯域」として取り扱うこととする。 The frequency band in which wind noise exists is relatively low, and normally wind noise is concentrated in a band of about 300 Hz or less. Therefore, the wind noise reduction unit 6 treats 300 Hz as a boundary, treats a frequency band smaller than 300 Hz (Hertz) as a “low band”, and performs processing for reducing wind noise on the low band. However, although the intensity is relatively small, wind noise exists in a frequency band of 300 Hz or higher and close to a low band. Therefore, the wind noise reduction unit 6 further handles the frequency band of 300 Hz or more by dividing it into a medium band and a high band, and performs processing for reducing the wind noise for the medium band. As a specific numerical example, a frequency band of 300 Hz or more and smaller than 1.5 kHz (kilohertz) is treated as a “medium band”, and a frequency band of 1.5 kHz or more is treated as a “high band”.

そこで、風雑音低減部６では、風雑音を低減するための処理を２段に分割し、各処理を異なる帯域に対して適用する。２段の処理の内、一方の処理は、風雑音を含まない信号を復元する信号復元処理であり、他方の処理は、信号レベルを低減することによって風雑音を低減する信号低減処理である。 Therefore, the wind noise reduction unit 6 divides the process for reducing the wind noise into two stages, and applies each process to different bands. Of the two stages, one process is a signal restoration process that restores a signal that does not include wind noise, and the other process is a signal reduction process that reduces wind noise by reducing the signal level.

信号復元処理は、低帯域の信号に適用する。低帯域には強い風雑音とともに音の重要な要素が含まれているため、信号レベルを低減させるのではなく、風雑音を含まない信号を復元することで雑音除去を図る。信号復元処理を行えば、信号レベルを低減させる必要がなくなるため、音の歪みが生じにくくなる。 The signal restoration process is applied to a low-band signal. Since the low band includes important elements of sound along with strong wind noise, noise reduction is attempted by restoring a signal that does not include wind noise, rather than reducing the signal level. If the signal restoration process is performed, it is not necessary to reduce the signal level, so that distortion of sound is less likely to occur.

信号低減処理は、中帯域の信号に適用する。中帯域に対する風雑音の影響は小さいが、低帯域にのみ風雑音を低減するための処理を施して中帯域に対して風雑音低減対策を何ら施さなければ、比較的高い周波数の風雑音（コロコロといった音）が残存し、ユーザは違和感を覚える。但し、風雑音の影響が小さいが故に信号低減による音の歪みは少ないと想定され、音の要素に着目しても中帯域はピッチの高調波が存在する帯域であるので信号低減を行っても低帯域ほど歪みの影響を受けない。従って、上述の如く、中帯域の信号には信号低減処理を適用するようにする。 The signal reduction process is applied to a medium band signal. The effect of wind noise on the mid-band is small, but wind noise at a relatively high frequency (coro And the user feels uncomfortable. However, since the effects of wind noise are small, it is assumed that the sound distortion due to signal reduction is small, and even if attention is paid to sound elements, the middle band is a band where harmonics of pitch exist, so signal reduction can be performed. The lower the band, the less affected by distortion. Therefore, as described above, signal reduction processing is applied to the mid-band signal.

尚、中帯域に対しても信号復元処理を適用することが考えられるが、風雑音を含まない中帯域の信号を復元するためには、高帯域の信号中の高調波成分が必要となる。このような高調波成分は微弱であるため、良好な復元は困難である。故に、中帯域の信号には信号低減処理が適している。 Although it is conceivable to apply the signal restoration processing to the middle band as well, in order to restore the middle band signal that does not include wind noise, harmonic components in the high band signal are required. Since such harmonic components are weak, it is difficult to restore them satisfactorily. Therefore, signal reduction processing is suitable for a medium-band signal.

信号復元処理と信号低減処理を、どちらを先に行っても構わないし、夫々を並列に実行させてもよい。また、信号復元処理と信号低減処理の夫々は、時間軸上でも周波数軸上でも行うことができる。 Either the signal restoration process or the signal reduction process may be performed first, or each may be performed in parallel. Each of the signal restoration processing and the signal reduction processing can be performed on the time axis and the frequency axis.

また、風雑音の有無や強さを判定する風雑音判定部を設けるようにしても良い。風雑音判定部は、例えば、左右チャンネル間の相互相関を求めることにより風雑音の有無や強さを判定し、判定結果は、信号復元処理及び／又は信号低減処理に利用される。１つの風雑音判定部を、信号復元処理と信号低減処理で共有してもよいし、２つの風雑音判定部を設け、信号復元処理と信号低減処理の夫々に対して独立に風雑音判定部を割り当てるようにしてもよい。信号復元処理と信号低減処理の夫々に対して独立に風雑音判定部を割り当てる場合、各判定結果を相互利用することも可能である（詳細な具体例は後述）。 Further, a wind noise determination unit that determines the presence or absence and strength of wind noise may be provided. The wind noise determination unit determines, for example, the presence or intensity of wind noise by obtaining a cross-correlation between the left and right channels, and the determination result is used for signal restoration processing and / or signal reduction processing. One wind noise determination unit may be shared between the signal restoration process and the signal reduction process, or two wind noise determination units are provided, and the wind noise determination unit is independently provided for each of the signal restoration process and the signal reduction process. May be assigned. When the wind noise determination unit is independently assigned to each of the signal restoration process and the signal reduction process, it is possible to mutually use the respective determination results (detailed specific examples will be described later).

尚、相互相関とは、対比されるべき信号間における相互の相関性を意味する。後述の各実施例では、所定の演算を介して求めた相関値を相互相関を表す指標として取り扱っているが、相互相関の評価方法はこれに限定されない。 Cross-correlation means mutual correlation between signals to be compared. In each embodiment described later, a correlation value obtained through a predetermined calculation is handled as an index indicating cross-correlation, but the cross-correlation evaluation method is not limited to this.

以下に、風雑音低減部６に関与する具体的な実施例として、第１〜第５実施例を説明する。 Below, the 1st-5th Example is described as a specific Example in connection with the wind noise reduction part 6. FIG.

＜＜第１実施例＞＞
まず、第１実施例について説明する。第１実施例では、信号復元処理と信号低減処理を共に時間軸上で実行する。 << First Example >>
First, the first embodiment will be described. In the first embodiment, both the signal restoration process and the signal reduction process are executed on the time axis.

図３は、第１実施例に係る風雑音低減部６ａの内部ブロック図である。風雑音低減部６ａは、図２の風雑音低減部６として利用される。風雑音低減部６ａは、符号１１〜１５にて参照される各部位を備える。 FIG. 3 is an internal block diagram of the wind noise reduction unit 6a according to the first embodiment. The wind noise reduction unit 6a is used as the wind noise reduction unit 6 of FIG. The wind noise reduction part 6a is provided with each site | part referred with the codes | symbols 11-15.

風雑音低減部６ａに対する入力信号（入力音響信号）は、複数のチャンネル信号から成る時間軸上の音響信号（換言すれば、時間領域で表現された音響信号）である。具体的には、図２の音声信号処理部４が、マイクロホンＭＩＣ１及びＭＩＣ２からのアナログの出力信号を所定のサンプリング周波数にてデジタル信号に変換する。今、マイクロホンＭＩＣ１の出力信号に対応するデジタル信号を時系列で並べたチャンネル信号をＬ（ｔ）にて表し、マイクロホンＭＩＣ２の出力信号に対応するデジタル信号を時系列で並べたチャンネル信号をＲ（ｔ）にて表す。そして、マイクロホンＭＩＣ１の出力信号に対応するチャンネル信号をＬ信号と呼び、マイクロホンＭＩＣ２の出力信号に対応するチャンネル信号をＲ信号と呼ぶ。そうすると、図３の風雑音低減部６ａに対する入力信号は、Ｌ信号Ｌ（ｔ）及びＲ信号Ｒ（ｔ）から成る。この入力信号は風雑音低減部６ａによって補正される。従って、風雑音低減部６ａに対する入力信号を「原信号」と呼び、風雑音低減部６ａの出力信号を「補正信号」と呼ぶことにする。尚、以下、Ｌ信号Ｌ（ｔ）及びＲ信号Ｒ（ｔ）を、夫々、信号Ｌ（ｔ）及びＲ（ｔ）と略記することもある。 An input signal (input acoustic signal) to the wind noise reduction unit 6a is an acoustic signal on the time axis composed of a plurality of channel signals (in other words, an acoustic signal expressed in the time domain). Specifically, the audio signal processing unit 4 in FIG. 2 converts analog output signals from the microphones MIC1 and MIC2 into digital signals at a predetermined sampling frequency. Now, a channel signal in which digital signals corresponding to the output signal of the microphone MIC1 are arranged in time series is represented by L (t), and a channel signal in which digital signals corresponding to the output signal of the microphone MIC2 are arranged in time series is R ( t). A channel signal corresponding to the output signal of the microphone MIC1 is called an L signal, and a channel signal corresponding to the output signal of the microphone MIC2 is called an R signal. Then, the input signal to the wind noise reduction unit 6a in FIG. 3 includes the L signal L (t) and the R signal R (t). This input signal is corrected by the wind noise reduction unit 6a. Therefore, an input signal to the wind noise reduction unit 6a is referred to as an “original signal”, and an output signal of the wind noise reduction unit 6a is referred to as a “correction signal”. Hereinafter, the L signal L (t) and the R signal R (t) may be abbreviated as the signals L (t) and R (t), respectively.

信号Ｌ（ｔ）及びＲ（ｔ）は、ゼロを中心とした正負の値をとり、マイクロホンＭＩＣ１及びＭＩＣ２の振動板が振動しないとき、Ｌ（ｔ）及びＲ（ｔ）の値はゼロとなり（但し、オフセットやノイズ成分を無視）、それらの振動が大きくなるに従ってＬ（ｔ）及びＲ（ｔ）の振幅は大きくなる。 The signals L (t) and R (t) take positive and negative values centered on zero, and when the diaphragms of the microphones MIC1 and MIC2 do not vibrate, the values of L (t) and R (t) are zero ( However, offset and noise components are ignored), and the amplitudes of L (t) and R (t) increase as their vibration increases.

第１実施例において、原信号は、バンドパスフィルタ（以下、ＢＰＦという）２３及び３０、ローパスフィルタ（以下、ＬＰＦという）２１及び２６、並びに、ハイパスフィルタ（以下、ＨＰＦという）１４の夫々に対して入力される。 In the first embodiment, the original signal is supplied to band pass filters (hereinafter referred to as BPF) 23 and 30, low pass filters (hereinafter referred to as LPF) 21 and 26, and high pass filter (hereinafter referred to as HPF) 14, respectively. Is input.

風雑音判定部１１は、符号２１及び２２にて参照される各部位を備える。ＬＰＦ２１は、自身に対する入力信号の所定帯域成分を抽出して出力する。ＬＰＦ２１が抽出する帯域は、風雑音の周波数帯域を含み、典型的には上記の「低帯域」と同じとされる。但し、ＬＰＦ２１が抽出する帯域が、上記の「低帯域」と完全に合致する必要はなく、例えば２００Ｈｚより小さい周波数帯域をＬＰＦ２１が抽出する帯域としてもよい。 The wind noise determination unit 11 includes parts referred to by reference numerals 21 and 22. The LPF 21 extracts and outputs a predetermined band component of an input signal for itself. The band extracted by the LPF 21 includes the frequency band of wind noise, and is typically the same as the “low band” described above. However, the band extracted by the LPF 21 does not have to completely match the above-mentioned “low band”. For example, a frequency band smaller than 200 Hz may be a band extracted by the LPF 21.

ところで、風雑音低減部６ａ内の各部位は、複数のチャンネル信号の夫々に対して個別に必要な信号処理を行う。即ち例えば、ＬＰＦ２１は、Ｌ信号Ｌ（ｔ）の所定帯域成分とＲ信号Ｒ（ｔ）の所定帯域成分を抽出して出力する。これは、後述の風雑音低減部６ｂ及び６ｃについても同様に当てはまる。但し、当然ではあるが、相互相関演算によって相関値を算出する部位（本実施例において、相関値演算部２２）に対して、これは当てはまらない。 By the way, each part in the wind noise reduction unit 6a individually performs necessary signal processing on each of the plurality of channel signals. That is, for example, the LPF 21 extracts and outputs a predetermined band component of the L signal L (t) and a predetermined band component of the R signal R (t). This also applies to wind noise reduction units 6b and 6c described later. However, as a matter of course, this does not apply to the part (correlation value calculation unit 22 in the present embodiment) where the correlation value is calculated by cross-correlation calculation.

相関値演算部２２は、ＬＰＦ２１から出力されたチャンネル信号間の相互相関を表す相関値、即ち、ＬＰＦ２１によって抽出された帯域成分におけるチャンネル信号間の相関値を求める。具体的には、原信号及び原信号に基づく時間軸上の音響信号を所定間隔の単位区間で区切って考える。図４に示す如く、第１、第２、第３の単位区間、・・・、の順番で時間が進行するものとし、各単位区間には、離散化されたＮ個の信号（Ｎサンプルの信号）が含まれているとする。従って、１つの単位区間には、Ｎ個のＬ信号Ｌ（ｔ）とＮ個のＲ信号Ｒ（ｔ）が含まれる。 The correlation value calculation unit 22 obtains a correlation value representing a cross-correlation between channel signals output from the LPF 21, that is, a correlation value between channel signals in the band component extracted by the LPF 21. Specifically, the original signal and the acoustic signal on the time axis based on the original signal are considered by dividing them into unit intervals of a predetermined interval. As shown in FIG. 4, it is assumed that time advances in the order of the first, second, third unit intervals,..., And each unit interval includes N discrete signals (N samples). Signal). Accordingly, one unit section includes N L signals L (t) and N R signals R (t).

相関値演算部２２は、ＬＰＦ２１から出力される信号Ｌ（ｔ）及びＲ（ｔ）に基づき、各単位区間に対して、下記式（１）に従って相関値Ｋ［ｐ］を算出する。ここで、ｐは、単位区間の番号を表す。式（１）におけるＬ_i及びＲ_iは、夫々、着目した単位区間内における、ｉ番目のＬ信号Ｌ（ｔ）の値及びｉ番目のＲ信号Ｒ（ｔ）の値を示す。勿論、相関値演算部２２に対する信号Ｌ（ｔ）及びＲ（ｔ）は、ＬＰＦ２１を介して与えられるので、式（１）におけるＬ_i及びＲ_iは、ＬＰＦ２１の出力値にて規定される。 The correlation value calculation unit 22 calculates a correlation value K [p] for each unit section according to the following equation (1) based on the signals L (t) and R (t) output from the LPF 21. Here, p represents a unit interval number. L _i and R _i in equation (1) indicate the value of the i-th L signal L (t) and the value of the i-th R signal R (t), respectively, in the unit interval of interest. Of course, since the signals L (t) and R (t) for the correlation value calculation unit 22 are given through the LPF 21, L _i and R _i in the equation (1) are defined by the output value of the LPF 21.

風雑音は、左右のチャンネル信号間で相互相関がない。故に、原信号に含まれる風雑音が比較的多ければ相関値は比較的小さくなり、原信号に含まれる風雑音が比較的少なければ相関値は比較的大きくなる。従って、相関値Ｋ［ｐ］は、第ｐの単位区間における風雑音の強さに応じた値をとることになる。これを利用し、風雑音判定部１１は、相関値演算部２２が算出した相関値に基づいて各単位区間に対する風雑音の影響度合いを判定する。この判定結果は、信号復元部１２及び信号低減部１３での処理に利用される。 Wind noise has no cross-correlation between left and right channel signals. Therefore, the correlation value is relatively small if the wind noise contained in the original signal is relatively large, and the correlation value is relatively large if the wind noise contained in the original signal is relatively small. Accordingly, the correlation value K [p] takes a value corresponding to the wind noise intensity in the p-th unit interval. Using this, the wind noise determination unit 11 determines the degree of influence of wind noise on each unit section based on the correlation value calculated by the correlation value calculation unit 22. This determination result is used for processing in the signal restoration unit 12 and the signal reduction unit 13.

信号復元部１２は、符号２３〜２９にて参照される各部位を備える。信号復元部１２では、音声や楽器音の調波性を利用し、原信号の中帯域信号から低帯域に対する復元信号を生成する。 The signal restoration unit 12 includes each part referred to by reference numerals 23 to 29. The signal restoration unit 12 generates a restoration signal for the low band from the middle band signal of the original signal by using the harmonic nature of the voice or instrument sound.

調波性とは、周波数スペクトルが倍音構造で成り立っているという性質であり、音声や楽器音の多くはこの性質を有している。つまり、或る音の周波数スペクトルにおいて、最も低域側の成分の周波数をｆ０とすると、その音の周波数スペクトルは、ｆ０と、その倍音成分であるｆ０×２、ｆ０×３、ｆ０×４、・・・の周波数成分から形成される。この場合、ｆ０の周波数成分は基本波成分と呼ばれ、ｆ０×２、ｆ０×３、ｆ０×４、・・・の周波数成分は、夫々、２次、３次、４次、・・・の高調波成分と呼ばれる。 Harmonicity is the property that the frequency spectrum is composed of a harmonic structure, and many voices and instrument sounds have this property. That is, in the frequency spectrum of a certain sound, if the frequency of the lowest frequency component is f0, the frequency spectrum of the sound is f0 and its overtone components f0 × 2, f0 × 3, f0 × 4, Are formed from frequency components. In this case, the frequency component of f0 is called a fundamental wave component, and the frequency components of f0 × 2, f0 × 3, f0 × 4,... Are second order, third order, fourth order,. It is called a harmonic component.

調波性のある信号では、高次の高調波成分から基本波成分又は低次の高調波成分を復元できることが知られており、この復元に二乗処理、全波整流、半波整流などの非線形処理を利用できることが知られている（例えば、特開平８−１３０４９４号公報、特開平８−２７８８００号公報、特開平９−５５７７８号公報）。 For harmonic signals, it is known that the fundamental or low-order harmonic components can be restored from the higher-order harmonic components, and this restoration can be performed using nonlinear processing such as squaring, full-wave rectification, and half-wave rectification. It is known that processing can be used (for example, JP-A-8-130494, JP-A-8-278800, JP-A-9-55778).

図３の信号復元部１２でも、公知の任意の手法を用いて復元信号を生成することができる。具体的な例として、信号復元部１２では、符号２３〜２５にて参照される各部位によって復元信号が生成される。各部位の説明を行う。 The signal restoration unit 12 in FIG. 3 can also generate a restoration signal using any known method. As a specific example, in the signal restoration unit 12, a restoration signal is generated by each part referred to by reference numerals 23 to 25. Each part will be described.

ＢＰＦ２３は、自身に対する入力信号の所定帯域成分を抽出して出力する。低帯域の信号を復元するべく、ＢＰＦ２３が抽出する帯域は、上記の「中帯域」と同じとされる。但し、ＢＰＦ２３が抽出する帯域が、上記の「中帯域」と完全に合致する必要はない。 The BPF 23 extracts and outputs a predetermined band component of an input signal for itself. In order to restore a low-band signal, the band extracted by the BPF 23 is the same as the “medium band” described above. However, the band extracted by the BPF 23 does not have to completely match the above “medium band”.

非線形処理部２４は、ＢＰＦ２３を通過した信号（ＢＰＦ２３によって抽出された信号）に対して非線形処理を施す。この非線形処理は、二乗処理、全波整流（絶対値処理）又は半波整流などである。二乗処理を用いる場合、非線形処理部２４は、ＢＰＦ２３を通過した信号を二乗して出力する。人の声がマイクロホンＭＩＣ１及びＭＩＣ２に収音された場合、ＢＰＦ２３を通過した信号には音声のピッチ信号の高調波成分が含まれるが、この通過した信号を二乗することにより各高調波成分の周波数の差及び和に相当する周波数の信号が生成される。つまり、二乗処理によって、ＢＰＦ２３の通過帯域の低周波側及び高周波側にも調波成分（基本波成分又は高調波成分）が生成される。尚、二乗処理を用いる場合、生成される調波成分の振幅は、本来求めたい調波成分のそれを二乗したものになっている。このため、二乗処理を用いる場合、非線形処理部２４は、ＢＰＦ２３を通過した信号を二乗することによって得られる二乗信号に対して正規化等を行い、それによって振幅が調整された二乗信号を出力するようにする。 The nonlinear processing unit 24 performs nonlinear processing on the signal that has passed through the BPF 23 (the signal extracted by the BPF 23). This non-linear processing is square processing, full-wave rectification (absolute value processing), half-wave rectification, or the like. When square processing is used, the nonlinear processing unit 24 squares and outputs the signal that has passed through the BPF 23. When a human voice is picked up by the microphones MIC1 and MIC2, the signal that has passed through the BPF 23 contains harmonic components of the pitch signal of the voice. By squaring the passed signal, the frequency of each harmonic component is obtained. A signal having a frequency corresponding to the difference and the sum is generated. That is, harmonic components (fundamental wave components or harmonic components) are also generated on the low frequency side and the high frequency side of the passband of the BPF 23 by the square process. When the square process is used, the amplitude of the generated harmonic component is the square of that of the harmonic component to be originally obtained. Therefore, when using the square process, the nonlinear processing unit 24 performs normalization on the square signal obtained by squaring the signal that has passed through the BPF 23, and outputs a square signal whose amplitude is adjusted thereby. Like that.

非線形処理として、全波整流（絶対値処理）又は半波整流を用いる場合も同様である。例えば、全波整流を用いる場合、非線形処理部２４は、ＢＰＦ２３を通過した信号の絶対値を算出して出力する。 The same applies when full-wave rectification (absolute value processing) or half-wave rectification is used as the nonlinear processing. For example, when using full-wave rectification, the nonlinear processing unit 24 calculates and outputs the absolute value of the signal that has passed through the BPF 23.

信号復元部１２では、復元された信号の内、低帯域の信号成分のみを利用する。このため、ＬＰＦ２５は、非線形処理部２４の出力信号の低帯域成分のみを通過させる。ＬＰＦ２５の出力信号は、原信号中の中帯域の音響信号から復元された低帯域の音響信号である。そして、復元の元となる中帯域には風雑音が殆ど含まれていないため、復元された低帯域の音響信号にも、風雑音が殆ど含まれてない。つまり、符号２３〜２５にて参照される各部位によって、原信号の低帯域の音響信号よりも風雑音が低減された低帯域の音響信号が復元される。 The signal restoration unit 12 uses only the low-band signal component of the restored signal. For this reason, the LPF 25 passes only the low-band component of the output signal of the nonlinear processing unit 24. The output signal of the LPF 25 is a low-band acoustic signal restored from the middle-band acoustic signal in the original signal. And since the wind band is hardly contained in the middle band that is the source of restoration, the restored low-band acoustic signal contains almost no wind noise. That is, a low-band acoustic signal in which wind noise is reduced as compared with the low-band acoustic signal of the original signal is restored by each part referred to by reference numerals 23 to 25.

他方、信号復元部１２では、ＬＰＦ２６を用いてオリジナルの低帯域信号を用意しておく。つまり、自身に対する入力信号の低帯域成分のみを通過させるＬＰＦ２６を用い、ＬＰＦ２６から原信号の低帯域成分のみを出力させる。 On the other hand, the signal restoration unit 12 prepares an original low-band signal using the LPF 26. That is, the LPF 26 that passes only the low-band component of the input signal to itself is used, and only the low-band component of the original signal is output from the LPF 26.

乗算器２７及び２８及び加算器２９は、相関値演算部２２にて算出された相関値に応じてＬＰＦ２５及び２６の各出力信号値を加重加算することにより、信号復元部１２の出力信号（第１補正音響信号）を生成する。第ｐの単位区間におけるＬＰＦ２６の出力信号値をLPF_OUT_O（ｔ）にて表し、且つ、第ｐの単位区間におけるＬＰＦ２５の出力信号値をLPF_OUT_R（ｔ）にて表した場合、第ｐの単位区間に対応する信号復元部１２の出力信号値OUT₁₂（ｔ）は、下記式（２）にて表される。 The multipliers 27 and 28 and the adder 29 perform weighted addition of the output signal values of the LPFs 25 and 26 according to the correlation value calculated by the correlation value calculation unit 22, thereby outputting the output signal (first signal) of the signal restoration unit 12. 1 corrected acoustic signal) is generated. When the output signal value of the LPF 26 in the p-th unit interval is expressed by LPF_OUT _O (t) and the output signal value of the LPF 25 in the p-th unit interval is expressed by LPF_OUT _R (t), the p-th unit The output signal value OUT ₁₂ (t) of the signal restoration unit 12 corresponding to the section is expressed by the following equation (2).

つまり、相関値が比較的大きい場合は風雑音が比較的小さいと判断されるため、信号復元部１２の出力信号に対するオリジナルの低帯域信号の寄与度を増大させる。一方、相関値が比較的小さいは風雑音が比較的大きいと判断されるため、信号復元部１２の出力信号に対する復元信号（復元信号の低帯域信号）の寄与度を増大させる。 That is, when the correlation value is relatively large, it is determined that the wind noise is relatively small, so that the contribution of the original low-band signal to the output signal of the signal restoration unit 12 is increased. On the other hand, since it is determined that the wind noise is relatively large when the correlation value is relatively small, the degree of contribution of the restoration signal (the low-band signal of the restoration signal) to the output signal of the signal restoration unit 12 is increased.

尚、上記式（１）の算出式からも分かるように、相関値Ｋ［ｐ］は、不等式「０≦Ｋ［ｐ］≦１」を満たす。このため、OUT₁₂（ｔ）の算出に相関値Ｋ［ｐ］をそのまま用いているが、「０≦Ｋ［ｐ］≦１」を満たさない場合は式（２）を適宜変更すればよい。また、着目した単位区間に関し、相関値が所定の基準閾値よりも大きい場合は、風雑音はないと判断してＬＰＦ２６の出力信号をそのまま信号復元部１２の出力信号とするようにしてもよい。 As can be seen from the calculation formula of the above formula (1), the correlation value K [p] satisfies the inequality “0 ≦ K [p] ≦ 1”. For this reason, the correlation value K [p] is used as it is for the calculation of OUT ₁₂ (t), but if “0 ≦ K [p] ≦ 1” is not satisfied, Equation (2) may be changed as appropriate. Further, when the correlation value is larger than a predetermined reference threshold with respect to the focused unit section, it may be determined that there is no wind noise and the output signal of the LPF 26 may be directly used as the output signal of the signal restoration unit 12.

信号低減部１３は、符号３０及び３１にて参照される各部位を備える。ＢＰＦ３０は、自身に対する入力信号の中帯域成分を抽出して出力する。乗算器３１は、単位区間ごとに、ＢＰＦ３０を通過した信号（即ち、原信号から抽出された中帯域の音響信号）のレベルを相関値演算部２２にて算出された相関値に応じた低減率にて低減させ、低減後の信号を信号低減部１３の出力信号として出力する。或る信号のレベルとは、その信号の振幅（強度）を表している。 The signal reduction unit 13 includes portions referred to by reference numerals 30 and 31. The BPF 30 extracts and outputs the middle band component of the input signal for itself. For each unit section, the multiplier 31 reduces the level of the signal that has passed through the BPF 30 (that is, the middle-band acoustic signal extracted from the original signal) according to the correlation value calculated by the correlation value calculator 22. The reduced signal is output as an output signal of the signal reduction unit 13. The level of a certain signal represents the amplitude (intensity) of the signal.

この際、相関値から風雑音の影響が大きいと判断される場合はレベルを大きく低減させ、相関値から風雑音の影響が小さいと判断される場合はレベルをあまり低減させないようにする。つまり、第ｐの単位区間に着目した場合、相関値Ｋ［ｐ］が減少するに従って第ｐの単位区間に対応する低減率を増大させてより大きくレベルを低減させるようにする（逆に考えれば、相関値Ｋ［ｐ］が増大するに従って第ｐの単位区間に対応する低減率を減少させる）。乗算器３１による信号低減によって信号低減部１３の出力信号（第２補正音響信号）に含まれる風雑音は適切に低減されることになる。 At this time, if it is determined from the correlation value that the influence of wind noise is large, the level is greatly reduced, and if it is determined from the correlation value that the influence of wind noise is small, the level is not reduced so much. That is, when paying attention to the p-th unit interval, the level is reduced more greatly by increasing the reduction rate corresponding to the p-th unit interval as the correlation value K [p] decreases (conversely, As the correlation value K [p] increases, the reduction rate corresponding to the p-th unit interval is decreased). By the signal reduction by the multiplier 31, the wind noise included in the output signal (second corrected acoustic signal) of the signal reduction unit 13 is appropriately reduced.

尚、同じ結果を導くのであれば、低減させる手法は任意である。例えば、相関値演算部２２にて算出された相関値又は該相関値に応じた係数をＢＰＦ３０の出力信号に乗じればよい。 In addition, if the same result is led, the method of reducing is arbitrary. For example, the output value of the BPF 30 may be multiplied by the correlation value calculated by the correlation value calculation unit 22 or a coefficient corresponding to the correlation value.

ＨＰＦ１４は、自身に対する入力信号の高帯域成分のみを通過させる。 The HPF 14 passes only the high band component of the input signal to itself.

信号合成部１５は、信号復元処理によって風雑音が低減された低帯域の音響信号を表す信号復元部１２の出力信号と、信号低減処理によって風雑音が低減された中帯域の音響信号を表す信号低減部１３の出力信号と、ＨＰＦ１４の出力信号とを加算し、この加算によって得られた信号を風雑音低減部６ａの出力信号（即ち、補正信号）として出力する。第１実施例において、この補正信号も原信号と同じく複数のチャンネル信号から成る時間軸上の音響信号である。 The signal synthesis unit 15 outputs an output signal of the signal restoration unit 12 that represents a low-band acoustic signal in which wind noise has been reduced by signal restoration processing, and a signal that represents an intermediate-band acoustic signal in which wind noise has been reduced by signal reduction processing. The output signal of the reduction unit 13 and the output signal of the HPF 14 are added, and a signal obtained by this addition is output as an output signal (that is, a correction signal) of the wind noise reduction unit 6a. In the first embodiment, this correction signal is also an acoustic signal on the time axis composed of a plurality of channel signals, like the original signal.

尚、信号復元部１２、信号低減部１３及びＨＰＦ１４における各信号遅延量が異なる場合は、信号合成部１５内または信号合成部１５の前段にて、それらの信号遅延量間の差を打ち消す遅延処理を行ってから、信号合成部１５の加算処理を行うようにする。これは、乗算器２７及び２８及び加算器２９を用いた加重加算処理においても同様である。また、信号復元部１２による信号復元処理と信号低減部１３による信号低減処理の前に相関値を算出する必要はあるが、信号復元処理と信号低減処理の処理順序に対して特に制約はない。 When the signal delay amounts in the signal restoration unit 12, the signal reduction unit 13, and the HPF 14 are different, a delay process that cancels the difference between the signal delay amounts in the signal synthesis unit 15 or in the previous stage of the signal synthesis unit 15. Then, the signal synthesizer 15 performs the addition process. The same applies to the weighted addition processing using the multipliers 27 and 28 and the adder 29. Further, it is necessary to calculate a correlation value before the signal restoration process by the signal restoration unit 12 and the signal reduction process by the signal reduction unit 13, but there is no particular limitation on the processing order of the signal restoration process and the signal reduction process.

図２の音声信号処理部４は、信号合成部１５から出力される補正信号に所定の符号化処理（音声圧縮処理）を施して得た信号を記録媒体５に記録する。所定の符号化処理とは、例えば、ＭＰＥＧ（Moving Picture Experts Group）の規格に従ったＡＡＣ (Advanced Audio Coding)である。 The audio signal processing unit 4 in FIG. 2 records a signal obtained by performing a predetermined encoding process (audio compression process) on the correction signal output from the signal synthesis unit 15 on the recording medium 5. The predetermined encoding process is, for example, AAC (Advanced Audio Coding) according to the MPEG (Moving Picture Experts Group) standard.

上述の説明では、原則として、Ｌ信号とＲ信号に対する信号処理を個別に分けて記述していないが、上述したように、風雑音低減部６ａ内の各部位は、複数のチャンネル信号の夫々に対して個別に必要な信号処理を行う。 In the above description, in principle, the signal processing for the L signal and the R signal is not described separately. However, as described above, each part in the wind noise reduction unit 6a is assigned to each of a plurality of channel signals. On the other hand, necessary signal processing is performed individually.

つまり、ＬＰＦ２１は、原信号を形成するＬ信号及びＲ信号の各所定帯域成分（典型的には低帯域成分）を抽出して出力する。ＢＰＦ２３は、原信号を形成するＬ信号及びＲ信号の各所定帯域成分（典型的には中帯域成分）を抽出して出力する。非線形処理部２４は、ＢＰＦ２３を介して与えられるＬ信号及びＲ信号に対して個別に非線形処理を行い、ＬＰＦ２５は、この非線形処理後のＬ信号及びＲ信号の各低帯域成分のみを通過させる。ＬＰＦ２６は、原信号を形成するＬ信号及びＲ信号の各低帯域成分のみを通過させる。乗算器２７及び２８並びに加算器２９は、ＬＰＦ２５から出力されるＬ信号とＬＰＦ２６から出力されるＬ信号の加重加算処理を行うと共にＬＰＦ２５から出力されるＲ信号とＬＰＦ２６から出力されるＲ信号の加重加算処理を行う。
ＢＰＦ３０は、原信号を形成するＬ信号及びＲ信号の各中帯域成分を抽出して出力し、乗算器３１は、ＢＰＦ３０を通過したＬ信号及びＲ信号の各レベルを相関値に応じた低減率にて低減させる（低減率を決定する相関値は、Ｌ信号及びＲ信号間で共通である）。
ＨＰＦ１４は、原信号を形成するＬ信号及びＲ信号の各高帯域成分のみを通過させる。信号合成部１５は、信号復元部１２の出力信号におけるＬ信号と信号低減部１３の出力信号におけるＬ信号とＨＰＦ１４の出力信号におけるＬ信号とを加算する共に信号復元部１２の出力信号におけるＲ信号と信号低減部１３の出力信号におけるＲ信号とＨＰＦ１４の出力信号におけるＲ信号とを加算することによって、補正信号を生成する。 That is, the LPF 21 extracts and outputs each predetermined band component (typically a low band component) of the L signal and the R signal forming the original signal. The BPF 23 extracts and outputs each predetermined band component (typically a middle band component) of the L signal and the R signal forming the original signal. The nonlinear processing unit 24 individually performs nonlinear processing on the L signal and the R signal given through the BPF 23, and the LPF 25 passes only the low-band components of the L signal and R signal after the nonlinear processing. The LPF 26 passes only the low-band components of the L signal and R signal forming the original signal. The multipliers 27 and 28 and the adder 29 perform weighted addition processing of the L signal output from the LPF 25 and the L signal output from the LPF 26 and weight the R signal output from the LPF 25 and the R signal output from the LPF 26. Addition processing is performed.
The BPF 30 extracts and outputs each of the middle band components of the L signal and the R signal forming the original signal, and the multiplier 31 reduces the levels of the L signal and the R signal that have passed through the BPF 30 according to the correlation value. (The correlation value that determines the reduction rate is common between the L signal and the R signal).
The HPF 14 passes only the high-band components of the L signal and R signal forming the original signal. The signal synthesis unit 15 adds the L signal in the output signal of the signal restoration unit 12, the L signal in the output signal of the signal reduction unit 13, and the L signal in the output signal of the HPF 14, and also the R signal in the output signal of the signal restoration unit 12. And the R signal in the output signal of the signal reduction unit 13 and the R signal in the output signal of the HPF 14 are added to generate a correction signal.

また、風雑音低減部６ａから、風雑音判定部１１を省略することも可能である。風雑音判定部１１を省略した場合、乗算器２７及び２８及び加算器２９は、ＬＰＦ２５及び２６の各出力信号値を一定の比率で加重加算することにより、信号復元部１２の出力信号（第１補正音響信号）を生成する。つまり、この場合、上記式（２）におけるＫ［ｐ］は固定値とされる。また、風雑音判定部１１を省略した場合、乗算器３１は、ＢＰＦ３０を通過した信号のレベルを、一定の低減率にて低減させ、低減後の信号を信号低減部１３の出力信号として出力する。尚、風雑音判定部１１が省略された場合、風雑音低減部６ａに対する入力信号は１つのチャンネル信号から成るモノラル信号であってもよい。 In addition, the wind noise determination unit 11 can be omitted from the wind noise reduction unit 6a. When the wind noise determination unit 11 is omitted, the multipliers 27 and 28 and the adder 29 perform weighted addition of the output signal values of the LPFs 25 and 26 at a constant ratio, thereby generating an output signal (first signal) of the signal restoration unit 12. Corrected acoustic signal) is generated. That is, in this case, K [p] in the above equation (2) is a fixed value. When the wind noise determination unit 11 is omitted, the multiplier 31 reduces the level of the signal that has passed through the BPF 30 at a constant reduction rate, and outputs the reduced signal as an output signal of the signal reduction unit 13. . When the wind noise determination unit 11 is omitted, the input signal to the wind noise reduction unit 6a may be a monaural signal composed of one channel signal.

また、上述の例では、ＢＰＦ２３、非線形処理部２４及びＬＰＦ２５が、Ｌ信号とＲ信号に対して個別に必要な処理を行うことにより、Ｌ信号についての復元信号とＲ信号についての復元信号を生成することとしているが、原信号を形成するＬ信号とＲ信号からモノラル信号を生成し、該モノラル信号に基づいてモノラルの復元信号を生成するようにしてもよい。信号のモノラル化は、原信号から復元信号を生成する過程内の何れの段階でも行うことができる。通常は、ＢＰＦ２３の前段において、原信号を形成するＬ信号とＲ信号を平均化することによりモノラル信号を生成し、該モノラル信号をＢＰＦ２３に与えればよい。モノラルの復元信号は、Ｌ信号用の復元信号として利用されると共にＲ信号用の復元信号としても利用される。モノラル信号からモノラルの復元信号を生成するようにすれば、信号処理が１チャンネル分ですみ、処理の簡素化が図られる。低帯域ではステレオ感が乏しいため、モノラルの復元信号を利用しても問題は少ない。モノラルの復元信号を生成するという技術的事項を、後述の他の実施例に当てはめることも可能である。 In the above example, the BPF 23, the non-linear processing unit 24, and the LPF 25 perform necessary processing individually on the L signal and the R signal, thereby generating a restoration signal for the L signal and a restoration signal for the R signal. However, a monaural signal may be generated from the L signal and R signal forming the original signal, and a monaural restoration signal may be generated based on the monaural signal. Signal monauralization can be performed at any stage in the process of generating a restored signal from the original signal. Normally, a monaural signal is generated by averaging the L signal and the R signal forming the original signal before the BPF 23, and the monaural signal is supplied to the BPF 23. The monaural restoration signal is used as a restoration signal for the L signal and also as a restoration signal for the R signal. If a monaural restoration signal is generated from a monaural signal, signal processing is only required for one channel, and the processing can be simplified. Since the sense of stereo is poor in the low band, there are few problems even if a monaural restoration signal is used. The technical matter of generating a monaural restoration signal can also be applied to other embodiments described later.

［信号低減処理の変形例］
また、上述の信号低減処理では、乗算器３１を用い、ＢＰＦ３０を通過した信号のレベルを相関値Ｋ［ｐ］に応じた低減率にて低減させているが、信号レベルの低減方法はこれに限定されない。例として、第１実施例の信号低減処理に関する、第１及び第２の変形信号低減処理を例示する。尚、以下、Ｌ信号に対応するチャンネルをＬチャンネルと呼び、Ｒ信号に対応するチャンネルをＲチャンネルと呼ぶ。 [Modification of signal reduction processing]
In the signal reduction processing described above, the multiplier 31 is used to reduce the level of the signal that has passed through the BPF 30 at a reduction rate corresponding to the correlation value K [p]. It is not limited. As an example, first and second modified signal reduction processes related to the signal reduction process of the first embodiment will be exemplified. Hereinafter, a channel corresponding to the L signal is referred to as an L channel, and a channel corresponding to the R signal is referred to as an R channel.

まず、第１の変形信号低減処理を説明する。第１の変形信号低減処理において、信号低減部１３は、相関値Ｋ［ｐ］を所定の閾値Ｋ_THAと比較する。上述の如く、相関値Ｋ［ｐ］は、第ｐの単位区間における風雑音の影響度合いを表している。一方で、閾値Ｋ_THAは、その影響度合いと対比されるべき基準影響度合いを表している。相関値Ｋ［ｐ］が閾値Ｋ_THAよりも小さい場合、第ｐの単位区間おける風雑音の影響が比較的大きいと判断され、相関値Ｋ［ｐ］が閾値Ｋ_THAよりも大きい場合、それが比較的小さいと判断される（第２の変形信号低減処理についても同様）。 First, the first deformation signal reduction process will be described. In the first modified signal reduction process, the signal reduction unit 13 compares the correlation value K [p] with a predetermined threshold value K _THA . As described above, the correlation value K [p] represents the degree of wind noise influence in the p-th unit interval. On the other hand, the threshold value K _THA represents a reference influence degree to be compared with the influence degree. If the correlation value K [p] is smaller than the threshold value K _THA , it is judged that the influence of wind noise in the p-th unit interval is relatively large. If the correlation value K [p] is larger than the threshold value K _THA , It is determined that it is relatively small (the same applies to the second deformation signal reduction process).

そして、相関値Ｋ［ｐ］が閾値Ｋ_THAよりも小さい場合、信号低減部１３は、ＢＰＦ３０を通過したＬ信号及びＲ信号を平均化し、この平均化によって得られたモノラル信号を信号低減部１３の出力信号として信号合成部１５に与える。今、ＢＰＦ３０を通過した、第ｐの単位区間におけるＬ信号及びＲ信号の信号値を夫々BPF_OUT_L（ｔ）及びBPF_OUT_R（ｔ）で表し、且つ、信号低減部１３から出力される、第ｐの単位区間におけるＬ信号及びＲ信号の信号値を夫々BPF_OUT_L’（ｔ）及びBPF_OUT_R’（ｔ）で表す。そうすると、相関値Ｋ［ｐ］が閾値Ｋ_THAよりも小さいとき、「BPF_OUT_L’（ｔ）＝BPF_OUT_R’（ｔ）＝（BPF_OUT_L（ｔ）＋BPF_OUT_R（ｔ））／２」にて表される信号が信号低減部１３から出力される。 When the correlation value K [p] is smaller than the threshold value K _THA , the signal reduction unit 13 averages the L signal and the R signal that have passed through the BPF 30, and the monaural signal obtained by this averaging is used as the signal reduction unit 13. To the signal synthesizer 15 as an output signal. Now, the signal values of the L signal and the R signal in the p-th unit interval that have passed through the BPF 30 are represented by BPF_OUT _L (t) and BPF_OUT _R (t), respectively, and are output from the signal reduction unit 13. The signal values of the L signal and the R signal in the unit interval are represented by BPF_OUT _L ′ (t) and BPF_OUT _R ′ (t), respectively. Then, when the correlation value K [p] is smaller than the threshold value K _THA, it is expressed by “BPF_OUT _L '(t) = BPF_OUT _R ′ (t) = (BPF_OUT _L (t) + BPF_OUT _R (t)) / 2”. Is output from the signal reduction unit 13.

風雑音は、乱気流によって各チャンネルでランダムに発生する雑音であるため、風雑音の影響が強いチャンネルもあれば弱いチャンネルも存在する。上記の平均化を行えば、風雑音の影響が各チャンネル間で均等になり、風雑音の影響を比較的多く受けているチャンネルの雑音レベルが低減する。 Since wind noise is noise that is randomly generated in each channel due to turbulence, there are channels that are strongly influenced by wind noise and channels that are weak. If the above averaging is performed, the influence of wind noise is equalized between the channels, and the noise level of a channel that is relatively affected by wind noise is reduced.

図１２に、この平均化による信号低減処理の概念図を示す。第ｐの単位区間について議論する。図１２に示す例では、風雑音の影響がＬチャンネルに対して比較的多くＲチャンネルに対して比較的小さくなっており、これに由来して、ＢＰＦ３０を通過したＬ信号の信号レベルはＢＰＦ３０を通過したＲ信号のそれよりも大きくなっている。このような場合に上記の平均化を行うと、ＢＰＦ３０を通過したＬ信号及びＲ信号に含まれる風雑音成分も平均化され、結果として、ＢＰＦ３０の通過信号の内、Ｌ信号の信号レベルが低減される。 FIG. 12 shows a conceptual diagram of signal reduction processing by this averaging. The p-th unit interval will be discussed. In the example shown in FIG. 12, the influence of wind noise is relatively large for the L channel and relatively small for the R channel. Accordingly, the signal level of the L signal that has passed through the BPF 30 is equal to that of the BPF 30. It is larger than that of the R signal that has passed. If the above averaging is performed in such a case, the wind noise components included in the L signal and the R signal that have passed through the BPF 30 are also averaged, and as a result, the signal level of the L signal among the passing signals of the BPF 30 is reduced. Is done.

一方、相関値Ｋ［ｐ］が所定の閾値Ｋ_THAよりも大きい場合は、上記の平均化を行わず、ＢＰＦ３０を通過したＬ信号及びＲ信号を、そのまま、信号低減部１３から出力されるべきＬ信号及びＲ信号として信号合成部１５に与えるとよい。或いは、乗算器３１を用いた、原則通りの信号低減処理を行うようにしてもよい。即ち、相関値Ｋ［ｐ］が所定の閾値Ｋ_THAよりも大きい場合は、乗算器３１を用い、ＢＰＦ３０を通過したＬ信号及びＲ信号のレベルを相関値Ｋ［ｐ］に応じた低減率にて低減させ、これによって得た信号を信号低減部１３の出力信号としてもよい。 On the other hand, when the correlation value K [p] is larger than the predetermined threshold value K _THA , the L signal and the R signal that have passed through the BPF 30 should be output as they are from the signal reduction unit 13 without performing the above averaging. It is good to give to the signal composition part 15 as L signal and R signal. Alternatively, signal reduction processing as in principle using the multiplier 31 may be performed. That is, when the correlation value K [p] is larger than the predetermined threshold value K _THA , the multiplier 31 is used to reduce the level of the L signal and the R signal that have passed through the BPF 30 to a reduction rate according to the correlation value K [p]. The signal obtained by this may be used as the output signal of the signal reduction unit 13.

次に、第２の変形信号低減処理を説明する。第１の変形信号低減処理を利用した場合、図１２からも理解されるように、信号低減処理によって、風雑音の影響が比較的小さいチャンネル（図１２においてＲチャンネル）の風雑音成分が増加する。これを回避するため、第２の変形信号低減処理では、左右チャンネルの内、信号レベルが小さい方のチャンネルの信号にて信号レベルが大きい方のチャンネルの信号を置き換える。 Next, the second deformation signal reduction process will be described. When the first modified signal reduction process is used, the wind noise component of the channel (the R channel in FIG. 12) where the influence of the wind noise is relatively small is increased by the signal reduction process, as can be understood from FIG. . In order to avoid this, in the second modified signal reduction process, the signal of the channel with the higher signal level is replaced with the signal of the channel with the lower signal level of the left and right channels.

より具体的には、第２の変形信号低減処理において、信号低減部１３は、相関値Ｋ［ｐ］を所定の閾値Ｋ_THAと比較する。そして、相関値Ｋ［ｐ］が閾値Ｋ_THAよりも小さい場合は、ＢＰＦ３０を通過したＬ信号とＲ信号の内、信号レベルが小さい方の信号を最小音響信号として且つ他方の信号を非最小音響信号として特定し、非最小音響信号を最小音響信号にて置き換える。即ち、相関値Ｋ［ｐ］が閾値Ｋ_THAよりも小さい場合において、ＢＰＦ３０を通過したＬ信号とＲ信号の内、例えばＲ信号が最小音響信号として特定されたときは、「BPF_OUT_L’（ｔ）＝BPF_OUT_R’（ｔ）＝BPF_OUT_R（ｔ）」にて表される信号が信号低減部１３から出力される。 More specifically, in the second modified signal reduction process, the signal reduction unit 13 compares the correlation value K [p] with a predetermined threshold value K _THA . When the correlation value K [p] is smaller than the threshold value K _THA , the signal having the smaller signal level of the L signal and the R signal that have passed through the BPF 30 is set as the minimum acoustic signal and the other signal is set as the non-minimum acoustic signal. Identify as signal and replace non-minimum acoustic signal with minimum acoustic signal. That is, when the correlation value K [p] is smaller than the threshold value K _THA , among the L signal and R signal that have passed through the BPF 30, for example, the R signal is specified as the minimum acoustic signal, “BPF_OUT _L ′ (t ) = BPF_OUT _R '(t) = BPF_OUT _R (t) "is output from the signal reduction unit 13.

図１３に、この置き換えによる信号低減処理の概念図を示す。第ｐの単位区間について議論する。図１３に示す例では、風雑音の影響がＬチャンネルに対して比較的多くＲチャンネルに対して比較的小さくなっており、これに由来して、ＢＰＦ３０を通過したＬ信号の信号レベルはＢＰＦ３０を通過したＲ信号のそれよりも大きくなっている。このような場合に上記の置き換えを行うと、ＢＰＦ３０を通過したＬ信号に含まれる風雑音成分が低減される（Ｒ信号に対しては変化なし）。これにより、風雑音の影響が比較的少ないチャンネルの雑音レベルを増大させることなく、風雑音の影響を比較的多く受けているチャンネルの雑音レベルを低減させることができる。 FIG. 13 shows a conceptual diagram of signal reduction processing by this replacement. The p-th unit interval will be discussed. In the example shown in FIG. 13, the influence of wind noise is relatively large for the L channel and relatively small for the R channel. Accordingly, the signal level of the L signal that has passed through the BPF 30 is the same as that of the BPF 30. It is larger than that of the R signal that has passed. When the above replacement is performed in such a case, the wind noise component included in the L signal that has passed through the BPF 30 is reduced (no change with respect to the R signal). As a result, the noise level of a channel that is relatively affected by wind noise can be reduced without increasing the noise level of a channel that is relatively less affected by wind noise.

一方、相関値Ｋ［ｐ］が所定の閾値Ｋ_THAよりも大きい場合は、上記の置き換えを行わず、ＢＰＦ３０を通過したＬ信号及びＲ信号を、そのまま、信号低減部１３から出力されるべきＬ信号及びＲ信号として信号合成部１５に与えるとよい。或いは、乗算器３１を用いた、原則通りの信号低減処理を行うようにしてもよい。即ち、相関値Ｋ［ｐ］が所定の閾値Ｋ_THAよりも大きい場合は、乗算器３１を用い、ＢＰＦ３０を通過したＬ信号及びＲ信号のレベルを相関値Ｋ［ｐ］に応じた低減率にて低減させ、これによって得た信号を信号低減部１３の出力信号としてもよい。 On the other hand, when the correlation value K [p] is larger than the predetermined threshold value K _THA , the above-described replacement is not performed, and the L signal and the R signal that have passed through the BPF 30 are output from the signal reduction unit 13 as they are. It is good to give to the signal composition part 15 as a signal and R signal. Alternatively, signal reduction processing as in principle using the multiplier 31 may be performed. That is, when the correlation value K [p] is larger than the predetermined threshold value K _THA , the multiplier 31 is used to reduce the level of the L signal and the R signal that have passed through the BPF 30 to a reduction rate according to the correlation value K [p]. The signal obtained by this may be used as the output signal of the signal reduction unit 13.

＜＜第２実施例＞＞
次に、第２実施例について説明する。第２実施例では、信号復元処理と信号低減処理を共に周波数軸上で実行する。 << Second Example >>
Next, a second embodiment will be described. In the second embodiment, both signal restoration processing and signal reduction processing are executed on the frequency axis.

図５は、第２実施例に係る風雑音低減部６ｂの内部ブロック図である。風雑音低減部６ｂは、図２の風雑音低減部６として利用される。風雑音低減部６ｂは、低帯域用の風雑音判定部として機能する相関値演算部５１と、中帯域用の風雑音判定部５２と、信号低減部５３と、信号復元部５４と、信号合成部５５と、を備える。風雑音判定部５２は、ｎ個の相関値演算部５２＿１、５２＿２、・・・、５２＿ｎを備え、信号低減部５３は、ｎ個の乗算器５３＿１、５３＿２、・・・、５３＿ｎを備える（ｎは、２以上の整数）。信号復元部５４は、復元信号生成部６１及び信号選択器６２にて参照される各部位を備える。 FIG. 5 is an internal block diagram of the wind noise reduction unit 6b according to the second embodiment. The wind noise reduction unit 6b is used as the wind noise reduction unit 6 of FIG. The wind noise reduction unit 6b includes a correlation value calculation unit 51 that functions as a wind noise determination unit for low band, a wind noise determination unit 52 for medium band, a signal reduction unit 53, a signal restoration unit 54, and a signal synthesis. Part 55. The wind noise determination unit 52 includes n correlation value calculation units 52_1, 52_2,..., 52_n, and the signal reduction unit 53 includes n multipliers 53_1, 53_2,. Is an integer of 2 or more). The signal restoration unit 54 includes each part referred to by the restoration signal generation unit 61 and the signal selector 62.

風雑音低減部６ｂに対する入力信号（入力音響信号）は、複数のチャンネル信号から成る周波数軸上の音響信号（換言すれば、周波数領域で表現された音響信号）である。風雑音低減部６ｂに対する入力信号は、時間軸上の音響信号である図３の風雑音低減部６ａに対する入力信号（Ｌ（ｔ）及びＲ（ｔ））を時間周波数変換によって周波数軸上の音響信号に変換したものである。この時間周波数変換には、ＤＦＴ（Discrete Fourier Transform）やＤＣＴ（Discrete Cosine Transform）などを利用する。 The input signal (input acoustic signal) to the wind noise reduction unit 6b is an acoustic signal on the frequency axis composed of a plurality of channel signals (in other words, an acoustic signal expressed in the frequency domain). The input signal to the wind noise reduction unit 6b is an acoustic signal on the frequency axis by time-frequency conversion of the input signals (L (t) and R (t)) to the wind noise reduction unit 6a in FIG. It is converted into a signal. For this time-frequency conversion, DFT (Discrete Fourier Transform), DCT (Discrete Cosine Transform), or the like is used.

上記の時間周波数変換によって、時間Δｔ毎に時間軸方向にサンプリングされたＬ信号Ｌ（ｔ）及びＲ信号Ｒ（ｔ）が周波数Δｆ毎に周波数軸方向にサンプリングされたＬ信号Ｌ（ｆ）及びＲ信号Ｒ（ｆ）に変換される。Ｌ（ｔ）及びＬ（ｆ）に対応するチャンネル信号をＬ信号と呼び、Ｒ（ｔ）及びＲ（ｆ）に対応するチャンネル信号をＲ信号と呼ぶ。 By the time-frequency conversion, the L signal L (t) and the R signal R (t) sampled in the time axis direction every time Δt, and the L signal L (f) sampled in the frequency axis direction every frequency Δf and Converted to R signal R (f). Channel signals corresponding to L (t) and L (f) are called L signals, and channel signals corresponding to R (t) and R (f) are called R signals.

図５の風雑音低減部６ｂに対する入力信号は、上述のようなＬ信号Ｌ（ｆ）及びＲ信号Ｒ（ｆ）から形成される。この入力信号は風雑音低減部６ｂによって補正される。従って、風雑音低減部６ｂに対する入力信号を「原信号」と呼び、風雑音低減部６ｂの出力信号を「補正信号」と呼ぶことにする。尚、以下、Ｌ信号Ｌ（ｆ）及びＲ信号Ｒ（ｆ）を、夫々、信号Ｌ（ｆ）及びＲ（ｆ）と略記することもある。 The input signal to the wind noise reduction unit 6b in FIG. 5 is formed from the L signal L (f) and the R signal R (f) as described above. This input signal is corrected by the wind noise reduction unit 6b. Therefore, an input signal to the wind noise reduction unit 6b is referred to as an “original signal”, and an output signal of the wind noise reduction unit 6b is referred to as a “correction signal”. Hereinafter, the L signal L (f) and the R signal R (f) may be abbreviated as signals L (f) and R (f), respectively.

今、説明の具体化のため、時間周波数変換として修正離散コサイン変換(modified discrete cosine transform；ＭＤＣＴ)を用いる場合を想定する。ＭＤＣＴを用いる場合、時間軸上の各チャンネル信号を符号化の処理単位であるフレームに分割する。１つのフレームには、１つ以上のブロックが含まれるが、今、１つのフレームが１つのブロックから形成されるものとする。フレームの番号（即ち、ブロックの番号）をｍで表し、０から起算してｍ番目のフレームを第ｍのフレームと表現する。ｍは、０以上の整数値をとる。図６に、各フレームの関係を示す。第０のフレーム、第１のフレーム、第２のフレーム、・・・、の順番で時間が進行する。各ブロックは、直前のブロックとの間でブロックの半分の長さの重複部分を有する。今の例の場合、１つのフレームが１つのブロックから形成されるため、各フレームも、直前のフレームとの間で１フレームの半分の長さの重複部分を有する。 Now, for the sake of concrete explanation, it is assumed that a modified discrete cosine transform (MDCT) is used as the time-frequency transform. When MDCT is used, each channel signal on the time axis is divided into frames that are processing units of encoding. One frame includes one or more blocks. Now, it is assumed that one frame is formed from one block. The frame number (that is, the block number) is represented by m, and the m-th frame from 0 is represented as the m-th frame. m takes an integer value of 0 or more. FIG. 6 shows the relationship between the frames. Time advances in the order of the 0th frame, the first frame, the second frame,. Each block has an overlap portion that is half the length of the previous block. In the case of the present example, since one frame is formed from one block, each frame also has an overlap portion that is half the length of one frame with the immediately preceding frame.

そして、Ｎサンプルの信号Ｌ（ｔ）がＭサンプルの信号Ｌ（ｆ）に変換されると共にＮサンプルの信号Ｒ（ｔ）がＭサンプルの信号Ｒ（ｆ）に変換されるものとし、Ｎ＝２０４８且つＭ＝１０２４であるとする。また、サンプリング周波数が４８ｋＨｚである場合を想定し、上述のΔｔを４８ｋＨｚの逆数とする。更に、周波数番号を表す変数としてｋを導入する。Ｍ＝１０２４であるためｋは０以上且つ１０２３以下の各整数値をとり、更にΔｔ＝１／４８ｋＨｚであるため、Ｌ（ｆ）及びＲ（ｆ）が表す周波数スペクトルの周波数間隔、即ち、周波数番号（ｋ−１）とｋの間の周波数間隔は約２３Ｈｚとなる。従って、低帯域の上限として考えた３００Ｈｚはｋ＝１３に対応し、中帯域の上限として考えた１．５ｋＨｚはｋ＝６４に対応する。 An N sample signal L (t) is converted into an M sample signal L (f) and an N sample signal R (t) is converted into an M sample signal R (f). Assume that 2048 and M = 1024. Further, assuming that the sampling frequency is 48 kHz, the above-described Δt is the reciprocal of 48 kHz. Further, k is introduced as a variable representing the frequency number. Since M = 1024, k takes an integer value of 0 or more and 1023 or less, and Δt = 1/48 kHz, so that frequency interval of the frequency spectrum represented by L (f) and R (f), that is, frequency The frequency interval between the numbers (k-1) and k is about 23 Hz. Therefore, 300 Hz considered as the upper limit of the low band corresponds to k = 13, and 1.5 kHz considered as the upper limit of the medium band corresponds to k = 64.

そうすると、信号Ｌ（ｆ）及びＲ（ｆ）はＭＤＣＴ係数Ｌ_m,k及びＲ_m,kにて表現することができる。ＭＤＣＴ係数Ｌ_m,kは、信号Ｌ（ｆ）の、第ｍのフレームにおける周波数番号ｋの周波数成分の信号強度を表し、ＭＤＣＴ係数Ｒ_m,kは、信号Ｒ（ｆ）の、第ｍのフレームにおける周波数番号ｋの周波数成分の信号強度を表す。 Then, the signals L (f) and R (f) can be expressed by MDCT coefficients L _{m, k} and R _{m, k} . The MDCT coefficient L _{m, k} represents the signal strength of the frequency component of the frequency number k in the mth frame of the signal L (f), and the MDCT coefficient R _{m, k} represents the _mth of the signal R (f). It represents the signal strength of the frequency component of frequency number k in the frame.

原信号を形成する信号Ｌ（ｆ）及びＲ（ｆ）の内、周波数帯域が低帯域に属する信号が相関値演算部５１に入力される。即ち、０≦ｋ≦１３の範囲内におけるＭＤＣＴ係数Ｌ_m,k及びＲ_m,kが相関値演算部５１に入力される。相関値演算部５１は、フレームごとに下記式（３）に従って相関値Ｋ_A［ｍ］を算出する。Ｋ_A［ｍ］は、第ｍのフレームに対する相関値を表す。Ｋ_A［ｍ］は、０以上１以下の値となる。尚、当然ではあるが、本実施例の如く周波数軸上で信号を取り扱う場合は、一定の周波数間隔で信号が存在するため、第１実施例では必要であったＬＰＦ等は不要である。 Of the signals L (f) and R (f) forming the original signal, a signal whose frequency band belongs to a low band is input to the correlation value calculation unit 51. That is, the MDCT coefficients L _{m, k} and R _{m, k} within the range of 0 ≦ k ≦ 13 are input to the correlation value calculation unit 51. The correlation value calculation unit 51 calculates a correlation value K _A [m] for each frame according to the following equation (3). K _A [m] represents a correlation value for the m-th frame. K _A [m] is a value between 0 and 1. Needless to say, when a signal is handled on the frequency axis as in the present embodiment, the signal exists at a constant frequency interval, and thus the LPF or the like necessary in the first embodiment is unnecessary.

風雑音は、左右のチャンネル信号間で相互相関がない。故に、原信号に含まれる風雑音が比較的多ければ相関値は比較的小さくなり、原信号に含まれる風雑音が比較的少なければ相関値は比較的大きくなる。相関値Ｋ_A［ｍ］は、第ｍのフレームにおける風雑音の強さに応じた値をとることになる。これを利用し、低帯域用の風雑音判定部としての相関値演算部５１は、相関値に基づいて各フレームに対する風雑音の影響度合いを判定する。この判定結果は、信号復元部５４での処理に利用される。 Wind noise has no cross-correlation between left and right channel signals. Therefore, the correlation value is relatively small if the wind noise contained in the original signal is relatively large, and the correlation value is relatively large if the wind noise contained in the original signal is relatively small. The correlation value K _A [m] takes a value corresponding to the strength of wind noise in the m-th frame. Using this, the correlation value calculation unit 51 as a low-band wind noise determination unit determines the degree of influence of wind noise on each frame based on the correlation value. This determination result is used for processing in the signal restoration unit 54.

原信号を形成する信号Ｌ（ｆ）及びＲ（ｆ）の内、周波数帯域が中帯域に属する信号が風雑音判定部５２及び信号低減部５３に入力される。即ち、１４≦ｋ≦６４の範囲内におけるＭＤＣＴ係数Ｌ_m,k及びＲ_m,kが風雑音判定部５２及び信号低減部５３に入力される。風雑音判定部５２及び信号低減部５３に対する入力信号は、更にｎ個に細分化される。つまり、中帯域を更にｎ個に細分化し、細分化された帯域ごとに風雑音判定と信号低減を実施する。 Of the signals L (f) and R (f) forming the original signal, a signal whose frequency band belongs to the middle band is input to the wind noise determination unit 52 and the signal reduction unit 53. That is, MDCT coefficients L _{m, k} and R _{m, k} within the range of 14 ≦ k ≦ 64 are input to the wind noise determination unit 52 and the signal reduction unit 53. The input signals to the wind noise determination unit 52 and the signal reduction unit 53 are further divided into n pieces. That is, the middle band is further divided into n pieces, and wind noise determination and signal reduction are performed for each subdivided band.

具体的には、
１４≦ｋ≦ｋ₁の範囲内におけるＭＤＣＴ係数Ｌ_m,k及びＲ_m,kが相関値演算部５２＿１及び乗算器５３＿１に入力され、
ｋ₁＜ｋ≦ｋ₂の範囲内におけるＭＤＣＴ係数Ｌ_m,k及びＲ_m,kが相関値演算部５２＿２及び乗算器５３＿２に入力され、・・・、
ｋ_nー1＜ｋ≦ｋ_nの範囲内におけるＭＤＣＴ係数Ｌ_m,k及びＲ_m,kが相関値演算部５２＿ｎ及び乗算器５３＿ｎに入力される。
ここで、１４＜ｋ₁＜ｋ₂＜、・・・＜ｋ_n-1＜ｋ_n＝６４、である。 In particular,
MDCT coefficients L _{m, k} and R _{m, k} within the range of 14 ≦ k ≦ k ₁ are input to the correlation value calculation unit 52_1 and the multiplier 53_1,
MDCT coefficients L _{m, k} and R _{m, k} within the range of k ₁ <k ≦ k ₂ are input to the correlation value calculator 52_2 and the multiplier 53_2,...
k _n over 1 <MDCT coefficient in the range of _k ≦ k _n L _m, k and R _{m, k} are input to the correlation value calculation section 52_n and the multiplier 53_N.
Here, 14 <k ₁ <k ₂ <,... <K _n-1 <k _n = 64.

風雑音判定部５２では、ｎ個に細分化された帯域ごとに相関値を算出する。即ち、相関値演算部５２＿１は、フレームごとに下記式（４−１）に従って相関値Ｋ_B1［ｍ］を算出し、相関値演算部５２＿２は、フレームごとに下記式（４−２）に従って相関値Ｋ_B2［ｍ］を算出し、・・・、相関値演算部５２＿ｎは、フレームごとに下記式（４−ｎ）に従って相関値Ｋ_Bn［ｍ］を算出する。相関値Ｋ_B1［ｍ］、Ｋ_B2［ｍ］、・・・Ｋ_Bn［ｍ］は、第ｍのフレームに対する相関値を表す。Ｋ_B1［ｍ］、Ｋ_B2［ｍ］、・・・Ｋ_Bn［ｍ］は、対応する帯域に関するＬ信号とＲ信号との間の相互相関を表し、夫々、０以上１以下の値となる。 The wind noise determination unit 52 calculates a correlation value for each of the n subdivided bands. That is, the correlation value calculation unit 52_1 calculates the correlation value K _B1 [m] for each frame according to the following equation (4-1), and the correlation value calculation unit 52_2 for each frame according to the following equation (4-2). The value K _B2 [m] is calculated, and the correlation value calculation unit 52_n calculates the correlation value K _Bn [m] for each frame according to the following equation (4-n). Correlation values K _B1 [m], K _B2 [m],... K _Bn [m] represent correlation values for the m-th frame. K _B1 [m], K _B2 [m],..., K _Bn [m] represent the cross-correlation between the L signal and the R signal for the corresponding band, and are values of 0 or more and 1 or less, respectively. .

乗算器５３＿１は、第ｍのフレームに関し、自身に対する入力信号のレベル（即ち、１４≦ｋ≦ｋ₁の範囲内におけるＭＤＣＴ係数Ｌ_m,k及びＲ_m,kの値）を相関値Ｋ_B1［ｍ］に応じた低減率にて低減させ、低減後の信号を出力する。同様に、
乗算器５３＿２は、第ｍのフレームに関し、自身に対する入力信号のレベル（即ち、ｋ₁＜ｋ≦ｋ₂の範囲内におけるＭＤＣＴ係数Ｌ_m,k及びＲ_m,kの値）を相関値Ｋ_B2［ｍ］に応じた低減率にて低減させ、低減後の信号を出力する。同様に、
乗算器５３＿ｎは、第ｍのフレームに関し、自身に対する入力信号のレベル（即ち、ｋ_nー1＜ｋ≦ｋ_nの範囲内におけるＭＤＣＴ係数Ｌ_m,k及びＲ_m,kの値）を相関値Ｋ_Bn［ｍ］に応じた低減率にて低減させ、低減後の信号を出力する。
信号低減部５３内の他の乗算器についても同様である。 For the m-th frame, the multiplier 53_1 uses the level of the input signal to itself (that is, the value of the MDCT coefficients L _{m, k} and R _{m, k} within the range of 14 ≦ k ≦ k ₁ ) as the correlation value K _B1 [ m] and output a signal after the reduction. Similarly,
For the m-th frame, the multiplier 53_2 uses the level of the input signal to itself (that is, the values of the MDCT coefficients L _{m, k} and R _{m, k} within the range of k ₁ <k ≦ k ₂ ) as the correlation value K _B2. The signal is reduced at a reduction rate according to [m], and the signal after reduction is output. Similarly,
Multiplier 53_n relates frame of the m, the level of the input signal to itself (i.e., k _n over 1 <MDCT coefficient in the range of _k ≦ k _n L _m, k and R _m, the value of _k) correlation values The signal is reduced at a reduction rate corresponding to K _Bn [m], and the signal after reduction is output.
The same applies to the other multipliers in the signal reduction unit 53.

この際、ｊを１以上ｎ以下の整数とした場合、乗算器５３＿ｊは、相関値Ｋ_Bj［ｍ］から風雑音の影響が大きいと判断される場合はレベルを大きく低減させ、相関値Ｋ_Bj［ｍ］から風雑音の影響が小さいと判断される場合はレベルをあまり低減させないようにする。つまり、乗算器５３＿ｊは、相関値Ｋ_Bj［ｍ］が減少するに従って第ｍのフレームに対応する低減率を増大させ、相関値Ｋ_Bj［ｍ］が増大するに従って第ｍのフレームに対応する低減率を減少させる。低減率が大きいほど乗算器５３＿ｊにおけるレベルの低減度合いは大きくなる。低減対象となるレベルとは、具体的には、例えばｊ＝１の場合、１４≦ｋ≦ｋ₁の範囲内におけるＭＤＣＴ係数Ｌ_m,k及びＲ_m,kの値である。 At this time, when j is an integer of 1 to n, the multiplier 53_j greatly reduces the level when the influence of wind noise is determined to be large from the correlation value K _Bj [m], and the correlation value K _Bj When it is determined from [m] that the influence of wind noise is small, the level is not reduced so much. That is, the multiplier 53_j increases the reduction rate corresponding to the m-th frame as the correlation value K _Bj [m] decreases, and decreases corresponding to the m-th frame as the correlation value K _Bj [m] increases. Decrease rate. The greater the reduction rate, the greater the level reduction level in the multiplier 53_j. The level to be reduced is specifically the values of the MDCT coefficients L _{m, k} and R _{m, k} within the range of 14 ≦ k ≦ k ₁ when, for example, j = 1.

尚、同じ結果を導くのであれば、低減させる手法は任意である。例えば、相関値演算部５２＿ｊにて算出された相関値又は該相関値に応じた係数を、乗算器５３＿ｊの入力信号に乗じればよい。尚、相関値Ｋ_Bj［ｍ］が所定の基準閾値よりも大きい場合は、風雑音はないと判断して乗算器５３＿ｊの入力信号をそのまま乗算器５３＿ｊの出力信号とするようにしてもよい。 In addition, if the same result is led, the method of reducing is arbitrary. For example, the correlation value calculated by the correlation value calculation unit 52_j or a coefficient corresponding to the correlation value may be multiplied by the input signal of the multiplier 53_j. When the correlation value K _Bj [m] is larger than a predetermined reference threshold, it may be determined that there is no wind noise and the input signal of the multiplier 53_j may be used as the output signal of the multiplier 53_j as it is.

中帯域においては、風の強さ等に依存して風雑音の影響を受ける帯域が異なってくる。そこで、中帯域をより小さな帯域に細分化して細分化した帯域ごとに風雑音の影響度合いを相関値演算によって評価する。そして、細分化した帯域ごとに、風雑音の影響度合いに応じて信号レベルの低減度合いを調整する。これにより、風雑音の影響がある細分化帯域に対してのみ信号低減がなされ、或いは、より風雑音の影響が多い細分化帯域に対してより強い信号低減がなされる。結果、不必要に信号低減を行うことなく中帯域に対する風雑音低減効果が得られる。 In the middle band, the band affected by wind noise differs depending on the wind intensity and the like. Therefore, the influence level of wind noise is evaluated by correlation value calculation for each subdivided band by subdividing the middle band into smaller bands. Then, for each subdivided band, the signal level reduction degree is adjusted according to the wind noise influence degree. As a result, signal reduction is performed only for the subdivided band that is affected by wind noise, or stronger signal reduction is performed for the subdivided band that is more affected by wind noise. As a result, a wind noise reduction effect for the middle band can be obtained without unnecessary signal reduction.

乗算器５３＿１、５３＿２、・・・、５３＿ｎの各出力信号は合成され、その合成によって得られた中帯域のＭＤＣＴ係数は、信号低減部５３の出力信号（第２補正音響信号）として復元信号生成部６１及び信号合成部５５に送られる。 The output signals of the multipliers 53_1, 53_2,..., 53_n are combined, and the MDCT coefficient in the middle band obtained by the combination is generated as an output signal (second corrected acoustic signal) of the signal reduction unit 53. Sent to the unit 61 and the signal synthesis unit 55.

復元信号生成部６１は、信号低減部５３の出力信号に含まれる、中帯域のピッチ情報から低帯域のピッチを予測して周波数軸上における低帯域の音響信号を復元する。１つのフレームに着目して、この復元手法を説明する。図７を参照する。図７の符号３００が付された実線の折れ線は、復元信号生成部６１に与えられた、着目フレームにおける中帯域の周波数スペクトルを表している。本実施例において、周波数スペクトル３００は、信号低減部５３の出力信号にて規定されることになる。 The restoration signal generation unit 61 restores a low-band acoustic signal on the frequency axis by predicting a low-band pitch from the middle-band pitch information included in the output signal of the signal reduction unit 53. This restoration method will be described by focusing on one frame. Please refer to FIG. A solid broken line denoted by reference numeral 300 in FIG. 7 represents the frequency spectrum of the middle band in the frame of interest given to the restoration signal generation unit 61. In the present embodiment, the frequency spectrum 300 is defined by the output signal of the signal reduction unit 53.

図７において、横軸は周波数を表し、縦軸は周波数スペクトルのレベルを表す。周波数スペクトルのレベルは、ＭＤＣＴ係数の値によって表される。図７は、着目フレームにピッチが含まれている場合に対応している。着目フレームにピッチが含まれていると周波数スペクトルが周期的に変動し、周期的に極大値と極小値をとる。今、周波数スペクトル３００が周波数ｆ_A、ｆ_C、ｆ_E及びｆ_Gにて極大値をとり且つ周波数ｆ_B、ｆ_D、ｆ_F、ｆ_Hにて極小値をとったとする。ここで、ｆ_A＜ｆ_B＜ｆ_C＜ｆ_D＜ｆ_E＜ｆ_F＜ｆ_G＜ｆ_Hとする。 In FIG. 7, the horizontal axis represents the frequency, and the vertical axis represents the level of the frequency spectrum. The level of the frequency spectrum is represented by the value of the MDCT coefficient. FIG. 7 corresponds to the case where a pitch is included in the frame of interest. If a pitch is included in the frame of interest, the frequency spectrum periodically changes, and takes a local maximum value and a local minimum value periodically. Now, it is assumed that the frequency spectrum 300 has maximum values at frequencies f _A , f _C , f _E and f _G and has minimum values at frequencies f _B , f _D , f _F and f _H. Here, it is assumed that f _A <f _B <f _C <f _D <f _E <f _F <f _G <f _H.

復元信号生成部６１は、周波数スペクトル３００から周波数ｆ_A、ｆ_B、ｆ_C、ｆ_D、ｆ_E、ｆ_F、ｆ_G及びｆ_Hを検出し、隣接する極大値と極小値間の差分値を算出する、そして、その差分値が所定の差分閾値以上である場合に、その差分値に対応する極大値の周波数成分をピッチの高調波成分であると判断する。例えば、周波数スペクトル３００における周波数ｆ_Aのレベルから周波数ｆ_Bのレベルを差し引くことによって得られる差分値と上記の差分閾値を比較し、前者が後者以上である場合に周波数ｆ_Aの成分をピッチの高調波成分であると判断する一方、前者が後者未満である場合に周波数ｆ_Aの成分はピッチの高調波成分でないと判断する。他の極大値及び極小値に対応する周波数についても同様である。 The restoration signal generation unit 61 detects the frequencies f _A , f _B , f _C , f _D , f _E , f _F , f _G, and f _H from the frequency spectrum 300, and the difference value between the adjacent maximum value and the minimum value. When the difference value is equal to or greater than a predetermined difference threshold value, it is determined that the frequency component having the maximum value corresponding to the difference value is a harmonic component of the pitch. For example, the difference value obtained by subtracting the level of the frequency f _B from the level of the frequency f _A in the frequency spectrum 300 is compared with the above difference threshold, and if the former is greater than or equal to the latter, the component of the frequency f _A While it is determined that the component is a harmonic component, when the former is less than the latter, it is determined that the component of the frequency f _A is not a harmonic component of the pitch. The same applies to the frequencies corresponding to the other maximum and minimum values.

今、この判断によって、周波数ｆ_A、ｆ_C、ｆ_E及びｆ_Gがピッチの高調波成分であると判断されたとする。この場合、復元信号生成部６１は、隣接する高調波成分の周波数差からピッチ間隔Ｄｐを予測する。例えば、周波数差（ｆ_A−ｆ_C）と（ｆ_C−ｆ_E）と（ｆ_E−ｆ_G）の平均をピッチ間隔Ｄｐとする。更に、復元信号生成部６１は、周波数スペクトル３００における周波数ｆ_A、ｆ_C、ｆ_E及びｆ_GのレベルからピッチのレベルＧｐを予測する。 Now, it is assumed that the frequencies f _A , f _C , f _E and f _G are determined to be harmonic components of the pitch by this determination. In this case, the restoration signal generation unit 61 predicts the pitch interval Dp from the frequency difference between adjacent harmonic components. For example, an average of the frequency differences (f _A −f _C ), (f _C −f _E ), and (f _E −f _G ) is set as the pitch interval Dp. Further, the restoration signal generation unit 61 predicts the pitch level Gp from the levels of the frequencies f _A , f _C , f _E and f _{G in} the frequency spectrum 300.

復元信号生成部６１は、予測したピッチ間隔Ｄｐ及びピッチのレベルＧｐを含むピッチ情報から、低帯域の信号を予測して復元信号を生成する。即ち、中帯域の中で最も低周波数側にあるピッチの高調波成分の周波数からピッチ間隔Ｄｐだけ低い周波数ｆ_X（＝ｆ_A−Ｄｐ）にピッチが存在すると予測し、その周波数ｆ_XにレベルＧｐのピッチを復元する。この復元の様子を図７及び図８に示す。図７において、符号３０１が付された破線の折れ線は、復元信号生成部６１にて生成された周波数軸上における低帯域の復元信号の周波数スペクトルを表している。 The restoration signal generation unit 61 predicts a low-band signal from the pitch information including the predicted pitch interval Dp and pitch level Gp, and generates a restoration signal. That is, it is predicted that there is a pitch at a frequency f _X (= f _A −Dp) that is lower than the frequency of the harmonic component of the pitch on the lowest frequency side in the middle band by a pitch interval Dp, and the level is at that frequency f _X. Restore the pitch of Gp. The state of this restoration is shown in FIGS. In FIG. 7, a broken line with a reference numeral 301 represents a frequency spectrum of a low-band restoration signal on the frequency axis generated by the restoration signal generation unit 61.

レベルＧｐは、周波数スペクトル３００を表す座標面において、周波数スペクトル３００における周波数ｆ_A、ｆ_C、ｆ_E及びｆ_Gのレベルを線形補間又は曲線補間することによって算出する。例えば、周波数ｆ_A、ｆ_C、ｆ_E及びｆ_Gのレベルが、夫々、１０、８、６及び４によって数値化される場合、Ｇｐは１２と予測される。 The level Gp is calculated by linearly interpolating or curve interpolating the levels of the frequencies f _A , f _C , f _E and f _{G in} the frequency spectrum 300 on the coordinate plane representing the frequency spectrum 300. For example, if the levels of frequencies f _A , f _C , f _E and f _G are quantified by 10, 8, 6 and 4, respectively, Gp is predicted to be 12.

また、周波数ｆ_X以外の復元信号（即ち、図７の周波数スペクトル３０１の折れ線形状）は、周波数ｆ_Xから周波数が離れるに従ってレベルが徐々に減少するように、予測される。この予測の際、周波数スペクトル３００を考慮するようにしてもよい。例えば、周波数スペクトル３００における隣接する極大値と極小値間のスペクトル形状を考慮して、周波数ｆ_X以外の復元信号を予測しても良い。例えば、レベルＧｐと周波数ｆ_Cにおけるレベルとの比率（上記の数値例では、１２／８＝１．５）にて、周波数スペクトル３００内における周波数ｆ_B−ｆ_D間のスペクトル形状をレベル方向に引き伸ばすことによって得られるスペククトル形状を、周波数スペクトル３０１の形状としてもよい。尚、図７及び図８に示す例では、復元したピッチが１つしかないが、算出されたピッチ間隔Ｄｐが狭い場合は、低帯域内に複数のピッチが存在するように復元信号を生成するようにしても構わない。 In addition, a restoration signal other than the frequency f _X (that is, the polygonal line shape of the frequency spectrum 301 in FIG. 7) is predicted so that the level gradually decreases as the frequency increases from the frequency f _X. In this prediction, the frequency spectrum 300 may be taken into consideration. For example, a restored signal other than the frequency f _X may be predicted in consideration of the spectrum shape between adjacent maximum and minimum values in the frequency spectrum 300. For example, the spectrum shape between the frequencies f _B -f _D in the frequency spectrum 300 in the level direction at the ratio between the level Gp and the level at the frequency f _C (12/8 = 1.5 in the above numerical example). The spectrum shape obtained by stretching may be the shape of the frequency spectrum 301. In the example shown in FIGS. 7 and 8, there is only one restored pitch, but when the calculated pitch interval Dp is narrow, a restored signal is generated so that a plurality of pitches exist in the low band. It doesn't matter if you do.

信号選択器６２には、原信号内の低帯域の信号と復元信号生成部６１によって生成された復元信号とが与えられ、信号選択器６２は、フレーム毎に、前者と後者の内の一方を相関値演算部５１によって算出された相関値Ｋ_A［ｍ］に応じて選択して出力する。原信号内の低帯域の信号も復元信号生成部６１によって生成された復元信号も、０≦ｋ≦１３の範囲内におけるＭＤＣＴ係数Ｌ_m,k及びＲ_m,kにて表現されるが、両者間におけるＭＤＣＴ係数Ｌ_m,k及びＲ_m,kの値は通常異なる。 The signal selector 62 is given a low-band signal in the original signal and the restored signal generated by the restored signal generator 61, and the signal selector 62 selects one of the former and the latter for each frame. _A correlation value K _A [m] calculated by the correlation value calculation unit 51 is selected and output. Both the low-band signal in the original signal and the restored signal generated by the restored signal generator 61 are expressed by MDCT coefficients L _{m, k} and R _{m, k} within the range of 0 ≦ k ≦ 13. The values of the MDCT coefficients L _{m, k} and R _{m, k} between are usually different.

具体的には、第ｍのフレームに着目した場合、信号選択器６２は、相関値Ｋ_A［ｍ］と所定の閾値を比較し、相関値Ｋ_A［ｍ］が所定の閾値以下の場合に風雑音があると判断して第ｍのフレームに対応する復元信号を選択及び出力し、相関値Ｋ_A［ｍ］が該閾値よりも大きい場合に風雑音がないと判断して第ｍのフレームに対応する原信号内の低帯域の信号を選択及び出力する。信号選択器６２の出力信号は、信号復元部５４の出力信号（第１補正音響信号）とされる。 Specifically, when focusing on the m-th frame, the signal selector 62 compares the correlation value K _A [m] with a predetermined threshold, and when the correlation value K _A [m] is equal to or smaller than the predetermined threshold. It is determined that there is wind noise, and a restoration signal corresponding to the mth frame is selected and output. If the correlation value K _A [m] is larger than the threshold, it is determined that there is no wind noise and the mth frame. A low-band signal in the original signal corresponding to is selected and output. The output signal of the signal selector 62 is used as the output signal (first corrected acoustic signal) of the signal restoration unit 54.

信号合成部５５には、信号復元部５４と信号低減部５３の各出力信号の他、原信号内の高帯域の信号がそのまま与えられる。信号合成部５５は、フレーム毎に、低帯域の音響信号を表す信号復元部５４の出力信号と、信号低減処理によって風雑音が低減された中帯域の音響信号を表す信号低減部５３の出力信号と、原信号内の高帯域の信号とを合成し、この合成によって得られた信号を、風雑音低減部６ｂの出力信号（即ち、補正信号）として出力する。第２実施例において、この補正信号も原信号と同じく複数のチャンネル信号から成る周波数軸上の音響信号である。 In addition to the output signals of the signal restoration unit 54 and the signal reduction unit 53, the signal synthesis unit 55 is given the high-band signal in the original signal as it is. The signal synthesis unit 55 outputs, for each frame, an output signal of the signal restoration unit 54 that represents a low-band acoustic signal and an output signal of the signal reduction unit 53 that represents a mid-band acoustic signal in which wind noise has been reduced by the signal reduction processing. Are combined with a high-band signal in the original signal, and a signal obtained by the synthesis is output as an output signal (that is, a correction signal) of the wind noise reduction unit 6b. In the second embodiment, this correction signal is also an acoustic signal on the frequency axis composed of a plurality of channel signals, like the original signal.

図２の音声信号処理部４において、信号合成部５５から出力される補正信号は、ＡＡＣの符号化方式に従って量子化され、符号化音声信号としてのビットストリームに変換される。この符号化音声信号（ビットストリーム）は、図２の記録媒体５に記録される。 In the audio signal processing unit 4 of FIG. 2, the correction signal output from the signal synthesis unit 55 is quantized according to the AAC encoding method and converted into a bit stream as an encoded audio signal. This encoded audio signal (bit stream) is recorded on the recording medium 5 of FIG.

上述の説明では、原則として、Ｌ信号とＲ信号に対する信号処理を個別に分けて記述していないが、上述したように、風雑音低減部６ｂ内の各部位は、複数のチャンネル信号の夫々に対して個別に必要な信号処理を行う。 In the above description, in principle, the signal processing for the L signal and the R signal is not described separately. However, as described above, each part in the wind noise reduction unit 6b is assigned to each of a plurality of channel signals. On the other hand, necessary signal processing is performed individually.

つまり、乗算器５３＿ｊは、原信号中の中帯域におけるＬ信号及びＲ信号の夫々に対して相関値演算部５２＿ｊが算出した相関値に応じた信号低減処理を行う（上述したように、ｊは１以上ｎ以下の整数）。復元信号生成部６１は、信号低減部５３の出力信号を形成するＬ信号及びＲ信号の夫々のピッチ情報を作成し、各ピッチ情報に基づいてＬ信号及びＲ信号の復元信号を生成する。信号選択器６２は、相関値演算部５１が算出した相関値に応じて、原信号中の低帯域におけるＬ信号及びＲ信号又は復元信号におけるＬ信号及びＲ信号を選択して出力する。信号合成部５５は、信号復元部５４の出力信号におけるＬ信号と信号低減部５３の出力信号におけるＬ信号と原信号中の高帯域におけるＬ信号とを合成すると共に信号復元部５４の出力信号におけるＲ信号と信号低減部５３の出力信号におけるＲ信号と原信号中の高帯域におけるＲ信号とを合成することによって、補正信号を生成する。 That is, the multiplier 53_j performs signal reduction processing corresponding to the correlation value calculated by the correlation value calculation unit 52_j for each of the L signal and the R signal in the middle band in the original signal (as described above, j is An integer of 1 to n). The restoration signal generation unit 61 creates pitch information of each of the L signal and the R signal that form the output signal of the signal reduction unit 53, and generates a restoration signal of the L signal and the R signal based on each pitch information. The signal selector 62 selects and outputs the L signal and R signal in the low band in the original signal or the L signal and R signal in the restored signal according to the correlation value calculated by the correlation value calculation unit 51. The signal synthesis unit 55 synthesizes the L signal in the output signal of the signal restoration unit 54, the L signal in the output signal of the signal reduction unit 53, and the L signal in the high band in the original signal, and in the output signal of the signal restoration unit 54 The correction signal is generated by combining the R signal and the R signal in the output signal of the signal reduction unit 53 and the R signal in the high band in the original signal.

尚、本実施例では、信号低減部５３の出力信号（即ち、信号低減処理後の中帯域の信号）に基づいて復元信号を生成するようにしているが、原信号中の中帯域の信号に基づいて復元信号を生成することも可能である。この場合、信号低減部５３の出力信号の代わりに、原信号中の中帯域の信号を復元信号生成部６１に与えればよい。しかし、信号低減処理によって風雑音を低減してからピッチ情報の抽出を行った方が、より精度の高い情報が得られるため、図５に示す如く構成した方が望ましい。 In this embodiment, the restoration signal is generated based on the output signal of the signal reduction unit 53 (that is, the signal in the middle band after the signal reduction process), but the signal in the middle band in the original signal is generated. It is also possible to generate a restoration signal based on this. In this case, instead of the output signal of the signal reduction unit 53, a medium band signal in the original signal may be supplied to the restoration signal generation unit 61. However, if pitch information is extracted after reducing wind noise by signal reduction processing, more accurate information can be obtained. Therefore, it is desirable to configure as shown in FIG.

また、風雑音低減部６ｂから、相関値演算部５１を省略することも可能である。相関値演算部５１を省略した場合、信号選択器６２も省略され、信号復元部５４は復元信号生成部６１にて生成された復元信号を無条件で出力する。これに類似して、風雑音低減部６ｂから、風雑音判定部５２を省略することも可能である。風雑音判定部５２を省略した場合、乗算器５３＿ｊは、原信号中の中帯域における信号のレベルを一定の低減率にて低減させ、低減後の信号を出力する。尚、相関値演算部５１及び風雑音判定部５２が省略された場合、風雑音低減部６ｂに対する入力信号は１つのチャンネル信号から成るモノラル信号であってもよい。 Further, the correlation value calculation unit 51 can be omitted from the wind noise reduction unit 6b. When the correlation value calculation unit 51 is omitted, the signal selector 62 is also omitted, and the signal restoration unit 54 outputs the restoration signal generated by the restoration signal generation unit 61 unconditionally. In a similar manner, the wind noise determination unit 52 can be omitted from the wind noise reduction unit 6b. When the wind noise determination unit 52 is omitted, the multiplier 53_j reduces the signal level in the middle band in the original signal at a constant reduction rate, and outputs the signal after the reduction. If the correlation value calculation unit 51 and the wind noise determination unit 52 are omitted, the input signal to the wind noise reduction unit 6b may be a monaural signal composed of one channel signal.

また、風雑音低減部６ｂに、相関値演算部５１としての低帯域用の風雑音判定部と中帯域用の風雑音判定部５２とを独立に設け、前者の判定結果を信号復元部５２の処理にのみ反映させ且つ後者の判定結果を信号低減部５３の処理にのみ反映させているが、以下のように各判定結果を相互利用するようにしてもよい。即ち例えば、相関値演算部５１が算出した相関値Ｋ_A［ｍ］と相関値演算部５２＿ｊが算出した相関値Ｋ_Bj［ｍ］に基づいて、第ｍのフレームにおける乗算器５３＿ｊでの低減率を決定するようにしてもよい。より具体的には例えば、相関値Ｋ_Bj［ｍ］の減少に従って該低減率を増大させるだけでなく、相関値Ｋ_A［ｍ］の減少に従っても該低減率を増大させるようにする。同様に、第ｍのフレームにおける信号選択器６２の選択を、相関値演算部５１が算出した相関値Ｋ_A［ｍ］と相関値演算部５２＿ｊが算出した相関値Ｋ_Bj［ｍ］に基づいて行うようにしても良い。 Further, the wind noise reduction unit 6 b is provided with a low-band wind noise determination unit and a mid-band wind noise determination unit 52 as the correlation value calculation unit 51 independently, and the former determination result is obtained from the signal restoration unit 52. Although only the processing is reflected and the latter determination result is reflected only on the processing of the signal reduction unit 53, each determination result may be mutually used as follows. That is, for example, based on the correlation value K _A [m] calculated by the correlation value calculation unit 51 and the correlation value K _Bj [m] calculated by the correlation value calculation unit 52_j, the reduction rate in the multiplier 53_j in the m-th frame May be determined. More specifically, for example, not only the reduction rate is increased according to the decrease of the correlation value K _Bj [m], but also the reduction rate is increased according to the decrease of the correlation value K _A [m]. Similarly, the selection of the signal selector 62 in the m-th frame is performed based on the correlation value K _A [m] calculated by the correlation value calculation unit 51 and the correlation value K _Bj [m] calculated by the correlation value calculation unit 52_j. You may make it do.

［信号低減処理の変形例］
また、上述の信号低減処理では、原信号中の中帯域におけるＬ信号及びＲ信号の夫々の信号レベルを乗算器５３＿ｊを用いて低減させ、この低減後の信号を信号合成部５５に与えているが、これに代えて以下のような処理を実行しても良い。例として、第２実施例の信号低減処理に関する、第３及び第４の変形信号低減処理を例示する。第３及び第４の変形信号低減処理は、夫々、第１実施例にて述べた第１及び第２の変形信号低減処理を、第２実施例に適応させた処理に相当する。 [Modification of signal reduction processing]
In the signal reduction process described above, the signal levels of the L signal and the R signal in the middle band of the original signal are reduced using the multiplier 53_j, and the signal after this reduction is given to the signal synthesis unit 55. However, instead of this, the following processing may be executed. As an example, third and fourth modified signal reduction processes related to the signal reduction process of the second embodiment will be exemplified. The third and fourth modified signal reduction processes correspond to processes in which the first and second modified signal reduction processes described in the first embodiment are adapted to the second embodiment, respectively.

第３の変形信号低減処理を説明する。説明の具体化のため、まず、細分化されたｎ個の帯域の１つである、相関値演算部５２＿１に対応する帯域に注目する。第３の変形信号低減処理において、信号低減部５３は、相関値演算部５２＿１によって算出された相関値Ｋ_B1［ｍ］を所定の閾値Ｋ_THB1と比較する。上述の如く、相関値Ｋ_B1［ｍ］は、第ｍのフレームにおける特定帯域への風雑音の影響度合いを表している。一方で、閾値Ｋ_THB1は、その影響度合いと対比されるべき基準影響度合いを表している。相関値Ｋ_B1［ｍ］が閾値Ｋ_THB1よりも小さい場合、第ｍのフレームおける特定帯域への風雑音の影響が比較的大きいと判断され、相関値Ｋ_B1［ｍ］が閾値Ｋ_THB1よりも大きい場合、それが比較的小さいと判断される（相関値Ｋ_B2［ｍ］〜Ｋ_Bn［ｍ］についても同様、且つ、第４の変形信号低減処理についても同様）。 The third deformation signal reduction process will be described. For the sake of specific description, first, attention is focused on a band corresponding to the correlation value calculation unit 52_1, which is one of the n subdivided bands. In the third modified signal reduction process, the signal reduction unit 53 compares the correlation value K _B1 [m] calculated by the correlation value calculation unit 52_1 with a predetermined threshold value K _THB1 . As described above, the correlation value K _B1 [m] represents the degree of influence of wind noise on the specific band in the m-th frame. On the other hand, the threshold value K _THB1 represents a reference influence degree to be compared with the influence degree. When the correlation value K _B1 [m] is smaller than the threshold value K _THB1 , it is determined that the influence of wind noise on the specific band in the m-th frame is relatively large, and the correlation value K _B1 [m] is smaller than the threshold value K _THB1. If it is larger, it is determined that it is relatively small (the same applies to the correlation values K _B2 [m] to K _Bn [m] and the same applies to the fourth modified signal reduction process).

そして、相関値Ｋ_B1［ｍ］が閾値Ｋ_THB1よりも小さい場合、信号低減部５３は、原信号に含まれる１４≦ｋ≦ｋ₁の範囲内におけるＭＤＣＴ係数Ｌ_m,k及びＲ_m,kを平均化したＭＤＣＴ係数（Ｌ_m,k＋Ｒ_m,k）／２を算出し、そのＭＤＣＴ係数（Ｌ_m,k＋Ｒ_m,k）／２を、信号低減部５３から出力されるべき１４≦ｋ≦ｋ₁の範囲内におけるＭＤＣＴ係数Ｌ_m,k及びＲ_m,kとして取り扱う。 When the correlation value K _B1 [m] is smaller than the threshold value K _THB1 , the signal reduction unit 53 determines the MDCT coefficients L _{m, k} and R _{m, k} within the range of 14 ≦ k ≦ k ₁ included in the original signal. MDCT coefficient (L _{m, k} + R _{m, k} ) / 2 is calculated, and the MDCT coefficient (L _{m, k} + R _{m, k} ) / 2 is output from the signal reduction unit 53 to 14 ≦ Treated as MDCT coefficients L _{m, k} and R _{m, k} in the range of k ≦ k ₁ .

一方、相関値Ｋ_B1［ｍ］が閾値Ｋ_THB1よりも大きい場合は、上記の平均化を行わず、原信号に含まれる１４≦ｋ≦ｋ₁の範囲内におけるＭＤＣＴ係数Ｌ_m,k及びＲ_m,kをそのまま信号低減部５３から出力されるべき１４≦ｋ≦ｋ₁の範囲内におけるＭＤＣＴ係数Ｌ_m,k及びＲ_m,kとして取り扱う（或いは、乗算器５３＿１による上述の信号低減処理を行ってもよい）。 On the other hand, when the correlation value K _B1 [m] is larger than the threshold value K _THB1 , the above averaging is not performed, and the MDCT coefficients L _{m, k} and R within the range of 14 ≦ k ≦ k ₁ included in the original signal. _{m, k} is handled as MDCT coefficients L _{m, k} and R _{m, k} within the range of 14 ≦ k ≦ k ₁ to be output from the signal reduction unit 53 as they are (or the above signal reduction processing by the multiplier 53_1 is performed). You may go).

上記の処理は、細分化されたｎ個の帯域の夫々に対して個別に行われる。変数ｊを用いて一般化する。信号低減部５３から出力されるべきｋ_j-1≦ｋ≦ｋ_jの範囲内におけるＭＤＣＴ係数Ｌ_m,k及びＲ_m,kを、夫々、ＭＤＣＴ係数Ｌ_m,k’及びＲ_m,k’にて表す。上述したように、１４＜ｋ₁＜ｋ₂＜、・・・＜ｋ_n-1＜ｋ_n＝６４、であり、また、ｋ₀＝１４であるとする。 The above processing is performed individually for each of the subdivided n bands. Generalize using variable j. The MDCT coefficients L _{m, k} and R _{m, k} within the range of k _j−1 ≦ k ≦ k _j to be output from the signal reduction unit 53 are respectively converted into the MDCT coefficients L _{m, k} ′ and R _{m, k} ′. Represented by As described above, it is assumed that 14 <k ₁ <k ₂ <,... <K _n-1 <k _n = 64, and k ₀ = 14.

信号低減部５３は、ｊ＝１、２、・・・、ｎの夫々に関し、相関値演算部５２＿ｊによって算出された相関値Ｋ_Bj［ｍ］を所定の閾値Ｋ_THBjと比較する。そして、相関値Ｋ_Bj［ｍ］が閾値Ｋ_THBjよりも小さい場合、信号低減部５３は、原信号に含まれるｋ_j-1≦ｋ≦ｋ_jの範囲内におけるＭＤＣＴ係数Ｌ_m,k及びＲ_m,kを平均化したＭＤＣＴ係数（Ｌ_m,k＋Ｒ_m,k）／２を算出し、そのＭＤＣＴ係数（Ｌ_m,k＋Ｒ_m,k）／２を、ｋ_j-1≦ｋ≦ｋ_jの範囲内におけるＭＤＣＴ係数Ｌ_m,k’及びＲ_m,k’として出力する。一方、相関値Ｋ_Bj［ｍ］が閾値Ｋ_THBjよりも大きい場合は、上記の平均化を行わず、原信号に含まれるｋ_j-1≦ｋ≦ｋ_jの範囲内におけるＭＤＣＴ係数Ｌ_m,k及びＲ_m,kをそのままｋ_j-1≦ｋ≦ｋ_jの範囲内におけるＭＤＣＴ係数Ｌ_m,k’及びＲ_m,k’として出力する（或いは、乗算器５３＿ｊによる上述の信号低減処理を行ってもよい）。 The signal reduction unit 53 compares the correlation value K _Bj [m] calculated by the correlation value calculation unit 52 — j with a predetermined threshold value K _THBj for each of j = 1, 2 _,. When the correlation value K _Bj [m] is smaller than the threshold value K _THBj , the signal reduction unit 53 includes the MDCT coefficients L _{m, k} and R within the range of k _j−1 ≦ k ≦ k _j included in the original signal. _m, averaged MDCT coefficient _{_{k (L m, k + R}} m, k) / 2 is calculated, the MDCT coefficients _{_{(L m, k + R m}} , k) / 2 _{a, k j-1 ≦ k ≦} k Output as MDCT coefficients L _{m, k} ′ and R _{m, k} ′ within the range of _j . On the other hand, when the correlation value K _Bj [m] is larger than the threshold value K _THBj , the above averaging is not performed, and the MDCT coefficient L _m, within the range of k _j−1 ≦ k ≦ k _j included in the original signal _{. k} and R _{m, k} are output as they are as MDCT coefficients L _{m, k} ′ and R _{m, k} ′ within the range of k _j−1 ≦ k ≦ k _j (or the above signal reduction processing by the multiplier 53_j is performed). You may go).

上記の平均化を行えば、風雑音の影響が各チャンネル間で均等になり、風雑音の影響を比較的多く受けているチャンネルの雑音レベルが低減する。また、細分化された帯域ごとに信号低減処理を行うことで、風雑音の影響のある帯域のみの雑音レベルを効率よく低減することができる。 If the above averaging is performed, the influence of wind noise is equalized between the channels, and the noise level of a channel that is relatively affected by wind noise is reduced. Further, by performing signal reduction processing for each subdivided band, it is possible to efficiently reduce the noise level only in the band affected by wind noise.

第４の変形信号低減処理を説明する。説明の具体化のため、まず、細分化されたｎ個の帯域の１つである、相関値演算部５２＿１に対応する帯域に注目する。第４の変形信号低減処理において、信号低減部５３は、相関値演算部５２＿１によって算出された相関値Ｋ_B1［ｍ］を所定の閾値Ｋ_THB1と比較する。そして、相関値Ｋ_B1［ｍ］が閾値Ｋ_THB1よりも小さい場合は、原信号に含まれる１４≦ｋ≦ｋ₁の範囲内におけるＭＤＣＴ係数Ｌ_m,k及びＲ_m,kの内、信号レベルが小さい方の信号（即ち、絶対値が小さい方のＭＤＣＴ係数）を最小音響信号として且つ他方の信号（即ち、絶対値が大きい方のＭＤＣＴ係数）を非最小音響信号として特定し、非最小音響信号を最小音響信号にて置き換える。 A fourth modified signal reduction process will be described. For the sake of specific description, first, attention is focused on a band corresponding to the correlation value calculation unit 52_1, which is one of the n subdivided bands. In the fourth modified signal reduction process, the signal reduction unit 53 compares the correlation value K _B1 [m] calculated by the correlation value calculation unit 52_1 with a predetermined threshold value K _THB1 . When the correlation value K _B1 [m] is smaller than the threshold value K _THB1 , the signal level among the MDCT coefficients L _{m, k} and R _{m, k} within the range of 14 ≦ k ≦ k ₁ included in the original signal. Is specified as the minimum acoustic signal and the other signal (that is, the MDCT coefficient with the larger absolute value) is specified as the non-minimum acoustic signal. Replace the signal with the minimum acoustic signal.

即ち、相関値Ｋ_B1［ｍ］が閾値Ｋ_THB1よりも小さい場合において、原信号に含まれる１４≦ｋ≦ｋ₁の範囲内におけるＭＤＣＴ係数Ｌ_m,k及びＲ_m,kの内、例えばＭＤＣＴ係数Ｒ_m,kが最小音響信号として特定されたときは、最小音響信号を表すそのＭＤＣＴ係数Ｒ_m,kが、１４≦ｋ≦ｋ₁の範囲内におけるＭＤＣＴ係数Ｌ_m,k’として、且つ、１４≦ｋ≦ｋ₁の範囲内におけるＭＤＣＴ係数Ｒ_m,k’として出力される。 That is, when the correlation value K _B1 [m] is smaller than the threshold value K _THB1 , the MDCT coefficients L _{m, k} and R _{m, k} within the range of 14 ≦ k ≦ k ₁ included in the original signal, for example, MDCT When the coefficient R _{m, k} is specified as the minimum acoustic signal, the MDCT coefficient R _{m, k} representing the minimum acoustic signal is the MDCT coefficient L _{m, k} ′ within the range of 14 ≦ k ≦ k ₁ , and , 14 ≦ k ≦ k ₁ , and output as MDCT coefficients R _{m, k} ′.

一方、相関値Ｋ_B1［ｍ］が閾値Ｋ_THB1よりも大きい場合は、上記の置き換えを行わず、原信号に含まれる１４≦ｋ≦ｋ₁の範囲内におけるＭＤＣＴ係数Ｌ_m,k及びＲ_m,kをそのまま１４≦ｋ≦ｋ₁の範囲内におけるＭＤＣＴ係数Ｌ_m,k’及びＲ_m,k’として出力する（或いは、乗算器５３＿１による上述の信号低減処理を行ってもよい）。 On the other hand, when the correlation value K _B1 [m] is larger than the threshold value K _THB1 , the above replacement is not performed, and MDCT coefficients L _{m, k} and R _m within the range of 14 ≦ k ≦ k ₁ included in the original signal are not performed. _{, k} are output as MDCT coefficients L _{m, k} ′ and R _{m, k} ′ within the range of 14 ≦ k ≦ k ₁ (or the above signal reduction processing by the multiplier 53_1 may be performed).

上記の処理は、細分化されたｎ個の帯域の夫々に対して個別に行われる。変数ｊを用いて一般化する。信号低減部５３は、ｊ＝１、２、・・・、ｎの夫々に関し、相関値演算部５２＿ｊによって算出された相関値Ｋ_Bj［ｍ］を所定の閾値Ｋ_THBjと比較する。そして、相関値Ｋ_Bj［ｍ］が閾値Ｋ_THBjよりも小さい場合は、原信号に含まれるｋ_j-1≦ｋ≦ｋ_jの範囲内におけるＭＤＣＴ係数Ｌ_m,k及びＲ_m,kの内、信号レベルが小さい方の信号（即ち、絶対値が小さい方のＭＤＣＴ係数）を最小音響信号として且つ他方の信号（即ち、絶対値が大きい方のＭＤＣＴ係数）を非最小音響信号として特定し、非最小音響信号を最小音響信号にて置き換える。そして、その置き換え後のＭＤＣＴ係数を、ｋ_j-1≦ｋ≦ｋ_jの範囲内におけるＭＤＣＴ係数Ｌ_m,k’及びＲ_m,k’として出力する。 The above processing is performed individually for each of the subdivided n bands. Generalize using variable j. The signal reduction unit 53 compares the correlation value K _Bj [m] calculated by the correlation value calculation unit 52 — j with a predetermined threshold value K _THBj for each of j = 1, 2 _,. When the correlation value K _Bj [m] is smaller than the threshold value K _THBj , the MDCT coefficients L _{m, k} and R _{m, k} within the range of k _j−1 ≦ k ≦ k _j included in the original signal are included. The signal having the smaller signal level (ie, the MDCT coefficient having the smaller absolute value) is specified as the minimum acoustic signal, and the other signal (that is, the MDCT coefficient having the larger absolute value) is specified as the non-minimum acoustic signal. Replace non-minimum acoustic signal with minimum acoustic signal. Then, the replaced MDCT coefficients are output as MDCT coefficients L _{m, k} ′ and R _{m, k} ′ within the range of k _j−1 ≦ k ≦ k _j .

一方、相関値Ｋ_Bj［ｍ］が閾値Ｋ_THBjよりも大きい場合は、上記の置き換えを行わず、原信号に含まれるｋ_j-1≦ｋ≦ｋ_jの範囲内におけるＭＤＣＴ係数Ｌ_m,k及びＲ_m,kをそのままｋ_j-1≦ｋ≦ｋ_jの範囲内におけるＭＤＣＴ係数Ｌ_m,k’及びＲ_m,k’として出力する（或いは、乗算器５３＿ｊによる上述の信号低減処理を行ってもよい）。 On the other hand, when the correlation value K _Bj [m] is larger than the threshold value K _THBj , the above replacement is not performed, and the MDCT coefficient L _{m, k} within the range of k _j−1 ≦ k ≦ k _j included in the original signal. And R _{m, k} are output as MDCT coefficients L _{m, k} ′ and R _{m, k} ′ within the range of k _j−1 ≦ k ≦ k _j (or the above signal reduction processing by the multiplier 53_j is performed). May be)

上記の置き換えを行えば、風雑音の影響が比較的少ないチャンネルの雑音レベルを増大させることなく、風雑音の影響を比較的多く受けているチャンネルの雑音レベルを低減させることができる。また、細分化された帯域ごとに信号低減処理を行うことで、風雑音の影響のある帯域のみの雑音レベルを効率よく低減することができる。 By performing the above replacement, it is possible to reduce the noise level of a channel relatively affected by wind noise without increasing the noise level of a channel relatively affected by wind noise. Further, by performing signal reduction processing for each subdivided band, it is possible to efficiently reduce the noise level only in the band affected by wind noise.

第３又は第４の変形信号低減処理を行って得られた１４≦ｋ≦６４の範囲内におけるＭＤＣＴ係数Ｌ_m,k’及びＲ_m,k’は合成され、その合成によって得られた中帯域のＭＤＣＴ係数は、信号低減部５３の出力信号として復元信号生成部６１及び信号合成部５５に送られる。 MDCT coefficients L _{m, k} ′ and R _{m, k} ′ within the range of 14 ≦ k ≦ 64 obtained by performing the third or fourth modified signal reduction processing are synthesized, and the intermediate band obtained by the synthesis These MDCT coefficients are sent to the restoration signal generation unit 61 and the signal synthesis unit 55 as output signals of the signal reduction unit 53.

また、低帯域用の風雑音判定部として機能する相関値演算部５１の演算結果に基づいて閾値Ｋ_THBjを可変設定するようにしてもよい。具体的には例えば、相関値演算部５１にて求められた相関値Ｋ_A［ｍ］が小さくなるに従って、上記の平均化又は置き換えが実行されやすくなるように閾値Ｋ_THBjを可変設定する。即ち、相関値Ｋ_A［ｍ］が小さくなるに従って、相関値Ｋ_Bj［ｍ］と比較されるべき閾値Ｋ_THBjを大きくする。 Further, the threshold value K _THBj may be variably set based on the calculation result of the correlation value calculation unit 51 that functions as a low-band wind noise determination unit. Specifically, for example, the threshold value K _THBj is variably set so that the above averaging or replacement is easily performed as the correlation value K _A [m] obtained by the correlation value calculation unit 51 becomes smaller. That is, as the correlation value K _A [m] decreases, the threshold value K _THBj to be compared with the correlation value K _Bj [m] is increased.

＜＜第３実施例＞＞
次に、第３実施例について説明する。第３実施例では、信号復元処理を時間軸上で実行し、その後、時間周波数変換を行ってから信号低減処理を周波数軸上で実行する。実現しやすい領域（時間領域又は周波数領域）でそれぞれの処理を行うことにより、より高精度で処理負荷の少ない風雑音低減部を形成できる。 << Third Example >>
Next, a third embodiment will be described. In the third embodiment, the signal restoration process is executed on the time axis, and then the signal reduction process is executed on the frequency axis after performing time-frequency conversion. By performing each process in an easily realizable area (time domain or frequency domain), it is possible to form a wind noise reduction unit with higher accuracy and less processing load.

図９は、第３実施例に係る風雑音低減部６ｃの内部ブロック図である。風雑音低減部６ｃは、図２の風雑音低減部６として利用される。風雑音低減部６ｃは、低帯域用の風雑音判定部として機能する風雑音判定部１１と、信号復元部１２と、中帯域用の風雑音判定部として機能する風雑音判定部５２と、信号低減部５３と、ＨＰＦ８１と、信号合成部８２と、時間周波数変換部８３と、信号合成部８４と、を備える。 FIG. 9 is an internal block diagram of the wind noise reduction unit 6c according to the third embodiment. The wind noise reduction unit 6c is used as the wind noise reduction unit 6 of FIG. The wind noise reduction unit 6c includes a wind noise determination unit 11 that functions as a wind noise determination unit for low band, a signal restoration unit 12, a wind noise determination unit 52 that functions as a wind noise determination unit for medium band, and a signal A reduction unit 53, an HPF 81, a signal synthesis unit 82, a time frequency conversion unit 83, and a signal synthesis unit 84 are provided.

風雑音低減部６ｃに対する入力信号（入力音響信号）は、図３の風雑音低減部６ａに対する入力信号と同じ信号Ｌ（ｔ）及びＲ（ｔ）である。この入力信号は風雑音低減部６ｃによって補正される。従って、風雑音低減部６ｃに対する入力信号を「原信号」と呼び、風雑音低減部６ｃの出力信号を「補正信号」と呼ぶことにする。 The input signals (input acoustic signals) to the wind noise reduction unit 6c are the same signals L (t) and R (t) as the input signals to the wind noise reduction unit 6a in FIG. This input signal is corrected by the wind noise reduction unit 6c. Accordingly, an input signal to the wind noise reduction unit 6c is referred to as an “original signal”, and an output signal of the wind noise reduction unit 6c is referred to as a “correction signal”.

第３実施例において、原信号は、ＢＰＦ２３、ＬＰＦ２１及び２６、並びに、ＨＰＦ８１の夫々に対して入力される。 In the third embodiment, the original signal is input to each of the BPF 23, the LPFs 21 and 26, and the HPF 81.

風雑音低減部６ｃにおける風雑音判定部１１及び信号復元部１２は、図３の風雑音低減部６ａにおけるそれらと同じものである。即ち、風雑音判定部１１にて算出された相関値に応じて、信号復元部１２にて原信号の低帯域信号と復元信号の低帯域信号が加重加算され、これによって信号復元部１２の出力信号（第１補正音響信号）が生成される。 The wind noise determination unit 11 and the signal restoration unit 12 in the wind noise reduction unit 6c are the same as those in the wind noise reduction unit 6a in FIG. That is, according to the correlation value calculated by the wind noise determination unit 11, the signal restoration unit 12 performs weighted addition of the low-band signal of the original signal and the low-band signal of the restoration signal, and thereby the output of the signal restoration unit 12 A signal (first corrected acoustic signal) is generated.

ＨＰＦ８１は、自身に対する入力信号の中帯域成分と高帯域成分のみを通過させる。 The HPF 81 passes only the middle band component and the high band component of the input signal to itself.

信号合成部８２は、信号復元処理によって風雑音が低減された低帯域の音響信号を表す信号復元部１２の出力信号と、ＨＰＦ８１の出力信号とを加算し、この加算によって得られた信号を、時間周波数変換部８３に出力する。尚、信号復元部１２及びＨＰＦ８１における各信号遅延量が異なる場合は、信号合成部８２内または信号合成部８２の前段にて、それらの信号遅延量間の差を打ち消す遅延処理を行ってから、信号合成部８２の加算処理を行うようにする。これは、乗算器２７及び２８及び加算器２９を用いた加重加算処理においても同様である。 The signal synthesis unit 82 adds the output signal of the signal restoration unit 12 that represents a low-band acoustic signal in which wind noise has been reduced by the signal restoration process, and the output signal of the HPF 81, and the signal obtained by this addition is It outputs to the time frequency conversion part 83. In addition, when each signal delay amount in the signal restoration unit 12 and the HPF 81 is different, after performing delay processing to cancel the difference between the signal delay amounts in the signal synthesis unit 82 or in the previous stage of the signal synthesis unit 82, The addition processing of the signal synthesis unit 82 is performed. The same applies to the weighted addition processing using the multipliers 27 and 28 and the adder 29.

信号合成部８２から出力される音響信号は、Ｌ信号及びＲ信号から成る時間軸上の音響信号である。信号合成部８２の出力信号を形成するＬ信号及びＲ信号の各値は、原信号を形成するＬ信号及びＲ信号のそれと異なるのであるが、信号合成部８２の出力信号を形成するＬ信号及びＲ信号も、説明の便宜上、Ｌ（ｔ）及びＲ（ｔ）で表すこととする。 The acoustic signal output from the signal synthesis unit 82 is an acoustic signal on the time axis composed of an L signal and an R signal. Each value of the L signal and R signal forming the output signal of the signal synthesis unit 82 is different from that of the L signal and R signal forming the original signal, but the L signal and the R signal forming the output signal of the signal synthesis unit 82 are different. The R signal is also expressed by L (t) and R (t) for convenience of explanation.

時間周波数変換部８３は、信号合成部８２の出力信号を時間周波数変換によって周波数軸上の信号に変換する。この時間周波数変換は、第２実施例で述べた時間周波数変換と同様である。即ち、時間周波数変換部８３は、時間周波数変換によって、信号合成部８２の出力信号を形成する、時間Δｔ毎に時間軸方向にサンプリングされたＬ信号Ｌ（ｔ）及びＲ信号Ｒ（ｔ）を、周波数Δｆ毎に周波数軸方向にサンプリングされたＬ信号Ｌ（ｆ）及びＲ信号Ｒ（ｆ）に変換して出力する。時間周波数変換部８３の前段にて低帯域に対する信号復元処理がなされているため、この変換によって得られたＬ信号Ｌ（ｆ）及びＲ信号Ｒ（ｆ）の低帯域成分の値は、図５の風雑音低減部６ｂに対する原信号（第２実施例におけるＬ（ｆ）及びＲ（ｆ））のそれと異なっているが、説明の便宜上、本実施例では、時間周波数変換部８３から出力されるＬ信号及びＲ信号をＬ（ｆ）及びＲ（ｆ）と表記する。 The time frequency conversion unit 83 converts the output signal of the signal synthesis unit 82 into a signal on the frequency axis by time frequency conversion. This time-frequency conversion is the same as the time-frequency conversion described in the second embodiment. That is, the time-frequency conversion unit 83 forms the output signal of the signal synthesis unit 82 by time-frequency conversion, and the L signal L (t) and the R signal R (t) sampled in the time axis direction every time Δt. The L signal L (f) and the R signal R (f) sampled in the frequency axis direction for each frequency Δf are converted and output. Since the signal restoration processing for the low band is performed in the previous stage of the time frequency conversion unit 83, the values of the low band components of the L signal L (f) and the R signal R (f) obtained by this conversion are shown in FIG. This is different from that of the original signal (L (f) and R (f) in the second embodiment) for the wind noise reduction unit 6b, but is output from the time-frequency conversion unit 83 in this embodiment for convenience of explanation. The L signal and the R signal are expressed as L (f) and R (f).

そして、説明の具体化のため、時間周波数変換部８３における時間周波数変換として、第２実施例と同様、修正離散コサイン変換(ＭＤＣＴ)を用いる場合を想定する。そして、第２実施例で述べた、ＭＤＣＴに対する具体例を本実施例にも適用する（Ｎ、Ｍ、ｍ、ｋなどの値の具体例も適用する）。そうすると、時間周波数変換部８３の出力信号を形成する信号Ｌ（ｆ）及びＲ（ｆ）は、ＭＤＣＴ係数Ｌ_m,k及びＲ_m,kにて表現することができる。 For the sake of concrete explanation, it is assumed that the modified discrete cosine transform (MDCT) is used as the time-frequency transform in the time-frequency transform unit 83 as in the second embodiment. The specific example for MDCT described in the second example is also applied to this example (specific examples of values such as N, M, m, and k are also applied). Then, the signals L (f) and R (f) forming the output signal of the time frequency conversion unit 83 can be expressed by MDCT coefficients L _{m, k} and R _{m, k} .

時間周波数変換部８３の出力信号を形成する信号Ｌ（ｆ）及びＲ（ｆ）の内、周波数帯域が中帯域に属する信号が風雑音判定部５２及び信号低減部５３に入力される。即ち、１４≦ｋ≦６４の範囲内におけるＭＤＣＴ係数Ｌ_m,k及びＲ_m,kが風雑音判定部５２及び信号低減部５３に入力される。 Of the signals L (f) and R (f) forming the output signal of the time-frequency conversion unit 83, a signal whose frequency band belongs to the middle band is input to the wind noise determination unit 52 and the signal reduction unit 53. That is, MDCT coefficients L _{m, k} and R _{m, k} within the range of 14 ≦ k ≦ 64 are input to the wind noise determination unit 52 and the signal reduction unit 53.

風雑音低減部６ｃにおける風雑音判定部５２及び信号低減部５３は、図５の風雑音低減部６ｂにおけるそれらと同じものである。即ち、中帯域をｎ個に細分化し、細分化された帯域ごとに、風雑音判定部５２にて算出された相関値に応じた低減率にて時間周波数変換８３の出力信号の中帯域を低減させる。この低減後の信号である、乗算器５３＿１、５３＿２、・・・、５３＿ｎの各出力信号は合成され、その合成によって得られた中帯域のＭＤＣＴ係数は、信号低減部５３の出力信号（第２補正音響信号）として信号合成部８４に送られる。尚、乗算器５３＿１、５３＿２、・・・、５３＿ｎの各出力信号の合成は、信号合成部８４にて行われると考えても良い。 The wind noise determination unit 52 and the signal reduction unit 53 in the wind noise reduction unit 6c are the same as those in the wind noise reduction unit 6b in FIG. That is, the middle band is subdivided into n, and the middle band of the output signal of the time-frequency conversion 83 is reduced for each subdivided band at a reduction rate according to the correlation value calculated by the wind noise determination unit 52. Let The output signals of the multipliers 53_1, 53_2,..., 53_n, which are the signals after the reduction, are combined, and the midband MDCT coefficient obtained by the combination is the output signal of the signal reduction unit 53 (second signal). Corrected acoustic signal) is sent to the signal synthesizer 84. It should be noted that the output signals of the multipliers 53_1, 53_2,..., 53_n may be combined by the signal combining unit 84.

時間周波数変換部８３の出力信号を形成する信号Ｌ（ｆ）及びＲ（ｆ）の内、周波数帯域が低帯域及び高帯域に属する信号は、そのまま、信号合成部８４に与えられる。信号合成部８４は、フレーム毎に、時間周波数変換部８３から直接与えられた低帯域及び高帯域に属する信号と、信号低減部５３の出力信号と、を合成し、この合成によって得られた信号を風雑音低減部６ｃの出力信号（即ち、補正信号）として出力する。第３実施例において、この補正信号は、複数のチャンネル信号から成る周波数軸上の音響信号である。 Of the signals L (f) and R (f) forming the output signal of the time-frequency conversion unit 83, signals belonging to the low and high frequency bands are supplied to the signal synthesis unit 84 as they are. The signal synthesizer 84 synthesizes the signal belonging to the low band and the high band directly given from the time-frequency converter 83 and the output signal of the signal reducer 53 for each frame, and the signal obtained by this synthesis. Is output as an output signal (that is, a correction signal) of the wind noise reduction unit 6c. In the third embodiment, the correction signal is an acoustic signal on the frequency axis composed of a plurality of channel signals.

図２の音声信号処理部４において、信号合成部８４から出力される補正信号は、ＡＡＣの符号化方式に従って量子化され、符号化音声信号としてのビットストリームに変換される。この符号化音声信号（ビットストリーム）は、図２の記録媒体５に記録される。 In the audio signal processing unit 4 of FIG. 2, the correction signal output from the signal synthesis unit 84 is quantized according to the AAC encoding method and converted into a bit stream as an encoded audio signal. This encoded audio signal (bit stream) is recorded on the recording medium 5 of FIG.

上述の説明では、原則として、Ｌ信号とＲ信号に対する信号処理を個別に分けて記述していないが、上述したように、風雑音低減部６ｃ内の各部位は、複数のチャンネル信号の夫々に対して個別に必要な信号処理を行う。 In the above description, in principle, the signal processing for the L signal and the R signal is not separately described. However, as described above, each part in the wind noise reduction unit 6c is assigned to each of a plurality of channel signals. On the other hand, necessary signal processing is performed individually.

つまり、ＨＰＦ８１は、原信号を形成するＬ信号の中帯域及び高帯域成分と原信号を形成するＲ信号の中帯域及び高帯域成分のみを通過させる。信号合成部８２は、信号復元部１２の出力信号におけるＬ信号とＨＰＦ８１の出力信号におけるＬ信号とを加算する共に信号復元部１２の出力信号におけるＲ信号とＨＰＦ８１の出力信号におけるＲ信号とを加算する。時間周波数変換部８３は、与えられた時間軸上のＬ信号とＲ信号を個別に時間周波数変換する。信号合成部８４は、信号低減部５３の出力信号におけるＬ信号と時間周波数変換部８３の出力信号中の低帯域及び高帯域におけるＬ信号とを合成すると共に信号低減部５３の出力信号におけるＲ信号と時間周波数変換部８３の出力信号中の低帯域及び高帯域におけるＲ信号とを合成することによって、補正信号を生成する。尚、ＬＰＦ２１等に関しては、第１又は第２実施例にて述べた通りである。 That is, the HPF 81 passes only the middle band and high band components of the L signal forming the original signal and the middle band and high band components of the R signal forming the original signal. The signal synthesis unit 82 adds the L signal in the output signal of the signal restoration unit 12 and the L signal in the output signal of the HPF 81, and adds the R signal in the output signal of the signal restoration unit 12 and the R signal in the output signal of the HPF 81. To do. The time-frequency conversion unit 83 performs time-frequency conversion on the given L and R signals on the time axis individually. The signal synthesis unit 84 synthesizes the L signal in the output signal of the signal reduction unit 53 and the L signal in the low band and the high band in the output signal of the time frequency conversion unit 83 and also the R signal in the output signal of the signal reduction unit 53. And the R signal in the low band and the high band in the output signal of the time-frequency conversion unit 83 are combined to generate a correction signal. The LPF 21 and the like are as described in the first or second embodiment.

また、第１実施例でも述べたのと同様、風雑音低減部６ｃから、風雑音判定部１１を省略することも可能である。風雑音判定部１１を省略した場合、乗算器２７及び２８及び加算器２９は、ＬＰＦ２５及び２６の各出力信号値を一定の比率で加重加算することにより、信号復元部１２の出力信号（第１補正音響信号）を生成する。また、第２実施例でも述べたのと同様、風雑音低減部６ｃから、風雑音判定部５２を省略することも可能である。風雑音判定部５２を省略した場合、乗算器５３＿ｊは、時間周波数変換部８３の出力信号中の中帯域における信号のレベルを一定の低減率にて低減させ、低減後の信号を出力する。尚、風雑音判定部１１及び風雑音判定部５２が省略された場合、風雑音低減部６ｃに対する入力信号は１つのチャンネル信号から成るモノラル信号であってもよい。 Further, as described in the first embodiment, the wind noise determination unit 11 can be omitted from the wind noise reduction unit 6c. When the wind noise determination unit 11 is omitted, the multipliers 27 and 28 and the adder 29 perform weighted addition of the output signal values of the LPFs 25 and 26 at a constant ratio, thereby generating an output signal (first signal) of the signal restoration unit 12. Corrected acoustic signal) is generated. Further, as described in the second embodiment, the wind noise determination unit 52 can be omitted from the wind noise reduction unit 6c. When the wind noise determination unit 52 is omitted, the multiplier 53_j reduces the signal level in the middle band in the output signal of the time-frequency conversion unit 83 at a constant reduction rate, and outputs the signal after the reduction. When the wind noise determination unit 11 and the wind noise determination unit 52 are omitted, the input signal to the wind noise reduction unit 6c may be a monaural signal composed of one channel signal.

また、風雑音低減部６ｃに、低帯域用の風雑音判定部１１と中帯域用の風雑音判定部５２とを独立に設け、前者の判定結果を信号復元部１２の処理にのみ反映させ且つ後者の判定結果を信号低減部５３の処理にのみ反映させているが、第２実施例で述べたのと同様、各判定結果を相互利用するようにしてもよい。即ち例えば、或る着目フレームに関し、相関値演算部２２が算出した相関値と相関値演算部５２＿ｊが算出した相関値に基づいて、着目フレームにおける乗算器５３＿ｊでの低減率を決定する。より具体的には例えば、相関値演算部５２＿ｊが算出した相関値の減少に従って該低減率を増大させるだけでなく、相関値演算部２２が算出した相関値の減少に従っても該低減率を増大させるようにする。 Further, the wind noise reduction unit 6c is provided with the wind noise determination unit 11 for low band and the wind noise determination unit 52 for medium band independently, and the former determination result is reflected only in the processing of the signal restoration unit 12; Although the latter determination result is reflected only in the processing of the signal reduction unit 53, each determination result may be used mutually as described in the second embodiment. That is, for example, for a certain frame of interest, the reduction rate in the multiplier 53_j in the frame of interest is determined based on the correlation value calculated by the correlation value calculation unit 22 and the correlation value calculated by the correlation value calculation unit 52_j. More specifically, for example, not only the reduction rate is increased according to the decrease of the correlation value calculated by the correlation value calculation unit 52_j, but also the reduction rate is increased according to the decrease of the correlation value calculated by the correlation value calculation unit 22. Like that.

また、第２実施例にて上述した第３及び第４の変形信号低減処理は、第３実施例に対しても適用可能である。当然ではあるが、第３及び第４の変形信号低減処理を第３実施例に適用する場合、第３及び第４の変形信号低減処理の説明文で述べた「原信号及び信号合成部５５」は、夫々、「時間周波数変換部８３の出力信号及び信号合成部８４に読み替えられる。また、第３及び第４の変形信号低減処理を第３実施例に適用する場合、低帯域用の風雑音判定部１１の判定結果に応じて閾値Ｋ_THBjを可変設定するようにしてもよい。具体的には例えば、或る着目フレームに関し、相関値演算部２２にて求められた相関値が小さくなるに従って、上記の平均化又は置き換えが実行されやすくなるように閾値Ｋ_THBjを可変設定する。即ち、或る着目フレームに関し、相関値演算部２２にて求められた相関値が小さくなるに従って、相関値Ｋ_Bj［ｍ］と比較されるべき閾値Ｋ_THBjを大きくする。 Further, the third and fourth modified signal reduction processes described in the second embodiment can be applied to the third embodiment. Needless to say, when the third and fourth modified signal reduction processes are applied to the third embodiment, the “original signal and signal combining unit 55” described in the description of the third and fourth modified signal reduction processes is provided. Are read as “the output signal of the time-frequency converter 83 and the signal synthesizer 84. Also, when the third and fourth modified signal reduction processes are applied to the third embodiment, the wind noise for the low band is used. The threshold value K _THBj may be variably set according to the determination result of the determination unit 11. Specifically, for example, for a certain frame of interest, as the correlation value obtained by the correlation value calculation unit 22 decreases. The threshold value K _THBj is variably set so that the above averaging or replacement is easily performed, that is, as the correlation value obtained by the correlation value calculation unit 22 for a certain frame of _interest decreases, the correlation value K is compared with _Bj [m] A larger threshold K _THBj should.

第１〜第３実施例によれば、風雑音低減用の処理に起因する低帯域信号の歪みを解消することができる。また、中帯域についても、信号低減処理によって風雑音の影響を抑制することができる。 According to the first to third embodiments, it is possible to eliminate the distortion of the low-band signal caused by the wind noise reduction process. In addition, the influence of wind noise can be suppressed also in the middle band by the signal reduction process.

ここで、第１〜第３実施例の夫々の風雑音低減部の利点を考察する。
第１実施例に係る風雑音低減部６ａ（図３）では、信号復元処理と信号低減処理を並列に行うことができる。また、時間領域のみで信号処理を行うことができるため、時間周波数変換が不要となる。
第２実施例に係る風雑音低減部６ｂ（図５）では、周波数領域で信号処理を行うため、帯域別の処理を直感的に行うことができる。信号低減処理を施す中帯域を容易に細かく分けることが可能であり、風の影響のある帯域にのみ信号低減を行うといったことが可能となる。
第３実施例に係る風雑音低減部６ｃ（図９）は、ＡＡＣなどに従ったエンコーダ（符号化器）に組み込みやすく、実用性が高い。 Here, the advantages of the respective wind noise reduction units of the first to third embodiments will be considered.
In the wind noise reduction unit 6a (FIG. 3) according to the first embodiment, signal restoration processing and signal reduction processing can be performed in parallel. In addition, since signal processing can be performed only in the time domain, time-frequency conversion is not necessary.
Since the wind noise reduction unit 6b (FIG. 5) according to the second embodiment performs signal processing in the frequency domain, it can intuitively perform processing for each band. It is possible to easily divide the middle band on which signal reduction processing is performed, and it is possible to perform signal reduction only in a band affected by wind.
The wind noise reduction unit 6c (FIG. 9) according to the third embodiment is easy to incorporate into an encoder (encoder) according to AAC or the like, and has high practicality.

＜＜第４実施例＞＞
上述したように、第３実施例に係る風雑音低減部６ｃは、ＡＡＣなどに従ったエンコーダへの組み込みが容易である。例えば、時間周波数変換にＭＤＣＴを利用できたり、出力される周波数軸上の補正信号をエンコーダの量子化処理にそのまま利用できたりする。エンコーダへの組み込みに関する実施例として、第４実施例を説明する。 << 4th Example >>
As described above, the wind noise reduction unit 6c according to the third embodiment can be easily incorporated into an encoder according to AAC or the like. For example, MDCT can be used for time-frequency conversion, or the output correction signal on the frequency axis can be used as it is for quantization processing of the encoder. A fourth embodiment will be described as an embodiment relating to incorporation into an encoder.

図９の風雑音低減部６ｃと組み合わせて使用可能なＡＡＣエンコーダ１１０の内部ブロック図を、図１０に示す。ＡＡＣエンコーダ１１０は、図２の音声信号処理部４に内蔵される。ＡＡＣエンコーダ１１０内の各部位の動作は、ＡＡＣの規格に従うものであるため説明を割愛する。ＡＡＣエンコーダ１１０内に設けられたフィルタバンク１１１が、修正離散コサイン変換を行う部分であり、図９の時間周波数変換部８３に相当することになる。 FIG. 10 shows an internal block diagram of the AAC encoder 110 that can be used in combination with the wind noise reduction unit 6c of FIG. The AAC encoder 110 is built in the audio signal processing unit 4 of FIG. Since the operation of each part in the AAC encoder 110 conforms to the AAC standard, description thereof is omitted. The filter bank 111 provided in the AAC encoder 110 is a part that performs the modified discrete cosine transform, and corresponds to the time-frequency transform unit 83 in FIG.

ＡＡＣエンコーダ１１０の前段に、図９の風雑音低減部６ｃにおける風雑音判定部１１、信号復元部１２、ＨＰＦ８１及び信号合成部８２を設け、信号合成部８２の出力信号をＡＡＣエンコーダ１１０の入力信号として与える。そして、時間周波数変換部８３の出力信号に対応するフィルタバンク１１１の出力信号の中帯域を信号低減部５３にて補正し、この補正を介した信号（即ち、図９の信号合成部８４から出力される補正信号）を、フィルタバンク１１１の出力信号を必要とする部位（ＴＮＳ（Temporal Noise Shaping）及びビットストリームマルチプレクサ）に供給するようにする。この補正を介してＡＡＣエンコーダ１１０から出力されるビットストリームが、図２の記録媒体５に記録される。 The wind noise determination unit 11, the signal restoration unit 12, the HPF 81, and the signal synthesis unit 82 in the wind noise reduction unit 6 c of FIG. 9 are provided in the previous stage of the AAC encoder 110, and the output signal of the signal synthesis unit 82 is input to the input signal of the AAC encoder 110. Give as. Then, the signal reduction unit 53 corrects the intermediate band of the output signal of the filter bank 111 corresponding to the output signal of the time-frequency conversion unit 83, and outputs a signal (that is, output from the signal synthesis unit 84 of FIG. 9). Correction signal) is supplied to a portion (TNS (Temporal Noise Shaping) and bit stream multiplexer) that requires the output signal of the filter bank 111. The bit stream output from the AAC encoder 110 via this correction is recorded on the recording medium 5 in FIG.

尚、ＡＡＣエンコーダ１１０などのエンコーダに風雑音低減部（６ｂ又は６ｃ）を組み込む場合、帯域の分割の仕方を、組み込むエンコーダにおけるオーディオフォーマットに合わせておくと良い。これにより、処理を簡略化することができる。つまり例えば、第２又は第３実施例で述べたＭＤＣＴ係数Ｌ_m,k及びＲ_m,kの表現形式（ｋの取りうる値など）をエンコーダで用いるＭＤＣＴ係数のそれと合わせておくとよい。 Note that when the wind noise reduction unit (6b or 6c) is incorporated into an encoder such as the AAC encoder 110, the band division method may be matched to the audio format of the incorporated encoder. Thereby, processing can be simplified. That is, for example, the MDCT coefficients L _{m, k} and R _{m, k} described in the second or third embodiment may be combined with the MDCT coefficients used in the encoder.

また、ＡＡＣエンコーダ１１０などのエンコーダに風雑音低減部（６ｂ又は６ｃ）を組み込む場合、隣接するフレーム間で時間軸上の音響信号がオーバーラップされることがある。つまり例えば、第２又は第３実施例で述べたＭＤＣＴの具体例の場合、図６に示す如く、隣接するフレーム間で１０２４サンプルの時間軸上の音響信号がオーバーラップされる。このような場合、低帯域用の風雑音判定と中帯域用の風雑音判定が同等となるように、風雑音低減部６ｃの風雑音判定部１１及び信号復元部１２にて、以下のように処理することが望ましい。 In addition, when the wind noise reduction unit (6b or 6c) is incorporated in an encoder such as the AAC encoder 110, acoustic signals on the time axis may overlap between adjacent frames. That is, for example, in the case of the specific example of MDCT described in the second or third embodiment, as shown in FIG. 6, the acoustic signals on the time axis of 1024 samples are overlapped between adjacent frames. In such a case, in the wind noise determination unit 11 and the signal restoration unit 12 of the wind noise reduction unit 6c, the low-band wind noise determination and the mid-band wind noise determination are equivalent as follows. It is desirable to process.

即ち、風雑音判定部１１の相関値演算部２２は、各フレームに対し、上記式（１）に従って相関値を算出する。これは、第１実施例で導入した「単位区間」をＭＤＣＴに適応した「フレーム」として取り扱うことによって実現される。この場合、図４に示す状況と異なるが、隣接する単位区間同士は単位区間の半分だけ重なり合うことになる。そして例えば、時間軸上における１番目〜２０４８番目の音響信号に基づいて或るフレームに対する相関値が算出され、その後、時間軸上における１０２５番目〜３０７２番目の音響信号に基づいて次のフレームに対する相関値が算出されることになる。乗算器２７及び２８並びに加算器２９は、第ｍのフレームの前半（又は後半）１０２４サンプルについてのＬＰＦ２５及び２６の出力信号値を、相関値演算部２２が第ｍのフレームについて算出した相関値に応じて上記式（２）に従って加重加算し、これによって信号復元部１２の出力信号を形成するようにする。 That is, the correlation value calculation unit 22 of the wind noise determination unit 11 calculates a correlation value according to the above equation (1) for each frame. This is realized by handling the “unit section” introduced in the first embodiment as a “frame” adapted to MDCT. In this case, although different from the situation shown in FIG. 4, adjacent unit sections overlap each other by half of the unit sections. Then, for example, a correlation value for a certain frame is calculated based on the first to 2048th acoustic signals on the time axis, and thereafter, a correlation for the next frame is calculated based on the 1025th to 3072th acoustic signals on the time axis. A value will be calculated. The multipliers 27 and 28 and the adder 29 convert the output signal values of the LPFs 25 and 26 for 1024 samples of the first half (or second half) of the m-th frame to the correlation values calculated by the correlation value calculation unit 22 for the m-th frame. Accordingly, the weighted addition is performed according to the above equation (2), thereby forming the output signal of the signal restoration unit 12.

＜＜第５実施例＞＞
上述の各実施例では、各マイクロホンの出力信号に対してリアルタイムに風雑音低減用の補正処理（信号復元処理及び信号低減処理）を施し、これによって得られた補正信号を図２の記録媒体５に保存するようにしているが、補正処理を実行するタイミングは任意である。 << 5th Example >>
In each of the embodiments described above, correction processing (signal restoration processing and signal reduction processing) for reducing wind noise is performed in real time on the output signal of each microphone, and the correction signal obtained thereby is used as the recording medium 5 in FIG. However, the timing for executing the correction process is arbitrary.

例えば、マイクロホンＭＩＣ１及びＭＩＣ２の出力信号に基づく補正前の時間軸上の原信号又は補正前の周波数軸上の原信号を、一旦、記録媒体５に生データとして記録しておく。勿論、この記録の際に、適宜、圧縮処理等の信号処理を行っても構わない。そして、音声再生時などにおいて、その生データから補正前の時間軸上の原信号又は補正前の周波数軸上の原信号を再現し、その再現された原信号を風雑音低減部（６ａ、６ｂ又は６ｃ）に与えることによって補正信号を得るようにしてもよい。そして、音声再生時には、この補正信号を再生出力すれば良い。 For example, the original signal on the time axis before correction or the original signal on the frequency axis before correction based on the output signals of the microphones MIC1 and MIC2 is once recorded as raw data on the recording medium 5. Of course, signal processing such as compression processing may be appropriately performed during the recording. Then, at the time of audio reproduction or the like, the original signal on the time axis before correction or the original signal on the frequency axis before correction is reproduced from the raw data, and the reproduced original signal is reproduced in the wind noise reduction unit (6a, 6b or 6c). The correction signal may be obtained by giving the correction signal. Then, at the time of audio reproduction, the correction signal may be reproduced and output.

上述の説明から明らかなように、上記の生データから音響信号を再生する音響信号再生装置に風雑音低減部（６ａ、６ｂ又は６ｃ）を含む音声信号処理部を搭載することも可能であり、その場合も、風雑音低減部は有効に機能する。つまり、本発明は、音響信号再生装置にも適用可能である。集音時には生データを記録しておき、風雑音低減用の補正処理を音響信号再生装置側に担わせるようにしておけば、該補正処理の実施／不実施を再生時に自由に切り替えることも可能である。 As is clear from the above description, it is also possible to mount an audio signal processing unit including a wind noise reduction unit (6a, 6b or 6c) on an acoustic signal reproduction device that reproduces an acoustic signal from the raw data. Even in this case, the wind noise reduction unit functions effectively. That is, the present invention can also be applied to an acoustic signal reproduction device. If raw data is recorded at the time of sound collection and correction processing for reducing wind noise is performed on the sound signal playback device side, it is possible to freely switch execution / non-execution of the correction processing during playback. It is.

また、音声信号処理部４が設けられる装置として撮像装置１を例示したが、同様の音声信号処理部を、他の録音装置又は録音機能を備えた装置に設けることが可能である。他の録音装置又は録音機能を備えた装置には、例えば、ＩＣレコーダ等の携帯型録音装置や、録音機能を備えた携帯電話機が含まれる。これらの装置には、図２に示されるマイクロホンＭＩＣ１及びＭＩＣ２、音声信号処理部４及び記録媒体５が設けられることになる。 Further, although the imaging apparatus 1 is illustrated as an apparatus provided with the audio signal processing unit 4, a similar audio signal processing unit can be provided in another recording apparatus or an apparatus having a recording function. Other recording devices or devices having a recording function include, for example, portable recording devices such as IC recorders and mobile phones having a recording function. These devices are provided with the microphones MIC1 and MIC2, the audio signal processing unit 4, and the recording medium 5 shown in FIG.

＜＜変形等＞＞
上述した説明文中に示した具体的な数値は、単なる例示であって、当然の如く、それらを様々な数値に変更することができる。上述の実施形態の変形例または注釈事項として、以下に、注釈１〜注釈４を記す。各注釈に記載した内容は、矛盾なき限り、任意に組み合わせることが可能である。 << Deformation, etc. >>
The specific numerical values shown in the above description are merely examples, and as a matter of course, they can be changed to various numerical values. As modifications or annotations of the above-described embodiment, notes 1 to 4 are described below. The contents described in each comment can be arbitrarily combined as long as there is no contradiction.

［注釈１］
上述の説明では、説明の具体化のため、時間周波数変換として修正離散コサイン変換（ＭＤＣＴ）を用いる場合を例示したが、勿論、これは例示であり、その他の任意の時間周波数変換を利用することが可能である。 [Note 1]
In the above description, the case where the modified discrete cosine transform (MDCT) is used as the time frequency conversion is illustrated for the purpose of concrete description, but of course, this is an example, and any other time frequency conversion may be used. Is possible.

［注釈２］
説明の簡略化上、マイクロホンの個数を２に限定し、２つのチャンネル信号から形成される音響信号を補正する手法を上述したが、本発明において、マイクロホンの個数は２に限定されない。つまり、上述の各実施例に記載した技術を、３以上のマイクロホンの出力信号に基づく３以上のチャンネル信号から成るマルチチャンネル信号に対して適用することもできる。各実施例でチャンネル信号ごとに信号復元処理及び信号低減処理を行っていたのと同様、各実施例に記載した技術をマルチチャンネル信号に対して適用する場合も、基本的には、チャンネル信号ごとに信号復元処理及び信号低減処理を行えばよい。 [Note 2]
For simplification of description, the number of microphones is limited to 2 and the method of correcting an acoustic signal formed from two channel signals has been described above. However, in the present invention, the number of microphones is not limited to 2. That is, the technique described in each of the above-described embodiments can be applied to a multi-channel signal including three or more channel signals based on output signals of three or more microphones. As in the case where the signal restoration processing and the signal reduction processing are performed for each channel signal in each embodiment, when the technique described in each embodiment is applied to a multi-channel signal, basically, for each channel signal. Then, signal restoration processing and signal reduction processing may be performed.

また、各実施例に記載した技術を、第１、第２、・・・第ｑのチャンネル信号から成るマルチチャンネル信号に対して適用する場合、風雑音判定は、以下のように行えばよい（ｑは３以上の整数）。
例えば、第１〜第ｑのチャンネル信号の中から２つのチャンネル信号を選定しておき、その選定された２つのチャンネル信号を上記のＬ信号及びＲ信号と取り扱って、各実施例で述べたのと同様に、相関値演算を介して風雑音の影響度合いを判定を行う。
或いは例えば、第１〜第ｑのチャンネル信号の内の２つのチャンネル信号の各組み合わせに対して、組み合わせ毎に２つのチャンネル信号間の相互相関を表す相関値を求め、組み合わせ毎に求められた相関値の最大値、平均値、最小値などに基づいて風雑音の影響度合いを判定する。
或いは例えば、第１〜第ｑのチャンネル信号の内の３以上のチャンネル信号間の相互相関を表す相関値を求め、この相関値に基づいて風雑音の影響度合いを判定する。 In addition, when the technique described in each embodiment is applied to a multi-channel signal including first, second,..., Q-th channel signals, wind noise determination may be performed as follows ( q is an integer of 3 or more).
For example, two channel signals are selected from the first to q-th channel signals, and the selected two channel signals are handled as the L signal and the R signal described above in the respective embodiments. Similarly, the degree of wind noise influence is determined through correlation value calculation.
Or, for example, for each combination of two channel signals of the first to q-th channel signals, a correlation value representing a cross-correlation between the two channel signals is obtained for each combination, and the correlation obtained for each combination is obtained. The degree of influence of wind noise is determined based on the maximum value, average value, minimum value, and the like.
Alternatively, for example, a correlation value indicating a cross-correlation between three or more channel signals among the first to q-th channel signals is obtained, and the influence level of wind noise is determined based on the correlation value.

また、上述の第１〜第４の変形信号処理をマルチチャンネル信号に対して適用することもできる。 Further, the above-described first to fourth modified signal processes can be applied to the multichannel signal.

第１実施例に着目した場合、マルチチャンネル信号を形成する第１〜第ｑのチャンネル信号はＢＰＦ３０に与えられる。そして、マルチチャンネル信号に第１の変形信号処理を適用する場合は、相関値演算を介して風雑音の影響度合いを判定し、その影響度合いが比較的大きいと判断したならば、ＢＰＦ３０を通過した第１〜第ｑのチャンネル信号を平均化し、この平均化後の各チャンネル信号にて信号低減部１３の出力信号を形成すればよい。 When attention is paid to the first embodiment, the first to qth channel signals forming the multichannel signal are given to the BPF 30. When the first modified signal processing is applied to the multi-channel signal, the influence level of the wind noise is determined through the correlation value calculation. If it is determined that the influence degree is relatively large, the BPF 30 is passed. The first to q-th channel signals may be averaged, and the output signal of the signal reduction unit 13 may be formed by the averaged channel signals.

マルチチャンネル信号に第２の変形信号処理を適用する場合は、相関値演算を介して風雑音の影響度合いを判定し、その影響度合いが比較的大きいと判断したならば、ＢＰＦ３０を通過した第１〜第ｑのチャンネル信号の信号レベルを互いに対比する。そして、ＢＰＦ３０を通過した第１〜第ｑのチャンネル信号の内、信号レベルが最も小さいチャンネル信号を最小音響信号として且つそれ以外の各チャンネル信号を非最小音響信号として特定し、全ての非最小音響信号を最小音響信号にて置き換えて、置き換え後の各チャンネル信号にて信号低減部１３の出力信号を形成すればよい。 When the second modified signal processing is applied to the multi-channel signal, the influence level of the wind noise is determined through the correlation value calculation. If it is determined that the influence degree is relatively large, the first passing through the BPF 30 is determined. The signal levels of the q-th channel signals are compared with each other. Then, among the first to q-th channel signals that have passed through the BPF 30, the channel signal having the lowest signal level is specified as the minimum acoustic signal and each other channel signal is specified as the non-minimum acoustic signal, and all non-minimum acoustic signals The signal may be replaced with the minimum acoustic signal, and the output signal of the signal reduction unit 13 may be formed with each channel signal after replacement.

マルチチャンネル信号に第３の変形信号処理を適用する場合は、中帯域をｎ個に細分化し、ｎ個に細分化された帯域ごとに相関値演算を介して風雑音の影響度合いを判定する。そして、ｎ個に細分化された帯域ごとに影響度合いの大小を判別し、その影響度合いが比較的大きいと判断した帯域に対して、周波数軸上の第１〜第ｑのチャンネル信号（即ち、ＭＤＣＴ係数）を平均化し、この平均化後の各チャンネル信号にて信号低減部５３の出力信号を形成すればよい。 When the third modified signal processing is applied to the multi-channel signal, the middle band is subdivided into n, and the influence level of wind noise is determined for each band subdivided into n through correlation value calculation. Then, the degree of influence is determined for each of the n subdivided bands, and the first to q-th channel signals on the frequency axis (that is, the band whose influence degree is determined to be relatively large) MDCT coefficients) may be averaged, and the output signal of the signal reduction unit 53 may be formed with each channel signal after the averaging.

マルチチャンネル信号に第４の変形信号処理を適用する場合も、中帯域をｎ個に細分化し、ｎ個に細分化された帯域ごとに相関値演算を介して風雑音の影響度合いを判定する。そして、ｎ個に細分化された帯域ごとに影響度合いの大小を判別し、その影響度合いが比較的大きいと判断した帯域に対して、周波数軸上の第１〜第ｑのチャンネル信号（即ち、ＭＤＣＴ係数）間の大小関係を評価し、信号レベルが最も小さいチャンネル信号を最小音響信号として且つそれ以外の各チャンネル信号を非最小音響信号として特定する。その後、全ての非最小音響信号を最小音響信号にて置き換えて、置き換え後の各チャンネル信号にて信号低減部５３の出力信号を形成すればよい。 Even when the fourth modified signal processing is applied to the multi-channel signal, the middle band is subdivided into n pieces, and the influence level of wind noise is determined for each band subdivided into n pieces through correlation value calculation. Then, the degree of influence is determined for each of the n subdivided bands, and the first to q-th channel signals on the frequency axis (that is, the band whose influence degree is determined to be relatively large) The magnitude relationship between the MDCT coefficients) is evaluated, and the channel signal with the lowest signal level is specified as the minimum acoustic signal, and each other channel signal is specified as the non-minimum acoustic signal. Thereafter, all the non-minimum acoustic signals may be replaced with the minimum acoustic signals, and the output signal of the signal reduction unit 53 may be formed with each channel signal after replacement.

［注釈３］
図３、図５又は図９に示される風雑音低減部６ａ、６ｂ又は６ｃは、ハードウェア、ソフトウェア、またはハードウェアとソフトウェアの組み合わせによって実現可能である。ソフトウェアを用いて風雑音低減部（６ａ、６ｂ又は６ｃ）を構成する場合、ソフトウェアにて実現される部位についてのブロック図は、その部位の機能ブロック図を表すことになる。 [Note 3]
The wind noise reduction unit 6a, 6b, or 6c shown in FIG. 3, FIG. 5, or FIG. 9 can be realized by hardware, software, or a combination of hardware and software. When the wind noise reduction unit (6a, 6b, or 6c) is configured using software, a block diagram of a part realized by software represents a functional block diagram of the part.

また、風雑音低減部（６ａ、６ｂ又は６ｃ）にて実現される機能の全部または一部を、プログラムとして記述し、該プログラムをプログラム実行装置（例えばコンピュータ）上で実行することによって、その機能の全部または一部を実現するようにしてもよい。 Further, all or part of the functions realized by the wind noise reduction unit (6a, 6b or 6c) is described as a program, and the function is executed by executing the program on a program execution device (for example, a computer). You may make it implement | achieve all or one part.

［注釈４］
例えば、以下のように考えることができる。
本発明に係る風雑音低減装置は、入力音響信号に含まれる低帯域の音響信号とは異なる、低帯域の音響信号を信号復元処理によって生成する信号生成手段を備えており、風雑音低減装置６ａ又は６ｃでは、符号２３〜２５にて参照される各部位によって信号生成手段が形成され、風雑音低減装置６ｂでは、復元信号生成部６１が信号生成手段として機能する（図３、図５及び図９参照）。
低帯域の補正音響信号を生成する第１補正手段の機能は、風雑音低減装置６ａ又は６ｃでは信号復元部１２が担い、雑音低減装置６ｂでは信号復元部５４が担う。
中帯域の補正音響信号を生成する第２補正手段の機能は、風雑音低減装置６ａでは信号低減部１３が担い、雑音低減装置６ｂ又は６ｃでは信号低減部５３が担う。この第２補正手段に、風雑音判定部１１（図３）又は風雑音判定部５２（図５、図９）が含まれていると考えることも含まれていないと考えることもできる。 [Note 4]
For example, it can be considered as follows.
The wind noise reduction device according to the present invention includes signal generation means for generating a low-band acoustic signal different from the low-band acoustic signal included in the input acoustic signal by signal restoration processing, and the wind noise reduction device 6a. Alternatively, in 6c, a signal generation unit is formed by each part referred to by reference numerals 23 to 25, and in the wind noise reduction device 6b, the restoration signal generation unit 61 functions as a signal generation unit (FIGS. 3, 5, and 5). 9).
The function of the 1st correction | amendment means which produces | generates a correction | amendment acoustic signal of a low band is responsible for the signal restoration part 12 in the wind noise reduction apparatus 6a or 6c, and the signal restoration part 54 in the noise reduction apparatus 6b.
The function of the 2nd correction | amendment means which produces | generates the correction | amendment acoustic signal of a middle band is responsible for the signal reduction part 13 in the wind noise reduction apparatus 6a, and the signal reduction part 53 takes charge in the noise reduction apparatus 6b or 6c. It can be considered that the second correction means includes or does not include the wind noise determination unit 11 (FIG. 3) or the wind noise determination unit 52 (FIGS. 5 and 9).

本発明の実施形態に係る撮像装置の外観斜視図である。1 is an external perspective view of an imaging apparatus according to an embodiment of the present invention. 図１の撮像装置の電気的構成を表す概略ブロック図である。It is a schematic block diagram showing the electric constitution of the imaging device of FIG. 本発明の第１実施例に係る、図２の風雑音低減部の内部ブロック図である。FIG. 3 is an internal block diagram of the wind noise reduction unit of FIG. 2 according to the first embodiment of the present invention. 本発明の第１実施例に係る、信号処理の単位区間を表す図である。It is a figure showing the unit area of signal processing concerning the 1st example of the present invention. 本発明の第２実施例に係る、図２の風雑音低減部の内部ブロック図である。FIG. 6 is an internal block diagram of the wind noise reduction unit of FIG. 2 according to a second embodiment of the present invention. 本発明の第２実施例に係り、符号化の処理単位である各フレームの関係を示す図である。It is a figure which concerns on 2nd Example of this invention and shows the relationship of each flame | frame which is a process unit of an encoding. 図５の復元信号生成部による信号の復元手法を説明するための周波数スペクトル図である。It is a frequency spectrum figure for demonstrating the restoration method of the signal by the restoration signal generation part of FIG. 図５の復元信号生成部による信号の復元手法を説明するための図である。It is a figure for demonstrating the restoration method of the signal by the restoration signal production | generation part of FIG. 本発明の第３実施例に係る、図２の風雑音低減部の内部ブロック図である。It is an internal block diagram of the wind noise reduction part of FIG. 2 based on 3rd Example of this invention. 図９の風雑音低減部と組み合わせて使用可能なＡＡＣエンコーダの内部ブロック図である。FIG. 10 is an internal block diagram of an AAC encoder that can be used in combination with the wind noise reduction unit of FIG. 9. 従来の風雑音低減装置の内部ブロック図である。It is an internal block diagram of the conventional wind noise reduction apparatus. 本発明の第１実施例に係り、第１の変形信号低減処理を説明するための概念図である。FIG. 6 is a conceptual diagram for explaining a first modified signal reduction process according to the first embodiment of the present invention. 本発明の第１実施例に係り、第２の変形信号低減処理を説明するための概念図である。It is a conceptual diagram for demonstrating 2nd deformation | transformation signal reduction processing in 1st Example of this invention.

Explanation of symbols

ＭＩＣ１、ＭＩＣ２マイクロホン
１撮像装置
４音声信号処理部
５記録媒体
６、６ａ、６ｂ、６ｃ風雑音低減部
１１、５２風雑音判定部
５１相関値演算部（風雑音判定部）
１２、５４信号復元部
１３、５３信号低減部 MIC1, MIC2 Microphone 1 Imaging device 4 Audio signal processing unit 5 Recording medium 6, 6a, 6b, 6c Wind noise reduction unit 11, 52 Wind noise determination unit 51 Correlation value calculation unit (wind noise determination unit)
12, 54 Signal restoration unit 13, 53 Signal reduction unit

Claims

In a wind noise reduction device that generates a corrected acoustic signal in which wind noise is reduced from an input acoustic signal,
If the predetermined bandwidth is a higher frequency than the first band single且a first band a predetermined band including a bandwidth of wind noise and a second zone,
The wind noise reduction device is
Based on an acoustic signal in a band having a higher frequency than the first band, included in the input acoustic signal,
The acoustic signal generated by the signal generation means, the signal generation means including signal generation means for generating the first band acoustic signal in which the wind noise is reduced compared to the first band acoustic signal included in the input acoustic signal First correcting means for generating a first corrected acoustic signal based on
Second correcting means for generating a second corrected acoustic signal that is an acoustic signal of the second band in which the wind noise is reduced by reducing a signal level of the acoustic signal of the second band of the input acoustic signal;
A wind noise reduction apparatus comprising: a corrected sound signal output unit that outputs the corrected sound signal based on the first and second corrected sound signals.

The first correcting unit generates the first corrected acoustic signal based on the acoustic signal in the first band included in the input acoustic signal and the acoustic signal generated by the signal generating unit. The wind noise reduction device according to claim 1.

The input acoustic signal consists of a plurality of channel signals,
The wind noise reduction device further includes:
Wind noise determination means for determining an influence degree of the wind noise on the input acoustic signal based on a cross-correlation between different channel signals of a predetermined band component of each channel signal including the wind noise band,
3. The wind noise reduction device according to claim 1, wherein the first correction unit generates the first corrected acoustic signal in accordance with a determination result of the wind noise determination unit.

The input acoustic signal consists of a plurality of channel signals,
The wind noise reduction device further includes:
Wind noise determination means for determining an influence degree of the wind noise on the input acoustic signal based on a cross-correlation between different channel signals of a predetermined band component of each channel signal including the wind noise band,
3. The wind noise reduction device according to claim 1, wherein the second correction unit generates the second corrected acoustic signal in accordance with a determination result of the wind noise determination unit.

The input acoustic signal is given to the wind noise reduction device as a signal on the frequency axis, and the input acoustic signal is composed of a plurality of channel signals,
The second correction means includes
Dividing the second band of the input acoustic signal into a plurality of element bands, and generating the second corrected acoustic signal on the frequency axis by reducing the signal level of the acoustic signal of each element band;
For each of the plurality of element bands, a cross-correlation of the acoustic signals of the element bands between different channel signals is obtained, and a reduction level of the signal level is determined for each element band based on each cross-correlation. The wind noise reduction device according to claim 1 or 2, wherein

The input acoustic signal is given to the wind noise reduction device as a signal on the time axis, and the input acoustic signal is composed of a plurality of channel signals,
The first corrected acoustic signal generated by the first correction unit is a signal on a time axis,
The wind noise reduction device further includes:
Extraction means for extracting a component of a predetermined band not including the first band and including the second band from the input acoustic signal;
Time frequency conversion means for converting the signal format of the synthesized signal of the first corrected acoustic signal and the extraction signal of the extraction means from the time axis to the frequency axis,
The second correction means generates the second corrected acoustic signal on the frequency axis by reducing the signal level of the second band acoustic signal in the synthesized signal on the frequency axis,
The corrected acoustic signal output means includes an acoustic signal including the second corrected acoustic signal on the frequency axis obtained from the second correcting means and the first corrected acoustic signal on the frequency axis obtained from the time-frequency conversion means. The wind noise reduction device according to claim 1 or 2, wherein the corrected acoustic signal on the frequency axis is output based on

The second correction means includes
Dividing the second band of the synthesized signal on the frequency axis into a plurality of element bands, and generating the second corrected acoustic signal on the frequency axis by reducing the signal level of the acoustic signal of each element band;
For each of the plurality of element bands, a cross-correlation of the acoustic signals of the element bands between different channel signals is obtained, and a reduction level of the signal level is determined for each element band based on each cross-correlation. The wind noise reduction device according to claim 6.

The input acoustic signal consists of a plurality of channel signals,
The second correction means includes
Paying attention to all or part of the second band,
By averaging the sound signals of the band of interest for the plurality of channel signals included in the input sound signal, the signal level of the sound signal of the band of interest in the channel that is relatively affected by the wind noise is obtained. Reduce
The wind noise reduction device according to claim 1 or 2, wherein the second corrected acoustic signal is generated from a signal obtained by the averaging.

The input acoustic signal consists of a plurality of channel signals,
The second correction means includes
Paying attention to all or part of the second band,
Among the acoustic signals of the band of interest for the plurality of channel signals included in the input acoustic signal, the acoustic signal having the minimum signal level is specified as the minimum acoustic signal and the other acoustic signals are specified as non-minimum acoustic signals,
By replacing the non-minimum acoustic signal with the minimum acoustic signal, the signal level of the acoustic signal in the band of interest in a channel that is relatively affected by the wind noise is reduced,
The wind noise reduction device according to claim 1 or 2, wherein the second corrected acoustic signal is generated from a signal obtained by the replacement.

The wind noise reduction device according to any one of claims 1 to 9,
A recording device comprising: a microphone for generating the input acoustic signal for the wind noise reduction device.

The wind noise reduction device according to any one of claims 1 to 9,
A microphone for generating the input acoustic signal for the wind noise reduction device;
An imaging apparatus comprising: an imaging unit for acquiring an image.

In a wind noise reduction method for generating a corrected acoustic signal in which wind noise is reduced from an input acoustic signal,
If the predetermined bandwidth is a higher frequency than the first band single且a first band a predetermined band including a bandwidth of wind noise and a second zone,
The wind noise reduction method is
Based on an acoustic signal in a band having a higher frequency than the first band, included in the input acoustic signal,
A signal generation step of generating the first band acoustic signal in which the wind noise is reduced as compared to the first band acoustic signal included in the input acoustic signal;
A first correction step of generating a first corrected acoustic signal based on the acoustic signal generated in the signal generation step;
A second correction step of generating a second corrected acoustic signal that is an acoustic signal of the second band in which the wind noise is reduced by reducing a signal level of the acoustic signal of the second band of the input acoustic signal; Prepared,
A method for reducing wind noise, comprising generating the corrected acoustic signal based on the first and second corrected acoustic signals.