JP6114518B2

JP6114518B2 - Noise reduction device

Info

Publication number: JP6114518B2
Application number: JP2012186602A
Authority: JP
Inventors: 智岐奥
Original assignee: Xacti Corp
Current assignee: Xacti Corp
Priority date: 2012-08-27
Filing date: 2012-08-27
Publication date: 2017-04-12
Anticipated expiration: 2032-08-27
Also published as: JP2014044313A

Description

本発明は、雑音低減装置に関する。 The present invention relates to a noise reduction device.

入力音響信号に重畳した雑音を除去する方法として、スペクトル減算法（Spectral Subtraction）が知られている。図１２に、スペクトル減算法を用いる従来の雑音低減装置９０１のブロック図を示す。雑音低減装置９０１は、予め保持している雑音のスペクトル信号９２０に減算係数αを乗じたものを、音声及び雑音を含む入力音響信号９２１に基づく入力スペクトル信号９２２から減算することで減算スペクトル信号９３１を生成し、減算スペクトル信号９３１を用いて、雑音低減の成された出力スペクトル信号９３３及び出力音響信号９３４を生成する。低減対象の雑音は、例えば、撮像装置におけるレンズの駆動音である。ここで、減算係数αは、雑音を抑制するために大きな値をとる。このため、入力音響信号９２１中の音声と雑音のレベル差等によっては、減算結果がゼロ以下になることもある。このような減算結果をそのまま出力に含めると、出力スペクトルが大きく歪む。歪みは異音（所謂ミュージカルノイズ）につながり、音響信号の聞き手に違和感を覚えさせる。 A spectral subtraction method is known as a method for removing noise superimposed on an input acoustic signal. FIG. 12 shows a block diagram of a conventional noise reduction apparatus 901 using the spectral subtraction method. The noise reduction device 901 subtracts a noise spectrum signal 920 held in advance by a subtraction coefficient α from an input spectrum signal 922 based on an input acoustic signal 921 including speech and noise, thereby subtracting a spectrum signal 931. , And the subtracted spectrum signal 931 is used to generate an output spectrum signal 933 and an output acoustic signal 934 with reduced noise. The noise to be reduced is, for example, lens driving sound in the imaging apparatus. Here, the subtraction coefficient α takes a large value in order to suppress noise. For this reason, depending on the level difference between the voice and noise in the input acoustic signal 921, the subtraction result may become zero or less. If such a subtraction result is included in the output as it is, the output spectrum is greatly distorted. Distortion leads to abnormal noise (so-called musical noise), and makes the listener of the acoustic signal feel uncomfortable.

これを防ぐため、入力スペクトル信号９２２にフロアリング係数β（０．１等）を乗じた信号９３２を生成し、信号９３２の方が信号９３１よりも大きければ、信号９３２を出力（９３３、９３４）に含めるようにする。図１３（ａ）〜（ｃ）に、夫々、信号９２２、９２０及び９３３のスペクトル例を示す。図１３（ｃ）の破線枠９４０内の部分では、入力スペクトル信号９２２に基づく信号９３２が出力スペクトル信号９３３に反映されている。 In order to prevent this, a signal 932 obtained by multiplying the input spectrum signal 922 by a flooring coefficient β (0.1 or the like) is generated, and if the signal 932 is larger than the signal 931, the signal 932 is output (933, 934). To include. FIGS. 13A to 13C show spectrum examples of the signals 922, 920, and 933, respectively. In the portion within the broken line frame 940 in FIG. 13C, the signal 932 based on the input spectrum signal 922 is reflected in the output spectrum signal 933.

尚、撮像装置において、ズーム動作直前の入力音響信号のパワー値を算出し、そのパワー値に基づいてフロアリング係数を決定するという第１従来方法も提案されている（下記特許文献１参照）。また、入力音響信号の集音と同期して雑音信号を集音する装置を用意し、適応フィルタによって雑音を除去する第２従来方法も知られている。当該方法は、一般的にフィードフォワード型ＡＮＣ（Active Noise Control）と呼ばれる。 Note that a first conventional method is also proposed in which an imaging apparatus calculates a power value of an input acoustic signal immediately before a zoom operation and determines a flooring coefficient based on the power value (see Patent Document 1 below). A second conventional method is also known in which a device for collecting a noise signal in synchronization with the sound collection of an input acoustic signal is prepared and noise is removed by an adaptive filter. This method is generally referred to as feedforward ANC (Active Noise Control).

特開２００８−２５２３８９号公報JP 2008-252389 A

フロアリング係数βを用いた信号９３２の利用により歪みは幾分抑制されるのではあるが、その抑制効果が不十分になることもある。例えば、元々の入力音響信号９２１が小さい場合、破線枠９４０内の帯域における出力レベルが異常なまでに低くなり、出力音響信号９３４の聞き手が違和感を覚えることがある。 Although the distortion is somewhat suppressed by using the signal 932 using the flooring coefficient β, the suppression effect may be insufficient. For example, when the original input acoustic signal 921 is small, the output level in the band within the broken line frame 940 becomes low enough to be abnormal, and the listener of the output acoustic signal 934 may feel uncomfortable.

尚、第１従来方法では、フロアリング係数の調整が歪みの抑制に対して或る程度の効果を発揮するかもしれないが、ズーム動作中に入力音響信号のレベルが変動した場合には、その調整が有効に機能しない。また、第２従来方法を用いると、その時々の雑音信号に応じた雑音低減処理が可能になるが、実際の入力音響信号に含まれる雑音成分と雑音信号として集音された雑音成分との間では雑音の伝達経路が異なるため、それらの間に位相ズレ等が発生して適正な雑音低減をできないことがある。 In the first conventional method, the adjustment of the flooring coefficient may have a certain effect on distortion suppression. However, if the level of the input acoustic signal fluctuates during the zoom operation, Adjustment does not work effectively. Further, when the second conventional method is used, noise reduction processing according to the noise signal at that time can be performed, but between the noise component included in the actual input acoustic signal and the noise component collected as the noise signal. However, since the noise transmission paths are different, a phase shift or the like may occur between them, and noise may not be reduced appropriately.

そこで本発明は、雑音低減の適正化に寄与する雑音低減装置、特に例えば、スペクトル減算を用いた雑音低減によって生じうる歪み（聞き手の違和感）を抑制可能な雑音低減装置を提供することを目的とする。 Therefore, the present invention has an object to provide a noise reduction device that contributes to optimization of noise reduction, in particular, a noise reduction device that can suppress distortion (discomfort of the listener) that can be caused by noise reduction using spectral subtraction, for example. To do.

本発明に係る第１の雑音低減装置は、集音部にて収集された入力音響信号に基づく入力スペクトル信号から、雑音のスペクトル信号に所定の係数を乗じて得た信号を減算することで減算スペクトル信号を生成し、前記減算スペクトル信号を用いて出力音響信号を生成する雑音低減装置において、予め取得された暗騒音スペクトル信号を保持する暗騒音信号保持部と、前記減算スペクトル信号及び前記暗騒音スペクトル信号を用いて前記出力音響信号を生成する出力生成部と、を備えたことを特徴とする。 A first noise reduction apparatus according to the present invention subtracts a signal obtained by multiplying a noise spectrum signal by a predetermined coefficient from an input spectrum signal based on an input acoustic signal collected by a sound collection unit. In a noise reduction apparatus that generates a spectrum signal and generates an output acoustic signal using the subtracted spectrum signal, a background noise signal holding unit that holds a background noise spectrum signal acquired in advance, the subtracted spectrum signal, and the background noise An output generation unit configured to generate the output acoustic signal using a spectrum signal.

これにより例えば、出力音響信号において、少なくとも暗騒音スペクトルのレベル程度の信号を確保することができ、出力レベルが異常に低くなるような（聞き手に違和感を覚えさせるような）信号歪みを抑制することが可能となる。 As a result, for example, in the output acoustic signal, at least a signal of the level of the background noise spectrum can be secured, and signal distortion that causes the output level to be abnormally low (such as making the listener feel uncomfortable) is suppressed. Is possible.

具体的には例えば、前記出力生成部は、前記減算スペクトル信号としての第１スペクトル信号、前記入力スペクトル信号に所定の第２係数を乗じて得られる第２スペクトル信号、及び、前記暗騒音スペクトル信号に所定の第３係数を乗じて得られる第３スペクトル信号に基づき、前記出力音響信号を生成すると良い。 Specifically, for example, the output generation unit includes a first spectrum signal as the subtracted spectrum signal, a second spectrum signal obtained by multiplying the input spectrum signal by a predetermined second coefficient, and the background noise spectrum signal. The output acoustic signal may be generated based on a third spectrum signal obtained by multiplying a predetermined third coefficient.

より具体的には例えば、前記出力生成部は、複数の周波数帯域の夫々において、前記第１、第２及び第３スペクトル信号の大きさを比較し、前記第１、第２及び第３スペクトル信号の内、最大の大きさを持つスペクトル信号を用いて前記出力音響信号を生成すると良い。 More specifically, for example, the output generation unit compares the magnitudes of the first, second, and third spectrum signals in each of a plurality of frequency bands, and the first, second, and third spectrum signals. Of these, the output acoustic signal may be generated using a spectrum signal having the maximum magnitude.

また例えば、前記第１、第２及び第３スペクトル信号の大きさの大小関係に基づき、前記雑音のスペクトル信号に乗じられる前記係数としての第１係数、前記第２係数及び前記第３係数の内、少なくとも１つの係数を更新対象係数として更新する係数更新部を、前記第１の雑音低減装置に更に設けてもよい。 Also, for example, based on the magnitude relationship between the magnitudes of the first, second, and third spectral signals, the first coefficient, the second coefficient, and the third coefficient as the coefficients to be multiplied by the noise spectrum signal. A coefficient updating unit that updates at least one coefficient as an update target coefficient may be further provided in the first noise reduction device.

これにより、係数の最適化が可能となり、結果、雑音低減の適正化が図られる。 As a result, the coefficients can be optimized, and as a result, noise reduction can be optimized.

具体的には例えば、前記係数更新部は、第１時刻における前記第１、第２及び第３スペクトル信号の大きさの大小関係に基づき、前記第１時刻よりも後の第２時刻の前記更新対象係数の値が決定されるように、前記更新を行うとよい。 Specifically, for example, the coefficient updating unit updates the second time after the first time based on a magnitude relationship between the magnitudes of the first, second, and third spectrum signals at the first time. The update may be performed so that the value of the target coefficient is determined.

また例えば、前記減算スペクトル信号の大きさと、前記暗騒音スペクトル信号に所定の係数を乗じて得たスペクトル信号の大きさとの大小関係に基づき、前記雑音のスペクトル信号に乗じられる前記係数を更新する係数更新部を、前記第１の雑音低減装置に更に設けてもよい。 Further, for example, based on the magnitude relationship between the magnitude of the subtracted spectrum signal and the magnitude of the spectrum signal obtained by multiplying the background noise spectrum signal by a predetermined coefficient, a coefficient for updating the coefficient multiplied by the noise spectrum signal An update unit may be further provided in the first noise reduction device.

これにより、雑音のスペクトル信号に乗じられる係数の最適化が可能になり、結果、雑音低減の適正化が図られる。 As a result, it is possible to optimize the coefficient to be multiplied by the noise spectrum signal, and as a result, noise reduction is optimized.

また例えば、前記集音部による前記入力音響信号の収集と同期して前記雑音を収集することで前記雑音の時間領域上の音響信号を生成し、その生成音響信号を周波数領域上の信号に変換することで前記雑音のスペクトル信号を生成する信号生成部を、前記第１の雑音低減装置に更に設けてもよい。 Further, for example, the noise is collected in synchronization with the collection of the input acoustic signal by the sound collecting unit to generate an acoustic signal in the time domain of the noise, and the generated acoustic signal is converted into a signal in the frequency domain. Thus, a signal generation unit that generates the noise spectrum signal may be further provided in the first noise reduction device.

これにより、その時々の雑音に適した雑音低減が可能になる。入力音響信号に同期して収集された雑音の音響信号を周波数領域上の信号に変換して、その変換信号（雑音のスペクトル信号）を元に減算スペクトル信号を生成するようにすれば、フィードフォワード型ＡＮＣと異なり、位相に関係なく適正な雑音低減が可能となる。適正な雑音低減は歪みの低減につながる。 Thereby, noise reduction suitable for the noise at that time becomes possible. If the acoustic signal of noise collected in synchronization with the input acoustic signal is converted into a signal on the frequency domain and a subtracted spectrum signal is generated based on the converted signal (noise spectrum signal), feed forward Unlike type ANC, appropriate noise reduction is possible regardless of phase. Proper noise reduction leads to distortion reduction.

本発明に係る第２の雑音低減装置は、集音部にて収集された入力音響信号に基づく入力スペクトル信号から、雑音のスペクトル信号に所定の係数を乗じて得た信号を減算することで減算スペクトル信号を生成し、前記減算スペクトル信号を用いて出力音響信号を生成する雑音低減装置において、前記集音部による前記入力音響信号の収集と同期して前記雑音を収集することで前記雑音の時間領域上の音響信号を生成し、その生成音響信号を周波数領域上の信号に変換することで前記雑音のスペクトル信号を生成する信号生成部を備えたことを特徴とする。 The second noise reduction apparatus according to the present invention subtracts a signal obtained by multiplying a noise spectrum signal by a predetermined coefficient from an input spectrum signal based on the input acoustic signal collected by the sound collection unit. In the noise reduction device that generates a spectrum signal and generates an output acoustic signal using the subtracted spectrum signal, the noise time is acquired by collecting the noise in synchronization with the collection of the input acoustic signal by the sound collection unit. A signal generation unit is provided that generates an acoustic signal on a region and converts the generated acoustic signal into a signal on a frequency region to generate a spectrum signal of the noise.

また例えば、前記減算スペクトル信号としての第１スペクトル信号の大きさと、前記入力スペクトル信号に所定の第２係数を乗じて得られる第２スペクトル信号の大きさとの大小関係に基づき、前記雑音のスペクトル信号に乗じられる前記係数を更新する係数更新部を、前記第２の雑音低減装置に更に設けてもよい。 Further, for example, based on the magnitude relationship between the magnitude of the first spectrum signal as the subtracted spectrum signal and the magnitude of the second spectrum signal obtained by multiplying the input spectrum signal by a predetermined second coefficient, the spectrum signal of the noise A coefficient updating unit that updates the coefficient multiplied by may be further provided in the second noise reduction device.

本発明によれば、雑音低減の適正化に寄与する雑音低減装置、特に例えば、スペクトル減算を用いた雑音低減によって生じうる歪み（聞き手の違和感）を抑制可能な雑音低減装置を提供することが可能である。 ADVANTAGE OF THE INVENTION According to this invention, it is possible to provide the noise reduction apparatus which contributes to optimization of noise reduction, especially the noise reduction apparatus which can suppress the distortion (listener's uncomfortable feeling) which may arise by noise reduction using spectrum subtraction, for example. It is.

本発明の第１実施形態に係る雑音低減装置及び雑音低減装置に接続される部位を示す図である。It is a figure which shows the site | part connected to the noise reduction apparatus which concerns on 1st Embodiment of this invention, and a noise reduction apparatus. 雑音低減装置を備えた電子機器を示す図である。It is a figure which shows the electronic device provided with the noise reduction apparatus. 雑音低減装置内の３つのスペクトル信号の構成を示す図である。It is a figure which shows the structure of three spectrum signals in a noise reduction apparatus. 雑音低減装置における出力スペクトル信号の構成を示す図である。It is a figure which shows the structure of the output spectrum signal in a noise reduction apparatus. 雑音低減装置内の２つのスペクトル信号の例を示す図である。It is a figure which shows the example of the two spectrum signals in a noise reduction apparatus. 暗騒音スペクトル信号の存在を無視した場合において、図５（ａ）及び（ｂ）の２つのスペクトル信号に対しスペクトル減算処理を適用して得られる結果信号を示す図である。It is a figure which shows the result signal obtained by applying a spectrum subtraction process with respect to two spectrum signals of Fig.5 (a) and (b), when the presence of a background noise spectrum signal is disregarded. 雑音低減装置内の暗騒音スペクトル信号の例を示す図である。It is a figure which shows the example of the background noise spectrum signal in a noise reduction apparatus. 図５（ａ）及び（ｂ）並びに図７の３つのスペクトル信号に基づく、出力スペクトル信号の例を示す図である。It is a figure which shows the example of an output spectrum signal based on three spectrum signals of Fig.5 (a) and (b) and FIG. ２つのフレームの時間的関係を示す図である。It is a figure which shows the temporal relationship of two frames. 本発明の第２実施形態に係る雑音低減装置及び雑音低減装置に接続される部位を示す図である。It is a figure which shows the site | part connected to the noise reduction apparatus which concerns on 2nd Embodiment of this invention, and a noise reduction apparatus. 図１０の非目的音信号生成部の内部構成例を示す図である。It is a figure which shows the internal structural example of the non-target sound signal generation part of FIG. 従来の雑音低減装置の内部ブロック図である。It is an internal block diagram of the conventional noise reduction apparatus. 図１２の雑音低減装置における入力、雑音及び出力スペクトル信号を示す図である。It is a figure which shows the input in the noise reduction apparatus of FIG. 12, a noise, and an output spectrum signal.

以下、本発明の実施形態の例を、図面を参照して具体的に説明する。参照される各図において、同一の部分には同一の符号を付し、同一の部分に関する重複する説明を原則として省略する。尚、本明細書では、記述の簡略化上、情報、信号、物理量、状態量又は部材等を参照する記号又は符号を記すことによって該記号又は符号に対応する情報、信号、物理量、状態量又は部材等の名称を省略又は略記することがある。 Hereinafter, an example of an embodiment of the present invention will be specifically described with reference to the drawings. In each of the drawings to be referred to, the same part is denoted by the same reference numeral, and redundant description regarding the same part is omitted in principle. In this specification, for simplification of description, a symbol or reference that refers to information, signal, physical quantity, state quantity, member, or the like is written to indicate information, signal, physical quantity, state quantity or Names of members and the like may be omitted or abbreviated.

＜＜第１実施形態＞＞
図１に、本発明の第１実施形態に係る雑音低減装置１と、雑音低減装置１に接続される部位を示す。図１の雑音低減装置１は、符号１１〜２０によって参照される各部位を備える。雑音低減装置１は、集音部２及び前処理部３と共に、集音機能を備えた任意の電子機器１００に搭載される（図２参照）。電子機器１００は、例えば、集音機能を備えた撮像装置又はボイスレコーダ等の集音装置を含む。撮像装置は、静止画像及び動画像を撮影及び記録可能なデジタルビデオカメラ、又は、静止画像のみを撮影及び記録可能なデジタルスチルカメラである。電子機器１００としての撮像装置は、固体撮像素子、被写体像を固体撮像素子に結像させるためのフォーカスレンズ及び撮影画角を調節するためのズームレンズを備えると共に、フォーカスレンズ、ズームレンズ又は任意の素子を移動させるためのモータを備える（何れも不図示）。雑音低減装置１は、音響信号中の雑音を低減する機能を持つが、低減対象の雑音として、例えば、上記フォーカスレンズ又はズームレンズの移動に伴う駆動音や、上記モータの駆動音が想定される。 << First Embodiment >>
In FIG. 1, the noise reduction apparatus 1 which concerns on 1st Embodiment of this invention, and the site | part connected to the noise reduction apparatus 1 are shown. The noise reduction apparatus 1 of FIG. 1 includes each part referred to by reference numerals 11 to 20. The noise reduction apparatus 1 is mounted on an arbitrary electronic device 100 having a sound collection function together with the sound collection unit 2 and the preprocessing unit 3 (see FIG. 2). The electronic device 100 includes, for example, a sound collection device such as an imaging device or a voice recorder having a sound collection function. The imaging device is a digital video camera that can capture and record still images and moving images, or a digital still camera that can capture and record only still images. An imaging apparatus as the electronic apparatus 100 includes a solid-state imaging device, a focus lens for forming a subject image on the solid-state imaging device, and a zoom lens for adjusting a shooting angle of view, and a focus lens, a zoom lens, or an arbitrary lens A motor for moving the element is provided (both not shown). The noise reduction device 1 has a function of reducing noise in an acoustic signal. As noise to be reduced, for example, driving sound accompanying movement of the focus lens or zoom lens and driving sound of the motor are assumed. .

集音部２は、１以上のマイクロホンから成る。集音部２は、目的音を含む電子機器１００の周辺音を集音し、集音した音をアナログ音響信号に変換して出力する。前処理部３は、集音部２の出力信号に所定の信号処理を施すことで時間領域上の音響信号である入力音響信号Ｓ_IN［ｔ］を生成する。当該信号処理は、集音部２から出力されるアナログ音響信号をデジタル音響信号に変換するＡ／Ｄ変換処理、及び、該デジタル音響信号を所定時間長を有するフレーム単位で分割するフレーム分割処理などを含む。 The sound collection unit 2 is composed of one or more microphones. The sound collection unit 2 collects the peripheral sound of the electronic device 100 including the target sound, converts the collected sound into an analog sound signal, and outputs the analog sound signal. The preprocessing unit 3 performs predetermined signal processing on the output signal of the sound collection unit 2 to generate an input acoustic signal S _IN [t] that is an acoustic signal in the time domain. The signal processing includes an A / D conversion process for converting an analog sound signal output from the sound collection unit 2 into a digital sound signal, a frame division process for dividing the digital sound signal into frame units having a predetermined time length, and the like. including.

ＦＦＴ部１１は、フレーム単位で分割された入力音響信号Ｓ_IN［ｔ］に対して、フレームごとに、時間／周波数変換の一種である離散フーリエ変換を行うことで、入力スペクトル信号Ｓ_IN［ｆ］を生成する。入力スペクトル信号Ｓ_IN［ｆ］は、入力音響信号Ｓ_IN［ｔ］を周波数領域上の信号に変換したものである。 The FFT unit 11 performs a discrete Fourier transform, which is a type of time / frequency conversion, on the input acoustic signal S _IN [t] divided in units of frames for each frame, whereby the input spectrum signal S _IN [f ] Is generated. The input spectrum signal S _IN [f] is obtained by converting the input acoustic signal S _IN [t] into a signal on the frequency domain.

入力音響信号Ｓ_IN［ｔ］及び入力スペクトル信号Ｓ_IN［ｆ］は、集音部２によって集音されるべき目的音の音響信号を含んでいると共に、目的音とは異なる非目的音の音響信号も含んでいる。
目的音は、人間の音声、動物の鳴き声、音楽など、集音部２の集音対象の音である。入力音響信号Ｓ_IN［ｔ］及び入力スペクトル信号Ｓ_IN［ｆ］に含まれる音の信号成分の内、非目的音の信号成分以外の信号成分は、目的音の信号成分であると考えてもよい。
非目的音は、目的音にとっての雑音の一種であり、雑音低減装置１において低減されるべき低減対象雑音（特定雑音）である。非目的音は、特定の雑音源の発生音であっても良い（この場合、特定の雑音源は非目的音の発生源であると言える）。特定の雑音源の発生音は、例えば、上記フォーカスレンズ又はズームレンズの移動に伴う駆動音や上記モータの駆動音である。 The input sound signal S _IN [t] and the input spectrum signal S _IN [f] include the sound signal of the target sound to be collected by the sound collecting unit 2 and the sound of the non-target sound different from the target sound. It also includes a signal.
The target sound is a sound to be collected by the sound collection unit 2, such as a human voice, an animal call, or music. Of the sound signal components included in the input acoustic signal S _IN [t] and the input spectrum signal S _IN [f], signal components other than the non-target sound signal components may be considered to be the target sound signal components. Good.
The non-target sound is a kind of noise for the target sound, and is reduction target noise (specific noise) to be reduced in the noise reduction device 1. The non-target sound may be a sound generated from a specific noise source (in this case, it can be said that the specific noise source is a source of non-target sound). The sound generated by the specific noise source is, for example, a driving sound accompanying the movement of the focus lens or the zoom lens or a driving sound of the motor.

尚、集音部２の出力音響信号、上記デジタル音響信号、入力音響信号Ｓ_IN［ｔ］又は入力スペクトル信号Ｓ_IN［ｆ］は、それらの何れかを記録する記録媒体（不図示）を介して、電子機器１００又は雑音低減装置１に提供されても良い。即ち例えば、前処理部３にて生成された入力音響信号Ｓ_IN［ｔ］を図示されない記録媒体に記録しておき、その後の任意のタイミングにおいて当該入力音響信号Ｓ_IN［ｔ］が雑音低減装置１に提供されても良い。 Note that the output sound signal of the sound collection unit 2, the digital sound signal, the input sound signal S _IN [t], or the input spectrum signal S _IN [f] is passed through a recording medium (not shown) for recording any of them. The electronic device 100 or the noise reduction device 1 may be provided. That is, for example, the input acoustic signal S _IN [t] generated by the preprocessing unit 3 is recorded on a recording medium (not shown), and the input acoustic signal S _IN [t] is converted into a noise reduction device at an arbitrary timing thereafter. 1 may be provided.

非目的音信号保持部１２は、非目的音に対応するスペクトル信号Ｓ_A［ｆ］を保持するメモリである。本実施形態では、入力音響信号Ｓ_IN［ｔ］が雑音低減装置１に入力されるのに先立ち、スペクトル信号Ｓ_A［ｆ］が予め用意されて保持部１２に保持されているものとする。例えば、非目的音（上記フォーカスレンズ等の駆動音）の音響信号を予め取得しておき、その取得した音響信号に対して離散フーリエ変換を施すことでスペクトル信号Ｓ_A［ｆ］を予め得ておくことができる。非目的音の音響信号及びスペクトル信号Ｓ_A［ｆ］の取得のために、集音部２が用いられても良いし、集音部２と異なる集音装置（マイクロホン）が用いられても良い。或いは、非目的音の発生源の振動の成分を計測部（加速度センサ等）を用いて計測し、その計測結果から、非目的音の音響信号及びスペクトル信号Ｓ_A［ｆ］が取得されても良い。 The non-target sound signal holding unit 12 is a memory that holds the spectrum signal S _A [f] corresponding to the non-target sound. In the present embodiment, it is assumed that the spectrum signal S _A [f] is prepared in advance and held in the holding unit 12 before the input acoustic signal S _IN [t] is input to the noise reduction device 1. For example, an acoustic signal of a non-target sound (driving sound of the focus lens or the like) is acquired in advance, and a spectrum signal S _A [f] is acquired in advance by performing discrete Fourier transform on the acquired acoustic signal. I can leave. In order to acquire the acoustic signal of the non-target sound and the spectrum signal S _A [f], the sound collecting unit 2 may be used, or a sound collecting device (microphone) different from the sound collecting unit 2 may be used. . Alternatively, the vibration component of the non-target sound source is measured using a measurement unit (acceleration sensor or the like), and the non-target sound signal and the spectrum signal S _A [f] are acquired from the measurement result. good.

乗算部１３は、スペクトル信号Ｓ_A［ｆ］に対して所定の係数αを乗じることでスペクトル信号Ｓ_A’［ｆ］を生成する。減算部１４は、スペクトル信号Ｓ_A’［ｆ］を入力音響信号Ｓ_IN［ｔ］から減算することにより、減算スペクトル信号とも言うべきスペクトル信号Ｓ₁［ｆ］を生成する。乗算部１５は、入力スペクトル信号Ｓ_IN［ｆ］に対して所定の係数βを乗じることでスペクトル信号Ｓ₂［ｆ］を生成する。係数αは減算係数とも呼ばれ、係数βはフロアリング係数とも呼ばれる。 The multiplier 13 generates a spectrum signal S _A ′ [f] by multiplying the spectrum signal S _A [f] by a predetermined coefficient α. The subtractor 14 subtracts the spectrum signal S _A ′ [f] from the input acoustic signal S _IN [t], thereby generating a spectrum signal S ₁ [f] that can be called a subtracted spectrum signal. The multiplier 15 generates a spectrum signal S ₂ [f] by multiplying the input spectrum signal S _IN [f] by a predetermined coefficient β. The coefficient α is also called a subtraction coefficient, and the coefficient β is also called a flooring coefficient.

暗騒音信号保持部１６は、暗騒音に対応するスペクトル信号Ｓ_B［ｆ］を保持するメモリである。暗騒音も雑音の一種であるが、本実施形態で想定される暗騒音は、非目的音とは異なる。暗騒音は、目的音及び非目的音が無いときに集音部２にて集音されることになる雑音を指し、背景雑音又は背景音とも呼ばれる。例えば、雑音低減装置１の実稼働に先立ち、所定の集音環境において暗騒音の音響信号を取得し、取得した暗騒音の音響信号に対して離散フーリエ変換を施すことでスペクトル信号Ｓ_B［ｆ］を予め得ておくことができる。所定の集音環境とは、例えば、集音部２及び電子機器１にとって十分に静かな環境である。暗騒音の音響信号及びスペクトル信号Ｓ_B［ｆ］の取得のために、集音部２が用いられる。 The background noise signal holding unit 16 is a memory that holds the spectrum signal S _B [f] corresponding to the background noise. Although the background noise is a kind of noise, the background noise assumed in the present embodiment is different from the non-target sound. Background noise refers to noise that is collected by the sound collection unit 2 when there is no target sound and non-target sound, and is also referred to as background noise or background sound. For example, prior to actual operation of the noise reduction apparatus 1, an acoustic signal of background noise is acquired in a predetermined sound collection environment, and the spectrum signal S _B [f is obtained by performing discrete Fourier transform on the acquired acoustic signal of background noise. ] Can be obtained in advance. The predetermined sound collection environment is, for example, a sufficiently quiet environment for the sound collection unit 2 and the electronic device 1. The sound collection unit 2 is used to acquire the acoustic signal of the background noise and the spectrum signal S _B [f].

乗算部１７は、暗騒音スペクトル信号であるスペクトル信号Ｓ_B［ｆ］に対して、所定の係数（暗騒音用係数）γを乗じることでスペクトル信号Ｓ₃［ｆ］を生成する。 The multiplier 17 multiplies the spectrum signal S _B [f], which is a background noise spectrum signal, by a predetermined coefficient (background noise coefficient) γ to generate the spectrum signal S ₃ [f].

上述の説明から理解されるように、スペクトル信号Ｓ₁［ｆ］〜Ｓ₃［ｆ］は、下記式（Ａ１）〜（Ａ３）を満たす。
Ｓ₁［ｆ］＝Ｓ_IN［ｆ］−Ｓ_A［ｆ］×α …（Ａ１）
Ｓ₂［ｆ］＝Ｓ_IN［ｆ］×β …（Ａ２）
Ｓ₃［ｆ］＝Ｓ_B［ｆ］×γ …（Ａ３） As understood from the above description, the spectrum signals S ₁ [f] to S ₃ [f] satisfy the following expressions (A1) to (A3).
S ₁ [f] = S _IN [f] −S _A [f] × α (A1)
S ₂ [f] = S _IN [f] × β (A2)
S ₃ [f] = S _B [f] × γ (A3)

本実施形態においてスペクトル信号とは周波数領域上の音響信号を指し、任意のスペクトル信号において周波数の標本間隔はΔｆである。任意のスペクトル信号は、周波数番号０の信号成分から周波数番号（Ｍ−１）の信号成分までの計Ｍ個の信号成分より成る。Ｍは、２以上の整数であり、例えば１２８である。記述の明確化上、周波数の標本間隔Δｆで分割された１つ１つの周波数帯域を単位帯域と呼ぶ。そうすると、任意のスペクトル信号の全周波数帯域はＭ個の単位帯域から形成される（Ｍ個の単位帯域にて分割される）。
第ｍ単位帯域は、（ｍ×Δｆ）の周波数から（（ｍ＋１）×Δｆ）の周波数までの周波数帯域を指す（ｍは整数）。 In this embodiment, the spectrum signal refers to an acoustic signal in the frequency domain, and the frequency sampling interval is Δf in an arbitrary spectrum signal. An arbitrary spectrum signal is composed of a total of M signal components from a signal component with frequency number 0 to a signal component with frequency number (M−1). M is an integer greater than or equal to 2, for example 128. For clarity of description, each frequency band divided by the frequency sampling interval Δf is called a unit band. Then, the entire frequency band of an arbitrary spectrum signal is formed from M unit bands (divided by M unit bands).
The m-th unit band refers to a frequency band from a frequency of (m × Δf) to a frequency of ((m + 1) × Δf) (m is an integer).

従って、図３（ａ）〜（ｃ）に示す如く、入力スペクトル信号Ｓ_IN［ｆ］は、信号成分Ｓ_IN［０・Δｆ］〜Ｓ_IN［Ｍ−１・Δｆ］から成り、非目的音のスペクトル信号Ｓ_A［ｆ］は、信号成分Ｓ_A［０・Δｆ］〜Ｓ_A［Ｍ−１・Δｆ］から成り、暗騒音のスペクトル信号Ｓ_B［ｆ］は、信号成分Ｓ_B［０・Δｆ］〜Ｓ_B［Ｍ−１・Δｆ］から成る。信号成分Ｓ_IN［ｍ・Δｆ］、Ｓ_A［ｍ・Δｆ］、Ｓ_B［ｍ・Δｆ］は、夫々、スペクトル信号Ｓ_IN［ｆ］、Ｓ_A［ｆ］、Ｓ_B［ｆ］に含まれる、第ｍ単位帯域内の信号成分を示す。 Therefore, as shown in FIGS. 3A to 3C, the input spectrum signal S _IN [f] is composed of signal components S _IN [0 · Δf] to S _IN [M−1 · Δf], Spectrum signal S _A [f] is composed of signal components S _A [0 · Δf] to S _A [M−1 · Δf], and background noise spectrum signal S _B [f] is signal component S _B [0]. Δf] to S _B [M−1 · Δf]. The signal components S _IN [m · Δf], S _A [m · Δf], and S _B [m · Δf] are included in the spectrum signals S _IN [f], S _A [f], and S _B [f], respectively. The signal component in the mth unit band is shown.

乗算部１３、１５及び１７の各乗算並びに減算部１４の減算は、単位帯域ごとに行われるため、上記式（Ａ１）〜（Ａ３）に対応する下記式（Ａ４）〜（Ａ６）が成立する。ｉ＝１、ｉ＝２又はｉ＝３とした場合、スペクトル信号Ｓ₁［ｆ］、Ｓ₂［ｆ］又はＳ₃［ｆ］であるスペクトル信号Ｓ_i［ｆ］は、信号成分Ｓ_i［０・Δｆ］〜Ｓ_i［Ｍ−１・Δｆ］から成り、信号成分Ｓ_i［ｍ・Δｆ］は、スペクトル信号Ｓ_i［ｆ］に含まれる、第ｍ単位帯域内の信号成分を示す。
Ｓ₁［ｍ・Δｆ］＝Ｓ_IN［ｍ・Δｆ］−Ｓ_A［ｍ・Δｆ］×α …（Ａ４）
Ｓ₂［ｍ・Δｆ］＝Ｓ_IN［ｍ・Δｆ］×β …（Ａ５）
Ｓ₃［ｍ・Δｆ］＝Ｓ_B［ｍ・Δｆ］×γ …（Ａ６） Since the multiplications of the multiplication units 13, 15 and 17 and the subtraction of the subtraction unit 14 are performed for each unit band, the following equations (A4) to (A6) corresponding to the above equations (A1) to (A3) are established. . When i = 1, i = 2, or i = 3, the spectrum signal S _i [f], which is the spectrum signal S ₁ [f], S ₂ [f], or S ₃ [f], is a signal component S _i [ 0 · Δf] to S _i [M−1 · Δf], and the signal component S _i [m · Δf] indicates a signal component in the m-th unit band included in the spectrum signal S _i [f].
S ₁ [m · Δf] = S _IN [m · Δf] −S _A [m · Δf] × α (A4)
S ₂ [m · Δf] = S _IN [m · Δf] × β (A5)
S ₃ [m · Δf] = S _B [m · Δf] × γ (A6)

比較処理部１８は、スペクトル信号Ｓ₁［ｆ］、Ｓ₂［ｆ］及びＳ₃［ｆ］に基づき、それらの信号レベルの比較を介して、出力スペクトル信号Ｓ_OUT［ｆ］を生成する。図４に示す如く、出力スペクトル信号Ｓ_OUT［ｆ］は、信号成分Ｓ_OUT［０・Δｆ］〜Ｓ_OUT［Ｍ−１・Δｆ］から成り、信号成分Ｓ_OUT［ｍ・Δｆ］は、出力スペクトル信号Ｓ_OUT［ｆ］に含まれる、第ｍ単位帯域内の信号成分を示す。 Based on the spectrum signals S ₁ [f], S ₂ [f] and S ₃ [f], the comparison processing unit 18 generates an output spectrum signal S _OUT [f] through comparison of the signal levels. As shown in FIG. 4, the output spectrum signal S _OUT [f] is composed of signal components S _OUT [0 · Δf] to S _OUT [M−1 · Δf], and the signal component S _OUT [m · Δf] is output. The signal component in the m-th unit band included in the spectrum signal S _OUT [f] is shown.

ＩＦＦＴ１９部は、周波数／時間変換の一種である逆離散フーリエ変換を用いて、出力スペクトル信号Ｓ_OUT［ｆ］を時間領域上の出力音響信号Ｓ_OUT［ｔ］に変換する。出力音響信号Ｓ_OUT［ｔ］は、入力音響信号Ｓ_IN［ｔ］に含まれる雑音（非目的音を含む）が抑制された音響信号として、雑音低減装置１から出力される。係数設定部２０は、係数α、β及びγの値を設定する。係数α、β及びγは、“０＜α”、“０＜β＜１”及び“０＜γ＜１”を満たす。係数設定部２０は、係数更新部としての機能も備えているが、その機能については後述する。 The IFFT 19 unit converts the output spectrum signal S _OUT [f] into an output acoustic signal S _OUT [t] in the time domain using inverse discrete Fourier transform, which is a type of frequency / time conversion. The output acoustic signal S _OUT [t] is output from the noise reduction device 1 as an acoustic signal in which noise (including non-target sounds) included in the input acoustic signal S _IN [t] is suppressed. The coefficient setting unit 20 sets the values of the coefficients α, β, and γ. The coefficients α, β, and γ satisfy “0 <α”, “0 <β <1”, and “0 <γ <1”. The coefficient setting unit 20 also has a function as a coefficient update unit, which will be described later.

今、スペクトル信号Ｓ_IN［ｆ］及びＳ_A［ｆ］が、夫々、図５（ａ）及び（ｂ）のスペクトル信号３００_IN［ｆ］及び３００_A［ｆ］であった場合を考える。尚、便宜上、図５（ａ）及び（ｂ）並びに後述の図６〜図８では、離散化された信号成分をつなぎ合わせることで、図示すべき各スペクトル信号を連続した曲線として表現している。また、信号３００_IN［ｆ］及び３００_A［ｆ］を参照した説明文では、説明の簡略化及び便宜上、α＝β＝１と仮定する。 Consider the case where the spectral signals S _IN [f] and S _A [f] are the spectral signals 300 _IN [f] and 300 _A [f] in FIGS. 5 (a) and 5 (b), respectively. For convenience, in FIGS. 5A and 5B and FIGS. 6 to 8 described later, the spectrum signals to be illustrated are represented as continuous curves by connecting the discretized signal components. . Also, in the explanatory text referring to the signals 300 _IN [f] and 300 _A [f], it is assumed that α = β = 1 for the sake of simplification and convenience.

スペクトル減算法における雑音低減が有効に機能するためには、信号３００_A［ｆ］の信号レベルが信号３００_IN［ｆ］の信号レベルより大きくなっている必要がある。しかしながら、破線枠３０１で囲まれた周波数帯域（以下、周波数帯域３０１という）において、信号３００_A［ｆ］の信号レベルが信号３００_IN［ｆ］の信号レベルよりも小さくなっている（これを、便宜上、逆転現象と呼ぶ）。 In order for noise reduction in the spectral subtraction method to function effectively, the signal level of the signal 300 _A [f] needs to be higher than the signal level of the signal 300 _IN [f]. However, the signal level of the signal 300 _A [f] is lower than the signal level of the signal 300 _IN [f] in the frequency band surrounded by the broken line frame 301 (hereinafter referred to as the frequency band 301) ( For convenience, this is called the reverse phenomenon).

スペクトル信号３００_A［ｆ］は非目的音を評価することで予め用意されている一方で、入力スペクトル信号３００_IN［ｆ］は目的音と非目的音の音響信号を足し合わせたものに相当するため、本来的には、信号３００_IN［ｆ］の信号レベルは信号３００_A［ｆ］の信号レベルよりも大きいはずである。
しかしながら、その時々で発生する非目的音は時系列上でばらつき、結果、注目タイミングの非目的音（即ち入力音響信号中の非目的音）と上記評価のタイミングの非目的音（即ち保持部１２に保持された非目的音）との差が大きくなることもある。その差の増大は上記逆転現象を発生させる。特に例えば、非目的音が撮像装置のレンズの駆動による雑音等である場合には、レンズユニットの経年変化により、その時々で発生する非目的音は時系列上でばらつく。
また、非目的音の評価を介して得たスペクトル信号３００_A［ｆ］を、複数の電子機器１００に共通して適用した場合、電子機器１００の個体差ばらつきが影響して上記逆転現象が発生することもある（共通適用されたスペクトル信号３００_A［ｆ］は、或る電子機器１００にとっては最適であるが、或る他の電子機器１００にとっては大きすぎることもある）。特に例えば、非目的音が撮像装置のレンズの駆動による雑音等である場合には、レンズユニットの個体差ばらつきが上記逆転現象の発生を招きうる。
また、目的音と非目的音の位相関係に依存して局所的な周波数帯域において入力音響信号のパワーが相当に小さくなることもあり、そのような場合にも上記逆転現象が生じる。 While the spectrum signal 300 _A [f] is prepared in advance by evaluating the non-target sound, the input spectrum signal 300 _IN [f] corresponds to the sum of the acoustic signals of the target sound and the non-target sound. Therefore, originally, the signal level of the signal 300 _IN [f] should be larger than the signal level of the signal 300 _A [f].
However, the non-target sound generated at that time varies in time series. As a result, the non-target sound at the timing of interest (that is, the non-target sound in the input acoustic signal) and the non-target sound at the timing of the evaluation (that is, the holding unit 12). The difference from the non-target sound held in the sound may increase. The increase in the difference causes the above reversal phenomenon. In particular, for example, when the non-target sound is noise or the like due to the driving of the lens of the imaging apparatus, the non-target sound generated at that time varies over time due to the secular change of the lens unit.
Further, when the spectrum signal 300 _A [f] obtained through the evaluation of the non-target sound is commonly applied to a plurality of electronic devices 100, the above inversion phenomenon occurs due to the influence of individual differences in the electronic devices 100. (The commonly applied spectral signal 300 _A [f] is optimal for one electronic device 100, but may be too large for some other electronic device 100). In particular, for example, when the non-target sound is noise or the like due to the driving of the lens of the imaging device, the individual difference variation of the lens unit may cause the reverse phenomenon.
In addition, depending on the phase relationship between the target sound and the non-target sound, the power of the input acoustic signal may be considerably reduced in the local frequency band. In such a case, the reverse phenomenon occurs.

スペクトル信号３００_IN［ｆ］及び３００_A［ｆ］に対して、仮に、図１２の構成に対応する従来のスペクトル減算処理を適用した場合には、図６に示すような結果信号３１０_OUT［ｆ］がスペクトル減算処理の結果信号として得られることになる（信号３１０_OUT［ｆ］は、便宜上、暗騒音スペクトル信号が無いと仮定した場合に得られる信号Ｓ_OUT［ｆ］に相当する）。当該周波数帯域３０１において、スペクトル信号３００_A［ｆ］に減算係数αを乗じて得た信号がスペクトル信号３００_IN［ｆ］にフロアリング係数βを乗じて得た信号よりも小さい場合、前者の信号（３００_A［ｆ］×α）の代わりに後者の信号（３００_IN［ｆ］×β）が結果信号３１０_OUT［ｆ］に含められる。これにより、周波数帯域３０１において、前者の信号（３００_A［ｆ］×α）を結果信号３１０_OUT［ｆ］に含める場合よりは、結果信号３１０_OUT［ｆ］の歪み（歪みは異音につながり、音響信号の聞き手に違和感を覚えさせる）が幾分抑制されはするが、その抑制効果が不十分になることもある。 If the conventional spectral subtraction processing corresponding to the configuration of FIG. 12 is applied to the spectral signals 300 _IN [f] and 300 _A [f], the result signal 310 _OUT [f] as shown in FIG. ] Is obtained as a result signal of the spectral subtraction process (the signal 310 _OUT [f] corresponds to the signal S _OUT [f] obtained when it is assumed that there is no background noise spectrum signal for convenience). In the frequency band 301, when the signal obtained by multiplying the spectrum signal 300 _A [f] by the subtraction coefficient α is smaller than the signal obtained by multiplying the spectrum signal 300 _IN [f] by the flooring coefficient β, the former signal Instead of (300 _A [f] × α), the latter signal (300 _IN [f] × β) is included in the result signal 310 _OUT [f]. Thus, in the frequency band 301, than to include the former signal _{(300 A [f] × α} ) results in signal 310 _OUT [f] is, the distortion of the result signal 310 _OUT [f] (distortion leads to abnormal noise Although the sound signal listener feels uncomfortable, the suppression effect may be insufficient.

例えば、元々の入力音響信号Ｓ_IN［ｔ］が小さい場合、周波数領域３０１において、結果信号３１０_OUT［ｆ］の信号レベルが暗騒音の信号レベルよりも小さくなることがある。暗騒音は、目的音及び非目的音が無いときに入力音響信号Ｓ_IN［ｔ］に含まれる背景雑音であるため、入力音響信号の補正信号であるべき結果信号３１０_OUT［ｆ］の信号レベルが暗騒音の信号レベルよりも小さくなることは、本来的に不自然であり、結果信号３１０_OUT［ｆ］の信号レベルが暗騒音の信号レベルよりも小さくなるような結果信号３１０_OUT［ｆ］の歪みは、結果信号３１０_OUT［ｆ］の音響信号の聞き手に違和感を覚えさせる。 For example, when the original input acoustic signal S _IN [t] is small, the signal level of the result signal 310 _OUT [f] may be lower than the signal level of background noise in the frequency domain 301. The background noise is background noise included in the input sound signal S _IN [t] when there is no target sound and non-target sound, and thus the signal level of the result signal 310 _OUT [f] that should be a correction signal of the input sound signal Is inherently unnatural and the result signal 310 _OUT [f] is such that the signal level of the result signal 310 _OUT [f] is smaller than the signal level of the background noise. This makes the listener of the acoustic signal of the result signal 310 _OUT [f] feel uncomfortable.

これらを考慮し、第１実施形態では、上述の如く暗騒音信号保持部１６を設け、暗騒音のスペクトル信号Ｓ_B［ｆ］をも用いて出力スペクトル信号Ｓ_OUT［ｆ］を生成する。このため、比較処理部１８は、スペクトル信号Ｓ₁［ｆ］及びＳ₂［ｆ］に加えて、暗騒音のスペクトル信号Ｓ_B［ｆ］に基づくスペクトル信号Ｓ₃［ｆ］をも参照し、スペクトル信号Ｓ₁［ｆ］〜Ｓ₃［ｆ］の信号レベルの大小関係に基づき、出力スペクトル信号Ｓ_OUT［ｆ］を生成する。より具体的には、比較処理部１８は、単位帯域ごとに、スペクトル信号Ｓ₁［ｆ］〜Ｓ₃［ｆ］の信号レベルを比較し、スペクトル信号Ｓ₁［ｆ］〜Ｓ₃［ｆ］の内、最大の信号レベルを持つスペクトル信号を出力スペクトル信号Ｓ_OUT［ｆ］に含めるようにする。 Considering these, in the first embodiment, the background noise signal holding unit 16 is provided as described above, and the output spectrum signal S _OUT [f] is generated also using the background noise spectrum signal S _B [f]. Therefore, the comparison processing unit 18 refers to the spectrum signal S ₃ [f] based on the background signal S _B [f] in addition to the spectrum signals S ₁ [f] and S ₂ [f], An output spectrum signal S _OUT [f] is generated based on the magnitude relationship between the signal levels of the spectrum signals S ₁ [f] to S ₃ [f]. More specifically, the comparison processing unit 18 compares the signal levels of the spectrum signals S ₁ [f] to S ₃ [f] for each unit band, and the spectrum signals S ₁ [f] to S ₃ [f]. Of these, the spectrum signal having the maximum signal level is included in the output spectrum signal S _OUT [f].

スペクトル信号Ｓ₁［ｆ］〜Ｓ₃［ｆ］の内、スペクトル信号Ｓ₁［ｆ］の信号レベルが最大となる条件を第１条件と呼び、スペクトル信号Ｓ₁［ｆ］〜Ｓ₃［ｆ］の内、スペクトル信号Ｓ₂［ｆ］の信号レベルが最大となる条件を第２条件と呼び、スペクトル信号Ｓ₁［ｆ］〜Ｓ₃［ｆ］の内、スペクトル信号Ｓ₃［ｆ］の信号レベルが最大となる条件を第３条件と呼ぶ。 Among spectrum signal _{_{S 1 [f] ~S 3 [}} f], the condition signal level of the spectral signal S ₁ [f] is maximized is referred to as a first condition, the spectrum signal _{_{S 1 [f] ~S 3 [}} f ], The condition where the signal level of the spectrum signal S ₂ [f] is maximized is called a second condition. Of the spectrum signals S ₁ [f] to S ₃ [f], the spectrum signal S ₃ [f] The condition that maximizes the signal level is called the third condition.

第ｍ単位帯域について、第１条件が成立するときには下記式（Ａ７）に従って、第２条件が成立するときには下記式（Ａ８）に従って、第３条件が成立するときには下記式（Ａ９）に従って、出力スペクトル信号Ｓ_OUT［ｆ］の信号成分Ｓ_OUT［ｍ・Δｆ］が生成され、比較処理部１８は、この生成処理を単位帯域ごとに実行する。第ｍ単位帯域について第ｉ条件が成立するとは、信号成分Ｓ₁［ｍ・Δｆ］、Ｓ₂［ｍ・Δｆ］及びＳ₃［ｍ・Δｆ］の内、信号成分Ｓ_i［ｍ・Δｆ］の信号レベルが最大であることを指す（ここで、ｉは１、２又は３）。
Ｓ_OUT［ｍ・Δｆ］＝Ｓ₁［ｍ・Δｆ］ …（Ａ７）
Ｓ_OUT［ｍ・Δｆ］＝Ｓ₂［ｍ・Δｆ］ …（Ａ８）
Ｓ_OUT［ｍ・Δｆ］＝Ｓ₃［ｍ・Δｆ］ …（Ａ９） For the m-th unit band, an output spectrum according to the following equation (A7) when the first condition is satisfied, according to the following equation (A8) when the second condition is satisfied, and according to the following equation (A9) when the third condition is satisfied. signal S _OUT signal component S _OUT of the [f] [m · Δf] is generated, the comparison processing unit 18 executes the generation processing for each unit band. The i-th condition is satisfied for the m-th unit band when the signal component S _i [m · Δf] is selected from the signal components S ₁ [m · Δf], S ₂ [m · Δf] and S ₃ [m · Δf]. Is the maximum signal level (where i is 1, 2 or 3).
S _OUT [m · Δf] = S ₁ [m · Δf] (A7)
S _OUT [m · Δf] = S ₂ [m · Δf] (A8)
S _OUT [m · Δf] = S ₃ [m · Δf] (A9)

スペクトル信号Ｓ_IN［ｆ］及びＳ_A［ｆ］が夫々図５（ａ）及び（ｂ）のスペクトル信号３００_IN［ｆ］及び３００_A［ｆ］であって、且つ、暗騒音のスペクトル信号Ｓ_B［ｆ］が図７のスペクトル信号３００_B［ｆ］である場合を考える。加えて、周波数帯域３０１において、第３条件が成立することを想定する。この想定下では、周波数帯域３０１において、スペクトル信号Ｓ_B［ｆ］（３００_B［ｆ］）に基づくスペクトル信号Ｓ₃［ｆ］が出力スペクトル信号Ｓ_OUT［ｆ］に含められ、結果、図８のスペクトル信号３２０_OUT［ｆ］が出力スペクトル信号Ｓ_OUT［ｆ］として比較処理部１８から出力されることになる。即ち、図６の信号３１０_OUT［ｆ］を基準として、周波数帯域３０１の信号を、スペクトル信号Ｓ_B［ｆ］（３００_B［ｆ］）に対応するスペクトル信号Ｓ₃［ｆ］に置き換えた結果、出力スペクトル信号Ｓ_OUT［ｆ］（３２０_OUT［ｆ］）における周波数帯域３０１内の信号レベルが増大補正されていることが分かる。 The spectral signals S _IN [f] and S _A [f] are the spectral signals 300 _IN [f] and 300 _A [f] of FIGS. 5A and 5B, respectively, and the background noise spectral signal S. Consider the case where _B [f] is the spectrum signal 300 _B [f] in FIG. In addition, it is assumed that the third condition is satisfied in the frequency band 301. Under this assumption, in the frequency band 301, the spectrum signal S ₃ [f] based on the spectrum signal S _B [f] (300 _B [f]) is included in the output spectrum signal S _OUT [f]. Spectrum signal 320 _OUT [f] is output from the comparison processing unit 18 as an output spectrum signal S _OUT [f]. That is, the result of replacing the signal in the frequency band 301 with the spectrum signal S ₃ [f] corresponding to the spectrum signal S _B [f] (300 _B [f]) using the signal 310 _OUT [f] in FIG. 6 as a reference. It can be seen that the signal level in the frequency band 301 in the output spectrum signal S _OUT [f] (320 _OUT [f]) is corrected to be increased.

上述の如く、スペクトル減算法を基本としつつ、暗騒音のスペクトル信号をも用いて出力音響信号を生成することにより、出力音響信号における歪みを低減することができる。つまり、暗騒音の信号レベルよりも小さな信号が出力音響信号に含められるような、出力音響信号の歪み（図６の周波数帯域３０１における信号歪みに相当）を補償することができ（図６の信号３１０_OUT［ｆ］から図８の信号３２０_OUT［ｆ］への変化が当該補償に相当）、その歪みによって生じうる、出力音響信号の聞き手の違和感を軽減することが可能となる。 As described above, distortion in the output sound signal can be reduced by generating the output sound signal using the spectrum signal of the background noise while using the spectrum subtraction method as a basis. That is, it is possible to compensate for distortion of the output acoustic signal (corresponding to signal distortion in the frequency band 301 in FIG. 6) such that a signal smaller than the background noise signal level is included in the output acoustic signal (signal in FIG. 6). The change from 310 _OUT [f] to the signal 320 _OUT [f] in FIG. 8 corresponds to the compensation), and it is possible to reduce the uncomfortable feeling of the listener of the output acoustic signal that may be caused by the distortion.

次に、図２の係数設定部２０が有する係数更新機能について説明する。係数更新機能において、係数更新部としての機能を持つ係数設定部２０は、３つの係数α、β及びγの内、少なくとも１以上の係数を更新対象係数に設定し、更新対象係数の値を更新することができる。上述の説明では特に意識しなかったが、係数設定部２０は、係数α、β及びγの値を単位帯域ごとに個別に設定することができ、上記更新を単位帯域ごとに行うことができる。 Next, the coefficient update function of the coefficient setting unit 20 in FIG. 2 will be described. In the coefficient updating function, the coefficient setting unit 20 having a function as a coefficient updating unit sets at least one coefficient among the three coefficients α, β, and γ as the update target coefficient, and updates the value of the update target coefficient. can do. Although not particularly conscious in the above description, the coefficient setting unit 20 can individually set the values of the coefficients α, β, and γ for each unit band, and can perform the update for each unit band.

ここで、説明の明確化上、図９に示す第１フレーム及び第２フレームを定義する。上述したように、入力音響信号Ｓ_IN［ｔ］はフレーム単位で分割され、フレームごとに入力音響信号Ｓ_IN［ｔ］から入力スペクトル信号Ｓ_IN［ｆ］が生成されるが、第２フレームは、第１フレームよりも時間的に後の任意のフレームである。第２フレームは第１フレームに隣接するフレームであっても良い。第１及び第２フレームの一部同士は互いにオーバラップしうる。第１フレームに適用される、第ｍ単位帯域に対する係数α、β、γを、特に、夫々、記号α［ｍ］、β［ｍ］、γ［ｍ］にて表し、第２フレームに適用される、第ｍ単位帯域に対する係数α、β、γを、特に、夫々、記号α_NEW［ｍ］、β_NEW［ｍ］、γ_NEW［ｍ］にて表す。 Here, for clarity of explanation, the first frame and the second frame shown in FIG. 9 are defined. As described above, the input acoustic signal S _IN [t] is divided in units of frames, and the input spectrum signal S _IN [f] is generated from the input acoustic signal S _IN [t] for each frame. , Any frame that is later in time than the first frame. The second frame may be a frame adjacent to the first frame. Part of the first and second frames may overlap each other. The coefficients α, β, and γ for the mth unit band applied to the first frame are represented by symbols α [m], β [m], and γ [m], respectively, and applied to the second frame. In particular, the coefficients α, β, and γ for the m-th unit band are represented by symbols α _NEW [m], β _NEW [m], and γ _NEW [m], respectively.

［減算係数αの更新方法］
係数αが更新対象係数に設定されたときの係数更新方法を説明する。係数設定部２０は、第１フレームの第ｍ単位帯域について、第１条件が成立するときには式（Ｂ１）に従って、第２条件が成立するときには式（Ｂ２）に従って、第３条件が成立するときには式（Ｂ３）に従って、第２フレームの係数α_NEW［ｍ］の値を決定する。但し、式（Ｂ１）〜（Ｂ３）における定数Ｋ１〜Ｋ３は、“１＜Ｋ１”及び“１＜Ｋ３＜Ｋ２＜１”を満たす。
α_NEW［ｍ］＝α［ｍ］×Ｋ１ …（Ｂ１）
α_NEW［ｍ］＝α［ｍ］×Ｋ２ …（Ｂ２）
α_NEW［ｍ］＝α［ｍ］×Ｋ３ …（Ｂ３） [How to update the subtraction coefficient α]
A coefficient update method when the coefficient α is set as the update target coefficient will be described. For the mth unit band of the first frame, the coefficient setting unit 20 follows the equation (B1) when the first condition is satisfied, follows the equation (B2) when the second condition is satisfied, and applies the equation when the third condition is satisfied. According to (B3), the value of the coefficient α _NEW [m] of the second frame is determined. However, the constants K1 to K3 in the formulas (B1) to (B3) satisfy “1 <K1” and “1 <K3 <K2 <1”.
α _NEW [m] = α [m] × K1 (B1)
α _NEW [m] = α [m] × K2 (B2)
α _NEW [m] = α [m] × K3 (B3)

係数αは、本来、非目的音を出力音響信号からなるだけ除去すべく、なるだけ大きくされた方が良い。第１条件成立時には、信号Ｓ₁［ｆ］〜Ｓ₃［ｆ］の内、信号Ｓ₁［ｆ］が最大であるため、非目的音の除去を更に増大させることが可能であると判断し、減算部１４における減算量を増やすべく、式（Ｂ１）により係数αを増大させて信号Ｓ₁［ｆ］の低減を図る。一方、第２条件成立時には、減算部１４での減算量が大きすぎる可能性があるため、式（Ｂ２）により係数αを減少させて信号Ｓ₁［ｆ］の増大を図る。第３条件が成立する状態は、信号Ｓ₁［ｆ］が暗騒音に対応する信号Ｓ₃［ｆ］よりも小さい状態であり、第２条件成立時よりも、減算部１４での減算量が更に大きすぎると考えられる。故に、第３条件成立時には、第２条件成立時よりも更に係数αを減少させるべく、式（Ｂ３）にて係数αの更新を行い、信号Ｓ₁［ｆ］の更なる増大を図る。 The coefficient α should be as large as possible in order to remove non-target sound from the output acoustic signal as much as possible. When the first condition is satisfied, the signal S ₁ [f] is the largest of the signals S ₁ [f] to S ₃ [f], and therefore it is determined that the removal of the non-target sound can be further increased. In order to increase the subtraction amount in the subtraction unit 14, the coefficient α is increased by the equation (B1) to reduce the signal S ₁ [f]. On the other hand, when the second condition is satisfied, there is a possibility that the subtraction amount in the subtraction unit 14 is too large. Therefore, the coefficient α is decreased by the equation (B2) to increase the signal S ₁ [f]. The state where the third condition is satisfied is a state where the signal S ₁ [f] is smaller than the signal S ₃ [f] corresponding to the background noise, and the subtraction amount in the subtracting unit 14 is smaller than that when the second condition is satisfied. It is considered to be too large. Therefore, when the third condition is satisfied, the coefficient α is updated by the equation (B3) so as to further decrease the coefficient α compared to when the second condition is satisfied, and the signal S ₁ [f] is further increased.

［フロアリング係数βの更新方法］
係数βが更新対象係数に設定されたときの係数更新方法を説明する。係数設定部２０は、第１フレームの第ｍ単位帯域について、第１条件が成立するときには式（Ｃ１）に従って、第２条件が成立するときには式（Ｃ２）に従って、第３条件が成立するときには式（Ｃ３）に従って、第２フレームの係数β_NEW［ｍ］の値を決定する。但し、式（Ｃ１）〜（Ｃ３）における定数Ｋ１〜Ｋ３は、“１＜Ｋ１”、“０＜Ｋ２＜１”及び“１＜Ｋ３”を満たす。
β_NEW［ｍ］＝β［ｍ］×Ｋ１ …（Ｃ１）
β_NEW［ｍ］＝β［ｍ］×Ｋ２ …（Ｃ２）
β_NEW［ｍ］＝β［ｍ］×Ｋ３ …（Ｃ３） [How to update the flooring coefficient β]
A coefficient update method when the coefficient β is set as the update target coefficient will be described. For the mth unit band of the first frame, the coefficient setting unit 20 follows the equation (C1) when the first condition is satisfied, follows the equation (C2) when the second condition is satisfied, and applies the equation when the third condition is satisfied. According to (C3), the value of the coefficient β _NEW [m] of the second frame is determined. However, the constants K1 to K3 in the equations (C1) to (C3) satisfy “1 <K1”, “0 <K2 <1”, and “1 <K3”.
β _NEW [m] = β [m] × K1 (C1)
β _NEW [m] = β [m] × K2 (C2)
β _NEW [m] = β [m] × K3 (C3)

第１条件成立時、信号Ｓ₂［ｆ］が小さすぎて、信号Ｓ₂［ｆ］を比較処理部１８に入力した意義（Ｓ₁［ｆ］が小さすぎるときにおいて、Ｓ₁［ｆ］をＳ_OUT［ｆ］を含めたならば生じうる歪みをＳ₂［ｆ］で補償するという意義）が薄れている可能性がある。故に、第１条件成立時には、式（Ｃ１）により係数βを増大させて信号Ｓ₂［ｆ］の増大を図る。第２条件成立時には、信号Ｓ₂［ｆ］が大きすぎて信号Ｓ₁［ｆ］を用いた本来の雑音低減ができていない可能性があるため、式（Ｃ２）により係数β及び信号Ｓ₂［ｆ］を小さくする。第３条件が成立する状態は、信号Ｓ₂［ｆ］が暗騒音に対応する信号Ｓ₃［ｆ］よりも小さい状態である。目的音を含まない暗騒音の信号レベルが目的音を含む信号の信号レベルよりも低い状態は本来発生すべきではなく、その状態の発生は、係数βが小さいことに原因があるとも言える。故に、第３条件成立時には、式（Ｃ３）により係数β及び信号Ｓ₂［ｆ］を大きくする。 When the first condition is satisfied, the signal S ₂ [f] is too small and the significance of inputting the signal S ₂ [f] to the comparison processing unit 18 (when S ₁ [f] is too small, S ₁ [f] is There is a possibility that the distortion that may occur if S _OUT [f] is included, is compensated with S ₂ [f]). Therefore, when the first condition is satisfied, the coefficient S is increased by the equation (C1) to increase the signal S ₂ [f]. During the second condition is established, the signal S ₂ [f] is because it may not be the original noise reduction using the signal S ₁ [f] is too large, the coefficient β and the signal S ₂ by the equation (C2) [F] is reduced. The state in which the third condition is satisfied is a state in which the signal S ₂ [f] is smaller than the signal S ₃ [f] corresponding to background noise. A state where the signal level of background noise that does not include the target sound should not be lower than the signal level of the signal that includes the target sound, and it can be said that this state is caused by a small coefficient β. Therefore, when the third condition is satisfied, the coefficient β and the signal S ₂ [f] are increased by the equation (C3).

［暗騒音用係数γの更新方法］
係数γが更新対象係数に設定されたときの係数更新方法を説明する。係数設定部２０は、第１フレームの第ｍ単位帯域について、第１条件が成立するときには式（Ｄ１）に従って、第２条件が成立するときには式（Ｄ２）に従って、第３条件が成立するときには式（Ｄ３）に従って、第２フレームの係数γ_NEW［ｍ］の値を決定する。但し、式（Ｄ１）〜（Ｄ３）における定数Ｋ１〜Ｋ３は、“１＜Ｋ１”、“１＜Ｋ２”及び“０＜Ｋ３＜１”を満たす。
γ_NEW［ｍ］＝γ［ｍ］×Ｋ１ …（Ｄ１）
γ_NEW［ｍ］＝γ［ｍ］×Ｋ２ …（Ｄ２）
γ_NEW［ｍ］＝γ［ｍ］×Ｋ３ …（Ｄ３） [How to update coefficient γ for background noise]
A coefficient updating method when the coefficient γ is set as the update target coefficient will be described. For the mth unit band of the first frame, the coefficient setting unit 20 follows the equation (D1) when the first condition is satisfied, follows the equation (D2) when the second condition is satisfied, and applies the equation when the third condition is satisfied. According to (D3), the value of the coefficient γ _NEW [m] of the second frame is determined. However, the constants K1 to K3 in the equations (D1) to (D3) satisfy “1 <K1”, “1 <K2”, and “0 <K3 <1”.
γ _NEW [m] = γ [m] × K1 (D1)
γ _NEW [m] = γ [m] × K2 (D2)
γ _NEW [m] = γ [m] × K3 (D3)

第１条件成立時、信号Ｓ₃［ｆ］が小さすぎて、信号Ｓ₃［ｆ］を比較処理部１８に入力した意義（図６の周波数帯域３０１における歪みを、信号Ｓ₃［ｆ］にて図８の如く補償するという意義）が薄れている可能性がある。故に、第１条件成立時には、式（Ｄ１）により係数γを増大させて信号Ｓ₃［ｆ］の増大を図る。第２条件成立時も、第１条件成立時と同様、信号Ｓ₃［ｆ］が小さすぎる可能性があるため、式（Ｄ２）により係数γ及び信号Ｓ₃［ｆ］を大きくする。第３条件成立時には、信号Ｓ₃［ｆ］が大きすぎて、本来達成されるべき非目的音の低減（Ｓ₁［ｆ］のＳ_OUT［ｆ］への反映）をできていない可能性があるため、式（Ｄ３）により係数γ及び信号Ｓ₃［ｆ］を小さくする。 When the first condition is satisfied, the signal S ₃ [f] is too small, and the significance of inputting the signal S ₃ [f] to the comparison processing unit 18 (the distortion in the frequency band 301 in FIG. 6 is changed to the signal S ₃ [f]). The meaning of compensation as shown in FIG. Therefore, when the first condition is satisfied, the coefficient γ is increased by the equation (D1) to increase the signal S ₃ [f]. Even when the second condition is satisfied, similarly to the case when the first condition is satisfied, the signal S ₃ [f] may be too small. Therefore, the coefficient γ and the signal S ₃ [f] are increased according to the equation (D2). When the third condition is established, there is a possibility that the signal S ₃ [f] is too large to reduce the non-target sound that should be originally achieved (reflection of S ₁ [f] to S _OUT [f]). Therefore, the coefficient γ and the signal S ₃ [f] are reduced by the equation (D3).

［係数更新の共通事項］
係数設定部２０は、次々と訪れる各フレームを第１フレームと捉えて、フレームごとに更新対象係数を更新することができる。但し、更新対象係数の更新が行われないフレームが存在していても良い。 [Common items for coefficient update]
The coefficient setting unit 20 can regard each frame that is visited one after another as the first frame, and can update the update target coefficient for each frame. However, there may be a frame in which the update target coefficient is not updated.

α［ｍ］に定数（Ｋ１、Ｋ２又はＫ３）を乗じることでα_NEW［ｍ］を求める方法を例示したが、上述の更新の主旨に従っている限り、α［ｍ］からα_NEW［ｍ］を決定する演算式は任意に変更可能である（係数β及びγについても同様）。例えば、係数αを増大させる場合には、α［ｍ］に正の所定値を加算することでα_NEW［ｍ］を決定し、係数αを減少させる場合には、α［ｍ］から正の所定値を減算することでα_NEW［ｍ］を決定すれば良い（係数β及びγについても同様）。 The method of obtaining α _NEW [m] by multiplying α [m] by a constant (K1, K2 or K3) is illustrated, but as long as the above update is followed, α _NEW [m] The arithmetic expression to be determined can be arbitrarily changed (the same applies to the coefficients β and γ). For example, when the coefficient α is increased, α _NEW [m] is determined by adding a positive predetermined value to α [m], and when the coefficient α is decreased, a positive value is determined from α [m]. Α _NEW [m] may be determined by subtracting a predetermined value (the same applies to the coefficients β and γ).

また、“０＜α”、“０＜β＜１”及び“０＜γ＜１”が満たされる範囲内で、係数α、β及びγの更新が行われるよう、更新処理に制限を加えると良い。つまり、入力スペクトル信号Ｓ_IN［ｆ］から非目的音に対応する信号を“減算”するため、係数αを常に正とする。信号Ｓ₂［ｆ］が信号Ｓ_IN［ｆ］よりも大きくなると、常に信号Ｓ₂［ｆ］が信号Ｓ₁［ｆ］よりも大きくなって本来の雑音低減が不能になるため、係数βは１未満の正の値とする。入力音響信号が暗騒音のレベルに近いような状況下において信号Ｓ₃［ｆ］が有効に機能することを想定しており、そのような状況下において、信号Ｓ₃［ｆ］が実際の暗騒音よりも大きくなるべきではない（暗騒音よりも大きな信号が出力音響信号に混入するため）。故に、係数γは１未満の正の値とする。 Further, if the update process is limited so that the coefficients α, β, and γ are updated within the range where “0 <α”, “0 <β <1”, and “0 <γ <1” are satisfied. good. That is, since the signal corresponding to the non-target sound is “subtracted” from the input spectrum signal S _IN [f], the coefficient α is always positive. When the signal S ₂ [f] becomes larger than the signal S _IN [f], the signal S ₂ [f] is always larger than the signal S ₁ [f], and the inherent noise reduction becomes impossible. A positive value less than 1. It is assumed that the signal S ₃ [f] functions effectively in a situation where the input acoustic signal is close to the background noise level. In such a situation, the signal S ₃ [f] is actually dark. It should not be louder than noise (because a signal larger than background noise is mixed in the output acoustic signal). Therefore, the coefficient γ is a positive value less than 1.

上述の如く、係数設定部２０は、スペクトル信号Ｓ₁［ｆ］〜Ｓ₃［ｆ］の信号レベルの大小関係に基づき（信号Ｓ₁［ｆ］〜Ｓ₃［ｆ］の内、何れを信号Ｓ_OUT［ｆ］に含めたかに基づき）、更新対象係数の値を逐次更新することができる。つまり、係数設定部２０は、第１時刻（第１フレーム）におけるスペクトル信号Ｓ₁［ｆ］〜Ｓ₃［ｆ］の信号レベルの大小関係に基づき、第１時刻よりも後の第２時刻（第２フレーム）の更新対象係数の値が決定されるように、更新対象係数を更新することができる。 As described above, the coefficient setting unit 20, spectrum signal S ₁ [f] ~S ₃ of on a relationship in magnitude between the signal levels of the [f] (signal _{_{S 1 [f] ~S 3 [}} f], any signal The value of the update target coefficient can be sequentially updated based on whether it is included in S _OUT [f]. That is, the coefficient setting unit 20 determines the second time after the first time (based on the magnitude relationship between the signal levels of the spectrum signals S ₁ [f] to S ₃ [f] at the first time (first frame). The update target coefficient can be updated so that the value of the update target coefficient in the second frame) is determined.

この更新処理により、実際に使用される信号Ｓ_IN［ｆ］、Ｓ_A［ｆ］又はＳ_B［ｆ］に適した、スペクトル減算又は暗騒音信号による補償（図６の周波数帯域３０１における歪みの、信号Ｓ₃［ｆ］による補償；図８参照）が可能になる。係数αの更新により減算部１４での減算量が適正化され、実際に発生する非目的音の時系列上でのばらつき（非目的音の経年変化）にも対応することができると共に、保持部１２の保持信号と実際に発生する非目的音の信号との相違（個体差ばらつき）にも対応することができる。また、特開２００８−２５２３８９号公報に記載された、ズーム動作直前の入力音響信号のパワー値に基づきフロアリング係数を調整する方法では、ズーム動作中に入力音響信号のレベルが変動するケースに対応できないが（当該調整内容が有効でなくなるが）、上述の如くフロアリング係数βを逐次更新することで、そのようなケースにも適切に対応可能である。 By this update process, compensation by spectral subtraction or background noise signal suitable for the actually used signal S _IN [f], S _A [f] or S _B [f] (distortion in the frequency band 301 in FIG. 6). , Compensation by the signal S ₃ [f]; see FIG. 8). The amount of subtraction in the subtracting unit 14 is optimized by updating the coefficient α, and it is possible to cope with variations in time series of non-target sounds that actually occur (aging of non-target sounds) and a holding unit. It is possible to cope with the difference (individual difference variation) between the 12 held signals and the actually generated non-target sound signal. Further, the method of adjusting the flooring coefficient based on the power value of the input acoustic signal immediately before the zoom operation described in Japanese Patent Application Laid-Open No. 2008-252389 supports a case where the level of the input acoustic signal varies during the zoom operation. Although it is not possible (although the adjustment contents become invalid), such a case can be appropriately handled by sequentially updating the flooring coefficient β as described above.

但し、係数α、β及びγの内、何れか係数又は全ての係数の更新処理を行わないようにすることも可能である。即ち、雑音低減装置１において、３つの係数α、β及びγの内、１以上の係数の値を固定値にしておくことも可能である。 However, it is possible not to update any or all of the coefficients α, β and γ. In other words, in the noise reduction apparatus 1, it is possible to set one or more of the three coefficients α, β, and γ to a fixed value.

＜＜第２実施形態＞＞
本発明の第２実施形態を説明する。第２実施形態は第１実施形態を基礎とする実施形態であり、第２実施形態において特に述べない事項に関しては、特に記述無き限り且つ矛盾無き限り、第１実施形態の記載が第２実施形態にも適用される。第１実施形態の記載を第２実施形態に適用する場合、第１実施形態の記載文中の雑音低減装置１は第２実施形態において雑音低減装置１ａに読み替えられる。 << Second Embodiment >>
A second embodiment of the present invention will be described. The second embodiment is an embodiment based on the first embodiment. Regarding matters not specifically described in the second embodiment, the description of the first embodiment is the second embodiment unless there is a description and no contradiction. Also applies. When the description of the first embodiment is applied to the second embodiment, the noise reduction device 1 in the description of the first embodiment is replaced with the noise reduction device 1a in the second embodiment.

図１０に、本発明の第２実施形態に係る雑音低減装置１ａと、雑音低減装置１ａに接続される部位を示す。図１０の雑音低減装置１ａは、符号１１、１３〜２０、３１及び３２によって参照される各部位を備える。図１０の雑音低減装置１ａは、図１の雑音低減装置１における保持部１２を非目的音信号生成部３１及びＦＦＴ部３２に置換することで得られ、その置換と後述の内容を除き、装置１ａは装置１と同様である。 FIG. 10 shows a noise reduction device 1a according to the second embodiment of the present invention and parts connected to the noise reduction device 1a. The noise reduction apparatus 1a of FIG. 10 includes each part referred to by reference numerals 11, 13 to 20, 31 and 32. The noise reduction device 1a of FIG. 10 is obtained by replacing the holding unit 12 in the noise reduction device 1 of FIG. 1 with a non-target sound signal generation unit 31 and an FFT unit 32, except for the replacement and the contents described later. 1a is the same as the apparatus 1.

非目的音信号生成部３１は、集音部２を用いた入力音響信号Ｓ_IN［ｔ］の収集（集音）と同期して非目的音を収集し、収集した非目的音の音響信号Ｓ_A［ｔ］を生成する。生成部３１において、又は、生成部３１及びＦＦＴ部３２間に設けられたフレーム分割部（不図示）において、時間領域上の音響信号である音響信号Ｓ_A［ｔ］は上記フレーム単位で分割される。ＦＦＴ部３２は、フレーム単位で分割された音響信号Ｓ_A［ｔ］に対して、フレームごとに、時間／周波数変換の一種である離散フーリエ変換を行うことで、非目的音のスペクトル信号Ｓ_A［ｆ］を生成する。スペクトル信号Ｓ_A［ｆ］は、音響信号Ｓ_A［ｔ］を周波数領域上の信号に変換したものである。 The non-target sound signal generation unit 31 collects the non-target sound in synchronization with the collection (sound collection) of the input acoustic signal S _IN [t] using the sound collection unit 2, and the collected non-target sound acoustic signal S _A [t] is generated. In the generation unit 31 or in a frame division unit (not shown) provided between the generation unit 31 and the FFT unit 32, the acoustic signal S _A [t], which is an acoustic signal in the time domain, is divided in units of frames. The The FFT unit 32 performs a discrete Fourier transform, which is a type of time / frequency conversion, for each frame on the acoustic signal S _A [t] divided in units of frames, so that the spectrum signal S _A of the non-target sound is obtained. [F] is generated. The spectrum signal S _A [f] is obtained by converting the acoustic signal S _A [t] into a signal on the frequency domain.

図１１（ａ）に示す如く、生成部３１に１以上のマイクロホンから成る集音部３１ａを設けても良く、集音部３１ａを用いて音響信号Ｓ_A［ｔ］を生成しても良い。集音部３１ａは、非目的音のみを集音できるように配置及び形成され、集音した非目的音を音響信号に変換して出力する。生成部３１は、集音部３１ａの出力音響信号をデジタル化することで音響信号Ｓ_A［ｔ］を生成することができる。或いは、図１１（ｂ）に示す如く、生成部３１に計測部３１ｂを設けても良い。計測部３１ｂは、非目的音の発生源の振動の成分を計測する加速度センサ等にて形成され、生成部３１は、計測部３１ｂの計測結果を音響信号に変換することで音響信号Ｓ_A［ｔ］を生成しても良い。 As shown in FIG. 11A, the generating unit 31 may be provided with a sound collecting unit 31a including one or more microphones, and the sound signal S _A [t] may be generated using the sound collecting unit 31a. The sound collection unit 31a is arranged and formed so as to collect only the non-target sound, and converts the collected non-purpose sound into an acoustic signal and outputs it. The generating unit 31 can generate the acoustic signal S _A [t] by digitizing the output acoustic signal of the sound collecting unit 31a. Or you may provide the measurement part 31b in the production | generation part 31, as shown in FIG.11 (b). The measurement unit 31b is formed by an acceleration sensor or the like that measures the vibration component of the non-target sound source, and the generation unit 31 converts the measurement result of the measurement unit 31b into an acoustic signal, thereby converting the acoustic signal S _A [ t] may be generated.

リアルタイムに収集した非目的音の音響信号を用いてスペクトル減算を行うことにより、その時々の非目的音に適した最適なスペクトル減算が可能となる。最適なスペクトル減算は、減算スペクトル信号Ｓ₁［ｆ］の適正化を実現するため（例えば、Ｓ₁［ｆ］＜Ｓ₂［ｆ］になるような事態が極力回避されるようになるため）、図６を参照して説明したような歪みの低減（出力音響信号の聞き手の違和感の低減）にもつながる。 By performing spectral subtraction using a non-target sound signal collected in real time, it is possible to perform optimal spectral subtraction suitable for the non-target sound at that time. Optimal spectral subtraction is to optimize the subtracted spectral signal S ₁ [f] (for example, a situation where S ₁ [f] <S ₂ [f] is avoided as much as possible). This also leads to a reduction in distortion as described with reference to FIG. 6 (a reduction in the uncomfortable feeling of the listener of the output acoustic signal).

また、従来のフィードフォワード型ＡＮＣ（Active Noise Control）では、入力音響信号中の雑音成分と雑音信号として集音された雑音成分との間の位相ズレ等が問題になるが、本実施形態の如く、雑音（非目的音）の音響信号Ｓ_A［ｔ］を時間／周波数変換して、周波数領域上でスペクトル演算（スペクトルパワー比較等）を行えば位相に関係なく雑音低減が可能である。 Further, in the conventional feedforward type ANC (Active Noise Control), a phase shift or the like between a noise component in an input acoustic signal and a noise component collected as a noise signal becomes a problem. Noise can be reduced regardless of the phase by performing time / frequency conversion of the noise (non-target sound) acoustic signal S _A [t] and performing spectrum calculation (spectral power comparison or the like) in the frequency domain.

第２実施形態においても、第１実施形態で述べた、係数設定部２０による係数α、β及びγの更新処理を行っても良い。但し、雑音低減装置１ａにおいて、３つの係数α、β及びγの内、１以上の係数の値を固定値にしておくことも可能である。 Also in the second embodiment, the coefficient α, β, and γ update processing by the coefficient setting unit 20 described in the first embodiment may be performed. However, in the noise reduction apparatus 1a, it is also possible to set a value of one or more of the three coefficients α, β and γ to a fixed value.

雑音低減装置１ａから、暗騒音信号保持部１６及び乗算部１７を削除することも可能である。この場合、比較処理部１８は、単位帯域ごとに、スペクトル信号Ｓ₁［ｆ］及びＳ₂［ｆ］の信号レベルを比較し、単位帯域ごとに、スペクトル信号Ｓ₁［ｆ］及びＳ₂［ｆ］の内、より大きな信号レベルを持つスペクトル信号を出力スペクトル信号Ｓ_OUT［ｆ］に含める（即ち、信号成分Ｓ₁［ｍ・Δｆ］及びＳ₂［ｍ・Δｆ］の内、信号レベルの大きい方の信号成分を、信号成分Ｓ_OUT［ｍ・Δｆ］に代入する）。 It is also possible to delete the background noise signal holding unit 16 and the multiplication unit 17 from the noise reduction device 1a. In this case, comparison processing unit 18, for each unit band, to compare the signal level of the spectral signal S ₁ [f] and S ₂ [f] for each unit band, the spectrum signal S ₁ [f] and S ₂ [ f], a spectrum signal having a larger signal level is included in the output spectrum signal S _OUT [f] (that is, of the signal components S ₁ [m · Δf] and S ₂ [m · Δf], The larger signal component is substituted into the signal component S _OUT [m · Δf]).

＜＜変形等＞＞
本発明の実施形態は、特許請求の範囲に示された技術的思想の範囲内において、適宜、種々の変更が可能である。以上の実施形態は、あくまでも、本発明の実施形態の例であって、本発明ないし各構成要件の用語の意義は、以上の実施形態に記載されたものに制限されるものではない。上述の説明文中に示した具体的な数値は、単なる例示であって、当然の如く、それらを様々な数値に変更することができる。上述の実施形態に適用可能な注釈事項として、以下に、注釈１〜注釈３を記す。各注釈に記載した内容は、矛盾なき限り、任意に組み合わせることが可能である。 << Deformation, etc. >>
The embodiment of the present invention can be appropriately modified in various ways within the scope of the technical idea shown in the claims. The above embodiment is merely an example of the embodiment of the present invention, and the meaning of the term of the present invention or each constituent element is not limited to that described in the above embodiment. The specific numerical values shown in the above description are merely examples, and as a matter of course, they can be changed to various numerical values. As annotations applicable to the above-described embodiment, notes 1 to 3 are described below. The contents described in each comment can be arbitrarily combined as long as there is no contradiction.

［注釈１］
各実施形態の説明文における“信号レベル”を“大きさ”又は“パワー”に読み替えても良い。時間領域又は周波数領域上の任意の音響信号に関し、信号レベルは、その音響信号の大きさを表す指標の一種である。時間領域又は周波数領域上の任意の音響信号に関し、パワーが、その音響信号の大きさを表す指標であっても良い。即ち、時間領域又は周波数領域上の任意の音響信号に関し、当該音響信号の大きさとは、当該音響信号の信号レベル又はパワーを表す。 [Note 1]
“Signal level” in the description of each embodiment may be read as “size” or “power”. Regarding an arbitrary acoustic signal in the time domain or the frequency domain, the signal level is a kind of index indicating the magnitude of the acoustic signal. Regarding an arbitrary acoustic signal in the time domain or the frequency domain, the power may be an index representing the magnitude of the acoustic signal. That is, regarding an arbitrary acoustic signal in the time domain or the frequency domain, the magnitude of the acoustic signal represents the signal level or power of the acoustic signal.

［注釈２］
雑音低減装置１若しくは１ａ又は電子機器１００である対象装置を、ハードウェア、或いは、ハードウェアとソフトウェアの組み合わせによって構成することができる。対象装置にて実現される機能の全部又は一部である任意の特定の機能をプログラムとして記述して、該プログラムを対象装置に搭載可能なフラッシュメモリに保存しておき、該プログラムをプログラム実行装置（例えば、対象装置に搭載可能なマイクロコンピュータ）上で実行することによって、その特定の機能を実現するようにしてもよい。上記プログラムは任意の記録媒体（不図示）に記憶及び固定されうる。上記プログラムを記憶及び固定する記録媒体（不図示）は対象装置と異なる機器（サーバ機器等）に搭載又は接続されても良い。 [Note 2]
The target device that is the noise reduction device 1 or 1a or the electronic device 100 can be configured by hardware or a combination of hardware and software. Arbitrary specific functions that are all or part of the functions realized in the target device are described as a program, the program is stored in a flash memory that can be mounted on the target device, and the program is executed by the program execution device. The specific function may be realized by executing on a microcomputer (for example, a microcomputer that can be mounted on the target device). The program can be stored and fixed in an arbitrary recording medium (not shown). A recording medium (not shown) for storing and fixing the program may be mounted or connected to a device (such as a server device) different from the target device.

［注釈３］
例えば、以下のように考えることができる。雑音低減装置１又は１ａは、雑音（低減対象雑音）のスペクトル信号Ｓ_A［ｆ］に減算係数αを乗じて得た信号Ｓ_A’［ｆ］を入力スペクトル信号Ｓ_IN［ｆ］から減算することで減算スペクトル信号Ｓ₁［ｆ］を生成し、減算スペクトル信号Ｓ₁［ｆ］を用いて、雑音（低減対象雑音）が抑制された出力音響信号を生成することができる。上述の各実施形態では、低減対象雑音を非目的音と呼んでいる。 [Note 3]
For example, it can be considered as follows. The noise reduction device 1 or 1a subtracts the signal S _A '[f] obtained by multiplying the spectrum signal S _A [f] of noise (noise to be reduced) by the subtraction coefficient α from the input spectrum signal S _IN [f]. Thus, a subtracted spectrum signal S ₁ [f] can be generated, and an output acoustic signal in which noise (noise to be reduced) is suppressed can be generated using the subtracted spectrum signal S ₁ [f]. In each of the above-described embodiments, the noise to be reduced is called a non-target sound.

１、１ａ雑音低減装置
１２非目的音保持部
１６暗騒音信号保持部
１８比較処理部
２０係数設定部
３１非目的信号生成部
１００電子機器 DESCRIPTION OF SYMBOLS 1, 1a Noise reduction apparatus 12 Non-purpose sound holding part 16 Background noise signal holding part 18 Comparison processing part 20 Coefficient setting part 31 Non-purpose signal generation part 100 Electronic device

Claims

A subtracted spectrum signal is generated by subtracting a signal obtained by multiplying a noise spectrum signal by a predetermined coefficient from an input spectrum signal based on the input acoustic signal collected by the sound collecting unit, and the subtracted spectrum signal is used. In the noise reduction device that generates the output acoustic signal,
A background noise signal holding unit that holds a background noise spectrum signal acquired in advance;
An output generation unit that generates the output acoustic signal using the subtracted spectrum signal and the background noise spectrum signal , and
The output generation unit includes a first spectrum signal as the subtracted spectrum signal, a second spectrum signal obtained by multiplying the input spectrum signal by a predetermined second coefficient, and a predetermined third coefficient to the background noise spectrum signal. Based on the third spectral signal obtained by multiplying by
In each of a plurality of frequency bands, the magnitudes of the first, second, and third spectrum signals are compared, and the spectrum signal having the maximum magnitude among the first, second, and third spectrum signals is used. The noise reduction device is characterized by generating the output acoustic signal .

Based on the magnitude relationship between the magnitudes of the first, second, and third spectral signals, at least one of the first coefficient, the second coefficient, and the third coefficient as the coefficient multiplied by the noise spectrum signal. The noise reduction apparatus according to claim 1 , further comprising a coefficient updating unit that updates one coefficient as an update target coefficient.

The coefficient updating unit determines a value of the update target coefficient at a second time after the first time based on a magnitude relationship between the magnitudes of the first, second, and third spectrum signals at the first time. The noise reduction apparatus according to claim 2 , wherein the update is performed.

A coefficient updating unit that updates the coefficient multiplied by the noise spectrum signal based on a magnitude relationship between the magnitude of the subtracted spectrum signal and the magnitude of the spectrum signal obtained by multiplying the background noise spectrum signal by a predetermined coefficient; The noise reduction device according to claim 1 , further comprising:

By generating the acoustic signal in the time domain of the noise by collecting the noise in synchronization with the collection of the input acoustic signal by the sound collection unit, and converting the generated acoustic signal into a signal in the frequency domain noise reduction device according to any one of claims 1 to 4, further comprising a signal generator for generating a spectral signal of the noise.