JP5340127B2

JP5340127B2 - Audio signal processing apparatus and control method of audio signal processing apparatus

Info

Publication number: JP5340127B2
Application number: JP2009278954A
Authority: JP
Inventors: 信吾池田
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2009-12-08
Filing date: 2009-12-08
Publication date: 2013-11-13
Anticipated expiration: 2029-12-08
Also published as: JP2011124661A

Description

本発明は、音声信号処理装置に関し、特に、取得した音声信号に含まれる、風雑音（ウィンドウノイズ）を低減することができる装置に関する。 The present invention relates to an audio signal processing apparatus, and more particularly to an apparatus that can reduce wind noise (window noise) included in an acquired audio signal.

従来、音声信号を処理する装置として、画像信号を記録すると共に音声信号を記録する撮像装置が知られている。これらの撮像装置では、外部の音声を集音し音声信号を生成するためのマイクロホンを備えているが、その表面等に風が当たると、ユーザにとって耳障りな、雑音が発生してしまうことがあった。この様な装置周辺の風の影響により発生する雑音を一般に風雑音という。 2. Description of the Related Art Conventionally, as an apparatus that processes an audio signal, an imaging apparatus that records an image signal and an audio signal is known. These imaging devices include a microphone for collecting external sound and generating a sound signal. However, when wind hits the surface or the like, noise that is annoying to the user may occur. It was. Such noise generated by the influence of wind around the device is generally called wind noise.

このような問題に対し、マイクから入力された音声の例えば、１ｋＨｚ以下の低域成分等を減衰させるハイパスフィルタにより、風雑音の影響が大きい低周波成分を減衰させる技術などが提案されている。例えば、特許文献１の技術のように、風雑音の低減をした後に、オートレベルコントローラ（以後、ＡＬＣ）によって、音声のレベルを調整するものが提案されている。なお、ＡＬＣは、音声のレベルが低いときには、音声信号を増幅し、音声のレベルが高いときには音声信号を減衰させるようにしている。 In order to solve such a problem, a technique of attenuating a low frequency component that is greatly affected by wind noise by a high-pass filter that attenuates, for example, a low-frequency component of 1 kHz or less of audio input from a microphone has been proposed. For example, as in the technique of Patent Document 1, after reducing wind noise, an audio level is adjusted by an auto level controller (hereinafter ALC). The ALC amplifies the audio signal when the audio level is low, and attenuates the audio signal when the audio level is high.

特開平０５−３２８４８０号公報JP 05-328480 A

しかし、従来は、減衰しきれない周波数に風雑音が含まれていた場合などには、風雑音を完全に低減させることは難しく、音声信号に音量（レベル）が不安定な風雑音成分が残ってしまうことがあった。そのため、特許文献１では、音声信号に音量の不安定な風雑音が残ると、ＡＬＣが音声信号の増幅と減衰を繰り返してしまい、例えば、人の声などの一定の音量であるべき音が不安定な音声になってしまうという問題がった。このような音声信号を再生すると、本来聞き取りたい人の声などの音声の音量が安定せず聞きづらいものとなってしまう。 However, in the past, when wind noise was included in frequencies that could not be attenuated, it was difficult to completely reduce wind noise, and wind noise components with unstable volume (level) remained in the audio signal. There was a case. For this reason, in Patent Document 1, if wind noise with unstable volume remains in an audio signal, the ALC repeatedly amplifies and attenuates the audio signal. For example, a sound that should be at a constant volume such as a human voice is unacceptable. There was a problem that the sound became stable. When such an audio signal is reproduced, the volume of the voice such as the voice of the person who originally wants to hear becomes unstable and difficult to hear.

そこで、本発明は、風雑音（風量、風速、風圧）のレベルが高いときに、ＡＬＣの増幅率の変化速度を遅くすることによって、ＡＬＣの出力音声の不安定さを低減することができる音声信号処理装置を提供することを目的とする。 Therefore, the present invention reduces the instability of the ALC output sound by slowing the change rate of the ALC gain when the level of wind noise (air volume, wind speed, wind pressure) is high. An object is to provide a signal processing device.

なお、風量のレベルが高いときや風速の速いとき、風圧の高いとき等であっても、音声信号に含まれる風雑音のレベルが高くなる可能性が大きい。 Even when the air volume level is high, the wind speed is high, or the wind pressure is high, there is a high possibility that the level of wind noise included in the audio signal will be high.

本発明の音声処理装置は、かかる目的を達成するために、集音手段を備える音声信号処理装置であって、前記集音手段により得られる音声信号に含まれる前記装置周辺の風の影響による雑音成分のレベルを検出する検出手段と、前記音声信号を音声信号のレベルに応じた増幅率で増幅させる調整手段とを有し、前記調整手段は、前記音声信号のレベルが高くなると前記増幅率を下げ、音声信号のレベルが低くなると前記増幅率を上げ、前記調整手段は、前記増幅率を上げる場合に、前記検出手段により検出された前記雑音成分のレベルが高いときほど、前記増幅率の変化速度を遅くする構成とした。 Audio processing apparatus of the present invention, in order to achieve the above object, an audio signal processing apparatus including a sound collection unit, according to the wind effects of the device around included in the audio signal obtained by said sound collecting means Detecting means for detecting a level of a noise component; and adjusting means for amplifying the audio signal with an amplification factor corresponding to the level of the audio signal, wherein the adjusting means increases the amplification factor when the level of the audio signal increases. the lowered, and the level of the audio signal that a low increase the amplification factor, the adjustment means, when increasing the gain, the more when the higher level of said noise component detected by said detecting means, said amplifier The rate change rate is slow.

本発明によれば、風雑音のレベルが大きいときにＡＬＣの増幅率の変化速度を遅くすることによって、出力音声の不安定さを低減することができる。これにより、風雑音のレベルが大きいとき等に、人の声などの音声のレベルが不安定になりにくくした音声信号を得ることができる。 According to the present invention, the instability of output sound can be reduced by slowing the rate of change of the ALC gain when the wind noise level is high. As a result, when the wind noise level is high, it is possible to obtain an audio signal in which the audio level such as a human voice is less likely to become unstable.

本実施例の撮像装置のブロック図である。It is a block diagram of the imaging device of a present Example. 音声入力部１０２のブロック図である。3 is a block diagram of a voice input unit 102. FIG. 振幅レベルに対応する増幅率を示す図である。It is a figure which shows the amplification factor corresponding to an amplitude level. 音声の振幅レベルの検出結果を示す図である。It is a figure which shows the detection result of the amplitude level of an audio | voice.

以下、図面を参照して本発明の実施例を詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

音声信号を処理することができる音声信号処理装置として、撮像装置を例にとって説明する。 As an audio signal processing device capable of processing an audio signal, an imaging device will be described as an example.

図１は、実施例１の撮像装置１００の構成を示すブロック図である。 FIG. 1 is a block diagram illustrating the configuration of the imaging apparatus 100 according to the first embodiment.

図１において、撮像部１０１は、撮影レンズにより取り込まれた被写体の光学像を撮像素子により画像信号に変換し、アナログデジタル変換、画像調整処理などを行い、画像データを生成する。音声入力部１０２は、内蔵または音声端子を介して接続された複数のマイクにより、撮像装置１００の周辺の音声を集音し、アナログデジタル変換、音声処理などを行い音声データを生成する。マイクは、音声振動を音声信号に変換するものである。メモリ１０３は、撮像部１０１により得られた画像データや、音声入力部１０２により得られた音声データを一時的に記憶する。表示制御部１０４は、撮像部１０１により得られた画像データに係る映像や、撮像装置１００の操作画面、メニュー画面等を表示部１０５や、不図示の映像端子を介して外部のディスプレイに表示させる。符号化処理部１０６は、メモリ１０３に一時的に記憶された画像データや音声データを読み出して所定の符号化を行い、圧縮画像データ、圧縮音声データ等を生成する。記録再生部１０７は、記録媒体１０８に対して、符号化処理部１０６で生成された圧縮画像データ、圧縮音声データ等を記録したり、記録媒体１０８に記録された圧縮画像データ、圧縮音声データ、各種データ、プログラムを読み出す。ここで、記録媒体１０８は、圧縮画像データ、圧縮音声データ、等を記録することができれば、磁気ディスク、光学式ディスク、半導体メモリなどのあらゆる方式の記録媒体を含む。 In FIG. 1, an imaging unit 101 converts an optical image of a subject captured by a photographing lens into an image signal by an imaging element, performs analog-digital conversion, image adjustment processing, and the like, and generates image data. The audio input unit 102 collects audio around the imaging device 100 by a plurality of microphones built in or connected via audio terminals, and performs analog-digital conversion, audio processing, and the like to generate audio data. The microphone converts voice vibration into a voice signal. The memory 103 temporarily stores the image data obtained by the imaging unit 101 and the audio data obtained by the audio input unit 102. The display control unit 104 displays a video related to the image data obtained by the imaging unit 101, an operation screen of the imaging device 100, a menu screen, and the like on the display unit 105 or an external display via a video terminal (not shown). . The encoding processing unit 106 reads out image data and audio data temporarily stored in the memory 103, performs predetermined encoding, and generates compressed image data, compressed audio data, and the like. The recording / reproducing unit 107 records the compressed image data, the compressed audio data, and the like generated by the encoding processing unit 106 on the recording medium 108, or the compressed image data, the compressed audio data recorded on the recording medium 108, Read various data and programs. Here, the recording medium 108 includes all types of recording media such as a magnetic disk, an optical disk, and a semiconductor memory as long as compressed image data, compressed audio data, and the like can be recorded.

制御部１０９は、撮像装置１００の各ブロックに制御信号を送信することで撮像装置１００の各ブロックを制御することができ、各種制御を実行するためのＣＰＵやメモリなどからなる。操作部１１０は、ボタンやダイヤルなどからなり、ユーザの操作に応じて、指示信号を制御部１０９に送信する。音声出力部１１１は、記録再生部１０７により再生された圧縮音声データや、制御部１０９により出力される音声データをスピーカ１１２や音声端子などに出力する。外部出力部１１３は、記録再生部１０７により再生された圧縮映像データや圧縮音声データなどを外部機器に出力する。データバス１１４は、音声データや画像データ等の各種データ、各種制御信号を撮像装置１００の各ブロックに供給する。 The control unit 109 can control each block of the imaging apparatus 100 by transmitting a control signal to each block of the imaging apparatus 100, and includes a CPU, a memory, and the like for performing various controls. The operation unit 110 includes buttons, a dial, and the like, and transmits an instruction signal to the control unit 109 according to a user operation. The audio output unit 111 outputs the compressed audio data reproduced by the recording / reproducing unit 107 and the audio data output by the control unit 109 to the speaker 112, the audio terminal, and the like. The external output unit 113 outputs the compressed video data and the compressed audio data reproduced by the recording / reproducing unit 107 to an external device. The data bus 114 supplies various data such as audio data and image data and various control signals to each block of the imaging apparatus 100.

ここで、本実施例の撮像装置１００の通常の動作について説明する。 Here, the normal operation of the imaging apparatus 100 of the present embodiment will be described.

本実施例の撮像装置１００は、ユーザが操作部１１０を操作して電源を投入する指示が出されたことに応じて、不図示の電源供給部から、撮像装置の各ブロックに電源を供給する。 The imaging apparatus 100 according to the present embodiment supplies power to each block of the imaging apparatus from a power supply unit (not illustrated) in response to an instruction to turn on the power by operating the operation unit 110 by the user. .

電源が供給されると、制御部１０９は、例えば、操作部１１０のモード切り換えスイッチが、例えば、撮影モード、再生モード等のどのモードであるかを操作部１１０からの指示信号により確認する。動画記録モードでは、撮像部１０１により得られた画像データと音声入力部１０２により得られた音声データとを１つのファイルとして保存する。再生モードでは、記録媒体１０８に記録された圧縮画像データを記録再生部１０７により再生して表示部１０５に表示させる。 When the power is supplied, the control unit 109 checks, for example, which mode the mode selector switch of the operation unit 110 is in, for example, a shooting mode, a reproduction mode, or the like by an instruction signal from the operation unit 110. In the moving image recording mode, the image data obtained by the imaging unit 101 and the audio data obtained by the audio input unit 102 are stored as one file. In the reproduction mode, the compressed image data recorded on the recording medium 108 is reproduced by the recording / reproducing unit 107 and displayed on the display unit 105.

動画記録モードでは、まず、制御部１０９は、撮影待機状態に移行させるように制御信号を撮像装置１００の各ブロックに送信し、以下のような動作をさせる。 In the moving image recording mode, first, the control unit 109 transmits a control signal to each block of the imaging apparatus 100 so as to shift to the shooting standby state, and performs the following operation.

撮像部１０１は、撮影レンズにより取り込まれた被写体の光学像を撮像素子により画像信号に変換し、アナログデジタル変換、画像調整処理などを行い、画像データを生成する。そして、得られた画像データを表示処理部１０４に送信し、表示部１０５に表示させる。ユーザはこの様にして表示された画面を見ながら撮影の準備を行う。 The imaging unit 101 converts an optical image of a subject captured by a photographing lens into an image signal by an imaging element, performs analog-digital conversion, image adjustment processing, and the like, and generates image data. Then, the obtained image data is transmitted to the display processing unit 104 and displayed on the display unit 105. The user prepares for shooting while viewing the screen displayed in this way.

音声入力部１０２は、複数のマイクにより得られたアナログ音声信号をデジタル変換し、得られた複数のデジタル音声信号を処理して、マルチチャンネルの音声データを生成する。そして、得られた音声データを音声出力部１１１に送信し、接続されたスピーカ１１２や不図示のイヤホンから音声として出力させる。ユーザは、この様にして出力された音声を聞きながら記録音量を決定するためのマニュアルボリュームの調整をすることもできる。 The audio input unit 102 digitally converts analog audio signals obtained by a plurality of microphones and processes the obtained digital audio signals to generate multi-channel audio data. Then, the obtained audio data is transmitted to the audio output unit 111 and is output as audio from the connected speaker 112 or an unillustrated earphone. The user can also adjust the manual volume to determine the recording volume while listening to the sound output in this way.

次に、ユーザが操作部１１０の記録ボタンを操作することにより撮影開始の指示信号が制御部１０９に送信されると、制御部１０９は、撮像装置１００の各ブロックに撮影開始の指示信号を送信し、以下のような動作をさせる。 Next, when a shooting start instruction signal is transmitted to the control unit 109 by the user operating the recording button of the operation unit 110, the control unit 109 transmits a shooting start instruction signal to each block of the imaging apparatus 100. Then, the following operation is performed.

撮像部１０１は、撮影レンズにより取り込まれた被写体の光学像を撮像素子により画像信号に変換し、アナログデジタル変換、画像調整処理などを行い、画像データを生成する。そして、得られた画像データを表示処理部１０４に送信し、表示部１０５に表示させる。また、得られた画像データをメモリ１０３送信する。 The imaging unit 101 converts an optical image of a subject captured by a photographing lens into an image signal by an imaging element, performs analog-digital conversion, image adjustment processing, and the like, and generates image data. Then, the obtained image data is transmitted to the display processing unit 104 and displayed on the display unit 105. Further, the obtained image data is transmitted to the memory 103.

音声入力部１０２は、複数のマイクにより得られたアナログ音声信号をデジタル変換し、得られた複数のデジタル音声信号を処理して、マルチチャンネルの音声データを生成する。そして、得られた音声データをメモリ１０３に送信する。 The audio input unit 102 digitally converts analog audio signals obtained by a plurality of microphones and processes the obtained digital audio signals to generate multi-channel audio data. Then, the obtained audio data is transmitted to the memory 103.

符号化処理部１０６は、メモリ１０３に一時的に記憶された画像データや音声データを読み出して所定の符号化を行い、圧縮画像データ、圧縮音声データ等を生成する。 The encoding processing unit 106 reads out image data and audio data temporarily stored in the memory 103, performs predetermined encoding, and generates compressed image data, compressed audio data, and the like.

そして、制御部１０９は、これらの圧縮画像データ、圧縮音声データを合成し、データストリームを形成し、記録再生部１０７に出力する。 Then, the control unit 109 synthesizes these compressed image data and compressed audio data, forms a data stream, and outputs the data stream to the recording / reproducing unit 107.

記録再生部１０７は、ＵＤＦ、ＦＡＴ等のファイルシステム管理のもとに、データストリームを一つの動画ファイルとして記録媒体１０８に書き込んでいく。 The recording / playback unit 107 writes the data stream to the recording medium 108 as one moving image file under the management of a file system such as UDF or FAT.

以上の動作を撮影中は継続する。 The above operation is continued during shooting.

そして、ユーザが操作部１１０の記録ボタンを操作することにより撮影終了の指示信号が制御部１０９に送信されると、制御部１０９は、撮像装置１００の各ブロックに撮影終了の指示信号を送信し、以下のような動作をさせる。 When the user operates the recording button of the operation unit 110 and a shooting end instruction signal is transmitted to the control unit 109, the control unit 109 transmits a shooting end instruction signal to each block of the imaging apparatus 100. The following operations are performed.

撮像部１０１、音声入力部１０２は、それぞれ画像データ、音声データの生成を停止する。 The imaging unit 101 and the audio input unit 102 stop generating image data and audio data, respectively.

符号化処理部１０６は、メモリに記憶されている残りの画像データと音声データとを読出して所定の符号化を行い、圧縮画像データ、圧縮音声データ等を生成し終えたら動作を停止する。 The encoding processing unit 106 reads the remaining image data and audio data stored in the memory, performs predetermined encoding, and stops the operation when generation of compressed image data, compressed audio data, and the like is completed.

そして、制御部１０９は、これらの最後の圧縮画像データ、圧縮音声データを合成し、データストリームを形成し、記録再生部１０７に出力する。 Then, the control unit 109 synthesizes these last compressed image data and compressed audio data, forms a data stream, and outputs the data stream to the recording / reproducing unit 107.

記録再生部１０７は、ＵＤＦ、ＦＡＴ等のファイルシステム管理のもとに、データストリームを一つの動画ファイルとして記録媒体１０８に書き込んでいく。そして、データストリームの供給が停止したら、動画ファイルを完成させて、記録動作を停止させる。 The recording / playback unit 107 writes the data stream to the recording medium 108 as one moving image file under the management of a file system such as UDF or FAT. When the supply of the data stream is stopped, the moving image file is completed and the recording operation is stopped.

制御部１０９は、記録動作が停止すると、撮影待機状態に移行させるように制御信号を撮像装置１００の各ブロックに送信して、撮影待機状態に戻る。 When the recording operation stops, the control unit 109 transmits a control signal to each block of the imaging apparatus 100 so as to shift to the shooting standby state, and returns to the shooting standby state.

次に、再生モードでは、制御部１０９は、再生状態に移行させるように制御信号を撮像装置１００の各ブロックに送信し、以下のような動作をさせる。 Next, in the playback mode, the control unit 109 transmits a control signal to each block of the imaging apparatus 100 so as to shift to the playback state, and performs the following operation.

記録媒体１０８に記録された圧縮画像データと圧縮音声データとからなる動画ファイルを記録再生部１０７が読出して、読出された圧縮画像データ、圧縮音声データは、符号化処理部１０６に送る。符号化処理部１０６は、圧縮画像データ、圧縮音声データを復号してそれぞれ、表示制御部１０４、音声出力部１１１に送信する。表示制御部１０４は、復号された画像データを表示部１０５に表示させる。音声出力部１１１は、復号された音声データを内蔵または、取付けられた外部スピーカから出力させる。 The recording / playback unit 107 reads a moving image file composed of compressed image data and compressed audio data recorded on the recording medium 108, and sends the read compressed image data and compressed audio data to the encoding processing unit 106. The encoding processing unit 106 decodes the compressed image data and the compressed audio data and transmits them to the display control unit 104 and the audio output unit 111, respectively. The display control unit 104 causes the display unit 105 to display the decoded image data. The audio output unit 111 outputs the decoded audio data from an external speaker built in or attached.

本実施例の撮像装置は以上のように、画像、音声の記録再生を行う。 As described above, the image pickup apparatus of the present embodiment records and reproduces images and sounds.

ここで、音声入力部１０２において、動画撮影中に行われる動作について、図２を用いて説明する。 Here, the operation performed during moving image shooting in the voice input unit 102 will be described with reference to FIG.

図２において、マイクユニット２０１は音声を集音し、音声振動を電気信号に変換し、音声信号を得るもので、先述したように、撮像装置に内蔵されていても、撮像装置の音声入力端子に接続された外付けマイクユニットでも良い。本実施例においては、マイクユニット２０１は、２つのマイクエレメントからなる例について説明するが、マイクエレメントの数はいくつでも良い。ＡＤ変換部２０２はマイクユニット２０１により得られた音声信号をアナログ信号からデジタル信号に変換する。ステレオ変換部２０３は、マイクユニット２０１により得られた音声信号をステレオ音声データに変換する。２つの音声信号からステレオ音声信号を生成する方法は、既存の技術を用いるため、本実施例では説明を省略する。風雑音低減部２０４は、装置の周囲の風の影響で発生する低周波数成分（例えば２ｋＨｚ程度の周波数以下）の雑音（風雑音）を低減する。オートレベルコントローラ（以降、ＡＬＣ）２０５Ｌ、２０５Ｒは、入力された音声信号のレベルを適正なレベルに保つ機能を有している。ＡＬＣ２０５Ｌ／２０５Ｒでは、入力された音声信号のレベルが低いときには音声信号に対する増幅率を増加させ、音声信号のレベルが高いときには増幅率を低下させるようにしている。 In FIG. 2, a microphone unit 201 collects sound and converts sound vibration into an electric signal to obtain a sound signal. As described above, even if the microphone unit 201 is built in the image capturing apparatus, the sound input terminal of the image capturing apparatus An external microphone unit connected to may be used. In this embodiment, an example in which the microphone unit 201 includes two microphone elements will be described, but any number of microphone elements may be used. The AD converter 202 converts the audio signal obtained by the microphone unit 201 from an analog signal to a digital signal. The stereo conversion unit 203 converts the audio signal obtained by the microphone unit 201 into stereo audio data. Since a method for generating a stereo audio signal from two audio signals uses an existing technique, the description thereof is omitted in this embodiment. The wind noise reduction unit 204 reduces noise (wind noise) of low frequency components (for example, a frequency of about 2 kHz or less) generated due to the influence of wind around the apparatus. The auto level controllers (hereinafter, ALC) 205L and 205R have a function of keeping the level of the input audio signal at an appropriate level. In the ALC 205L / 205R, the amplification factor for the audio signal is increased when the level of the input audio signal is low, and the amplification factor is decreased when the level of the audio signal is high.

風雑音除去部２０４は、マイクユニットにより得られた２つの音声信号が、風の影響により発生する雑音については、低域成分（例えば１ｋＨｚ以下）において相関性が小さくなるがあることを利用して風雑音除去をしている。そのために、ステレオ変換部２０３により得られたステレオ音声Ｌｃｈ、Ｒｃｈの音声に対し、処理を行う。 The wind noise removing unit 204 uses the fact that the two audio signals obtained by the microphone unit have a low correlation in the low frequency component (for example, 1 kHz or less) with respect to noise generated by the influence of wind. Wind noise is removed. For this purpose, processing is performed on stereo audio Lch and Rch audio obtained by the stereo conversion unit 203.

具体的には、まず、減算部２０６により、Ｌｃｈ−Ｒｃｈの信号（差信号）を出力する。一般に、風雑音は、複数のマイクから相関性のない音声信号が出力されることが知られている。そのため、装置周辺の風雑音のレベルが小さいときにはこの差信号の値が小さく、装置周辺の風雑音のレベルが大きいときにはこの差信号の値が大きくなる。本実施例では、差信号の値に基づいて、風雑音（雑音成分）のレベルが大きいか、小さいかを検出するようにしている。風量検出部２０７では、差信号の絶対値が大きければ風雑音のレベルが大きい（風量が多い、風速が速い、風圧が大きい）ことを示す信号を、差信号の値が小さければ風雑音のレベルが小さい（風量が少ない、風速が遅い、風圧が小さい）ことを示す信号を風量情報として出力する。 Specifically, first, the subtractor 206 outputs an Lch-Rch signal (difference signal). In general, it is known that wind noise is output from a plurality of microphones as an uncorrelated voice signal. Therefore, the value of the difference signal is small when the wind noise level around the apparatus is small, and the value of the difference signal is large when the wind noise level around the apparatus is large. In the present embodiment, whether the level of wind noise (noise component) is large or small is detected based on the value of the difference signal. In the air volume detection unit 207, if the absolute value of the difference signal is large, the level of the wind noise is large (the volume of air is large, the wind speed is fast, the wind pressure is large), and if the value of the difference signal is small, the level of the wind noise is Is output as air volume information indicating that the air volume is small (the air volume is low, the wind speed is slow, and the wind pressure is low).

本実施例では、、風が吹いていないような状況で、差信号の値が極端に小さければ、風雑音のレベルを示す風量情報を出力しないようにしてもよい。本実施例では、風雑音のレベルが大きいか小さいかを示す２段階の信号を出力する例について説明するが、当然、２段階以上の信号であってもよい。 In this embodiment, if the value of the difference signal is extremely small in a situation where the wind is not blowing, the air volume information indicating the wind noise level may not be output. In the present embodiment, an example of outputting a two-stage signal indicating whether the level of wind noise is large or small will be described.

つづいて、減算部２０６により出力された差信号の低域成分を減衰するためのハイパスフィルタ（ＨＰＦ）２０８に入力し、差信号の低域成分の信号を減衰させた信号を出力する。このＨＰＦ２０８は、風量検出部２０７により検出された風量情報の値が風雑音のレベルが大きいことを示していれば、減衰させる低域成分のカットオフ周波数を例えば２ｋＨｚにする。また、風量情報の値が風雑音のレベルが小さいことを示していれば、カットオフ周波数を５００Ｈｚにする。または、風量情報の値が風雑音のレベルが大いことを示していれば低域成分の減衰量を多くし、風量情報の値が風雑音のレベルが小さいことを示していれば低域成分の減衰量を少なくするようにしてもよい。すなわち、ＨＰＦ２０８は、風量情報の値に応じて、減衰特性を変更するように構成されている。当然のことながら、ＨＰＦ２０８は、ハイパスフィルタでなく、バンドパスフィルタなどの他のフィルタで構成されても良い。 Subsequently, the signal is input to a high-pass filter (HPF) 208 for attenuating the low-frequency component of the difference signal output by the subtracting unit 206, and a signal obtained by attenuating the low-frequency component signal of the difference signal is output. If the value of the air volume information detected by the air volume detector 207 indicates that the level of wind noise is high, the HPF 208 sets the cutoff frequency of the low-frequency component to be attenuated to 2 kHz, for example. Further, if the value of the air volume information indicates that the level of wind noise is small, the cutoff frequency is set to 500 Hz. Or, if the value of the airflow information indicates that the level of wind noise is high, increase the attenuation of the low frequency component, and if the value of the airflow information indicates that the level of wind noise is low, the low frequency component The amount of attenuation may be reduced. That is, the HPF 208 is configured to change the attenuation characteristic according to the value of the air volume information. As a matter of course, the HPF 208 may be configured by other filters such as a band pass filter instead of the high pass filter.

このようにして得られた差信号の低域成分を減衰させた信号を、加算部２０９により、Ｌｃｈ＋Ｒｃｈの信号（和信号）と加算、または減算することにより、Ｌｃｈ、Ｒｃｈの音声を得ることができる。 The adder 209 adds or subtracts the signal obtained by attenuating the low-frequency component of the difference signal thus obtained with the Lch + Rch signal (sum signal), thereby obtaining Lch and Rch audio. it can.

具体的には、加算部２１０で、差信号の低域成分を減衰させた信号と、和信号とを加算することにより、低域成分において、ＬｃｈとＲｃｈとで位相の異なる成分を低減したＬｃｈの音声信号が生成される。そして、減算部２１１で、和信号から、差信号の低域成分を減衰させた信号を減算することにより、低域成分において、ＬｃｈとＲｃｈとで位相の異なる成分を低減したＲｃｈの音声信号が生成される。この処理により、装置周囲の風に起因する雑音（風雑音）を低減したＬｃｈ、Ｒｃｈの音声信号を生成することができる。 Specifically, the addition unit 210 adds the signal obtained by attenuating the low-frequency component of the difference signal and the sum signal, thereby reducing the Lch component having a phase difference between Lch and Rch in the low-frequency component. Audio signals are generated. Then, the subtracting unit 211 subtracts a signal obtained by attenuating the low frequency component of the difference signal from the sum signal, whereby an Rch audio signal in which a component having a different phase between Lch and Rch is reduced in the low frequency component. Generated. With this process, it is possible to generate Lch and Rch audio signals with reduced noise (wind noise) caused by wind around the apparatus.

ここで、さらに、ＡＬＣ２０５Ｌ／２０５Ｒについて説明する。 Here, the ALC 205L / 205R will be further described.

本実施例のＡＬＣ２０５Ｌ／２０５Ｒでは、入力された音声信号のレベルが低いときには音声信号に対する増幅率を増加させ、音声信号のレベルが高いときには増幅率を低下させるようにしている。 In the ALC 205L / 205R of this embodiment, the amplification factor for the audio signal is increased when the level of the input audio signal is low, and the amplification factor is decreased when the level of the audio signal is high.

ここでは、Ｌｃｈの音声信号について説明するが、Ｒｃｈの音声信号についても同様の処理を行う。Ｒｃｈ用にも同様の回路が備えられている。 Here, the Lch audio signal will be described, but the same processing is performed for the Rch audio signal. A similar circuit is provided for the Rch.

まず、風雑音のレベルがほぼゼロである場合の動作について説明する。 First, the operation when the wind noise level is almost zero will be described.

ＡＬＣ２０５に入力されたＬｃｈの音声信号は、遅延部２１２で数ｍｓ遅延され、振幅制御部２１３で、音声信号の振幅を増幅または減衰される。振幅制御部２１３における増幅率は、振幅検出部２１４により、ＡＬＣ２０５に入力された音声信号の振幅の大きさを検出し、検出された値に応じた検出結果を増幅率設定部２１５に送信することで設定される。 The Lch audio signal input to the ALC 205 is delayed by several ms by the delay unit 212, and the amplitude of the audio signal is amplified or attenuated by the amplitude control unit 213. As for the amplification factor in the amplitude control unit 213, the amplitude detection unit 214 detects the amplitude of the audio signal input to the ALC 205, and transmits a detection result corresponding to the detected value to the amplification factor setting unit 215. Set by.

本実施例の振幅検出部２１４（音声レベル検出部）では、音声信号（デジタル信号）の１サンプル毎に音声レベルを検出していく。そして、検出結果として、ｍサンプルのレベルが、ｍ−１サンプルのレベルより高いレベルであれば、ｍサンプルのレベルを検出結果として出力する動作を繰り返す。一方で、ｎサンプルのレベルが、ｍ−１サンプルのレベルより低いレベルであれば、ｍ−１サンプルのレベルから所定の値Ａを引いた値をｎサンプルのレベルとし、検出結果として出力する動作を繰り返している。言い換えれば、本実施例の振幅検出部２１４は、音声信号のピークをむかえるまでは音声信号の音声レベルに追従したレベルを検出結果として出力する。そして、音声信号のピーク以降は、前回の検出結果が音声信号のレベルより大きくなるまで所定の値Ａを引き続けた値が検出結果として出力されることになる。（ただしｎは１以上の整数）。 The amplitude detection unit 214 (audio level detection unit) of this embodiment detects the audio level for each sample of the audio signal (digital signal). If the level of m samples is higher than the level of m−1 samples as the detection result, the operation of outputting the level of m samples as the detection result is repeated. On the other hand, if the level of n samples is lower than the level of m−1 samples, the value obtained by subtracting a predetermined value A from the level of m−1 samples is set to the level of n samples and is output as a detection result. Is repeated. In other words, the amplitude detection unit 214 of the present embodiment outputs a level that follows the audio level of the audio signal as a detection result until the peak of the audio signal is changed. Then, after the peak of the audio signal, a value obtained by continuing to subtract the predetermined value A until the previous detection result becomes larger than the level of the audio signal is output as the detection result. (Where n is an integer of 1 or more).

増幅率設定部２１５は、振幅検出部２１４により出力された検出結果に応じて振幅調整部２１３での音声信号の増幅率を設定する。増幅率設定部２１５における、振幅検出部２１４の検出結果と、振幅調整部２１３での音声信号の増幅率の関係は図３に示すような関係となる。図３において、横軸が入力された振幅検出部２１４の検出結果であり、縦軸が増幅率である。図３からわかるように振幅検出部２１４の検出結果のレベルがＹ以上Ｘ以下のときは、増幅率Ｇｎｏｒを振幅調整部２１３に設定し、音声信号を増幅させる。また、検出結果のレベルがＸ以上になると、検出結果のレベルに応じて増幅率をＧｎｏｒから徐々に下げた増幅率とし、検出結果のレベルが所定の高い値以上になったときには増幅率をＧｌｉｍに設定する。また、検出結果のレベルがＹ以下になると、検出結果のレベルに応じて増幅率をＧｎｏｒから徐々に上げていき、検出結果のレベルが所定の低い値以下になると、増幅率をＧｒｅｃに設定する。ここで、Ｇｒｅｃ＞Ｇｎｏｒ＞Ｇｌｉｍである。すなわち、先述したように、振幅検出部２１４から出力された、検出結果のレベルが低いときには音声信号の増幅率を増加させ、検出結果のレベルが高いときには増幅率を低下させるようにしているのである。本実施例では、増幅率設定部２１５における、振幅検出部２１４の検出結果と、振幅調整部２１３での音声信号の増幅率の関係は図３に示すような関係となる例について説明した。しかし、例えば、検出結果のレベルにリニアに応答するような関係でも良い。例えば、Ｇｌｉｍが設定される検出結果のレベルからＧｒｅｃが設定される検出結果のレベルまで、検出結果のレベルが下がるにつれて徐々に増幅率が大きくなるような関係でも良い。また、本実施例では検出結果のレベルがＹ以下になると、増幅率がＧｎｏｒから徐々にＧｒｅｃに上げていく用にしたが、上げなくても良い。 The amplification factor setting unit 215 sets the amplification factor of the audio signal in the amplitude adjustment unit 213 according to the detection result output from the amplitude detection unit 214. The relationship between the detection result of the amplitude detection unit 214 in the amplification factor setting unit 215 and the amplification factor of the audio signal in the amplitude adjustment unit 213 is as shown in FIG. In FIG. 3, the horizontal axis represents the detection result of the amplitude detection unit 214 that is input, and the vertical axis represents the amplification factor. As can be seen from FIG. 3, when the level of the detection result of the amplitude detection unit 214 is Y or more and X or less, the amplification factor Gnor is set in the amplitude adjustment unit 213 to amplify the audio signal. Further, when the detection result level becomes X or more, the amplification factor is gradually decreased from Gnor according to the detection result level, and when the detection result level becomes a predetermined high value or more, the amplification factor is set to Glim. Set to. Further, when the detection result level becomes Y or less, the amplification factor is gradually increased from Gnor according to the detection result level, and when the detection result level becomes a predetermined low value or less, the amplification factor is set to Grec. . Here, Grec> Gnor> Glim. That is, as described above, when the detection result level output from the amplitude detection unit 214 is low, the amplification factor of the audio signal is increased, and when the detection result level is high, the amplification factor is decreased. . In the present embodiment, the example in which the relationship between the detection result of the amplitude detection unit 214 in the amplification factor setting unit 215 and the amplification factor of the audio signal in the amplitude adjustment unit 213 is as shown in FIG. However, for example, a relationship that linearly responds to the level of the detection result may be used. For example, the relationship may be such that the amplification factor gradually increases as the detection result level decreases from the detection result level at which Glim is set to the detection result level at which Grec is set. In this embodiment, when the level of the detection result becomes Y or less, the amplification factor is gradually increased from Gnor to Grec. However, it may not be increased.

ＡＬＣ２０５は、音声のレベルが大きくなりすぎてしまい、音声の再現性が無くなってしまうことを防止するように設計されている。そのため、音声信号が大きくなり続ける場合には、入力された音声信号のレベルに応じた増幅率がすぐに反映されるように、振幅検出部２１４は、先述したような構成としている。すなわち、音声信号のピークをむかえるまでは音声信号に追従したレベルを検出結果として出力するようにしているのである。こうすることで、増幅率設定部２１５では、音声信号のピークを迎えるまでは、入力される音声信号のレベルに応じて、増幅率を設定することになるのである。 The ALC 205 is designed to prevent the sound level from becoming excessively high and the sound reproducibility from being lost. Therefore, when the audio signal continues to increase, the amplitude detection unit 214 is configured as described above so that the amplification factor according to the level of the input audio signal is immediately reflected. That is, the level following the audio signal is output as a detection result until the peak of the audio signal is changed. By doing so, the amplification factor setting unit 215 sets the amplification factor according to the level of the input audio signal until the peak of the audio signal is reached.

また、ＡＬＣ２０５は、音声のレベルが下がる場合には、徐々に増幅率が増加するようにしている。これは、増幅率が音声のレベルに追従して変化すると、音声レベルは一定になるが、抑揚が失われてしまいかえって聞きづらくなってしまうからである。そのため、振幅検出部２１４は、音声信号のピーク以降は、前回の検出結果が音声信号のレベルより大きくなるまで所定の値Ａを引き続けた値が検出結果として出力されるようにしている。こうすることで、増幅率設定部２１５では、音声信号のピーク以降は、振幅検出部２１４の検出結果のレベルに応じて増幅率を設定することになる。すなわち、検出結果は所定の値Ａを引き続けた値が出力されるので、検出結果がＸ以上であれば、Ｇｎｏｒまで徐々に増幅率が増えていくことになり、検出結果がＹ以下になると、Ｇｎｏｒから徐々に増幅率が増えていくことになる。 In addition, the ALC 205 gradually increases the amplification factor when the sound level decreases. This is because if the amplification factor changes following the sound level, the sound level becomes constant, but the inflection is lost, making it difficult to hear. Therefore, after the peak of the audio signal, the amplitude detection unit 214 outputs a value obtained by continuously pulling the predetermined value A until the previous detection result becomes larger than the level of the audio signal. By doing so, the amplification factor setting unit 215 sets the amplification factor according to the level of the detection result of the amplitude detection unit 214 after the peak of the audio signal. That is, since the detection result is a value obtained by continuously pulling the predetermined value A, if the detection result is X or more, the amplification factor gradually increases to Gnor, and when the detection result becomes Y or less. , The amplification factor gradually increases from Gnor.

この様な構成により、例えば、増幅率設定部２１５によりＧｌｉｍが設定された状態からＧｎｏｒまで戻るには、０．４秒程度の時間がかかるようにしている。 With such a configuration, for example, it takes about 0.4 seconds to return from the state where Glim is set by the amplification factor setting unit 215 to Gnor.

次に、本実施例の撮像装置における、風雑音の発生している場合のＡＬＣ２０５の動作について説明する。 Next, the operation of the ALC 205 when wind noise occurs in the imaging apparatus of the present embodiment will be described.

本実施例の撮像装置は、音声信号のピーク以降の増幅率の変化速度が、風雑音のレベルが大きいときの方が、風雑音の小さいときよりも緩やかになるようにしている。例えば、風雑音のレベルが大きいときは、増幅率設定部２１５によりＧｌｉｍが設定された状態からＧｎｏｒまで戻るには、２．０秒程度の時間がかかるようにしている。そして、風雑音のレベルが小さいときは、例えば、増幅率設定部２１５によりＧｌｉｍが設定された状態からＧｎｏｒまで戻るには、１．２秒程度の時間がかかるようにしている。 In the image pickup apparatus of the present embodiment, the rate of change of the amplification factor after the peak of the audio signal is made gentler when the wind noise level is large than when the wind noise is small. For example, when the wind noise level is high, it takes about 2.0 seconds to return from the state where Glim is set by the amplification factor setting unit 215 to Gnor. When the wind noise level is low, for example, it takes about 1.2 seconds to return from the state where Glim is set by the amplification factor setting unit 215 to Gnor.

この様にすることにより、風雑音のレベルが大きいときには、風雑音によって増幅率が不安定にならないようにすることができるのである。 In this way, when the wind noise level is high, it is possible to prevent the amplification factor from becoming unstable due to the wind noise.

以下、具体的な動作について説明する。 A specific operation will be described below.

遅延部２１２、振幅調整部２１３は、風雑音のレベルがほぼゼロである場合と同様に動作する。振幅検出部２１４も、風雑音のレベルがほぼゼロである場合と同様に動作するが、一部異なる動作をする。音声信号のピークをむかえるまでは音声信号に追従したレベルを検出結果として出力する点では同様である。しかし、風雑音のレベルが小さい場合には、音声信号のピーク以降は、前回の検出結果が音声信号のレベルより大きくなるまで所定の値Ｂを引き続けた値が検出結果として出力される。また、風雑音のレベルが大きい場合には、音声信号のピーク以降は、前回の検出結果が音声信号のレベルより大きくなるまで所定の値Ｃを引き続けた値が検出結果として出力される。ただし、Ａ＞Ｂ＞Ｃである。 The delay unit 212 and the amplitude adjustment unit 213 operate in the same manner as when the wind noise level is almost zero. The amplitude detection unit 214 operates in the same manner as when the wind noise level is almost zero, but operates partially differently. The same is true in that the level following the audio signal is output as a detection result until the peak of the audio signal is changed. However, when the wind noise level is low, after the peak of the audio signal, a value obtained by continuing to pull the predetermined value B until the previous detection result becomes larger than the level of the audio signal is output as the detection result. When the wind noise level is high, after the peak of the audio signal, a value obtained by continuing to subtract the predetermined value C until the previous detection result becomes larger than the level of the audio signal is output as the detection result. However, A> B> C.

この様にすることにより、音声信号のピーク以降、風雑音のレベルが大きいときには、風雑音のレベルが低いときに比べて、検出結果のレベルが下がっていく時間が遅くなることになる。これを示しているのが図４である。図４において、横軸は時間、縦軸は音声のレベル（検出結果のレベル）を示している。図４において、破線は、入力された音声信号の絶対値を示している。図４（ａ）は、風雑音のレベルが低いとき（Ｂを引く場合）の振幅検出部２１４から出力される検出結果の値を示すもので、４０１が、検出結果の検出レベルである。図４（ｂ）は、風雑音のレベルが大きいとき（Ｃを引く場合）の振幅検出部２１４から出力される検出結果の値を示すもので、４０２が検出結果の検出レベルである。 By doing so, when the wind noise level is high after the peak of the audio signal, the time for the detection result level to be lowered is delayed compared to when the wind noise level is low. This is shown in FIG. In FIG. 4, the horizontal axis represents time, and the vertical axis represents the audio level (detection result level). In FIG. 4, the broken line indicates the absolute value of the input audio signal. FIG. 4A shows the value of the detection result output from the amplitude detection unit 214 when the wind noise level is low (when B is subtracted), and 401 is the detection level of the detection result. FIG. 4B shows the detection result value output from the amplitude detection unit 214 when the wind noise level is high (when C is subtracted), and 402 is the detection level of the detection result.

４０１、４０２からわかるように、風雑音のレベルが高いときは、風雑音のレベルが低いときより検出結果の値の低下する時間が長くなる。そして、増幅率設定部２１５は、この検出結果に応じて、図３に示す対応関係に従った増幅率を振幅調整部２１３に設定する。 As can be seen from 401 and 402, when the wind noise level is high, the time during which the detection result value decreases is longer than when the wind noise level is low. Then, the amplification factor setting unit 215 sets the amplification factor according to the correspondence shown in FIG. 3 in the amplitude adjustment unit 213 according to the detection result.

以上のような制御によれば、本実施例の撮像装置は、風雑音のレベルが高いときは、風雑音のレベルが低いときより、増幅率の変化する速度が遅く（変動が緩やかに）なる。従って、風雑音のレベルが高いときは、風雑音のレベルの増減によって増幅率が不安定に変動してしまうことを防止し、安定的な増幅率を設定することができるのである。すなわち、風雑音のレベルが大きいときに、人の声などの音声のレベルが不安定になりにくくした音声信号を得ることができる。 According to the control as described above, in the imaging apparatus of the present embodiment, when the wind noise level is high, the rate at which the amplification factor changes is slower (the fluctuation is gradual) than when the wind noise level is low. . Therefore, when the wind noise level is high, it is possible to prevent the amplification factor from fluctuating unstable due to the increase or decrease in the wind noise level, and to set a stable amplification factor. That is, when the level of wind noise is high, it is possible to obtain a voice signal in which the level of voice such as a human voice is less likely to become unstable.

本実施例では、振幅検出部２１４で、音声信号のレベルから引く値を調整することにより、風雑音のレベルが高いときは、風雑音のレベルが低いときより、増幅率の変化する速度が遅く（変動が緩やかに）なるようにしていた。しかし、音声信号のレベルに対して１より小さいの所定の値を乗算するようにして、乗算する値を調整することにより対応しても良い。この場合は、振幅検出部２１４は、音声信号のピークをむかえるまでは音声信号に追従したレベルを検出結果として出力する。そして、風雑音のレベルが小さい場合には、音声信号のピーク以降は、前回の検出結果が音声信号のレベルより大きくなるまで所定の値Ｄ（Ｄ＜１）を乗じ続けた値を検出結果として順次出力する。また、風雑音のレベルが大きい場合には、音声信号のピーク以降は、前回の検出結果が音声信号のレベルより大きくなるまで所定の値Ｅ（Ｄ＜Ｅ＜１）を乗じ続けた値を検出結果として出力する。 In the present embodiment, the amplitude detection unit 214 adjusts the value subtracted from the level of the audio signal, so that when the wind noise level is high, the rate at which the amplification factor changes is slower than when the wind noise level is low. (Slow fluctuations). However, the level of the audio signal may be multiplied by a predetermined value smaller than 1, and the value to be multiplied may be adjusted. In this case, the amplitude detection unit 214 outputs a level following the audio signal as a detection result until the peak of the audio signal is obtained. When the wind noise level is low, after the peak of the audio signal, the detection result is a value obtained by continuously multiplying the predetermined value D (D <1) until the previous detection result becomes larger than the audio signal level. Output sequentially. Also, when the wind noise level is high, after the peak of the audio signal, a value continuously multiplied by a predetermined value E (D <E <1) is detected until the previous detection result becomes larger than the audio signal level. Output as a result.

また、振幅検出部２１４においては、音声信号のレベルをそのまま検出結果として増幅率設定部２１５に出力してもよい。この場合は、増幅率設定部２１５では、増幅率を上げる場合に、風雑音のレベルが高いときは、風雑音のレベルが低いときより、増幅率の変化する速度が遅くなるように設定する。 Further, the amplitude detection unit 214 may output the level of the audio signal as it is to the amplification factor setting unit 215 as a detection result. In this case, when the amplification factor is increased, the amplification factor setting unit 215 sets the rate of change of the amplification factor to be slower when the wind noise level is high than when the wind noise level is low.

本実施例においては、２つの音声信号の差信号に基づいて風雑音のレベルを検出していたが、単に２つの音声信号の低域成分を比較してもよい。また、別途風圧センサ、風速計、レーザとカメラを用いる装置など空気変位を検出できるユニット用いて風に関する情報を検出しても良い。これは、風雑音のレベルが高くなるときは、風圧が高い、風速が速い、風量が多いときであることがほとんどであることを利用している。 In this embodiment, the wind noise level is detected based on the difference signal between the two audio signals, but the low frequency components of the two audio signals may be simply compared. Further, information related to wind may be detected using a unit capable of detecting air displacement such as a wind pressure sensor, an anemometer, or a device using a laser and a camera. This utilizes the fact that the wind noise level is high when the wind pressure is high, the wind speed is fast, and the air volume is large.

また、本実施例においては、撮像装置について説明したが、本実施例の音声入力部１０２にの音声処理は、外部の音声を記録、または入力して出力するするような装置であればどのような装置であっても適用することができる。例えば、ＩＣレコーダ、携帯電話等に適用しても良い。 In this embodiment, the image pickup apparatus has been described. However, the sound processing to the sound input unit 102 according to this embodiment may be any apparatus that records or inputs external sound and outputs the sound. Even a simple device can be applied. For example, you may apply to an IC recorder, a mobile telephone, etc.

また、本実施例では音声が２ｃｈの場合について説明したが、マイクの数を２つ以上として、５．１ｃｈの音声を生成する場合であっても同様とすることができる。また、２つの音声信号の差信号を検出する以外の方法であれば風量（風速、風圧）に応じて、増幅率の変動を緩やかにする。 In this embodiment, the case where the sound is 2ch has been described. However, the same can be applied to the case where 5.1ch sound is generated with two or more microphones. If the method is other than detecting the difference signal between the two audio signals, the variation of the amplification factor is moderated according to the air volume (wind speed, wind pressure).

Claims

An audio signal processing apparatus including a sound collecting unit,
Detecting means for detecting the level of the noise component due to the wind effects of the device around included in the audio signal obtained by said sound collecting means,
Adjusting means for amplifying the audio signal at an amplification factor according to the level of the audio signal;
It said adjusting means lowers the amplification factor level is high in the audio signal, increasing the amplification factor and the level of the audio signal that a low,
The adjustment means, when increasing the amplification factor, slows the rate of change of the amplification factor as the level of the noise component detected by the detection means is higher.

It said adjusting means, chromatic and audio level detecting means for outputting a detection result detected a level of the audio signal, and a determination means for determining the amplification factor according to the detection result output by the audio level detector And
The audio level detecting means, wherein when the level of the audio signal may turn low, as at higher levels of the noise component detected by said detection means, to ensure that the rate of change in the detection result is slow the audio signal processing apparatus according to claim 1, wherein the.

The detecting device, the audio signal processing apparatus according to claim 1 or 2, wherein the detecting the level of the noise component based on the audio signal obtained by a plurality of said sound collecting means.

Said adjusting means determines a voice level detecting means for outputting a detection result by detecting the level of the audio signal at least one sample per, the amplification factor in accordance with a detection result output by the audio level detector Determining means,
The audio level detecting unit, when the audio signal level may turn low, n-1-th sample of the output to continue (n as a result of detection of a value obtained by subtracting from the detection result of the predetermined value n samples An integer greater than or equal to 1),
The audio level detecting means, the smaller the higher the level of said noise component detected by the detecting means, the audio signal according to any one of claims 1 to 3, characterized in that to reduce the predetermined value Processing equipment.

5. The audio signal processing apparatus according to claim 4 , wherein the audio level detection unit increases the predetermined value as the level of the noise component detected by the detection unit is lower.

Said adjusting means determines a voice level detecting means for outputting a detection result by detecting the level of the audio signal at least one sample per, the amplification factor in accordance with a detection result output by the audio level detector Determining means,
When the level of the audio signal decreases, the audio level detection means continues to output a value obtained by multiplying the detection result of the (n-1) th sample by a predetermined value smaller than 1 as the detection result of n samples (however, n Is an integer of 1 or more),
The audio level detecting means, the smaller the higher the level of said noise component detected by the detecting means, the audio signal according to any one of claims 1 to 3, characterized in that to increase the predetermined value Processing equipment.

The audio signal processing apparatus according to claim 6 , wherein the audio level detection unit decreases the predetermined value as the level of the noise component detected by the detection unit is lower.

The audio signal processing apparatus according to claim 1, wherein the audio signal processing apparatus is a mobile phone.

A method for controlling an audio signal processing device including sound collection means,
A detection step of detecting a level of a noise component due to an influence of wind around the device included in the sound signal obtained by the sound collecting means;
An adjustment step of amplifying the audio signal at an amplification factor according to the level of the audio signal,
In the adjustment step, when the level of the audio signal becomes high, the amplification factor is lowered, and when the level of the audio signal becomes low, the amplification factor is raised,
In the adjustment step, when the amplification factor is increased, the rate of change of the amplification factor is decreased as the level of the noise component detected by the detection unit is higher. Method.

The adjustment step includes
An audio level detection step of detecting the level of the audio signal and outputting a detection result;
Determining the amplification factor according to the detection result of the audio level detection step,
In the sound level detection step, when the level of the sound signal becomes low, the change rate of the detection result becomes slower as the level of the noise component detected by the detection means becomes higher. The method for controlling an audio signal processing device according to claim 9, wherein the method is a control method.

11. The method of controlling an audio signal processing apparatus according to claim 9, wherein in the detecting step, the level of the noise component is detected based on audio signals obtained by a plurality of the sound collecting means.

The adjustment step includes
An audio level detection step of detecting the level of the audio signal at least for each sample and outputting a detection result;
Determining the amplification factor according to the detection result of the audio level detection step,
In the audio level detection step, when the level of the audio signal becomes low, a value obtained by subtracting a predetermined value from the detection result of the (n-1) th sample is continuously output as the detection result of n samples (where n is 1). An integer greater than or equal to)
The audio signal according to any one of claims 9 to 11, wherein the audio level detection step decreases the predetermined value as the level of the noise component detected in the detection step is higher. A method for controlling a processing apparatus.

13. The method of controlling an audio signal processing device according to claim 12, wherein in the audio level detection step, the predetermined value is increased as the level of the noise component detected in the detection step is lower.

The adjustment step includes
An audio level detection step of detecting the level of the audio signal at least for each sample and outputting a detection result;
Determining the amplification factor according to the detection result output by the audio level detection step,
In the audio level detection step, when the level of the audio signal decreases, a value obtained by multiplying the detection result of the (n−1) th sample by a predetermined value smaller than 1 is continuously output as the detection result of n samples (however, n Is an integer of 1 or more),
12. The audio signal according to claim 9, wherein the audio level detection step increases the predetermined value as the level of the noise component detected in the detection step is higher. A method for controlling a processing apparatus.

15. The method of controlling an audio signal processing device according to claim 14, wherein the audio level detection step decreases the predetermined value as the level of the noise component detected by the detection means is lower.

16. The method of controlling an audio signal processing device according to claim 9, wherein the audio signal processing device is a mobile phone.