JP6497878B2

JP6497878B2 - Electronic device and control method

Info

Publication number: JP6497878B2
Application number: JP2014180497A
Authority: JP
Inventors: 悠貴辻本; 啓太園田
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2014-09-04
Filing date: 2014-09-04
Publication date: 2019-04-10
Anticipated expiration: 2034-09-04
Also published as: JP2016053697A

Description

本発明は、ノイズの低減を行うための技術に関するものである。 The present invention relates to a technique for reducing noise.

昨今のデジタルカメラに代表される撮像装置は、静止画のみならず、音声付き動画像記録機能を有する。すなわち、被写体を時間軸に連続して撮像して得た動画像と、その周囲の音声のデータも併せてメモリカード等の記憶媒体に記録できるようになっている。このような被写体の周囲の音声のように、記録の目的となる音声を、以下、「環境音」と称する。 Imaging apparatuses represented by recent digital cameras have not only still images but also moving image recording functions with sound. That is, a moving image obtained by continuously capturing an image of a subject along a time axis and surrounding audio data can be recorded together in a storage medium such as a memory card. Hereinafter, the sound to be recorded, such as the sound around the subject, is referred to as “environmental sound”.

また、撮像装置は、光学レンズを移動させることで、撮像中に被写体をフォーカスしたりズームすることができる。しかし、光学レンズの移動のための駆動時には、駆動音が発生する。近年のデジタルカメラは、筐体の小型化が進み、駆動音の発生源とマイクロホンとの距離が短くなってしまう。そのため、デジタルカメラにおけるマイクロホンは、駆動音を取得してしまい、結果的に環境音に駆動音がノイズとして重畳されてしまう可能性が高くなっていた。 In addition, the imaging apparatus can focus or zoom the subject during imaging by moving the optical lens. However, driving sound is generated during driving for moving the optical lens. In recent digital cameras, the housing has been downsized, and the distance between the drive sound source and the microphone has been shortened. Therefore, the microphone in the digital camera acquires driving sound, and as a result, there is a high possibility that the driving sound is superimposed as noise on the environmental sound.

従来から、上記のようなノイズを低減させるため、「スペクトルサブトラクション法」と称される手法が知られている（例えば、特許文献１）。このスペクトルサブトラクション法を図２１を参照して簡単に説明する。同図は、デジタルカメラのブロック構成の一部である。この装置は、装置全体の制御を行う制御部２１０９、ユーザからの指示を受け付ける操作部２１１０、光学レンズやレンズ制御部等で構成される。さらに、この装置は、撮像して画像データを得る撮像部２１０１、マイク２２０５、音声を音声データとして取得する音声入力部２１０２、画像データ及び音声データを記憶するメモリ２１０３で構成される。なお、通常、メモリ２１０３に格納された画像データや音声データは、符号化処理され、符号化データとして記憶媒体に格納される。 Conventionally, a technique called “spectral subtraction method” is known in order to reduce the noise as described above (for example, Patent Document 1). This spectral subtraction method will be briefly described with reference to FIG. This figure is a part of a block configuration of a digital camera. This apparatus includes a control unit 2109 that controls the entire apparatus, an operation unit 2110 that receives instructions from a user, an optical lens, a lens control unit, and the like. Further, this apparatus includes an imaging unit 2101 that captures images to obtain image data, a microphone 2205, an audio input unit 2102 that acquires audio as audio data, and a memory 2103 that stores image data and audio data. Note that, normally, image data and audio data stored in the memory 2103 are encoded and stored in the storage medium as encoded data.

音声付き動画像を記録している期間、制御部２１０９が操作部２１１０を介してユーザによるズームインやズームアウト等の指示を検出すると、制御部２１０９は、光学レンズの位置の変更を行うよう撮像部１０１を制御する。撮像部２１０１は、これに従い、光学レンズの位置を変更するために、モータ等の駆動源を駆動する。このとき、マイク２２０５は、光学レンズの駆動音を拾ってしまい、結果的にマイク２２０５から得られる音響データは、環境音と駆動音（ノイズ）とが合成されたデータになる。図示の音声入力部２１０２は、この駆動音を低減する機能を持つものである。 When the control unit 2109 detects an instruction such as zoom-in or zoom-out by the user via the operation unit 2110 during the recording of the moving image with sound, the control unit 2109 changes the position of the optical lens to change the position of the optical lens. 101 is controlled. In accordance with this, the imaging unit 2101 drives a drive source such as a motor in order to change the position of the optical lens. At this time, the microphone 2205 picks up the driving sound of the optical lens, and as a result, the acoustic data obtained from the microphone 2205 is data in which the environmental sound and the driving sound (noise) are synthesized. The illustrated voice input unit 2102 has a function of reducing the drive sound.

マイク２２０５で検出された音はＡＤＣ（アナログデジタルコンバータ）２２０６にて、例えば、４８ｋＨｚのサンプリングレートで１６ビットのデジタルデータ（以下、音響データという）に変換される。ＦＦＴ２２０７は、時系列に並ぶ音響データ（例えば１０２４個の音響データ）を、ＦＦＴ（高速フーリエ変換）処理して周波数毎のデータ（振幅スペクトル）に変換する。騒音低減部２２００は、各周波数のデータから、ノイズの各周波数データを減算することで、ノイズ低減処理を行う。このため、騒音低減部２２００は、ノイズの各周波数毎の振幅データ（ノイズプロファイル）をあらかじめ記憶しているプロファイル格納部２２１０と、振幅スペクトル減算部２２１１を有する。振幅スペクトル減算部２２１１は、振幅スペクトルからプロファイル格納部２２１０に記録されているノイズの各周波数の振幅データを減算する。その後、ノイズが減算された振幅スペクトルは、ＩＦＦＴ２２１４にて逆ＦＦＴ処理され、元の時系列の音響データに戻される。この後、音声処理部２２１６は、その音響データに対して各種処理を行う。そして、ＡＬＣ（オートレベルコントローラ）２２１７が音響データのレベル調節を行い、その結果は、メモリ２１０３に格納される。 The sound detected by the microphone 2205 is converted into 16-bit digital data (hereinafter referred to as acoustic data) by an ADC (Analog / Digital Converter) 2206 at a sampling rate of 48 kHz, for example. The FFT 2207 converts the acoustic data arranged in time series (for example, 1024 pieces of acoustic data) into FFT data (amplitude spectrum) by performing FFT (fast Fourier transform) processing. The noise reduction unit 2200 performs noise reduction processing by subtracting each frequency data of noise from each frequency data. For this reason, the noise reduction unit 2200 includes a profile storage unit 2210 that stores amplitude data (noise profile) for each frequency of noise in advance, and an amplitude spectrum subtraction unit 2211. The amplitude spectrum subtraction unit 2211 subtracts the amplitude data of each frequency of noise recorded in the profile storage unit 2210 from the amplitude spectrum. Thereafter, the amplitude spectrum from which the noise has been subtracted is subjected to inverse FFT processing by IFFT 2214 and returned to the original time-series acoustic data. Thereafter, the voice processing unit 2216 performs various processes on the acoustic data. An ALC (auto level controller) 2217 adjusts the level of the acoustic data, and the result is stored in the memory 2103.

以上が「スペクトルサブトラクション法」の概要である。上記のごとく、プロファイル格納部２２１０にあらかじめ格納されたノイズプロファイルは、撮像部２１０１で実際に発生する駆動音を表していることが望ましい。 The above is the outline of the “spectral subtraction method”. As described above, it is desirable that the noise profile stored in advance in the profile storage unit 2210 represents the driving sound actually generated by the imaging unit 2101.

特開２００６−２７９１８５号公報JP 2006-279185 A

特許文献１で示される手法を撮像装置に適用した場合、以下に示す状況によって撮像装置において実際に発生する駆動音と、あらかじめ格納されているノイズプロファイルにより示される駆動音とに誤差が生じてしまう。
・モーターやギアなど駆動部の音声ノイズ発生の個体差
・撮像装置の組み付け状況音声ノイズの違い
・ズームポジションによる音声ノイズの違い
・部品の摩耗、経年変化
・動作時の温度条件
・撮像装置の姿勢
・故障対応など市場に出荷されてからの駆動部もしくは録音部のパーツ交換
このため、あらかじめ格納されている１つのノイズプロファイルによってノイズを低減することは難しいという問題があった。また、ステレオ音声のような左右のチャネルの音声を記録する場合、環境音や駆動音の変化により、それぞれのチャネルに対応するマイクロホンへの駆動音による影響が変わってきてしまうという問題があった。 When the technique disclosed in Patent Document 1 is applied to an imaging apparatus, an error occurs between the driving sound actually generated in the imaging apparatus and the driving sound indicated by the noise profile stored in advance in the following situation. .
・ Individual differences in sound noise generation of motors, gears, etc. ・ Imaging device installation status Differences in audio noise ・ Differences in audio noise due to zoom position ・ Wear and aging of parts ・ Temperature conditions during operation ・ Image sensor attitude -Replacement of parts of the drive unit or the recording unit after being shipped to the market, such as in response to a failure For this reason, there is a problem that it is difficult to reduce noise by one noise profile stored in advance. In addition, when recording left and right channel sounds such as stereo sound, there is a problem in that the influence of the drive sound on the microphone corresponding to each channel changes due to changes in environmental sound and drive sound.

本発明は、上記のような課題を鑑み、左右のチャネルの音声を記録する場合、環境音や駆動音が変化したとしても、それぞれのチャネルの音声に対して高い精度でノイズの低減を行うようにすることを目的とする。 In view of the above-described problems, the present invention reduces noise with high accuracy with respect to the sound of each channel even when the environmental sound and driving sound change when recording the sound of the left and right channels. The purpose is to.

本発明に係る電子機器は、
第１のマイクと、
第２のマイクと、
駆動手段を駆動するための駆動指示を入力する入力手段と、
前記第１のマイクから得られた音声データをフーリエ変換することによって、音声スペクトルデータを取得する第１の変換手段と、
前記第２のマイクから得られた音声データをフーリエ変換することによって、音声スペクトルデータを取得する第２の変換手段と、
前記駆動指示が入力される前に前記第１のマイクから得られた音声データを前記第１の変換手段により変換することにより得られた音声スペクトルデータの平均値である第１の平均値と、前記駆動指示が入力されてから、音声スペクトルデータが安定するまでの所定の安定化期間が経過するまでの間に前記第１のマイクから得られた音声データを前記第１の変換手段により変換することにより得られた音声スペクトルデータの平均値である第２の平均値との差分に基づき、前記駆動手段の駆動ノイズを表す第１のノイズスペクトルデータを作成する第１の作成手段と、
前記駆動指示が入力される前に前記第２のマイクから得られた音声データを前記第２の変換手段により変換することにより得られた音声スペクトルデータの平均値である第３の平均値と、前記駆動指示が入力されてから、音声スペクトルデータが安定するまでの所定の安定化期間が経過するまでの間に前記第２のマイクから得られた音声データを前記第２の変換手段により変換することにより得られた音声スペクトルデータの平均値である第４の平均値との差分に基づき、前記駆動手段の駆動ノイズを表す第２のノイズスペクトルデータを作成する第２の作成手段と、
前記第１の変換手段から得られた音声スペクトルデータと前記第２の変換手段から得られた音声スペクトルデータとの和に対する、前記第１の変換手段から得られた音声スペクトルデータと前記第２の変換手段から得られた音声スペクトルデータとの差の割合を示す値が所定値以下である場合、前記駆動ノイズを低減するために前記第１のノイズスペクトルデータを用いるように制御し、前記値が前記所定値以下でない場合、前記駆動ノイズを低減するために前記第１のノイズスペクトルデータを用いないように制御する制御手段と、を有することを特徴とする。 The electronic device according to the present invention is
A first microphone,
A second microphone,
Input means for inputting a drive instruction for driving the drive means ;
First conversion means for acquiring sound spectrum data by performing Fourier transform on sound data obtained from the first microphone;
Second conversion means for acquiring sound spectrum data by performing Fourier transform on the sound data obtained from the second microphone;
A first average value is an average value of the voice spectrum data obtained by converting by said first of said first converting means the audio data obtained from the microphone before the driving instruction is input The voice data obtained from the first microphone is converted by the first conversion means after a predetermined stabilization period until the voice spectrum data is stabilized after the drive instruction is input. a first generation means for generating a first noise spectrum data representing the driving noise of the difference in basis, the driving means and the second average value is an average value of the voice spectrum data obtained by,
A third average value is an average value of the voice spectrum data obtained by converting by said second of said second converting means the audio data obtained from the microphone before the driving instruction is input The voice data obtained from the second microphone is converted by the second conversion means between the input of the driving instruction and the elapse of a predetermined stabilization period until the voice spectrum data is stabilized. a second generation means for generating a second noise spectrum data representing the driving noise of the difference in basis, the driving means and the fourth average value is an average value of the voice spectrum data obtained by,
The voice spectrum data obtained from the first conversion means and the second voice spectrum data obtained from the first conversion means and the sum of the voice spectrum data obtained from the second conversion means. When the value indicating the ratio of the difference from the audio spectrum data obtained from the conversion means is equal to or less than a predetermined value, control is performed so that the first noise spectrum data is used to reduce the driving noise, and the value is If the not less than a predetermined value, characterized by chromatic and control means for controlling so as not using the first noise spectrum data in order to reduce the driving noise.

本発明によれば、左右のチャネルの音声を記録する場合、環境音や駆動音が変化したとしても、それぞれのチャネルの音声に対して高い精度でノイズの低減を行うようにするができる。 According to the present invention, when recording the sound of the left and right channels, noise can be reduced with high accuracy with respect to the sound of each channel even if the environmental sound or the driving sound changes.

実施形態の撮像装置の一例を示すブロック図。1 is a block diagram illustrating an example of an imaging apparatus according to an embodiment. 実施形態の撮像部及び、音声入力部の構成の一例を示すブロック図。The block diagram which shows an example of a structure of the imaging part and audio | voice input part of embodiment. 実施形態における動画記録処理の一例を示すフローチャート。The flowchart which shows an example of the moving image recording process in embodiment. 実施形態のズーム動作前とズーム動作中とにおける周波数毎の振幅スペクトルの一例を表す図。The figure showing an example of the amplitude spectrum for every frequency before the zoom operation of the embodiment and during the zoom operation. 実施形態のノイズプロファイル作成処理を示すタイミングチャートの一例。An example of the timing chart which shows the noise profile creation process of embodiment. 実施形態のノイズプロファイル作成処理を示すタイミングチャートの一例。An example of the timing chart which shows the noise profile creation process of embodiment. 実施形態のノイズプロファイル作成処理の一例を示すフローチャート。The flowchart which shows an example of the noise profile creation process of embodiment. 実施形態のノイズプロファイルの拡大補正処理を示すタイミングチャートの一例。An example of the timing chart which shows the expansion correction process of the noise profile of embodiment. 実施形態のノイズプロファイルの縮小補正処理を示すタイミングチャートの一例。An example of the timing chart which shows the reduction correction process of the noise profile of embodiment. 実施形態のノイズプロファイル補正処理の一例を示すフローチャート。The flowchart which shows an example of the noise profile correction process of embodiment. 実施形態におけるノイズプロファイル補正処理に関する時定数の設定の一例を示す図である。It is a figure which shows an example of the setting of the time constant regarding the noise profile correction process in embodiment. 実施形態の外部音源と音声入力部との関係の一例を示す図。The figure which shows an example of the relationship between the external sound source and audio | voice input part of embodiment. 実施形態のＲｃｈ及びＬｃｈに対するノイズプロファイル補正処理の一例を示すタイミングチャート。6 is a timing chart showing an example of noise profile correction processing for Rch and Lch in the embodiment. 実施形態のＲｃｈ及びＬｃｈに対するノイズプロファイル補正処理の一例を示すフローチャート。6 is a flowchart illustrating an example of noise profile correction processing for Rch and Lch according to the embodiment. 実施形態におけるノイズ低減処理の一例を示すタイミングチャート。The timing chart which shows an example of the noise reduction process in embodiment. 実施形態におけるノイズ低減処理の一例を示すフローチャート。The flowchart which shows an example of the noise reduction process in embodiment. 実施形態における係数αと環境音との関係の一例を示す図。The figure which shows an example of the relationship between the coefficient (alpha) and environmental sound in embodiment. 実施形態の後処理の一例を示すタイミングチャート。The timing chart which shows an example of the post-process of embodiment. 実施形態における後処理の一例を示すフローチャート。The flowchart which shows an example of the post-process in embodiment. 実施形態における音声入力部の一例を示すブロック。The block which shows an example of the audio | voice input part in embodiment. 従来の撮像装置の一例を示すブロック図。FIG. 10 is a block diagram illustrating an example of a conventional imaging device.

以下、図面を参照して本発明の実施形態を詳細に説明する。なお、実施形態では、電子機器としてデジタルカメラ等の撮像装置１００を一例に挙げ、以下説明を行う。しかし、電子機器は、撮像装置１００に限られず、マイクを有する装置であれば、携帯電話やＩＣレコーダであっても良い。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. In the embodiment, the imaging apparatus 100 such as a digital camera is taken as an example of the electronic device and will be described below. However, the electronic device is not limited to the imaging device 100, and may be a mobile phone or an IC recorder as long as the device has a microphone.

図１は、撮像装置１００の構成の一例を示すブロック図である。撮像装置１００は、撮像部１０１、音声入力部１０２、メモリ１０３、表示制御部１０４、表示部１０５、符号化処理部１０６、記録再生部１０７、記録媒体１０８、制御部１０９を有する。さらに、撮像装置１００は、操作部１１０、音声出力部１１１、スピーカ１１２、外部出力部１１３、及び、各構成要素を接続するシステムバス１１４を有する。 FIG. 1 is a block diagram illustrating an example of the configuration of the imaging apparatus 100. The imaging apparatus 100 includes an imaging unit 101, an audio input unit 102, a memory 103, a display control unit 104, a display unit 105, an encoding processing unit 106, a recording / playback unit 107, a recording medium 108, and a control unit 109. Furthermore, the imaging apparatus 100 includes an operation unit 110, an audio output unit 111, a speaker 112, an external output unit 113, and a system bus 114 that connects each component.

撮像部１０１は、被写体の光学像を画像信号に変換し、これに対して画像処理を行い、画像データを生成する。音声入力部１０２は、撮像装置１００の周辺の音声を集音し、これに対して音声処理を行い、音声データを生成する。 The imaging unit 101 converts an optical image of a subject into an image signal, performs image processing on the image signal, and generates image data. The sound input unit 102 collects sound around the imaging apparatus 100, performs sound processing on the sound, and generates sound data.

メモリ１０３は、撮像部１０１から供給される画像データや、音声入力部１０２から供給される音声データを記憶する。表示制御部１０４は、撮像部１０１から得られた画像データや撮像装置１００のメニュー画面等を表示部１０５に表示させる。符号化処理部１０６は、メモリ１０３に記憶された画像データに対して所定の符号化を行い、圧縮画像データを生成する。また、符号化処理部１０６は、メモリ１０３に記憶された音声データに対して所定の符号化を行い、圧縮音声データを生成する。記録再生部１０７は、符号化処理部１０６で生成された圧縮画像データ、圧縮音声データ及び圧縮された動画データの少なくとも一つを記録媒体１０８に記録する。また、記録再生部１０７は、記録媒体１０８に記録されている画像データ、音声データ及び動画データの少なくとも一つを記録媒体１０８から読み出す。 The memory 103 stores image data supplied from the imaging unit 101 and audio data supplied from the audio input unit 102. The display control unit 104 causes the display unit 105 to display the image data obtained from the imaging unit 101, the menu screen of the imaging device 100, and the like. The encoding processing unit 106 performs predetermined encoding on the image data stored in the memory 103 to generate compressed image data. In addition, the encoding processing unit 106 performs predetermined encoding on the audio data stored in the memory 103 to generate compressed audio data. The recording / reproducing unit 107 records at least one of the compressed image data, the compressed audio data, and the compressed moving image data generated by the encoding processing unit 106 on the recording medium 108. The recording / reproducing unit 107 reads at least one of image data, audio data, and moving image data recorded on the recording medium 108 from the recording medium 108.

制御部１０９は、システムバス１１４を介して撮像装置１００の各部を制御する。制御部１０９は、ＣＰＵ及びメモリを有する。制御部１０９のメモリには、撮像装置１００の各部を制御するためのプログラムが記録される。 The control unit 109 controls each unit of the imaging apparatus 100 via the system bus 114. The control unit 109 has a CPU and a memory. A program for controlling each unit of the imaging apparatus 100 is recorded in the memory of the control unit 109.

操作部１１０は、ユーザからの指示を撮像装置１００に入力するための操作を受け付ける。操作部１１０は、ユーザによって行われた特定の操作に対応する信号を制御部１０９に送信する。操作部１１０は、静止画の撮影を指示するボタン、動画記録開始と停止を指示する記録ボタン、光学的に画像に対してズーム動作を行うように撮像装置１００に指示するためのズームボタンなどを有する。さらに、操作部１１０は、撮像装置１００の動作モードを静止画撮影モード、動画撮影モード及び再生モードのなかから選択するためのモード選択ボタンを有する。 The operation unit 110 receives an operation for inputting an instruction from the user to the imaging apparatus 100. The operation unit 110 transmits a signal corresponding to a specific operation performed by the user to the control unit 109. The operation unit 110 includes a button for instructing photographing of a still image, a recording button for instructing start and stop of moving image recording, a zoom button for instructing the imaging apparatus 100 to optically perform a zoom operation on the image, and the like. Have. Furthermore, the operation unit 110 includes a mode selection button for selecting an operation mode of the imaging apparatus 100 from among a still image shooting mode, a moving image shooting mode, and a playback mode.

音声出力部１１１は、記録再生部１０７によって読み出された音声データをスピーカ１１２に出力する。外部出力部１１３は、記録再生部１０７によって読み出された音声データを外部機器に出力する。 The audio output unit 111 outputs the audio data read by the recording / reproducing unit 107 to the speaker 112. The external output unit 113 outputs the audio data read by the recording / playback unit 107 to an external device.

次に、撮像装置１００が動画撮影モードである場合における動作について説明する。撮像装置１００が動画撮影モードである場合、制御部１０９は、操作部１１０の記録ボタンがＯＮにされたことに応じて、所定のフレームレートで撮像するように撮像部１０１を制御し、音声データを取得するように音声入力部１０２を制御する。この場合、撮像部１０１で撮像された画像データと音声データとは圧縮され、記録再生部１０７によって記録媒体１０８に動画データとして記録される。その後、制御部１０９は、操作部１１０の記録ボタンがＯＦＦにされたことに応じて、記録媒体１０８に記録していた動画データをクローズ処理し、１つの動画ファイルを生成する。なお、撮像装置１００が動画撮影モードである場合、ユーザによって操作部１１０の記録ボタンがＯＮされるまでは、操作部１１０の記録ボタンは、ＯＦＦであるものとする。 Next, an operation when the imaging apparatus 100 is in the moving image shooting mode will be described. When the image capturing apparatus 100 is in the moving image capturing mode, the control unit 109 controls the image capturing unit 101 to capture images at a predetermined frame rate in response to the recording button of the operation unit 110 being turned on, and the audio data The voice input unit 102 is controlled so as to acquire. In this case, the image data and audio data captured by the imaging unit 101 are compressed and recorded as moving image data on the recording medium 108 by the recording / reproducing unit 107. Thereafter, the control unit 109 closes the moving image data recorded on the recording medium 108 in response to the recording button of the operation unit 110 being turned off, and generates one moving image file. When the imaging apparatus 100 is in the moving image shooting mode, it is assumed that the recording button of the operation unit 110 is OFF until the recording button of the operation unit 110 is turned ON by the user.

図２０は、撮像部１０１と音声入力部１０２との関係を示す。
撮像部１０１は、光学レンズ２０１、撮像素子２０２、レンズ制御部２０３及び画像処理部２０４を有する。
光学レンズ２０１は、被写体に対して光学的に合焦させるためのフォーカスレンズやズームレンズ等である。光学レンズ２０１は、ズーミングを光学的に行うことができる。以下、光学レンズ２０１を使ってズーミングを光学的に行うことを「ズーム動作」と呼ぶ。ズーム動作は、制御部１０９からの指示で、レンズ制御部２０３が、光学レンズ２０１を移動させることで、被写体の光学像をズーミングさせるものである。撮像素子２０２は、被写体の光学像を画像信号に変換し、画像信号を出力する。レンズ制御部２０３は、光学レンズ２０１を移動させるためのモータ等を駆動させる。画像処理部２０４は、撮像素子２０２から出力される画像信号に対して画像処理を行い、画像データを生成する。 FIG. 20 shows the relationship between the imaging unit 101 and the audio input unit 102.
The imaging unit 101 includes an optical lens 201, an imaging element 202, a lens control unit 203, and an image processing unit 204.
The optical lens 201 is a focus lens or a zoom lens for optically focusing on a subject. The optical lens 201 can perform zooming optically. Hereinafter, the optical zooming using the optical lens 201 is referred to as “zoom operation”. In the zoom operation, the lens control unit 203 moves the optical lens 201 in response to an instruction from the control unit 109 to zoom the optical image of the subject. The image sensor 202 converts an optical image of a subject into an image signal and outputs the image signal. The lens control unit 203 drives a motor or the like for moving the optical lens 201. The image processing unit 204 performs image processing on the image signal output from the image sensor 202 to generate image data.

例えば、ズーム動作やフォーカス調整等を撮像装置１００に開始させるための指示が操作部１１０を介して入力された場合、制御部１０９は、光学レンズ２０１を移動させるようにレンズ制御部２０３を制御するためのズーム制御信号をＯＮに変更する。ズーム制御信号がＯＮに変更された場合、レンズ制御部２０３は、モータ等を駆動し、光学レンズ２０１を移動させる。 For example, when an instruction for causing the imaging apparatus 100 to start zoom operation, focus adjustment, or the like is input via the operation unit 110, the control unit 109 controls the lens control unit 203 to move the optical lens 201. The zoom control signal for turning on is changed to ON. When the zoom control signal is changed to ON, the lens control unit 203 drives a motor or the like to move the optical lens 201.

光学レンズ２０１を移動させる場合、撮像装置１００において、光学レンズ２０１の移動に伴うノイズや光学レンズ２０１を移動させるためのモータの駆動に伴うノイズが発生する。以下、光学レンズ２０１の移動に伴うノイズや光学レンズ２０１を移動させるためのモータの駆動に伴うノイズを「駆動ノイズ」と呼ぶ。 When the optical lens 201 is moved, in the imaging apparatus 100, noise accompanying movement of the optical lens 201 and noise accompanying driving of a motor for moving the optical lens 201 are generated. Hereinafter, the noise accompanying the movement of the optical lens 201 and the noise accompanying the driving of the motor for moving the optical lens 201 are referred to as “driving noise”.

なお、図２０において、撮像装置１００に光学レンズ２０１やレンズ制御部２０３が含まれているものとして説明を行ったが、これに限られないものとする。光学レンズ２０１やレンズ制御部２０３は、撮像装置１００に対して着脱可能なものであっても良い。 In FIG. 20, the imaging apparatus 100 has been described as including the optical lens 201 and the lens control unit 203, but is not limited thereto. The optical lens 201 and the lens control unit 203 may be detachable from the imaging apparatus 100.

撮像装置１００の音声入力部１０２は、ステレオ録音を実現するため、Ｒ（Ｒｉｇｈｔ）チャネル音声入力部１０２ａ及びＬ（Ｌｅｆｔ）チャネル音声入力部１０２ｂを有する。Ｒチャネル音声入力部１０２ａと、Ｌチャネル音声入力部１０２ｂとは構成が同じであるため、以下、Ｒチャネル音声入力部１０２ａの構成について説明する。Ｒチャネル音声入力部１０２ａは、マイク２０５ａ、ＡＤＣ２０６ａ、ＦＦＴ２０７ａ、ノイズ低減部２００ａ、ＩＦＦＴ２１４ａ、ノイズ印加部２１５ａ、音声処理部２１６ａ、ＡＬＣ２１７ａを有する。なお、Ｒチャネルを以下「Ｒｃｈ」と呼び、Ｌチャネルを以下「Ｌｃｈ」と呼ぶ。 The audio input unit 102 of the imaging apparatus 100 includes an R (Right) channel audio input unit 102a and an L (Left) channel audio input unit 102b in order to realize stereo recording. Since the R channel audio input unit 102a and the L channel audio input unit 102b have the same configuration, the configuration of the R channel audio input unit 102a will be described below. The R channel audio input unit 102a includes a microphone 205a, an ADC 206a, an FFT 207a, a noise reduction unit 200a, an IFFT 214a, a noise application unit 215a, an audio processing unit 216a, and an ALC 217a. The R channel is hereinafter referred to as “Rch”, and the L channel is hereinafter referred to as “Lch”.

マイク２０５ａは、音声振動を電気信号に変換し、アナログの音声信号を出力する。ＡＤＣ（アナログデジタルコンバータ）２０６ａは、マイク２０５ａにより得られたアナログ音声信号をデジタル音声信号に変換する。例えば、ＡＤＣ２０６ａのサンプリング周波数は４８ＫＨｚで、１サンプルにつき１６ビットの、時系列のデジタルデータを出力する。ＦＦＴ（高速フーリエ変換器）２０７ａは、ＡＤＣ２０６ａから出力された、例えば、１０２４個の時系列に並んだ音声データを１フレームとして入力する。そして、ＦＦＴ２０７ａは、１フレーム分の音声データに対して高速フーリエ変換し、各周波数毎の振幅レベル（振幅スペクトルデータ）を生成し、ノイズ低減部２００ａに供給する。なお、ＦＦＴ２０７ａが生成する振幅スペクトルは、０乃至４８ＫＨｚまでの１０２４ポイントの各周波数毎の振幅データで構成されるものとする。また、実施形態では１フレームの音声データを１０２４個としているが、次に処理する１フレームにおける前半の５１２個のデータと、直前の１フレームの後半の５１２個のデータは同じであり、互いに一部が重複している。 The microphone 205a converts audio vibration into an electrical signal and outputs an analog audio signal. The ADC (analog / digital converter) 206a converts the analog audio signal obtained by the microphone 205a into a digital audio signal. For example, the sampling frequency of the ADC 206a is 48 KHz, and 16-bit time-series digital data is output per sample. An FFT (Fast Fourier Transformer) 207a inputs, for example, 1024 time-series audio data output from the ADC 206a as one frame. Then, the FFT 207a performs fast Fourier transform on the audio data for one frame, generates an amplitude level (amplitude spectrum data) for each frequency, and supplies it to the noise reduction unit 200a. Note that the amplitude spectrum generated by the FFT 207a is composed of amplitude data for each frequency of 1024 points from 0 to 48 KHz. In the embodiment, the number of audio data of one frame is 1024, but the first half of 512 data in one frame to be processed next and the latter half of 512 data in the immediately preceding frame are the same, and are identical to each other. The parts are overlapping.

ノイズ低減部２００ａは、ズーム動作を撮像装置１００が実行している場合に発生する駆動ノイズを表す周波数毎のノイズの振幅データを、ＦＦＴ２０７ａから出力された該当する周波数の振幅データから減算する。ノイズ低減部２００ａは、減算が行われた後の振幅スペクトルデータをＩＦＦＴ（逆高速フーリエ変換器）２１４ａに供給する。 The noise reduction unit 200a subtracts the amplitude data of the noise for each frequency representing the driving noise generated when the imaging apparatus 100 performs the zoom operation from the amplitude data of the corresponding frequency output from the FFT 207a. The noise reduction unit 200a supplies the amplitude spectrum data after the subtraction to an IFFT (Inverse Fast Fourier Transform) 214a.

ＩＦＦＴ（逆高速フーリエ変換器）２１４ａは、ＦＦＴ２０７ａから供給された位相情報を用いて、ノイズ低減部２００ａから供給された振幅スペクトルに対して逆高速フーリエ変換（逆変換）を行うことで、元の時系列形式の音声データを生成する。ＩＦＦＴ２１４ａは、ＦＦＴ２０７ａにて高速フーリエ変換される前の位相情報を使って時系列の音声信号に戻す。 The IFFT (Inverse Fast Fourier Transform) 214a uses the phase information supplied from the FFT 207a to perform an inverse fast Fourier transform (inverse transform) on the amplitude spectrum supplied from the noise reduction unit 200a, thereby Generate audio data in time series format. The IFFT 214a uses the phase information before being subjected to the fast Fourier transform in the FFT 207a to return it to a time-series audio signal.

ノイズ印加部２１５ａは、ＩＦＦＴ２１４ａから供給される時系列の音声信号に対してノイズ信号を印加する。ノイズ印加部２１５ａによって印加されるノイズ信号は、ノイズフロアレベルの信号であるものとする。音声処理部２１６ａは、風騒音を低減するための処理、ステレオ感を強調するための処理やイコライザ処理等を行う。そして、ＡＬＣ（オートゲインコントローラ）２１７ａは、時系列の音声信号の振幅を所定のレベルに調整し、調整後の音声データをメモリ１０３に出力する。 The noise applying unit 215a applies a noise signal to the time-series audio signal supplied from the IFFT 214a. It is assumed that the noise signal applied by the noise applying unit 215a is a noise floor level signal. The sound processing unit 216a performs processing for reducing wind noise, processing for enhancing a stereo feeling, equalizer processing, and the like. The ALC (auto gain controller) 217 a adjusts the amplitude of the time-series audio signal to a predetermined level and outputs the adjusted audio data to the memory 103.

次に、実施形態におけるＲチャネル音声入力部１０２ａのノイズ低減部２００ａについて、図２を用いて、以下説明を行う。 Next, the noise reduction unit 200a of the R channel audio input unit 102a in the embodiment will be described below with reference to FIG.

図２は、ノイズ低減部２００ａの構成の一例を示すブロック図である。ノイズ低減部２００ａは、積分回路２５０ａ、メモリ２５１ａ、プロファイル作成部２５２ａ、プロファイル格納部２５３ａ、振幅スペクトル減算部２５４ａ、後処理部２５５ａ、及びプロファイル補正部２５６ａを有する。 FIG. 2 is a block diagram illustrating an example of the configuration of the noise reduction unit 200a. The noise reduction unit 200a includes an integration circuit 250a, a memory 251a, a profile creation unit 252a, a profile storage unit 253a, an amplitude spectrum subtraction unit 254a, a post-processing unit 255a, and a profile correction unit 256a.

ノイズ低減部２００ａは、撮像装置１００がズーム動作を行っている場合に発生する駆動ノイズを低減するための動作を行う。以下、ノイズ低減部２００ａによって行われる動作について図４を参照し、説明を行う。
図４は、撮像装置１００によってズーム動作が行われる前と撮像装置１００によってズーム動作が行われている間とにおける周波数毎の振幅スペクトルの一例を表す図である。図４における横軸が周波数を示し、０乃至４８Ｋｈｚの区間の１０２４ポイントを示している（ただし、ナイキスト周波数である２４ｋＨｚまでにおいては５１２ポイントの周波数スペクトルをもつものとする）。図４における４０１は、撮像装置１００がズーム動作を行う前（光学レンズ２０１が移動していない状態）における環境音を示す振幅スペクトルを示す。図４における４０２は、撮像装置１００がズーム動作を行っている場合（光学レンズ２０１が移動している状態）における環境音を示す振幅スペクトルを示している。４０２によって示されている振幅スペクトルには、駆動ノイズが含まれている。ノイズ低減部２００aは、４０１によって示されている振幅スペクトルと、４０２によって示されている振幅スペクトルとの差分から駆動ノイズを低減するために用いられるノイズプロファイルを作成する。以下、ノイズ低減部２００aの各部について説明を行う。 The noise reduction unit 200a performs an operation for reducing drive noise that occurs when the imaging apparatus 100 performs a zoom operation. Hereinafter, the operation performed by the noise reduction unit 200a will be described with reference to FIG.
FIG. 4 is a diagram illustrating an example of an amplitude spectrum for each frequency before the zoom operation is performed by the imaging apparatus 100 and during the zoom operation is performed by the imaging apparatus 100. The horizontal axis in FIG. 4 indicates the frequency and indicates 1024 points in the interval from 0 to 48 Khz (provided that the Nyquist frequency of up to 24 kHz has a 512-point frequency spectrum). Reference numeral 401 in FIG. 4 indicates an amplitude spectrum indicating environmental sound before the imaging apparatus 100 performs a zoom operation (in a state where the optical lens 201 is not moved). Reference numeral 402 in FIG. 4 indicates an amplitude spectrum indicating environmental sound when the imaging apparatus 100 is performing a zoom operation (a state in which the optical lens 201 is moving). The amplitude spectrum indicated by 402 includes drive noise. The noise reduction unit 200a creates a noise profile that is used to reduce drive noise from the difference between the amplitude spectrum indicated by 401 and the amplitude spectrum indicated by 402. Hereinafter, each part of the noise reduction unit 200a will be described.

積分回路２５０ａは、制御部１０９からの指示に応じて、ＦＦＴ２０７ａにて高速フーリエ変換された振幅スペクトルの各周波数毎の振幅値を時間軸に積分する。このとき、積分回路２５０ａは、積分したフレーム数をカウントする。ＦＦＴ２０７ａからの１フレームから得られた振幅スペクトルデータにおける周波数ｆｉ（ただし、ｉ＝０、１、…、１０２３のいずれか）の振幅値をＡ（ｆｉ）と表す。この場合、積分回路２５０ａは、次式のように各周波数毎の積分値（累積加算値）Ｓ（ｆｉ）を求める。
Ｓ（ｆｉ）＝ΣＡ（ｆｉ） In response to an instruction from the control unit 109, the integration circuit 250a integrates the amplitude value for each frequency of the amplitude spectrum subjected to the fast Fourier transform by the FFT 207a on the time axis. At this time, the integration circuit 250a counts the number of integrated frames. The amplitude value of the frequency fi (where i = 0, 1,..., 1023) in the amplitude spectrum data obtained from one frame from the FFT 207a is represented as A (fi). In this case, the integration circuit 250a obtains an integral value (cumulative addition value) S (fi) for each frequency as in the following equation.
S (fi) = ΣA (fi)

レンズ制御部２０３が光学レンズ２０１を移動させていない場合、積分回路２５０ａは、上述のように各周波数の振幅値を積分していく。そして、積分回路２５０ａは、各周波数の積分値を、積分期間を表すフレーム数ｎで除算した結果を出力する。つまり、積分回路２５０ａは、次式のように周波数毎の平均振幅値Ａａｖｅ（ｆｉ）を算出（演算）し、その算出結果を出力する。
Ａａｖｅ（ｆｉ）＝Ｓ（ｆｉ）／ｎ
平均振幅値Ａａｖｅ（ｆｉ）（ｉ＝０、１、…、１０２３）で示されるデータは、図４の４０１によって示されている振幅スペクトルに対応する。積分回路２５０ａは、算出した平均振幅値Ａａｖｅ（ｆｉ）をメモリ２５１aに格納する。 When the lens control unit 203 has not moved the optical lens 201, the integration circuit 250a integrates the amplitude value of each frequency as described above. Then, the integration circuit 250a outputs a result obtained by dividing the integration value of each frequency by the number of frames n representing the integration period. That is, the integration circuit 250a calculates (calculates) the average amplitude value Aave (fi) for each frequency as in the following equation, and outputs the calculation result.
Aave (fi) = S (fi) / n
The data indicated by the average amplitude value Aave (fi) (i = 0, 1,..., 1023) corresponds to the amplitude spectrum indicated by 401 in FIG. The integrating circuit 250a stores the calculated average amplitude value Aave (fi) in the memory 251a.

積分回路２５０ａは、レンズ制御部２０３が光学レンズ２０１の移動を開始させてから安定化期間が経過するまでの間、上述のように各周波数の振幅値を積分していく。安定化期間とは、積分回路２５０ａの時定数により積分回路２５０ａに入力される振幅スペクトルが安定するまでの期間である。安定化期間が経過するまでの間において、ＦＦＴ２０７ａから出力される振幅スペクトルには、駆動ノイズが含まれている。安定化期間（例えば、ｍフレーム分に相当するものとする）が経過した場合、積分回路２５０ａは、Ｓ（ｆｉ）／ｍをプロファイル作成部２５２ａに出力する。Ｓ（ｆｉ）／ｍは、図４の４０２によって示されている振幅スペクトルに対応する。 The integration circuit 250a integrates the amplitude value of each frequency as described above from when the lens control unit 203 starts moving the optical lens 201 until the stabilization period elapses. The stabilization period is a period until the amplitude spectrum input to the integration circuit 250a is stabilized by the time constant of the integration circuit 250a. Until the stabilization period elapses, the amplitude spectrum output from the FFT 207a includes drive noise. When a stabilization period (for example, corresponding to m frames) has elapsed, the integration circuit 250a outputs S (fi) / m to the profile creation unit 252a. S (fi) / m corresponds to the amplitude spectrum indicated by 402 in FIG.

プロファイル作成部２５２ａは、積分回路２５０ａから供給されたＳ（ｆｉ）／ｍからメモリ２５１ａに格納されたＳ（ｆｉ）／ｎを減算することで、各周波数毎の駆動ノイズに対応する振幅値であるＮ（ｆｉ）を次式のように算出する。
Ｎ（ｆｉ）＝Ｓ（ｆｉ）／ｍ−Ｓ（ｆｉ）／ｎ The profile creation unit 252a subtracts S (fi) / n stored in the memory 251a from S (fi) / m supplied from the integration circuit 250a, thereby obtaining an amplitude value corresponding to driving noise for each frequency. A certain N (fi) is calculated as follows.
N (fi) = S (fi) / m-S (fi) / n

Ｎ（ｆｉ）が算出された後、プロファイル作成部２５２ａは、Ｎ（ｆｉ）をノイズプロファイルとしてプロファイル格納部２５３ａに格納する。ノイズプロファイルとは、ズーム動作が行われている場合に発生する駆動ノイズを示すデータである。 After N (fi) is calculated, the profile creation unit 252a stores N (fi) as a noise profile in the profile storage unit 253a. The noise profile is data indicating drive noise that occurs when a zoom operation is performed.

その後、振幅スペクトル減算部２５４ａは、ＦＦＴ２０７ａから供給される振幅スペクトルのＡ（ｆｉ）から、プロファイル格納部２５３ａから読み出された駆動ノイズの振幅値Ｎ（ｆｉ）を減算する処理を行う。なお、以下、ＦＦＴ２０７ａから供給される振幅スペクトルのＡ（ｆｉ）から、プロファイル格納部２５３ａから読み出されたノイズプロファイルである振幅値Ｎ（ｆｉ）を減算する処理を「減算処理」と呼ぶ。振幅スペクトル減算部２５４ａは、次式によって得られた振幅スペクトルＡ_NR（ｆｉ）をＩＦＦＴ２１４ａまたはＩＦＦＴ２１４ｂに出力する。
Ａ_NR（ｆｉ）＝Ａ（ｆｉ）−Ｎ（ｆｉ） After that, the amplitude spectrum subtraction unit 254a performs a process of subtracting the drive noise amplitude value N (fi) read from the profile storage unit 253a from the amplitude spectrum A (fi) supplied from the FFT 207a. Hereinafter, the process of subtracting the amplitude value N (fi), which is the noise profile read from the profile storage unit 253a, from A (fi) of the amplitude spectrum supplied from the FFT 207a is referred to as “subtraction process”. The amplitude spectrum subtraction unit 254a outputs the amplitude spectrum A _NR (fi) obtained by the following equation to the IFFT 214a or the IFFT 214b.
A _NR (fi) = A (fi) −N (fi)

なお、ユーザによって撮像装置１００へのズーム動作の開始の指示を受けてから安定化期間が経過するまでの間、プロファイル作成部２５２ａによってノイズプロファイルの作成が完了していない状態が発生する。これにより、プロファイル作成部２５２ａによるノイズプロファイルの作成が完了するまでの期間を短くするためには、「ｍ」を小さくする必要がある。しかし、「ｍ」が極端に小さい場合、ノイズプロファイルによる駆動ノイズの低減の精度が低くなってしまう可能性がある。レンズ制御部２０３が光学レンズ２０１を移動させるための制御を開始した場合、７０ｍｓ程度の間、駆動ノイズの一種である光学レンズ２０１の動きだし音、音揺れ等が発生する。光学レンズ２０１の動きだし音、音揺れ等を低減するために、７０ｍｓを超える期間において、プロファイル作成部２５２aにノイズプロファイルを作成させるため、「ｍ」を例えば「１５」とする。 It should be noted that a state in which the creation of the noise profile is not completed by the profile creation unit 252a occurs after the stabilization period elapses after the user receives an instruction to start the zoom operation to the imaging apparatus 100. Thereby, in order to shorten the period until the creation of the noise profile by the profile creation unit 252a is completed, it is necessary to reduce “m”. However, when “m” is extremely small, there is a possibility that the drive noise reduction accuracy by the noise profile is lowered. When the lens control unit 203 starts control for moving the optical lens 201, a movement sound, sound vibration, and the like of the optical lens 201, which are a kind of driving noise, are generated for about 70 ms. In order to reduce the starting sound and vibration of the optical lens 201, “m” is set to, for example, “15” in order to cause the profile creation unit 252a to create a noise profile in a period exceeding 70 ms.

実施形態では、１フレームは１０２４個の時系列の音声データであるものの各フレームの半分は互いに重畳している。また、音声データのサンプリングレートは４８ｋＨｚとしているので、ｍ＝１５とすると、ノイズプロファイルの作成期間Ｔは、
Ｔ＝ｍフレーム分の期間＝ｍ×（１０２４／２）／４８ｋＨｚ＝１６０ｍｓ
となる。ユーザによって撮像装置１００へのズーム動作の開始の指示を受けてから作成期間Ｔが経過するまでの間に、プロファイル作成部２５２ａは、ノイズプロファイルを作成する。このため、プロファイル作成部２５２ａは、光学レンズ２０１の動きだし音、音揺れ等を低減するための精度の高いノイズプロファイルを作成できる。 In the embodiment, although one frame is 1024 time-series audio data, half of each frame is superimposed on each other. Since the sampling rate of the audio data is 48 kHz, if m = 15, the noise profile creation period T is
T = m frame period = m × (1024/2) / 48 kHz = 160 ms
It becomes. The profile creation unit 252a creates a noise profile during the period from when the user receives an instruction to start the zoom operation to the imaging apparatus 100 until the creation period T elapses. For this reason, the profile creation unit 252a can create a highly accurate noise profile for reducing the movement sound and sound fluctuation of the optical lens 201.

後処理部２５５ａは、振幅スペクトル減算部２５４ａによって減算処理が行われた後の振幅スペクトルを補正して、ＩＦＦＴ２１４ａに出力する。 The post-processing unit 255a corrects the amplitude spectrum after the subtraction process is performed by the amplitude spectrum subtracting unit 254a and outputs the corrected amplitude spectrum to the IFFT 214a.

プロファイル補正部２５６ａは、環境音の大きさに応じて、プロファイル格納部２５３ａに格納されたノイズプロファイルを補正する処理を行う。プロファイル補正部２５６ａによって行われるノイズプロファイルの補正として、拡大補正（増加補正）と縮小補正（減少補正）とがある。プロファイル補正部２５６ａは、ノイズプロファイルの拡大補正を行うプロファイル拡大部２７１ａと、ノイズプロファイルの縮小補正を行うプロファイル縮小部２７２ａとを有する。 The profile correction unit 256a performs a process of correcting the noise profile stored in the profile storage unit 253a according to the magnitude of the environmental sound. As correction of the noise profile performed by the profile correction unit 256a, there are enlargement correction (increase correction) and reduction correction (decrease correction). The profile correction unit 256a includes a profile enlargement unit 271a that performs noise profile enlargement correction and a profile reduction unit 272a that performs noise profile reduction correction.

ノイズプロファイルの拡大補正とは、プロファイル作成部２５２ａによって作成されたノイズプロファイルまたはプロファイル補正部２５６ａによって補正されたノイズプロファイルの振幅スペクトルを増大させる補正である。つまり、ノイズプロファイルの拡大補正をすることで振幅スペクトル減算部２５４ａによって減算処理がされた後の振幅スペクトルＡ_NR（ｆｉ）は小さくなる。また、ノイズプロファイルの縮小補正とは、プロファイル作成部２５２ａによって作成されたノイズプロファイルまたはプロファイル補正部２５６ａによって補正されたノイズプロファイルの振幅スペクトルを減少させる補正である。つまり、ノイズプロファイルの縮小補正をすることで振幅スペクトル減算部２１１によって減算処理がされた後の振幅スペクトルＡ_NR（ｆｉ）は大きくなる。プロファイル補正部２５６ａによって行われるノイズプロファイルの補正は必要に応じて、ＦＦＴ２０７ａから供給される１フレーム毎の振幅スペクトルＡ（ｆｉ）に対して行われる。撮像装置１００によってズーム動作が行われている場合、環境音や駆動ノイズの変動に応じて、プロファイル補正部２５６ａは、ノイズプロファイルを適正に補正することができる。 The noise profile enlargement correction is correction for increasing the amplitude spectrum of the noise profile created by the profile creation unit 252a or the noise profile corrected by the profile correction unit 256a. That is, the amplitude spectrum A _NR (fi) after the subtraction process is performed by the amplitude spectrum subtracting unit 254a is reduced by performing the correction of the noise profile. The noise profile reduction correction is correction for reducing the amplitude profile of the noise profile created by the profile creation unit 252a or the noise profile corrected by the profile correction unit 256a. That is, the amplitude spectrum A _NR (fi) after the subtraction process is performed by the amplitude spectrum subtracting unit 211 is increased by performing the noise profile reduction correction. The noise profile correction performed by the profile correction unit 256a is performed on the amplitude spectrum A (fi) for each frame supplied from the FFT 207a as necessary. When the zoom operation is performed by the imaging apparatus 100, the profile correction unit 256a can appropriately correct the noise profile in accordance with fluctuations in environmental sound and driving noise.

Ｌチャネル音声入力部１０２ｂもＲチャネル音声入力部１０２ａと同様に、マイク２０５ｂ、ＡＤＣ２０６ｂ、ＦＦＴ２０７ｂ、ノイズ低減部２００ｂ、ＩＦＦＴ２１４ｂ、ノイズ印加部２１５ｂ、音声処理部２１６ｂ、ＡＬＣ２１７ｂを有する。マイク２０５ａとマイク２０５ｂとは、同様の構成であり、ＦＦＴ２０７ａとＦＦＴ２０７ｂとは、同様の構成であり、ノイズ低減部２００ａとノイズ低減部２００ｂは、同様の構成である。さらに、ＩＦＦＴ２１４ａとＩＦＦＴ２１４ｂとは、同様の構成であり、ノイズ印加部２１５ａとノイズ印加部２１５ｂとは、同様の構成である。さらに、音声処理部２１６ａと音声処理部２１６ｂとは、同様の構成であり、ＡＬＣ２１７ａとＡＬＣ２１７ｂとは同様の構成である。 Similarly to the R channel audio input unit 102a, the L channel audio input unit 102b includes a microphone 205b, an ADC 206b, an FFT 207b, a noise reduction unit 200b, an IFFT 214b, a noise application unit 215b, an audio processing unit 216b, and an ALC 217b. The microphone 205a and the microphone 205b have the same configuration, the FFT 207a and the FFT 207b have the same configuration, and the noise reduction unit 200a and the noise reduction unit 200b have the same configuration. Furthermore, IFFT 214a and IFFT 214b have the same configuration, and noise application unit 215a and noise application unit 215b have the same configuration. Furthermore, the audio processing unit 216a and the audio processing unit 216b have the same configuration, and the ALC 217a and the ALC 217b have the same configuration.

図３は、撮像装置１００のモードとして動画撮影モードが選択された場合に制御部１０９によって行われる動画記録処理の一例を示すフローチャートである。マイク２０５ａからＡＤＣ２０６ａにアナログの音声信号が出力される場合を一例に挙げて、以下、動画記録処理について説明する。 FIG. 3 is a flowchart illustrating an example of a moving image recording process performed by the control unit 109 when the moving image shooting mode is selected as the mode of the imaging apparatus 100. The moving image recording process will be described below by taking an example in which an analog audio signal is output from the microphone 205a to the ADC 206a.

撮像装置１００が動画撮影モードに変更された場合、制御部１０９は、ノイズ低減部２００ａにおけるプロファイル格納部２５３ａをゼロクリアにする（Ｓ３０１）。その後、制御部１０９は、積分回路２５０ａに対してＦＦＴ２０７ａから入力された振幅スペクトルの積分処理を開始させる（Ｓ３０２）。そして、制御部１０９は、操作部１１０の記録ボタンがＯＮされたか否か、つまり動画データの記録を撮像装置１００に開始させる指示が入力されたか否かを判定する（Ｓ３０３）。動画データの記録を撮像装置１００に開始させる指示が入力された場合（Ｓ３０３でＹｅｓ）、制御部１０９は、動画データの記録を開始する（Ｓ３０４）。この場合、制御部１０９は、撮像部１０１及び音声入力部１０２からメモリ１０３に格納される動画データを生成するための画像データ及び音声データの符号化処理を開始し、記録再生部１０７に記録媒体１０８への記録を開始させる。 When the imaging apparatus 100 is changed to the moving image shooting mode, the control unit 109 clears the profile storage unit 253a in the noise reduction unit 200a to zero (S301). Thereafter, the control unit 109 causes the integrating circuit 250a to start integrating the amplitude spectrum input from the FFT 207a (S302). Then, the control unit 109 determines whether or not the recording button of the operation unit 110 is turned on, that is, whether or not an instruction for starting the recording of moving image data to the imaging device 100 is input (S303). When an instruction to start recording moving image data is input to the imaging apparatus 100 (Yes in S303), the control unit 109 starts recording moving image data (S304). In this case, the control unit 109 starts encoding processing of image data and audio data for generating moving image data stored in the memory 103 from the imaging unit 101 and the audio input unit 102, and stores the recording medium in the recording / playback unit 107. Recording to 108 is started.

Ｓ３０５では、制御部１０９は、操作部１１０を介してズーム動作を開始する指示が入力されたか否かを判定する。操作部１１０を介してズーム動作を開始する指示が入力されなかった場合、制御部１０９は、操作部１１０を介して動画データの記録を撮像装置１００に終了させる指示が入力されたか否かを判定する（Ｓ３０６）。操作部１１０を介して動画データの記録を撮像装置１００に終了させる指示が入力された場合（Ｓ３０６でＹｅｓ）、制御部１０９は、メモリ１０３に格納されている動画データの符号化を開始し、記録媒体１０８への記録を行わせる。さらに、制御部１０９は、記録媒体１０８に格納されている動画データのクローズ処理を行い、動画ファイルとして完成させる（Ｓ３１２）。操作部１１０を介して動画データの記録を撮像装置１００に終了させる指示が入力されなかった場合（Ｓ３０６でＮｏ）、処理は、Ｓ３０６からＳ３０２に戻る。 In step S <b> 305, the control unit 109 determines whether an instruction to start a zoom operation is input via the operation unit 110. When an instruction to start a zoom operation is not input via the operation unit 110, the control unit 109 determines whether an instruction to end recording of moving image data is input to the imaging apparatus 100 via the operation unit 110. (S306). When an instruction to end recording of moving image data is input to the imaging apparatus 100 via the operation unit 110 (Yes in S306), the control unit 109 starts encoding of moving image data stored in the memory 103, Recording on the recording medium 108 is performed. Further, the control unit 109 performs a closing process on the moving image data stored in the recording medium 108 and completes it as a moving image file (S312). When an instruction to end recording of moving image data is not input to the imaging apparatus 100 via the operation unit 110 (No in S306), the process returns from S306 to S302.

一方、操作部１１０からズーム動作を開始する指示が入力された場合、処理は、Ｓ３０５からＳ３０７に進む。Ｓ３０７において、制御部１０９は、ノイズ低減部２００ａにノイズプロファイルを作成させるために、ノイズプロファイル作成処理を行う。Ｓ３０７において行われるノイズプロファイル作成処理については後述する。ノイズプロファイル作成処理が実行されることによって作成されたノイズプロファイルは、プロファイル格納部２５３ａに格納される。 On the other hand, when an instruction to start the zoom operation is input from the operation unit 110, the process proceeds from S305 to S307. In step S307, the control unit 109 performs noise profile creation processing to cause the noise reduction unit 200a to create a noise profile. The noise profile creation process performed in S307 will be described later. The noise profile created by executing the noise profile creation process is stored in the profile storage unit 253a.

次に、制御部１０９は、ＦＦＴ２０７ａにて高速フーリエ変換された周波数毎の振幅スペクトルの各周波数毎の振幅値からノイズプロファイルに含まれる特定の周波数の振幅値を減算するノイズ低減処理を行う（Ｓ３０８）。ノイズ低減処理が行われる場合、制御部１０９は、減算処理を行うように振幅スペクトル減算部２５４ａを制御する。次に、制御部１０９は、プロファイル格納部２１０ａに格納されたノイズプロファイルを補正するようにプロファイル補正部２５６ａを制御するノイズプロファイル補正処理を行う（Ｓ３０９）。Ｓ３０９において行われるノイズプロファイル補正処理については後述する。プロファイル補正部２５６ａによって補正されたノイズプロファイルは、次フレームの減算処理で適用される。次に、制御部１０９は、振幅スペクトル減算部２５４ａにより減算処理が行われた後に得られたＲｃｈの振幅スペクトルと、振幅スペクトル減算部２５４ｂにより減算処理が行われた後に得られたＬｃｈの振幅スペクトルとがある場合、後処理を行う（Ｓ３１０）。後処理とは、Ｒｃｈの振幅スペクトルとＬｃｈの振幅スペクトルとを同一にするように補正する処理である。Ｓ３１０において行われる後処理については後述する。 Next, the control unit 109 performs a noise reduction process of subtracting the amplitude value of a specific frequency included in the noise profile from the amplitude value for each frequency of the amplitude spectrum for each frequency that has been fast Fourier transformed by the FFT 207a (S308). ). When the noise reduction process is performed, the control unit 109 controls the amplitude spectrum subtraction unit 254a to perform the subtraction process. Next, the control unit 109 performs a noise profile correction process for controlling the profile correction unit 256a so as to correct the noise profile stored in the profile storage unit 210a (S309). The noise profile correction process performed in S309 will be described later. The noise profile corrected by the profile correction unit 256a is applied in the subtraction process for the next frame. Next, the control unit 109 outputs the Rch amplitude spectrum obtained after the subtraction process is performed by the amplitude spectrum subtraction unit 254a and the Lch amplitude spectrum obtained after the subtraction process is performed by the amplitude spectrum subtraction unit 254b. If so, post-processing is performed (S310). The post-processing is a process of correcting the Rch amplitude spectrum and the Lch amplitude spectrum to be the same. The post-processing performed in S310 will be described later.

そして、操作部１１０からズーム動作を停止する指示が入力されたか否かが判定される（Ｓ３１１）。操作部１１０からズーム動作を停止する指示が入力されていない場合、撮像装置１００においてズーム動作が継続して実行されるので、制御部１０９は、Ｓ３０８からＳ３１０までの処理を繰り返す。また、制御部１０９は、操作部１１０からズーム動作を停止する指示が入力された場合、撮像装置１００におけるズーム動作を停止し、Ｓ３０１の処理に戻る。 Then, it is determined whether or not an instruction to stop the zoom operation is input from the operation unit 110 (S311). When an instruction to stop the zoom operation is not input from the operation unit 110, the zoom operation is continuously executed in the imaging apparatus 100, and thus the control unit 109 repeats the processing from S308 to S310. In addition, when an instruction to stop the zoom operation is input from the operation unit 110, the control unit 109 stops the zoom operation in the imaging apparatus 100 and returns to the process of S301.

なお、図３の動画記録処理について、マイク２０５ａからＡＤＣ２０６ａにアナログの音声信号が出力される場合を一例に挙げて説明を行った。しかしながら、マイク２０５ｂからＡＤＣ２０６ｂにアナログの音声信号が出力される場合も、図３の動画記録処理と同様に動画の記録を行う。 Note that the moving image recording process in FIG. 3 has been described by taking an example in which an analog audio signal is output from the microphone 205a to the ADC 206a. However, even when an analog audio signal is output from the microphone 205b to the ADC 206b, a moving image is recorded in the same manner as the moving image recording process of FIG.

［ノイズプロファイル作成処理（Ｓ３０７）］
Ｓ３０７において制御部１０９によって実行されるノイズプロファイルの作成処理について図４、５、６及び７を用いて説明を行う。マイク２０５ａからＡＤＣ２０６ａにアナログの音声信号が出力される場合を一例に挙げて、以下、ノイズプロファイルの作成処理について説明する。 [Noise Profile Creation Processing (S307)]
The noise profile creation process executed by the control unit 109 in S307 will be described with reference to FIGS. The case where an analog audio signal is output from the microphone 205a to the ADC 206a will be described as an example, and the noise profile creation process will be described below.

図５は、周波数毎の振幅スペクトルに対するノイズプロファイル作成処理を示すタイミングチャート図である。図６は、撮像装置１００によってズーム動作が開始される前における環境音が大きい場合における周波数毎の振幅スペクトルに対するノイズプロファイル作成処理を示すタイミングチャート図である。 FIG. 5 is a timing chart showing a noise profile creation process for the amplitude spectrum for each frequency. FIG. 6 is a timing chart illustrating a noise profile creation process for the amplitude spectrum for each frequency when the environmental sound is large before the zoom operation is started by the imaging apparatus 100.

以下、図５及び６について説明を行う。図５及び６おいて、制御部１０９が光学レンズ２０１を移動させるようにレンズ制御部２０３を制御するタイミングを「ｔ１」とする。制御部１０９は、光学レンズ２０１を移動させるようにレンズ制御部２０３を制御するために、ズーム制御信号をＯＮに変更する。制御部１０９は、光学レンズ２０１を移動させるようにレンズ制御部２０３を制御しない場合、ズーム制御信号をＯＮにしないので、この場合、ズーム制御信号は、ＯＦＦになる。また、制御部１０９は、光学レンズ２０１の移動を停止させるようにレンズ制御部２０３を制御する場合、ズーム制御信号をＯＦＦに変更する。ズーム制御信号がＯＮに変更された場合、レンズ制御部２０３は、光学レンズ２０１の移動を開始させる。ズーム制御信号がＯＦＦに変更された場合、レンズ制御部２０３は、光学レンズ２０１の移動を停止させる。さらに、図５及び６おいて、振幅スペクトル減算部２５４ａにノイズプロファイルを用いた減算処理を開始させるタイミングを「ｔ２」とする。さらに、図５及び６おいて、制御部１０９がズーム制御信号をＯＦＦにするタイミングを「ｔ３」とする。ノイズプロファイルを用いた減算処理は、タイミングｔ２からタイミングｔ３までの期間、振幅スペクトル減算部２５４ａによって行われる。 Hereinafter, FIGS. 5 and 6 will be described. 5 and 6, the timing at which the control unit 109 controls the lens control unit 203 to move the optical lens 201 is “t1”. The control unit 109 changes the zoom control signal to ON in order to control the lens control unit 203 to move the optical lens 201. If the control unit 109 does not control the lens control unit 203 so as to move the optical lens 201, the zoom control signal is not turned on. In this case, the zoom control signal is turned off. In addition, when the control unit 109 controls the lens control unit 203 to stop the movement of the optical lens 201, the control unit 109 changes the zoom control signal to OFF. When the zoom control signal is changed to ON, the lens control unit 203 starts to move the optical lens 201. When the zoom control signal is changed to OFF, the lens control unit 203 stops the movement of the optical lens 201. Further, in FIGS. 5 and 6, the timing at which the amplitude spectrum subtraction unit 254a starts the subtraction process using the noise profile is “t2”. Further, in FIGS. 5 and 6, the timing at which the control unit 109 turns off the zoom control signal is “t3”. The subtraction process using the noise profile is performed by the amplitude spectrum subtraction unit 254a during the period from the timing t2 to the timing t3.

図５及び６において、ＦＦＴ２０７ａによって高速フーリエ変換された所定の周波数ｆｉの振幅スペクトルを「Ｉｔ」とする。さらに、図５及び６において、積分回路２５０ａによって積分された所定の周波数ｆｉの振幅を示す振幅スペクトルを「Ｄｔ」とする。図５及び６において、プロファイル作成部２５２aによって作成された所定の周波数ｆｉに対応するノイズプロファイルを「Ｐｔ」とし、振幅スペクトル減算部２５４aから出力される所定の周波数ｆｉの振幅スペクトルを「Ｕｔ」とする。さらに図５及び６において、ノイズ印加部２１５aによってノイズ信号が印加された後の所定の周波数ｆｉの時系列のデジタル音声信号を「Ｎｔ」とする。 5 and 6, the amplitude spectrum of the predetermined frequency fi that has been fast Fourier transformed by the FFT 207a is assumed to be “It”. Further, in FIGS. 5 and 6, the amplitude spectrum indicating the amplitude of the predetermined frequency fi integrated by the integration circuit 250a is “Dt”. 5 and 6, the noise profile corresponding to the predetermined frequency fi created by the profile creation unit 252a is “Pt”, and the amplitude spectrum of the predetermined frequency fi output from the amplitude spectrum subtraction unit 254a is “Ut”. To do. Further, in FIGS. 5 and 6, a time-series digital audio signal having a predetermined frequency fi after the noise signal is applied by the noise applying unit 215a is represented as “Nt”.

ノイズプロファイルＰｔは、ナイキスト周波数である２４ｋＨｚまでにおいて５１２ポイントの振幅スペクトルを持つ。図５及び６における５１２ポイントの振幅スペクトルＤｔ１は、撮像装置１００がズーム動作を行う前（光学レンズ２０１が移動していない状態）における環境音を示す振幅スペクトルを示し、図４における４０１に対応する。図５及び６における５１２ポイントの振幅スペクトル（Ｄｔ２）は、撮像装置１００がズーム動作を行っている場合（光学レンズ２０１が移動している状態）における環境音を示す振幅スペクトルを示し、図４における４０２に対応する。 The noise profile Pt has an amplitude spectrum of 512 points up to the Nyquist frequency of 24 kHz. A 512-point amplitude spectrum Dt1 in FIGS. 5 and 6 indicates an amplitude spectrum indicating an environmental sound before the imaging apparatus 100 performs a zoom operation (a state in which the optical lens 201 is not moved), and corresponds to 401 in FIG. . A 512-point amplitude spectrum (Dt2) in FIGS. 5 and 6 indicates an amplitude spectrum indicating an environmental sound when the imaging apparatus 100 is performing a zoom operation (a state in which the optical lens 201 is moving). Corresponding to 402.

図７は、制御部１０９によって行わせるノイズプロファイル作成処理を示すフローチャートである。次に、図７を用いて、制御部１０９によって行われるノイズプロファイル作成処理について説明する。なお、プロファイル作成部２５２ａがノイズプロファイルを作成する場合を一例に挙げて、以下、ノイズプロファイル作成処理について説明する。操作部１１０からズーム動作を開始する指示が入力された場合（Ｓ３０５でＹｅｓ）に、制御部１０９によってズーム制御信号はＯＦＦからＯＮに変更される（タイミングｔ１）。タイミングｔ１において、制御部１０９は、上述のように、Ａａｖｅ（ｆｉ）を算出するように積分回路２５０ａを制御する。積分回路２５０ａによってＡａｖｅ（ｆｉ）が算出された場合、制御部１０９は、Ａａｖｅ（ｆｉ）を振幅スペクトルＤｔ１としてメモリ２５１ａに保存する（Ｓ７０１）。 FIG. 7 is a flowchart showing a noise profile creation process to be performed by the control unit 109. Next, the noise profile creation process performed by the control unit 109 will be described with reference to FIG. The case where the profile creating unit 252a creates a noise profile will be described as an example, and the noise profile creating process will be described below. When an instruction to start the zoom operation is input from the operation unit 110 (Yes in S305), the zoom control signal is changed from OFF to ON by the control unit 109 (timing t1). At timing t1, the control unit 109 controls the integration circuit 250a so as to calculate Aave (fi) as described above. When Aave (fi) is calculated by the integrating circuit 250a, the control unit 109 stores Aave (fi) in the memory 251a as the amplitude spectrum Dt1 (S701).

次に、制御部１０９は、タイミングｔ１から安定化期間が経過するまで、レンズ制御部２０３が光学レンズ２０１を移動させている場合の振幅スペクトルにおける各周波数毎の積分値を取得するように積分回路２５０ａを制御する。その後、制御部１０９は、安定化期間が経過したか否かを判定する（Ｓ７０２）。安定化期間が経過した場合（タイミングｔ２）（Ｓ７０２でＹｅｓ）、制御部１０９は、上述のように、Ｓ（ｆｉ）／ｍを算出するように積分回路２５０ａを制御する。積分回路２５０ａによってＳ（ｆｉ）／ｍが算出された場合、制御部１０９は、Ｓ（ｆｉ）／ｍを振幅スペクトルＤｔ２としてメモリ２５１ａに保存する（Ｓ７０３）。 Next, the control unit 109 integrates the integration circuit for each frequency in the amplitude spectrum when the lens control unit 203 moves the optical lens 201 until the stabilization period elapses from the timing t1. 250a is controlled. Thereafter, the control unit 109 determines whether or not the stabilization period has elapsed (S702). When the stabilization period has elapsed (timing t2) (Yes in S702), the control unit 109 controls the integration circuit 250a to calculate S (fi) / m as described above. When S (fi) / m is calculated by the integration circuit 250a, the control unit 109 stores S (fi) / m as the amplitude spectrum Dt2 in the memory 251a (S703).

次に、制御部１０９は、振幅スペクトルＤｔ１が所定の振幅スペクトルであるＤｔｔｈ以下であるか否かを判定する（Ｓ７０４）。所定の振幅スペクトルＤｔｔｈは、あらかじめメモリ１０３に格納されているものとする。所定の振幅スペクトルＤｔｔｈは、撮像装置１００によってズーム動作が開始される前における環境音が大きい場合であっても、駆動ノイズを低減できるように設定される。所定の振幅スペクトルＤｔｔｈは、撮像装置１００の騒音ノイズとして予測されるノイズレベルよりも一定レベル低いレベルになるように設定される。 Next, the control unit 109 determines whether or not the amplitude spectrum Dt1 is equal to or less than a predetermined amplitude spectrum Dtth (S704). It is assumed that the predetermined amplitude spectrum Dtth is stored in the memory 103 in advance. The predetermined amplitude spectrum Dtth is set so that driving noise can be reduced even when the environmental sound before the zoom operation is started by the imaging apparatus 100 is loud. The predetermined amplitude spectrum Dtth is set to be a level that is lower than the noise level predicted as noise noise of the imaging apparatus 100 by a certain level.

振幅スペクトルＤｔ１が所定の振幅スペクトルＤｔｔｈよりも大きいと判定された場合（Ｓ７０４でＮｏ）、制御部１０９は、ズーム動作が開始される前の環境音が大きいと判定する。振幅スペクトルＤｔ１が所定の振幅スペクトルＤｔｔｈより大きいと判定された場合（Ｓ７０４でＮｏ）、ノイズプロファイル作成処理は、図６のようなタイミングチャートになる。この場合（Ｓ７０４でＮｏ）、制御部１０９は、振幅スペクトルＤｔ１としてメモリ２５１ａに保存されているＡａｖｅ（ｆｉ）を消去し、所定の振幅スペクトルＤｔｔｈが振幅スペクトルＤｔ１としてメモリ２５１ａに保存されるようにする（Ｓ７０５）。所定の振幅スペクトルＤｔｔｈが振幅スペクトルＤｔ１としてメモリ２５１ａに保存された場合、制御部１０９は、Ｓ７０６の処理を行う。振幅スペクトルＤｔ１が所定の振幅スペクトルＤｔｔｈ以下であると判定された場合（Ｓ７０４でＹｅｓ）、ノイズプロファイル作成処理は、図５のようなタイミングチャートになる。この場合（Ｓ７０４でＹｅｓ）、制御部１０９は、Ｓ７０６の処理を行う。 When it is determined that the amplitude spectrum Dt1 is larger than the predetermined amplitude spectrum Dtth (No in S704), the control unit 109 determines that the environmental sound before the zoom operation is started is large. When it is determined that the amplitude spectrum Dt1 is larger than the predetermined amplitude spectrum Dtth (No in S704), the noise profile creation process is a timing chart as shown in FIG. In this case (No in S704), the control unit 109 deletes Aave (fi) stored in the memory 251a as the amplitude spectrum Dt1 so that the predetermined amplitude spectrum Dtth is stored in the memory 251a as the amplitude spectrum Dt1. (S705). When the predetermined amplitude spectrum Dtth is stored in the memory 251a as the amplitude spectrum Dt1, the control unit 109 performs the process of S706. When it is determined that the amplitude spectrum Dt1 is equal to or less than the predetermined amplitude spectrum Dtth (Yes in S704), the noise profile creation process is a timing chart as shown in FIG. In this case (Yes in S704), the control unit 109 performs the process of S706.

次に、制御部１０９は、振幅スペクトルＤｔ２から振幅スペクトルＤｔ１を減算することによって、ノイズプロファイルＰｔを作成するようにプロファイル作成部２５２ａを制御する（Ｓ７０６）。振幅スペクトルＤｔ１が所定の振幅スペクトルＤｔｔｈ以下である場合、プロファイル作成部２５２ａは、振幅スペクトルＤｔ２からＡａｖｅ（ｆｉ）を減算することによって、ノイズプロファイルＰｔを作成する。振幅スペクトルＤｔ１が所定の振幅スペクトルＤｔｔｈよりも大きい場合、プロファイル作成部２５２ａは、振幅スペクトルＤｔ２から所定の振幅スペクトルＤｔｔｈを減算することによって、ノイズプロファイルＰｔを作成する。プロファイル作成部２５２ａによって作成されたノイズプロファイルＰｔは、プロファイル格納部２５３ａに格納される。 Next, the control unit 109 controls the profile creation unit 252a to create the noise profile Pt by subtracting the amplitude spectrum Dt1 from the amplitude spectrum Dt2 (S706). When the amplitude spectrum Dt1 is equal to or smaller than the predetermined amplitude spectrum Dtth, the profile creation unit 252a creates a noise profile Pt by subtracting Aave (fi) from the amplitude spectrum Dt2. When the amplitude spectrum Dt1 is larger than the predetermined amplitude spectrum Dtth, the profile creation unit 252a creates a noise profile Pt by subtracting the predetermined amplitude spectrum Dtth from the amplitude spectrum Dt2. The noise profile Pt created by the profile creation unit 252a is stored in the profile storage unit 253a.

安定化期間が経過していない場合（Ｓ７０２でＮｏ）、プロファイル格納部２５３ａには、ノイズプロファイルＰｔは格納されていないので、ノイズプロファイルＰｔを用いて駆動ノイズを低減することはできない。そこで、制御部１０９は、振幅スペクトルＵｔが振幅スペクトルＤｔ１と同一になるように振幅スペクトル減算部２５４ａを制御する。安定化期間が経過していない場合、図５の５０１のように環境音が途中から急激に変動する場合がある。この場合、制御部１０９は、振幅スペクトルＩｔが振幅スペクトルＤｔ１以上であるか否かを判定する（Ｓ７０７）。 When the stabilization period has not elapsed (No in S702), since the noise profile Pt is not stored in the profile storage unit 253a, driving noise cannot be reduced using the noise profile Pt. Therefore, the control unit 109 controls the amplitude spectrum subtraction unit 254a so that the amplitude spectrum Ut is the same as the amplitude spectrum Dt1. When the stabilization period has not elapsed, the environmental sound may fluctuate abruptly from the middle as indicated by reference numeral 501 in FIG. In this case, the control unit 109 determines whether or not the amplitude spectrum It is greater than or equal to the amplitude spectrum Dt1 (S707).

振幅スペクトルＩｔが振幅スペクトルＤｔ１以上であると判定された場合（Ｓ７０７でＹｅｓ）、制御部１０９は、振幅スペクトルＵｔが振幅スペクトルＤｔ１と同一になるように振幅スペクトル減算部２５４ａを制御する（Ｓ７０８）。振幅スペクトルＩｔが振幅スペクトルＤｔ１以上であると判定された場合（Ｓ７０７でＹｅｓ）、安定化期間が経過するまで（タイミングｔ１からタイミングｔ２まで）は、振幅スペクトルＵｔは、振幅スペクトルＤｔ１と同一になるように制御される。振幅スペクトルＩｔが振幅スペクトルＤｔ１以上でないと判定された場合（Ｓ７０７でＮｏ）、制御部１０９は、振幅スペクトルＵｔが振幅スペクトルＩｔと同一になるように振幅スペクトル減算部２５４ａを制御する（Ｓ７０９）。振幅スペクトルＩｔが振幅スペクトルＤｔ１よりも小さいと判定された場合（Ｓ７０７でＮｏ）、安定化期間が経過するまで（タイミングｔ１からタイミングｔ２まで）、振幅スペクトルＵｔは、振幅スペクトルＩｔと同一になるように制御される。 When it is determined that the amplitude spectrum It is equal to or greater than the amplitude spectrum Dt1 (Yes in S707), the control unit 109 controls the amplitude spectrum subtracting unit 254a so that the amplitude spectrum Ut is the same as the amplitude spectrum Dt1 (S708). . When it is determined that the amplitude spectrum It is equal to or greater than the amplitude spectrum Dt1 (Yes in S707), the amplitude spectrum Ut is the same as the amplitude spectrum Dt1 until the stabilization period elapses (from timing t1 to timing t2). To be controlled. When it is determined that the amplitude spectrum It is not equal to or greater than the amplitude spectrum Dt1 (No in S707), the control unit 109 controls the amplitude spectrum subtracting unit 254a so that the amplitude spectrum Ut is the same as the amplitude spectrum It (S709). When it is determined that the amplitude spectrum It is smaller than the amplitude spectrum Dt1 (No in S707), the amplitude spectrum Ut is the same as the amplitude spectrum It until the stabilization period elapses (from timing t1 to timing t2). Controlled.

なお、図７のノイズプロファイル作成処理について、プロファイル作成部２５２ａがノイズプロファイルを作成する場合を一例に挙げて説明を行った。しかしながら、プロファイル作成部２５２ｂがノイズプロファイルを作成する場合も、図７のノイズプロファイル作成処理と同様にノイズプロファイルを作成する。 Note that the noise profile creation processing in FIG. 7 has been described by taking as an example the case where the profile creation unit 252a creates a noise profile. However, even when the profile creation unit 252b creates a noise profile, it creates a noise profile in the same manner as the noise profile creation processing of FIG.

なお、図６のような場合、振幅スペクトルＩｔが振幅スペクトルＤｔ１以上の状態になったり、振幅スペクトルＩｔが振幅スペクトルＤｔ１よりも小さい状態になったりを繰り返す場合がある。このような場合であっても、振幅スペクトルＵｔは、振幅スペクトルＤｔ１を超えないように制御される。これにより、撮像装置１００は、安定化期間が経過するまで（タイミングｔ１からタイミングｔ２まで）の期間において、駆動ノイズを低減することができる。 In the case shown in FIG. 6, the amplitude spectrum It may be in a state equal to or greater than the amplitude spectrum Dt1, or the amplitude spectrum It may be smaller than the amplitude spectrum Dt1. Even in such a case, the amplitude spectrum Ut is controlled so as not to exceed the amplitude spectrum Dt1. Thereby, the imaging device 100 can reduce drive noise during a period until the stabilization period elapses (from timing t1 to timing t2).

このように、ズーム制御信号がＯＮにされてから安定化期間が経過するまで（タイミングｔ１からタイミングｔ２まで）の期間、制御部１０９は、振幅スペクトルＵｔが振幅スペクトルＩｔまたは振幅スペクトルＤｔ１になるように制御する。これにより、撮像装置１００は、ズーム制御信号がＯＮにされてから安定化期間が経過するまで（タイミングｔ１からタイミングｔ２まで）の期間における駆動ノイズを低減することができる。さらに、安定化期間が経過した後（タイミングｔ２からタイミングｔ３まで）の期間、制御部１０９は、ノイズプロファイルＰｔを使って、安定化期間が経過した後（タイミングｔ２からタイミングｔ３まで）の期間における駆動ノイズを低減することができる。これにより、撮像装置１００は、駆動ノイズの低減をシームレスに行うことができる。 As described above, during the period from when the zoom control signal is turned ON until the stabilization period elapses (from timing t1 to timing t2), the control unit 109 causes the amplitude spectrum Ut to be the amplitude spectrum It or the amplitude spectrum Dt1. To control. As a result, the imaging apparatus 100 can reduce drive noise in a period from when the zoom control signal is turned on until the stabilization period elapses (from timing t1 to timing t2). Further, during the period after the stabilization period has elapsed (from timing t2 to timing t3), the control unit 109 uses the noise profile Pt in the period after the stabilization period has elapsed (from timing t2 to timing t3). Drive noise can be reduced. Thereby, the imaging device 100 can seamlessly reduce drive noise.

［ノイズプロファイル補正処理（Ｓ３０９）］
Ｓ３０９において制御部１０９によって実行されるノイズプロファイル補正処理について図８、９、１０及び１１を用いて説明を行う。プロファイル補正部２５６ａがプロファイル作成部２５２ａによって作成されたノイズプロファイルを補正する場合を一例に挙げて、以下、ノイズプロファイル補正処理について説明する。 [Noise Profile Correction Processing (S309)]
The noise profile correction process executed by the control unit 109 in S309 will be described with reference to FIGS. The case where the profile correction unit 256a corrects the noise profile created by the profile creation unit 252a will be described as an example, and the noise profile correction processing will be described below.

図８は、ノイズプロファイルＰｔを拡大補正する処理を示すタイミングチャート図である。図９は、ノイズプロファイルＰｔを縮小補正する処理を示すタイミングチャート図である。図８及び９におけるｔ１、ｔ２、ｔ３、Ｉｔ、Ｄｔ、Ｐｔ、Ｕｔ、Ｎｔは、図５及び６におけるｔ１、ｔ２、ｔ３、Ｉｔ、Ｄｔ、Ｐｔ、Ｕｔ、Ｎｔと同様であるため、説明を省略する。図１１は、ノイズプロファイル補正処理に関する時定数の設定を示す図である。 FIG. 8 is a timing chart showing a process for enlarging and correcting the noise profile Pt. FIG. 9 is a timing chart showing a process for reducing and correcting the noise profile Pt. Since t1, t2, t3, It, Dt, Pt, Ut, and Nt in FIGS. 8 and 9 are the same as t1, t2, t3, It, Dt, Pt, Ut, and Nt in FIGS. Omitted. FIG. 11 is a diagram illustrating setting of a time constant related to the noise profile correction process.

図１０は、制御部１０９によって行われるノイズプロファイル補正処理を示すフローチャートである。次に、図１０を用いて、制御部１０９によって行われるノイズプロファイル補正処理について説明する。なお、プロファイル補正部２５６ａがプロファイル作成部２５２ａによって作成されたノイズプロファイルを補正する場合を一例に挙げて、以下、ノイズプロファイル補正処理について説明する。積分回路２５０ａは、Ｓ３０７のおけるノイズプロファイル作成処理が行われた後、あらかじめ設定されたフレーム数の各周波数の振幅値の積分し、積分された振幅値をあらかじめ設定されたフレーム数で除算することで、各周波数毎の平均振幅値を算出する。あらかじめ設定されたフレーム数は、ユーザによって設定されても良い。あらかじめ設定されたフレーム数が「１」である場合、積分回路２５０ａが出力する振幅スペクトルの値は、ＦＦＴ２０７ａから出力される値と等しくなる。積分回路２５０ａは、各周波数毎の平均振幅値を振幅スペクトルＤｔとして出力する。 FIG. 10 is a flowchart showing a noise profile correction process performed by the control unit 109. Next, the noise profile correction process performed by the control unit 109 will be described with reference to FIG. The noise profile correction process will be described below by taking as an example a case where the profile correction unit 256a corrects the noise profile created by the profile creation unit 252a. The integration circuit 250a integrates the amplitude value of each frequency of a preset number of frames after the noise profile creation processing in S307 is performed, and divides the integrated amplitude value by the preset number of frames. Then, the average amplitude value for each frequency is calculated. The number of frames set in advance may be set by the user. When the preset number of frames is “1”, the value of the amplitude spectrum output from the integration circuit 250a is equal to the value output from the FFT 207a. The integrating circuit 250a outputs an average amplitude value for each frequency as an amplitude spectrum Dt.

制御部１０９は、積分回路２５０ａから出力された振幅スペクトルＤｔが振幅スペクトルＤｔ２以下であるか否かを判定する（Ｓ１００１）。振幅スペクトルＤｔが振幅スペクトルＤｔ２よりも大きいと判定された場合（Ｓ１００１でＮｏ）、制御部１０９は、プロファイル格納部２５３ａに格納されているノイズプロファイルＰｔが第１の値Ｐｍａｘ以下であるか否かを判定する（Ｓ１００２）。なお、第１の値Ｐｍａｘは、ノイズプロファイルＰｔの拡大補正を制限するための閾値である。さらに、第１の値Ｐｍａｘは、駆動ノイズを低減し過ぎることによる違和感を防止するために用いられる。 The control unit 109 determines whether or not the amplitude spectrum Dt output from the integration circuit 250a is equal to or less than the amplitude spectrum Dt2 (S1001). When it is determined that the amplitude spectrum Dt is larger than the amplitude spectrum Dt2 (No in S1001), the control unit 109 determines whether or not the noise profile Pt stored in the profile storage unit 253a is equal to or less than the first value Pmax. Is determined (S1002). The first value Pmax is a threshold value for limiting the expansion correction of the noise profile Pt. Furthermore, the first value Pmax is used to prevent a sense of incongruity due to excessive reduction of drive noise.

図８のように、タイミングｔ２からタイミングｔ３までの期間、駆動ノイズが大きくなることに伴い、振幅スペクトルＤｔが振幅スペクトルＤｔ２よりも大きくなる。このため、プロファイル作成部２５２aで生成されたノイズプロファイルＰｔを使って減算処理を振幅スペクトル減算部２１１ａに行わせるだけでは、振幅スペクトルＤｔと振幅スペクトルＤｔ２との差分に対応する駆動ノイズは低減されなかった。そこで、ノイズプロファイルＰｔが第１の値Ｐｍａｘ以下であると判定された場合（Ｓ１００２でＹｅｓ）、制御部１０９は、時定数ｉｎｃ（ｆｉ）に応じてノイズプロファイルＰｔの拡大補正をプロファイル拡大部２７１ａに行わせる（Ｓ１００３）。ノイズプロファイルＰｔの拡大補正が行われた後、制御部１０９は、Ｓ１００４の処理を行う。 As shown in FIG. 8, the amplitude spectrum Dt becomes larger than the amplitude spectrum Dt2 as the drive noise increases during the period from the timing t2 to the timing t3. For this reason, the drive noise corresponding to the difference between the amplitude spectrum Dt and the amplitude spectrum Dt2 is not reduced only by causing the amplitude spectrum subtraction unit 211a to perform the subtraction process using the noise profile Pt generated by the profile creation unit 252a. It was. Therefore, when it is determined that the noise profile Pt is equal to or less than the first value Pmax (Yes in S1002), the control unit 109 performs the expansion correction of the noise profile Pt according to the time constant inc (fi). (S1003). After the enlargement correction of the noise profile Pt is performed, the control unit 109 performs the process of S1004.

ノイズプロファイルＰｔが第１の値Ｐｍａｘ以下でないと判定された場合（Ｓ１００２でＮｏ）、駆動ノイズを低減し過ぎることを防止するために、制御部１０９は、ノイズプロファイルＰｔの拡大補正をプロファイル拡大部２７１ａに行わせないようにする。ノイズプロファイルＰｔが第１の値Ｐｍａｘ以下でないと判定された場合（Ｓ１００２でＮｏ）、制御部１０９は、Ｓ１００４の処理を行う。振幅スペクトルＤｔが振幅スペクトルＤｔ２以下であると判定された場合（Ｓ１００１でＹｅｓ）、制御部１０９は、Ｓ１００４の処理を行う。 When it is determined that the noise profile Pt is not equal to or less than the first value Pmax (No in S1002), the control unit 109 performs enlargement correction of the noise profile Pt to prevent the drive noise from being excessively reduced. 271a should not be performed. When it is determined that the noise profile Pt is not equal to or less than the first value Pmax (No in S1002), the control unit 109 performs the process of S1004. When it is determined that the amplitude spectrum Dt is equal to or smaller than the amplitude spectrum Dt2 (Yes in S1001), the control unit 109 performs the process of S1004.

制御部１０９は、振幅スペクトル減算部２５４ａから出力された振幅スペクトルＵｔが第２の値Ｕｍｉｎ以上であるか否かを判定する（Ｓ１００４）。なお、第２の値Ｕｍｉｎは、ノイズプロファイルＰｔの縮小補正を制限する閾値である。第２の値Ｕｍｉｎは、ノイズフロアレベルであり、音声入力部１０２に音声が入力されていない場合であっても、録音されてしまう最小のノイズの値である。ノイズプロファイルＰｔが第２の値Ｕｍｉｎ以上であると判定された場合（Ｓ１００４でＹｅｓ）、制御部１０９は、ノイズプロファイルＰｔの縮小補正をプロファイル縮小部２７２ａに行わせないようにし、ノイズプロファイル補正処理を終了する。 The control unit 109 determines whether or not the amplitude spectrum Ut output from the amplitude spectrum subtraction unit 254a is greater than or equal to the second value Umin (S1004). The second value Umin is a threshold value that limits the reduction correction of the noise profile Pt. The second value Umin is a noise floor level, and is a minimum noise value that is recorded even when no sound is input to the sound input unit 102. When it is determined that the noise profile Pt is equal to or greater than the second value Umin (Yes in S1004), the control unit 109 prevents the profile reduction unit 272a from performing reduction correction of the noise profile Pt, and performs noise profile correction processing. Exit.

図９のように、タイミングｔ２からタイミングｔ３までの期間、駆動ノイズが小さくなることに伴い、振幅スペクトルＵｔが第２の値Ｕｔｍｉｎよりも小さくなる。このため、プロファイル作成部２５２ａで作成されたノイズプロファイルＰｔを使って減算処理を振幅スペクトル減算部２５４ａに行わせるだけでは、振幅スペクトルＵｔと第２の値Ｕｔｍｉｎとの差分に対応する音声が消されてしまう場合があった。そこで、ノイズプロファイルＰｔが第２の値Ｕｍｉｎ以上でないと判定された場合（Ｓ１００４でＮｏ）、制御部１０９は、時定数ｄｅｃ（ｆｉ）に応じてノイズプロファイルＰｔの縮小補正をプロファイル縮小部２７２ａに行わせる（Ｓ１００５）。ノイズプロファイルＰｔの縮小補正が行われた後、制御部１０９は、ノイズプロファイル補正処理を終了する。 As shown in FIG. 9, the amplitude spectrum Ut becomes smaller than the second value Utmin as the drive noise becomes smaller during the period from the timing t2 to the timing t3. For this reason, the sound corresponding to the difference between the amplitude spectrum Ut and the second value Utmin is erased only by causing the amplitude spectrum subtraction unit 254a to perform the subtraction process using the noise profile Pt created by the profile creation unit 252a. There was a case. Therefore, when it is determined that the noise profile Pt is not equal to or greater than the second value Umin (No in S1004), the control unit 109 causes the profile reduction unit 272a to perform reduction correction of the noise profile Pt according to the time constant dec (fi). (S1005). After the reduction correction of the noise profile Pt is performed, the control unit 109 ends the noise profile correction process.

なお、図１０のノイズプロファイル補正処理について、プロファイル補正部２５６ａがプロファイル作成部２５２ａによって作成されたノイズプロファイルを補正する場合を一例に挙げて説明を行った。しかしながら、プロファイル補正部２５６ｂがプロファイル作成部２５２ｂによって作成されたノイズプロファイルを補正する場合も、図１０のノイズプロファイル補正処理と同様にノイズプロファイルの補正を行う。 Note that the noise profile correction processing of FIG. 10 has been described by taking as an example the case where the profile correction unit 256a corrects the noise profile created by the profile creation unit 252a. However, when the profile correction unit 256b corrects the noise profile created by the profile creation unit 252b, the noise profile is corrected similarly to the noise profile correction process of FIG.

次に、図１１を用いて、プロファイル拡大部２７１ａによるノイズプロファイルＰｔの拡大補正の時定数ｉｎｃ（ｆｉ）及びプロファイル縮小部２７２ａによるノイズプロファイルＰｔの縮小補正の時定数ｄｅｃ（ｆｉ）を設定する方法について説明する。 Next, referring to FIG. 11, a method of setting the time constant inc (fi) for the expansion correction of the noise profile Pt by the profile expansion unit 271a and the time constant dec (fi) for the reduction correction of the noise profile Pt by the profile reduction unit 272a. Will be described.

図１１（Ａ）は、駆動ノイズの周波数毎の特性を示す図である。図１１（Ｂ）は、ノイズプロファイルＰｔを拡大補正する場合の周波数に応じた時定数ｉｎｃ（ｆｉ）の設定を示す図である。図１１（Ｃ）は、ノイズプロファイルＰｔを縮小補正する場合の周波数に応じた時定数ｄｅｃ（ｆｉ）の設定を表す図である。 FIG. 11A is a diagram illustrating characteristics of driving noise for each frequency. FIG. 11B is a diagram showing the setting of the time constant inc (fi) corresponding to the frequency when the noise profile Pt is enlarged and corrected. FIG. 11C is a diagram illustrating the setting of the time constant dec (fi) according to the frequency when the noise profile Pt is reduced and corrected.

図１１（Ａ）において、１１０１は、撮像装置１００によってズーム動作が行われている場合における振幅スペクトルを５１２ポイントの振幅スペクトルで示したものである。１１０２は、撮像装置１００によってズーム動作が行われている場合における駆動ノイズの振幅スペクトルの変化を示したものである。１１０２が示すように、周波数帯が高域になるほど、撮像装置１００によってズーム動作が行われている場合における駆動ノイズの変化が大きくなる。 In FIG. 11A, reference numeral 1101 denotes an amplitude spectrum when the zoom operation is performed by the imaging apparatus 100 as an amplitude spectrum of 512 points. Reference numeral 1102 denotes a change in the amplitude spectrum of the drive noise when the zoom operation is performed by the imaging apparatus 100. As 1102 indicates, the higher the frequency band, the greater the change in drive noise when the zoom operation is performed by the imaging apparatus 100.

これにより、図１１（Ｂ）のように、プロファイル拡大部２７１ａによるノイズプロファイルＰｔの拡大補正が行われる場合、周波数帯域が高くなるほど、時定数ｉｎｃ（ｆｉ）は、小さくなるように設定される。これは、駆動ノイズの変化に対して、ノイズプロファイルＰｔの拡大補正を早く追従させることによって、駆動ノイズが低減されず残ってしまうような事態を防止するためである。 Thus, as shown in FIG. 11B, when the noise profile Pt is enlarged and corrected by the profile enlargement unit 271a, the time constant inc (fi) is set to be smaller as the frequency band is higher. This is to prevent the drive noise from being left unreduced by causing the enlargement correction of the noise profile Pt to follow the change in the drive noise quickly.

また、図１１（Ｃ）のように、プロファイル縮小部２７２ａによるノイズプロファイルＰｔの縮小補正が行われる場合、周波数帯域が高くなるほど、時定数ｄｅｃ（ｆｉ）は、大きくなるように設定される。これは、駆動ノイズの変化に対して、ノイズプロファイルＰｔの縮小補正を遅く追従させることによって、駆動ノイズが低減されず残ってしまうような事態を防止するためである。 In addition, as shown in FIG. 11C, when the noise profile Pt is reduced and corrected by the profile reduction unit 272a, the time constant dec (fi) is set to increase as the frequency band increases. This is to prevent the drive noise from remaining unreduced by causing the reduction correction of the noise profile Pt to follow the change of the drive noise slowly.

本実施形態において、ノイズプロファイルＰｔを縮小補正する際の時定数ｄｅｃ（ｆｉ）は、ノイズプロファイルＰｔを拡大補正する際の時定数ｉｎｃ（ｆｉ）よりも大きくする。 In the present embodiment, the time constant dec (fi) for correcting the noise profile Pt to be reduced is set larger than the time constant inc (fi) for correcting the noise profile Pt to be enlarged.

ＩＦＦＴ２１４ａによって時系列の音声信号に戻された後に、ノイズ印加部２１５ａは、ＩＦＦＴ２１４ａから供給された音声信号にノイズ信号を印加する。ノイズ印加部２１５ａは、ノイズ低減部２００ａによる駆動ノイズの低減し過ぎによる違和感を防止するために、ノイズ信号を印加する。ノイズ印加部２１５ａによって印加されるノイズ信号は、ノイズフロアレベルの信号であるものとする。これにより、振幅スペクトル減算部２５４ａによる減算処理は、駆動ノイズの低減が重視される。 After being returned to the time-series audio signal by the IFFT 214a, the noise applying unit 215a applies the noise signal to the audio signal supplied from the IFFT 214a. The noise application unit 215a applies a noise signal in order to prevent a sense of incongruity due to excessive reduction of drive noise by the noise reduction unit 200a. It is assumed that the noise signal applied by the noise applying unit 215a is a noise floor level signal. As a result, in the subtraction processing by the amplitude spectrum subtraction unit 254a, reduction of driving noise is emphasized.

図１２は、外部音源１２０１と音声入力部１０２との関係の一例を示す図である。図１２のように、外部音源１２０１と撮像装置１００との距離が十分に離れている場合、外部音源１２０１とＲチャネル音声入力部１０２ａとの距離と、外部音源１２０１とＬチャネル音声入力部１０２ｂとの距離とは、ほぼ同じである。このため、マイク２０５ａによって取得される環境音と、マイク２０５ｂによって取得される環境音との差は小さい。 FIG. 12 is a diagram illustrating an example of the relationship between the external sound source 1201 and the audio input unit 102. As shown in FIG. 12, when the distance between the external sound source 1201 and the imaging device 100 is sufficiently large, the distance between the external sound source 1201 and the R channel sound input unit 102a, the external sound source 1201 and the L channel sound input unit 102b, Is substantially the same. For this reason, the difference between the environmental sound acquired by the microphone 205a and the environmental sound acquired by the microphone 205b is small.

しがしながら、光学レンズ２０１とＲチャネル音声入力部１０２ａとの距離と、光学レンズ２０１とＬチャネル音声入力部１０２ｂとの距離との差による駆動ノイズの影響は異なる。そのため、Ｒチャネル音声入力部１０２ａに対する駆動ノイズの影響と、Ｌチャネル音声入力部１０２ｂに対する駆動ノイズの影響とをそれぞれ考慮する必要がある。 However, the influence of the drive noise due to the difference between the distance between the optical lens 201 and the R channel audio input unit 102a and the distance between the optical lens 201 and the L channel audio input unit 102b is different. Therefore, it is necessary to consider the influence of driving noise on the R channel audio input unit 102a and the influence of driving noise on the L channel audio input unit 102b, respectively.

次の式に示されるように、Ｒチャネル音声入力部１０２ａに対する駆動ノイズの影響とＬチャネル音声入力部１０２ｂに対する駆動ノイズの影響との差は、大きくなる。次の式の「ＤｔＬ」は、ノイズ低減処理が行われる前のＬ（Ｌｅｆｔ）のチャネルの振幅値であり、「ＤｔＲ」は、ノイズ低減処理が行われる前のＲ（Ｒｉｇｈｔ）のチャネルの振幅値である。さらに、次の式の「βｔ」は、左右相関振幅スペクトルである。
βｔ＝｜ＤｔＬ−ＤｔＲ｜／（ＤｔＬ＋ＤｔＲ） As shown in the following equation, the difference between the influence of driving noise on the R channel audio input unit 102a and the influence of driving noise on the L channel audio input unit 102b becomes large. In the following equation, “DtL” is the amplitude value of the L (Left) channel before the noise reduction processing is performed, and “DtR” is the amplitude of the R (Right) channel before the noise reduction processing is performed. Value. Further, “βt” in the following expression is a left-right correlation amplitude spectrum.
βt = | DtL−DtR | / (DtL + DtR)

環境音は、音量が大きいほど、ＬｃｈとＲｃｈとで差分は大きくなる。しかし、図１２のような場合、外部音源１２０１とＲチャネル音声入力部１０２ａとの距離と、外部音源１２０１とＬチャネル音声入力部１０２ｂとの距離とは、ほぼ同じなので、左右相関振幅スペクトルβｔは小さくなる。駆動ノイズについては、光学レンズ２０１とＲチャネル音声入力部１０２ａとの距離と、光学レンズ２０１とＬチャネル音声入力部１０２ｂとの距離との差により、左右相関振幅スペクトルβｔは大きくなる。左右相関振幅スペクトルβｔにより、駆動ノイズが環境音に対して支配的か否かを判定することができる。 The difference between Lch and Rch increases as the volume of the environmental sound increases. However, in the case as shown in FIG. 12, the distance between the external sound source 1201 and the R channel sound input unit 102a and the distance between the external sound source 1201 and the L channel sound input unit 102b are substantially the same. Get smaller. Regarding the drive noise, the left-right correlation amplitude spectrum βt increases due to the difference between the distance between the optical lens 201 and the R channel audio input unit 102a and the distance between the optical lens 201 and the L channel audio input unit 102b. Whether the driving noise is dominant with respect to the environmental sound can be determined from the left-right correlation amplitude spectrum βt.

次に、図１３及び図１４を用いて、Ｒｃｈに対する駆動ノイズの影響と、Ｌｃｈに対する駆動ノイズの影響とを考慮したノイズプロファイル補正処理について説明を行う。 Next, a noise profile correction process that takes into account the influence of drive noise on Rch and the influence of drive noise on Lch will be described with reference to FIGS. 13 and 14.

図１３は、Ｒｃｈ及びＬｃｈに対してノイズプロファイルを補正する処理を示すタイミングチャート図である。図１３におけるｔ１、ｔ２、ｔ３は、図５及び６におけるｔ１、ｔ２、ｔ３と同様であるため、説明を省略する。
図１３において、ＦＦＴ２０７ａによって高速フーリエ変換された所定の周波数ｆｉの振幅スペクトルを「ＩｔＲ」とし、ＦＦＴ２０７ｂによって高速フーリエ変換された所定の周波数ｆｉの振幅スペクトルを「ＩｔＬ」とする。振幅スペクトルＩｔＲは、点線で示され、振幅スペクトルＩｔＬは、実線で示される。さらに、図１３において、積分回路２５０ａによって積分された所定の周波数ｆｉの振幅を示す振幅スペクトルを「ＤｔＲ」とし、積分回路２５０ｂによって積分された所定の周波数ｆｉの振幅を示す振幅スペクトルを「ＤｔＬ」とする。振幅スペクトルＤｔＲは、点線で示され、振幅スペクトルＤｔＬは、実線で示される。図１３において、プロファイル作成部２５２ａによって作成された所定の周波数ｆｉに対応するノイズプロファイルを「ＰｔＲ」とし、プロファイル作成部２５２ｂによって作成された所定の周波数ｆｉに対応するノイズプロファイルを「ＰｔＬ」とする。ノイズプロファイルＰｔＲは、点線で示され、ノイズプロファイルＰｔＬは、実線で示される。タイミングｔ２において、プロファイル作成部２５２ａによってノイズプロファイルＰｔＲが作成され、プロファイル作成部２５２ｂによってノイズプロファイルＰｔＬが作成される。 FIG. 13 is a timing chart showing a process for correcting a noise profile for Rch and Lch. Since t1, t2, and t3 in FIG. 13 are the same as t1, t2, and t3 in FIGS. 5 and 6, description thereof is omitted.
In FIG. 13, the amplitude spectrum of the predetermined frequency fi that is fast Fourier transformed by the FFT 207a is “ItR”, and the amplitude spectrum of the predetermined frequency fi that is fast Fourier transformed by the FFT 207b is “ItL”. The amplitude spectrum ItR is indicated by a dotted line, and the amplitude spectrum ItL is indicated by a solid line. Further, in FIG. 13, the amplitude spectrum indicating the amplitude of the predetermined frequency fi integrated by the integration circuit 250a is “DtR”, and the amplitude spectrum indicating the amplitude of the predetermined frequency fi integrated by the integration circuit 250b is “DtL”. And The amplitude spectrum DtR is indicated by a dotted line, and the amplitude spectrum DtL is indicated by a solid line. In FIG. 13, the noise profile corresponding to the predetermined frequency fi created by the profile creation unit 252a is “PtR”, and the noise profile corresponding to the predetermined frequency fi created by the profile creation unit 252b is “PtL”. . The noise profile PtR is indicated by a dotted line, and the noise profile PtL is indicated by a solid line. At timing t2, a noise profile PtR is created by the profile creation unit 252a, and a noise profile PtL is created by the profile creation unit 252b.

図１３において、振幅スペクトル減算部２５４ａから出力される所定の周波数ｆｉの振幅スペクトルを「ＵｔＲ」とし、振幅スペクトル減算部２５４ｂから出力される所定の周波数ｆｉの振幅スペクトルを「ＵｔＬ」とする。振幅スペクトルＵｔＲは、点線で示され、振幅スペクトルＵｔＬは、実線で示される。図１３において、ノイズ印加部２１５ａによってノイズ信号が印加された後の所定の周波数ｆｉの時系列のデジタル音声信号を「ＮｔＲ」とする。図１３において、ノイズ印加部２１５ｂによってノイズ信号が印加された後の所定の周波数ｆｉの時系列のデジタル音声信号を「ＮｔＬ」とする。振幅スペクトルＮｔＲは、点線で示され、振幅スペクトルＮｔＬは、実線で示される。図１３において、振幅スペクトルＩｔＬと振幅スペクトルＩｔＲとの差分の絶対値である｜ＩｔＬ−ＩｔＲ｜は、実線で示される。図１３において、振幅スペクトルＵｔＬと振幅スペクトルＵｔＲとの差分の絶対値である｜ＵｔＬ−ＵｔＲ｜は、点線で示される。 In FIG. 13, the amplitude spectrum of the predetermined frequency fi output from the amplitude spectrum subtraction unit 254a is “UtR”, and the amplitude spectrum of the predetermined frequency fi output from the amplitude spectrum subtraction unit 254b is “UtL”. The amplitude spectrum UtR is indicated by a dotted line, and the amplitude spectrum UtL is indicated by a solid line. In FIG. 13, a time-series digital audio signal having a predetermined frequency fi after the noise signal is applied by the noise applying unit 215 a is referred to as “NtR”. In FIG. 13, a time-series digital audio signal having a predetermined frequency fi after the noise signal is applied by the noise applying unit 215 b is “NtL”. The amplitude spectrum NtR is indicated by a dotted line, and the amplitude spectrum NtL is indicated by a solid line. In FIG. 13, | ItL−ItR |, which is the absolute value of the difference between the amplitude spectrum ItL and the amplitude spectrum ItR, is indicated by a solid line. In FIG. 13, | UtL−UtR |, which is the absolute value of the difference between the amplitude spectrum UtL and the amplitude spectrum UtR, is indicated by a dotted line.

図１３に示すように、｜ＵｔＬ−ＵｔＲ｜が｜ＩｔＬ−ＩｔＲ｜を上回る場合がある。これは、振幅スペクトルＵｔＬ及び振幅スペクトルＵｔＲのいずれか一つが減算処理により駆動ノイズが低減され過ぎていることを示す。これは、ノイズプロファイルＰｔＬ及びノイズプロファイルＰｔＲのいずれか一つが大き過ぎることが原因となって起こることである。 As illustrated in FIG. 13, | UtL−UtR | may exceed | ItL−ItR |. This indicates that the drive noise of one of the amplitude spectrum UtL and the amplitude spectrum UtR is excessively reduced by the subtraction process. This is because one of the noise profile PtL and the noise profile PtR is too large.

図１４は、Ｒｃｈ及びＬｃｈに対するノイズプロファイル補正処理の一例を示すフローチャートである。次に、図１４を用いて、制御部１０９によって行われるＲｃｈ及びＬｃｈに対するノイズプロファイル補正処理について説明する。なお、ノイズプロファイルＰｔＲに対して図１０のノイズプロファイル補正処理が行われ、ノイズプロファイルＰｔＬに対して図１０のノイズプロファイル補正処理が行われた後、図１４のＲｃｈ及びＬｃｈに対するノイズプロファイル補正処理が行われる。 FIG. 14 is a flowchart illustrating an example of noise profile correction processing for Rch and Lch. Next, the noise profile correction processing for Rch and Lch performed by the control unit 109 will be described with reference to FIG. 10 is performed on the noise profile PtR, and after the noise profile correction process of FIG. 10 is performed on the noise profile PtL, the noise profile correction process on Rch and Lch in FIG. 14 is performed. Done.

この後、制御部１０９は、振幅スペクトルＩｔＬ、振幅スペクトルＩｔＲ、振幅スペクトルＵｔＬ及び振幅スペクトルＵｔＲを検出し、以下の条件が成立しているか否かを判定する（Ｓ１４０１）。
条件：｜ＩｔＬ−ＩｔＲ｜≦｜ＵｔＬ−ＵｔＲ｜
条件｜ＩｔＬ−ＩｔＲ｜≦｜ＵｔＬ−ＵｔＲ｜が成立していると判定された場合（Ｓ１４０１でＹｅｓ）、制御部１０９は、振幅スペクトルＵｔＬが振幅スペクトルＵｔＲ以上であるか否かを判定する（Ｓ１４０２）。振幅スペクトルＵｔＬが振幅スペクトルＵｔＲ以上である場合（Ｓ１４０２でＹｅｓ）、制御部１０９は、時定数ｉｎｃ＿Ｌ（ｆｉ）に応じてノイズプロファイルＰｔＬの拡大補正をプロファイル拡大部２７１ｂに行わせる（Ｓ１４０３）。時定数ｉｎｃ＿Ｌ（ｆｉ）は、プロファイル拡大部２７１ｂに対応する時定数である。その後、制御部１０９は、時定数ｄｅｃ＿Ｒ（ｆｉ）に応じてノイズプロファイルＰｔＲの縮小補正をプロファイル縮小部２７２ａに行わせる（Ｓ１４０４）。時定数ｄｅｃ＿Ｒ（ｆｉ）は、プロファイル縮小部２７２ａに対応する時定数である。Ｓ１４０４の処理が行われた後、Ｒｃｈ及びＬｃｈに対するノイズプロファイル補正処理は終了する。時定数ｄｅｃ＿Ｒ（ｆｉ）は、時定数ｉｎｃ＿Ｌ（ｆｉ）よりも大きくなる。 Thereafter, the control unit 109 detects the amplitude spectrum ItL, the amplitude spectrum ItR, the amplitude spectrum UtL, and the amplitude spectrum UtR, and determines whether or not the following conditions are satisfied (S1401).
Condition: | ItL-ItR | ≦ | UtL-UtR |
When it is determined that the condition | ItL−ItR | ≦ | UtL−UtR | is satisfied (Yes in S1401), the control unit 109 determines whether or not the amplitude spectrum UtL is greater than or equal to the amplitude spectrum UtR ( S1402). When the amplitude spectrum UtL is equal to or larger than the amplitude spectrum UtR (Yes in S1402), the control unit 109 causes the profile enlargement unit 271b to perform the enlargement correction of the noise profile PtL according to the time constant inc_L (fi) (S1403). The time constant inc_L (fi) is a time constant corresponding to the profile enlarging unit 271b. Thereafter, the control unit 109 causes the profile reduction unit 272a to perform reduction correction of the noise profile PtR according to the time constant dec_R (fi) (S1404). The time constant dec_R (fi) is a time constant corresponding to the profile reduction unit 272a. After the processing of S1404 is performed, the noise profile correction processing for Rch and Lch ends. The time constant dec_R (fi) is larger than the time constant inc_L (fi).

振幅スペクトルＵｔＬが振幅スペクトルＵｔＲよりも小さいと判定された場合（Ｓ１４０２でＮｏ）、制御部１０９は、時定数ｉｎｃ＿Ｒ（ｆｉ）に応じてノイズプロファイルＰｔＲの拡大補正をプロファイル拡大部２７１ａに行わせる（Ｓ１４０５）。時定数ｉｎｃ＿Ｒ（ｆｉ）は、プロファイル拡大部２７１ａに対応する時定数である。その後、制御部１０９は、時定数ｄｅｃ＿Ｌ（ｆｉ）に応じてノイズプロファイルＰｔＬの縮小補正をプロファイル縮小部２７２ｂに行わせる（Ｓ１４０６）。時定数ｄｅｃ＿Ｌ（ｆｉ）は、プロファイル縮小部２７２ｂに対応する時定数である。Ｓ１４０６の処理が行われた後、Ｒｃｈ及びＬｃｈに対するノイズプロファイル補正処理は終了する。時定数ｄｅｃ＿Ｌ（ｆｉ）は、時定数ｉｎｃ＿Ｒ（ｆｉ）よりも大きくなる。 When it is determined that the amplitude spectrum UtL is smaller than the amplitude spectrum UtR (No in S1402), the control unit 109 causes the profile expansion unit 271a to perform expansion correction of the noise profile PtR according to the time constant inc_R (fi) ( S1405). The time constant inc_R (fi) is a time constant corresponding to the profile enlarging unit 271a. Thereafter, the control unit 109 causes the profile reduction unit 272b to perform reduction correction of the noise profile PtL according to the time constant dec_L (fi) (S1406). The time constant dec_L (fi) is a time constant corresponding to the profile reduction unit 272b. After the processing of S1406 is performed, the noise profile correction processing for Rch and Lch ends. The time constant dec_L (fi) is larger than the time constant inc_R (fi).

このように、制御部１０９は、環境音や駆動ノイズの変化に伴い、ノイズプロファイルＰｔＲに対して補正を行い、ノイズプロファイルＰｔＬに対して補正を行うようにした。これにより、撮像装置１００は、Ｒｃｈの音声に対するノイズ低減処理と、Ｌｃｈの音声に対するノイズ低減処理とが適切に行われるようにすることができる。したがって、撮像装置１００は、駆動ノイズの消し残しや駆動ノイズの低減し過ぎによって環境音に違和感が生じるような事態を防止することができる。 As described above, the control unit 109 corrects the noise profile PtR and corrects the noise profile PtL in accordance with changes in environmental sound and driving noise. Accordingly, the imaging apparatus 100 can appropriately perform the noise reduction process for the Rch sound and the noise reduction process for the Lch sound. Therefore, the imaging apparatus 100 can prevent a situation in which the environmental sound is uncomfortable due to the drive noise remaining unerased or the drive noise being excessively reduced.

［ノイズ低減処理（Ｓ３０８）］
Ｓ３０８において、制御部１０９によって実行されるノイズ低減処理について図１５、１６及び１７を用いて説明を行う。 [Noise reduction processing (S308)]
The noise reduction process executed by the control unit 109 in S308 will be described with reference to FIGS.

図１５は、Ｒｃｈ及びＬｃｈに対するノイズ低減処理を示すタイミングチャート図である。図１５におけるｔ１、ｔ２、ｔ３は、図５及び６におけるｔ１、ｔ２、ｔ３と同様であるため、説明を省略する。図１５におけるＩｔＲ、ＩｔＬ、ＤｔＲ、ＤｔＬ、ＰｔＲ、ＰｔＬ、ＵｔＲ、ＵｔＬ、ＮｔＲ及びＮｔＬは、図１３におけるＩｔＲ、ＩｔＬ、ＤｔＲ、ＤｔＬ、ＰｔＲ、ＰｔＬ、ＵｔＲ、ＵｔＬ、ＮｔＲ及びＮｔＬと同様であるため、説明を省略する。 FIG. 15 is a timing chart showing noise reduction processing for Rch and Lch. Since t1, t2, and t3 in FIG. 15 are the same as t1, t2, and t3 in FIGS. 5 and 6, description thereof is omitted. In FIG. 15, ItR, ItL, DtR, DtL, PtR, PtL, UtR, UtL, NtR and NtL are the same as ItR, ItL, DtR, DtL, PtR, PtL, UtR, UtL, NtR and NtL in FIG. Therefore, the description is omitted.

撮像装置１００によってズーム動作が行われている間に環境音や駆動ノイズが急激に変化した場合、ノイズプロファイルＰｔＲ及びノイズプロファイルＰｔＬを用いて駆動ノイズを低減したとしても、駆動ノイズの消し残りや環境音に違和感が生じる場合がある。これを防止するために、制御部１０９は、左右相関振幅スペクトルβｔに応じて、ノイズ低減処理を行う。 If the environmental sound or driving noise changes abruptly while the zoom operation is being performed by the imaging apparatus 100, even if the driving noise is reduced using the noise profile PtR and the noise profile PtL, the remaining noise of driving noise and the environment The sound may be uncomfortable. In order to prevent this, the control unit 109 performs noise reduction processing according to the left-right correlation amplitude spectrum βt.

図１６は、ノイズ低減処理の一例を示すフローチャートである。図１７は、係数αと環境音との関係を示す図である。図１７の横軸は、環境音のレベルを示し、図１７の縦軸は、係数αの値を示している。図１７において、環境音のレベルに係数αが対応づけられている。図１７における実線１７０１が、環境音のレベルに対応した係数αの値を示している。破線１７０２は、駆動ノイズのレベルであり、破線１７０３は、駆動ノイズが環境音によってかき消されるレベルである。係数αは、環境音レベルの大きくなるほど、小さくなるものとする。環境音のレベルが破線１７０２のレベルである場合、係数αは０．１２５となる。 FIG. 16 is a flowchart illustrating an example of noise reduction processing. FIG. 17 is a diagram illustrating the relationship between the coefficient α and the environmental sound. The horizontal axis in FIG. 17 indicates the environmental sound level, and the vertical axis in FIG. 17 indicates the value of the coefficient α. In FIG. 17, the coefficient α is associated with the environmental sound level. A solid line 1701 in FIG. 17 indicates the value of the coefficient α corresponding to the environmental sound level. A broken line 1702 is a drive noise level, and a broken line 1703 is a level at which the drive noise is erased by the environmental sound. The coefficient α decreases as the environmental sound level increases. When the environmental sound level is the level indicated by the broken line 1702, the coefficient α is 0.125.

次に、図１６（ａ）及び図１７を用いて、制御部１０９によって行われるノイズ低減処理について説明を行う。なお、図１６（ａ）のノイズ低減処理について、振幅スペクトル減算部２５４ａが減算処理を行う場合を一例に挙げて説明を行う。 Next, the noise reduction process performed by the control unit 109 will be described with reference to FIGS. Note that the noise reduction processing in FIG. 16A will be described using an example in which the amplitude spectrum subtraction unit 254a performs subtraction processing.

タイミングｔ１において、制御部１０９は、メモリ２５１ａに保存された振幅スペクトルＤｔ１に応じて、係数αを決定する（Ｓ１６０１）。係数αは、ノイズプロファイルに乗算する係数である。Ｓ１６０１において、制御部１０９は、振幅スペクトルＤｔ１に対応する図１７のおける環境音のレベルを検出し、検出された環境音のレベルに対応する係数αの値を決定する。 At timing t1, the control unit 109 determines the coefficient α according to the amplitude spectrum Dt1 stored in the memory 251a (S1601). The coefficient α is a coefficient by which the noise profile is multiplied. In S1601, the control unit 109 detects the level of the environmental sound in FIG. 17 corresponding to the amplitude spectrum Dt1, and determines the value of the coefficient α corresponding to the detected level of the environmental sound.

次に、制御部１０９は、上述のように左右相関振幅スペクトルβｔを算出する（Ｓ１６０２）。その後、制御部１０９は、左右相関振幅スペクトルβｔが第３の値βｔｈ以下であるか否かを判定する（Ｓ１６０３）。なお、第３の値βｔｈは、環境音がないときにおいて算出された左右相関振幅スペクトルβｔの値に応じて設定される。環境音のレベルが大きいほど、左右相関振幅スペクトルβｔは０に近くなる。また、環境音に対して駆動ノイズが支配的である場合、左右相関振幅スペクトルβｔは、０．２以上になる。 Next, the control unit 109 calculates the left-right correlation amplitude spectrum βt as described above (S1602). Thereafter, the control unit 109 determines whether or not the left-right correlation amplitude spectrum βt is equal to or smaller than the third value βth (S1603). Note that the third value βth is set according to the value of the left-right correlation amplitude spectrum βt calculated when there is no environmental sound. The greater the level of environmental sound, the closer the left-right correlation amplitude spectrum βt is to zero. When the driving noise is dominant with respect to the environmental sound, the left-right correlation amplitude spectrum βt is 0.2 or more.

左右相関振幅スペクトルβｔが第３の値βｔｈ以下であると判定された場合（Ｓ１６０３でＹｅｓ）、制御部１０９は、Ｓ１６０４の処理を行う。Ｓ１６０４において、制御部１０９は、ノイズプロファイルＰｔＲとＳ１６０１において決定された係数αとを乗算し、これを振幅スペクトルＩｔＲから減算するように振幅スペクトル減算部２５４ａを制御する。Ｓ１６０４において、振幅スペクトル減算部２５４ａによって減算処理が行われた場合、振幅スペクトル減算部２５４ａから出力される振幅スペクトルＵｔＲは、次式のようになる。
ＵｔＲ＝ＩｔＲ−α・ＰｔＲ When it is determined that the left-right correlation amplitude spectrum βt is equal to or smaller than the third value βth (Yes in S1603), the control unit 109 performs the process of S1604. In S1604, the control unit 109 multiplies the noise profile PtR by the coefficient α determined in S1601, and controls the amplitude spectrum subtraction unit 254a to subtract this from the amplitude spectrum ItR. In S1604, when the subtraction process is performed by the amplitude spectrum subtraction unit 254a, the amplitude spectrum UtR output from the amplitude spectrum subtraction unit 254a is expressed by the following equation.
UtR = ItR−α · PtR

左右相関振幅スペクトルβｔが第３の値βｔｈよりも大きいと判定された場合（Ｓ１６０３でＮｏ）、制御部１０９は、Ｓ１６０５の処理を行う。Ｓ１６０５において、制御部１０９は、第１の値ＰｍａｘとＳ１６０１において決定された係数αとを乗算し、これを振幅スペクトルＩｔＲから減算するように振幅スペクトル減算部２５４ａを制御する。Ｓ１６０５において、振幅スペクトル減算部２５４ａによって減算処理が行われた場合、振幅スペクトル減算部２５４ａから出力される振幅スペクトルＵｔＲは、次式のようになる。
ＵｔＲ＝ＩｔＲ−α・Ｐｍａｘ When it is determined that the left-right correlation amplitude spectrum βt is larger than the third value βth (No in S1603), the control unit 109 performs the process of S1605. In step S1605, the control unit 109 multiplies the first value Pmax by the coefficient α determined in step S1601, and controls the amplitude spectrum subtraction unit 254a to subtract this from the amplitude spectrum ItR. In S1605, when the subtraction process is performed by the amplitude spectrum subtraction unit 254a, the amplitude spectrum UtR output from the amplitude spectrum subtraction unit 254a is expressed by the following equation.
UtR = ItR−α · Pmax

左右相関振幅スペクトルβｔが第３の値βｔｈ以下でないと判定された場合（Ｓ１６０３でＮｏ）、制御部１０９は、ノイズプロファイルＰｔＲを用いないようにする。なお、図１６（ａ）のノイズ低減処理について、振幅スペクトル減算部２５４ａが減算処理を行う場合を一例に挙げて説明を行った。しかしながら、振幅スペクトル減算部２５４ｂが減算処理を行う場合も、図１６（ａ）のノイズ低減処理と同様に駆動ノイズの低減を行う。 When it is determined that the left-right correlation amplitude spectrum βt is not equal to or smaller than the third value βth (No in S1603), the control unit 109 does not use the noise profile PtR. Note that the noise reduction processing in FIG. 16A has been described by taking as an example the case where the amplitude spectrum subtraction unit 254a performs the subtraction processing. However, also when the amplitude spectrum subtraction unit 254b performs the subtraction process, the drive noise is reduced in the same manner as the noise reduction process of FIG.

次に、図１６（ｂ）及び図１７を用いて、制御部１０９によって行われるノイズ低減処理について説明を行う。なお、図１６（ｂ）のノイズ低減処理について、振幅スペクトル減算部２５４ａが減算処理を行う場合を一例に挙げて説明を行う。 Next, the noise reduction process performed by the control unit 109 will be described with reference to FIGS. Note that the noise reduction processing in FIG. 16B will be described by taking the case where the amplitude spectrum subtraction unit 254a performs the subtraction processing as an example.

図１６（ｂ）におけるＳ１６０２、Ｓ１６０３及びＳ１６０４は、図１６（ｂ）におけるＳ１６０２、Ｓ１６０３及びＳ１６０４と同一の処理であるので、説明を省略する。制御部１０９は、前フレームの減算処理後の振幅スペクトルＵｔ−１に応じて、係数αを決定する（Ｓ１６０６）。Ｓ１６０６において、制御部１０９は、振幅スペクトルＵｔ−１に対応する図１７のおける環境音のレベルを検出し、検出された環境音のレベルに対応する係数αの値を決定する。振幅スペクトル減算部２５４ａが減算処理を行う場合、Ｓ１６０６において、制御部１０９は、振幅スペクトル減算部２５４ａによって行われた前フレームの減算処理後の振幅スペクトルＵｔ−１Ｒに応じて、係数αを決定する。その後、制御部１０９は、Ｓ１６０２及びＳ１６０３の処理が行われる。左右相関振幅スペクトルβｔが第３の値βｔｈ以下であると判定された場合（Ｓ１６０３でＹｅｓ）、制御部１０９は、Ｓ１６０４の処理を行う。左右相関振幅スペクトルβｔが第３の値βｔｈ以下でないと判定された場合（Ｓ１６０３でＮｏ）、制御部１０９は、Ｓ１６０７の処理を行う。 Since S1602, S1603, and S1604 in FIG. 16B are the same processes as S1602, S1603, and S1604 in FIG. The control unit 109 determines the coefficient α according to the amplitude spectrum Ut−1 after the subtraction process of the previous frame (S1606). In S1606, the control unit 109 detects the level of the environmental sound in FIG. 17 corresponding to the amplitude spectrum Ut-1, and determines the value of the coefficient α corresponding to the detected level of the environmental sound. When the amplitude spectrum subtraction unit 254a performs the subtraction process, in step S1606, the control unit 109 determines the coefficient α according to the amplitude spectrum Ut-1R after the subtraction process for the previous frame performed by the amplitude spectrum subtraction unit 254a. . Thereafter, the control unit 109 performs the processes of S1602 and S1603. When it is determined that the left-right correlation amplitude spectrum βt is equal to or smaller than the third value βth (Yes in S1603), the control unit 109 performs the process of S1604. When it is determined that the left-right correlation amplitude spectrum βt is not equal to or smaller than the third value βth (No in S1603), the control unit 109 performs the process of S1607.

Ｓ１６０７において、制御部１０９は、第２の値ＵｍｉｎとＳ１６０６において決定された係数αとを乗算し、これを振幅スペクトルＩｔＲから減算するように振幅スペクトル減算部２５４ａを制御する。Ｓ１６０７において、振幅スペクトル減算部２５４ａによって減算処理が行われた場合、振幅スペクトル減算部２５４ａから出力される振幅スペクトルＵｔＲは、次式のようになる。
ＵｔＲ＝ＩｔＲ−α・Ｕｍｉｎ In S1607, the control unit 109 multiplies the second value Umin by the coefficient α determined in S1606, and controls the amplitude spectrum subtraction unit 254a to subtract this from the amplitude spectrum ItR. In S1607, when the subtraction process is performed by the amplitude spectrum subtraction unit 254a, the amplitude spectrum UtR output from the amplitude spectrum subtraction unit 254a is expressed by the following equation.
UtR = ItR−α · Umin

左右相関振幅スペクトルβｔが第３の値βｔｈ以下でないと判定された場合（Ｓ１６０３でＮｏ）、制御部１０９は、ノイズプロファイルＰｔＲを用いないようにする。なお、図１６（ｂ）のノイズ低減処理について、振幅スペクトル減算部２５４ａが減算処理を行う場合を一例に挙げて説明を行った。しかしながら、振幅スペクトル減算部２５４ｂが減算処理を行う場合も、図１６（ｂ）のノイズ低減処理と同様に駆動ノイズの低減を行う。 When it is determined that the left-right correlation amplitude spectrum βt is not equal to or smaller than the third value βth (No in S1603), the control unit 109 does not use the noise profile PtR. Note that the noise reduction process in FIG. 16B has been described by taking as an example the case where the amplitude spectrum subtraction unit 254a performs the subtraction process. However, even when the amplitude spectrum subtraction unit 254b performs the subtraction process, the drive noise is reduced in the same manner as the noise reduction process of FIG.

ノイズ低減処理について、図１６（ａ）及び図１６（ｂ）について説明したが、図１６（ａ）及び図１６（ｂ）のいずれか一つのノイズ低減処理が制御部１０９によって行われればよいものとする。 Although the noise reduction processing has been described with reference to FIGS. 16A and 16B, any one of the noise reduction processing in FIGS. 16A and 16B may be performed by the control unit 109. And

このように、制御部１０９は、左右相関振幅スペクトルβｔに応じて、ノイズを低減するための処理を変更するようにした。これにより、撮像装置１００は、駆動ノイズが環境音に対して支配的か否かに応じて、適切に駆動ノイズを低減することができる。 As described above, the control unit 109 changes the process for reducing noise according to the left-right correlation amplitude spectrum βt. Thereby, the imaging device 100 can appropriately reduce the drive noise depending on whether the drive noise is dominant with respect to the environmental sound.

［後処理（Ｓ３１０）］
Ｓ３１０において、制御部１０９によって実行される後処理について図１８及び１９を用いて説明を行う。 [Post-processing (S310)]
In S310, post-processing executed by the control unit 109 will be described with reference to FIGS.

図１８は、後処理を示すタイミングチャート図である。図１８におけるｔ１、ｔ２、ｔ３は、図５及び６におけるｔ１、ｔ２、ｔ３と同様であるため、説明を省略する。図１８におけるＵｔＲ、ＵｔＬは、図１３におけるＵｔＲ、ＵｔＬと同様であるため、説明を省略する。図１８における振幅スペクトルＱｔは、後処理が行われた後に出力される振幅スペクトルである。 FIG. 18 is a timing chart showing post-processing. Since t1, t2, and t3 in FIG. 18 are the same as t1, t2, and t3 in FIGS. 5 and 6, description thereof is omitted. UtR and UtL in FIG. 18 are the same as UtR and UtL in FIG. The amplitude spectrum Qt in FIG. 18 is an amplitude spectrum that is output after post-processing.

次に、図１９（ａ）及び図１８（ａ）を用いて、制御部１０９によって行われる後処理について説明を行う。振幅スペクトル減算部２５４ａから振幅スペクトルＵｔＲが出力され、振幅スペクトル減算部２５４ｂから振幅スペクトルＵｔＬが出力された場合、制御部１０９は、振幅スペクトルＵｔＬが振幅スペクトルＵｔＲ以下であるか否かを判定する（Ｓ１９０１）。 Next, post-processing performed by the control unit 109 will be described with reference to FIGS. 19A and 18A. When the amplitude spectrum UtR is output from the amplitude spectrum subtraction unit 254a and the amplitude spectrum UtL is output from the amplitude spectrum subtraction unit 254b, the control unit 109 determines whether or not the amplitude spectrum UtL is equal to or less than the amplitude spectrum UtR ( S1901).

振幅スペクトルＵｔＬが振幅スペクトルＵｔＲ以下であると判定された場合（Ｓ１９０１でＹｅｓ）、制御部１０９は、振幅スペクトルＵｔＬを振幅スペクトルＱｔとしてＩＦＦＴ２１４ｂに出力するように後補正部２５５ｂを制御する。その後、制御部１０９は、振幅スペクトルＵｔＲをＩＦＦＴ２１４ａに出力することなく、振幅スペクトルＵｔＬを振幅スペクトルＱｔとしてＩＦＦＴ２１４ａに出力するように後補正部２５５ａを制御する（Ｓ１９０２）。Ｓ１９０２の処理が行われた後、制御部１０９は、後処理を終了する。 When it is determined that the amplitude spectrum UtL is equal to or smaller than the amplitude spectrum UtR (Yes in S1901), the control unit 109 controls the post-correction unit 255b to output the amplitude spectrum UtL as the amplitude spectrum Qt to the IFFT 214b. Thereafter, the control unit 109 controls the post-correction unit 255a to output the amplitude spectrum UtL as the amplitude spectrum Qt to the IFFT 214a without outputting the amplitude spectrum UtR to the IFFT 214a (S1902). After the process of S1902 is performed, the control unit 109 ends the post-process.

振幅スペクトルＵｔＬが振幅スペクトルＵｔＲよりも大きいと判定された場合（Ｓ１９０１でＮｏ）、制御部１０９は、振幅スペクトルＵｔＲを振幅スペクトルＱｔとしてＩＦＦＴ２１４ａに出力するように後補正部２５５ａを制御する。その後、制御部１０９は、振幅スペクトルＵｔＬをＩＦＦＴ２１４ｂに出力することなく、振幅スペクトルＵｔＲを振幅スペクトルＱｔとしてＩＦＦＴ２１４ｂに出力するように後処理部２５５ｂを制御する（Ｓ１９０３）。Ｓ１９０３の処理が行われた後、制御部１０９は、後処理を終了する。 When it is determined that the amplitude spectrum UtL is larger than the amplitude spectrum UtR (No in S1901), the control unit 109 controls the post-correction unit 255a to output the amplitude spectrum UtR to the IFFT 214a as the amplitude spectrum Qt. Thereafter, the control unit 109 controls the post-processing unit 255b to output the amplitude spectrum UtR as the amplitude spectrum Qt to the IFFT 214b without outputting the amplitude spectrum UtL to the IFFT 214b (S1903). After the processing of S1903 is performed, the control unit 109 ends the post processing.

図１９（ａ）の後処理が行われる場合、図１８（ａ）のように、振幅スペクトルＵｔＬ及び振幅スペクトルＵｔＲのうちの小さい方が振幅スペクトルＱｔとしてＩＦＦＴ２１４ａ及びＩＦＦＴ２１４ｂに入力される。 When the post-processing of FIG. 19A is performed, as shown in FIG. 18A, the smaller one of the amplitude spectrum UtL and the amplitude spectrum UtR is input to the IFFT 214a and IFFT 214b as the amplitude spectrum Qt.

次に、図１９（ｂ）及び図１８（ｂ）を用いて、制御部１０９によって行われる後処理について説明を行う。振幅スペクトル減算部２５４ａから振幅スペクトルＵｔＲが出力され、振幅スペクトル減算部２５４ｂから振幅スペクトルＵｔＬが出力された場合、制御部１０９は、Ｓ１９１０の処理を行う。Ｓ１９１０において、制御部１０９は、振幅スペクトルＵｔＬ及び振幅スペクトルＵｔＲのいずれか一つが第４の値Ｑｍｉｎ以下か否かを判定する。なお、第４の値Ｑｍｉｎは、後処理による違和感を防止するために用いられる。第４の値Ｑｍｉｎは、第２の値Ｕｍｉｎと同一の値であっても良い。 Next, post-processing performed by the control unit 109 will be described with reference to FIGS. 19B and 18B. When the amplitude spectrum UtR is output from the amplitude spectrum subtraction unit 254a and the amplitude spectrum UtL is output from the amplitude spectrum subtraction unit 254b, the control unit 109 performs the process of S1910. In step S1910, the control unit 109 determines whether any one of the amplitude spectrum UtL and the amplitude spectrum UtR is equal to or less than the fourth value Qmin. Note that the fourth value Qmin is used to prevent a sense of incongruity due to post-processing. The fourth value Qmin may be the same value as the second value Umin.

振幅スペクトルＵｔＬ及び振幅スペクトルＵｔＲのいずれか一つが第４の値Ｑｍｉｎ以下である場合（Ｓ１９１０でＹｅｓ）、制御部１０９は、振幅スペクトルＵｔＬが振幅スペクトルＵｔＲ以下であるか否かを判定する（Ｓ１９１４）。振幅スペクトルＵｔＬが振幅スペクトルＵｔＲ以下であると判定された場合（Ｓ１９１４でＹｅｓ）、制御部１０９は、振幅スペクトルＵｔＲを振幅スペクトルＱｔとしてＩＦＦＴ２１４ａに出力するように後補正部２５５ａを制御する。その後、制御部１０９は、振幅スペクトルＵｔＬをＩＦＦＴ２１４ｂに出力することなく、振幅スペクトルＵｔＲを振幅スペクトルＱｔとしてＩＦＦＴ２１４ｂに出力するように後補正部２５５ｂを制御する（Ｓ１９１５）。Ｓ１９１５の処理が行われた後、制御部１０９は、後処理を終了する。振幅スペクトルＵｔＬが振幅スペクトルＵｔＲよりも大きいと判定された場合（Ｓ１９１４でＮｏ）、制御部１０９は、振幅スペクトルＵｔＬを振幅スペクトルＱｔとしてＩＦＦＴ２１４ｂに出力するように後補正部２５５ｂを制御する。その後、制御部１０９は、振幅スペクトルＵｔＲをＩＦＦＴ２１４ａに出力することなく、振幅スペクトルＵｔＬを振幅スペクトルＱｔとしてＩＦＦＴ２１４ａに出力するように後補正部２５５ａを制御する（Ｓ１９１６）。Ｓ１９１６の処理が行われた後、制御部１０９は、後処理を終了する。 When any one of the amplitude spectrum UtL and the amplitude spectrum UtR is equal to or smaller than the fourth value Qmin (Yes in S1910), the control unit 109 determines whether the amplitude spectrum UtL is equal to or smaller than the amplitude spectrum UtR (S1914). ). When it is determined that the amplitude spectrum UtL is equal to or smaller than the amplitude spectrum UtR (Yes in S1914), the control unit 109 controls the post-correction unit 255a to output the amplitude spectrum UtR as the amplitude spectrum Qt to the IFFT 214a. Thereafter, the control unit 109 controls the post-correction unit 255b to output the amplitude spectrum UtR as the amplitude spectrum Qt to the IFFT 214b without outputting the amplitude spectrum UtL to the IFFT 214b (S1915). After the process of S1915 is performed, the control unit 109 ends the post-process. When it is determined that the amplitude spectrum UtL is larger than the amplitude spectrum UtR (No in S1914), the control unit 109 controls the post-correction unit 255b to output the amplitude spectrum UtL as the amplitude spectrum Qt to the IFFT 214b. Thereafter, the control unit 109 controls the post-correction unit 255a to output the amplitude spectrum UtL as the amplitude spectrum Qt to the IFFT 214a without outputting the amplitude spectrum UtR to the IFFT 214a (S1916). After the processing of S1916 is performed, the control unit 109 ends the post processing.

振幅スペクトルＵｔＬ及び振幅スペクトルＵｔＲのいずれも第４の値Ｑｍｉｎよりも大きいである場合（Ｓ１９１０でＮｏ）、制御部１０９は、振幅スペクトルＵｔＬが振幅スペクトルＵｔＲ以下であるか否かを判定する（Ｓ１９１１）。 When both the amplitude spectrum UtL and the amplitude spectrum UtR are larger than the fourth value Qmin (No in S1910), the control unit 109 determines whether or not the amplitude spectrum UtL is equal to or smaller than the amplitude spectrum UtR (S1911). ).

振幅スペクトルＵｔＬが振幅スペクトルＵｔＲ以下であると判定された場合（Ｓ１９１１でＹｅｓ）、制御部１０９は、振幅スペクトルＵｔＬを振幅スペクトルＱｔとしてＩＦＦＴ２１４ｂに出力するように後補正部２５５ｂを制御する。その後、制御部１０９は、振幅スペクトルＵｔＲをＩＦＦＴ２１４ａに出力することなく、振幅スペクトルＵｔＬを振幅スペクトルＱｔとしてＩＦＦＴ２１４ａに出力するように後補正部２５５ａを制御する。Ｓ１９１２の処理が行われた後、制御部１０９は、後処理を終了する。 When it is determined that the amplitude spectrum UtL is equal to or smaller than the amplitude spectrum UtR (Yes in S1911), the control unit 109 controls the post-correction unit 255b to output the amplitude spectrum UtL as the amplitude spectrum Qt to the IFFT 214b. Thereafter, the control unit 109 controls the post-correction unit 255a to output the amplitude spectrum UtL as the amplitude spectrum Qt to the IFFT 214a without outputting the amplitude spectrum UtR to the IFFT 214a. After the process of S1912 is performed, the control unit 109 ends the post-process.

振幅スペクトルＵｔＬが振幅スペクトルＵｔＲよりも大きいと判定された場合（Ｓ１９１１でＮｏ）、制御部１０９は、振幅スペクトルＵｔＲを振幅スペクトルＱｔとしてＩＦＦＴ２１４ａに出力するように後補正部２５５ａを制御する。その後、制御部１０９は、振幅スペクトルＵｔＬをＩＦＦＴ２１４ｂに出力することなく、振幅スペクトルＵｔＲを振幅スペクトルＱｔとしてＩＦＦＴ２１４ｂに出力するように後補正部２５５ｂを制御する（Ｓ１９１３）。Ｓ１９１３の処理が行われた後、制御部１０９は、後処理を終了する。 When it is determined that the amplitude spectrum UtL is larger than the amplitude spectrum UtR (No in S1911), the control unit 109 controls the post-correction unit 255a to output the amplitude spectrum UtR as the amplitude spectrum Qt to the IFFT 214a. Thereafter, the control unit 109 controls the post-correction unit 255b to output the amplitude spectrum UtR as the amplitude spectrum Qt to the IFFT 214b without outputting the amplitude spectrum UtL to the IFFT 214b (S1913). After the process of S1913 is performed, the control unit 109 ends the post-process.

図１９（ｂ）の後処理が行われる場合、振幅スペクトルＵｔＬ及び振幅スペクトルＵｔＲのいずれもが第４の値Ｑｍｉｎよりも大きい場合について説明する。この場合、図１８（ｂ）のように、振幅スペクトルＵｔＬ及び振幅スペクトルＵｔＲのうち小さい方が振幅スペクトルＱｔとしてＩＦＦＴ２１４ａ及びＩＦＦＴ２１４ｂに入力される。 When post-processing in FIG. 19B is performed, a case will be described in which both the amplitude spectrum UtL and the amplitude spectrum UtR are larger than the fourth value Qmin. In this case, as shown in FIG. 18B, the smaller one of the amplitude spectrum UtL and the amplitude spectrum UtR is input to the IFFT 214a and the IFFT 214b as the amplitude spectrum Qt.

次に、図１９（ｂ）の後処理が行われる場合、振幅スペクトルＵｔＬ及び振幅スペクトルＵｔＲのいずれか一つが第４の値Ｑｍｉｎ以下である場合について説明する。この場合、図１８（ｂ）のように、振幅スペクトルＵｔＬ及び振幅スペクトルＵｔＲのうち大きい方が振幅スペクトルＱｔとしてＩＦＦＴ２１４ａ及びＩＦＦＴ２１４ｂに入力される。 Next, a case where any one of the amplitude spectrum UtL and the amplitude spectrum UtR is equal to or smaller than the fourth value Qmin when the post-processing of FIG. 19B is performed will be described. In this case, as shown in FIG. 18B, the larger one of the amplitude spectrum UtL and the amplitude spectrum UtR is input to the IFFT 214a and the IFFT 214b as the amplitude spectrum Qt.

次に、図１９（ｃ）及び図１８（ｃ）を用いて、制御部１０９によって行われる後処理について説明を行う。振幅スペクトル減算部２５４aから振幅スペクトルＵｔRが出力され、振幅スペクトル減算部２５４ｂから振幅スペクトルＵｔＬが出力された場合、制御部１０９は、Ｓ１９２１の処理を行う。Ｓ１９２１において、制御部１０９は、ΔｔＬ及びΔｔＲを算出し、｜ΔｔＬ−ΔｔＲ｜を算出する。ΔｔＬは、振幅スペクトルＩｔＬと振幅スペクトルＵｔＬとの差分であり、ΔｔＲは、振幅スペクトルＩｔＲと振幅スペクトルＵｔＲとの差分である。さらに、制御部１０９は、以下の条件が成立しているか否かを判定する。
｜ΔｔＬ−ΔｔＲ｜≦｜ΔｔＬ−ΔｔＲ｜ｍａｘ
なお、｜ΔｔＬ−ΔｔＲ｜ｍａｘは、予め定められた閾値であり、ΔｔＬとΔｔＲとの差分による環境音の左右差の違和感を防止するために用いられる。 Next, post-processing performed by the control unit 109 will be described with reference to FIGS. 19C and 18C. When the amplitude spectrum UtR is output from the amplitude spectrum subtraction unit 254a and the amplitude spectrum UtL is output from the amplitude spectrum subtraction unit 254b, the control unit 109 performs the process of S1921. In S1921, the control unit 109 calculates ΔtL and ΔtR, and calculates | ΔtL−ΔtR |. ΔtL is a difference between the amplitude spectrum ItL and the amplitude spectrum UtL, and ΔtR is a difference between the amplitude spectrum ItR and the amplitude spectrum UtR. Furthermore, the control unit 109 determines whether or not the following conditions are satisfied.
| ΔtL−ΔtR | ≦ | ΔtL−ΔtR | max
Note that | ΔtL−ΔtR | max is a predetermined threshold value, and is used to prevent a sense of incongruity between the left and right environmental sounds due to the difference between ΔtL and ΔtR.

条件｜ΔｔＬ−ΔｔＲ｜≦｜ΔｔＬ−ΔｔＲ｜ｍａｘが成立していると判定された場合（Ｓ１９２１でＹｅｓ）、制御部１０９は、振幅スペクトルＵｔＬが振幅スペクトルＵｔＲ以下であるか否かを判定する（Ｓ１９２２）。振幅スペクトルＵｔＬが振幅スペクトルＵｔＲ以下であると判定された場合（Ｓ１９２２でＹｅｓ）、制御部１０９は、振幅スペクトルＵｔＬを振幅スペクトルＱｔとしてＩＦＦＴ２１４ｂに出力するように後補正部２５５ｂを制御する。その後、制御部１０９は、振幅スペクトルＵｔＲをＩＦＦＴ２１４aに出力することなく、振幅スペクトルＵｔLを振幅スペクトルＱｔとしてＩＦＦＴ２１４aに出力するように後補正部２５５aを制御する（Ｓ１９２３）。 When it is determined that the condition | ΔtL−ΔtR | ≦ | ΔtL−ΔtR | max is satisfied (Yes in S1921), the control unit 109 determines whether or not the amplitude spectrum UtL is equal to or smaller than the amplitude spectrum UtR. (S1922). When it is determined that the amplitude spectrum UtL is equal to or smaller than the amplitude spectrum UtR (Yes in S1922), the control unit 109 controls the post-correction unit 255b to output the amplitude spectrum UtL as the amplitude spectrum Qt to the IFFT 214b. Thereafter, the control unit 109 controls the post-correction unit 255a to output the amplitude spectrum UtL as the amplitude spectrum Qt to the IFFT 214a without outputting the amplitude spectrum UtR to the IFFT 214a (S1923).

Ｓ１９２３の処理が行われた後、制御部１０９は、後処理を終了する。振幅スペクトルＵｔＬが振幅スペクトルＵｔＲよりも大きいと判定された場合（Ｓ１９２２でＮｏ）、制御部１０９は、振幅スペクトルＵｔRを振幅スペクトルＱｔとしてＩＦＦＴ２１４aに出力するように後補正部２５５aを制御する。その後、制御部１０９は、振幅スペクトルＵｔLをＩＦＦＴ２１４ｂに出力することなく、振幅スペクトルＵｔＲを振幅スペクトルＱｔとしてＩＦＦＴ２１４ｂに出力するように後補正部２５５ｂを制御する（Ｓ１９２４）。Ｓ１９２４の処理が行われた後、制御部１０９は、後処理を終了する。 After the process of S1923 is performed, the control unit 109 ends the post-process. When it is determined that the amplitude spectrum UtL is larger than the amplitude spectrum UtR (No in S1922), the control unit 109 controls the post-correction unit 255a to output the amplitude spectrum UtR as the amplitude spectrum Qt to the IFFT 214a. Thereafter, the control unit 109 controls the post-correction unit 255b to output the amplitude spectrum UtR as the amplitude spectrum Qt to the IFFT 214b without outputting the amplitude spectrum UtL to the IFFT 214b (S1924). After the processing of S1924 is performed, the control unit 109 ends the post processing.

振幅スペクトルＵｔＬが振幅スペクトルＵｔＲ以下であると判定された場合（Ｓ１９２５でＹｅｓ）、制御部１０９は、振幅スペクトルＵｔＲを振幅スペクトルＱｔとしてＩＦＦＴ２１４aに出力するように後補正部２５５aを制御する。その後、制御部１０９は、振幅スペクトルＵｔLをＩＦＦＴ２１４ｂに出力することなく、振幅スペクトルＵｔＲを振幅スペクトルＱｔとしてＩＦＦＴ２１４ｂに出力するように後補正部２５５ｂを制御する（Ｓ１９２６）。Ｓ１９２６の処理が行われた後、制御部１０９は、後処理を終了する。振幅スペクトルＵｔＬが振幅スペクトルＵｔＲよりも大きいと判定された場合（Ｓ１９２５でＮｏ）、制御部１０９は、振幅スペクトルＵｔＬを振幅スペクトルＱｔとしてＩＦＦＴ２１４ｂに出力するように後補正部２５５ｂを制御する。その後、制御部１０９は、振幅スペクトルＵｔＲをＩＦＦＴ２１４aに出力することなく、振幅スペクトルＵｔLを振幅スペクトルＱｔとしてＩＦＦＴ２１４aに出力するように後補正部２５５aを制御する（Ｓ１９２７）。Ｓ１９２７の処理が行われた後、制御部１０９は、後処理を終了する。 When it is determined that the amplitude spectrum UtL is equal to or smaller than the amplitude spectrum UtR (Yes in S1925), the control unit 109 controls the post-correction unit 255a so as to output the amplitude spectrum UtR to the IFFT 214a as the amplitude spectrum Qt. Thereafter, the control unit 109 controls the post-correction unit 255b to output the amplitude spectrum UtR as the amplitude spectrum Qt to the IFFT 214b without outputting the amplitude spectrum UtL to the IFFT 214b (S1926). After the process of S1926 is performed, the control unit 109 ends the post-process. When it is determined that the amplitude spectrum UtL is larger than the amplitude spectrum UtR (No in S1925), the control unit 109 controls the post-correction unit 255b to output the amplitude spectrum UtL as the amplitude spectrum Qt to the IFFT 214b. Thereafter, the control unit 109 controls the post-correction unit 255a to output the amplitude spectrum UtL as the amplitude spectrum Qt to the IFFT 214a without outputting the amplitude spectrum UtR to the IFFT 214a (S1927). After the process of S1927 is performed, the control unit 109 ends the post-process.

図１９（ｃ）の後処理が行われる場合、｜ΔｔＬ−ΔｔＲ｜≦｜ΔｔＬ−ΔｔＲ｜ｍａｘが成り立つ場合について説明する。この場合、図１８（ｃ）のように、振幅スペクトルＵｔＬ及び振幅スペクトルＵｔＲのうち小さい方が振幅スペクトルＱｔとしてＩＦＦＴ２１４ａ及びＩＦＦＴ２１４ｂに入力される。
次に、図１９（ｃ）の後処理が行われる場合、｜ΔｔＬ−ΔｔＲ｜≦｜ΔｔＬ−ΔｔＲ｜ｍａｘが成り立たない場合について説明する。この場合、図１８（ｃ）のように、振幅スペクトルＵｔＬ及び振幅スペクトルＵｔＲのうち大きい方が振幅スペクトルＱｔとしてＩＦＦＴ２１４ａ及びＩＦＦＴ２１４ｂに入力される。 A case where | ΔtL−ΔtR | ≦ | ΔtL−ΔtR | max is satisfied when the post-processing in FIG. 19C is performed will be described. In this case, as shown in FIG. 18C, the smaller one of the amplitude spectrum UtL and the amplitude spectrum UtR is input to the IFFT 214a and the IFFT 214b as the amplitude spectrum Qt.
Next, a case where | ΔtL−ΔtR | ≦ | ΔtL−ΔtR | max does not hold when post-processing in FIG. 19C is performed will be described. In this case, as shown in FIG. 18C, the larger one of the amplitude spectrum UtL and the amplitude spectrum UtR is input to the IFFT 214a and the IFFT 214b as the amplitude spectrum Qt.

後処理について、図１９（ａ）、図１９（ｂ）及び図１９（ｃ）について説明したが、図１９（ａ）、図１９（ｂ）及び図１９（ｃ）のいずれか一つの後処理が制御部１０９によって行われればよいものとする。 The post-processing has been described with reference to FIGS. 19A, 19B, and 19C. The post-processing of any one of FIGS. 19A, 19B, and 19C is performed. Is assumed to be performed by the control unit 109.

図１９（ａ）、図１９(ｂ)及び図１９(ｃ)のいずれか一つの後処理が行われた後、ＩＦＦＴ２１４aは、ＦＦＴ２０７ａから供給された位相情報を用いて、振幅スペクトルＱｔに対して逆高速フーリエ変換を行うことで、元の時系列形式の音声データを生成する。図１９（ａ）、図１９（ｂ）及び図１９（ｃ）のいずれか一つの後処理が行われた後、ＩＦＦＴ２１４ｂは、ＦＦＴ２０７ｂから供給された位相情報を用いて、振幅スペクトルＱｔに対して逆高速フーリエ変換を行うことで、元の時系列形式の音声データを生成する。 After the post-processing of any one of FIGS. 19A, 19B, and 19C is performed, the IFFT 214a uses the phase information supplied from the FFT 207a to the amplitude spectrum Qt. By performing inverse fast Fourier transform, the original time-series audio data is generated. After the post-processing of any one of FIGS. 19A, 19B, and 19C is performed, the IFFT 214b uses the phase information supplied from the FFT 207b to the amplitude spectrum Qt. By performing inverse fast Fourier transform, the original time-series audio data is generated.

このように、制御部１０９は、Ｒｃｈの音声とＬｃｈの音声とのレベルが一致するように補正するための処理を行うようにした。これにより、撮像装置１００は、環境音の左右差による違和感が生じないようにすることができる。 In this way, the control unit 109 performs processing for correcting the Rch sound and the Lch sound so that the levels match. As a result, the imaging apparatus 100 can prevent a sense of incongruity due to the difference between the left and right environmental sounds.

本実施形態において、撮像装置１００は、ＲｃｈとＬｃｈとの２系統の音声が入力される構成を持つものとして説明を行ったが、チャネル数が２以上の音声が入力される構成を持つものであっても良い。また、撮像装置１００は、１系統の音声が入力される構成であっても良いものとする。
（その他の実施例）
上述した実施形態は、プログラム等のソフトウェアをコンピュータ（またはＣＰＵやＭＰＵ等）に実行させることによって実現することができる。この場合、当該ソフトウェアは、ネットワーク又は記憶媒体を介してコンピュータ（またはＣＰＵやＭＰＵ等）に供給される。 In the present embodiment, the imaging apparatus 100 has been described as having a configuration in which two channels of Rch and Lch are input. However, the imaging device 100 has a configuration in which audio having two or more channels is input. There may be. In addition, the imaging apparatus 100 may be configured to receive one system of audio.
(Other examples)
The above-described embodiments can be realized by causing a computer (or CPU, MPU, etc.) to execute software such as a program. In this case, the software is supplied to a computer (or CPU, MPU, etc.) via a network or a storage medium.

Claims

A first microphone,
A second microphone,
Input means for inputting a drive instruction for driving the drive means ;
First conversion means for acquiring sound spectrum data by performing Fourier transform on sound data obtained from the first microphone;
Second conversion means for acquiring sound spectrum data by performing Fourier transform on the sound data obtained from the second microphone;
A first average value is an average value of the voice spectrum data obtained by converting by said first of said first converting means the audio data obtained from the microphone before the driving instruction is input The voice data obtained from the first microphone is converted by the first conversion means after a predetermined stabilization period until the voice spectrum data is stabilized after the drive instruction is input. a first generation means for generating a first noise spectrum data representing the driving noise of the difference in basis, the driving means and the second average value is an average value of the voice spectrum data obtained by,
A third average value is an average value of the voice spectrum data obtained by converting by said second of said second converting means the audio data obtained from the microphone before the driving instruction is input The voice data obtained from the second microphone is converted by the second conversion means between the input of the driving instruction and the elapse of a predetermined stabilization period until the voice spectrum data is stabilized. a second generation means for generating a second noise spectrum data representing the driving noise of the difference in basis, the driving means and the fourth average value is an average value of the voice spectrum data obtained by,
The voice spectrum data obtained from the first conversion means and the second voice spectrum data obtained from the first conversion means and the sum of the voice spectrum data obtained from the second conversion means. When the value indicating the ratio of the difference from the audio spectrum data obtained from the conversion means is equal to or less than a predetermined value, control is performed so that the first noise spectrum data is used to reduce the driving noise, and the value is If the not less than a predetermined value, the electronic device characterized by chromatic and control means for controlling so as not using the first noise spectrum data in order to reduce the driving noise.

The control means includes voice spectrum data obtained from the first conversion means for a sum of the voice spectrum data obtained from the first conversion means and the voice spectrum data obtained from the second conversion means; When the value indicating the ratio of the difference from the audio spectrum data obtained from the second conversion means is equal to or less than a predetermined value, control is performed to use the second noise spectrum data in order to reduce the driving noise. If the value is not less than the predetermined value, the electronic device according to claim 1, wherein the controller controls so as not using the second noise spectrum data in order to reduce the driving noise.

It said drive means, an electronic device according to claim 1 or 2, characterized by controlling so as to move the lens.

The electronic apparatus according to any one of claims 1 to 3 , further comprising an imaging unit for imaging the image data.

A control method for controlling an electronic apparatus having a first microphone, a second microphone, and input means for inputting a drive instruction for driving the drive means,
A first conversion step of acquiring audio spectrum data by performing Fourier transform on the audio data obtained from the first microphone;
A second transforming step of acquiring speech spectrum data by performing Fourier transform on the speech data obtained from the second microphone;
A first average value is an average value of the voice spectrum data obtained by converting by said first of said first converting step the audio data obtained from the microphone before the driving instruction is input The voice data obtained from the first microphone is converted by the first conversion step after a predetermined stabilization period until the voice spectrum data is stabilized after the driving instruction is input. a first generation step of generating a first noise spectrum data representing the driving noise of the difference in basis, the driving means and the second average value is an average value of the voice spectrum data obtained by,
A third average value is an average value of the voice spectrum data obtained by converting by said second of said second conversion step the audio data obtained from the microphone before the driving instruction is input The voice data obtained from the second microphone is converted by the second conversion step after a predetermined stabilization period until the voice spectrum data is stabilized after the driving instruction is input. a second generation step of generating a second noise spectrum data representing the driving noise of the difference in basis, the driving means and the fourth average value is an average value of the voice spectrum data obtained by,
The voice spectrum data obtained from the first conversion step and the second voice spectrum data obtained from the first conversion step and the sum of the voice spectrum data obtained from the second conversion step. When the value indicating the ratio of the difference from the audio spectrum data obtained from the conversion step is equal to or less than a predetermined value , control is performed to use the first noise spectrum data in order to reduce the driving noise, and the value is And a control step for controlling not to use the first noise spectrum data in order to reduce the driving noise when the predetermined value is not less than the predetermined value .