JP2017229086A

JP2017229086A - Sound processing device and sound processing program

Info

Publication number: JP2017229086A
Application number: JP2017160246A
Authority: JP
Inventors: 岡崎　光宏; Mitsuhiro Okazaki; 光宏岡崎
Original assignee: Nikon Corp
Current assignee: Nikon Corp
Priority date: 2012-02-01
Filing date: 2017-08-23
Publication date: 2017-12-28
Anticipated expiration: 2033-01-31
Also published as: JP2013179585A; JP6369612B2; JP6610725B2; JP6197298B2; JP2018182751A

Abstract

PROBLEM TO BE SOLVED: To provide a sound processing device and a sound processing program capable of suppressing a displacement of a sound image due to noise reduction processing.SOLUTION: The sound processing device includes: a calculation unit for calculating a reference relation which is a relation between a first sound collected by a first sound collecting unit and a second sound collected by a second sound collecting unit of sounds collected by a plurality of sound collecting units; and a processing unit for processing the sound collected by the plurality of sound collecting units so that the relation between the first sound and the second sound is included in a predetermined range including the reference relation calculated by the calculation unit.SELECTED DRAWING: Figure 2

Description

本発明は、音処理装置および音処理プログラムに関するものである。 The present invention relates to a sound processing device and a sound processing program.

複数の集音装置を備えたステレオ録音が可能な撮像装置として、動画撮影時にオートフォーカス（以後、「ＡＦ」と略記する）等の駆動音の発生に合わせてノイズ低減処理を行うものがある。
ステレオ等の複数チャンネルの有する音信号の雑音を抑制する雑音抑制装置においては、ステレオ成分の雑音を抑制する技術が知られている（特許文献１等参照）。 2. Description of the Related Art As an imaging apparatus including a plurality of sound collectors capable of stereo recording, there is an apparatus that performs noise reduction processing in accordance with the generation of driving sound such as autofocus (hereinafter abbreviated as “AF”) during moving image shooting.
In a noise suppression apparatus that suppresses noise of sound signals of a plurality of channels such as stereo, a technique for suppressing noise of a stereo component is known (see Patent Document 1 and the like).

特開２００８−２８３３８５号公報JP 2008-283385 A

ところで、ステレオ録音時において、駆動音の発生に合わせてノイズ低減処理を行うと、ノイズ低減処理に起因して音信号のバランスが変化してしまうことがあり、その結果、音像が変位し、再生時に違和感を生じさせるという問題がある。 By the way, during stereo recording, if noise reduction processing is performed in accordance with the generation of driving sound, the balance of the sound signal may change due to the noise reduction processing. As a result, the sound image is displaced and reproduced. There is a problem that it sometimes causes discomfort.

本発明の課題は、ノイズ低減処理に伴う音像の変位を抑制できる音処理装置および音処理プログラムを提供することである。 The subject of this invention is providing the sound processing apparatus and sound processing program which can suppress the displacement of the sound image accompanying a noise reduction process.

本発明の音処理装置は、第１音データを出力する第１集音部と第２音データを出力する第２集音部とを有する集音部と、前記第１音データと前記第２音データとの、複数の周波数領域における振幅の比を算出する算出部と、前記第１音データと前記第２音データとのうち少なくとも一方から、一部の音データを除去する除去部と、少なくとも一方から前記一部の音データが除去された前記第１音データと前記第２音データの前記周波数領域における振幅の比を、除去前の前記第１音データと前記第２音データとの前記複数の周波数領域における振幅の比に近づける処理部とを備える構成とした。
本発明のプログラムは、第１集音部による第１音データと、第２集音部による第２音データとを入力する処理と、前記第１音データと前記第２音データとの、複数の周波数領域における振幅の比を算出する処理と、前記第１音データと前記第２音データとのうち少なくとも一方から、一部の音データを除去する処理と、少なくとも一方から前記一部の音データが除去された前記第１音データと前記第２音データの前記周波数領域における振幅の比を、除去前の前記第１音データと前記第２音データとの前記複数の周波数領域における振幅の比に近づける処理とをコンピュータに実行させる構成とした。 The sound processing apparatus of the present invention includes a sound collecting unit having a first sound collecting unit that outputs first sound data and a second sound collecting unit that outputs second sound data, the first sound data, and the second sound data. A calculation unit that calculates a ratio of amplitudes in a plurality of frequency regions to sound data; and a removal unit that removes part of sound data from at least one of the first sound data and the second sound data; The ratio of the amplitude in the frequency domain of the first sound data and the second sound data from which the partial sound data has been removed from at least one of the first sound data and the second sound data before removal is determined. And a processing unit that approximates the amplitude ratio in the plurality of frequency regions.
The program of the present invention includes a plurality of processes of inputting the first sound data by the first sound collection unit and the second sound data by the second sound collection unit, and the first sound data and the second sound data. Calculating a ratio of amplitudes in the frequency domain, a process of removing part of sound data from at least one of the first sound data and the second sound data, and the part of sound from at least one of the first sound data and the second sound data The ratio of the amplitude in the frequency domain of the first sound data and the second sound data from which the data has been removed is the amplitude ratio in the frequency domain of the first sound data and the second sound data before the removal. The computer is configured to execute the process of approaching the ratio.

本発明によれば、ノイズ低減処理に伴う音像の変位を抑制できる音処理装置および音処理プログラムを提供できる。 ADVANTAGE OF THE INVENTION According to this invention, the sound processing apparatus and sound processing program which can suppress the displacement of the sound image accompanying a noise reduction process can be provided.

本発明における音処理装置の一実施形態であるカメラを示し、（ａ）はそのブロック構成図、（ｂ）は概念正面図である。The camera which is one Embodiment of the sound processing apparatus in this invention is shown, (a) is the block block diagram, (b) is a conceptual front view. 音情報処理部におけるノイズ低減処理とその補正の説明図である。It is explanatory drawing of the noise reduction process in the sound information processing part, and its correction | amendment. 音情報処理部におけるノイズ低減処理と補正のフローチャートである。It is a flowchart of the noise reduction process and correction | amendment in a sound information processing part. 音像変位を説明する図である。It is a figure explaining a sound image displacement. 第２実施形態にかかる音情報処理部におけるノイズ低減処理と補正のフローチャートである。It is a flowchart of the noise reduction process and correction | amendment in the sound information processing part concerning 2nd Embodiment. ノイズ低減処理部分の前後の信号比変化と対応させた補正を説明する図である。It is a figure explaining the correction | amendment matched with the signal ratio change before and behind a noise reduction process part.

以下、図面等を参照して、本発明の実施形態について説明する。
（第１実施形態）
図１は、本発明における音処理装置の一実施形態であるカメラ１を示し、図１（ａ）はそのブロック構成図、図１（ｂ）はカメラ１の概念正面図である。
図１（ａ）に示すように、カメラ１は、カメラ本体１０と、レンズ鏡筒２０とにより構成されている。カメラ１は、自動的に合焦するオートフォーカス（以下ＡＦと略記する）機能を備えている。また、カメラ１は、静止画と動画の何れも撮影可能であって、動画撮影時には画像と同時に音をステレオで記録可能である。 Embodiments of the present invention will be described below with reference to the drawings.
(First embodiment)
FIG. 1 shows a camera 1 which is an embodiment of a sound processing apparatus according to the present invention. FIG. 1 (a) is a block diagram of the camera 1 and FIG. 1 (b) is a conceptual front view of the camera 1.
As shown in FIG. 1A, the camera 1 includes a camera body 10 and a lens barrel 20. The camera 1 has an autofocus (hereinafter abbreviated as AF) function for automatically focusing. In addition, the camera 1 can shoot both still images and moving images, and can record sound in stereo at the same time as images during moving image shooting.

カメラ本体１０は、撮像素子１１と、画像処理部１２と、ステレオ集音装置１３と、音情報処理部１４と、記憶部１５と、制御部１６と、出力部１８と、入力部１９とを備えている。
撮像素子１１は、ＣＣＤ等の光電変換素子により構成され、レンズ鏡筒２０の結像光学系によって結像された被写体像光を電気信号に変換する。
画像処理部１２は、撮像素子１１から出力されるアナログの画像情報をＡ／Ｄ変換すると共に画像処理して画像データを生成する。 The camera body 10 includes an image sensor 11, an image processing unit 12, a stereo sound collecting device 13, a sound information processing unit 14, a storage unit 15, a control unit 16, an output unit 18, and an input unit 19. I have.
The image sensor 11 is composed of a photoelectric conversion element such as a CCD, and converts the subject image light imaged by the imaging optical system of the lens barrel 20 into an electrical signal.
The image processing unit 12 A / D converts analog image information output from the image sensor 11 and performs image processing to generate image data.

ステレオ集音装置１３は、図１（ｂ）に示すように、左右一対のマイク（左マイク１３Ｌ，右マイク１３Ｒ）を備えている。左マイク１３Ｌと右マイク１３Ｒとは、カメラ１を横位置で構えた状態においてレンズ鏡筒２０の中心を通る鉛直線を挟む略対称位置に配置されている。各マイク１３Ｌ，１３Ｒは、それぞれ外部の音を集音してアナログ信号として検出し、音情報処理部１４に出力する。
音情報処理部１４は、ステレオ集音装置１３から入力される音信号をＡ／Ｄ変換してデジタル信号とすると共にノイズ低減処理を行う。音情報処理部１４は、ノイズ低減処理係る機能部として、ノイズ低減処理部１４Ａと、補正部１４Ｂと、を備えている。これらについては、後に詳述する。 As shown in FIG. 1B, the stereo sound collecting device 13 includes a pair of left and right microphones (a left microphone 13L and a right microphone 13R). The left microphone 13L and the right microphone 13R are disposed at substantially symmetrical positions with a vertical line passing through the center of the lens barrel 20 in a state where the camera 1 is held in the horizontal position. The microphones 13L and 13R collect external sounds, detect them as analog signals, and output them to the sound information processing unit 14.
The sound information processing unit 14 A / D converts the sound signal input from the stereo sound collecting device 13 into a digital signal and performs noise reduction processing. The sound information processing unit 14 includes a noise reduction processing unit 14A and a correction unit 14B as functional units related to noise reduction processing. These will be described in detail later.

記憶部１５は、画像処理部１２が出力する画像データおよび音情報処理部１４が出力する音データを記憶する。記憶部１５は、バッファーやカメラに内蔵されたメモリでもよいし、またＳＤカードやＨＤＤ等の外部の記憶媒体でもよい。 The storage unit 15 stores the image data output from the image processing unit 12 and the sound data output from the sound information processing unit 14. The storage unit 15 may be a buffer or a memory built in the camera, or may be an external storage medium such as an SD card or HDD.

出力部１８は、記憶部１５に記憶された画像データ及び音データを出力する。出力部１８は、外部機器へ音情報（電気信号）を出力するためのインターフェース等である。外部機器とは、これに限定されないが、例えばＰＣ、外部スピーカ、携帯電話等である。ただし、これに限定されず、出力部１８は、カメラ１に設けられた背面液晶及びスピーカであってもよい。なお、出力部１８がスピーカの場合、出力部１８は音情報（電気信号）を音に変換する変換部も備える。 The output unit 18 outputs image data and sound data stored in the storage unit 15. The output unit 18 is an interface for outputting sound information (electric signal) to an external device. Examples of the external device include, but are not limited to, a PC, an external speaker, and a mobile phone. However, the present invention is not limited to this, and the output unit 18 may be a rear liquid crystal and a speaker provided in the camera 1. When the output unit 18 is a speaker, the output unit 18 also includes a conversion unit that converts sound information (electric signal) into sound.

入力部１９は、外部機器からデータを入力するためのインターフェース等である。
外部機器とデータのやり取り（通信）をする際には、出力部１８と入力部１９は別体となっていなくてもよく、入力部１９と出力部１８が一体となっているような構成であってもよい。
なお、外部機器とは、これに限定されないが、例えばＰＣ、外部マイク、携帯電話等である。 The input unit 19 is an interface for inputting data from an external device.
When exchanging (communication) data with an external device, the output unit 18 and the input unit 19 do not have to be separated, and the input unit 19 and the output unit 18 are integrated. There may be.
The external device is not limited to this, but is, for example, a PC, an external microphone, a mobile phone, or the like.

制御部１６は、ＣＰＵ等を備えて構成され、設定された撮像条件（例えば、絞り値、露出値等）に応じて、レンズ鏡筒２０の後述する各構成要素を含めたカメラ１の各構成要素を統括制御する。たとえば、制御部１６は、後述するレンズ鏡筒２０におけるＡＦ駆動用モータ２２を駆動する駆動制御信号を生成し、レンズ制御部２４に出力する。 The control unit 16 includes a CPU and the like, and each configuration of the camera 1 including each component to be described later of the lens barrel 20 according to the set imaging conditions (for example, an aperture value, an exposure value, etc.). Supervise and control elements. For example, the control unit 16 generates a drive control signal for driving an AF drive motor 22 in a lens barrel 20 described later, and outputs the drive control signal to the lens control unit 24.

レンズ鏡筒２０は、フォーカシングレンズ、手振れ補正レンズ、ズーミングレンズ等を備える結像光学系（図示省略）と、ＡＦエンコーダ２１と、ＡＦ駆動用モータ２２と、を備えている。
ＡＦエンコーダ２１は、フォーカシングレンズの位置を検出してレンズ制御部２４および制御部１６に出力する。レンズ制御部２４は、検出されたフォーカシングレンズの位置情報を制御部１６に出力する。
ＡＦ駆動用モータ２２は、レンズ制御部２４から入力されるＡＦレンズの位置を制御するための駆動制御信号に応じて、ＡＦレンズを移動駆動する。 The lens barrel 20 includes an imaging optical system (not shown) including a focusing lens, a camera shake correction lens, a zooming lens, and the like, an AF encoder 21, and an AF drive motor 22.
The AF encoder 21 detects the position of the focusing lens and outputs it to the lens control unit 24 and the control unit 16. The lens control unit 24 outputs the detected position information of the focusing lens to the control unit 16.
The AF drive motor 22 moves and drives the AF lens according to a drive control signal for controlling the position of the AF lens input from the lens control unit 24.

そして、カメラ１は、使用者による図示しないシャッタボタンの押圧操作によって撮影が指令されると、制御部１６によって制御されて撮影作用を行う。
すなわち、撮像素子１１によって被写体像光を電気信号に変換し、画像処理部１２によって処理した画像データを、記憶部１５に記憶させる（撮影する）。制御部１６は、撮影時において、レンズ制御部２４、ＡＦ駆動用モータ２２を介してＡＦレンズを移動駆動するＡＦ制御を行う。 The camera 1 is controlled by the control unit 16 to perform a shooting operation when a shooting command is issued by a user pressing a shutter button (not shown).
That is, the subject image light is converted into an electrical signal by the image sensor 11 and the image data processed by the image processing unit 12 is stored (captured) in the storage unit 15. The control unit 16 performs AF control for moving and driving the AF lens via the lens control unit 24 and the AF driving motor 22 during photographing.

動画撮影時においては、撮像素子１１は、被写体像光を電気信号に変換して順次取り込み、記憶部１５を介して１秒間に所定のフレーム（コマ数）の画像を記憶する。また、前述したように、音情報処理部１４が集音した音データを、画像データと共に記憶部１５を介して記憶（録音）する。動画撮影時には、撮影期間を通してＡＦ制御が行われる。 At the time of moving image shooting, the image pickup device 11 converts subject image light into an electrical signal and sequentially captures it, and stores an image of a predetermined frame (number of frames) per second via the storage unit 15. Further, as described above, the sound data collected by the sound information processing unit 14 is stored (recorded) through the storage unit 15 together with the image data. During moving image shooting, AF control is performed throughout the shooting period.

ここで、ステレオ集音装置１３が集音した音情報は、音情報処理部１４に入力される。音情報処理部１４は、ステレオ集音装置１３が集音した音に含まれるＡＦ制御にかかる駆動ノイズ（ＡＦ駆動音）を低減処理する。そして、音情報処理部１４は、駆動ノイズ（ＡＦ駆動音）が低減処理された音情報を記憶部１５に出力する。 Here, the sound information collected by the stereo sound collecting device 13 is input to the sound information processing unit 14. The sound information processing unit 14 performs a reduction process on driving noise (AF driving sound) related to AF control included in the sound collected by the stereo sound collecting device 13. Then, the sound information processing unit 14 outputs the sound information on which the drive noise (AF drive sound) has been reduced to the storage unit 15.

ただし、上記の処理の流れに限定されない。例えば変形形態として、１）制御部１６は、ステレオ集音装置１３が集音した音を、一旦、記憶部１５に記憶させる、２）制御部１６は、その記憶された音データをノイズ低減処理部１４Ａへ出力する、３）低減処理部１４Ａは音データに対して低減処理を施す、４）次いで、制御部１６は、低減処理された音データを、再度、記憶部１５に記憶する、といった処理の流れでも良い。 However, it is not limited to the above processing flow. For example, as a modification, 1) the control unit 16 temporarily stores the sound collected by the stereo sound collecting device 13 in the storage unit 15. 2) The control unit 16 performs noise reduction processing on the stored sound data. 3) The reduction processing unit 14A performs a reduction process on the sound data. 4) Next, the control unit 16 stores the reduced sound data in the storage unit 15 again. Processing flow may be used.

本実施形態の処理の流れに戻り、前述した図１に加えて図２〜図４を参照し、音情報処理部１４について詳細に説明する。図２は、音情報処理部１４におけるノイズ低減処理とその補正の説明図である。図３は、音情報処理部１４におけるノイズ低減処理と補正のフローチャートである。図４は、音像変位を説明する図である。 Returning to the processing flow of the present embodiment, the sound information processing unit 14 will be described in detail with reference to FIGS. 2 to 4 in addition to FIG. 1 described above. FIG. 2 is an explanatory diagram of noise reduction processing and correction in the sound information processing unit 14. FIG. 3 is a flowchart of noise reduction processing and correction in the sound information processing unit 14. FIG. 4 is a diagram for explaining the sound image displacement.

音情報処理部１４は、前述したように、ノイズ低減処理部１４Ａと、補正部１４Ｂとを備えている。
ノイズ低減処理部１４Ａは、ノイズ周波数スペクトルＳＮを用い、スペクトル減算法によってＡＦ駆動音に対するノイズ低減処理を行う。ノイズ周波数スペクトルＳＮは、図２（ｂ）に一例を示すような、予め記憶している動作ノイズ情報又は過去に集音した音情報から推定したものである。 As described above, the sound information processing unit 14 includes the noise reduction processing unit 14A and the correction unit 14B.
The noise reduction processing unit 14A uses the noise frequency spectrum SN and performs noise reduction processing on the AF driving sound by a spectral subtraction method. The noise frequency spectrum SN is estimated from previously stored operation noise information or sound information collected in the past, as shown in FIG. 2B as an example.

具体的に説明すると、ノイズ低減処理部１４Ａは、ステレオ集音装置１３（左マイク１３Ｌ，右マイク１３Ｒ）から入力されてデジタル化された音信号を、所定の長さで区切ったフレーム単位でフーリエ変換等により周波数解析を行う。
そして、図２（ａ）に一例を示すような複数の周波数帯域（ｆ１〜ｆ８）に分割した周波数スペクトルＳＩＬ，ＳＩＲを得る。
その周波数スペクトルＳＩＬ，ＳＩＲから図２（ｂ）に示すノイズ周波数スペクトルＳＮを減算してノイズ成分を除去する。
さらに、必要に応じて、信号の下限規制等のフロアリング処理を行って、図２（ｃ）に示すノイズ低減処理後の周波数スペクトルＳＳＬ，ＳＳＲを補正部１４Ｂに出力する。 More specifically, the noise reduction processing unit 14A performs Fourier transform in units of frames obtained by dividing a digitized sound signal input from the stereo sound collector 13 (the left microphone 13L and the right microphone 13R) by a predetermined length. Perform frequency analysis by conversion.
Then, frequency spectra SIL and SIR divided into a plurality of frequency bands (f1 to f8) as shown in FIG. 2A are obtained.
The noise component is removed by subtracting the noise frequency spectrum SN shown in FIG. 2B from the frequency spectra SIL and SIR.
Further, flooring processing such as signal lower limit regulation is performed as necessary, and the frequency spectra SSL and SSR after the noise reduction processing shown in FIG. 2C are output to the correction unit 14B.

このノイズ低減処理部１４Ａによるノイズ低減処理は、ＡＦ駆動音が含まれるフレームに対して、フレーム毎に行われる。
ＡＦ駆動音が含まれるフレームの検知は、たとえば、ＡＦレンズの位置を検出するＡＦエンコーダ２１の出力に基づいて（ＡＦレンズが移動するとＡＦエンコーダ２１の出力が変化する）行われる。
なお、図２（ａ）における周波数スペクトルＳＩＬ，ＳＩＲに対する網掛け部位は、ＡＦ駆動音が含まれない目的音のみの周波数スペクトルを参考的に示すものである。 The noise reduction processing by the noise reduction processing unit 14A is performed for each frame with respect to the frame including the AF driving sound.
The detection of the frame including the AF driving sound is performed, for example, based on the output of the AF encoder 21 that detects the position of the AF lens (the output of the AF encoder 21 changes when the AF lens moves).
Note that the shaded portions for the frequency spectra SIL and SIR in FIG. 2A refer to the frequency spectrum of only the target sound that does not include the AF driving sound.

ここで、ノイズ低減処理部１４Ａによるノイズ低減処理は、ステレオ集音装置１３における左右のマイク（左マイク１３Ｌ，右マイク１３Ｒ）からの音信号に対して、それぞれ独立して行われる。
ただし、左マイク１３Ｌおよび右マイク１３Ｒはレンズ鏡筒２０に対して略対称に配置されているため、入力されるＡＦノイズ（ＡＦ駆動音）は同一であるものとしてノイズ周波数スペクトルＳＮは同一のものを用いる。
なお、左マイク１３Ｌおよび右マイク１３Ｒはレンズ鏡筒２０に対して略対称に配置される形態に限定されず、光軸に対した左右非対称であってもよい。 Here, the noise reduction processing by the noise reduction processing unit 14A is performed independently on the sound signals from the left and right microphones (the left microphone 13L and the right microphone 13R) in the stereo sound collecting device 13.
However, since the left microphone 13L and the right microphone 13R are arranged substantially symmetrically with respect to the lens barrel 20, the input AF noise (AF drive sound) is the same and the noise frequency spectrum SN is the same. Is used.
Note that the left microphone 13L and the right microphone 13R are not limited to a form in which the left microphone 13L and the right microphone 13R are arranged substantially symmetrically with respect to the lens barrel 20, and may be asymmetrical with respect to the optical axis.

補正部１４Ｂは、
・ノイズ低減処理部１４Ａによるノイズ低減処理前の周波数スペクトル（処理前スペクトル）ＳＩＬ，ＳＩＲの、各周波数帯域（ｆ１〜ｆ８）における左右の信号比（処理前比、基準比）と、
・ノイズ低減処理部１４Ａによるノイズ低減処理後の周波数スペクトル（処理後スペクトル）ＳＳＬ，ＳＳＲの各周波数帯域（ｆ１〜ｆ８）における左右の信号比（処理後比、第１の関係）と、
を各々比較する。 The correction unit 14B
A left / right signal ratio (pre-processing ratio, reference ratio) of each frequency band (f1 to f8) of the frequency spectrum (pre-processing spectrum) SIL, SIR before the noise reduction processing by the noise reduction processing unit 14A;
The left / right signal ratio (post-processing ratio, first relationship) in each frequency band (f1 to f8) of the frequency spectrum (processed spectrum) SSL, SSR after the noise reduction processing by the noise reduction processing unit 14A,
Are compared.

補正部１４Ｂは、その比較結果に基づいて、処理後比ＲＳが処理前比ＲＩと、各周波数帯域において、それぞれ略一致するように補正して補正後比ＲＣ（第２の関係）、補正後の周波数スペクトル（補正後スペクトル）ＳＣＬ，ＳＣＲを求める。
そして、補正部１４Ｂは、この補正後スペクトルＳＣＬ，ＳＣＲを記憶部１５に出力する。 Based on the comparison result, the correction unit 14B corrects the post-processing ratio RS so that it substantially coincides with the pre-processing ratio RI in each frequency band, thereby correcting the post-correction ratio RC (second relationship). The frequency spectrum (corrected spectrum) SCL, SCR is obtained.
Then, the correction unit 14B outputs the corrected spectra SCL and SCR to the storage unit 15.

以下、この補正部１４Ｂによる補正について、図２に即してより詳細に説明する。
（処理前スペクトル）
図２（ａ）に示すように、ノイズ低減処理部１４Ａによるノイズ低減処理前における左マイク１３Ｌから入力した音（音信号Ｌ）の周波数スペクトル（処理前スペクトル（Ｌ））における各周波数帯域（ｆ１〜ｆ８）の振幅をＳＩＬ１〜ＳＩＬ８とする。
右マイク１３Ｒから入力した音（音信号Ｒ）の周波数スペクトル（処理前スペクトル（Ｒ））における各周波数帯域（ｆ１〜ｆ８）の振幅をＳＩＲ１〜ＳＩＲ８とする。
処理前スペクトルの周波数帯域（ｆ１〜ｆ８）ごとの振幅の左／右信号比（以下、この左／右信号比を処理前比とする）は、ＲＩ１＝ＳＩＬ１／ＳＩＲ１，・・・，ＲＩ８＝ＳＩＬ８／ＳＩＲ８となる。 Hereinafter, the correction by the correction unit 14B will be described in more detail with reference to FIG.
(Spectrum before processing)
As shown in FIG. 2A, each frequency band (f1) in the frequency spectrum (pre-processing spectrum (L)) of the sound (sound signal L) input from the left microphone 13L before the noise reduction processing by the noise reduction processing unit 14A. ˜f8) is assumed to be SIL1 to SIL8.
The amplitudes of the frequency bands (f1 to f8) in the frequency spectrum (pre-processing spectrum (R)) of the sound (sound signal R) input from the right microphone 13R are SIR1 to SIR8.
The left / right signal ratio of the amplitude for each frequency band (f1 to f8) of the spectrum before processing (hereinafter, this left / right signal ratio is referred to as the ratio before processing) is RI1 = SIL1 / SIR1,..., RI8 = SIL8 / SIR8.

（処理後スペクトル）
また、図２（ｃ）に示すように、ノイズ低減処理部１４Ａによるノイズ低減処理後の音信号Ｌの周波数スペクトル（処理後スペクトル（Ｌ））における各周波数帯域（ｆ１〜ｆ８）の振幅をＳＳＬ１〜ＳＳＬ８とする。
ノイズ低減処理部１４Ａによるノイズ低減処理後の音信号Ｒの周波数スペクトル（処理後スペクトル（Ｒ））における各周波数帯域（ｆ１〜ｆ８）の振幅をＳＳＲ１〜ＳＳＲ８とする。
処理後スペクトルの周波数帯域（ｆ１〜ｆ８）ごとの振幅の左／右信号比（以下、この左／右信号比を処理後比とする）は、ＲＳ１＝ＳＳＬ１／ＳＳＲ１，・・・，ＲＳ８＝ＳＳＬ８／ＳＳＲ８となる。 (Processed spectrum)
Further, as shown in FIG. 2C, the amplitude of each frequency band (f1 to f8) in the frequency spectrum (processed spectrum (L)) of the sound signal L after the noise reduction processing by the noise reduction processing unit 14A is expressed as SSL1. ˜SSL8.
The amplitudes of the frequency bands (f1 to f8) in the frequency spectrum (processed spectrum (R)) of the sound signal R after the noise reduction processing by the noise reduction processing unit 14A are defined as SSR1 to SSR8.
The left / right signal ratio of the amplitude for each frequency band (f1 to f8) of the processed spectrum (hereinafter, this left / right signal ratio is referred to as the processed ratio) is RS1 = SSL1 / SSR1,..., RS8 = SSL8 / SSR8.

（補正後スペクトル）
補正部１４Ｂは、処理前比（ＲＩ１〜ＲＩ８）と、処理後比（ＲＳ１〜ＲＳ８）と、
を各周波数帯域（ｆ１〜ｆ８）において比較する。
そして、補正部１４Ｂは、図２（ｄ）に示すように、処理後比（ＲＳ１〜ＲＳ８）が処理前比（ＲＩ１〜ＲＩ８）と各々等しくなるように補正する。そして、補正後スペクトル（Ｌ）（ＳＣＬ１〜ＳＣＬ８）及び補正後スペクトル（Ｒ）（ＳＣＲ１〜ＳＣＲ８）を得る。 (Corrected spectrum)
The correction unit 14B includes a pre-processing ratio (RI1 to RI8), a post-processing ratio (RS1 to RS8),
Are compared in each frequency band (f1 to f8).
And the correction | amendment part 14B correct | amends so that post-process ratio (RS1-RS8) may become equal to pre-process ratio (RI1-RI8), respectively, as shown in FIG.2 (d). Then, a corrected spectrum (L) (SCL1 to SCL8) and a corrected spectrum (R) (SCR1 to SCR8) are obtained.

ここで、補正後スペクトルを得る方式には、増加補正と、減少補正と、平均補正と、がある。 Here, methods for obtaining a corrected spectrum include an increase correction, a decrease correction, and an average correction.

（増加補正）
増加補正は、処理後スペクトル（Ｌ）又は処理後スペクトル（Ｒ）の何れかの振幅を大きく補正して、処理後比ＲＳを処理前比ＲＩに一致させるものである。
１．処理後比ＲＳｎが処理前比ＲＩｎより大きい場合
（１）補正後スペクトル（Ｌ）を求める（Ｌ固定）
処理後スペクトル（Ｌ）ＳＳＬｎを補正後スペクトル（Ｌ）ＳＣＬｎとする（ＳＣＬｎ＝ＳＳＬｎ）
（２）補正後スペクトル（Ｒ）を求める
そして、（１）で求めた補正後スペクトル（Ｌ）ＳＣＬｎに対する比が、処理前比ＲＩｎと等しくなるように、補正後スペクトル（Ｒ）ＳＣＲｎを求める。
このとき、処理後比ＲＳｎは、処理前比ＲＩｎより大きいので、処理後スペクトル（Ｌ）と同じ値の補正後スペクトル（Ｌ）ＳＣＬに対して処理前比を満たすように、処理後スペクトル（Ｒ）ＳＳＲを補正すると、補正後スペクトル（Ｒ）ＳＣＲｎは、処理後スペクトル（Ｒ）ＳＳＲｎより大きくなる（ＳＣＲｎ＞ＳＳＲｎ）。 (Increase correction)
In the increase correction, the amplitude of either the post-processing spectrum (L) or the post-processing spectrum (R) is largely corrected to make the post-processing ratio RS coincide with the pre-processing ratio RI.
1. When the post-process ratio RSn is larger than the pre-process ratio RIn (1) Obtain the corrected spectrum (L) (fixed L)
The processed spectrum (L) SSLn is set as a corrected spectrum (L) SCLn (SCLn = SSLn).
(2) Obtaining the corrected spectrum (R) Then, the corrected spectrum (R) SCRn is obtained so that the ratio of the corrected spectrum (L) SCLn obtained in (1) is equal to the pre-processing ratio RIn.
At this time, since the post-processing ratio RSn is larger than the pre-processing ratio RIn, the post-processing spectrum (R) so as to satisfy the pre-processing ratio with respect to the corrected spectrum (L) SCL having the same value as the post-processing spectrum (L). ) When the SSR is corrected, the corrected spectrum (R) SCRn becomes larger than the processed spectrum (R) SSRn (SCRn> SSRn).

２．処理後比ＲＳｎが処理前比ＲＩｎより小さい場合
（１）補正後スペクトル（Ｒ）を求める（Ｒ固定）
処理後スペクトル（Ｒ）ＳＳＲｎを補正後スペクトル（Ｒ）ＳＣＲｎとする（ＳＣＲｎ＝ＳＳＲｎ）
（２）補正後スペクトル（Ｌ）を求める
そして、（１）で求めた補正後スペクトル（Ｒ）ＳＣＲｎに対する比が、処理前比ＲＩｎと等しくなるように、補正後スペクトル（Ｌ）ＳＣＬｎを求める。
このとき、ＳＣＬｎ＞ＳＳＬｎとなる。
なお、上記「ｎ」には、各周波数帯域を示す数字（１〜８）が入る。 2. When the post-process ratio RSn is smaller than the pre-process ratio RIn (1) Obtain the corrected spectrum (R) (fixed R)
The post-processing spectrum (R) SSRn is used as the corrected spectrum (R) SCRn (SCRn = SSRn).
(2) Obtaining the corrected spectrum (L) Then, the corrected spectrum (L) SCLn is obtained so that the ratio to the corrected spectrum (R) SCRn obtained in (1) is equal to the pre-processing ratio RIn.
At this time, SCLn> SSLn.
Note that “n” is a number (1 to 8) indicating each frequency band.

上記の増加補正において、補正後スペクトルの振幅は、本実施形態においてノイズ低減処理前の振幅以下であるが、これに限定されない。例えば、ノイズ低減処理後のスペクトルを一旦増幅した後にスペクトルの振幅を補正した場合には、補正後のスペクトルの振幅はノイズ低減処理前の振幅よりも大きくなることがある。 In the above increase correction, the amplitude of the corrected spectrum is equal to or smaller than the amplitude before the noise reduction processing in the present embodiment, but is not limited thereto. For example, when the spectrum after the noise reduction process is once amplified and the amplitude of the spectrum is corrected, the amplitude of the spectrum after the correction may be larger than the amplitude before the noise reduction process.

３．具体例
具体例として、図２（ａ）〜（ｅ）中に示すように、周波数スペクトルにおける周波数帯域ｆ３に左右で差があり、周波数帯域ｆ３における左右（Ｌ，Ｒ）の振幅値がノイズ低減処理前（６，３）で、ノイズ低減処理によって（４，１）に変化したとする。
この場合、処理前比ＲＩ３は６／３＝２、処理後比ＲＳ３は４／１＝４、と異なる。補正後における左右信号比（補正後比）ＲＣ３を処理前比ＲＩ３と等しくするため、ノイズ低減処理後の右（Ｒ）の振幅値を１から２に補正する。
その結果、補正後におけるＬ、Ｒの振幅値は（４，２）となり、処理前比２と等しくなる。
このような増加補正によれば、目的音の劣化を抑えることができ、人の音がある場合や目的音が大きくノイズがあまり気にならない場合等に適する。 3. Specific Example As a specific example, as shown in FIGS. 2A to 2E, there is a difference between the left and right frequency bands f3 in the frequency spectrum, and the left and right (L, R) amplitude values in the frequency band f3 are noise reduced. It is assumed that before the processing (6, 3), the noise reduction processing is changed to (4, 1).
In this case, the pre-processing ratio RI3 is different from 6/3 = 2 and the post-processing ratio RS3 is different from 4/1 = 4. In order to make the right / left signal ratio (corrected ratio) RC3 after correction equal to the pre-processing ratio RI3, the right (R) amplitude value after noise reduction processing is corrected from 1 to 2.
As a result, the amplitude values of L and R after correction are (4, 2), which is equal to the pre-processing ratio 2.
Such increase correction can suppress the deterioration of the target sound, and is suitable when there is a human sound or when the target sound is large and noise is not a concern.

（減少補正）
減少補正は、処理後スペクトル（Ｌ）又は処理後スペクトル（Ｒ）の何れかの振幅を小さく補正して、処理後比ＲＳを処理前比ＲＩに一致させるものである。
１．処理後比ＲＳｎが処理前比ＲＩｎより大きい場合
（１）補正後スペクトル（Ｒ）を求める（Ｒ固定）
処理後スペクトル（Ｒ）ＳＳＲｎを補正後スペクトル（Ｒ）ＳＣＲｎとする（ＳＣＲｎ＝ＳＳＲｎ）
（２）補正後スペクトル（Ｌ）を求める
そして、（１）で求めた補正後スペクトル（Ｒ）ＳＣＲｎに対する比が、処理前比ＲＩｎと等しくなるように、補正後スペクトル（Ｌ）ＳＣＬｎを求める。
このとき、ＳＣＬｎ＜ＳＳＬｎとなる。 (Decrease correction)
In the reduction correction, the amplitude of either the processed spectrum (L) or the processed spectrum (R) is corrected to be small so that the processed ratio RS matches the preprocessed ratio RI.
1. When the post-process ratio RSn is larger than the pre-process ratio RIn (1) Obtain the corrected spectrum (R) (fixed R)
The post-processing spectrum (R) SSRn is used as the corrected spectrum (R) SCRn (SCRn = SSRn).
(2) Obtaining the corrected spectrum (L) Then, the corrected spectrum (L) SCLn is obtained so that the ratio to the corrected spectrum (R) SCRn obtained in (1) is equal to the pre-processing ratio RIn.
At this time, SCLn <SSLn.

２．処理後比ＲＳｎが処理前比ＲＩｎより小さい場合
（１）補正後スペクトル（Ｌ）を求める（Ｌ固定）
処理後スペクトル（Ｌ）ＳＳＬｎを補正後スペクトル（Ｌ）ＳＣＬｎとする（ＳＣＬｎ＝ＳＳＬｎ）
（２）補正後スペクトル（Ｒ）を求める
そして、（１）で求めた補正後スペクトル（Ｌ）ＳＣＬｎに対する比が、処理前比ＲＩｎと等しくなるように、補正後スペクトル（Ｒ）ＳＣＲｎを求める。
このとき、ＳＣＲｎ＜ＳＳＲｎとなる。
このような減少補正は、ノイズ低減効果が高く、人声のない静かな場合等に適する。 2. When the post-processing ratio RSn is smaller than the pre-processing ratio RIn (1) Obtain the corrected spectrum (L) (fixed L)
The processed spectrum (L) SSLn is set as a corrected spectrum (L) SCLn (SCLn = SSLn).
(2) Obtaining the corrected spectrum (R) Then, the corrected spectrum (R) SCRn is obtained so that the ratio of the corrected spectrum (L) SCLn obtained in (1) is equal to the pre-processing ratio RIn.
At this time, SCRn <SSRn.
Such reduction correction is suitable for a case where the noise reduction effect is high and there is no human voice.

なお上記の減少補正において、補正後スペクトルの振幅は、本実施形態においてノイズ低減処理後の振幅以下であるが、これに限定されない。例えば、ノイズ低減処理後のスペクトルを一旦増幅した後にスペクトルの振幅を補正した場合には、補正後のスペクトルの振幅はノイズ低減処理後の振幅よりも大きくなることがある。また、増幅の度合いに応じては、ノイズ低減処理前の振幅よりも大きくなることもある。 In the above reduction correction, the amplitude of the corrected spectrum is equal to or smaller than the amplitude after the noise reduction processing in the present embodiment, but is not limited thereto. For example, when the spectrum amplitude after correction after the noise reduction processing is once amplified, the amplitude of the corrected spectrum may be larger than the amplitude after the noise reduction processing. Depending on the degree of amplification, the amplitude before the noise reduction process may be larger.

（平均補正）
平均補正は、前述した増加補正と減少補正とを折衷したものである。ノイズ低減処理後の左右の周波数スペクトルにおける振幅の和を、処理後比ＲＳｎ＝処理前比ＲＩｎとなるように左右に振り分けて補正するものである。 (Average correction)
The average correction is a compromise between the above-described increase correction and decrease correction. The sum of the amplitudes in the left and right frequency spectra after the noise reduction processing is distributed and corrected so that the post-processing ratio RSn = the pre-processing ratio RIn.

上記各補正方式は、補正する対象や状況に応じて、補正方式を切り換えて適用するように構成してもよい。補正方式の切り換えは、公知の技術である音認識や撮像情報から顔認識や人物認識を利用して行うことができる。たとえば、人物が大きく撮影されている場合や人の音入力が認識された場合および入力が大きい場合には増加補正を適用し、人物が認識されないその他の場合には減少補正を適用するように構成すれば良い。 Each of the above correction methods may be configured to be applied by switching the correction method according to the object to be corrected and the situation. Switching of the correction method can be performed by using face recognition or person recognition from sound recognition or imaging information, which are publicly known techniques. For example, it is configured to apply an increase correction when a person is photographed large, when a person's sound input is recognized and when the input is large, and to apply a decrease correction in other cases where the person is not recognized Just do it.

なお、本実施形態では、処理後比ＲＳ（第１の関係）を処理前比ＲＩ（基準関係）に一致させる例について説明した。しかし、本実施形態はそれに限定されない。補正後比ＲＣは必ずしもＲＣ＝処理前比ＲＩでなくても良く、ＲＣは処理前比ＲＩを含む所定の範囲内であればよい。また、補正後比ＲＣの所定の範囲とは、処理後比ＲＳよりも処理前比ＲＩに近い値となる範囲である。 In the present embodiment, the example in which the post-processing ratio RS (first relationship) matches the pre-processing ratio RI (reference relationship) has been described. However, the present embodiment is not limited to this. The corrected ratio RC does not necessarily have to be RC = pre-processing ratio RI, and RC may be within a predetermined range including the pre-processing ratio RI. The predetermined range of the post-correction ratio RC is a range that is closer to the pre-processing ratio RI than the post-processing ratio RS.

すなわち、仮に、処理後比ＲＳ（第１の関係）の音を聞くことができたとすると、補正後比ＲＣの音の定位は、第１の関係（処理後比ＲＳ）の音の定位よりも、処理前比ＲＩの音の定位に近い。
また、補正後比ＲＣの所定の範囲とは、補正後比ＲＣが処理前比ＲＩのプラスマイナス５％以内に含まれるような範囲と定めてもよい。 That is, if the sound of the post-processing ratio RS (first relation) can be heard, the localization of the sound of the post-correction ratio RC is more than the localization of the sound of the first relation (post-processing ratio RS). It is close to the sound localization of the pre-processing ratio RI.
The predetermined range of the corrected ratio RC may be determined as a range in which the corrected ratio RC is included within plus or minus 5% of the pre-processing ratio RI.

また、補正後比ＲＣの所定の範囲とは、ノイズ低減処理前の音像の位置に対して、補正後の音像の位置がプラスマイナス３０°以内に含まれるような範囲であってもよい。このように、補正後比ＲＣの所定の範囲を、補正後の音像の位置が所定の角度の範囲に含まれるような範囲として定めてもよい。また補正後比ＲＣの所定の範囲とは、補正後の音像の位置がプラスマイナス３０°よりも狭い、プラスマイナス１５°以内に含まれる範囲であってもよい。 Further, the predetermined range of the corrected ratio RC may be a range in which the position of the corrected sound image is included within ± 30 ° with respect to the position of the sound image before the noise reduction processing. As described above, the predetermined range of the corrected ratio RC may be determined as a range in which the position of the corrected sound image is included in the range of the predetermined angle. Further, the predetermined range of the corrected ratio RC may be a range in which the position of the corrected sound image is narrower than plus / minus 30 ° and included within plus / minus 15 °.

つぎに、図３に示すフローチャートに沿って、ノイズ低減処理部１４Ａおよび補正部１４Ｂによるノイズ低減処理と補正制御の流れを説明する。なお、図３中および以下の説明では、ステップを「Ｓ」とも略記する。
ノイズ低減処理部１４Ａによるノイズ低減処理と補正部１４Ｂによる補正は、前述したようにＡＦエンコーダ２１の出力等のＡＦ駆動情報に基づいてスタートする。つまり、ＡＦ駆動時のみに機能する。 Next, the flow of noise reduction processing and correction control by the noise reduction processing unit 14A and the correction unit 14B will be described with reference to the flowchart shown in FIG. In FIG. 3 and the following description, step is also abbreviated as “S”.
The noise reduction processing by the noise reduction processing unit 14A and the correction by the correction unit 14B start based on AF drive information such as the output of the AF encoder 21 as described above. That is, it functions only during AF driving.

ノイズ低減処理と補正制御は、まず、補正部１４Ｂがノイズ低減処理部１４Ａによるノイズ低減処理前におけるそのフレームの処理前比ＲＩを演算し（Ｓ３０１）、ノイズ低減処理部１４Ａによってノイズ低減処理を行う（Ｓ３０２）。
ついで、補正部１４Ｂが、ノイズ低減処理部１４Ａによるノイズ低減処理後の処理後比ＲＳを演算し（Ｓ３０３）、その処理後比ＲＳと処理前比ＲＩとを比較する（Ｓ３０４）。
ステップ３０４において両者が等しくないと判断された場合（Ｎｏ）には、補正部１４Ｂによってノイズ低減処理後の信号に補正を行う（Ｓ３０５）。一方、ステップ３０４において両者が等しいと判断された場合（Ｙｅｓ）には、補正することなく制御を終了する。 In the noise reduction processing and correction control, first, the correction unit 14B calculates a pre-processing ratio RI of the frame before the noise reduction processing by the noise reduction processing unit 14A (S301), and the noise reduction processing unit 14A performs the noise reduction processing. (S302).
Next, the correction unit 14B calculates the post-processing ratio RS after the noise reduction processing by the noise reduction processing unit 14A (S303), and compares the post-processing ratio RS with the pre-processing ratio RI (S304).
If it is determined in step 304 that they are not equal (No), the correction unit 14B corrects the signal after the noise reduction processing (S305). On the other hand, if it is determined in step 304 that both are equal (Yes), the control is terminated without correction.

上記のように、補正部１４Ｂは、周波数スペクトルの各周波数帯域における処理後比を、処理前比と略一致するように補正する。
これにより、ステレオ信号をノイズ発生タイミングに合わせてノイズ低減処理を行った際に、そのノイズ低減処理に起因する目的音の音像変位を抑制することができる。 As described above, the correction unit 14B corrects the post-processing ratio in each frequency band of the frequency spectrum so as to substantially match the pre-processing ratio.
Thereby, when the noise reduction process is performed on the stereo signal in accordance with the noise generation timing, it is possible to suppress the sound image displacement of the target sound due to the noise reduction process.

すなわち、図４に概念図を示すように、人物Ｍから見た目的音の音像位置に対して、ノイズ低減処理のみで補正しない処理音の音像が大きく移動してしまう場合でも、補正によって音像の移動を小さく抑えることができる。その結果、ノイズ低減処理時（ＡＦ駆動時）において映像と音像とが突然乖離するといった違和感のある音像変位を防ぐことができるものである。
また、本実施形態において音処理は、全周波数帯域において行うものでなくてもよく、一部の周波数帯域に対して音処理を行ってもよい。一部の周波数帯域の例としては、ノイズが特に検出される周波数帯域や、可聴の周波数帯域、極端な高周波や低周波をカットした周波数帯域があげられる。 That is, as shown in the conceptual diagram of FIG. 4, even if the sound image of the processed sound that is not corrected only by the noise reduction process moves greatly relative to the sound image position of the target sound viewed from the person M, the sound image is moved by the correction. Can be kept small. As a result, it is possible to prevent an uncomfortable sound image displacement such that the image and the sound image suddenly deviate during noise reduction processing (AF driving).
In the present embodiment, the sound processing may not be performed in the entire frequency band, and the sound processing may be performed on a part of the frequency bands. Examples of some frequency bands include a frequency band in which noise is particularly detected, an audible frequency band, and a frequency band in which extreme high and low frequencies are cut.

（第２実施形態）
つぎに、本発明の第２実施形態について説明する。
図５は、第２実施形態にかかる音情報処理部１４におけるノイズ低減処理と補正のフローチャートである。図２、図３と同様に、周波数スペクトルにおける周波数帯域ｆ３について説明する。
本第２実施形態は、補正の基準とする左右信号比（処理前比）を、ノイズ（ＡＦ駆動音）発生の無い部分（フレーム）から取得するものである。なお、機構的な構成は、前述した第１実施形態と全く同様であり、説明は省略する。以下の説明中における構成要素の符号等は、図１参照のこと。
本第２実施形態では、補正の基準とする左右信号比を、ノイズ低減処理部分の直前または直後の部分から求める。なお、直前の信号比を利用する場合にはリアルタイムの処理（逐次処理）が可能であるが、直後の信号比を利用する場合には逐次処理が難しく後処理の場合にのみ適用可能である。 (Second Embodiment)
Next, a second embodiment of the present invention will be described.
FIG. 5 is a flowchart of noise reduction processing and correction in the sound information processing unit 14 according to the second embodiment. Similar to FIGS. 2 and 3, the frequency band f <b> 3 in the frequency spectrum will be described.
In the second embodiment, the left / right signal ratio (pre-processing ratio) as a reference for correction is acquired from a portion (frame) where no noise (AF drive sound) is generated. The mechanical configuration is exactly the same as that of the first embodiment described above, and a description thereof is omitted. Refer to FIG. 1 for the reference numerals of the constituent elements in the following description.
In the second embodiment, the left / right signal ratio as a reference for correction is obtained from a portion immediately before or after the noise reduction processing portion. Note that real-time processing (sequential processing) is possible when the immediately preceding signal ratio is used, but when the immediately following signal ratio is used, sequential processing is difficult and can be applied only to post-processing.

このように、ノイズ（ＡＦ駆動音）が混入していない部分から左右信号比を求めてこれを補正の基準とすることで、ノイズの影響を受けずに目的音の左右比を求めることができる。
ただし、目的音の時間変化が大きい場合は、ノイズ低減処理部分の直前と、ノイズ低減処理部分とで、目的音のスペクトル（左右信号比）が大きく変化することがあり、実際に発生している目的音の音像移動に追従できないことがある。このようなことを防ぐため、補正の基準とする左右信号比を、ノイズ低減処理部分の直前から求めた左右信号比と、前述した第１実施形態のようにノイズ低減処理部分の左右信号比と、の何れかを選択可能とすることが好ましい。 In this way, by obtaining the left / right signal ratio from the portion where noise (AF driving sound) is not mixed and using this as the reference for correction, the right / left ratio of the target sound can be obtained without being affected by noise. .
However, when the time variation of the target sound is large, the spectrum (right / left signal ratio) of the target sound may change greatly between the noise reduction processing part and the noise reduction processing part, which is actually occurring. It may not be possible to follow the movement of the target sound. In order to prevent this, the right / left signal ratio used as a reference for correction is determined as the right / left signal ratio obtained immediately before the noise reduction processing portion, and the left / right signal ratio of the noise reduction processing portion as in the first embodiment described above. It is preferable that any one of these can be selected.

この補正基準となる左右信号比を選択して適用する場合におけるノイズ低減処理および補正を、図５に示すフローチャートに沿って説明する。
まず、補正部１４Ｂが、ＡＦエンコーダ２１の出力等のＡＦ駆動情報に基づいてノイズ低減処理をスタートする直前のフレームの左右の信号比ＲＩｂを演算し（Ｓ５０１）、ノイズ低減処理に入った後にノイズ低減処理部１４Ａによるノイズ低減処理前における各フレームの左右の信号比ＲＩａを演算する（Ｓ５０２）。 The noise reduction processing and correction in the case of selecting and applying the right / left signal ratio as the correction reference will be described with reference to the flowchart shown in FIG.
First, the correction unit 14B calculates the left / right signal ratio RIb of the frame immediately before starting the noise reduction process based on the AF drive information such as the output of the AF encoder 21 (S501), and after entering the noise reduction process, the noise The left / right signal ratio RIa of each frame before the noise reduction processing by the reduction processing unit 14A is calculated (S502).

そして、信号比ＲＩｂと信号比ＲＩａとの差（絶対値）を、予め定められた閾値Ａと比較判定する（Ｓ５０３）。
ステップ５０３において、信号比ＲＩｂと信号比ＲＩａとの差が閾値Ａ以下と判定された場合（Ｙｅｓ）には、信号比ＲＩｂを基準比ＲＩとして設定する（Ｓ５０４）。一方、ステップ５０３において、信号比ＲＩｂと信号比ＲＩａとの差が閾値Ａを越えていると判定された場合（Ｎｏ）には、信号比ＲＩａを基準比ＲＩとして設定する（Ｓ５０５）。 Then, the difference (absolute value) between the signal ratio RIb and the signal ratio RIa is compared with a predetermined threshold A (S503).
If it is determined in step 503 that the difference between the signal ratio RIb and the signal ratio RIa is equal to or less than the threshold A (Yes), the signal ratio RIb is set as the reference ratio RI (S504). On the other hand, if it is determined in step 503 that the difference between the signal ratio RIb and the signal ratio RIa exceeds the threshold A (No), the signal ratio RIa is set as the reference ratio RI (S505).

その後、ノイズ低減処理部１４Ａによってノイズ低減処理を行い（Ｓ５０６）、ついで、補正部１４Ｂが、ノイズ低減処理部１４Ａによるノイズ低減処理後の左右の信号比ＲＳを演算し（Ｓ５０７）、そのノイズ低減処理後左右信号比ＲＳとノイズ低減処理前左右信号比ＲＩとを比較する（Ｓ５０８）。
ステップ５０８において両者が等しくないと判断された場合（Ｎｏ）には、補正部１４Ｂによってノイズ低減処理後の信号に補正を行う（Ｓ５０９）。一方、ステップ５０８において両者が等しいと判断された場合（Ｙｅｓ）には、補正することなく制御を終了する。 Thereafter, noise reduction processing is performed by the noise reduction processing unit 14A (S506), and then the correction unit 14B calculates a left / right signal ratio RS after the noise reduction processing by the noise reduction processing unit 14A (S507), and the noise reduction is performed. The processed right / left signal ratio RS is compared with the left / right signal ratio RI before noise reduction processing (S508).
If it is determined in step 508 that the two are not equal (No), the correction unit 14B corrects the signal after the noise reduction processing (S509). On the other hand, if it is determined in step 508 that both are equal (Yes), the control is terminated without correction.

上記構成では、ノイズ低減処理部分の直前から求めた左右信号比ＲＩｂとノイズ低減処理部分の左右信号比ＲＩａとを比較し、その差が小さい場合には、目的音の音像の移動が小さいと判断してノイズの影響を受けない信号比ＲＩｂを基準信号比ＲＩとして採用し、差が所定量より大きい場合には、目的音の音像の移動が大きいと判断して信号比ＲＩａを基準信号比ＲＩとして採用するものである。
このような構成によれば、目的音の音像の移動が小さい場合には処理部分の直前と処理部分の音像の連続性を保つことができ、目的音の音像の移動が大きい場合には違和感のない円滑な音像移動を再現できる。 In the above configuration, the left / right signal ratio RIb obtained immediately before the noise reduction processing portion is compared with the left / right signal ratio RIa of the noise reduction processing portion, and if the difference is small, it is determined that the movement of the sound image of the target sound is small. When the signal ratio RIb that is not affected by noise is adopted as the reference signal ratio RI and the difference is larger than a predetermined amount, it is determined that the movement of the sound image of the target sound is large, and the signal ratio RIa is determined as the reference signal ratio RI. Is to be adopted.
According to such a configuration, when the movement of the target sound image is small, the continuity of the sound image immediately before the processing portion and the processing portion can be maintained, and when the movement of the target sound image is large, the sense of discomfort is maintained. Reproducible smooth sound image movement.

なお、事後処理（逐次処理でなく一旦記録した後に、読み出して行う処理）となるが、ノイズ低減処理部分の直前と直後の部分（フレーム）の左右信号比をそれぞれ求め、その変化率に対応させて左右の信号比率を変化させても良い。つまり、ノイズ低減処理部分の直前と直後において左右信号比が大きく異なる場合は、音源が左右に移動したと考えられるため、ノイズ低減処理部分の直前と直後の左右の信号比の変化と対応するように音像を移動させる処理を行うものである。 In addition, post processing (processing that is read after recording once instead of sequential processing) is performed, and the left and right signal ratios of the portion (frame) immediately before and after the noise reduction processing portion are respectively determined and corresponded to the rate of change. Thus, the left / right signal ratio may be changed. In other words, if the left / right signal ratio is significantly different between immediately before and after the noise reduction processing part, it is considered that the sound source has moved left and right, so that it corresponds to the change in the left / right signal ratio immediately before and after the noise reduction processing part. The process of moving the sound image is performed.

図６は、このような処理の説明図である。
図６（ａ）に示すように、フレーム４〜１０がノイズ低減処理フレームである場合、フレーム３が直前部分、フレーム１１が直後部分のフレームである。
図６（ｂ）において、ＳＦＬ３は直前（フレーム３）の左スペクトル、ＳＦＲ３は直前（フレーム３）の右スペクトル、ＳＦＬ１１は直後（フレーム１１）の左スペクトル、ＳＦＲ１１は直後（フレーム１１）の右スペクトルである。 FIG. 6 is an explanatory diagram of such processing.
As shown in FIG. 6A, when the frames 4 to 10 are noise reduction processing frames, the frame 3 is the immediately preceding portion and the frame 11 is the immediately following portion.
In FIG. 6B, SFL3 is the left spectrum immediately before (frame 3), SFR3 is the right spectrum immediately before (frame 3), SFL11 is the left spectrum immediately after (frame 11), and SFR11 is the right spectrum immediately after (frame 11). It is.

ここで、たとえば、周波数帯域ｆ３について見ると、
左側：ＳＦＬ１１のｆ３（振幅１．５）は、ＳＦＬ３のｆ３（振幅３）より減少している。
右側：ＳＦＲ１１のｆ３（振幅３）は、ＳＦＲ３のｆ３（振幅１）より増加している。
これは、ノイズ低減処理フレーム４〜１０の間に音源が左側から右側に移動していることを示す。 Here, for example, when looking at the frequency band f3,
Left side: f3 (amplitude 1.5) of SFL11 is smaller than f3 (amplitude 3) of SFL3.
Right: f3 (amplitude 3) of SFR11 is greater than f3 (amplitude 1) of SFR3.
This indicates that the sound source is moving from the left side to the right side during the noise reduction processing frames 4 to 10.

そこで、ノイズ低減処理フレーム４〜１０における左右の信号比（処理前比）については、図６（ｃ）に示すように、直前（フレーム３）の左右の信号比（３／１＝３）から、直後（フレーム１１）の左右の信号比（１．５／３＝０．５）へ、連続して変化するようにして補正の基準値となる信号比を求める。
具体的には、直前と直後の値（３と０．５）と直前と直後の間にあるフレーム（７つ）とに基づいて、各フレームでの左右の信号比の値を求める。具体的にはフレーム４〜１０間で２．５／８の値ずつ左右比を減少させるような補正を行う。
ｆ３以外の周波数帯域についても、各々同様の処理を行う。
その結果、処理直前から処理中、処理直後の左右の信号比が連続的に変化し、音像の移動が滑らかになり、違和感を軽減することができる。 Therefore, the left / right signal ratio (pre-processing ratio) in the noise reduction processing frames 4 to 10 is as shown in FIG. 6C from the right / left signal ratio (3/1 = 3) immediately before (frame 3). Then, the signal ratio that is the reference value for correction is obtained so as to continuously change to the right / left signal ratio (1.5 / 3 = 0.5) immediately after (frame 11).
Specifically, based on the immediately preceding and immediately following values (3 and 0.5) and the immediately preceding and immediately following frames (seven), the left and right signal ratio values in each frame are obtained. Specifically, correction is performed so as to decrease the left / right ratio by a value of 2.5 / 8 between frames 4-10.
The same processing is performed for frequency bands other than f3.
As a result, during the process from immediately before the process, the right / left signal ratio immediately after the process changes continuously, the movement of the sound image becomes smooth, and the uncomfortable feeling can be reduced.

以上、本実施形態によると、以下の効果を有する。
（１）カメラ１における補正部１４Ｂは、ノイズ低減処理後における周波数スペクトルの各周波数帯域における左右の信号比を、ノイズ低減処理前における周波数スペクトルの各周波数帯域における左右の信号比と略一致するように補正する。これにより、ステレオ信号をノイズ発生タイミングに合わせてノイズ低減処理する際に、そのノイズ低減処理に起因して生ずる目的音の音像変位を抑制することができる。その結果、ノイズ低減処理時（ＡＦ駆動時）における音像変位による違和感を防ぐことができる。 As described above, this embodiment has the following effects.
(1) The correction unit 14B in the camera 1 substantially matches the left / right signal ratio in each frequency band of the frequency spectrum after the noise reduction processing with the left / right signal ratio in each frequency band of the frequency spectrum before the noise reduction processing. To correct. Thereby, when the noise reduction process is performed on the stereo signal in accordance with the noise generation timing, the sound image displacement of the target sound caused by the noise reduction process can be suppressed. As a result, it is possible to prevent a sense of incongruity due to sound image displacement during noise reduction processing (AF driving).

（変形形態）
以上、説明した実施形態に限定されることなく、以下に示すような種々の変形や変更が可能であり、それらも本発明の範囲内である。
（１）上記実施形態は、本発明を音処理装置としてのカメラに適用したものである。しかし、本発明はこれに限らず、コンピュータを上記各構成要素として機能させるプログラムとして提供されるものであっても良い。 (Deformation)
The present invention is not limited to the above-described embodiment, and various modifications and changes as described below are possible, and these are also within the scope of the present invention.
(1) In the above embodiment, the present invention is applied to a camera as a sound processing apparatus. However, the present invention is not limited to this, and may be provided as a program that causes a computer to function as each of the above components.

（２）上記実施形態は、本発明をカメラにおけるＡＦ駆動音によるノイズを低減するように構成したものである。しかし、本発明はこれに限らず、ズーミングやブレ補正装置の作動ノイズの低減にも適用可能なものであり、さらに、カメラに限らず録音機能を備える光学機器に適用可能である。 (2) In the above embodiment, the present invention is configured to reduce noise caused by AF driving sound in a camera. However, the present invention is not limited to this, and can also be applied to the reduction of operating noise of zooming and blur correction devices. Furthermore, the present invention can be applied not only to cameras but also to optical equipment having a recording function.

（３）本実施形態では、カメラ本体１０に音情報処理部１４が含まれている例について説明したが、これに限定されず、カメラに備わるステレオマイクで録音した後、音処理装置のほうにデータを送信し、音処理装置で低減処理を行ってもよい。すなわち、音を集音する部分と、音の低減処理を施す部分とが分離していてもよい。
この場合、一例として以下のような流れで処理が行われる。
カメラ等に備わるステレオマイクで周囲の音が録音される。
そして、そのステレオマイクで録音した音が音データに変換され、記憶部に記憶される。
録音の際にＡＦ等のカメラ備わる機能の動作が行われた場合は、周囲の音を録音した音データとカメラに備わる機能の動作（例えばＡＦの動作）を行ったタイミングとを関連づけて記憶させる。
次に、記憶部に記憶された音データと動作タイミングとが出力部を介して、別体の音処理装置、例えばＰＣ等に出力される。
音処理装置は、制御部、記憶部、ノイズ低減処理部（以下、これらをＳＰ制御部、ＳＰ記憶部、ＳＰノイズ低減処理部という）を備える。
ＳＰ制御部は、カメラから入力部を介して入力されたその音データと動作タイミングと音データをＳＰ記憶部に記憶させる。
ＳＰ制御部は、ＳＰ記憶部に記憶された音データをＳＰ低減処理部へ出力し、ＳＰ低減処理部は音データに対してＡＦ音などの雑音の低減を行う。
なお、音の低減処理は、音データと共に記憶されている機能の動作タイミングに基づいて行う。その後、ＳＰ制御部は、低減処理された音データをＳＰ記憶部に記憶させる。このようにして、音データに対して低減処理を施してもよい。 (3) In the present embodiment, an example in which the sound information processing unit 14 is included in the camera body 10 has been described. However, the present invention is not limited to this, and after recording with a stereo microphone provided in the camera, Data may be transmitted and reduction processing may be performed by the sound processing device. That is, the part that collects the sound and the part that performs the sound reduction process may be separated.
In this case, as an example, processing is performed in the following flow.
Ambient sounds are recorded with a stereo microphone on the camera.
The sound recorded by the stereo microphone is converted into sound data and stored in the storage unit.
When an operation of a function such as AF is performed at the time of recording, sound data obtained by recording surrounding sounds and the timing at which the function operation (for example, AF operation) of the camera is performed are stored in association with each other. .
Next, the sound data and the operation timing stored in the storage unit are output to a separate sound processing device, such as a PC, via the output unit.
The sound processing apparatus includes a control unit, a storage unit, and a noise reduction processing unit (hereinafter referred to as an SP control unit, an SP storage unit, and an SP noise reduction processing unit).
The SP control unit stores the sound data, operation timing, and sound data input from the camera via the input unit in the SP storage unit.
The SP control unit outputs the sound data stored in the SP storage unit to the SP reduction processing unit, and the SP reduction processing unit reduces noise such as AF sound on the sound data.
Note that the sound reduction processing is performed based on the operation timing of the function stored together with the sound data. Thereafter, the SP control unit stores the reduced sound data in the SP storage unit. In this way, reduction processing may be performed on the sound data.

なお、実施形態及び変形形態は、適宜組み合わせて用いることもできるが、詳細な説明は省略する。また、本発明は以上説明した実施形態によって限定されることはない。 In addition, although embodiment and a deformation | transformation form can also be used in combination as appropriate, detailed description is abbreviate | omitted. Further, the present invention is not limited to the embodiment described above.

１：カメラ、１３：ステレオ集音装置、１３Ｌ：左マイク、１３Ｒ：右マイク、１４：音情報処理部、１４Ａ：ノイズ低減処理部、１４Ｂ：補正部、２０：レンズ鏡筒、２１：ＡＦエンコーダ、２２：ＡＦ駆動用モータ、ＳＩＬ，ＳＩＲ：ノイズ低減処理前の周波数スペクトル、ＲＩ：処理前比、ＳＮ：ノイズ周波数スペクトル、ＳＳＬ，ＳＳＲ：ノイズ低減処理後の周波数スペクトル、ＲＳ：処理後比、ＳＣＬ，ＳＣＲ：補正後の周波数スペクトル 1: Camera, 13: Stereo sound collector, 13L: Left microphone, 13R: Right microphone, 14: Sound information processing unit, 14A: Noise reduction processing unit, 14B: Correction unit, 20: Lens barrel, 21: AF encoder 22: AF driving motor, SIL, SIR: frequency spectrum before noise reduction processing, RI: pre-processing ratio, SN: noise frequency spectrum, SSL, SSR: frequency spectrum after noise reduction processing, RS: post-processing ratio, SCL, SCR: Frequency spectrum after correction

Claims

A sound collection unit having a first sound collection unit that outputs first sound data and a second sound collection unit that outputs second sound data;
A calculation unit that calculates a ratio of amplitudes in a plurality of frequency regions between the first sound data and the second sound data;
A removing unit that removes part of the sound data from at least one of the first sound data and the second sound data;
The ratio of the amplitude in the frequency domain of the first sound data and the second sound data from which the partial sound data has been removed from at least one of the first sound data and the second sound data before removal is determined. A sound processing apparatus comprising: a processing unit that approximates an amplitude ratio in the plurality of frequency regions.

The sound processing device according to claim 1, wherein the sound data removed by the removing unit is sound data generated by driving a mechanism of a camera lens.

The sound processing apparatus according to claim 2,
A detection unit that detects sound data generated by driving the mechanism of the camera lens;
The sound processing device, wherein the processing unit removes the sound data from at least one of the first sound data and the second sound data based on the sound data detected by the detection unit.

A process of inputting the first sound data by the first sound collection unit and the second sound data by the second sound collection unit;
A process of calculating a ratio of amplitudes in a plurality of frequency regions between the first sound data and the second sound data;
A process of removing a part of sound data from at least one of the first sound data and the second sound data;
The ratio of the amplitude in the frequency domain of the first sound data and the second sound data from which the partial sound data has been removed from at least one of the first sound data and the second sound data before removal is determined. A program for causing a computer to execute a process of approaching an amplitude ratio in the plurality of frequency regions.