JP4656395B2

JP4656395B2 - Recording apparatus, recording method, and recording program

Info

Publication number: JP4656395B2
Application number: JP2005098286A
Authority: JP
Inventors: 善樹石毛
Original assignee: Casio Computer Co Ltd
Current assignee: Casio Computer Co Ltd
Priority date: 2005-03-30
Filing date: 2005-03-30
Publication date: 2011-03-23
Anticipated expiration: 2025-03-30
Also published as: JP2006279737A

Description

本発明は、記録装置、記録方法、記録プログラム、再生装置、再生方法、再生プログラムおよび記録再生装置に関する。 The present invention relates to a recording apparatus, a recording method, a recording program, a reproducing apparatus, a reproducing method, a reproducing program, and a recording / reproducing apparatus.

会議などにおいては、会議内容を記録するためにビデオなどにより動画を記録しておくことも可能であるが、動画の場合、データ容量が大きくなるため、長時間に渡っての撮影は困難である。そこで、音声だけでも全て記録しておきたい。しかし、音声だけでは情報が足りず、会議の様子、例えば誰が話しているのかなどを知ることができない、あるいは思い出すことができないなど課題がある。 In meetings, etc., it is possible to record a video with video to record the contents of the meeting, but in the case of a video, the data capacity is large, so it is difficult to shoot for a long time. . So, I want to record everything with just audio. However, there is a problem that information cannot be obtained by voice alone, and it is impossible to know or remember the state of the meeting, for example, who is talking.

そこで、従来、音声と静止画とを関連付けて記録する技術が提案されている。例えば、音声データの録音中に音量が変化した場合のみ、静止画を撮影記録するようにした技術（例えば特許文献１参照）や、音声を検出した方向にカメラを向けて撮影を行う技術（例えば特許文献２参照）が知られている。
特開２００４−１３５２３３号公報特開平０７−０５７１９９号公報 Therefore, conventionally, a technique for recording audio and a still image in association with each other has been proposed. For example, a technique for capturing and recording a still image only when the volume changes during recording of audio data (for example, see Patent Document 1), or a technique for performing imaging with a camera directed in the direction in which sound is detected (for example, Patent Document 2) is known.
JP 2004-135233 A Japanese Patent Application Laid-Open No. 07-057199

しかしながら、上記従来技術においては、音声データを利用して静止画の撮影を行うことを目的とした技術であり、音声データの再生時に撮影された静止画データを利用することはできない。つまり、音声データを再生する場合には、先頭から再生するか、おおよその感で早送りや巻き戻し操作を行って再生開始位置を決めて再生するしかなく、所望する位置から再生しようとしても操作が煩雑になるという問題があった。 However, the above-described conventional technique is a technique for capturing still images using audio data, and cannot use still image data captured during reproduction of audio data. In other words, when playing back audio data, there is no choice but to play from the beginning or determine the playback start position by performing fast forward or rewind operation with an approximate feeling, and even if you try to play from the desired position There was a problem of becoming complicated.

そこで本発明は、音声データの録音中に撮影した静止画データを該音声データの再生制御のために用いることで、容易かつ柔軟に音声データの再生制御を行うことができる記録装置、記録方法、記録プログラム、再生装置、再生方法、再生プログラムおよび記録再生装置を提供することを目的とする。 Accordingly, the present invention provides a recording apparatus, a recording method, and a recording method that can easily and flexibly control the reproduction of audio data by using still image data captured during recording of the audio data for reproduction control of the audio data. It is an object to provide a recording program, a reproducing apparatus, a reproducing method, a reproducing program, and a recording / reproducing apparatus.

上記目的達成のため、請求項１記載の発明による記録装置は、音声を録音して音声データとして取り込む録音手段と、前記録音手段による音声データの録音中の音声変化を検知する音声変化検知手段と、前記音声変化検知手段により所定の静止画撮影条件を満たす音声変化が検知される度に、静止画を撮影して静止画データとして取り込む撮影手段と、前記撮影手段により静止画が撮影された撮影時刻を取得する撮影時刻取得手段と、前記撮影手段により取り込まれた静止画データと前記撮影時刻取得手段により取得された撮影時刻とを、前記録音手段により取り込まれた音声データに対応付けて記録する記録手段と、前記音声変化検知手段により所定の静止画撮影条件を満たす音声変化が音声発生源の方向変化である場合、該音声発生源の方向で過去に撮影された静止画データが存在するか否かを判別する判別手段と、を具備し、前記判別手段により前記音声発生源の方向で過去に撮影された静止画データが存在すると判別された場合、前記撮影手段により取り込むべき静止画データに代えて、当該過去に撮影された静止画データを前記音声変化に対する静止画データとすることを特徴とする。
In order to achieve the above object, a recording apparatus according to the first aspect of the present invention comprises a recording means for recording voice and recording it as voice data, and a voice change detecting means for detecting a voice change during recording of the voice data by the recording means. Each time a change in sound that satisfies a predetermined still image shooting condition is detected by the sound change detection unit, a shooting unit that takes a still image and captures it as still image data; and a shooting in which a still image is shot by the shooting unit Recording time acquisition means for acquiring time, still image data acquired by the imaging means, and imaging time acquired by the imaging time acquisition means are recorded in association with the audio data acquired by the recording means. If a recording unit, given the still image shooting condition is satisfied voice change by the voice change detecting means is a direction change of the sound source, the voice source Discriminating means for discriminating whether or not there is still image data photographed in the past in the direction, and discriminating that there is still image data photographed in the past in the direction of the sound source by the discriminating means. In this case, the still image data photographed in the past is used as still image data corresponding to the sound change, instead of the still image data to be captured by the photographing means .

また、好ましい態様として、例えば請求項２記載のように、請求項１記載の記録装置において、前記撮影時刻取得手段は、前記録音手段による録音開始からの録音経過時間を計時し、前記撮影手段により静止画が撮影された時点における録音経過時間を、前記静止画の撮影時刻として取得するようにしてもよい。 Further, as a preferred aspect, for example, as in claim 2, in the recording apparatus according to claim 1, the photographing time acquisition means measures an elapsed recording time from the start of recording by the recording means, and the photographing means The elapsed recording time at the time when the still image was shot may be acquired as the shooting time of the still image.

また、好ましい態様として、例えば請求項３記載のように、請求項１または２に記載の記録装置において、前記撮影手段は、前記音声変化検知手段により、前記所定の静止画撮影条件を満たす音声変化として、音量変化、周波数帯変化、または音声方向変化の少なくともいずれかであることが検知されると、静止画を撮影して静止画データとして取り込むようにしてもよい。 Further, as a preferred aspect, for example, as in claim 3, in the recording apparatus according to claim 1 or 2, the photographing unit is configured to change the sound that satisfies the predetermined still image photographing condition by the sound change detecting unit. If a change in volume, a change in frequency band, or a change in voice direction is detected, a still image may be captured and captured as still image data.

また、好ましい態様として、例えば請求項４記載のように、請求項１ないし３のいずれかに記載の記録装置において、前記音声変化検知手段により前記撮影手段の撮影パラメータの変更条件を満たす音声変化が検知されると、該音声変化に基づいて前記撮影手段の撮影パラメータを変更する撮影パラメータ変更手段をさらに具備し、前記撮影手段は、前記音声変化検知手段により所定の静止画撮影条件を満たす音声変化が検知されると、前記撮影パラメータ変更手段により変更された撮影パラメータに基づいて静止画を撮影して静止画データとして取り込むようにしてもよい。 Further, as a preferred aspect, for example, as in claim 4, in the recording apparatus according to any one of claims 1 to 3, the sound change detection means causes a sound change that satisfies a change condition of a shooting parameter of the shooting means. When detected, the camera further includes a shooting parameter changing unit that changes a shooting parameter of the shooting unit based on the sound change, and the shooting unit is configured to change a sound that satisfies a predetermined still image shooting condition by the sound change detecting unit. Is detected, a still image may be captured based on the imaging parameter changed by the imaging parameter changing means and captured as still image data.

また、好ましい態様として、例えば請求項５記載のように、請求項４記載の記録装置において、前記撮影パラメータ変更手段は、前記撮影パラメータの変更条件として、前記音声変化検知手段により音声発生源の方向が所定角度以上変化したことが検知されると、前記撮影手段の撮影方向を音声発生源の方向に向けるように撮影パラメータを変更し、前記撮影パラメータに基づいて前記撮影手段による撮影方向を音声発生源の方向に向ける撮影方向変更手段をさらに具備するようにしてもよい。 Further, as a preferable aspect, for example, as in claim 5, in the recording apparatus according to claim 4, the photographing parameter changing unit is configured to change the direction of the sound source by the sound change detecting unit as the photographing parameter changing condition. When it is detected that the camera has changed more than a predetermined angle, the shooting parameter is changed so that the shooting direction of the shooting unit is directed to the direction of the sound source, and the shooting direction by the shooting unit is generated based on the shooting parameter. You may make it further comprise the imaging | photography direction change means which orient | assigns to the direction of a source.

また、好ましい態様として、例えば請求項６記載のように、請求項４記載の記録装置において、前記撮影パラメータ変更手段は、前記撮影パラメータの変更条件として、前記音声変化検知手段により所定以上の音量を有する周波数帯の音声方向が所定角度以上変化したことが検知されると、前記撮影手段の撮影方向を変更するように撮影パラメータを変更し、前記撮影パラメータ変更手段により変更された撮影パラメータに基づいて、前記撮影手段の撮影方向を所定以上の音量を有する周波数帯の音声方向に向ける撮影方向変更手段をさらに具備するようにしてもよい。
Further, as a preferred aspect, for example, as in claim 6, in the recording apparatus according to claim 4, the shooting parameter changing means sets the sound volume more than a predetermined volume by the sound change detecting means as the changing condition of the shooting parameter. When it is detected that the sound direction of the frequency band has changed by a predetermined angle or more, the shooting parameter is changed so as to change the shooting direction of the shooting unit, and based on the shooting parameter changed by the shooting parameter changing unit Further, it may further comprise a photographing direction changing means for directing the photographing direction of the photographing means to the sound direction of a frequency band having a predetermined volume or higher .

また、好ましい態様として、例えば請求項７記載のように、請求項１記載の記録装置において、前記撮影手段は、前記過去に撮影された静止画データに対応する音声データと前記録音手段により取り込まれた音声データとの間に、所定の静止画撮影条件を満たす音声変化がある場合、静止画を撮影して新たな静止画データとして取り込むようにしてもよい。
Further, as a preferred aspect, for example, as in claim 7, in the recording apparatus according to claim 1 , the photographing unit is captured by audio data corresponding to the still image data photographed in the past and the recording unit. If there is a change in sound that satisfies a predetermined still image shooting condition with the sound data, the still image may be shot and captured as new still image data.

また、上記目的達成のため、請求項８記載の発明による記録方法は、音声データの録音中に音声変化を検知し、所定の静止画撮影条件を満たす音声変化が検知される度に、静止画を撮影して静止画データとして取り込み、静止画が撮影された撮影時刻を取得し、前記取り込まれた静止画データと前記取得された撮影時刻とを、前記録音中の音声データに対応付けて記録し、前記所定の静止画撮影条件を満たす音声変化が音声発生源の方向変化であって該音声発生源の方向で過去に撮影された静止画データが存在すると判別された場合は、前記取り込むべき静止画データに代えて、前記過去に撮影された静止画データを前記音声変化に対する静止画データとすることを特徴とする。
In order to achieve the above object, the recording method according to the eighth aspect of the invention detects a change in sound during recording of sound data, and each time a change in sound that satisfies a predetermined still image shooting condition is detected, a still image is recorded. Is captured and captured as still image data, the shooting time when the still image was captured is acquired, and the captured still image data and the acquired shooting time are recorded in association with the audio data being recorded. If it is determined that the sound change that satisfies the predetermined still image shooting condition is a change in the direction of the sound generation source and there is still image data shot in the past in the direction of the sound generation source, Instead of still image data, still image data taken in the past is used as still image data corresponding to the sound change .

また、上記目的達成のため、請求項９記載の発明による記録プログラムは、音声を録音して音声データとして取り込む手順と、前記音声データの音声変化を検知する手順と、所定の静止画撮影条件を満たす音声変化が検知される度に、静止画を撮影して静止画データとして取り込む手順と、前記静止画が撮影された撮影時刻を取得する手順と、前記取り込まれた静止画データと前記取得された撮影時刻とを、前記取り込まれた音声データに対応付けて記録する手順と、所定の静止画撮影条件を満たす音声変化が音声発生源の方向変化である場合、該音声発生源の方向で過去に撮影された静止画データが存在するか否かを判別する手順と、前記音声発生源の方向で過去に撮影された静止画データが存在すると判別された場合、前記取り込むべき静止画データに代えて、当該過去に撮影された静止画データを前記音声変化に対する静止画データとする手順と、をコンピュータに実行させることを特徴とする。
In order to achieve the above object, a recording program according to claim 9 includes a procedure for recording voice and recording it as voice data, a procedure for detecting a voice change in the voice data, and predetermined still image shooting conditions. Each time a change in sound is detected, a procedure for capturing a still image and capturing it as still image data, a procedure for acquiring a shooting time when the still image was captured, and the captured still image data and the acquired If the change in sound that satisfies the predetermined still image shooting condition is a change in direction of the sound source, the past recording is performed in the direction of the sound source. If it is determined that there is still image data captured in the past in the direction of the sound source, it should be captured. Tomega instead data, characterized in that to execute the still image data captured in the past and steps to still image data, to a computer for the voice change.

本発明によって、静止画データの撮影時刻を該音声データの再生制御に用いれば、容易かつ柔軟に音声データの再生制御を行うことができるという利点が得られる。また、音声発生源の方向で過去に撮影した既存の静止画データを流用することでデータ容量を小さくすることができるという利点が得られる。
According to the present invention, if the shooting time of still image data is used for the reproduction control of the audio data, there is an advantage that the reproduction control of the audio data can be easily and flexibly performed. Moreover, there is an advantage that the data capacity can be reduced by diverting existing still image data taken in the past in the direction of the sound source.

以下、本発明の実施の形態を、図面を参照して説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

Ａ．第１実施形態
Ａ−１．第１実施形態の構成
図１は、本発明の第１実施形態によるデジタルカメラの構成を示すブロック図である。図において、画像取得部１０は、レンズ１１、シャッター１２、ＬＰＦ１３からなる。レンズ１１は、通常の光学レンズであり、非球面レンズを重ねたレンズ群からなる。シャッター１２は、シャッタボタンが操作されると、制御部２０によって駆動されるドライバ１４により動作する、所謂メカニカルシャッタである。なお、デジタルカメラによっては、メカニカルシャッタを備えない場合もあり、沈胴式のレンズ構造、メカニカルズームを搭載する機種の場合、これらの駆動制御もドライバ１４で行う。ＬＰＦ１３は、水晶ローパスフィルタであり、モアレの発生を防ぐために搭載されている。 A. First embodiment A-1. Configuration of First Embodiment FIG. 1 is a block diagram showing a configuration of a digital camera according to a first embodiment of the present invention. In the figure, the image acquisition unit 10 includes a lens 11, a shutter 12, and an LPF 13. The lens 11 is a normal optical lens and includes a lens group in which aspherical lenses are stacked. The shutter 12 is a so-called mechanical shutter that is operated by a driver 14 driven by the control unit 20 when a shutter button is operated. Depending on the digital camera, a mechanical shutter may not be provided. In the case of a model equipped with a retractable lens structure and a mechanical zoom, these drivers are also controlled by the driver 14. The LPF 13 is a crystal low-pass filter and is mounted to prevent the occurrence of moire.

次に、アナログ信号処理部１５は、撮像センサ（ＣＣＤ，ＣＭＯＳ）１６、サンプリング／信号増幅処理部１７、Ａ／Ｄコンバータ１８からなる。撮像センサ１６は、被写体画像（イメージ）を結像し、ＲＧＢの各色の光の強さを、電流値に変換する。サンプリング／信号増幅処理部１７は、ノイズや色むらを抑えるための相関二重サンプリング処理や信号増幅処理を行う。Ａ／Ｄコンバータ１８は、アナログフロントエンドとも呼ばれ、サンプリング・増幅したアナログ信号をデジタル信号に変換する（ＲＧＢ，ＣＭＹＧ各色について１２ｂｉｔデータに変換してバスラインに出力する）。 Next, the analog signal processing unit 15 includes an image sensor (CCD, CMOS) 16, a sampling / signal amplification processing unit 17, and an A / D converter 18. The imaging sensor 16 forms a subject image (image), and converts the intensity of light of each color of RGB into a current value. The sampling / signal amplification processing unit 17 performs correlated double sampling processing and signal amplification processing for suppressing noise and color unevenness. The A / D converter 18 is also called an analog front end, and converts the sampled / amplified analog signal into a digital signal (converts each color of RGB and CMYG into 12-bit data and outputs it to the bus line).

次に、制御部（ＣＰＵ）２０は、後述するプログラムメモリ格納されるプログラムに従ってデジタルカメラ１（撮像装置）の全体を制御する。特に、本第１実施形態では、音声データの録音中に、静止画の撮影条件を満たす音声変化があった場合に静止画を撮影し、その撮影時刻（音声データの録音開始からの経過時間）と静止画データとを、音声データと共に記録保存する一方、記録した音声データの再生時には、静止画データの撮影時刻（音声データの録音開始からの経過時間＝音声データの再生開始からの経過時間）に従って、音声データの再生開始位置を制御するようになっている。 Next, the control unit (CPU) 20 controls the entire digital camera 1 (imaging device) according to a program stored in a program memory described later. In particular, in the first embodiment, when there is a change in sound that satisfies the still image shooting condition during recording of the sound data, the still image is shot, and the shooting time (elapsed time from the start of recording of the audio data). And still image data are recorded and saved together with the audio data, and when the recorded audio data is played back, the shooting time of the still image data (elapsed time from the start of recording of the audio data = elapsed time from the start of playback of the audio data) Accordingly, the playback start position of the audio data is controlled.

プレビューエンジン２２は、録画モード（記録モード、撮影モードともいう）において、画像取得部１０、アナログ信号処理部１５を介して入力されたデジタルデータ、もしくはシャッター操作検出直後、イメージバッファ２６に格納されたデジタルデータ、および、画像メモリ３１に格納されたデジタルデータを表示部２５に表示させるために間引き処理を行う。Ｄ／Ａコンバータ２３は、プレビューエンジン２２により間引き処理されたデジタルデータを変換し、後段のドライバ２４に出力する。ドライバ２４は、後段の表示部２５に表示されるデジタルデータを一時記憶するバッファ領域を備え、キー操作部２７、制御部２０を介して入力された制御信号に基づいて表示部２５を駆動させる。表示部２５は、カラーＴＦＴ液晶や、ＳＴＮ液晶などからなり、プレビュー画像や、撮影後の画像データ、設定メニューなどを表示する。 The preview engine 22 is stored in the image buffer 26 immediately after detection of digital data input via the image acquisition unit 10 or the analog signal processing unit 15 in the recording mode (also referred to as recording mode or photographing mode) or immediately after the shutter operation is detected. Thinning processing is performed in order to display the digital data and the digital data stored in the image memory 31 on the display unit 25. The D / A converter 23 converts the digital data thinned out by the preview engine 22 and outputs it to the driver 24 at the subsequent stage. The driver 24 includes a buffer area for temporarily storing digital data displayed on the display unit 25 at the subsequent stage, and drives the display unit 25 based on a control signal input via the key operation unit 27 and the control unit 20. The display unit 25 includes a color TFT liquid crystal, an STN liquid crystal, or the like, and displays a preview image, image data after shooting, a setting menu, and the like.

イメージバッファ２６は、アナログ信号処理部１５、もしくはデジタル信号処理部２８を介して入力され、デジタル信号処理部２８に渡すまで一時的に撮影直後のデジタルデータを格納する。キー操作部２７は、シャッタボタンや、記録／再生モード選択スライドスイッチ、メニューボタン、十字キー（中央押しで決定）などからなる。 The image buffer 26 temporarily stores digital data immediately after photographing until it is input via the analog signal processing unit 15 or the digital signal processing unit 28 and passed to the digital signal processing unit 28. The key operation unit 27 includes a shutter button, a recording / playback mode selection slide switch, a menu button, a cross key (determined by pressing the center), and the like.

デジタル信号処理部２８は、アナログ信号処理部１５を介して入力されたデジタルデータに対して、ホワイトバランス処理、色処理、階調処理、輸郭強調、ＲＧＢ形式からＹＵＶ形式への変換、ＹＵＶ形式からＪＰＥＧ形式への変換を行う。また、デジタル信号処理部２８は、画像取得部１０およびアナログ信号処理部１５により取り込んだ画像データからＥｘｉｆ規格に従った画像ファイルを生成する。画像圧縮／伸張処理部２９は、デジタル信号処理部２８を介して入力されたデジタルデータをＪＰＥＧ方式に圧縮符号化したり、再生モードにおいては、ＪＰＥＧ形式のファイルを伸張したりする。 The digital signal processing unit 28 performs white balance processing, color processing, gradation processing, contour emphasis, conversion from RGB format to YUV format, YUV format for digital data input via the analog signal processing unit 15. To JPEG format. The digital signal processing unit 28 also generates an image file according to the Exif standard from the image data captured by the image acquisition unit 10 and the analog signal processing unit 15. The image compression / decompression processing unit 29 compresses and encodes the digital data input via the digital signal processing unit 28 into the JPEG format, or decompresses the JPEG format file in the reproduction mode.

プログラムメモリ３０は、制御部２０にロードされる各種プログラムや、ベストショット機能におけるＥＶ値、色補正情報などを格納する。画像メモリ３１は、イメージバッファ２６に一時的に保持された画像データや、各種ファイル形式に変換されたデジタルデータを格納する。カードＩ／Ｆ３２は、外部記録媒体３３と撮像装置本体との間のデータ交換を制御する。外部記録媒体３３は、コンパクトフラッシュ（登録商標）、メモリースティック、ＳＤカード等からなる着脱可能な記録媒体である。外部接続用Ｉ／Ｆ３４は、ＵＳＢコネクター用スロットなどからなり、パーソナルコンピュータなどと接続され、撮影した画像データの転送などに用いられる。電池３５は、使い捨ての一次電池や、充電可能な二次電池などからなり、上述した各部を駆動するための電力を供給する。 The program memory 30 stores various programs loaded in the control unit 20, EV values in the best shot function, color correction information, and the like. The image memory 31 stores image data temporarily held in the image buffer 26 and digital data converted into various file formats. The card I / F 32 controls data exchange between the external recording medium 33 and the imaging apparatus main body. The external recording medium 33 is a detachable recording medium composed of a compact flash (registered trademark), a memory stick, an SD card, or the like. The external connection I / F 34 includes a USB connector slot and the like, is connected to a personal computer or the like, and is used for transferring photographed image data. The battery 35 includes a disposable primary battery, a rechargeable secondary battery, and the like, and supplies power for driving the above-described units.

Ａ−２．第１実施形態の動作
次に、上述した第１実施形態の動作について説明する。
（１）録音処理
図２は、本第１実施形態によるデジタルカメラ１の録音処理における動作を説明するためのフローチャートである。まず、録音開始の指示があったか否かを判断し（ステップＳ１０）、録音開始の指示があると、マイクから新たに入力される音声データを記録する（ステップＳ１２）。次に、音声データの記録中に、静止画の撮影条件を満たす音声変化があったか否かを判断する（ステップＳ１４）。 A-2. Operation of First Embodiment Next, the operation of the first embodiment described above will be described.
(1) Recording Process FIG. 2 is a flowchart for explaining the operation in the recording process of the digital camera 1 according to the first embodiment. First, it is determined whether or not an instruction to start recording is given (step S10). When there is an instruction to start recording, voice data newly input from the microphone is recorded (step S12). Next, it is determined whether or not there is a change in sound that satisfies the still image shooting condition during recording of the sound data (step S14).

ここで、本第１実施形態では、上記静止画の撮影条件としては、図４に示すように、音量の変化が１０ｄＢ以上あった場合、周波数帯の変化が２ｋＨｚ以上あった場合、音声方向の変化が１０度以上あった場合などを想定している。例示した撮影条件のいずれか１つでも該当するか、あるいは２つ以上の組み合わせに該当した場合に静止画を撮影する。また、これらの撮影条件、およびその組み合わせは、ユーザにより任意に登録可能であってもよい。 Here, in the first embodiment, as the still image shooting conditions, as shown in FIG. 4, when the change in volume is 10 dB or more, when the change in frequency band is 2 kHz or more, It is assumed that the change is 10 degrees or more. A still image is shot when any one of the exemplified shooting conditions is met or when a combination of two or more of the shooting conditions is met. Further, these shooting conditions and combinations thereof may be arbitrarily registered by the user.

そして、静止画の撮影条件を満たす音声変化がない場合には、録音終了の指示があったか否かを判断し（ステップＳ２０）、録音終了の指示がなければ、ステップＳ１４へ戻り、音声データの記録を継続する。 If there is no change in audio that satisfies the still image shooting condition, it is determined whether or not an instruction to end recording is given (step S20). If there is no instruction to end recording, the process returns to step S14 to record audio data. Continue.

一方、音声データの記録中に、上述した静止画の撮影条件を満たす音声変化があった場合には、撮像素子から得られた画像データを静止画として記録し（ステップＳ１６）、静止画データと撮影時刻（音声データの録音開始からの経過時間）とを関連付けるマーキング情報を記憶する（ステップＳ１８）。そして、上述したように、録音終了の指示があったか否かを判断し（ステップＳ２０）、録音終了の指示がなければ、ステップＳ１４へ戻り、音声データの記録を継続する。 On the other hand, if there is an audio change that satisfies the above-described still image shooting conditions during recording of the audio data, the image data obtained from the image sensor is recorded as a still image (step S16). Marking information that correlates the shooting time (elapsed time from the start of recording of audio data) is stored (step S18). Then, as described above, it is determined whether or not an instruction to end recording is given (step S20). If there is no instruction to end recording, the process returns to step S14 and recording of audio data is continued.

以降、音声データの記録中に、静止画の撮影条件を満たす音声変化があった場合には、撮像素子から得られた画像データを静止画として記録し（ステップＳ１６）、静止画データと撮影時刻（音声データの録音開始からの経過時間）とを関連付けるマーキング情報を記憶する（ステップＳ１８）、という動作を繰り返す。 Thereafter, when there is a change in sound that satisfies the still image shooting condition during recording of the sound data, the image data obtained from the image sensor is recorded as a still image (step S16), and the still image data and the shooting time are recorded. The operation of storing the marking information that associates (the elapsed time from the start of recording of the audio data) is repeated (step S18).

そして、録音終了の指示があると、保存ファイル形式（ＷＡＶ形式やＭｏｔｉｏｎ−ＪＰＥＧ形式など）を指定し（ステップＳ２２）、指定されたファイル形式で、録音された音声データおよびマーキング情報をファイル内に格納して保存する（ステップＳ２４）。そして、当該録音処理を終了する。 When there is an instruction to end recording, a storage file format (WAV format, Motion-JPEG format, etc.) is designated (step S22), and the recorded audio data and marking information in the designated file format are stored in the file. Store and save (step S24). Then, the recording process ends.

（２）再生処理
次に、図３は、本第１実施形態によるデジタルカメラ１の再生処理における動作を説明するためのフローチャートである。まず、ユーザは、再生対象となる音声データのファイルを指定する（ステップＳ３０）。音声データが指定されると、指定された音声データのファイル内のマーキング情報に基づいて、各静止画データを縮小して、図５に示すように、表示部２５のインデックス表示領域２５１に一覧表示する（ステップＳ３２）。 (2) Reproduction Process Next, FIG. 3 is a flowchart for explaining the operation in the reproduction process of the digital camera 1 according to the first embodiment. First, the user designates an audio data file to be reproduced (step S30). When the audio data is designated, each still image data is reduced based on the marking information in the file of the designated audio data, and a list is displayed in the index display area 251 of the display unit 25 as shown in FIG. (Step S32).

次に、音声データの再生位置を先頭時刻に設定し（ステップＳ３４）、インデックス表示領域において静止画データが選択されたか否かを判断する（ステップＳ３６）。すなわち、ユーザは、インデックス表示領域に一覧表示された静止画データを見て、音声データの再生位置を指示するために、再生位置（再生時刻）に対応する静止画データを選択する。 Next, the playback position of the audio data is set to the start time (step S34), and it is determined whether still image data has been selected in the index display area (step S36). That is, the user views still image data displayed in a list in the index display area, and selects still image data corresponding to the reproduction position (reproduction time) in order to indicate the reproduction position of the audio data.

そして、ユーザによりいずれかの静止画データが選択された場合には、選択された静止画データの撮影時刻（音声データの録音開始からの経過時間）を音声データの再生位置として設定し（ステップＳ３８）、該再生位置から音声データを再生する（ステップＳ４０）。一方、静止画データが選択されなかった場合には、ステップＳ３４で設定した音声データの先頭を再生位置として音声データを再生する（ステップＳ４０）。 If any of the still image data is selected by the user, the shooting time of the selected still image data (the elapsed time from the start of recording of the audio data) is set as the audio data reproduction position (step S38). ), Audio data is reproduced from the reproduction position (step S40). On the other hand, if still image data is not selected, the audio data is reproduced with the beginning of the audio data set in step S34 as the reproduction position (step S40).

次に、再生位置の直近にあるマーキング情報が変化したか否かを判断し（ステップＳ４２）、マーキング情報が変化した場合には、該マーキング情報に対応する静止画データを表示部２５の静止画表示領域２５２に表示する（ステップＳ４４）。次に、再生終了の指示があったか否かを判断し（ステップＳ４６）、再生終了の指示がなければ、ステップＳ３６へ戻り、上述した処理を繰り返す。 Next, it is determined whether or not the marking information closest to the reproduction position has changed (step S42). If the marking information has changed, the still image data corresponding to the marking information is displayed on the still image on the display unit 25. It displays on the display area 252 (step S44). Next, it is determined whether or not an instruction to end reproduction is given (step S46). If there is no instruction to end reproduction, the process returns to step S36 and the above-described processing is repeated.

すなわち、音声データの再生途中で、インデックス表示領域に一覧表示されている静止画データのうち、いずれかの静止画データが選択される度に、その静止画データの撮影時刻（音声データの録音開始からの経過時間）を音声データの再生位置に設定し、該再生位置から音声データを再生する、という一連の処理を実行する。そして、再生終了の指示があった場合には、音声データの再生を終了し、当該処理を終了する。 That is, each time still image data is selected from among the still image data listed in the index display area during the reproduction of the audio data, the shooting time of the still image data (recording of audio data starts) (Elapsed time from) is set as the reproduction position of the audio data, and a series of processes of reproducing the audio data from the reproduction position is executed. If there is an instruction to end playback, the playback of the audio data is ended and the processing is ended.

上述した第１実施形態によれば、音声データの録音中に、静止画の撮影条件を満たす音声変化があった場合に静止画を撮影し、その撮影時刻（音声データの録音開始からの経過時間）と静止画データとを、音声データと共に記録保存する一方、記録した音声データの再生時には、静止画データの撮影時刻（音声データの録音開始からの経過時間＝音声データの再生開始からの経過時間）に従って音声データの再生開始位置を制御するようにしたので、容易に、かつ柔軟に再生制御を行うことができるようになる。 According to the above-described first embodiment, during audio data recording, if there is a change in audio that satisfies the still image shooting condition, a still image is shot, and the shooting time (elapsed time from the start of audio data recording) ) And still image data are recorded and saved together with the audio data. When the recorded audio data is reproduced, the shooting time of the still image data (elapsed time from the start of recording of the audio data = elapsed time from the start of reproduction of the audio data) ), The playback start position of the audio data is controlled according to the above, so that playback control can be performed easily and flexibly.

Ｂ．第２実施形態
次に、本発明の第２実施形態について説明する。本第２実施形態は、後述するようにデジタルカメラにも適用可能であるが、特に、撮影方向を変える機構を備える監視カメラやウェブカメラに有用な技術である。本第２実施形態では、撮影パラメータ（撮影方向、ズーム倍率など）の変更条件を満たす音声変化を検知すると、該音声変化に応じた撮影パラメータに変更し、静止画の撮影条件を満たす音声変化があると、その時点の静止画を撮影パラメータに従って撮影するようになっている。 B. Second Embodiment Next, a second embodiment of the present invention will be described. The second embodiment can be applied to a digital camera as will be described later, but is a technique that is particularly useful for a surveillance camera or a web camera having a mechanism for changing a shooting direction. In the second embodiment, when a change in sound that satisfies the change condition of the shooting parameter (shooting direction, zoom magnification, etc.) is detected, the sound is changed to the shooting parameter corresponding to the change in sound, and the sound change that satisfies the still image shooting condition is detected. If there is, the still image at that time is shot according to the shooting parameters.

Ｂ−１．第２実施形態の構成
図６は、本第２実施形態による監視カメラ４０の構成を示すブロック図である。図において、画像取得部４１は、レンズ４２、ＬＰＦ４３からなる。レンズ４２は、通常の光学レンズであり、非球面レンズを重ねたレンズ群からなる。ＬＰＦ４３は、水晶ローパスフィルタであり、モアレの発生を防ぐために搭載されている。 B-1. Configuration of Second Embodiment FIG. 6 is a block diagram showing the configuration of the surveillance camera 40 according to the second embodiment. In the figure, the image acquisition unit 41 includes a lens 42 and an LPF 43. The lens 42 is a normal optical lens and includes a lens group in which aspherical lenses are stacked. The LPF 43 is a crystal low-pass filter and is mounted to prevent the occurrence of moire.

次に、アナログ信号処理部４５は、撮像センサ（ＣＣＤ，ＣＭＯＳ）４６、サンプリング／信号増幅処理部４７、Ａ／Ｄコンバータ４８からなる。撮像センサ４６は、被写体画像（イメージ）を結像し、ＲＧＢの各色の光の強さを、電流値に変換する。サンプリング／信号増幅処理部４７は、ノイズや色むらを抑えるための相関二重サンプリング処理や信号増幅処理を行う。Ａ／Ｄコンバータ４８は、アナログフロントエンドとも呼ばれ、サンプリング・増幅したアナログ信号をデジタル信号に変換する（ＲＧＢ，ＣＭＹＧ各色について１２ｂｉｔデータに変換してバスラインに出力する）。ドライバ４９は、制御部５０の制御の下、画像取得部４０およびアナログ信号処理部４５を駆動する。
メカニカルズーム Next, the analog signal processing unit 45 includes an image sensor (CCD, CMOS) 46, a sampling / signal amplification processing unit 47, and an A / D converter 48. The imaging sensor 46 forms a subject image (image) and converts the intensity of light of each color of RGB into a current value. The sampling / signal amplification processing unit 47 performs correlated double sampling processing and signal amplification processing for suppressing noise and color unevenness. The A / D converter 48 is also called an analog front end, and converts the sampled / amplified analog signal into a digital signal (converts each color of RGB and CMYG into 12-bit data and outputs it to the bus line). The driver 49 drives the image acquisition unit 40 and the analog signal processing unit 45 under the control of the control unit 50.
Mechanical zoom

次に、制御部（ＣＰＵ）５０は、後述するプログラムメモリ格納されるプログラムに従って監視カメラ４０（撮像装置）の全体を制御する。特に、本第２実施形態では、音声データの録音中に、撮影パラメータの変更条件を満たす音声変化があると、該音声変化に応じて撮影パラメータを変更し、かつ、静止画の撮影条件を満たす音声変化があると、撮影パラメータに従って静止画を撮影し、その撮影時刻（音声データの録音開始からの経過時間）と静止画データとを、音声データと共に記録保存する一方、記録した音声データの再生時には、静止画データの撮影時刻に従って、音声データの再生開始位置を制御するようになっている。 Next, the control unit (CPU) 50 controls the entire surveillance camera 40 (imaging device) according to a program stored in a program memory described later. In particular, in the second embodiment, if there is a change in sound that satisfies the shooting parameter change condition during recording of the sound data, the shooting parameter is changed according to the sound change and the still image shooting condition is satisfied. When there is a change in audio, a still image is taken according to the shooting parameters, and the shooting time (elapsed time from the start of recording of the audio data) and the still image data are recorded and saved together with the audio data, while the recorded audio data is played back. In some cases, the playback start position of the audio data is controlled according to the shooting time of the still image data.

プログラムメモリ５１は、制御部５０にロードされる各種プログラムなどを格納する。イメージバッファ５２は、アナログ信号処理部４５からの撮影直後のデジタルデータを、デジタル信号処理部５３に渡すまで一時的に格納する。デジタル信号処理部５３は、アナログ信号処理部４５から供給される撮像データに対して各種画像処理を施す。なお、該デジタル信号処理部５３による処理は、該監視カメラ４０が接続されるコンピュータ側で行うようにしてもよい。 The program memory 51 stores various programs loaded on the control unit 50. The image buffer 52 temporarily stores the digital data immediately after shooting from the analog signal processing unit 45 until it is passed to the digital signal processing unit 53. The digital signal processing unit 53 performs various types of image processing on the imaging data supplied from the analog signal processing unit 45. The processing by the digital signal processing unit 53 may be performed on the computer side to which the monitoring camera 40 is connected.

マイク５４は、画像取得部４１の近傍に配置され、外部音を集音する。音声処理部５５は、マイク５４により集音された音声信号をデジタルデータである音声データに変換する。ズーム駆動部５６は、制御部５０の制御の下、画像取得部４のレンズ４２を駆動し、メカニカルズームを行う。なお、ズームをデジタル処理のみで行うことも可能であり、この場合、レンズ部４２を駆動することなく、前述したデジタル信号処理部５３、もしくは該監視カメラ４０が接続されるコンピュータ側で行うようにしてもよい。 The microphone 54 is disposed in the vicinity of the image acquisition unit 41 and collects external sounds. The audio processing unit 55 converts the audio signal collected by the microphone 54 into audio data that is digital data. The zoom drive unit 56 drives the lens 42 of the image acquisition unit 4 under the control of the control unit 50 to perform mechanical zoom. Note that zooming can be performed only by digital processing. In this case, zooming is performed on the above-described digital signal processing unit 53 or the computer connected to the monitoring camera 40 without driving the lens unit 42. May be.

パーン駆動部５７は、制御部５０の制御の下、画像取得部４１を駆動して、水平方向または／および垂直方向へ回動する、いわゆるパーン駆動を行う。なお、パーンに関してもズームと同様に広角レンズで撮影した静止画データをトリミングすることで実現するようにしてもよい。外部接続用Ｉ／Ｆ５８は、ＵＳＢコネクター用スロットなどからなり、パーソナルコンピュータなどと接続され、撮影した画像データおよび音声データの転送に用いられる。 The pan drive unit 57 drives the image acquisition unit 41 under the control of the control unit 50 and performs so-called pan drive that rotates in the horizontal direction and / or the vertical direction. Note that Pann may also be realized by trimming still image data captured with a wide-angle lens, as with zooming. The external connection I / F 58 includes a USB connector slot and the like, is connected to a personal computer or the like, and is used to transfer captured image data and audio data.

ここで、図７は、本第２実施形態における撮影パラメータの変更条件、および撮影パラメータの変更内容の一例を示す概念図である。例えば、音声方向の変化が２０度以上であった場合には、音声方向にカメラ（画像取得部）をパーンしたり、音量が５０ｄＢ以上の周波数帯の音声方向が１０度以上変化した場合には、ズーム倍率を広角側に１段階変更したりする。 Here, FIG. 7 is a conceptual diagram showing an example of the shooting parameter changing condition and the shooting parameter changing contents in the second embodiment. For example, when the change in the voice direction is 20 degrees or more, when the camera (image acquisition unit) is panned in the voice direction, or when the voice direction in the frequency band with a volume of 50 dB or more is changed by 10 degrees or more. The zoom magnification is changed by one step to the wide angle side.

Ｂ−２．第２実施形態の動作
次に、上述した第２実施形態の動作について説明する。以下では、前述した第１実施形態と異なる部分についてのみ説明し、特に説明しない部分については前述した第１実施形態と同様の処理を実行する。ここで、図８は、本第２実施形態によるデジタル監視カメラ４０の録音処理における一部動作を説明するためのフローチャートである。 B-2. Operation of the Second Embodiment Next, the operation of the second embodiment described above will be described. In the following, only the parts different from the first embodiment will be described, and the same processing as that of the first embodiment will be executed for the parts that are not particularly described. Here, FIG. 8 is a flowchart for explaining a partial operation in the recording process of the digital surveillance camera 40 according to the second embodiment.

まず、録音開始の指示があると、マイク５４から新たに入力される音声データを記録する（ステップＳ１２）。次に、音声データの記録中に、撮影パラメータの変更条件を満たす音声変化があったか否かを判断する（ステップＳ５０）。そして、撮影パラメータの変更条件を満たす音声変化がない場合には、撮影パラメータを変更することなく、静止画の撮影条件を満たす音声変化があったか否かを判断し（ステップＳ１４）、以下、前述した第１実施形態と同様の処理を実行する。したがって、この場合、静止画の撮影条件を満たす音声変化があった場合には、同じ撮影パラメータで静止画を撮影し、その撮影時刻（音声データの録音開始からの経過時間）と静止画データとを、音声データと共に記録保存する。 First, when there is an instruction to start recording, voice data newly input from the microphone 54 is recorded (step S12). Next, it is determined whether or not there is a sound change that satisfies the shooting parameter change condition during recording of the sound data (step S50). If there is no sound change that satisfies the shooting parameter change condition, it is determined whether or not there is a sound change that satisfies the still image shooting condition without changing the shooting parameter (step S14). The same processing as in the first embodiment is executed. Therefore, in this case, if there is a change in the sound that satisfies the still image shooting conditions, the still image is shot with the same shooting parameters, the shooting time (the elapsed time from the start of recording the audio data), the still image data, Are recorded and saved together with the audio data.

一方、撮影パラメータの変更条件を満たす音声変化があった場合には、撮影パラメータを、該音声変化に応じた撮影パラメータに変更し（ステップＳ５２）、静止画の撮影条件を満たす音声変化があったか否かを判断し（ステップＳ１４）、以下、前述した第１実施形態と同様の処理を実行する。撮影パラメータの変更条件を満たす音声変化があった場合、例えば、音声方向が２０度以上変化した場合には、画像取得部４１（カメラ部分）が音声方向にパーンすることになる。 On the other hand, if there is a sound change that satisfies the shooting parameter change condition, the shooting parameter is changed to a shooting parameter corresponding to the sound change (step S52), and whether or not there is a sound change that satisfies the still image shooting condition. Is determined (step S14), and the same processing as in the first embodiment described above is executed. When there is a change in sound that satisfies the shooting parameter change condition, for example, when the sound direction changes by 20 degrees or more, the image acquisition unit 41 (camera part) pans in the sound direction.

そして、この状態で、前述した第１実施形態で説明したように、静止画の撮影条件を満たす音声変化があった場合には、パーンした方向における静止画を撮影することになる。これにより、話者などの音声の発生源方向の静止画が撮影されることになる。静止画の撮影後、その撮影時刻（音声データの録音開始からの経過時間）と静止画データとを、音声データと共に記録保存する。 In this state, as described in the first embodiment, when there is a change in sound that satisfies the still image shooting condition, the still image in the panned direction is shot. As a result, a still image in the direction of the sound source of the speaker or the like is taken. After shooting a still image, the shooting time (elapsed time from the start of recording audio data) and the still image data are recorded and saved together with the audio data.

再生動作については、前述した第１実施形態と同様であるので説明を省略する。 Since the reproduction operation is the same as that of the first embodiment described above, description thereof is omitted.

なお、上述した第２実施形態では、パーン機能やズーム機能を有する監視カメラとしたが、これに限らず、パーン機能については、前述したように、画像取得部を機械的にパーンせずに、広角レンズで撮影した静止画データから音声方向に対応する部分をトリミングすることで実現したり、ズーム機能については、撮影した静止画データに対してデジタル処理によりズーム倍率の変更を実現するようにしてもよい。これにより、本第２実施形態を、機械的なパーン機能やズーム機能を備えていないデジタルカメラにも適用することが可能となる。また、パーン、ズーム以外にも、絞りやシャッター速度などの撮影パラメータを変更するようにしてもよい。 In the second embodiment described above, the surveillance camera has a panning function and a zoom function. However, the panning function is not limited to this, and the panning function is not mechanically panned as described above. It can be realized by trimming the part corresponding to the audio direction from still image data shot with a wide-angle lens, and the zoom function can be changed digitally for the shot still image data. Also good. As a result, the second embodiment can be applied to a digital camera that does not have a mechanical panning function or zoom function. In addition to panning and zooming, shooting parameters such as aperture and shutter speed may be changed.

上述した第２実施形態によれば撮影パラメータ（撮影方向、ズーム倍率など）の変更条件を満たす音声変化を検知すると、該音声変化に応じた撮影パラメータに変更し、静止画の撮影条件を満たす音声変化があると、その時点の静止画を撮影パラメータに従って撮影するようにしたので、静止画データを音声データの発生源に関連するものとすることができ、音声データの再生位置をより容易に把握することができる。 According to the second embodiment described above, when a change in sound that satisfies a change condition of the shooting parameters (shooting direction, zoom magnification, etc.) is detected, the sound is changed to a shooting parameter corresponding to the change in sound, and the sound satisfies the still image shooting condition. When there is a change, the still image at that time is shot according to the shooting parameters, so the still image data can be related to the source of the audio data, and the playback position of the audio data can be grasped more easily can do.

Ｃ．第３実施形態
次に、本発明の第３実施形態について説明する。本第３実施形態では、静止画の撮影条件を満たす音声変化があった場合、該音声変化が音声方向の変化であり、かつその方向で過去に撮影された静止画データが既に存在する場合には、静止画を新たに撮影するのでなく、その時点の音声データの録音開始からの経過時間を撮影時刻として（実際に撮影しない）、該撮影時刻と既存の静止画データとを、音声データと共に記録保存するようになっている。 C. Third Embodiment Next, a third embodiment of the present invention will be described. In the third embodiment, when there is a sound change that satisfies the still image shooting condition, the sound change is a change in the sound direction, and still image data shot in the past in that direction already exists. Rather than taking a new still image, the elapsed time from the start of recording of the audio data at that time is taken as the shooting time (not actually shot), and the shooting time and the existing still image data are combined with the audio data. The record is to be saved.

本第３実施形態は、前述した第１実施形態によるデジタルカメラ、あるいは第２実施形態による監視カメラのいずれにも適用することが可能であり、その構成は図１あるいは図６と同様であるので説明を省略する。以下では、デジタルカメラに適用した例について説明する。 The third embodiment can be applied to either the digital camera according to the first embodiment described above or the surveillance camera according to the second embodiment, and the configuration thereof is the same as in FIG. 1 or FIG. Description is omitted. Below, the example applied to the digital camera is demonstrated.

Ｃ−１．第３実施形態の動作
次に、本第３実施形態の動作について説明する。以下では、前述した第１または第２実施形態と異なる部分についてのみ説明し、特に説明しない部分については前述した第１または第２実施形態と同様の処理を実行する。ここで、図９は、本第３実施形態によるデジタルカメラ１の録音処理における一部動作を説明するためのフローチャートである。 C-1. Operation of Third Embodiment Next, the operation of the third embodiment will be described. In the following, only the parts different from the first or second embodiment described above will be described, and the same processes as those of the first or second embodiment described above will be executed for parts not specifically described. Here, FIG. 9 is a flowchart for explaining a partial operation in the recording process of the digital camera 1 according to the third embodiment.

まず、録音開始の指示があると、マイク５４から新たに入力される音声データの記録を開始し、該音声データの記録中に、静止画の撮影条件を満たす音声変化があったか否かを判断し（ステップＳ１４）、静止画の撮影条件を満たす音声変化がない場合には、図１に示すステップＳ２０へ進み、前述した第１実施形態と同様の処理を実行する。 First, when there is an instruction to start recording, recording of audio data newly input from the microphone 54 is started, and it is determined whether or not there is an audio change that satisfies the still image shooting condition during recording of the audio data. (Step S14) If there is no audio change that satisfies the still image shooting condition, the process proceeds to Step S20 shown in FIG. 1, and the same processing as in the first embodiment described above is executed.

一方、静止画の撮影条件を満たす音声変化があった場合には、該音声変化が音声方向の変化であるか否かを判断し（ステップＳ６２）、音声方向の変化でない場合には、撮像素子から新たに得られた画像データを静止画として記録し（ステップＳ１６）、静止画データと撮影時刻（音声データの録音開始からの経過時間）とを関連付けるマーキング情報を記憶する（ステップＳ１８）。その後、図１に示すステップＳ２０へ進み、前述した第１実施形態と同様の処理を実行する。 On the other hand, if there is a sound change that satisfies the still image shooting condition, it is determined whether or not the sound change is a change in the sound direction (step S62). The image data newly obtained from (1) is recorded as a still image (step S16), and marking information for associating the still image data with the photographing time (elapsed time from the start of recording of audio data) is stored (step S18). Thereafter, the process proceeds to step S20 shown in FIG. 1, and the same processing as in the first embodiment described above is executed.

さらに、静止画の撮影条件を満たす音声変化があり、かつ該音声変化が音声方向の変化であった場合には、該音声方向で過去に撮影された既存の静止画データがあるか否かを判断し（ステップＳ６４）、同じ音声方向の既存の静止画データがない場合には、前述したように、撮像素子から新たに得られた画像データを静止画として記録し（ステップＳ１６）、静止画データと撮影時刻（音声データの録音開始からの経過時間）とを関連付けるマーキング情報を記憶する（ステップＳ１８）。その後、図１に示すステップＳ２０へ進み、前述した第１実施形態と同様の処理を実行する。 Furthermore, if there is a sound change that satisfies the still image shooting condition and the sound change is a change in the sound direction, it is determined whether or not there is existing still image data that has been shot in the past in the sound direction. If there is no existing still image data in the same audio direction (step S64), as described above, the image data newly obtained from the image sensor is recorded as a still image (step S16). Marking information that associates the data with the photographing time (the elapsed time from the start of recording of the audio data) is stored (step S18). Thereafter, the process proceeds to step S20 shown in FIG. 1, and the same processing as in the first embodiment described above is executed.

一方、静止画の撮影条件を満たす音声変化があり、かつ該音声変化が音声方向の変化であり、さらに同じ音声方向の既存の静止画データがあった場合には、当該既存の静止画データを撮影したときの音声と今回の音声とを比較し、撮影条件を満たす音声変化があるか否かを判断する（ステップＳ６６）。そして、双方を比較した結果、撮影条件を満たす音声変化があった場合、すなわち過去の撮影時の音声変化と今回の音声変化とが大きく変わった場合には、前述したように、撮像素子から新たに得られた画像データを静止画として記録し（ステップＳ１６）、静止画データと撮影時刻（音声データの録音開始からの経過時間）とを関連付けるマーキング情報を記憶する（ステップＳ１８）。その後、図１に示すステップＳ２０へ進み、前述した第１実施形態と同様の処理を実行する。 On the other hand, if there is an audio change that satisfies the still image shooting condition, the audio change is a change in the audio direction, and there is existing still image data in the same audio direction, the existing still image data is The voice at the time of shooting is compared with the current voice, and it is determined whether there is a voice change that satisfies the shooting conditions (step S66). As a result of comparing the two, if there is a change in sound that satisfies the shooting condition, that is, if the change in sound in the past shooting and the change in sound in this time change significantly, as described above, a new image is acquired from the image sensor. The obtained image data is recorded as a still image (step S16), and marking information for associating the still image data with the photographing time (elapsed time from the start of recording of audio data) is stored (step S18). Thereafter, the process proceeds to step S20 shown in FIG. 1, and the same processing as in the first embodiment described above is executed.

一方、双方を比較した結果、撮影条件を満たす音声変化がなかった場合、すなわち過去の撮影時の音声変化と今回の音声変化とが大きく変わらない場合には、当該既存の静止画データを今回の撮影すべき静止画データに代用し（ステップＳ６８）、新たに画像データを撮影することなく、上記既存の静止画データと撮影時刻（音声データの録音開始からの経過時間）とを関連付けるマーキング情報を記憶する（ステップＳ１８）。その後、図１に示すステップＳ２０へ進み、前述した第１実施形態と同様の処理を実行する。 On the other hand, as a result of comparing both, if there is no audio change that satisfies the shooting conditions, that is, if the audio change during the past shooting and the current audio change do not change significantly, the existing still image data is In place of the still image data to be photographed (step S68), marking information for associating the existing still image data with the photographing time (elapsed time from the start of recording of audio data) without newly photographing image data. Store (step S18). Thereafter, the process proceeds to step S20 shown in FIG. 1, and the same processing as in the first embodiment described above is executed.

なお、再生動作については、前述した第１実施形態と同様であるので説明を省略する。 Since the reproduction operation is the same as that in the first embodiment described above, the description thereof is omitted.

上述した第３実施形態によれば、静止画の撮影条件を満たす音声変化があった場合、該音声変化が音声方向の変化であり、かつその方向で過去に撮影された静止画データが既に存在する場合には、静止画を新たに撮影するのでなく、その時点の音声データの録音開始からの経過時間を撮影時刻として（実際に撮影しない）、該撮影時刻と既存の静止画データとを、音声データと共に記録保存するようにしたので、記憶容量を節約することができる。 According to the third embodiment described above, when there is a sound change that satisfies the still image shooting condition, the sound change is a change in the sound direction, and still image data previously captured in that direction already exists. In this case, instead of newly taking a still image, the elapsed time from the start of recording of the audio data at that time is set as the shooting time (not actually shot), and the shooting time and the existing still image data are Since it is recorded and saved together with the audio data, the storage capacity can be saved.

なお、上述した第１ないし第３実施形態においては、音声データの記録中に音声変化を検知して音声データの録音経過時間と関連付けて静止画データを記録するようにした。これは、音声付動画撮影では、データ容量が大きくなるため、長時間に渡っての記録が困難になるという前提があったためである。しかしながら、近年、記録媒体の大容量化、動画データの高圧縮化技術が実現されつつあるため、かなりの長時間に渡っての動画記録が可能となってきている。 In the first to third embodiments described above, a change in sound is detected during recording of audio data, and still image data is recorded in association with the recording elapsed time of the audio data. This is because video recording with sound has a premise that it is difficult to record for a long time because the data capacity increases. However, in recent years, a recording medium having a large capacity and a high compression technology for moving image data are being realized, and therefore it is possible to record a moving image for a considerably long time.

そこで、上述した第１ないし第３実施形態は、音声付動画撮影中に音声変化を検知した場合、音声付動画の録画経過時間と関連付けて静止画データを記録するようにしてもよい。この場合、動画から音声変化検知時の１フレームを取り出して静止画データとしてもよい。あるいは、記録時には、音声変化検知時にマーキング情報（タイムスタンプ）のみを記録しておき、再生時に、マーキング情報に従って、記録した音声付動画から静止画データを生成し、インデックス表示領域に一覧表示するようにしてもよい。 Therefore, in the first to third embodiments described above, when a change in sound is detected during moving image recording with sound, still image data may be recorded in association with the elapsed recording time of the moving image with sound. In this case, one frame at the time of detecting a voice change may be extracted from the moving image and used as still image data. Alternatively, at the time of recording, only the marking information (time stamp) is recorded at the time of detecting a voice change, and at the time of reproduction, still image data is generated from the recorded moving image with sound according to the marking information and displayed in a list in the index display area It may be.

本発明の第１実施形態によるデジタルカメラの構成を示すブロック図である。It is a block diagram which shows the structure of the digital camera by 1st Embodiment of this invention. 本第１実施形態によるデジタルカメラ１の録音処理における動作を説明するためのフローチャートである。It is a flowchart for demonstrating the operation | movement in the recording process of the digital camera 1 by this 1st Embodiment. 本第１実施形態によるデジタルカメラ１の再生処理における動作を説明するためのフローチャートである。4 is a flowchart for explaining an operation in a reproduction process of the digital camera 1 according to the first embodiment. 静止画の撮影条件の一例を示す概念図である。It is a conceptual diagram which shows an example of the imaging condition of a still image. 記録した音声データ再生時の表示画面例を示す模式図である。It is a schematic diagram which shows the example of a display screen at the time of recorded audio | voice data reproduction | regeneration. 本第２実施形態による監視カメラ４０の構成を示すブロック図である。It is a block diagram which shows the structure of the monitoring camera 40 by this 2nd Embodiment. 本第２実施形態における撮影パラメータの変更条件、および撮影パラメータの変更内容の一例を示す概念図である。It is a conceptual diagram which shows an example of the change condition of the imaging parameter in this 2nd Embodiment, and the change content of an imaging parameter. 本第２実施形態によるデジタル監視カメラ４０の録音処理における一部動作を説明するためのフローチャートである。It is a flowchart for demonstrating a partial operation | movement in the recording process of the digital surveillance camera 40 by this 2nd Embodiment. 本第３実施形態によるデジタルカメラ１の録音処理における一部動作を説明するためのフローチャートである。It is a flowchart for demonstrating a partial operation | movement in the recording process of the digital camera 1 by this 3rd Embodiment.

Explanation of symbols

１デジタルカメラ（撮像装置）
１０画像取得部
１１レンズ
１２シャッター
１３ＬＰＦ
１４ドライバ
１５アナログ信号処理部
１６撮像センサ（ＣＣＤ，ＣＭＯＳ）
１７サンプリング／信号増幅処理部
１８Ａ／Ｄコンバータ
２０制御部
２２プレビューエンジン
２３Ｄ／Ａコンバータ
２４ドライバ
２５表示部
２６イメージバッファ
２７キー操作部
２８デジタル信号処理部
２９画像圧縮／伸張処理部
３０プログラムメモリ
３１画像メモリ
３２カードＩ／Ｆ
３３外部記録媒体
３４外部接続用Ｉ／Ｆ
３５電池
４０監視カメラ（撮像装置）
４１画像取得部
４２レンズ
４３ＬＰＦ
４５アナログ信号処理部
４６撮像センサ（ＣＣＤ，ＣＭＯＳ）
４７サンプリング／信号増幅処理部
４８Ａ／Ｄコンバータ
４９ドライバ
５０制御部
５１プログラムメモリ
５２イメージバッファ
５３デジタル信号処理部
５４マイク
５５音声処理部
５６ズーム駆動部
５７パーン駆動部
５８外部接続用Ｉ／Ｆ 1 Digital camera (imaging device)
10 Image Acquisition Unit 11 Lens 12 Shutter 13 LPF
14 Driver 15 Analog signal processor 16 Image sensor (CCD, CMOS)
17 Sampling / Signal Amplification Processing Unit 18 A / D Converter 20 Control Unit 22 Preview Engine 23 D / A Converter 24 Driver 25 Display Unit 26 Image Buffer 27 Key Operation Unit 28 Digital Signal Processing Unit 29 Image Compression / Expansion Processing Unit 30 Program Memory 31 Image memory 32 Card I / F
33 External recording medium 34 I / F for external connection
35 Battery 40 Surveillance camera (imaging device)
41 Image acquisition unit 42 Lens 43 LPF
45 Analog signal processor 46 Image sensor (CCD, CMOS)
47 Sampling / Signal Amplification Processing Unit 48 A / D Converter 49 Driver 50 Control Unit 51 Program Memory 52 Image Buffer 53 Digital Signal Processing Unit 54 Microphone 55 Audio Processing Unit 56 Zoom Drive Unit 57 Pan Drive Unit 58 External Connection I / F

Claims

Recording means for recording voice and recording it as voice data,
A voice change detecting means for detecting a voice change during recording of voice data by the recording means;
Photographing means for photographing a still image and capturing it as still image data each time a sound change that satisfies a predetermined still image photographing condition is detected by the sound change detecting means;
Shooting time acquisition means for acquiring a shooting time at which a still image was shot by the shooting means;
Recording means for recording the still image data captured by the photographing means and the photographing time acquired by the photographing time acquisition means in association with the sound data captured by the recording means ;
When the sound change satisfying a predetermined still image shooting condition by the sound change detection means is a direction change of the sound generation source, it is determined whether or not still image data shot in the past in the direction of the sound generation source exists. Discriminating means, and
If it is determined by the determining means that there is still image data captured in the past in the direction of the sound source, the still image data captured in the past is used instead of the still image data to be captured by the image capturing means. A recording apparatus characterized in that it is still image data corresponding to the sound change .

The shooting time acquisition means measures the elapsed recording time from the start of recording by the recording means, and acquires the elapsed recording time when the still image was shot by the shooting means as the shooting time of the still image. The recording apparatus according to claim 1, wherein:

When the sound change detecting means detects that the sound change satisfying the predetermined still image shooting condition is at least one of a volume change, a frequency band change, and a sound direction change, The recording apparatus according to claim 1, wherein an image is captured and captured as still image data.

When the sound change detecting means detects a change in sound that satisfies the change condition of the shooting parameter of the shooting means, the camera further includes a shooting parameter changing means for changing the shooting parameter of the shooting means based on the change in sound.
When the sound change detecting unit detects a sound change that satisfies a predetermined still image shooting condition, the shooting unit takes a still image based on the shooting parameter changed by the shooting parameter changing unit, and generates still image data. The recording apparatus according to claim 1, wherein the recording apparatus is incorporated as a recording apparatus.

The photographing parameter changing means changes the photographing direction of the photographing means to the sound generating source when the sound change detecting means detects that the direction of the sound generating source has changed by a predetermined angle or more as the photographing parameter changing condition. Change the shooting parameters so that they point in the direction,
5. The recording apparatus according to claim 4, further comprising photographing direction changing means for directing a photographing direction by the photographing means to a direction of a sound source based on the photographing parameter.

The photographing parameter changing means, when the sound change detecting means detects that the sound direction of a frequency band having a predetermined volume or more has changed by a predetermined angle or more as the photographing parameter changing condition. Change the shooting parameters to change the direction,
The imaging direction changing means for directing the imaging direction of the imaging means to a sound direction in a frequency band having a volume equal to or higher than a predetermined volume based on the imaging parameters changed by the imaging parameter changing means. 4. The recording apparatus according to 4.

The photographing unit is configured to display a still image when there is a sound change that satisfies a predetermined still image photographing condition between the audio data corresponding to the still image data photographed in the past and the audio data captured by the recording unit. recording apparatus according to claim 1, wherein the incorporation as a new still image data by photographing a.

When a change in sound is detected during recording of sound data, and a sound change that satisfies the predetermined still image shooting conditions is detected, a still image is shot and captured as still image data, and the shooting time at which the still image was shot is set. The acquired still image data and the acquired shooting time are recorded in association with the audio data being recorded, and an audio change that satisfies the predetermined still image shooting direction is a direction of the audio source. If it is determined that there is still image data captured in the past in the direction of the sound source, the still image data captured in the past is used as the audio instead of the still image data to be captured. A recording method characterized in that it is still image data corresponding to a change .

And procedures to incorporate as voice data to record the voice,
A step of detecting a voice change of the audio data,
Each time a sound change that satisfies a predetermined still image shooting condition is detected, a procedure for shooting a still image and importing it as still image data;
A procedure for obtaining a shooting time when the still image was shot;
A procedure for recording the captured still image data and the acquired shooting time in association with the captured audio data ;
A procedure for determining whether there is still image data captured in the past in the direction of the sound source when the sound change that satisfies the predetermined still image shooting condition is a direction change of the sound source;
If it is determined that there is still image data captured in the past in the direction of the sound source, the still image data captured in the past is used instead of the still image data to be captured. And the procedure
A recording program for causing a computer to execute.