JPH09252453A

JPH09252453A - Digital still video camera

Info

Publication number: JPH09252453A
Application number: JP8058140A
Authority: JP
Inventors: Kenji Shiraishi; 賢二白石
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1996-03-14
Filing date: 1996-03-14
Publication date: 1997-09-22

Abstract

PROBLEM TO BE SOLVED: To provide a camera convenient for circulation and arrangement of a picked-up image or the like to attain electronic mail and facsimile communication. SOLUTION: Voice electric conversion means 111-113 convert a voice input into an electric signal, and a voice recognition means 116 recognizes the voice converted into the electric signal and outputs a character code corresponding to each word. Then the character code is converted into a character string by a code character conversion means 117 and the character string being the recognition result of the voice input is given to a character synthesis means 121, in which the character string is synthesized with an image picked up simultaneously with the voice input or having been already picked up and a data display means 122 displays the recognition result of the image and the voice input or the synthesis result between the image and the character string.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は，静止画，動画，音
声等を入力し，デジタルデータに変換してメモリカード
等の記憶媒体に記録するデジタルスチルビデオカメラに
係り，特に，撮影した画像に音声入力によるコメントを
入れることができ，撮影した画像の閲覧，整理等に便利
なデジタルスチルビデオカメラ，並びに，電子メール通
信，ファクシミリ通信が可能なデジタルスチルビデオカ
メラに関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a digital still video camera for inputting still images, moving images, voices, etc., converting them into digital data and recording them in a storage medium such as a memory card, and more particularly to a captured image. The present invention relates to a digital still video camera in which comments can be entered by voice input, which is convenient for viewing and organizing captured images, and a digital still video camera capable of e-mail communication and facsimile communication.

【０００２】[0002]

【従来の技術】一般に，デジタルスチルビデオカメラに
おいては，撮影した画像をデジタル画像データに変換し
て圧縮した後に，付属のメモリカード等に記憶させる場
合が多い。また近年では，当該メモリカードの大容量化
及び圧縮技術の進歩等により，１枚のメモリカードに記
録できる画像の枚数が増え，例えば，約５００枚ほどの
画像を記憶できるメモリカードも出てきており，メモリ
カードの容量は今後更に増加することは容易に予想でき
る。2. Description of the Related Art Generally, in a digital still video camera, a captured image is often stored in an attached memory card or the like after being converted into digital image data and compressed. In recent years, the number of images that can be recorded on one memory card has increased due to the increase in the capacity of the memory card and the progress of compression technology. For example, some memory cards can store about 500 images. Therefore, it is easy to predict that the capacity of memory cards will further increase in the future.

【０００３】[0003]

【発明が解決しようとする課題】しかしながら，このよ
うなメモリカードに撮影画像を記憶して利用する上記従
来のデジタルスチルビデオカメラにおいては，メモリカ
ードに記録された画像の中から所望の画像を取り出した
い場合には，使用者は，メモリカードに記録されている
画像を１枚ずつ確認しながら見つけ出さなければならな
いという問題があった。However, in the above-mentioned conventional digital still video camera in which photographed images are stored and used in such a memory card, a desired image is extracted from the images recorded in the memory card. There is a problem that the user must find out the images recorded on the memory card one by one, if desired.

【０００４】メモリカードに記録されている画像が少な
ければ問題ないが，記録画像が多くなるにつれて，画像
を確認する作業は使用者の大変な負担となる。また上述
のように，今後メモリカードに記録可能な画像枚数が更
に増える傾向にあるため，使用者が所望の画像を見つけ
出す作業はより困難になっていくことが容易に予想され
る。There is no problem if the number of images recorded on the memory card is small, but as the number of recorded images increases, the work of checking the images becomes a heavy burden on the user. Further, as described above, the number of images that can be recorded on the memory card tends to further increase in the future, so that it is easily expected that the user will find it more difficult to find a desired image.

【０００５】本発明は，上記従来の問題点に鑑みてなさ
れたものであって，撮影した画像に音声入力によるコメ
ントを入れることができ，またメモリカード等の記憶媒
体に記録した画像の中から，所望の画像の画像を容易に
見つけ出すことができ，閲覧，整理等に便利なデジタル
スチルビデオカメラを提供することを目的としている。The present invention has been made in view of the above-mentioned conventional problems, and it is possible to add a comment by voice input to a photographed image, and to select from among images recorded in a storage medium such as a memory card. The purpose of the present invention is to provide a digital still video camera in which a desired image can be easily found and which is convenient for viewing and organizing.

【０００６】また本発明の他の目的は，デジタルスチル
ビデオカメラに通信端末としての機能を持たせ，電子メ
ール通信，ファクシミリ通信が可能なデジタルスチルビ
デオカメラを提供することである。Another object of the present invention is to provide a digital still video camera capable of performing electronic mail communication and facsimile communication by allowing the digital still video camera to function as a communication terminal.

【０００７】[0007]

【課題を解決するための手段】上記課題を解決するため
に，本発明の請求項１に係るデジタルスチルビデオカメ
ラは，音声入力を電気信号に変換する音声電気変換手段
と，電気信号に変換された音声を認識して各言葉に対応
した文字コードを出力する音声認識手段と，前記文字コ
ードを文字列に変換するコードキャラクタ変換手段と，
前記音声入力の認識結果である文字列を，該音声入力と
同時に撮影された画像または既に撮影されている画像と
合成する文字合成手段と，前記画像，前記音声入力の認
識結果または前記画像と文字列の合成結果を表示するデ
ータ表示手段とを備えるものである。In order to solve the above-mentioned problems, a digital still video camera according to claim 1 of the present invention includes an audio-electrical conversion means for converting an audio input into an electric signal and an audio-electrical conversion means. A voice recognition means for recognizing the voice and outputting a character code corresponding to each word; a code character conversion means for converting the character code into a character string;
A character synthesizing means for synthesizing a character string which is the recognition result of the voice input with an image taken at the same time as the voice input or an image already taken; the image, the recognition result of the voice input or the image and the character And a data display means for displaying the result of combining the columns.

【０００８】また，請求項２に係るデジタルスチルビデ
オカメラは，請求項１に記載のデジタルスチルビデオカ
メラにおいて，前記デジタルスチルビデオカメラは，前
記音声入力の認識結果を仮名漢字混じりの文字列のコー
ドに変換する仮名漢字変換手段を備えるものである。The digital still video camera according to a second aspect of the present invention is the digital still video camera according to the first aspect, wherein the digital still video camera uses the recognition result of the voice input as a code of a character string mixed with kana and kanji. It is provided with a kana-kanji conversion means for converting to.

【０００９】また，請求項３に係るデジタルスチルビデ
オカメラは，音声入力を電気信号に変換する音声電気変
換手段と，電気信号に変換された音声を認識して各言葉
に対応した文字コードを出力する音声認識手段と，前記
音声入力の認識結果を仮名漢字混じりの文字列のコード
に変換する仮名漢字変換手段と，前記音声入力の認識結
果を文字コードデータとして保存する記憶手段とを備え
るものである。The digital still video camera according to a third aspect of the present invention recognizes voice-electric conversion means for converting a voice input into an electric signal and recognizes the voice converted into the electric signal and outputs a character code corresponding to each word. And a kana-kanji conversion means for converting the recognition result of the voice input into a code of a character string containing kana-kanji, and a storage means for storing the recognition result of the voice input as character code data. is there.

【００１０】また，請求項４に係るデジタルスチルビデ
オカメラは，音声入力を電気信号に変換する音声電気変
換手段と，電気信号に変換された音声を認識して各言葉
に対応した文字コードを出力する音声認識手段と，電子
メール通信に必要なヘッダー情報の付加，通信動作等を
制御する通信制御手段とを備え，前記音声入力の認識結
果または前記文字コードを電子メールで通信するもので
ある。Further, the digital still video camera according to a fourth aspect of the present invention recognizes the voice / electric conversion means for converting a voice input into an electric signal and the voice converted into the electric signal and outputs a character code corresponding to each word. The voice recognition means and the communication control means for controlling header information necessary for electronic mail communication and communication operation are provided, and the recognition result of the voice input or the character code is communicated by electronic mail.

【００１１】また，請求項５に係るデジタルスチルビデ
オカメラは，請求項４に記載のデジタルスチルビデオカ
メラにおいて，前記デジタルスチルビデオカメラは，前
記音声入力の認識結果を仮名漢字混じりの文字列のコー
ドに変換する仮名漢字変換手段を備えるものである。The digital still video camera according to a fifth aspect is the digital still video camera according to the fourth aspect, wherein the digital still video camera codes the recognition result of the voice input as a character string containing kana and kanji characters. It is provided with a kana-kanji conversion means for converting to.

【００１２】また，請求項６に係るデジタルスチルビデ
オカメラは，請求項５に記載のデジタルスチルビデオカ
メラにおいて，前記デジタルスチルビデオカメラは，前
記文字コードを文字列に変換するコードキャラクタ変換
手段と，前記音声入力の認識結果または前記仮名漢字変
換結果を表示するデータ表示手段とを備えるものであ
る。The digital still video camera according to a sixth aspect is the digital still video camera according to the fifth aspect, wherein the digital still video camera includes code character conversion means for converting the character code into a character string. Data display means for displaying the recognition result of the voice input or the kana-kanji conversion result.

【００１３】また，請求項７に係るデジタルスチルビデ
オカメラは，請求項６に記載のデジタルスチルビデオカ
メラにおいて，前記デジタルスチルビデオカメラは，前
記データ表示手段に表示された仮名漢字変換結果に誤り
があった場合に，該誤りを訂正する訂正手段を備えるも
のである。The digital still video camera according to a seventh aspect is the digital still video camera according to the sixth aspect, wherein the digital still video camera has an error in the kana-kanji conversion result displayed on the data display means. If there is, a correction means for correcting the error is provided.

【００１４】また，請求項８に係るデジタルスチルビデ
オカメラは，音声入力を電気信号に変換する音声電気変
換手段と，電気信号に変換された音声を認識して各言葉
に対応した文字コードを出力する音声認識手段と，認識
された文字コードデータをファクシミリ通信のためのイ
メージデータに変換するイメージ変換手段と，ファクシ
ミリ通信に必要な制御コードの付加や通信動作を制御す
るファクシミリ制御手段とを備え，前記イメージデータ
をファクシミリで通信するものである。According to another aspect of the digital still video camera of the present invention, the voice / electric conversion means for converting a voice input into an electric signal and the voice converted into the electric signal to output a character code corresponding to each word. Voice recognition means, an image conversion means for converting the recognized character code data into image data for facsimile communication, and a facsimile control means for adding a control code necessary for facsimile communication and controlling communication operation, The image data is communicated by facsimile.

【００１５】更に，請求項９に係るデジタルスチルビデ
オカメラは，請求項８に記載のデジタルスチルビデオカ
メラにおいて，前記デジタルスチルビデオカメラは，前
記音声入力の認識結果を仮名漢字混じりの文字列のコー
ドに変換する仮名漢字変換手段と，前記音声入力の認識
結果または前記仮名漢字変換結果を表示するデータ表示
手段と，前記データ表示手段に表示された仮名漢字変換
結果に誤りがあった場合に，該誤りを訂正する訂正手段
とを備えるものである。Furthermore, the digital still video camera according to a ninth aspect is the digital still video camera according to the eighth aspect, wherein the digital still video camera uses the recognition result of the voice input as a code of a character string mixed with kana and kanji. A kana-kanji conversion means for converting to kana, a data display means for displaying the recognition result of the voice input or the kana-kanji conversion result, and a kana-kanji conversion result displayed on the data display means for error, And a correction means for correcting an error.

【００１６】[0016]

【発明の実施の形態】以下，本発明のデジタルスチルビ
デオカメラの概要について，並びに，本発明のデジタル
スチルビデオカメラの実施例について，〔実施例１〕，
〔実施例２〕，〔実施例３〕の順に図面を参照して詳細
に説明する。BEST MODE FOR CARRYING OUT THE INVENTION Hereinafter, an outline of a digital still video camera of the present invention and an embodiment of a digital still video camera of the present invention will be described [Embodiment 1],
[Embodiment 2] and [Embodiment 3] will be described in detail with reference to the drawings.

【００１７】〔本発明のデジタルスチルビデオカメラの
概要〕本発明の請求項１に係るデジタルスチルビデオカ
メラでは，図１に示す如く，音声電気変換手段１１１〜
１１３により音声入力を電気信号に変換し，音声認識手
段１１６で電気信号に変換された音声を認識して各言葉
に対応した文字コードを出力し，該文字コードをコード
キャラクタ変換手段１１７により文字列に変換し，音声
入力の認識結果である文字列を，文字合成手段１２１に
より，該音声入力と同時に撮影された画像または既に撮
影されている画像と合成して，データ表示手段１２２に
より，画像，音声入力の認識結果または画像と文字列の
合成結果を表示するようにしている。これにより，音声
入力したデータを音声として再生するだけでなく，撮影
した画像に該音声入力による表題等のコメントを入れる
ことができ，記憶媒体に記録した画像の中から，所望の
画像の画像を容易に見つけ出すことができ，閲覧，整理
等に便利なデジタルスチルビデオカメラを実現すること
ができる。[Outline of Digital Still Video Camera of the Present Invention] In the digital still video camera according to claim 1 of the present invention, as shown in FIG.
The voice input 113 converts the voice input into an electric signal, the voice recognition unit 116 recognizes the voice converted into the electric signal, outputs a character code corresponding to each word, and the character code is converted into a character string by the code character conversion unit 117. And the character string which is the recognition result of the voice input is synthesized by the character synthesizing means 121 with the image photographed at the same time as the voice input or the image already photographed, and the data display means 122 synthesizes the image, The recognition result of voice input or the combination result of image and character string is displayed. As a result, not only can the voice input data be reproduced as voice, but a comment such as a title by the voice input can be added to the captured image, and the image of the desired image can be selected from the images recorded in the storage medium. It is possible to realize a digital still video camera that can be easily found and is convenient for browsing and organizing.

【００１８】また，請求項２に係るデジタルスチルビデ
オカメラでは，図１に示す如く，仮名漢字変換手段１１
９により，音声入力の認識結果を仮名漢字混じりの文字
列のコードに変換するようにしている。これにより，撮
影した画像に付加する該音声入力によるコメントとし
て，仮名漢字混じりの日本語等の文字列を使用すること
ができる。Further, in the digital still video camera according to the second aspect, as shown in FIG.
9, the recognition result of the voice input is converted into a code of a character string containing kana and kanji. As a result, a character string such as Japanese mixed with Kana and Kanji can be used as a comment added to the captured image by the voice input.

【００１９】また，請求項３に係るデジタルスチルビデ
オカメラでは，図１に示す如く，音声電気変換手段１１
１〜１１３により音声入力を電気信号に変換し，音声認
識手段１１６で電気信号に変換された音声を認識して各
言葉に対応した文字コードを出力し，該音声入力の認識
結果を仮名漢字変換手段１１９により仮名漢字混じりの
文字列のコードに変換し，音声入力の認識結果を文字コ
ードデータとして記憶手段１５０に保存するようにして
いる。これにより，音声入力によるテキストデータの作
成が可能になり，また，音声入力によるコメントやメモ
等をコード化して保存することができ，保存に使用され
る記憶容量を小さくすることができる。Further, in the digital still video camera according to the third aspect, as shown in FIG.
1 to 113 convert a voice input into an electric signal, a voice recognition unit 116 recognizes the voice converted into the electric signal, outputs a character code corresponding to each word, and converts the recognition result of the voice input into kana-kanji conversion. The means 119 converts into a code of a character string mixed with kana and kanji, and the recognition result of voice input is stored in the storage means 150 as character code data. As a result, it becomes possible to create text data by voice input, and it is possible to code and save comments, memos, etc. by voice input, and it is possible to reduce the storage capacity used for storage.

【００２０】また，請求項４に係るデジタルスチルビデ
オカメラでは，図３に示す如く，音声電気変換手段１１
１〜１１３により音声入力を電気信号に変換し，音声認
識手段１１６で電気信号に変換された音声を認識して各
言葉に対応した文字コードを出力し，通信制御手段３３
１により，電子メール通信に必要なヘッダー情報の付
加，通信動作等を制御して，音声入力の認識結果または
文字コードを電子メールで通信するようにしている。こ
れにより，通信端末としての機能を備え，電子メール通
信が可能なデジタルスチルビデオカメラを実現できる。Further, in the digital still video camera according to the fourth aspect, as shown in FIG.
1 to 113 convert a voice input into an electric signal, a voice recognition unit 116 recognizes the voice converted into the electric signal, outputs a character code corresponding to each word, and the communication control unit 33.
1, the addition of header information required for e-mail communication, the communication operation, etc. are controlled so that the recognition result of the voice input or the character code is communicated by e-mail. As a result, a digital still video camera having a function as a communication terminal and capable of electronic mail communication can be realized.

【００２１】また，請求項５に係るデジタルスチルビデ
オカメラでは，図３に示す如く，仮名漢字変換手段１１
９により，音声入力の認識結果を仮名漢字混じりの文字
列のコードに変換するようにしている。これにより，仮
名漢字混じりの日本語等の文字列による電子メール通信
が可能となる。Further, in the digital still video camera according to the fifth aspect, as shown in FIG.
9, the recognition result of the voice input is converted into a code of a character string containing kana and kanji. As a result, it becomes possible to communicate by e-mail using a character string such as Japanese mixed with Kana and Kanji.

【００２２】また，請求項６に係るデジタルスチルビデ
オカメラでは，図３に示す如く，コードキャラクタ変換
手段１１７により文字コードを文字列に変換し，データ
表示手段３２２には，音声入力の認識結果または仮名漢
字変換結果を表示するようにしている。これにより，仮
名漢字変換されたデータを確認しながらメールを作成
し，電子メールとして送信することができ，また受信し
たメールを表示することも可能である。In the digital still video camera according to the sixth aspect, as shown in FIG. 3, the character code is converted into a character string by the code character conversion means 117, and the data display means 322 displays the recognition result of the voice input or The kana-kanji conversion result is displayed. As a result, it is possible to create a mail while checking the Kana-Kanji converted data and send it as an electronic mail, and it is also possible to display the received mail.

【００２３】また，請求項７に係るデジタルスチルビデ
オカメラでは，図３に示す如く，データ表示手段３２２
に表示された仮名漢字変換結果に誤りがあった場合に
は，訂正手段３２１により該誤りを訂正するようにして
いる。これにより，仮名漢字変換されたデータを確認お
よび誤りの訂正をしながら電子メールによる通信ができ
る。Further, in the digital still video camera according to the seventh aspect, as shown in FIG.
If there is an error in the kana-kanji conversion result displayed in, the correction means 321 corrects the error. As a result, it is possible to communicate by e-mail while checking the Kana-Kanji converted data and correcting errors.

【００２４】また，請求項８に係るデジタルスチルビデ
オカメラでは，図４に示す如く，音声電気変換手段１１
１〜１１３により音声入力を電気信号に変換し，音声認
識手段１１６で電気信号に変換された音声を認識して各
言葉に対応した文字コードを出力し，該認識された文字
コードデータをイメージ変換手段４２５によりファクシ
ミリ通信のためのイメージデータに変換して，ファクシ
ミリ通信に必要な制御コードの付加や通信動作を制御す
るファクシミリ制御手段３３１により，イメージデータ
をファクシミリで通信するようにしている。これによ
り，通信端末としての機能を備え，ファクシミリ通信が
可能なデジタルスチルビデオカメラを実現できる。Further, in the digital still video camera according to the eighth aspect, as shown in FIG.
1 to 113 convert a voice input into an electric signal, a voice recognition unit 116 recognizes the voice converted into the electric signal, outputs a character code corresponding to each word, and converts the recognized character code data into an image. The means 425 converts the image data into image data for facsimile communication, and the facsimile control means 331 that controls the addition of control codes and communication operation necessary for facsimile communication causes image data to be communicated by facsimile. As a result, a digital still video camera having a function as a communication terminal and capable of facsimile communication can be realized.

【００２５】更に，請求項９に係るデジタルスチルビデ
オカメラでは，図４に示す如く，仮名漢字変換手段１１
９により，音声入力の認識結果を仮名漢字混じりの文字
列のコードに変換し，音声入力の認識結果または仮名漢
字変換結果をデータ表示手段３２２に表示して，表示さ
れた仮名漢字変換結果に誤りがあった場合には，該誤り
を訂正手段４２１により訂正するようにしている。これ
により，仮名漢字変換結果に誤りがないことを確認し，
誤りがあった場合には訂正しながら，仮名漢字混じりの
日本語文章によるファクシミリ通信を行うことができ
る。Furthermore, in the digital still video camera according to the ninth aspect, as shown in FIG.
9, the recognition result of the voice input is converted into a code of a character string containing kana-kanji characters, the recognition result of the voice input or the kana-kanji conversion result is displayed on the data display means 322, and the displayed kana-kanji conversion result is incorrect. If there is, the correction means 421 corrects the error. This confirmed that the Kana-Kanji conversion results were correct,
If you make an error, you can correct it and perform facsimile communication using Japanese sentences containing Kana and Kanji.

【００２６】〔実施例１〕図１は本発明の実施例１に係
るデジタルスチルビデオカメラの構成図である。本実施
例のデジタルスチルビデオカメラの構成は，大別して，
デジタルスチルビデオカメラの本体１００と，撮像した
画像及びテキストデータを記録するＰＣカード１５０と
を具備して構成されている。[Embodiment 1] FIG. 1 is a block diagram of a digital still video camera according to Embodiment 1 of the present invention. The structure of the digital still video camera of this embodiment is roughly classified into
The main body 100 of the digital still video camera and a PC card 150 for recording captured images and text data are provided.

【００２７】同図において，デジタルスチルビデオカメ
ラ本体１００は，レンズユニット１０１，ＣＣＤ１０
２，ＣＤＳ回路１０３，Ａ／Ｄ変換器１０４，デジタル
画像処理部１０５，画像圧縮・伸長部１０６，ＦＩＦＯ
１０７，カードインタフェース回路１０８，ＰＣカード
インタフェース回路１０９，音声による日本語文章作成
部１１０，コードキャラクタ変換部１１７，ＣＰＵ１２
１，表示装置１２２，及び操作部１２３を具備して構成
されている。ここで，音声による日本語文章作成部１１
０は，マイク１１１，フィルタ１１２，Ａ／Ｄ変換器１
１３，ＤＲＡＭ１１４，音声パターンメモリ１１５，音
声認識部１１６，辞書１１８，及び仮名漢字変換部１１
９を備えている。In FIG. 1, a digital still video camera body 100 includes a lens unit 101 and a CCD 10.
2, CDS circuit 103, A / D converter 104, digital image processing unit 105, image compression / decompression unit 106, FIFO
107, a card interface circuit 108, a PC card interface circuit 109, a voiced Japanese sentence creation unit 110, a code character conversion unit 117, and a CPU 12.
1, a display device 122, and an operation unit 123. Here, the Japanese sentence creation part 11 by voice
0 is a microphone 111, a filter 112, an A / D converter 1
13, DRAM 114, voice pattern memory 115, voice recognition unit 116, dictionary 118, and Kana-Kanji conversion unit 11
9 is provided.

【００２８】レンズユニット１０１は，レンズ，及びオ
ートフォーカス（ＡＦ）・絞り・フィルター部を含むメ
カ機構等からなり，メカ機構のメカニカルシャッターは
２つのフィールドの同時露光を行う。ＣＣＤ（電荷結合
素子）１０２は，レンズユニットを介して入力した映像
を電気信号（アナログ画像データ）に変換する。ＣＤＳ
（相関２重サンプリング）回路１０３は，ＣＣＤ型撮像
素子に対する低雑音化のための回路である。またＡ／Ｄ
変換器１０４は，ＣＤＳ回路１０３を介して入力したＣ
ＣＤ１０２からのアナログ画像データをデジタル画像デ
ータに変換する。即ち，ＣＣＤ１０２の出力信号は，Ｃ
ＤＳ回路１０３を通してＡ／Ｄ変換器１０４で最適なサ
ンプリング周波数（例えば，ＮＴＳＣ信号のサブキャリ
ア周波数の整数倍）にてデジタル信号に変換される。The lens unit 101 is composed of a lens and a mechanical mechanism including an auto focus (AF) / aperture / filter section, and the mechanical shutter of the mechanical mechanism performs simultaneous exposure of two fields. The CCD (charge coupled device) 102 converts an image input via the lens unit into an electric signal (analog image data). CDS
The (correlated double sampling) circuit 103 is a circuit for reducing noise in the CCD image pickup device. Also A / D
The converter 104 inputs C through the CDS circuit 103
The analog image data from the CD 102 is converted into digital image data. That is, the output signal of the CCD 102 is C
It is converted into a digital signal by the A / D converter 104 through the DS circuit 103 at an optimum sampling frequency (for example, an integral multiple of the subcarrier frequency of the NTSC signal).

【００２９】また，デジタル画像処理部１０５は，Ａ／
Ｄ変換器１０４から入力したデジタル画像データを色差
と輝度に分けて各種処理，補正および画像圧縮・伸長の
ためのデータ処理を施す。画像圧縮・伸長部１０６は，
例えばＪＰＥＧ準拠の画像圧縮・伸長の一過程である直
交変換，並びに，ＪＰＥＧ準拠の画像圧縮・伸長の一過
程であるハフマン符号化・複合化等を行う。Further, the digital image processing unit 105 is
The digital image data input from the D converter 104 is divided into color difference and luminance, and various types of processing, correction, and data processing for image compression / decompression are performed. The image compression / decompression unit 106
For example, orthogonal transformation, which is one process of image compression / expansion conforming to JPEG, and Huffman encoding / compositing, which is one process of image compression / expansion conforming to JPEG, are performed.

【００３０】一方，音声は，マイク１１１等の音声−電
気信号変換素子により電気信号に変換され，フィルタ１
１２により増幅され必要帯域以外の周波数成分をカット
オフされた後，Ａ／Ｄ変換器１１３により必要帯域の２
倍以上のサンプリング周波数でデジタル信号に変換され
る。更に，このデジタル信号化された音声データはＤＲ
ＡＭ１１４を介して音声認識部１１６に送られる。音声
認識部１１６では，音声特徴抽出が行われ，音声パター
ンメモリ１１５内の音声パターンと照合しながら音声認
識が行われ，音声データがコード化されてＤＲＡＭ１１
４内に保持される。On the other hand, the voice is converted into an electric signal by a voice-electric signal conversion element such as the microphone 111, and the filter 1
After being amplified by 12 and the frequency components other than the required band are cut off, the A / D converter 113 reduces the required band to 2
It is converted into a digital signal with a sampling frequency more than double. Furthermore, the audio data converted into digital signals is DR
It is sent to the voice recognition unit 116 via the AM 114. In the voice recognition unit 116, voice feature extraction is performed, voice recognition is performed while matching with the voice pattern in the voice pattern memory 115, voice data is coded, and the DRAM 11
4.

【００３１】また仮名漢字変換部１１９では，ＤＲＡＭ
１１４内に保持されている認識結果について，表示装置
１２２上の認識結果の表示に従った操作者の指示に基づ
いて，仮名漢字変換を実行する。即ち，変換指示のあっ
た場合には，仮名漢字変換は，ＤＲＡＭ１１４より仮名
漢字変換部１１９に送られた認識結果を，辞書１１８と
照合しながら仮名漢字混じりの文字列のコードに変換す
ることにより行われる。仮名漢字変換の結果は，再び文
字キャラクタに変換されテキストデータとしてＤＲＡＭ
１１４に保持されると共に，表示装置１２２により表示
される。In the kana-kanji conversion unit 119, the DRAM is
Kana-Kanji conversion is performed on the recognition result stored in 114 based on the instruction of the operator according to the display of the recognition result on the display device 122. That is, when there is a conversion instruction, the Kana-Kanji conversion is performed by converting the recognition result sent from the DRAM 114 to the Kana-Kanji conversion unit 119 into a code of a character string mixed with Kana-Kanji while collating with the dictionary 118. Done. The result of the Kana-Kanji conversion is converted back into character characters and stored as text data in the DRAM.
It is held by 114 and displayed by the display device 122.

【００３２】次にＦＩＦＯ１０８は，例えばＤＲＡＭ，
フラッシュメモリ等で実現されており，圧縮処理された
画像と，音声入力したテキストデータとを一旦蓄える。
ＦＩＦＯ１０８に保持された圧縮画像データ及び音声入
力テキストデータは，カードインタフェース回路１０９
を通して読み出され，ＰＣカードインタフェース回路１
１０を介して接続されるＰＣカード１５０等の記憶媒体
へ記録される。Next, the FIFO 108 is, for example, a DRAM,
It is realized by a flash memory or the like, and temporarily stores a compressed image and voice-input text data.
The compressed image data and voice input text data held in the FIFO 108 are transferred to the card interface circuit 109.
Read through the PC card interface circuit 1
The data is recorded in a storage medium such as a PC card 150 connected via 10.

【００３３】ＣＰＵ１２１は，操作部１１３からの指
示，或いは図示しないリモコン等の外部動作指示に従
い，上記各部の動作を制御する。尚，カメラ電源はバッ
テリ，例えば，ＮｉＣｄ，ニッケル水素，リチウム電池
等から，図示しないＤＣ−ＤＣコンバータに入力され，
当該デジタルスチルビデオカメラ内部に供給される。The CPU 121 controls the operation of each of the above parts according to an instruction from the operation unit 113 or an external operation instruction such as a remote controller (not shown). It should be noted that the camera power source is input to a DC-DC converter (not shown) from a battery, for example, NiCd, nickel hydrogen, lithium battery, or the like.
It is supplied to the inside of the digital still video camera.

【００３４】更に，表示部１２２は，ＬＣＤ，ＬＥＤ，
ＥＬ等で実現されており，撮影したデジタル画像データ
や，伸長処理された記録画像データを表示すると共に，
ＤＲＡＭ１１４内の音声入力したコード化されたデータ
や仮名漢字変換後のテキストデータを表示する。また操
作部１２３は，機能選択，撮影指示，或いはその他の各
種設定を外部から行うためのボタンを備える。Further, the display unit 122 includes an LCD, an LED,
It is realized by EL, etc., and displays the captured digital image data and the expanded image data.
The voice-input coded data and the kana-kanji converted text data in the DRAM 114 are displayed. The operation unit 123 also includes buttons for externally performing function selection, shooting instruction, and other various settings.

【００３５】図１に示した構成において，マイク１１
１，フィルタ１１２及びＡ／Ｄ変換器１１３が音声電気
変換手段を，音声認識部１１６及び音声パターンメモリ
１１５が音声認識手段を，コードキャラクタ変換部１１
７がコードキャラクタ変換手段を，仮名漢字変換部１１
９及び辞書１１８が仮名漢字変換手段を，表示装置１２
２がデータ表示手段を，ＰＣカード１５０が記憶手段
を，それぞれ実現し，また文字合成手段及び訂正手段
は，図示しないＲＯＭ等のメモリに格納されている制御
プログラムを実行するＣＰＵ１２１によって実現されて
いる。In the configuration shown in FIG. 1, the microphone 11
1, the filter 112 and the A / D converter 113 are the voice electric conversion means, the voice recognition section 116 and the voice pattern memory 115 are the voice recognition means, and the code character conversion section 11
Reference numeral 7 is a code character conversion means, and kana-kanji conversion unit 11
9 and the dictionary 118 are kana-kanji conversion means, and the display device 12
2 is a data display unit, and the PC card 150 is a storage unit. The character synthesizing unit and the correcting unit are realized by the CPU 121 that executes a control program stored in a memory such as a ROM (not shown). .

【００３６】次に，本実施例のデジタルスチルビデオカ
メラにおける音声入力によるテキストデータの生成処理
の動作について，図２に示すフローチャートを参照して
説明する。Next, the operation of text data generation processing by voice input in the digital still video camera of this embodiment will be described with reference to the flowchart shown in FIG.

【００３７】先ずステップＳ２０１では，マイク１１１
等の音声−電気信号変換素子により入力音声が電気信号
に変換され，ステップＳ２０２では，フィルタ１１２に
よるフィルタ処理，即ち，電気信号が増幅され必要帯域
以外の周波数成分がカットオフされる。該フィルタ処理
された電気信号は，ステップＳ２０３で，Ａ／Ｄ変換器
１１３により必要帯域の２倍以上のサンプリング周波数
でデジタル信号に変換される。First, in step S201, the microphone 111
The input voice is converted into an electric signal by a voice-electric signal conversion element such as, and in step S202, the filtering process by the filter 112, that is, the electric signal is amplified and frequency components other than the required band are cut off. The filtered electric signal is converted into a digital signal by the A / D converter 113 at a sampling frequency of twice the required band or more in step S203.

【００３８】次に，このデジタル信号化された音声デー
タは，ステップＳ２０４で，ＤＲＡＭ１１４に格納され
た後，ステップＳ２０５で，音声認識部１１６に送られ
て，音声特徴抽出が行われ，音声パターンメモリ１１５
内の音声パターンと照合しながら音声認識が行われ，ス
テップＳ２０６で，該音声認識結果がコード化された音
声データとしてＤＲＡＭ１１４内に格納される。Next, the digital signalized voice data is stored in the DRAM 114 in step S204, and then sent to the voice recognition unit 116 in step S205 to perform voice feature extraction and voice pattern memory. 115
The voice recognition is performed while collating with the voice pattern inside, and in step S206, the voice recognition result is stored in the DRAM 114 as encoded voice data.

【００３９】次に，ステップＳ２０８では，認識結果で
ある各文字に対応したコード化された音声データは，コ
ードキャラクタ変換部１１７により文字キャラクタに変
換され，表示装置１２２により表示される。表示された
認識結果が入力した通り（即ち，ステップＳ２０９で誤
りがないと判断した場合）であれば，ステップＳ２１０
に進んで，操作者は操作部１２３の変換ボタン（例えば
レリーズボタン）を押下することにより，該認識結果を
仮名漢字変換する。この仮名漢字変換処理は，ＤＲＡＭ
１１４より仮名漢字変換部１１９に送られた認識結果を
辞書１１８と照合しながら，仮名漢字混じりの文字列の
コードに変換することによって行われる。更にステップ
Ｓ２１３では，該仮名漢字変換の結果は，再び文字キャ
ラクタに変換され，テキストデータとしてＤＲＡＭ１１
４に保持されると共に，表示装置１２２により表示され
る。Next, in step S208, the coded voice data corresponding to each character which is the recognition result is converted into a character by the code character conversion unit 117 and displayed by the display device 122. If the displayed recognition result is as input (that is, if it is determined that there is no error in step S209), step S210
Then, the operator presses a conversion button (for example, a release button) on the operation unit 123 to convert the recognition result into kana-kanji. This Kana-Kanji conversion process is based on the DRAM
The recognition result sent from 114 to the kana-kanji conversion unit 119 is collated with the dictionary 118 and converted into a code of a character string containing kana-kanji. Further, in step S213, the result of the Kana-Kanji conversion is converted into a character again and the text data is stored in the DRAM 11.
4 and is displayed by the display device 122.

【００４０】ステップＳ２１４では，表示装置１２２の
表示内容について操作者が確認する。変換が正しく行わ
れている場合には，操作部１２３の確定ボタン（例えば
ストロボボタン）により確定する。一方，仮名漢字変換
に誤りがあった場合には，ステップＳ２１５に進んで，
操作部１２３の，例えばズームレバーを選択レバーとし
て，記録モードボタンを選択ボタンとして用い，選択レ
バーで変換する最初と最後に指示ポイントを移動させ，
選択ボタンでそれぞれの位置を決める。そしてレリーズ
ボタンで再変換を行い，確定ボタンで確定する。In step S214, the operator confirms the display content of the display device 122. If the conversion is performed correctly, the confirmation button (for example, strobe button) of the operation unit 123 is used for confirmation. On the other hand, if there is an error in the kana-kanji conversion, proceed to step S215,
For example, the zoom lever of the operation unit 123 is used as a selection lever, the recording mode button is used as a selection button, and the instruction point is moved to the beginning and end of conversion with the selection lever,
Select each position with the select button. Then, re-convert with the release button and confirm with the confirm button.

【００４１】以上の処理が，本実施例における音声入力
によるテキストデータの生成処理の動作である。こうし
て完成したテキストデータは，記憶媒体であるＰＣカー
ド１５０に，文字コードデータとして画像データと合成
されて，或いは文字コードデータ単独で，ステップＳ２
１６において保存されたり，また後述する他の実施例の
ように，電子メール等の通信用テキストデータとして用
いられる。The above-described processing is the operation of text data generation processing by voice input in this embodiment. The text data thus completed is combined with the image data as character code data in the PC card 150 as a storage medium, or as the character code data alone, in step S2.
16 and is used as text data for communication such as e-mail as in other embodiments described later.

【００４２】先ず，音声入力したテキストデータのみを
単独でＰＣカード１５０に保存する場合について説明す
る。この場合，操作部１２３の記録モードを”音声入力
テキストモード”に設定する。そして，上述のテキスト
データの生成処理を行ってテキストデータが完成する
と，ステップＳ２１６で，該テキストデータの用途の選
択を行う。例えば，テキストデータのみを保存するか画
像と合成するかを，選択レバーと選択ボタンで選択す
る。ここで，”テキストデータのみの保存”を選択する
と，テキストデータはＤＲＡＭ１１４から別の記憶素子
ＦＩＦＯ１０７に送られる。ＦＩＦＯ１０７に記録され
たテキストデータは，カードインタフェース回路１０８
を通して読み出され，ＰＣカードインタフェース１０９
を介してＰＣカード１５０へ出力される。First, a case will be described in which only the voice-inputted text data is individually stored in the PC card 150. In this case, the recording mode of the operation unit 123 is set to the "voice input text mode". When the text data is generated and the text data is completed, the usage of the text data is selected in step S216. For example, the selection lever and the selection button are used to select whether to save only the text data or to combine with the image. Here, if "save only text data" is selected, the text data is sent from the DRAM 114 to another storage element FIFO 107. The text data recorded in the FIFO 107 is stored in the card interface circuit 108.
Read through the PC card interface 109
Is output to the PC card 150 via.

【００４３】次に，既に撮影された画像データに音声入
力したテキストデータをコメントとして合成し，ＰＣカ
ード１５０に保存する場合について説明する。この場
合，操作部１２３の記録モードを”音声入力テキストモ
ード”にする。そして，上述のテキストデータの生成処
理を行ってテキストデータが完成すると，ステップＳ２
１６で，該テキストデータの用途の選択を行う。ここ
で，操作部１２３の記憶モードボタンで記憶モードを”
画像との合成モード”とする。Next, a case will be described in which text data input by voice is combined with already captured image data as a comment and the comment is stored in the PC card 150. In this case, the recording mode of the operation unit 123 is set to the "voice input text mode". When the text data is completed by performing the above-described text data generation processing, step S2
At 16, the usage of the text data is selected. Here, the storage mode button of the operation unit 123
Image composition mode ".

【００４４】次に，操作部１２３により，テキストデー
タによるコメントを合成する画像データを選択する。画
像データは，ＰＣカード１５０よりカードインタフェー
ス回路１０８を通して画像圧縮・伸長部１０６に送られ
て伸長される。この伸長された画像データはテジタル画
像処理部１０５へ送られ，表示装置１２２で表示され
る。ここで，選択レバーと選択ボタンで合成する画像を
選択したら，確定ボタンで確定する。画像が確定される
とテキストデータとの合成が開始される。Next, the operation unit 123 selects the image data to be combined with the comment by the text data. The image data is sent from the PC card 150 to the image compression / expansion unit 106 through the card interface circuit 108 and expanded. The decompressed image data is sent to the digital image processing unit 105 and displayed on the display device 122. Here, when the image to be combined is selected by the selection lever and the selection button, it is confirmed by the confirmation button. When the image is confirmed, the composition with the text data is started.

【００４５】ＤＲＡＭ１１４上にあるテキストデータは
コードキャラクタ変換部１１７を介してデジタル信号処
理部１０５に送られ，画像データと合成されて表示装置
１２２に表示される。このようにして合成されたコメン
ト入り画像データは，再び画像圧縮・伸長部１０６で圧
縮されＦＩＦＯ１０７に送られる。ＦＩＦＯ１０７に記
録されたコメント入り画像データは，カードインタフェ
ース回路１０８を通して読み出され，ＰＣインタフェー
ス回路１０９を介してＰＣカード１５０へ出力される。The text data on the DRAM 114 is sent to the digital signal processing unit 105 via the code character conversion unit 117, and is combined with the image data to be displayed on the display device 122. The commented image data thus synthesized is again compressed by the image compression / decompression unit 106 and sent to the FIFO 107. The commented image data recorded in the FIFO 107 is read through the card interface circuit 108 and output to the PC card 150 through the PC interface circuit 109.

【００４６】更に，音声入力と同時に撮影された画像デ
ータに音声入力によるテキストデータをコメントとして
合成し，ＰＣカード１５０に保存する場合について説明
する。この場合，先ず画像を撮影し，デジタル画像処理
部１０５によるデジタル画像処理と画像圧縮・伸長部１
０６による画像圧縮を行って，画像データは一度ＰＣカ
ード１５０に書き込まれる。Further, a case will be described in which image data taken at the same time as voice input is combined with text data by voice input as a comment and the comment is stored in the PC card 150. In this case, first, an image is taken, and the digital image processing by the digital image processing unit 105 and the image compression / expansion unit 1 are performed.
Image data is once written to the PC card 150 by performing image compression according to 06.

【００４７】続いて，音声入力によってコメントとすべ
き音声データを入力する。この音声データは，上述のテ
キストデータの生成処理を行うことにより，テキストデ
ータに変換されてＤＲＡＭ１１４に記録される。テキス
トデータが完成すると，次にこのテキストデータの用途
の選択を行う。Then, voice data to be used as a comment is input by voice input. The voice data is converted into text data and recorded in the DRAM 114 by performing the above-described text data generation processing. When the text data is completed, the usage of this text data is selected next.

【００４８】操作部１２３の記憶モードボタンで記憶モ
ードを”画像との合成モード”とすると，撮影された画
像データは再びＰＣカード１５０よりカードインタフェ
ース回路１０８を通して画像圧縮・伸長部１０６に送ら
れて伸長される。この伸長された画像データはテジタル
画像処理部１０５に送られ，一方，ＤＲＡＭ１１４上に
あるテキストデータはコードキャラクタ変換部１１７を
介してデジタル信号処理部１０５に送られ，画像データ
と合成されて，コメント入り画像データとして表示装置
１２２に表示される。When the storage mode button of the operation unit 123 is used to set the storage mode to "composite mode with image", the captured image data is sent from the PC card 150 to the image compression / decompression unit 106 through the card interface circuit 108 again. It is extended. The decompressed image data is sent to the digital image processing unit 105, while the text data on the DRAM 114 is sent to the digital signal processing unit 105 via the code character conversion unit 117 and is combined with the image data to make a comment. It is displayed on the display device 122 as the input image data.

【００４９】このようにして合成されたコメント入り画
像データは，再び画像圧縮・伸長部１０６で圧縮されＦ
ＩＦＯ１０７に送られる。ＦＩＦＯ１０７に記録された
コメント入り画像データは，カードインタフェース回路
１０８を通して読み出され，ＰＣインタフェース回路１
０９を介してＰＣカード１５０へ出力される。The comment-added image data thus synthesized is compressed by the image compression / decompression unit 106 again and the F
It is sent to the IFO 107. The commented image data recorded in the FIFO 107 is read out through the card interface circuit 108, and the PC interface circuit 1
It is output to the PC card 150 via 09.

【００５０】〔実施例２〕次に，図３は本発明の実施例
２に係るデジタルスチルビデオカメラの構成図である。
本実施例のデジタルスチルビデオカメラは，音声入力に
基づき生成したテキストデータを電子メールとして送信
し，また電子メールを受信するものである。[Second Embodiment] FIG. 3 is a block diagram of a digital still video camera according to a second embodiment of the present invention.
The digital still video camera according to the present embodiment transmits text data generated based on voice input as an electronic mail and receives the electronic mail.

【００５１】本実施例のデジタルスチルビデオカメラの
本体３００は，音声による日本語文章作成部３１０，コ
ードキャラクタ変換部１１７，ＣＰＵ３２１，表示装置
３２２，操作部１２３，及び通信制御部３３１を具備し
て構成されている。ここで，音声による日本語文章作成
部３１０は，マイク１１１，フィルタ１１２，Ａ／Ｄ変
換器１１３，ＤＲＡＭ３１４，音声パターンメモリ１１
５，音声認識部１１６，辞書１１８，及び仮名漢字変換
部１１９を備えている。The main body 300 of the digital still video camera according to the present embodiment is provided with a voice Japanese sentence creating unit 310, a code character converting unit 117, a CPU 321, a display device 322, an operating unit 123, and a communication control unit 331. It is configured. Here, the voice-based Japanese sentence creation unit 310 includes a microphone 111, a filter 112, an A / D converter 113, a DRAM 314, and a voice pattern memory 11.
5, a voice recognition unit 116, a dictionary 118, and a Kana-Kanji conversion unit 119.

【００５２】尚，実施例１と同様にレンズユニット１０
１，ＣＣＤ１０２，ＣＤＳ回路１０３，Ａ／Ｄ変換器１
０４，デジタル画像処理部１０５，画像圧縮・伸長部１
０６，ＦＩＦＯ１０７，カードインタフェース回路１０
８，及びＰＣカードインタフェース回路１０９，並び
に，ＰＣカード１５０を備えるが，本実施例の特徴がテ
キストデータの電子メール送受信機能にあることから，
これらについては省略した。The lens unit 10 is the same as in the first embodiment.
1, CCD 102, CDS circuit 103, A / D converter 1
04, digital image processing unit 105, image compression / decompression unit 1
06, FIFO107, card interface circuit 10
8 and the PC card interface circuit 109 and the PC card 150 are provided, but since the feature of the present embodiment is the e-mail transmission / reception function of text data,
These are omitted.

【００５３】実施例１の構成に対して新たに付加される
通信制御部３３１は，当該デジタルスチルカメラ本体３
００をモデム３３０等の通信装置に接続し，ダイヤルア
ップ接続により当該デジタルスチルカメラをインターネ
ットに接続するものである。The communication control unit 331, which is newly added to the configuration of the first embodiment, is the digital still camera body 3 concerned.
00 is connected to a communication device such as a modem 330, and the digital still camera is connected to the Internet by dial-up connection.

【００５４】図３に示した構成において，マイク１１
１，フィルタ１１２及びＡ／Ｄ変換器１１３が音声電気
変換手段を，音声認識部１１６及び音声パターンメモリ
１１５が音声認識手段を，コードキャラクタ変換部１１
７がコードキャラクタ変換手段を，仮名漢字変換部１１
９及び辞書１１８が仮名漢字変換手段を，表示装置３２
２がデータ表示手段を，図示しないＰＣカードが記憶手
段を，それぞれ実現し，また文字合成手段及び訂正手段
は，図示しないＲＯＭ等のメモリに格納されている制御
プログラムを実行するＣＰＵ３２１によって実現されて
いる。更に，通信制御手段は通信制御部３３１によって
実現されている。In the configuration shown in FIG. 3, the microphone 11
1, the filter 112 and the A / D converter 113 are the voice electric conversion means, the voice recognition section 116 and the voice pattern memory 115 are the voice recognition means, and the code character conversion section 11
Reference numeral 7 is a code character conversion means, and kana-kanji conversion unit 11
9 and the dictionary 118 are the kana-kanji conversion means, the display device 32.
2 is a data display unit, a PC card (not shown) is a storage unit, and the character synthesizing unit and the correction unit are realized by a CPU 321 that executes a control program stored in a memory such as a ROM (not shown). There is. Further, the communication control means is realized by the communication control unit 331.

【００５５】次に，本実施例のデジタルスチルビデオカ
メラにおけるテキストデータの電子メール送受信の動作
について説明する。尚，テキストデータの生成は実施例
１と同様にして行われる。Next, an operation of transmitting / receiving an electronic mail of text data in the digital still video camera of this embodiment will be described. The text data is generated in the same manner as in the first embodiment.

【００５６】先ず，音声入力に基づき生成したテキスト
データを電子メールとして送信する場合について説明す
る。デジタルスチルビデオカメラ本体３００をモデム３
３０等の通信装置につなぎ，ダイアルアップ接続により
デジタルスチルビデオカメラをインターネットに接続す
る。接続後，ＤＲＡＭ３１４に格納されている音声入力
によって作成されたテキストデータのメールを，コード
キャラクタ変換部１１７で文字コードに変換する。そし
て，通信制御部３３１によって電子メール通信に必要な
データを付加し，ＴＣＰ／ＩＰのプロトコルに従い電子
メールを送信する。First, the case where the text data generated based on the voice input is transmitted as an electronic mail will be described. The digital still video camera body 300 is connected to the modem 3
A digital still video camera is connected to the Internet by connecting to a communication device such as 30 and a dial-up connection. After the connection, the text data mail stored in the DRAM 314 created by voice input is converted into a character code by the code character conversion unit 117. Then, the communication control unit 331 adds data necessary for e-mail communication, and sends the e-mail according to the TCP / IP protocol.

【００５７】また，電子メールの受信は次のようにして
行われる。先ず，ダイアルアップ接続によりデジタルス
チルビデオカメラをインターネットに接続する。接続
後，インターネットに接続したときのアドレスネーム宛
にメールが届いている場合は，ＴＣＰ／ＩＰのプロトコ
ルに従いメールを受信する。受信したデータは一度ＤＲ
ＡＭ３１４に格納し，コードキャラクタ変換部１１７に
よりテキストデータに変換して表示装置３２２上に表示
する。尚，表示画面のスクロールは，デジタルスチルビ
デオカメラの操作部１２３にあるズームレバー等で行
う。The reception of electronic mail is performed as follows. First, connect the digital still video camera to the Internet by dial-up connection. After the connection, if the mail reaches the address name when connecting to the Internet, the mail is received according to the TCP / IP protocol. Received data is once DR
The data is stored in the AM 314, converted into text data by the code character conversion unit 117, and displayed on the display device 322. The display screen is scrolled by a zoom lever or the like in the operation unit 123 of the digital still video camera.

【００５８】〔実施例３〕次に，図４は本発明の実施例
３に係るデジタルスチルビデオカメラの構成図である。
本実施例のデジタルスチルビデオカメラは，音声入力に
基づき生成したテキストデータをファクシミリ送信する
ものである。[Third Embodiment] FIG. 4 is a block diagram of a digital still video camera according to a third embodiment of the present invention.
The digital still video camera of this embodiment is for transmitting text data generated based on voice input by facsimile.

【００５９】本実施例のデジタルスチルビデオカメラの
本体４００は，音声による日本語文章作成部４１０，コ
ードキャラクタ変換部１１７，ＣＰＵ４２１，表示装置
３２２，操作部１２３，通信制御部３３１，イメージ生
成部４２５，及びイメージメモリを具備して構成されて
いる。ここで，音声による日本語文章作成部４１０は，
マイク１１１，フィルタ１１２，Ａ／Ｄ変換器１１３，
ＤＲＡＭ４１４，音声パターンメモリ１１５，音声認識
部１１６，辞書１１８，及び仮名漢字変換部１１９を備
えている。The main body 400 of the digital still video camera according to the present embodiment has a voice Japanese sentence creation unit 410, a code character conversion unit 117, a CPU 421, a display device 322, an operation unit 123, a communication control unit 331, and an image generation unit 425. , And an image memory. Here, the voice-based Japanese sentence creation unit 410
Microphone 111, filter 112, A / D converter 113,
A DRAM 414, a voice pattern memory 115, a voice recognition unit 116, a dictionary 118, and a Kana-Kanji conversion unit 119 are provided.

【００６０】尚，実施例１と同様にレンズユニット１０
１，ＣＣＤ１０２，ＣＤＳ回路１０３，Ａ／Ｄ変換器１
０４，デジタル画像処理部１０５，画像圧縮・伸長部１
０６，ＦＩＦＯ１０７，カードインタフェース回路１０
８，及びＰＣカードインタフェース回路１０９，並び
に，ＰＣカード１５０を備えるが，本実施例の特徴がテ
キストデータのファクシミリ送信機能にあることから，
これらについては省略した。As in the first embodiment, the lens unit 10
1, CCD 102, CDS circuit 103, A / D converter 1
04, digital image processing unit 105, image compression / decompression unit 1
06, FIFO107, card interface circuit 10
8 and the PC card interface circuit 109 and the PC card 150 are provided, but since the feature of this embodiment is the facsimile transmission function of text data,
These are omitted.

【００６１】実施例２の構成に対して新たに付加される
イメージ生成部４２５は，イメージメモリ４２６からレ
イアウトフォーマットを読み込んでテキストデータの配
置を決定し，各文字のイメージデータをイメージメモリ
４２６から読み込み，ファクシミリ通信のためのイメー
ジデータを生成する。The image generation unit 425, which is newly added to the configuration of the second embodiment, reads the layout format from the image memory 426 to determine the arrangement of text data, and reads the image data of each character from the image memory 426. , Generates image data for facsimile communication.

【００６２】図４に示した構成において，マイク１１
１，フィルタ１１２及びＡ／Ｄ変換器１１３が音声電気
変換手段を，音声認識部１１６及び音声パターンメモリ
１１５が音声認識手段を，コードキャラクタ変換部１１
７がコードキャラクタ変換手段を，仮名漢字変換部１１
９及び辞書１１８が仮名漢字変換手段を，表示装置３２
２がデータ表示手段を，図示しないＰＣカードが記憶手
段を，それぞれ実現し，また文字合成手段及び訂正手段
は，図示しないＲＯＭ等のメモリに格納されている制御
プログラムを実行するＣＰＵ３２１によって実現されて
いる。更に，ファクシミリ制御手段は通信制御部３３１
により，またイメージ変換手段はイメージ生成部４２５
及びイメージメモリ４２６によって実現されている。In the configuration shown in FIG. 4, the microphone 11
1, the filter 112 and the A / D converter 113 are the voice electric conversion means, the voice recognition section 116 and the voice pattern memory 115 are the voice recognition means, and the code character conversion section 11
Reference numeral 7 is a code character conversion means, and kana-kanji conversion unit 11
9 and the dictionary 118 are the kana-kanji conversion means, the display device 32.
2 is a data display unit, a PC card (not shown) is a storage unit, and the character synthesizing unit and the correction unit are realized by a CPU 321 that executes a control program stored in a memory such as a ROM (not shown). There is. Further, the facsimile control means is the communication control unit 331.
In addition, the image conversion means is operated by the image generator 425.
And the image memory 426.

【００６３】次に，本実施例のデジタルスチルビデオカ
メラにおけるテキストデータのファクシミリ送信の動作
について説明する。尚，テキストデータの生成は実施例
１と同様にして行われる。Next, the operation of facsimile transmission of text data in the digital still video camera of this embodiment will be described. The text data is generated in the same manner as in the first embodiment.

【００６４】先ず，ＤＲＡＭ４１４に格納されている音
声入力によって作成されたテキストデータを，イメージ
生成部４２５に送る。イメージ生成部４２５では，イメ
ージデータを記憶しているイメージメモリ４２６からレ
イアウトフォーマットを読み込み，テキストデータの配
置を決定する。次に，各文字のイメージデータをイメー
ジメモリ４２６から読み込み，イメージデータに変換し
た後，ＤＲＡＭ４１４に送出し，更にＤＲＡＭ４１４よ
り通信制御部３３１に送る。通信制御部３３１では，作
成されたイメージデータをファクシミリ信号に変換し
て，モデム３３０より送出する。First, the text data stored in the DRAM 414 and created by voice input is sent to the image generation unit 425. The image generation unit 425 reads the layout format from the image memory 426 that stores the image data and determines the arrangement of the text data. Next, the image data of each character is read from the image memory 426, converted into image data, sent to the DRAM 414, and further sent from the DRAM 414 to the communication control unit 331. The communication control unit 331 converts the created image data into a facsimile signal and sends it out from the modem 330.

【００６５】[0065]

【発明の効果】以上説明したように，本発明の請求項１
に係るデジタルスチルビデオカメラによれば，音声電気
変換手段により音声入力を電気信号に変換し，音声認識
手段で電気信号に変換された音声を認識して各言葉に対
応した文字コードを出力し，該文字コードをコードキャ
ラクタ変換手段により文字列に変換し，音声入力の認識
結果である文字列を，文字合成手段により，該音声入力
と同時に撮影された画像または既に撮影されている画像
と合成し，データ表示手段により，画像，音声入力の認
識結果または画像と文字列の合成結果を表示することと
したので，撮影した画像に該音声入力による表題等のコ
メントを入れることができ，記憶媒体に記録した画像の
中から，所望の画像の画像を容易に見つけ出すことがで
き，閲覧，整理等に便利なデジタルスチルビデオカメラ
を提供することができる。As described above, according to the first aspect of the present invention,
According to the digital still video camera of the present invention, the voice / electric conversion means converts the voice input into the electric signal, the voice recognition means recognizes the voice converted into the electric signal, and outputs the character code corresponding to each word, The character code is converted into a character string by the code character converting means, and the character string which is the recognition result of the voice input is combined with the image captured at the same time as the voice input or the image already captured by the character synthesizing means. Since the data display means displays the recognition result of the image or voice input or the combined result of the image and the character string, it is possible to add a comment such as a title by the voice input to the photographed image, and to the storage medium. It is possible to provide a digital still video camera that is easy to find a desired image from recorded images and is convenient for browsing and organizing. Kill.

【００６６】また，請求項２に係るデジタルスチルビデ
オカメラによれば，仮名漢字変換手段により，音声入力
の認識結果を仮名漢字混じりの文字列のコードに変換す
ることととしたので，撮影した画像に付加する該音声入
力によるコメントとして，仮名漢字混じりの日本語等の
文字列を使用することができる。According to the digital still video camera of the second aspect, the kana-kanji conversion means converts the recognition result of the voice input into a code of a character string containing kana-kanji characters. A character string such as Japanese mixed with Kana and Kanji can be used as the comment added by the voice input.

【００６７】また，請求項３に係るデジタルスチルビデ
オカメラによれば，音声電気変換手段により音声入力を
電気信号に変換し，音声認識手段で電気信号に変換され
た音声を認識して各言葉に対応した文字コードを出力
し，該音声入力の認識結果を仮名漢字変換手段により仮
名漢字混じりの文字列のコードに変換し，音声入力の認
識結果を文字コードデータとして記憶手段に保存するこ
ととしたので，音声入力によるテキストデータの作成が
可能になり，また，音声入力によるコメントやメモ等を
コード化して保存することができ，保存に使用される記
憶容量を小さくすることができる。Further, according to the digital still video camera of the third aspect, the voice-electric converting means converts the voice input into an electric signal, and the voice recognizing means recognizes the voice converted into the electric signal to recognize each word. A corresponding character code is output, the recognition result of the voice input is converted into a code of a character string containing kana-kanji characters by the kana-kanji conversion means, and the recognition result of the voice input is stored in the storage means as character code data. Therefore, it is possible to create text data by voice input, and it is possible to code and save comments and memos by voice input, and it is possible to reduce the storage capacity used for storage.

【００６８】また，請求項４に係るデジタルスチルビデ
オカメラによれば，音声電気変換手段により音声入力を
電気信号に変換し，音声認識手段で電気信号に変換され
た音声を認識して各言葉に対応した文字コードを出力
し，通信制御手段により，電子メール通信に必要なヘッ
ダー情報の付加，通信動作等を制御して，音声入力の認
識結果または文字コードを電子メールで通信することと
したので，通信端末としての機能を備え，電子メール通
信が可能なデジタルスチルビデオカメラを提供すること
ができる。According to the digital still video camera of the present invention, the voice / electric conversion means converts the voice input into an electric signal, and the voice recognizing means recognizes the voice converted into the electric signal to recognize each word. Since the corresponding character code is output and the communication control means controls the addition of header information necessary for e-mail communication, the communication operation, etc., the recognition result of the voice input or the character code is communicated by e-mail. , It is possible to provide a digital still video camera having a function as a communication terminal and capable of e-mail communication.

【００６９】また，請求項５に係るデジタルスチルビデ
オカメラによれば，仮名漢字変換手段により，音声入力
の認識結果を仮名漢字混じりの文字列のコードに変換す
ることとしたので，仮名漢字混じりの日本語等の文字列
による電子メール通信が可能となる。Further, according to the digital still video camera of the fifth aspect, the kana-kanji conversion means converts the recognition result of the voice input into a code of a character string containing kana-kanji characters. E-mail communication using character strings such as Japanese becomes possible.

【００７０】また，請求項６に係るデジタルスチルビデ
オカメラによれば，コードキャラクタ変換手段により文
字コードを文字列に変換し，データ表示手段には，音声
入力の認識結果または仮名漢字変換結果を表示すること
としたので，仮名漢字変換されたデータを確認しながら
メールを作成し，電子メールとして送信することがで
き，また受信したメールを表示することも可能である。According to the digital still video camera of the sixth aspect, the character code is converted into a character string by the code character conversion means, and the recognition result of the voice input or the kana-kanji conversion result is displayed on the data display means. Since it is decided to do so, it is possible to compose a mail while checking the data converted into Kana-Kanji and send it as an e-mail, and it is also possible to display the received mail.

【００７１】また，請求項７に係るデジタルスチルビデ
オカメラによれば，データ表示手段に表示された仮名漢
字変換結果に誤りがあった場合には，訂正手段により該
誤りを訂正することとしたので，仮名漢字変換されたデ
ータを確認および誤りの訂正をしながら電子メールによ
る通信ができる。Further, according to the digital still video camera of the present invention, when the kana-kanji conversion result displayed on the data display means has an error, the correction means corrects the error. , You can communicate by e-mail while checking the data converted into Kana-Kanji and correcting the errors.

【００７２】また，請求項８に係るデジタルスチルビデ
オカメラによれば，音声電気変換手段により音声入力を
電気信号に変換し，音声認識手段で電気信号に変換され
た音声を認識して各言葉に対応した文字コードを出力
し，該認識された文字コードデータをイメージ変換手段
によりファクシミリ通信のためのイメージデータに変換
して，ファクシミリ通信に必要な制御コードの付加や通
信動作を制御するファクシミリ制御手段により，イメー
ジデータをファクシミリで通信することとしたので，通
信端末としての機能を備え，ファクシミリ通信が可能な
デジタルスチルビデオカメラを提供することができる。According to the digital still video camera of the present invention, the voice-electric converting means converts the voice input into an electric signal, and the voice recognizing means recognizes the voice converted into the electric signal to recognize each word. Facsimile control means for outputting a corresponding character code, converting the recognized character code data into image data for facsimile communication by an image conversion means, and adding a control code necessary for facsimile communication and controlling communication operation Thus, since the image data is communicated by facsimile, it is possible to provide a digital still video camera having a function as a communication terminal and capable of facsimile communication.

【００７３】更に，請求項９に係るデジタルスチルビデ
オカメラによれば，仮名漢字変換手段により，音声入力
の認識結果を仮名漢字混じりの文字列のコードに変換
し，音声入力の認識結果または仮名漢字変換結果をデー
タ表示手段に表示して，表示された仮名漢字変換結果に
誤りがあった場合には，該誤りを訂正手段により訂正す
ることとしたので，仮名漢字変換結果に誤りがないこと
を確認し，誤りがあった場合には訂正しながら，仮名漢
字混じりの日本語文章によるファクシミリ通信を行うこ
とができる。Further, according to the digital still video camera of claim 9, the kana-kanji conversion means converts the recognition result of the voice input into a code of a character string containing kana-kanji characters, and the recognition result of the voice input or the kana-kanji character. The conversion result is displayed on the data display means, and if the displayed kana-kanji conversion result has an error, the error is corrected by the correction means. It is possible to check and correct if there is an error, and perform facsimile communication using Japanese sentences mixed with Kana and Kanji.

[Brief description of drawings]

【図１】本発明の実施例１に係るデジタルスチルビデオ
カメラの構成図である。FIG. 1 is a configuration diagram of a digital still video camera according to Embodiment 1 of the present invention.

【図２】実施例のデジタルスチルビデオカメラにおける
音声入力によるテキストデータの生成処理の動作を説明
するフローチャートである。FIG. 2 is a flowchart illustrating an operation of text data generation processing by voice input in the digital still video camera according to the embodiment.

【図３】本発明の実施例２に係るデジタルスチルビデオ
カメラの構成図である。FIG. 3 is a configuration diagram of a digital still video camera according to a second embodiment of the present invention.

【図４】本発明の実施例３に係るデジタルスチルビデオ
カメラの構成図である。FIG. 4 is a configuration diagram of a digital still video camera according to a third embodiment of the present invention.

[Explanation of symbols]

１００，３００，４００デジタルスチルビデオカメラ
本体１０１レンズユニット１０２ＣＣＤ（電荷結合素子）１０３ＣＤＳ（相関２重サンプリング）回路１０４Ａ／Ｄ変換器１０５デジタル画像処理部１０６画像圧縮・伸長部１０７ＦＩＦＯ１０８カードインタフェース回路１０９ＰＣカードインタフェース回路１１０，３１０，４１０音声による日本語文章作成部１１１マイク１１２フィルタ１１３Ａ／Ｄ変換器１１４，３１４，４１４ＤＲＡＭ１１５音声パターンメモリ１１６音声認識部１１７コードキャラクタ変換部１１８辞書１１９仮名漢字変換部１２１，３２１，４２１ＣＰＵ（文字合成手段，訂正
手段）１２２，３２２表示装置（データ表示手段）１２３操作部１５０ＰＣカード３３０モデム３３１通信制御部（通信制御手段，ファクシミリ制御
手段）４２５イメージ生成部（イメージ変換手段）４２６イメージメモリ100, 300, 400 Digital still video camera body 101 Lens unit 102 CCD (charge coupled device) 103 CDS (correlated double sampling) circuit 104 A / D converter 105 Digital image processing unit 106 Image compression / decompression unit 107 FIFO 108 card Interface circuit 109 PC card interface circuit 110, 310, 410 Voice Japanese sentence creation unit 111 Microphone 112 Filter 113 A / D converter 114, 314, 414 DRAM 115 Voice pattern memory 116 Voice recognition unit 117 Code character conversion unit 118 Dictionary 119 Kana-Kanji conversion unit 121, 321, 421 CPU (character synthesis unit, correction unit) 122, 322 display device (data display unit) 123 operation unit 150 PC card 330 model 331 Communication control unit (communication control unit, facsimile control unit) 425 Image generation unit (image conversion unit) 426 Image memory

Claims

[Claims]

1. A voice-electricity conversion means for converting a voice input into an electric signal, a voice recognition means for recognizing the voice converted into the electric signal and outputting a character code corresponding to each word, and the character code as a character. Code character converting means for converting into a sequence, character synthesizing means for synthesizing a character string as a recognition result of the voice input with an image taken at the same time as the voice input or an image already taken, the image, the A digital still video camera, comprising: a data display means for displaying a recognition result of voice input or a combination result of the image and the character string.

2. The digital still video camera according to claim 1, further comprising kana-kanji conversion means for converting a recognition result of the voice input into a code of a character string containing kana-kanji characters. .

3. A voice-electricity conversion means for converting a voice input into an electric signal, a voice recognition means for recognizing the voice converted into the electric signal and outputting a character code corresponding to each word, and recognition of the voice input. A digital still video camera comprising: a kana-kanji conversion means for converting the result into a code of a character string containing kana-kanji characters, and a storage means for storing the recognition result of the voice input as character code data.

4. A voice-electricity conversion means for converting a voice input into an electric signal, a voice recognition means for recognizing the voice converted into the electric signal and outputting a character code corresponding to each word, and necessary for electronic mail communication. And a communication control means for controlling communication operation and the like, and communicates the recognition result of the voice input or the character code by electronic mail.

5. The digital still video camera according to claim 4, wherein the digital still video camera has kana-kanji conversion means for converting the recognition result of the voice input into a code of a character string containing kana-kanji. .

6. The digital still video camera comprises code character conversion means for converting the character code into a character string, and data display means for displaying the recognition result of the voice input or the kana-kanji conversion result. The digital still video camera according to claim 5.

7. The digital still video camera comprises correction means for correcting an error in the Kana-Kanji conversion result displayed on the data display means. Digital still video camera.

8. A voice-electricity conversion means for converting a voice input into an electric signal, a voice recognition means for recognizing the voice converted into the electric signal and outputting a character code corresponding to each word, and a recognized character code. Image conversion means for converting data into image data for facsimile communication, and facsimile control means for controlling addition of control code necessary for facsimile communication and communication operation, and communicating the image data by facsimile A digital still video camera featuring.

9. The digital still video camera displays kana-kanji conversion means for converting the recognition result of the voice input into a code of a character string containing kana-kanji, and the recognition result of the voice input or the kana-kanji conversion result. 9. The digital still video according to claim 8, further comprising: a data display unit for executing the operation, and a correction unit for correcting the error when the Kana-Kanji conversion result displayed on the data display unit has an error. camera.