JP2001060875A

JP2001060875A - Embedding device, digital camera and recording medium

Info

Publication number: JP2001060875A
Application number: JP23389799A
Authority: JP
Inventors: Hiromatsu Aoki; 博松青木; Masao Hiramoto; 政夫平本
Original assignee: Matsushita Information Systems Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1999-08-20
Filing date: 1999-08-20
Publication date: 2001-03-06
Anticipated expiration: 2019-08-20
Also published as: JP3398343B2

Abstract

PROBLEM TO BE SOLVED: To provide an embedding device which causes reduced deterioration of picture quality and can embed voice data as much as possible. SOLUTION: An entropy decoding part 105 decodes with entropy a JPEG image and outputs a quantization DCT(discrete cosine transform) coefficient block. A selection part 106 selects 16 AC coefficients among the AC coefficients of low frequency which are included in the quantization DCT coefficient block and the AC coefficients having absolute value not smaller than its threshold, i.e., the AC coefficients having reduced influence of image deterioration. An embedding processing part 107 replaces the least significant bit of a prescribed number of AC coefficients with the partial compressed voice data of 16 bits which are inputted from a compressed voice input part 104.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、画像に音声データ
を埋め込む埋め込み装置及び圧縮画像に圧縮音声データ
を埋め込む埋め込み装置を備えたデジタルカメラに関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an embedding apparatus for embedding audio data in an image and a digital camera having an embedding apparatus for embedding compressed audio data in a compressed image.

【０００２】[0002]

【従来の技術】従来より、デジタルカメラ等により撮影
された画像に音声データを付け加えたいという要望があ
った。この要望に応えるものとしてFlashPixという、画
像と音声データとを１つのファイルとして扱うファイル
形式がある。ただしこのFlashPixは、付け加える音声デ
ータのデータ量が多ければ多いほど全体のデータ量は多
くなるという問題がある。データ量が多ければ、保管や
伝送効率が悪くなる。2. Description of the Related Art Heretofore, there has been a demand for adding audio data to an image taken by a digital camera or the like. To meet this demand, there is a file format called FlashPix which handles image and audio data as one file. However, this FlashPix has a problem that the larger the data amount of audio data to be added, the larger the total data amount. If the amount of data is large, the storage and transmission efficiency will deteriorate.

【０００３】そこで全体のデータ量を増やさない手段と
して埋め込み装置の利用が考えられる。埋め込み装置
は、画像の一部分を別情報に置き換えることによって埋
め込みを行うので、別情報が埋め込まれた画像のデータ
量は、埋め込まれる前のデータ量と変わらない。Therefore, use of an embedding device can be considered as a means for preventing the total data amount from increasing. Since the embedding device performs embedding by replacing a part of the image with another information, the data amount of the image in which the different information is embedded is not different from the data amount before the embedding.

【０００４】[0004]

【発明が解決しようとする課題】しかし従来の埋め込み
装置は、別情報として、署名やマーク等のデータ量が少
ないものを埋め込み対象としており、音声データのよう
に多量のものを埋め込み対象とすることは考慮されてい
ない。上記の点に鑑み本発明は、画質の劣化が少なく、
できるだけ多くのデータを埋め込むことのできる埋め込
み装置の提供を目的とする。However, the conventional embedding apparatus is intended to embed a small amount of data such as a signature or a mark as separate information, and to embed a large amount of data such as voice data. Is not taken into account. In view of the above points, the present invention has little degradation of image quality,
An object of the present invention is to provide an embedding device capable of embedding as much data as possible.

【０００５】また本発明は、撮影された画像に数秒程度
の音声データを画像の劣化が少なく埋めこむことのでき
るデジタルカメラの提供を目的とする。Another object of the present invention is to provide a digital camera capable of embedding audio data of about several seconds in a captured image with little deterioration of the image.

【０００６】[0006]

【課題を解決するための手段】上記問題を解決するた
め、本発明の埋め込み装置は、画像に離散コサイン変換
（ＤＣＴ：Discrete Cosine Transform）と量子化とを
施すことにより生成される量子化ＤＣＴ係数ブロック
中、低周波のＡＣ係数と絶対値が第１しきい値以上のＡ
Ｃ係数とから所定個のＡＣ係数を選択する選択手段と、
前記選択手段により選択された所定個のＡＣ係数の最下
位ビットを音声データに置き換える置換え手段とを備え
る。In order to solve the above problems, an embedding apparatus according to the present invention provides a quantized DCT coefficient generated by performing discrete cosine transform (DCT) and quantization on an image. During the block, the AC coefficient and the absolute value of the low frequency are equal to or higher than the first threshold value.
Selecting means for selecting a predetermined number of AC coefficients from the C coefficients;
Replacement means for replacing the least significant bit of the predetermined number of AC coefficients selected by the selection means with audio data.

【０００７】また前記選択手段は、絶対値が第１しきい
値以上のＡＣ係数を選択する第１選択部と、第１選択手
段によって選択されたＡＣ係数の個数が所定個に満たな
い場合には、第１しきい値未満であってより低周波のＡ
Ｃ係数から順に所定個になるまでＡＣ係数を選択する第
２選択部とを備える。前記選択手段は、さらに、絶対値
が第１しきい値より大きい第２しきい値以上のＡＣ係数
を選択する第３選択部を備え、前記置換え手段は、さら
に前記第３選択部により選択されたＡＣ係数の最下位か
ら２ビット目に音声データを埋め込むよう構成される。The selection means may include a first selection unit for selecting an AC coefficient having an absolute value equal to or greater than a first threshold value, and an AC coefficient selected by the first selection means when the number is less than a predetermined number. Is less than the first threshold and the lower frequency A
A second selection unit for selecting AC coefficients in order from the C coefficient until a predetermined number is reached. The selection unit further includes a third selection unit that selects an AC coefficient whose absolute value is equal to or greater than a second threshold value that is greater than the first threshold value, and the replacement unit is further selected by the third selection unit. The audio data is embedded in the second least significant bit of the AC coefficient.

【０００８】本発明の埋め込み方法は、画像に離散コサ
イン変換（ＤＣＴ）と量子化とを施すことにより生成さ
れる量子化ＤＣＴ係数ブロック中、低周波のＡＣ係数と
絶対値が第１しきい値以上のＡＣ係数とから所定個のＡ
Ｃ係数を選択する選択ステップと、前記選択ステップに
より選択された所定個のＡＣ係数の最下位ビットを音声
データに置き換える置換えステップとを備える。According to the embedding method of the present invention, in a quantized DCT coefficient block generated by performing discrete cosine transform (DCT) and quantization on an image, a low-frequency AC coefficient and an absolute value are set to a first threshold value. From the above AC coefficient, a predetermined number of A
The method includes a selecting step of selecting a C coefficient, and a replacing step of replacing the least significant bit of the predetermined number of AC coefficients selected in the selecting step with audio data.

【０００９】また前記選択ステップは、絶対値が第１し
きい値以上のＡＣ係数を選択する第１選択ステップと、
第１選択手段によって選択されたＡＣ係数の個数が所定
個に満たない場合には、第１しきい値未満であってより
低周波のＡＣ係数から順に所定個になるまでＡＣ係数を
選択する第２選択ステップとを備える。本発明のデジタ
ルカメラは、圧縮画像に数秒間の音声データに相当する
圧縮音声データを埋め込むデジタルカメラであって、圧
縮画像から離散コサイン変換と量子化とが施された量子
化ＤＣＴ係数ブロックを得る獲得手段と、前記圧縮音声
データを分割して所定ビットの部分圧縮音声データにす
る分割手段と、獲得される量子化ＤＣＴ係数ブロック
中、低周波のＡＣ係数と絶対値が第１しきい値以上のＡ
Ｃ係数とから前記所定個のＡＣ係数を選択する選択手段
と、前記選択手段により選択された前記所定個のＡＣ係
数の最下位ビットを前記部分圧縮音声データに置き換え
る置換え手段とを備える。[0009] The selecting step includes a first selecting step of selecting an AC coefficient having an absolute value equal to or greater than a first threshold.
If the number of AC coefficients selected by the first selecting means is less than a predetermined number, the AC coefficients are selected until the predetermined number of AC coefficients are less than the first threshold and sequentially from lower frequency AC coefficients. 2 selecting steps. The digital camera of the present invention is a digital camera that embeds compressed audio data corresponding to audio data for several seconds into a compressed image, and obtains a quantized DCT coefficient block subjected to discrete cosine transform and quantization from the compressed image. Acquiring means; dividing means for dividing the compressed audio data into partial compressed audio data of a predetermined bit; wherein, in the obtained quantized DCT coefficient block, the low-frequency AC coefficient and the absolute value are equal to or more than a first threshold value. A
A selecting means for selecting the predetermined number of AC coefficients from the C coefficients; and a replacing means for replacing least significant bits of the predetermined number of AC coefficients selected by the selecting means with the partially compressed audio data.

【００１０】また本発明の記録媒体は、画像に音声デー
タを埋め込ませる処理をコンピュータに実行させるため
のプログラムを記録したコンピュータ読み取り可能な記
録媒体であって、当該プログラムはコンピュータに、画
像に離散コサイン変換（ＤＣＴ）と量子化とを施すこと
により生成される量子化ＤＣＴ係数ブロック中、低周波
のＡＣ係数と絶対値が第１しきい値以上のＡＣ係数とか
ら所定個のＡＣ係数を選択する選択ステップと、前記選
択ステップにより選択された所定個のＡＣ係数の最下位
ビットを音声データに置き換える置換えステップとを実
行させることを特徴とする。A recording medium according to the present invention is a computer-readable recording medium in which a program for causing a computer to execute a process of embedding audio data in an image is recorded. In a quantized DCT coefficient block generated by performing transform (DCT) and quantization, a predetermined number of AC coefficients are selected from a low-frequency AC coefficient and an AC coefficient whose absolute value is equal to or greater than a first threshold. And a replacement step of replacing the least significant bit of the predetermined number of AC coefficients selected by the selection step with audio data.

【００１１】また前記選択ステップは、絶対値が第１し
きい値以上のＡＣ係数を選択する第１選択ステップと、
第１選択手段によって選択されたＡＣ係数の個数が所定
個に満たない場合には、第１しきい値未満であってより
低周波のＡＣ係数から順に所定個になるまでＡＣ係数を
選択する第２選択ステップとからなる。The selecting step includes a first selecting step of selecting an AC coefficient having an absolute value equal to or greater than a first threshold value;
If the number of AC coefficients selected by the first selecting means is less than a predetermined number, the AC coefficients are selected until the predetermined number of AC coefficients are less than the first threshold and sequentially from lower frequency AC coefficients. It consists of two selection steps.

【００１２】[0012]

【発明の実施の形態】（実施形態１）以下、本発明の埋
め込み装置の一実施形態であるデジタルカメラ１につい
て図面を用いて説明する。本実施形態において埋め込み
装置はデジタルカメラ１内部に埋め込み部として備えら
れる。（デジタルカメラ１の外観構成）図１及び図２は、デジ
タルカメラ１の正面側及び背面側の外観図である。(Embodiment 1) Hereinafter, a digital camera 1 which is an embodiment of an embedding device of the present invention will be described with reference to the drawings. In the present embodiment, the embedding device is provided inside the digital camera 1 as an embedding unit. (External Configuration of Digital Camera 1) FIGS. 1 and 2 are external views of the digital camera 1 on the front side and the rear side.

【００１３】同図に示すようにデジタルカメラ１は、正
面に音声データ再生用のスピーカ１１、音声データ録音
用のマイク１２、レンズ１８を備え、背面に画像表示部
１３、画像及び音声データ再生指示用の再生ボタン２１
ａ、２１ｂ、音声データ録音指示用の録音ボタン２２、
ファインダー１７を備え、上面にシャッターボタン１
４、シャッター速度や絞り値などを表示する状態表示部
１５を備え、側面にはフラッシュメモリの一種であるメ
モリカード２０が挿入されるメモリカード挿入口１９を
備える。As shown in FIG. 1, the digital camera 1 includes a speaker 11 for reproducing audio data, a microphone 12 for recording audio data, and a lens 18 on the front, and an image display unit 13, an image and audio data reproducing instruction on the rear. Play button 21 for
a, 21b, recording button 22 for voice data recording instruction,
It has a viewfinder 17 and a shutter button 1 on the top.
4. A status display unit 15 for displaying a shutter speed, an aperture value and the like is provided, and a memory card insertion slot 19 for inserting a memory card 20 which is a kind of flash memory is provided on a side surface.

【００１４】デジタルカメラ１の操作例を以下に簡単に
説明しておく。利用者がファインダー１７又は画像表示
部１３により撮像範囲を定めてシャッターボタン１４を
押下すると、レンズ１８を通して撮影された画像が内部
で符号化されて圧縮画像となり、メモリカード２０に記
憶される。このメモリカード２０は、圧縮画像を数十枚
記憶することができる。An operation example of the digital camera 1 will be briefly described below. When the user sets the imaging range using the viewfinder 17 or the image display unit 13 and presses the shutter button 14, the image captured through the lens 18 is internally encoded into a compressed image, and stored in the memory card 20. This memory card 20 can store several tens of compressed images.

【００１５】また利用者が録音ボタン２２を押下する
と、その時点からマイク１２より所定時間（本実施形態
においては約１０秒間）の音声データが集音され符号化
されて圧縮音声データとなる。圧縮音声データは、メモ
リカード２０に記憶されている圧縮画像のうち利用者に
選択された圧縮画像に埋め込まれる。以下、圧縮音声デ
ータが埋め込まれた圧縮画像を音声付き圧縮画像、埋め
込まれていない圧縮画像を音声なし圧縮画像と区別して
呼び、音声データ付きか音声データなしかを特に区別し
ない場合には単に圧縮画像と呼ぶこととする。When the user presses the record button 22, sound data for a predetermined time (about 10 seconds in the present embodiment) is collected from the microphone 12 from that point and is encoded to be compressed sound data. The compressed audio data is embedded in the compressed image selected by the user among the compressed images stored in the memory card 20. Hereinafter, a compressed image in which compressed audio data is embedded will be referred to as a compressed image with audio, and a non-embedded compressed image will be referred to as a compressed image without audio. It is called an image.

【００１６】また利用者が再生ボタン２１a、２１bを押
下する度に、メモリカード２０に記憶されている圧縮画
像が一枚ずつ復号されて、その画像が画像表示部１３に
表示される。音声付き圧縮画像が復号された場合には、
その復号された画像の表示と同時に圧縮音声データが
抽出、復号されてスピーカ１１より再生される。（デジタルカメラ１の概略構成）図３は、デジタルカメ
ラ１の概略構成図である。Each time the user presses the play buttons 21a and 21b, the compressed images stored in the memory card 20 are decoded one by one, and the images are displayed on the image display unit 13. If the compressed image with audio is decoded,
Simultaneously with the display of the decoded image, the compressed audio data is extracted, decoded, and reproduced from the speaker 11. (Schematic Configuration of Digital Camera 1) FIG. 3 is a schematic configuration diagram of the digital camera 1.

【００１７】同図に示すようにデジタルカメラ１は、画
像符号化部３、符号用メモリ３５、メモリカード入出力
部３６、音声符号化部４、埋め込み部３７、音声復号化
部５、画像復号化部６、抽出部８３から構成される。画
像符号化部３は、シャッターボタン１４が押下されたと
き、レンズ１８を介して撮像される画像をＪＰＥＧ方式
により符号化して音声なし圧縮画像を生成し、符号用メ
モリ３５に出力する。As shown in FIG. 1, the digital camera 1 includes an image encoding unit 3, an encoding memory 35, a memory card input / output unit 36, an audio encoding unit 4, an embedding unit 37, an audio decoding unit 5, and an image decoding unit. And an extraction unit 83. When the shutter button 14 is pressed, the image encoding unit 3 encodes an image captured through the lens 18 by the JPEG method to generate a compressed image without sound, and outputs the compressed image to the encoding memory 35.

【００１８】符号用メモリ３５は、画像符号化部３、メ
モリカード入出力部３６、埋め込み部３７、画像復号化
部６の間で入出力される圧縮画像を一時的に記憶する。
メモリカード入出力部３６は、符号用メモリ３５に記憶
されている圧縮画像をメモリカード２０へ書きこみ、ま
たメモリカード２０に記憶される圧縮画像を符号用メモ
リ３５に読み出す。The encoding memory 35 temporarily stores compressed images input and output among the image encoding unit 3, the memory card input / output unit 36, the embedding unit 37, and the image decoding unit 6.
The memory card input / output unit 36 writes the compressed image stored in the encoding memory 35 to the memory card 20 and reads the compressed image stored in the memory card 20 to the encoding memory 35.

【００１９】音声符号化部４は、録音ボタン２２が押下
されたとき、マイク１２を介して約１０秒間の外部の音
声データを集音し、ＩＭＡ(Interactive Multimedia As
sociation)方式のＡＤＰＣＭ(Adaptive Differential P
CM)により符号化し、その結果の圧縮音声データを音声
符号化部４内部のメモリ（後述のオーディオメモリ４
４）に記憶する。When the recording button 22 is depressed, the audio encoding section 4 collects external audio data for about 10 seconds through the microphone 12, and outputs an IMA (Interactive Multimedia As
ADPCM (Adaptive Differential P)
CM), and the resulting compressed audio data is stored in a memory (audio memory
4).

【００２０】埋め込み部３７は、符号用メモリ３５に記
憶される音声なし圧縮画像に前記圧縮音声データを埋め
込んで音声付き圧縮画像を生成し、符号用メモリ３５に
戻す。埋め込み部３７は、本発明の主要な構成要素であ
るので後に詳細に説明する。画像復号化部６は、画像符
号化部３による符号化とは逆の操作により符号用メモリ
３５に記憶される圧縮画像から画像を復号し、画像表示
部１３に表示する。The embedding section 37 embeds the compressed audio data in the compressed image without audio stored in the encoding memory 35 to generate a compressed image with audio, and returns the compressed image with audio to the encoding memory 35. The embedding section 37 is a main component of the present invention, and will be described later in detail. The image decoding unit 6 decodes the image from the compressed image stored in the encoding memory 35 by an operation reverse to the encoding performed by the image encoding unit 3, and displays the image on the image display unit 13.

【００２１】抽出部８３は、埋め込み部３７による音声
データの埋め込みと逆の操作によって、音声付き圧縮画
像から圧縮音声データを抽出して音声復号化部５に出力
する。音声復号化部５は、音声符号化部４による符号化
とは逆の操作により、抽出部８３より出力された圧縮音
声データから音声データを復号し、スピーカー１１より
再生する。（画像符号化部３、音声符号化部４、音声復号化部５、
画像復号化部６の詳細構成）図４は、図３の詳細構成図
であり、図５、図６は、図４を部分的に示す詳細構成図
である。同図を用いて以下に画像符号化部３、音声符号
化部４、音声復号化部５、画像復号化部６について説明
する。（画像符号化部３の詳細構成）図４において画像符号化
部３は、撮像部３１、撮影画像メモリ３３、符号化部３
４より構成される。（撮像部３１）撮像部３１は、レンズ１８、ＣＣＤ（図
外）、色変換器（図外）などから構成され、シャッター
ボタン１４が押されたときに、レンズ１８、ＣＣＤを介
して得られるＲＧＢ信号を色変換器によりＹＣｒＣｂ成
分から成る画像に変換して撮影画像メモリ３３に書き込
む。The extraction unit 83 extracts compressed audio data from the compressed image with audio and outputs the extracted compressed audio data to the audio decoding unit 5 by performing an operation reverse to the embedding of the audio data by the embedding unit 37. The audio decoding unit 5 decodes the audio data from the compressed audio data output from the extraction unit 83 and reproduces the audio data from the speaker 11 by an operation reverse to the encoding performed by the audio encoding unit 4. (The image encoding unit 3, the audio encoding unit 4, the audio decoding unit 5,
FIG. 4 is a detailed configuration diagram of FIG. 3, and FIGS. 5 and 6 are detailed configuration diagrams partially showing FIG. The image encoding unit 3, the audio encoding unit 4, the audio decoding unit 5, and the image decoding unit 6 will be described below with reference to FIG. (Detailed Configuration of Image Encoding Unit 3) In FIG. 4, the image encoding unit 3 includes an imaging unit 31, a captured image memory 33, and an encoding unit 3.
4 (Imaging Unit 31) The imaging unit 31 includes a lens 18, a CCD (not shown), a color converter (not shown), and the like, and is obtained via the lens 18 and the CCD when the shutter button 14 is pressed. The RGB signal is converted into an image composed of YCrCb components by a color converter, and written into the captured image memory 33.

【００２２】１つの画像は、１２８０画素×９６０ライ
ン、合計１２２８８００画素からなる輝度成分Ｙと、６
４０画素×４８０ライン（又は水平方向のみを間引いた
６４０画素×９６０ライン）、合計３０７２００画素
（又は６１４４００画素）からなる色差成分Ｃｒ、Ｃｂ
とから構成される。（撮影画像メモリ３３）撮影画像メモリ３３は、撮像部
３１により書き込まれる画像を一時的に記憶する。（符号化部３４）符号化部３４は、撮影画像メモリ３３
に記憶されている画像を８画素×８ラインのブロック毎
にＪＰＥＧ方式により圧縮符号化して、それによって得
られる圧縮符号列を符号用メモリ３５に書きこむ。１画
面分の圧縮符号列が音声なし圧縮画像に相当する。One image has a luminance component Y consisting of 1280 pixels × 960 lines, a total of 1,228,800 pixels, and 6
Color difference components Cr and Cb composed of 40 pixels × 480 lines (or 640 pixels × 960 lines thinned only in the horizontal direction), for a total of 307200 pixels (or 614400 pixels)
It is composed of (Captured Image Memory 33) The captured image memory 33 temporarily stores an image written by the imaging unit 31. (Encoding unit 34) The encoding unit 34 includes the captured image memory 33
Is compressed and encoded by the JPEG method for each block of 8 pixels × 8 lines, and the resulting compressed code string is written in the coding memory 35. A compressed code string for one screen corresponds to a compressed image without sound.

【００２３】図７は、１画面分の輝度成分Ｙとブロック
との関係を示す。輝度成分Ｙは、横１６０ブロック、縦
１２０ブロック、合計１９２００個のブロックから構成
され、各ブロックは、８画素×８ライン、合計６４個の
画素から構成される。例えば同図においてブロック１０
２は、１画面分の輝度成分Ｙに含まれる１つのブロック
である。１画面分の色差成分Ｃｒ、Ｃｂも同様に、それ
ぞれ横８０ブロック、縦６０ブロック、合計４８００個
（６４０画素×９６０ラインのものについては横８０ブ
ロック、縦１２０ブロック、合計９６００個）のブロッ
クから構成される。FIG. 7 shows the relationship between the luminance component Y for one screen and blocks. The luminance component Y is composed of 160 blocks in the horizontal direction and 120 blocks in the vertical direction, for a total of 19,200 blocks. Each block is composed of 8 pixels × 8 lines, that is, a total of 64 pixels. For example, in FIG.
Reference numeral 2 denotes one block included in the luminance component Y for one screen. Similarly, the color difference components Cr and Cb for one screen are also divided from 80 horizontal blocks and 60 vertical blocks, respectively, for a total of 4800 blocks (80 horizontal blocks, 120 vertical blocks, and a total of 9600 blocks for 640 pixels × 960 lines). Be composed.

【００２４】図８（ａ）は、輝度成分Ｙの１ブロック分
の画素の具体例Ｙxy（x,y＝０〜７；x,yはブロック中の
画素位置を表わす）を示す。なお、同図のＹxyは、元の
信号値から１２８を引いたものである。これは後の離散
コサイン変換（Discrete Cosine Transform、以下ＤＣ
Ｔと省略する）によって得られるＤＣＴ係数の期待値を
０にレベルシフトするためである。（符号化部３４の詳細構成）符号化部３４は、図５の構
成図に示すように、ＤＣＴ部７１、量子化部７２、エン
トロピー符号化部７４から構成される。（ＤＣＴ部７１）ＤＣＴ部７１は、撮影画像メモリ３３
から輝度成分Ｙ、色差成分Ｃｒ、Ｃｂをブロック毎に順
次読み出してＤＣＴを行い、８×８要素のＤＣＴ係数か
ら成るＤＣＴ係数ブロックＳuv（u,v＝０〜７）を生成
し、量子化部７２へ出力する。ここでＳuvは、直流成分
を表わすＳ00をＤＣ係数と呼び、Ｓ00以外の交流成分を
表わすＤＣＴ係数をＡＣ係数と呼ぶ。またＳuvは、u、v
の値が大きくなるほど高周波成分となる。FIG. 8A shows a specific example Yxy (x, y = 0 to 7; x, y represents a pixel position in a block) of a pixel of one block of the luminance component Y. Note that Yxy in the figure is obtained by subtracting 128 from the original signal value. This is a discrete cosine transform (DC).
This is for level-shifting the expected value of the DCT coefficient obtained by T) to 0. (Detailed Configuration of Encoding Unit 34) The encoding unit 34 includes a DCT unit 71, a quantization unit 72, and an entropy encoding unit 74, as shown in the configuration diagram of FIG. (DCT unit 71) The DCT unit 71
, The luminance component Y and the chrominance components Cr and Cb are sequentially read out for each block and DCT is performed to generate a DCT coefficient block Suv (u, v = 0 to 7) composed of DCT coefficients of 8 × 8 elements, and a quantization unit 72. Here, Suv refers to S00 representing a DC component as a DC coefficient, and DCT coefficients representing AC components other than S00 as an AC coefficient. Suv is u, v
The higher the value, the higher the frequency component.

【００２５】図８（ｂ）は、Ｙxyに対してＤＣＴを行う
ことにより得られるＤＣＴ係数ブロックSuvを示す。同
図においてＳ00＝８２３がＤＣ係数であり、その他はＡ
Ｃ係数である。u、vが大きくなるほど、つまり高周波成
分になるほど値が小さくなっていることがわかる。なお
ＤＣＴの具体的な演算式は公知であるので説明を省略す
る。（量子化部７２）量子化部７２は、８×８要素の量子化
係数から成る量子化テーブルＱuv（u,v＝０〜７）を備
え、これを用いてＤＣＴ係数ブロックＳuvを量子化し、
８×８要素の量子化ＤＣＴ係数から成る量子化ＤＣＴ係
数ブロックＲuv（u,v＝０〜７）を生成し、エントロピ
ー符号化部７４へ出力する。FIG. 8B shows a DCT coefficient block Suv obtained by performing DCT on Yxy. In the figure, S00 = 823 is the DC coefficient, and the others are A
C coefficient. It can be seen that the value decreases as u and v increase, that is, as the frequency component increases. Note that the specific arithmetic expression of DCT is well-known, and thus description thereof is omitted. (Quantization Unit 72) The quantization unit 72 includes a quantization table Quv (u, v = 0 to 7) including quantization coefficients of 8 × 8 elements, and quantizes the DCT coefficient block Suv using this.
A quantized DCT coefficient block Ruv (u, v = 0 to 7) composed of 8 × 8 element quantized DCT coefficients is generated and output to the entropy coding unit 74.

【００２６】量子化ＤＣＴ係数ブロックＲuvは以下のよ
うにして算出される。（式１）Ｒuv＝round（Ｓuv／Ｑuv）ここで、round（）は、（）内の値をもっとも近い整数
へ整数化することを意味する関数である。The quantized DCT coefficient block Ruv is calculated as follows. (Equation 1) Ruv = round (Suv / Quv) Here, round () is a function meaning that the value in () is converted into an integer to the nearest integer.

【００２７】ＪＰＥＧ方式においては量子化係数の値は
規定されていない。よってアプリケーション毎や画像毎
に自由に値を設定することができる。一般的に、量子化
係数は、u、vの値が大きくなるほど大きな値が設定され
る。このように高周波成分ほど量子化係数の値を大きく
するのは、視覚的に劣化の目立ちにくい高周波成分を粗
く量子化すれば、画質を保護しつつ圧縮効率を良くする
ことができるからである。In the JPEG system, the value of the quantization coefficient is not specified. Therefore, a value can be freely set for each application or each image. In general, the quantization coefficient is set to a larger value as the values of u and v become larger. The reason why the value of the quantization coefficient is increased as the frequency component increases is that if the high-frequency component that is hardly noticeable in deterioration is roughly quantized, the compression efficiency can be improved while protecting the image quality.

【００２８】図８（ｃ）に量子化テーブルＱuvの具体例
を示す。図８（ｄ）は、図８（ｂ）に示したＤＣＴ係数
ブロックＳuvを図８（ｃ）の量子化テーブルＱuvで量子
化した場合の量子化ＤＣＴ係数ブロックＲuvを示す。こ
の例によればＲ10＝round（Ｓ10／Ｑ10）＝round（−１
３５/４）＝−３４である。（エントロピー符号化部７４）エントロピー符号化部７
４は、量子化部７１から受け取った量子化ＤＣＴ係数ブ
ロックＲuvをエントロピー符号化して圧縮符号列を生成
し、符号用メモリ３５へ書き込む。１画面分の圧縮符号
列が圧縮画像に相当する。エントロピー復号については
公知であるので説明を省略する。（音声符号化部４の詳細構成）音声符号化部４は、集音
部４１、オーディオ符号化部４３、オーディオメモリ４
４から構成される。（集音部４１）集音部４１は、マイク１２、増幅器（図
外）、ＡＤ変換回路（図外）、量子化回路（図外）など
から構成され、利用者により録音ボタン２２が押下され
たときから約１０秒間分の外部のアナログ音声データを
集音し、１１ｋＨｚのサンプリング、ＡＤ変換等を施し
てデジタル音声データに変換し、オーディオ符号化部４
３へ出力する。（オーディオ符号化部４３）オーディオ符号化部４３
は、ＩＭＡ方式のＡＤＰＣＭに基づいて前記デジタル音
声データを圧縮音声データに変換してオーディオメモリ
４４へ出力する。なお、ＩＭＡ方式のＡＤＰＣＭについ
ては公知であるので説明を省略する。（オーディオメモリ４４）オーディオメモリ４４は、オ
ーディオ符号化部４３により出力される圧縮音声データ
を記憶する。（音声復号化部５の詳細構成）音声復号化部５は、オー
ディオメモリ５４、オーディオ復号化部５３、音声再生
部５１から構成される。（オーディオメモリ５４）オーディオメモリ５４は、抽
出部８３により圧縮画像から抽出された圧縮音声データ
を一時的に記憶する。（オーディオ復号化部５３）オーディオ復号化部５３
は、ＩＭＡ方式のＡＤＰＣＭに基づいて、オーディオメ
モリ５４に記憶されている圧縮音声データからデジタル
音声データを復号し、音声再生部５１へ出力する。（音声再生部５１）音声再生部５１は、ＤＡ変換回路
（図外）、スピーカ１１などから構成され、オーディオ
復号化部５３により復号されたデジタル音声データをア
ナログ音声データに変換して再生する。（画像復号化部６の詳細構成）画像復号化部６は、復号
化部６２、表示用画像メモリ６１、画像表示部１３から
構成される。（復号化部６２）復号化部６２は、符号用メモリ３５に
記憶される音声なし圧縮画像又は音声付き圧縮画像を読
み出してJPEG方式による復号化を行い、結果の画像を表
示用画像メモリ６１に出力する。FIG. 8C shows a specific example of the quantization table Quv. FIG. 8D shows a quantized DCT coefficient block Ruv when the DCT coefficient block Suv shown in FIG. 8B is quantized by the quantization table Quv in FIG. 8C. According to this example, R10 = round (S10 / Q10) = round (-1)
35/4) = − 34. (Entropy coding unit 74) Entropy coding unit 7
4 entropy-encodes the quantized DCT coefficient block Ruv received from the quantization unit 71 to generate a compressed code string, and writes it to the code memory 35. A compressed code string for one screen corresponds to a compressed image. Since the entropy decoding is known, its description is omitted. (Detailed Configuration of Speech Encoding Unit 4) The speech encoding unit 4 includes a sound collection unit 41, an audio encoding unit 43, and an audio memory 4.
4 (Sound Collection Unit 41) The sound collection unit 41 includes the microphone 12, an amplifier (not shown), an AD conversion circuit (not shown), a quantization circuit (not shown), and the like, and the recording button 22 is pressed by the user. The external analog audio data for about 10 seconds from the time when the analog audio data is collected is converted into digital audio data by performing 11 kHz sampling, AD conversion, and the like.
Output to 3. (Audio encoding unit 43) Audio encoding unit 43
Converts the digital audio data into compressed audio data based on IMA ADPCM and outputs the compressed audio data to the audio memory 44. Since the IMA type ADPCM is publicly known, the description is omitted. (Audio Memory 44) The audio memory 44 stores the compressed audio data output by the audio encoding unit 43. (Detailed Configuration of Audio Decoding Unit 5) The audio decoding unit 5 includes an audio memory 54, an audio decoding unit 53, and an audio reproducing unit 51. (Audio Memory 54) The audio memory 54 temporarily stores the compressed audio data extracted from the compressed image by the extraction unit 83. (Audio Decoding Unit 53) Audio Decoding Unit 53
Decodes the digital audio data from the compressed audio data stored in the audio memory 54 based on the IMA type ADPCM, and outputs the digital audio data to the audio reproduction unit 51. (Audio Reproducing Unit 51) The audio reproducing unit 51 is composed of a DA conversion circuit (not shown), the speaker 11, and the like, and converts digital audio data decoded by the audio decoding unit 53 into analog audio data and reproduces the same. (Detailed Configuration of Image Decoding Unit 6) The image decoding unit 6 includes a decoding unit 62, a display image memory 61, and an image display unit 13. (Decoding Unit 62) The decoding unit 62 reads out the compressed image without sound or the compressed image with sound stored in the encoding memory 35, decodes the compressed image with the JPEG method, and stores the resulting image in the display image memory 61. Output.

【００２９】図６は、復号化部６２のより詳細な構成図
である。同図において復号化部６２は、エントロピー復
号化部８４、逆量子化部８２、逆ＤＣＴ部８１から構成
される。（エントロピー復号化部８４）エントロピー復号化部８
４は、符号用メモリ３５に記憶されている圧縮画像をエ
ントロピー復号することにより量子化ＤＣＴ係数ブロッ
クＲuv（又はＲ’uv）を生成して抽出部８３と逆量子化
部８２とに出力する。ここにおいて、圧縮音声データを
含まない量子化ＤＣＴ係数ブロックをＲuv、圧縮音声デ
ータを含む量子化ＤＣＴ係数ブロックをＲ’uvとして区
別する。（逆量子化部８２）逆量子化部８２は、量子化部７１と
同一の量子化テーブルＱuvを備え、Ｑuvと量子化ＤＣＴ
係数ブロックＲuv（又はＲ’uv）とから（式２）に示す
逆量子化によりＤＣＴ係数ブロックＳ’uvを生成して逆
ＤＣＴ部８１へ出力する。ここにおいて、量子化ＤＣＴ
係数ブロックＲuv又はＲ’uvより生成されたＤＣＴ係数
ブロックをＳ’uvとし、量子化前のＤＣＴ係数ブロック
をＳuvとして区別する。（式２）Ｓ'uv＝Ｒuv（又はＲ’uv）×Ｑuv （逆ＤＣＴ部８１）逆ＤＣＴ部８１は、逆ＤＣＴを施す
ことによりＤＣＴ係数ブロックＳ’uvから輝度成分Ｙ、
色差成分Ｃｒ、Ｃｂをブロック単位で復号して表示用画
像メモリ６１へ書き込む。逆ＤＣＴについては公知であ
るので説明を省略する。（表示用画像メモリ６１）表示用画像メモリ６１は、復
号化部６２により復号された輝度成分Ｙ、色差成分Ｃ
ｒ、Ｃｂからなる画像を一時的に記憶する。（画像表示部１３）画像表示部１３は、液晶ディスプレ
イなどから構成され、表示用画像メモリ６１に記憶され
ている画像を表示する。（埋め込み部３７の詳細構成）埋め込み部３７は、符号
用メモリ３５に記憶される音声なし圧縮画像を読み出し
てエントロピー復号し、量子化ＤＣＴ係数ブロックＲuv
に戻す。次に埋め込み部３７は、輝度成分Ｙの量子化Ｄ
ＣＴ係数ブロックＲuv毎に、６４個のうちＤＣ係数（Ｒ
00）を除く６３個の量子化ＤＣＴ係数（ＡＣ係数）の中
から所定数Ｎの量子化ＤＣＴ係数を埋め込み用として選
択する。本実施形態ではこの所定数Ｎを１６個としてい
る。さらに埋め込み部３７は、圧縮音声データをＮビッ
ト（すなわち１６ビット）ずつの部分圧縮音声データに
分割する。最後に埋め込み部３７は、各ブロックに各部
分圧縮音声データを対応させて、埋め込み用の１６個の
量子化ＤＣＴ係数の最下位ビットに部分圧縮音声データ
を１ビットずつ埋め込む。ここで量子化ＤＣＴ係数の最
下位ビットに部分圧縮音声データを１ビット埋め込むと
は、すなわち量子化ＤＣＴ係数の最下位ビットの値を部
分圧縮音声データの１ビットに置き換えることを意味す
る。FIG. 6 is a more detailed block diagram of the decoding section 62. In the figure, the decoding unit 62 includes an entropy decoding unit 84, an inverse quantization unit 82, and an inverse DCT unit 81. (Entropy decoding unit 84) Entropy decoding unit 8
4 generates a quantized DCT coefficient block Ruv (or R'uv) by entropy decoding the compressed image stored in the encoding memory 35, and outputs it to the extraction unit 83 and the inverse quantization unit 82. Here, the quantized DCT coefficient block that does not include the compressed audio data is identified as Ruv, and the quantized DCT coefficient block that includes the compressed audio data is identified as R'uv. (Inverse Quantization Unit 82) The inverse quantization unit 82 includes the same quantization table Quv as the quantization unit 71,
A DCT coefficient block S′uv is generated from the coefficient block Ruv (or R′uv) by inverse quantization shown in (Equation 2) and output to the inverse DCT unit 81. Here, the quantized DCT
The DCT coefficient block generated from the coefficient block Ruv or R'uv is identified as S'uv, and the DCT coefficient block before quantization is identified as Suv. (Equation 2) S′uv = Ruv (or R′uv) × Quv (Inverse DCT section 81) The inverse DCT section 81 performs inverse DCT to obtain a luminance component Y,
The color difference components Cr and Cb are decoded in block units and written into the display image memory 61. Since the inverse DCT is known, its description is omitted. (Display Image Memory 61) The display image memory 61 stores the luminance component Y and the chrominance component C decoded by the decoding unit 62.
The image composed of r and Cb is temporarily stored. (Image Display Unit 13) The image display unit 13 is configured by a liquid crystal display or the like, and displays an image stored in the display image memory 61. (Detailed Configuration of the Embedding Unit 37) The embedding unit 37 reads out the compressed image without sound stored in the encoding memory 35, performs entropy decoding, and performs quantization DCT coefficient block Ruv.
Return to Next, the embedding unit 37 performs quantization D of the luminance component Y.
For each CT coefficient block Ruv, the DC coefficient (R
(00), a predetermined number N of the quantized DCT coefficients (AC coefficients) other than the 63 quantized DCT coefficients are selected for embedding. In the present embodiment, the predetermined number N is set to 16. Further, the embedding unit 37 divides the compressed audio data into partial compressed audio data of N bits (ie, 16 bits). Finally, the embedding unit 37 embeds the partial compressed audio data one bit at a time in the least significant bit of the 16 quantized DCT coefficients for embedding so that each block corresponds to each partial compressed audio data. Here, embedding one bit of partially compressed audio data in the least significant bit of the quantized DCT coefficient means that the value of the least significant bit of the quantized DCT coefficient is replaced with one bit of partially compressed audio data.

【００３０】最後に埋め込み部３７は、埋め込みの済ん
だ量子化ＤＣＴ係数ブロックＲ’uvを再びエントロピー
符号化して符号用メモリ３５に戻す。図９は、埋め込み
部３７の詳細な構成図である。同図において埋め込み部
３７は、圧縮画像入力部１０１、判定値入力部１０２、
埋め込み量入力部１０３、圧縮音声入力部１０４、エン
トロピー復号化部１０５、選択部１０６、埋め込み処理
部１０７、出力部１０８から構成される。Finally, the embedding unit 37 entropy-encodes the embedded quantized DCT coefficient block R'uv again and returns it to the encoding memory 35. FIG. 9 is a detailed configuration diagram of the embedding unit 37. In the figure, an embedding unit 37 includes a compressed image input unit 101, a determination value input unit 102,
It comprises an embedding amount input unit 103, a compressed speech input unit 104, an entropy decoding unit 105, a selection unit 106, an embedding processing unit 107, and an output unit 108.

【００３１】圧縮画像入力部１０１は、符号用メモリに
記憶される圧縮符号列を読み出してエントロピー復号化
部１０５に出力する。判定値入力部１０２は、判定値J
を予め記憶する。判定値Jは、選択部１０６が量子化Ｄ
ＣＴ係数ブロックＲuvの中から埋め込み用の量子化ＤＣ
Ｔ係数を決定する際のしきい値となる。本実施形態にお
いて判定値Jは２とする。この判定値２以上の量子化Ｄ
ＣＴ係数が埋め込み用の候補となる。The compressed image input section 101 reads out a compressed code string stored in the coding memory and outputs it to the entropy decoding section 105. The judgment value input unit 102 sets the judgment value J
Is stored in advance. The determination value J is determined by the quantization
Quantization DC for embedding from CT coefficient block Ruv
This is a threshold for determining the T coefficient. In the present embodiment, the judgment value J is 2. Quantization D of this judgment value 2 or more
CT coefficients are candidates for embedding.

【００３２】埋め込み量入力部１０３は、埋め込み量Ｎ
を予め記憶する。埋め込み量Ｎは、約１０秒分に相当す
る圧縮音声データのデータ量３８４００バイトを、輝度
成分の総ブロック数で割って、ビットに換算した値であ
り、本実施形態においてＮは３８４００バイト÷１９２
００ブロック＝１６ビットである。圧縮音声入力部１０
４は、オーディオメモリ４４に記憶される圧縮音声デー
タを、埋め込み量入力部１０３に記憶される埋め込み量
Ｎに分割して、埋め込み処理部１０７に出力する。The embedding amount input unit 103 has an embedding amount N
Is stored in advance. The embedding amount N is a value obtained by dividing the data amount of 38400 bytes of the compressed audio data corresponding to about 10 seconds by the total number of blocks of the luminance component and converting it into bits. In the present embodiment, N is 38400 bytes / 192.
00 block = 16 bits. Compressed voice input unit 10
Reference numeral 4 divides the compressed audio data stored in the audio memory 44 into the embedding amount N stored in the embedding amount input unit 103 and outputs the divided data to the embedding processing unit 107.

【００３３】エントロピー復号化部１０５は、圧縮画像
入力部１０１より出力される圧縮符号列をエントロピー
復号し、輝度成分の量子化ＤＣＴ係数ブロックＲuvを選
択部１０６に出力する。（選択部１０６）選択部１０６は、エントロピー復号化
部１０５より出力される量子化ＤＣＴ係数ブロックＲuv
毎に、ＤＣ係数（Ｒ00）を除く６３個の量子化ＤＣＴ係
数（ＡＣ係数）の中から合計Ｎ個（１６個）の量子化Ｄ
ＣＴ係数を埋め込み用として選択し、選択結果を埋め込
み処理部１０７に出力する。The entropy decoding unit 105 entropy decodes the compressed code string output from the compressed image input unit 101 and outputs a quantized DCT coefficient block Ruv of the luminance component to the selection unit 106. (Selection Unit 106) The selection unit 106 is a quantized DCT coefficient block Ruv output from the entropy decoding unit 105.
Each time, a total of N (16) quantized Ds out of 63 quantized DCT coefficients (AC coefficients) excluding the DC coefficient (R00)
The CT coefficient is selected for embedding, and the selection result is output to the embedding processing unit 107.

【００３４】選択部１０６は、量子化ＤＣＴ係数ブロッ
クＲuvに対応する８×８個の埋め込みフラグＥuv（u,v
＝０〜７）を有し、選択された量子化ＤＣＴ係数に対応
する埋め込みフラグＥuvをセットすることにより前記選
択結果を記録する。より具体的には、埋め込みフラグＥ
uvは、初期設定では全てオフに設定されており、選択部
１０６は、埋め込み用として選択した量子化ＤＣＴ係数
に対応する選択フラグをオンに設定する。埋め込み用と
して選択されなかった量子化ＤＣＴ係数に対応する選択
フラグについてはオフのままである。The selecting section 106 selects 8 × 8 embedding flags Euv (u, v) corresponding to the quantized DCT coefficient block Ruv.
= 0 to 7), and the selection result is recorded by setting an embedding flag Euv corresponding to the selected quantized DCT coefficient. More specifically, the embedding flag E
uv is initially set to off, and the selection unit 106 sets the selection flag corresponding to the quantized DCT coefficient selected for embedding to on. The selection flag corresponding to the quantized DCT coefficient not selected for embedding remains off.

【００３５】選択部１０６は、後に画像が復号された場
合に、符号化前の元の画像と比べて視覚的な劣化が極力
少なくなるように埋め込み用の量子化ＤＣＴ係数を選択
する。そうするために本実施形態では選択部１０６は、
量子化ＤＣＴ係数の絶対値が判定値Ｊ以上のもの、
低周波のもの、の中からをより優先させながら、埋
め込み用の量子化ＤＣＴ係数を選択する。The selecting section 106 selects a quantization DCT coefficient for embedding so that, when an image is decoded later, visual deterioration is minimized as compared with the original image before encoding. To do so, in the present embodiment, the selection unit 106
The absolute value of the quantized DCT coefficient is equal to or larger than the judgment value J,
An embedding quantized DCT coefficient is selected while giving priority to low frequency ones.

【００３６】の条件を用いる理由は、絶対値が小さい
量子化ＤＣＴ係数は、絶対値の大きい量子化ＤＣＴ係数
に比べて、１ビットの値が変化したときの誤差が大き
い。したがって絶対値の大きい量子化ＤＣＴ係数に埋め
込みを行う方が劣化が少なくなるからである。の条件
を用いる理由は、図８（ｃ）の量子化テーブルＱuvを見
ればわかるように、高周波の量子化ＤＣＴ係数は、低周
波の量子化ＤＣＴ係数に比べて、より大きな値で逆量子
化される。このため高周波の量子化ＤＣＴ係数は、低周
波の量子化ＤＣＴ係数に埋め込みを行った場合と比べ
て、逆量子化した結果のＤＣＴ係数は、符号化時のＤＣ
Ｔ係数と比較して誤差が大きくなる。したがって高周波
よりも低周波の量子化ＤＣＴ係数の方が埋め込みを行っ
た場合の劣化が少なくなるからである。The reason for using the condition (1) is that a quantized DCT coefficient having a small absolute value has a larger error when a 1-bit value changes than a quantized DCT coefficient having a large absolute value. Therefore, embedding in a quantized DCT coefficient having a large absolute value causes less deterioration. The reason for using the condition is that the high-frequency quantized DCT coefficient is inversely quantized with a larger value than the low-frequency quantized DCT coefficient, as can be seen from the quantization table Quv in FIG. Is done. For this reason, the high-frequency quantized DCT coefficient is compared with a case where the low-frequency quantized DCT coefficient is embedded, and the DCT coefficient resulting from the inverse quantization is a DCT coefficient at the time of encoding.
The error increases as compared with the T coefficient. Therefore, the quantization DCT coefficient of a low frequency is less deteriorated when embedding is performed than the high frequency.

【００３７】をより優先させる理由は、よりの
方が値が変化したときの劣化が少ないからである。図１
０は、選択部１０６による、選択処理を示すフローチャ
ートである。同図において選択部１０６は、量子化ＤＣ
Ｔ係数（ＡＣ係数）を１つ読み出して、その絶対値が判
定値Ｊ（＝２）以上であるか否かを判定する（ステップ
１１、１２）。この判定は、量子化ＤＣＴ係数の下位か
ら２番目以上のビット値に１があるか否かによって判定
すればよい。すなわち選択部１０６は、量子化ＤＣＴ係
数の絶対値の下位から２番目以上のビット値に、１が１
つでもあれば量子化ＤＣＴ係数の絶対値は２以上の値で
あり、１が１つもなければ量子化ＤＣＴ係数の絶対値
は、２より小さい値であると判定する。The reason for giving higher priority is that the higher the value, the less the deterioration when the value changes. FIG.
0 is a flowchart showing a selection process by the selection unit 106. In the same figure, the selection unit 106
One T coefficient (AC coefficient) is read, and it is determined whether or not its absolute value is equal to or greater than a determination value J (= 2) (steps 11 and 12). This determination may be made based on whether or not there is 1 in the second or more least significant bit value of the quantized DCT coefficient. That is, the selection unit 106 sets 1 to 1 from the lowest bit value of the absolute value of the quantized DCT coefficient.
If there is more than one, the absolute value of the quantized DCT coefficient is a value of 2 or more, and if there is no 1, the absolute value of the quantized DCT coefficient is smaller than 2.

【００３８】判定の結果、絶対値が判定値J以上の場合
には、対応する埋め込みフラグをセットし、変数Ｃに１
を足しこむ（ステップ１３、１４）。変数Ｃは、セット
された埋め込みフラグの数、つまり埋め込み用として選
択された量子化ＤＣＴ係数の数を示す。このように選択
部１０６は、量子化ＤＣＴ係数ブロックをジグザグ順に
走査しながら、６３個の量子化ＤＣＴ係数（ＡＣ係数）
についてステップ１１〜１４の処理を繰り返す。この処
理は、変数Ｃが埋め込み量Ｎ以上の場合、又は、６３個
の量子化ＤＣＴ係数（ＡＣ係数）全てについてステップ
１１〜１４の処理を行った場合に終了して、ステップ１
６に進む。If the result of the determination indicates that the absolute value is equal to or greater than the determination value J, the corresponding embedding flag is set and the variable C is set to 1
Is added (steps 13 and 14). The variable C indicates the number of set embedding flags, that is, the number of quantized DCT coefficients selected for embedding. As described above, the selecting unit 106 scans the quantized DCT coefficient block in the zigzag order, and executes the 63 quantized DCT coefficients (AC coefficients).
, The processing of steps 11 to 14 is repeated. This processing ends when the variable C is equal to or greater than the embedding amount N or when the processing of steps 11 to 14 is performed for all of the 63 quantized DCT coefficients (AC coefficients).
Proceed to 6.

【００３９】ステップ１６において選択部１０６は、変
数Ｃが埋め込み量Ｎより小さいか否かを判定する。判定
の結果、変数Ｃが埋め込み量Ｎより小さい場合、埋め込
み用として選択された量子化ＤＣＴ係数の個数が、部分
圧縮音声データのデータ量に達していないということで
あるから、選択部１０６は、低周波側からジグザグスキ
ャン順に、埋め込みフラグがオフになっているものの中
から（埋め込み量Ｎ−変数Ｃ）個を埋め込み用と選択し
て埋め込みフラグをセットする（ステップ１７）。In step 16, the selection unit 106 determines whether or not the variable C is smaller than the embedding amount N. If the result of the determination is that the variable C is smaller than the embedding amount N, it means that the number of quantized DCT coefficients selected for embedding has not reached the data amount of the partially compressed audio data. From the low-frequency side, in the zigzag scan order, the embedding flags are set by selecting (embedding amount N-variable C) pieces for embedding from among those whose embedding flags are turned off (step 17).

【００４０】選択部１０６は、以上の手順によってＮ個
の埋め込み用の量子化ＤＣＴ係数を選択する。（選択処理例１）図１１（ａ）は、選択部１０６が図８
（ｄ）の量子化ＤＣＴ係数ブロックＲuvについて選択処
理を行った場合に埋め込み用として選択される量子化Ｄ
ＣＴ係数を丸印で囲って示している。The selecting unit 106 selects N embedding quantized DCT coefficients by the above procedure. (Selection Processing Example 1) FIG.
The quantization D selected for embedding when the selection processing is performed on the quantized DCT coefficient block Ruv in (d).
The CT coefficients are shown by circles.

【００４１】選択部１０６は、量子化ＤＣＴ係数ブロッ
クＲuvをジグザグ順に走査しながら、各量子化ＤＣＴ係
数（ＡＣ係数）について判定値Ｊ以上であるか否かを判
定する。同図(a)の例では、選択部１０６は、−３４、
２６、−１９、３０、…の順に埋め込み用として選択し
ていき、最後に−２を選択した時点で選択した量子化Ｄ
ＣＴ係数の数が埋め込み量Ｎに達したので選択処理を終
了している。The selector 106 determines whether each quantized DCT coefficient (AC coefficient) is equal to or greater than the judgment value J while scanning the quantized DCT coefficient block Ruv in a zigzag order. In the example shown in FIG.
, -19, 30,... Are sequentially selected for embedding, and the quantization D
Since the number of CT coefficients has reached the embedding amount N, the selection process has been completed.

【００４２】同図（ｂ）は、選択部１０６が同図（ａ）
について選択処理を行った場合の選択結果の埋め込みフ
ラグＥuvを示す。同図（ｂ）において１はフラグがセッ
トされていることを示し、その位置に対応する量子化Ｄ
ＣＴ係数が埋め込み用として選択されたことを示す。ま
た０はフラグがセットされていないことを示し、その位
置に対応する量子化ＤＣＴ係数は埋め込み用として選択
されていないことを示す。（選択処理例２）図１２（ａ）は、選択部１０６が図８
（ｄ）とは別の量子化ＤＣＴ係数ブロックについて図１
０のステップ１１〜１５を繰り返した結果、埋め込み用
として選択された量子化ＤＣＴ係数を丸印で囲って示
す。同図に示すように選択部１０６は、ジグザグスキャ
ン順に、１０、−１１、−１２、５、５、１２、…の順
に埋め込み用として選択していき、量子化ＤＣＴ係数ブ
ロックの最後まで走査している。FIG. 4B shows that the selecting unit 106 is the same as FIG.
Indicates an embedding flag Euv of the selection result when the selection process is performed for the. In FIG. 9B, 1 indicates that the flag is set, and the quantization D corresponding to the position is set.
Indicates that the CT coefficient has been selected for embedding. Further, 0 indicates that the flag is not set, and indicates that the quantized DCT coefficient corresponding to that position is not selected for embedding. (Selection Processing Example 2) FIG.
FIG. 1 shows a quantized DCT coefficient block different from (d).
As a result of repeating steps 11 to 15 of 0, the quantized DCT coefficients selected for embedding are indicated by circles. As shown in the drawing, the selection unit 106 selects the embedding order in the order of 10, -11, -12, 5, 5, 12,... In the zigzag scanning order, and scans to the end of the quantized DCT coefficient block. ing.

【００４３】図１２（ａ）に対応する埋め込みフラグを
同図（ｂ）に示す。同図（ｂ）に示すようにセットされ
ている埋め込みフラグの数は１３個であり、埋め込み量
Ｎ（＝１６）より少ない。そこで選択部１０６は、ステ
ップ１７の処理を行う。具体的には、同図（ｂ）の埋め
込みフラグをジグザグスキャン順に走査して、１６−３
＝３個のセットされていない埋め込みフラグをセットし
て、全体として１６個の埋め込みフラグをセットする。
同図（ｂ）において丸印で囲まれた埋め込みフラグが、
ステップ１７の処理により新たにセットされる。FIG. 12B shows an embedding flag corresponding to FIG. As shown in FIG. 7B, the number of embedding flags set is thirteen, which is smaller than the embedding amount N (= 16). Then, the selection unit 106 performs the process of step 17. More specifically, the embedding flag shown in FIG.
= Set three unset embedding flags and set a total of 16 embedding flags.
The embedding flag surrounded by a circle in FIG.
It is newly set by the processing in step 17.

【００４４】図１２（ｃ）は、選択部１０６の選択処理
による最終的な選択結果の埋め込みフラグを示す。（埋め込み処理部１０７）埋め込み処理部１０７は、量
子化ＤＣＴ係数ブロックＲuv毎に、選択部１０６によっ
て埋め込み用として選択された１６個の量子化ＤＣＴ係
数に部分圧縮音声データの埋め込みを行う。FIG. 12C shows an embedding flag of a final selection result by the selection processing of the selection unit 106. (Embedding Processing Unit 107) The embedding processing unit 107 embeds partially compressed audio data into the 16 quantized DCT coefficients selected for embedding by the selecting unit 106 for each quantized DCT coefficient block Ruv.

【００４５】詳しくは、埋め込み処理部１０７は、選択
部１０６より出力される埋め込みフラグをジグザグスキ
ャン順に走査してセットされている埋め込みフラグを探
す。セットされている埋め込みフラグを見つけたら、埋
め込み処理部１０７は、その埋め込みフラグに対応する
量子化ＤＣＴ係数の最下位ビットを、圧縮音声入力部１
０４から入力される部分圧縮音声データの１ビットに変
更する。埋め込み処理部１０７は、この操作をセットさ
れている１６個のフラグ分繰り返す。このようにして埋
め込み処理部１０７は、部分圧縮音声データの先頭から
１ビットずつを埋め込み用の量子化ＤＣＴ係数の最下位
ビットに埋め込み、埋め込みの済んだ量子化ＤＣＴ係数
ブロックを出力部１０８に出力する。（埋め込み処理例）図１３（ａ）は、部分圧縮音声デー
タの一例を示す。埋め込み処理部１０７は、この部分圧
縮音声データの先頭から１ビットずつを、埋め込み用と
して選択された量子化ＤＣＴ係数に埋め込む。More specifically, the embedding processing unit 107 scans the embedding flags output from the selecting unit 106 in a zigzag scan order, and searches for the set embedding flags. When finding the embedding flag that has been set, the embedding processing unit 107 converts the least significant bit of the quantized DCT coefficient corresponding to the embedding flag into the compressed audio input unit 1.
04 is changed to 1 bit of the partially compressed audio data. The embedding processing unit 107 repeats this operation for the set 16 flags. In this way, the embedding processing unit 107 embeds one bit at a time from the beginning of the partially compressed audio data in the least significant bit of the quantized DCT coefficient for embedding, and outputs the embedded quantized DCT coefficient block to the output unit 108. I do. (Example of Embedding Process) FIG. 13A shows an example of the partially compressed audio data. The embedding processor 107 embeds one bit at a time from the beginning of the partially compressed audio data into the quantized DCT coefficient selected for embedding.

【００４６】同図（ｂ）は、埋め込み処理部１０７が同
図（ａ）の部分圧縮音声データを図１２（ａ）の量子化
ＤＣＴ係数ブロックに埋め込んだ場合の結果を示す。図
１３（ｂ）において丸印で囲まれている量子化ＤＣＴ係
数は、部分圧縮音声データが埋め込まれていることを示
す。埋め込み処理部１０７は、図１２（ｃ）の埋め込み
フラグをジグザグスキャン順に走査してセットされてい
る埋め込みフラグを探し、見つけたらそのフラグに対応
する量子化ＤＣＴ係数に部分圧縮音声データを１ビット
埋め込む。（出力部１０８）出力部１０８は、埋め込み処理部１０
７によって埋め込みの済んだ量子化ＤＣＴ係数ブロック
に対してエントロピー符号化部７４と同様のエントロピ
ー符号化を行って、結果の圧縮符号列を符号用メモリ３
５に出力する。この圧縮符号列はすなわち音声付き圧縮
画像である。FIG. 11B shows the result when the embedding processing unit 107 embeds the partially compressed audio data of FIG. 10A into the quantized DCT coefficient block of FIG. The quantized DCT coefficients circled in FIG. 13B indicate that the partially compressed audio data is embedded. The embedding processing unit 107 scans the embedding flag of FIG. 12C in a zigzag scan order to search for the set embedding flag, and when found, embeds one bit of the partially compressed audio data in the quantized DCT coefficient corresponding to the flag. . (Output unit 108) The output unit 108
7 performs the same entropy coding as the entropy coding unit 74 on the quantized DCT coefficient block embedded therein, and stores the resulting compressed code string in the coding memory 3.
5 is output. This compressed code string is a compressed image with sound.

【００４７】以上のようにして埋め込み３７は、圧縮符
号列を復号した量子化ＤＣＴ係数ブロックに、部分圧縮
音声データの埋め込みを行って、再び圧縮符号列に符号
化して符号用メモリに出力するという処理を繰り返すこ
とにより、１枚の音声なし圧縮画像に約１０秒の圧縮音
声データを埋め込む。このようにして埋め込み部３７
は、各ブロックについて画像の劣化に影響しない１６個
の量子化ＤＣＴ係数の最下位ビットに部分圧縮音声デー
タの埋め込みを行うので、全体として２バイト×１９２
００ブロック＝３８４００バイト分の埋め込みが行われ
ることとなり、より多くの圧縮音声データを画像の劣化
少なく埋め込むことができる。（抽出部８３の詳細構成）図１４は、抽出部８３の詳細
な構成図である。As described above, the embedding 37 embeds the partially compressed audio data in the quantized DCT coefficient block obtained by decoding the compressed code string, encodes the compressed code string again, and outputs it to the coding memory. By repeating the process, compressed audio data of about 10 seconds is embedded in one compressed image without audio. Thus, the embedding section 37
Performs embedding of partially compressed audio data in the least significant bit of 16 quantized DCT coefficients that do not affect image degradation for each block, so that 2 bytes × 192
The embedding of 00 blocks = 38400 bytes is performed, so that more compressed audio data can be embedded with less image deterioration. (Detailed Configuration of Extraction Unit 83) FIG. 14 is a detailed configuration diagram of the extraction unit 83.

【００４８】同図において抽出部８３は、識別部８３
１、抽出処理部８３２から構成される。識別部８３１
は、エントロピー復号化部８４より量子化ＤＣＴ係数ブ
ロックＲuv又はＲ’uvが出力されると、図１０のフロー
チャートと同じ処理によって、埋め込みフラグを生成す
る。ここで識別部８３１は、量子化ＤＣＴ係数ブロック
がＲuvであってもＲ’uvであっても埋め込みフラグの生
成を行う。つまり圧縮画像が音声なしであるか音声付き
であるかに関係なく埋め込みフラグの生成を行う。これ
により音声付き圧縮画像の量子化ＤＣＴ係数ブロック
Ｒ’uvであれば、識別部８３１は選択部１０６によって
作成されたものと同じ埋め込みフラグを復元するし、音
声なし圧縮画像の量子化ＤＣＴ係数ブロックＲuvであれ
ば、識別部８３１は全てのフラグがオフの埋め込みフラ
グを生成する。In the figure, the extraction unit 83 is composed of an identification unit 83
1. It is composed of an extraction processing unit 832. Identification unit 831
When the quantized DCT coefficient block Ruv or R'uv is output from the entropy decoding unit 84, an embedding flag is generated by the same processing as in the flowchart of FIG. Here, the identification unit 831 generates an embedding flag regardless of whether the quantized DCT coefficient block is Ruv or R'uv. That is, the embedding flag is generated regardless of whether the compressed image has no sound or has sound. As a result, if it is the quantized DCT coefficient block R'uv of the compressed image with sound, the identification unit 831 restores the same embedded flag created by the selecting unit 106, and the quantized DCT coefficient block of the compressed image without sound. If it is Ruv, the identification unit 831 generates an embedded flag in which all flags are off.

【００４９】抽出処理部８３２は、識別部８３１により
復元された埋め込みフラグをジグザグスキャン順に参照
して、オンに設定されている埋め込みフラグに対応する
量子化ＤＣＴ係数の最下位ビットを抽出してオーディオ
メモリ５４に出力する。以上のようにして抽出部８３
は、選択処理及び埋め込み処理と逆の処理を行うことに
よって、部分圧縮音声データが埋め込まれた量子化ＤＣ
Ｔ係数ブロックから部分圧縮音声データを抽出し、これ
を全ての量子化ＤＣＴ係数ブロックについて行うことに
よって約１０秒の圧縮音声データを抽出する。（実施形態２）以下、本発明の実施形態２のデジタルカ
メラ２について説明する。The extraction processing unit 832 refers to the embedding flags restored by the identification unit 831 in a zigzag scan order, extracts the least significant bit of the quantized DCT coefficient corresponding to the embedding flag set to ON, and outputs Output to the memory 54. As described above, the extraction unit 83
Is a quantization DC in which the partially compressed audio data is embedded by performing a process reverse to the selection process and the embedding process.
The compressed audio data of about 10 seconds is extracted by extracting the partially compressed audio data from the T coefficient block and performing this for all the quantized DCT coefficient blocks. Embodiment 2 Hereinafter, a digital camera 2 according to Embodiment 2 of the present invention will be described.

【００５０】デジタルカメラ２は、デジタルカメラ１と
同様の方法により量子化ＤＣＴ係数の最下位ビットに圧
縮音声データを埋め込むのに加えて、量子化ＤＣＴ係数
の下位から２ビット目と３ビット目にも圧縮音声データ
を埋め込むことにより、実施形態１よりも１枚の画像に
多くの圧縮音声データを埋め込むことができる。その構
成は、図３に示すデジタルカメラ１の構成と比較して、
埋め込み部３７と抽出部８３の代わりに埋め込み部４７
と抽出部９３を備える点が異なっている。The digital camera 2 embeds the compressed audio data in the least significant bit of the quantized DCT coefficient in the same manner as the digital camera 1 and also adds the second and third bits from the least significant bit of the quantized DCT coefficient. By embedding compressed audio data, more compressed audio data can be embedded in one image than in the first embodiment. Its configuration is different from that of the digital camera 1 shown in FIG.
An embedding unit 47 instead of the embedding unit 37 and the extraction unit 83
And an extraction unit 93.

【００５１】以下、埋め込み部４７と抽出部９３につい
て説明する。（埋め込み部４７）図１５は埋め込み部４７の詳細構成
図を示す。同図において埋め込み部４７は、圧縮画像入
力部１０１、判定値入力部２０２、埋め込み量入力部１
０３、圧縮音声入力部２０４、エントロピー復号化部１
０５、選択部２０６、埋め込み処理部２０７、出力部１
０８から構成される。Hereinafter, the embedding section 47 and the extracting section 93 will be described. (Embedding Unit 47) FIG. 15 is a detailed configuration diagram of the embedding unit 47. In the figure, an embedding unit 47 includes a compressed image input unit 101, a determination value input unit 202, and an embedding amount input unit 1.
03, compressed speech input unit 204, entropy decoding unit 1
05, selection unit 206, embedding processing unit 207, output unit 1
08.

【００５２】同図において図９の埋め込み部３７と同じ
符号の構成要素は同じ機能であるので説明を省略し、以
下符号の異なる構成要素について説明する。判定値入力
部２０２は、判定値Ｊ、Ｊ２、Ｊ３を予め記憶する。本
実施形態においてＪ、Ｊ２、Ｊ３はそれぞれ２、４、８
である。圧縮音声入力部２０４は、オーディオメモリ４
４に記憶される圧縮音声データを埋め込み量単位に区切
って埋め込み処理部２０７に出力する。埋め込み量は、
量子化ＤＣＴ係数ブロック毎に異なっており、選択部２
０６より指示される。In the figure, the components having the same reference numerals as those of the embedding unit 37 in FIG. 9 have the same functions, and thus the description thereof will be omitted, and the components having the different reference numerals will be described below. The determination value input unit 202 stores the determination values J, J2, and J3 in advance. In the present embodiment, J, J2, and J3 are 2, 4, and 8, respectively.
It is. The compressed voice input unit 204 is provided in the audio memory 4
4 is output to the embedding processing unit 207 by dividing the compressed audio data into units of embedding amount. The embedding amount is
It is different for each quantized DCT coefficient block.
06.

【００５３】選択部２０６は、実施形態１と同じく図１
０に示す選択処理によって最下位ビットに埋め込むため
の量子化ＤＣＴ係数を選択する。この処理は実施形態１
と同様であるので説明を省略する。上記選択処理に加え
て選択部２０６は、２ビット目、３ビット目に埋め込む
ための量子化ＤＣＴ係数を選択する。The selection unit 206 is the same as the first embodiment shown in FIG.
A quantization DCT coefficient to be embedded in the least significant bit is selected by a selection process indicated by “0”. This processing is performed in the first embodiment.
Therefore, the description is omitted. In addition to the above selection processing, the selection unit 206 selects a quantized DCT coefficient to be embedded in the second and third bits.

【００５４】選択部２０６は各量子化ＤＣＴ係数をその
絶対値の大きさに応じてクラス分けし、各クラスに属す
る量子化ＤＣＴ係数の個数に応じて埋め込みを行う量子
化ＤＣＴ係数の個数と埋め込みを行うビット位置とを決
定する。図１６は、選択部２０６が埋め込みを行う量子
化ＤＣＴ係数の数と埋め込みを行うビット位置とを決定
するための論理を表わすフローチャートである。The selecting section 206 classifies each quantized DCT coefficient according to the magnitude of its absolute value, and embeds the number of quantized DCT coefficients to be embedded according to the number of quantized DCT coefficients belonging to each class and the embedding. Is determined. FIG. 16 is a flowchart illustrating the logic for determining the number of quantized DCT coefficients to be embedded and the bit position to be embedded by the selection unit 206.

【００５５】同図において選択部２０６は、まず１つの
量子化ＤＣＴ係数ブロック中の６３個のＡＣ係数におい
て、絶対値が４以上の量子化ＤＣＴ係数の個数Ｃ２と絶
対値が８以上の量子化ＤＣＴ係数の個数Ｃ３とを数える
（ステップ２１）。次に選択部２０６は、Ｃ２、Ｃ３の
値に応じて以下の決定を行う。Ｃ３が８以上の場合、つ
まり量子化ＤＣＴ係数ブロック中に絶対値が８以上の量
子化ＤＣＴ係数の個数が８個以上ある場合（ステップ２
２）、選択部２０６は絶対値が８以上の４個の量子化Ｄ
ＣＴ係数についてその下位から３ビット目を埋め込み用
と決定し、また絶対値が４以上の８個の量子化ＤＣＴ係
数についてその下位から２ビット目を埋め込み用と決定
する（ステップ２３）。In the figure, the selection unit 206 first determines the number C2 of quantized DCT coefficients whose absolute value is 4 or more and the quantized DCT coefficients whose absolute value is 8 or more in 63 AC coefficients in one quantized DCT coefficient block. The number C3 of DCT coefficients is counted (step 21). Next, the selection unit 206 makes the following determination according to the values of C2 and C3. When C3 is 8 or more, that is, when the number of quantized DCT coefficients whose absolute value is 8 or more is 8 or more in the quantized DCT coefficient block (step 2)
2), the selection unit 206 performs four quantizations D with absolute values of 8 or more.
The third bit from the lower order of the CT coefficient is determined to be embedded, and the second bit from the lower order is determined to be embedded for eight quantized DCT coefficients whose absolute values are 4 or more (step 23).

【００５６】Ｃ３が８未満であってＣ２が８以上の場
合、つまり量子化ＤＣＴ係数ブロック中に絶対値が８以
上の量子化ＤＣＴ係数の個数が８個未満であって絶対値
が４以上の量子化ＤＣＴ係数の個数が８個以上ある場合
（ステップ２４）、選択部２０６は絶対値が４以上の８
個の量子化ＤＣＴ係数についてその下位から２ビット目
を埋め込み用と決定する（ステップ２５）。When C3 is less than 8 and C2 is 8 or more, that is, the number of quantized DCT coefficients having an absolute value of 8 or more in a quantized DCT coefficient block is less than 8 and the absolute value is 4 or more. When the number of quantized DCT coefficients is eight or more (step 24), the selection unit 206 determines that the absolute value is
The second least significant bit of the quantized DCT coefficients is determined to be embedded (step 25).

【００５７】Ｃ２が４以上８未満の場合、つまり量子化
ＤＣＴ係数ブロック中に絶対値が４以上の量子化ＤＣＴ
係数の個数が４個以上８個未満ある場合（ステップ２
６）、選択部２０６は絶対値が４以上の４個の量子化Ｄ
ＣＴ係数の下位から２ビット目を埋め込み用と決定する
（ステップ２７）。選択部２０６は、実施形態１と同様
の埋め込みフラグＥuvを有する他、下位から２ビット目
用と３ビット目用の埋め込みフラグＥ２uv（u,v＝０〜
７）と埋め込みフラグＥ３uv（u,v＝０〜７）とを有
し、先の決定結果に従って埋め込みフラグＥ２uv、Ｅ３
uvをセットする。When C2 is 4 or more and less than 8, that is, the quantized DCT whose absolute value is 4 or more in the quantized DCT coefficient block
When the number of coefficients is 4 or more and less than 8 (step 2
6), the selector 206 selects four quantized Ds whose absolute values are 4 or more.
The second bit from the bottom of the CT coefficient is determined to be embedded (step 27). The selecting unit 206 has an embedding flag Euv similar to that of the first embodiment, and also includes an embedding flag E2uv (u, v = 0 to 2) for the second and third bits from the lower order.
7) and an embedding flag E3uv (u, v = 0 to 7), and the embedding flags E2uv and E3 according to the result of the above determination.
Set uv.

【００５８】より具体的には、決定結果がステップ２３
である場合、選択部２０６はＤＣ係数を除く量子化ＤＣ
Ｔ係数（ＡＣ係数）をジグザグスキャン順に走査して、
絶対値が８以上の量子化ＤＣＴ係数を４個選択してそれ
に対応する埋め込みフラグＥ３uvをセットし、また同様
にしてＤＣ係数を除く量子化ＤＣＴ係数（ＡＣ係数）を
ジグザグスキャン順に走査して、絶対値が４以上の量子
化ＤＣＴ係数を８個選択してそれに対応する埋め込みフ
ラグＥ２uvをセットする。More specifically, the result of the determination is
, The selection unit 206 performs quantization DC
Scan the T coefficient (AC coefficient) in zigzag scan order,
Four quantized DCT coefficients having an absolute value of 8 or more are selected, and the corresponding embedding flag E3uv is set. Similarly, the quantized DCT coefficients (AC coefficients) excluding the DC coefficients are scanned in a zigzag scan order. Eight quantized DCT coefficients having an absolute value of 4 or more are selected, and the corresponding embedding flag E2uv is set.

【００５９】また決定結果がステップ２５である場合、
選択部２０６はＤＣ係数を除く量子化ＤＣＴ係数（ＡＣ
係数）をジグザグスキャン順に走査して、絶対値が４以
上の量子化ＤＣＴ係数を８個選択してそれに対応する埋
め込みフラグＥ２uvをセットする。埋め込みフラグＥ３
uvはセットしない。また決定結果がステップ２７である
場合、選択部２０６はＤＣ係数を除く量子化ＤＣＴ係数
（ＡＣ係数）をジグザグスキャン順に走査して、絶対値
が４以上の量子化ＤＣＴ係数を４個選択してそれに対応
する埋め込みフラグＥ２uvをセットする。埋め込みフラ
グＥ３uvはセットしない。If the result of the determination is step 25,
The selection unit 206 outputs the quantized DCT coefficients (AC
Are scanned in the zigzag scan order, eight quantized DCT coefficients having an absolute value of 4 or more are selected, and the corresponding embedding flag E2uv is set. Embedded flag E3
uv is not set. If the determination result is step 27, the selection unit 206 scans the quantized DCT coefficients (AC coefficients) excluding the DC coefficients in a zigzag scan order, selects four quantized DCT coefficients having an absolute value of 4 or more, and The corresponding embedding flag E2uv is set. The embedding flag E3uv is not set.

【００６０】例えば図１２（ａ）に示す量子化ＤＣＴ係
数ブロックについて決定処理を行った場合、絶対値が４
以上のＡＣ係数は１０、−１１、−１２、５、５、１
２、−７、−７、４の９個だからＣ２＝９、絶対値が８
以上のＡＣ係数は１０、−１１、−１２、１２の４個だ
からＣ３＝４である（ステップ２１）。よって決定結果
は、ステップ２５に相当するので、図１２（ａ）の量子
化ＤＣＴ係数ブロックの中から絶対値が４以上の８個の
量子化ＤＣＴ係数の下位から２ビット目が埋め込み用と
決定される。図１７に、この決定結果に従ってセットさ
れた埋め込みフラグＥ２uvを示す。同図に示すように絶
対値が４以上の８個の量子化DCT係数がジグザグスキャ
ン順に選択され、それに対応する埋め込みフラグＥ２uv
がセットされる。なおステップ２５の決定結果によれ
ば、埋め込みフラグＥ３uvは１つもセットされない。For example, when the decision processing is performed on the quantized DCT coefficient block shown in FIG.
The above AC coefficients are 10, -11, -12, 5, 5, 1
Since 2, -7, -7 and 4 are 9 pieces, C2 = 9 and the absolute value is 8
Since the above-mentioned AC coefficients are 10, -11, -12, and 12, C3 = 4 (step 21). Therefore, since the determination result corresponds to step 25, the second bit from the lower order of the eight quantized DCT coefficients whose absolute values are 4 or more from the quantized DCT coefficient block in FIG. Is done. FIG. 17 shows the embedding flag E2uv set according to the determination result. As shown in the figure, eight quantized DCT coefficients whose absolute values are 4 or more are selected in zigzag scan order, and the corresponding embedding flag E2uv
Is set. According to the determination result of step 25, no embedding flag E3uv is set.

【００６１】さらに選択部２０６は最下位ビット、下位
から２ビット目、３ビット目を合わせた埋め込み量を圧
縮音声入力部２０４に通知する。具体的には、決定結果
がステップ２３の場合、下位から２ビット目への埋め込
みが８ビット、下位から３ビット目への埋め込みが４ビ
ット、また最下位ビットへの埋め込みが１６ビットであ
るので、選択部２０６は１２と１６の合計２８を埋め込
み量として通知する。Further, the selection unit 206 notifies the compressed voice input unit 204 of the embedding amount including the least significant bit and the second and third bits from the lowest. Specifically, when the decision result is step 23, the embedding in the second bit from the lower order is 8 bits, the embedding in the third bit from the lower order is 4 bits, and the embedding in the least significant bit is 16 bits. , The selector 206 notifies the total 28 of 12 and 16 as the embedding amount.

【００６２】また決定結果がステップ２５の場合、下位
から２ビット目への埋め込みが８ビット、最下位ビット
への埋め込みが１６ビットであるので、選択部２０６は
８と１６の合計２４を埋め込み量として通知する。また
決定結果がステップ２７の場合、下位から２ビット目へ
の埋め込みは４ビット、最下位ビットへの埋め込みが１
６ビットであるので、選択部２０６は４と１６の合計２
０を埋め込み量として通知する。When the decision result is step 25, the embedding in the second bit from the lower order is 8 bits, and the embedding in the least significant bit is 16 bits. Notify as. If the determination result is step 27, the embedding in the second bit from the lower order is 4 bits, and the embedding in the least significant bit is 1
Since the number of bits is 6 bits, the selection unit 206 outputs a total of 2
0 is notified as the embedding amount.

【００６３】埋め込み処理部２０７は、埋め込みフラグ
Ｅuv、Ｅ２uv、Ｅ３uvに基づいて量子化DCT係数の最下
位ビット、下位から２ビット目、下位から３ビット目へ
の部分圧縮音声データの埋め込みを行う。詳しくは、埋
め込み処理部２０７は実施形態１と同様にして埋め込み
フラグＥuvに基づいて最下位ビットへの埋め込みを行
う。これについては実施形態１と同様であるので説明を
省略する。The embedding processing unit 207 embeds the partially compressed audio data in the least significant bit, the second least significant bit, and the least significant third bit of the quantized DCT coefficient based on the embedding flags Euv, E2uv, and E3uv. Specifically, the embedding processing unit 207 performs embedding in the least significant bit based on the embedding flag Euv in the same manner as in the first embodiment. Since this is the same as in the first embodiment, the description is omitted.

【００６４】次に埋め込み処理部２０７は、埋め込みフ
ラグＥ２uvをジグザグスキャン順に走査して、セットさ
れている埋め込みフラグを探し、見つけたらそれに対応
する量子化ＤＣＴ係数の下位から２ビット目を圧縮音声
入力部２０４から入力される部分圧縮音声データの１ビ
ットに変更する。埋め込みフラグＥ２uvの走査が終了す
ると、埋め込み処理部２０７は同様にして埋め込みフラ
グＥ３uvについてもジグザグスキャン順に走査して、セ
ットされている埋め込みフラグに対応する量子化ＤＣＴ
係数の下位から３ビット目を部分圧縮音声データの１ビ
ットに変更する。Next, the embedding processing section 207 scans the embedding flag E2uv in a zigzag scan order, searches for the set embedding flag, and if found, fetches the second lower bit of the corresponding quantized DCT coefficient into the compressed audio input signal. It is changed to one bit of the partially compressed audio data input from the unit 204. When the scanning of the embedding flag E2uv is completed, the embedding processing unit 207 similarly scans the embedding flag E3uv in the zigzag scanning order, and performs quantization DCT corresponding to the set embedding flag.
The third bit from the lower order of the coefficient is changed to 1 bit of the partially compressed audio data.

【００６５】以上のようにして埋め込み部４７は、各ブ
ロックについて画像の劣化に影響しない量子化ＤＣＴ係
数とそのビット位置とを選択して、部分圧縮音声データ
の埋め込みを行うので、全体で約４３〜４８Ｋバイトの
圧縮音声データが埋め込まれることとなる。抽出部９３
は、エントロピー復号化部８４より量子化ＤＣＴ係数ブ
ロックが出力されると、選択部２０６が行う処理と同様
の方法によって埋め込みフラグＥuv、Ｅ２uv、Ｅ３uvを
復元し、それらのフラグに基づいて量子化ＤＣＴ係数の
最下位ビット、下位から２ビット目、下位から３ビット
目より部分圧縮音声データを抽出し、オーディオメモリ
５４に出力する。As described above, the embedding section 47 selects the quantized DCT coefficient and its bit position which do not affect the image degradation for each block and embeds the partially compressed audio data. ４８48 Kbytes of compressed audio data will be embedded. Extraction unit 93
When the quantized DCT coefficient block is output from the entropy decoding unit 84, the embedding flags Euv, E2uv, and E3uv are restored by the same method as the processing performed by the selecting unit 206, and the quantized DCT coefficient is Partially compressed audio data is extracted from the least significant bit, the second least significant bit, and the third least significant bit of the coefficient, and is output to the audio memory 54.

【００６６】このように抽出部９３は、埋め込み部４７
とは逆の操作を行うことによって部分圧縮音声データを
抽出することができる。以上、本発明の実施形態１、２
について説明したが、本発明は実施形態１、２に限ら
ず、以下のようにしても良い。（１）実施形態１において埋め込み部３７はデジタルカ
メラ内部に備えられていたが、デジタルカメラ内部に備
えずに埋め込み部３７単体で構成してもよい。実施形態
２の埋め込み部４７についても同様である。As described above, the extraction unit 93 is provided with the embedding unit 47.
By performing the reverse operation, the partially compressed audio data can be extracted. As described above, Embodiments 1 and 2 of the present invention
However, the present invention is not limited to the first and second embodiments, and may be as follows. (1) Although the embedding unit 37 is provided inside the digital camera in the first embodiment, the embedding unit 37 may be configured as a single unit without being provided inside the digital camera. The same applies to the embedding part 47 of the second embodiment.

【００６７】また埋め込み部３７及び４７をパソコン等
の画像処理の可能な装置の内部に構成してもよい。（２）実施形態１において埋め込み部３７は、一度符号
化された圧縮画像をエントロピー復号によって量子化Ｄ
ＣＴ係数ブロックに戻してから部分圧縮音声データを埋
め込むという手順で埋め込みを行っていたが、符号化部
３４による符号化の段階で埋め込みを行うように構成し
ても良い。より詳しくは、埋め込み部３７は、符号化部
３４におけるＤＣＴ部７１、量子化部７２によってＤＣ
Ｔと量子化とが施された後であって、エントロピー符号
化部７４によって符号化される前の量子化ＤＣＴ係数ブ
ロックに対して埋め込みを行う。この場合埋め込み部３
７が有するエントロピー復号化部１０５と出力部１０８
とは不要となる。（３）圧縮画像のデータ量、判定値、埋め込み量、量子
化テーブルＱuv等は、実施形態１及び２に示す値に限ら
ない。The embedding sections 37 and 47 may be configured inside a device capable of image processing such as a personal computer. (2) In the first embodiment, the embedding unit 37 quantizes the compressed image once encoded by entropy decoding.
Although the embedding is performed by the procedure of embedding the partially compressed audio data after returning to the CT coefficient block, the embedding may be performed at the stage of encoding by the encoding unit 34. More specifically, the embedding unit 37 uses the DCT unit 71 and the quantization unit 72 in the encoding unit 34
The embedding is performed on the quantized DCT coefficient block after being subjected to T and quantization and before being encoded by the entropy encoding unit 74. In this case, the embedding unit 3
7 has an entropy decoding unit 105 and an output unit 108
Becomes unnecessary. (3) The data amount of the compressed image, the determination value, the embedding amount, the quantization table Quv, and the like are not limited to the values described in the first and second embodiments.

【００６８】例えば判定値Ｊは３や４でもよい。ただし
２ⁿ（ｎは自然数）を用いるのが望ましい。その理由
は、選択部１０６は、量子化ＤＣＴ係数の下位から（ｎ
＋１）番目以上のビット値に１があるか否かを判定する
ことにより、その係数が２ⁿ以上か否かを簡単に判定す
ることができるからである。また異なる量子化テーブル
を複数有するよう構成し、量子化テーブルに応じて判定
値や埋め込み量を変えても良い。For example, the judgment value J may be 3 or 4. However, it is desirable to use 2 ⁿ (n is a natural number). The reason is that the selection unit 106 selects (n
This is because whether or not the coefficient is ²ⁿ or more can be easily determined by determining whether or not there is 1 in the (+1) th or more bit value. Further, a configuration may be adopted in which a plurality of different quantization tables are provided, and the determination value and the embedding amount may be changed according to the quantization tables.

【００６９】画像の劣化と埋め込み量との兼ね合いを考
慮して量子化ＤＣＴ係数の下位から１ビット目、２ビッ
ト目、３ビット目への埋め込み量を実施形態２よりも多
くしても良い。例えば量子化テーブルの値を全体的に小
さくして圧縮率を低くすれば、１ビット目に３０ビッ
ト、２ビット目に１６ビット、３ビット目に８ビットと
いうように埋め込み量を多くすることも可能である。（４）エントロピー符号化部７４と出力部１０８、エン
トロピー復号化部１０５とエントロピー復号化部８４、
撮影画像メモリ３３と表示用画像メモリ６１、オーディ
オメモリ４４とオーディオメモリ５４等、デジタルカメ
ラ内部において同じ機能の２つの構成要素は、一方をな
くして１つだけで共用しても良い。（５）実施形態１においてはデジタルカメラ１は、各量
子化ＤＣＴ係数に部分圧縮音声データを一律に埋め込む
構成であった。この構成によれば、判定値Ｊ以上の量子
化ＤＣＴ係数の数が埋め込み量Ｎより少ないブロックに
ついては、判定値Ｊより小さい値の量子化ＤＣＴ係数に
も埋め込みが行われるので、そのブロックは、他のブロ
ックよりも劣化することになるという問題がある。この
ような問題に対して次のような方法で対処してもよい。
すなわち選択部１０６は、まず量子化ＤＣＴ係数毎に、
ＤＣ係数を除く６３個のＡＣ係数中に、その絶対値が判
定値Ｊ以上の値の係数の数を数え上げる。次に選択部１
０６は、数え上げた係数の数が埋め込み量Ｎの値以上で
あるか否かを判定する。判定の結果、埋め込み量Ｎの値
以上である場合には、その量子化ＤＣＴ係数ブロックに
埋め込みを行う、と決定し、ジグザグスキャン順に判定
値Ｊ以上の量子化ＤＣＴ係数ブロックを埋め込み用とし
て選択して埋め込みフラグをセットする。判定の結果、
埋め込み量Ｎの値よりも少ない場合には、その量子化Ｄ
ＣＴ係数ブロックには埋め込みを行わない、と決定し埋
め込みフラグのセットを行わない。埋め込み処理部１０
７は、選択部１０６によってセットされている埋め込み
フラグのあるブロックについてのみ部分圧縮音声データ
の埋め込みを行う。これによって、実施形態１の構成よ
りも埋め込み可能な圧縮音声データのデータ量は少なく
なるが、実施形態１の構成よりも画質を保護できるとい
う効果がある。（６）埋め込み部３７の各構成要素の機能をプログラム
化してＲＯＭに記録し、ＣＰＵ、ＲＡＭ、ＲＯＭからな
るマイクロコンピュータにより実現してもよい。より具
体的には、ＣＰＵはＲＯＭからプログラムを読み出して
実行することにより、量子化ＤＣＴ係数ブロックＲuvの
中から低周波のＡＣ係数と絶対値が判定値２以上のＡＣ
係数とから１６個のＡＣ係数を選択する選択ステップ
と、選択処理によって選択された１６個のＡＣ係数の最
下位ビットを１６ビットの部分圧縮音声データに置き換
える置き換えステップとを行う。選択ステップは図１０
に示すフローチャートをプログラム化したもので、ＣＰ
Ｕは量子化ＤＣＴ係数ＲuvのＡＣ係数についてジグザグ
スキャンし、判定値２以上の量子化ＤＣＴ係数があれば
それに対応する埋め込みフラグをオンする第1選択ステ
ップを行う。この埋め込みフラグはＲＡＭに記憶されて
いる。ＣＰＵは埋め込みフラグをオンにした個数Ｃが埋
め込み量１６に達したとき、または最後の量子化ＤＣＴ
係数までジグザグスキャンしたとき第1選択ステップを
終了する。ＣＰＵはフラグをオンにした個数Ｃが埋め込
み量１６より少ないか否かを判定し、少ない場合にはジ
グザグスキャン順にオフになっている埋め込みフラグ
（１６−Ｃ）個をオンにする第2選択ステップを行う。
ＣＰＵは、埋め込みフラグがオンになっている量子化Ｄ
ＣＴ係数について上記置き換えステップによる置き換え
を行う。The embedding amount in the first, second, and third bits from the lower order of the quantized DCT coefficient may be made larger than that in the second embodiment in consideration of the balance between the image deterioration and the embedding amount. For example, if the value of the quantization table is reduced as a whole to lower the compression ratio, the amount of embedding may be increased such as 30 bits for the first bit, 16 bits for the second bit, and 8 bits for the third bit. It is possible. (4) entropy encoding section 74 and output section 108, entropy decoding section 105 and entropy decoding section 84,
In the digital camera, two components having the same function, such as the photographed image memory 33 and the display image memory 61, the audio memory 44 and the audio memory 54, may be omitted and one may be used alone. (5) In the first embodiment, the digital camera 1 is configured to uniformly embed partially compressed audio data in each quantized DCT coefficient. According to this configuration, for a block in which the number of quantized DCT coefficients equal to or greater than the determination value J is smaller than the embedding amount N, embedding is performed also for quantized DCT coefficients having a value smaller than the determination value J. There is a problem that it will be deteriorated more than other blocks. Such a problem may be dealt with by the following method.
That is, the selecting unit 106 first sets, for each quantized DCT coefficient,
Among the 63 AC coefficients excluding the DC coefficient, the number of coefficients whose absolute value is equal to or greater than the determination value J is counted. Next, the selection unit 1
In step 06, it is determined whether the number of counted coefficients is equal to or larger than the value of the embedding amount N. As a result of the determination, when the embedding amount is equal to or more than the value of the embedding amount N, it is determined that embedding is performed in the quantized DCT coefficient block, and the quantized DCT coefficient blocks having the judgment value J or more are selected for embedding in zigzag scan order. To set the embedded flag. As a result of the judgment,
When the value is smaller than the value of the embedding amount N, the quantization D
It is determined that embedding is not performed in the CT coefficient block, and the embedding flag is not set. Embedding processing unit 10
7 embeds the partially compressed audio data only in the block having the embedding flag set by the selection unit 106. Thereby, the data amount of the compressed audio data that can be embedded is smaller than that of the configuration of the first embodiment, but there is an effect that the image quality can be protected more than the configuration of the first embodiment. (6) The function of each component of the embedding unit 37 may be programmed and recorded in the ROM, and may be realized by a microcomputer including a CPU, a RAM, and a ROM. More specifically, the CPU reads out the program from the ROM and executes the program, so that the low-frequency AC coefficient and the AC value whose absolute value is equal to or more than the determination value 2 are selected from the quantized DCT coefficient block Ruv.
A selection step of selecting 16 AC coefficients from the coefficients and a replacement step of replacing the least significant bit of the 16 AC coefficients selected by the selection process with 16-bit partially compressed audio data are performed. The selection step is shown in FIG.
Is a programmed version of the flowchart shown in
U performs a zigzag scan of the AC coefficient of the quantized DCT coefficient Ruv and performs a first selection step of turning on an embedding flag corresponding to the quantized DCT coefficient having a determination value of 2 or more, if any. This embedding flag is stored in the RAM. When the number C with the embedding flag turned on reaches the embedding amount 16, or when the last quantized DCT
When the zigzag scan is performed up to the coefficient, the first selection step ends. The CPU determines whether the number C of which the flags are turned on is smaller than the embedding amount 16, and if the number is smaller, the second selection step of turning on the embedding flags (16-C) which are turned off in the zigzag scanning order. I do.
The CPU executes the quantization D with the embedding flag turned on.
The CT coefficients are replaced in the above replacement step.

【００７０】[0070]

【発明の効果】本発明の埋め込み装置は、画像に離散コ
サイン（ＤＣＴ）変換と量子化とを施すことにより生成
される量子化ＤＣＴ係数ブロック中、低周波のＡＣ係数
と絶対値が第１しきい値以上のＡＣ係数とから所定個の
ＡＣ係数を選択する選択手段と、前記選択手段により選
択された所定個のＡＣ係数の最下位ビットを音声データ
に置き換える置換え手段とを備える。According to the embedding apparatus of the present invention, in a quantized DCT coefficient block generated by performing discrete cosine transform (DCT) and quantization on an image, the low-frequency AC coefficient and the absolute value are first. A selection means for selecting a predetermined number of AC coefficients from the AC coefficients having a threshold value or more, and a replacement means for replacing the least significant bit of the predetermined number of AC coefficients selected by the selection means with audio data.

【００７１】この構成によれば埋め込み装置は、最下位
ビットの値を変化させた場合に画像の劣化がより少なく
なるようなＡＣ係数を選択して音声データを埋めこむの
で画像の劣化が少なくなるという効果がある。ここで画
像の劣化が少なくなるのは、以下の理由による。まず低
周波のＡＣ係数について説明すると、低周波のＡＣ係数
は、通常、高周波のＡＣ係数に比べて小さい量子化レベ
ルで量子化されるという特徴がある。これは人間の視覚
特性が高周波成分に鈍感で低周波成分に敏感であるた
め、高周波成分を粗く量子化するで圧縮率を高めている
からである。ＡＣ係数は、復号の際、量子化レベルと同
じ値で逆量子化されるので、低周波のＡＣ係数は小さい
レベルで逆量子化され、高周波のＡＣ係数は大きいレベ
ルで逆量子化されることとなる。よって、同じように最
下位ビットを０から１に置き換えたとしても低周波の方
が高周波よりも逆量子化後の誤差が小さく、つまり画質
の劣化が少ない。According to this configuration, the embedding apparatus selects an AC coefficient that causes less deterioration of the image when the value of the least significant bit is changed, and embeds the audio data, thereby reducing the deterioration of the image. This has the effect. Here, the deterioration of the image is reduced for the following reason. First, the low-frequency AC coefficient will be described. The low-frequency AC coefficient is characterized in that it is usually quantized at a smaller quantization level than the high-frequency AC coefficient. This is because human visual characteristics are insensitive to high-frequency components and sensitive to low-frequency components, so that high-frequency components are roughly quantized to increase the compression ratio. When decoding, the AC coefficients are inversely quantized at the same value as the quantization level, so that low-frequency AC coefficients are inversely quantized at a small level and high-frequency AC coefficients are inversely quantized at a large level. Becomes Therefore, even if the least significant bit is replaced from 0 to 1, the error after the inverse quantization is smaller at the low frequency than at the high frequency, that is, the image quality is less deteriorated.

【００７２】また絶対値がしきい値以上のＡＣ係数につ
いて説明すると、絶対値の大きいＡＣ係数は、絶対値が
小さいＡＣ係数と比較して、最下位ビットが変化した場
合の変化の割合が小さい。例えば１６のＡＣ係数の最下
位ビットが１７になった場合と、０のＡＣ係数が１にな
った場合とを比べると、１６から１７に変化した方が変
化の割合が小さい。また絶対値が大きいＡＣ係数は、低
周波であることが多く（なぜなら低周波の方が小さい量
子化レベルで量子化されているので値が大きい場合が多
いからである）、その分逆量子化後の誤差が小さい。こ
れらから絶対値がしきい値以上のＡＣ係数は、しきい値
より小さいＡＣ係数に比べて画像の劣化が少ない。An AC coefficient having an absolute value equal to or larger than a threshold value will be described. An AC coefficient having a large absolute value has a smaller rate of change when the least significant bit changes than an AC coefficient having a small absolute value. . For example, comparing the case where the least significant bit of the 16 AC coefficient is 17 and the case where the 0 AC coefficient is 1, the change ratio is smaller when changing from 16 to 17. Also, the AC coefficient having a large absolute value is often at a low frequency (because the low frequency is quantized at a smaller quantization level, so the value is often large), and the inverse quantization is performed accordingly. Later error is small. From these, the AC coefficient whose absolute value is equal to or larger than the threshold value has less image deterioration than the AC coefficient whose absolute value is smaller than the threshold value.

【００７３】また前記選択手段は、絶対値が第１しきい
値以上のＡＣ係数を選択する第１選択部と、第１選択手
段によって選択されたＡＣ係数の個数が所定個に満たな
い場合には、第１しきい値未満であってより低周波のＡ
Ｃ係数から順に所定個になるまでＡＣ係数を選択する第
２選択部とを備える。この構成によれば埋め込み装置
は、絶対値が大きいＡＣ係数を低周波のＡＣ係数よりも
優先させて選択することによって置換えを行った場合の
画質の劣化を少なくしている。これは絶対値が大きいＡ
Ｃ係数の方が低周波のＡＣ係数よりも、最下位ビットを
置き換えた場合の画質の劣化が少ないからである。Further, the selecting means includes a first selecting section for selecting an AC coefficient whose absolute value is equal to or greater than a first threshold value, and a selecting means for determining whether the number of AC coefficients selected by the first selecting means is less than a predetermined number. Is less than the first threshold and the lower frequency A
A second selection unit for selecting AC coefficients in order from the C coefficient until a predetermined number is reached. According to this configuration, the embedding apparatus reduces deterioration of image quality when replacement is performed by selecting an AC coefficient having a large absolute value with priority over an AC coefficient of a low frequency. This is A with a large absolute value
This is because the C coefficient causes less deterioration in image quality when the least significant bit is replaced than the low-frequency AC coefficient.

【００７４】絶対値がしきい値未満の低周波のＡＣ係数
は、逆量子化レベルが小さいという１つの要因によって
最下位ビットを変化させた場合の誤差を小さくしてい
る。一方、絶対値がしきい値以上のＡＣ係数は、比較的
低周波側に分布するので、逆量子化レベルが小さい。こ
れに加えて絶対値がしきい値以上のＡＣ係数は、絶対値
がしきい値より小さいＡＣ係数に比べて最下位ビットが
変化した場合の変化率が小さい。このように絶対値がし
きい値以上のＡＣ係数は逆量子化レベルが小さいことと
最下位ビットが変化した場合の変化率が小さいこととの
２つの要因から誤差を小さくしている。このことから埋
め込み装置は、絶対値が大きいＡＣ係数を低周波のＡＣ
係数より優先的に選択することによって、より画質の劣
化を低減している。The low-frequency AC coefficient whose absolute value is less than the threshold value reduces an error when the least significant bit is changed due to one factor that the inverse quantization level is small. On the other hand, AC coefficients whose absolute values are equal to or larger than the threshold value are distributed on a relatively low frequency side, and thus have a low inverse quantization level. In addition, the AC coefficient whose absolute value is equal to or larger than the threshold has a smaller change rate when the least significant bit changes than the AC coefficient whose absolute value is smaller than the threshold. As described above, the error of the AC coefficient whose absolute value is equal to or larger than the threshold value is reduced due to two factors, that is, the inverse quantization level is small and the rate of change when the least significant bit changes is small. For this reason, the embedding apparatus converts the AC coefficient having a large absolute value into a low-frequency AC coefficient.
By selecting the coefficient with priority, the deterioration of the image quality is further reduced.

【００７５】前記選択手段は、さらに、絶対値が第１し
きい値より大きい第２しきい値以上のＡＣ係数を選択す
る第３選択部を備え、前記置換え手段は、さらに前記第
３選択部により選択されたＡＣ係数の最下位から２ビッ
ト目に音声データを埋め込むよう構成される。この構成
によれば埋め込み装置は、最下位ビットに音声データを
埋め込むのに加えて、最下位から２ビット目にも音声デ
ータを埋め込むので、より多くの音声データを埋め込む
ことができるという効果がある。また絶対値が第２しき
い値以上のＡＣ係数、つまり最下位から２ビット目が変
化した場合の変化率の小さいＡＣ係数を埋め込み用とし
て選択するので、本埋め込み装置は、画像の劣化が少な
く、より多くの音声データを埋め込むことができる。The selecting means further includes a third selecting section for selecting an AC coefficient whose absolute value is equal to or greater than a second threshold value and equal to or greater than a second threshold value, and the replacing means further includes a third selecting section. The audio data is embedded in the second least significant bit of the selected AC coefficient. According to this configuration, the embedding device embeds audio data in the second least significant bit in addition to embedding audio data in the least significant bit, so that there is an effect that more audio data can be embedded. . Further, since the AC coefficient whose absolute value is equal to or larger than the second threshold value, that is, the AC coefficient having a small change rate when the second bit changes from the least significant bit is selected for embedding, the embedding apparatus has less image deterioration. , More voice data can be embedded.

【００７６】本発明のデジタルカメラは、圧縮画像に数
秒間の音声データに相当する圧縮音声データを埋め込む
デジタルカメラであって、圧縮画像から離散コサイン変
換と量子化とが施された量子化ＤＣＴ係数ブロックを得
る獲得手段と、前記圧縮音声データを分割して所定ビッ
トの部分圧縮音声データにする分割手段と、獲得される
量子化ＤＣＴ係数ブロック中、低周波のＡＣ係数と絶対
値が第１しきい値以上のＡＣ係数とから前記所定個のＡ
Ｃ係数を選択する選択手段と、前記選択手段により選択
された前記所定個のＡＣ係数の最下位ビットを前記部分
圧縮音声データに置き換える置換え手段とを備える。A digital camera according to the present invention is a digital camera in which compressed audio data corresponding to audio data for several seconds is embedded in a compressed image, wherein a quantized DCT coefficient obtained by performing discrete cosine transform and quantization from the compressed image. Acquiring means for obtaining a block, dividing means for dividing the compressed audio data into partial compressed audio data of a predetermined bit, and a low-frequency AC coefficient and an absolute value which are first in a quantized DCT coefficient block to be obtained. From the AC coefficient not less than the threshold value,
There is provided a selecting means for selecting a C coefficient, and a replacing means for replacing a least significant bit of the predetermined number of AC coefficients selected by the selecting means with the partial compressed audio data.

【００７７】この構成によればデジタルカメラは、画像
の劣化が少なくて済む所定個のＡＣ係数に所定ビットの
部分圧縮音声データを埋め込むという操作を各量子化Ｄ
ＣＴ係数ブロックに対して行うので、全体として所定ビ
ット×総ブロック数という多くの音声データを画像の劣
化少なく埋め込むことができる。According to this configuration, the digital camera performs the operation of embedding the partially compressed audio data of a predetermined bit in a predetermined number of AC coefficients that requires little image degradation.
Since the processing is performed on the CT coefficient block, a large amount of audio data of predetermined bits × total number of blocks as a whole can be embedded with little deterioration of the image.

[Brief description of the drawings]

【図１】デジタルカメラ１の正面側の外観図である。FIG. 1 is an external view of a front side of a digital camera 1. FIG.

【図２】デジタルカメラ１の背面側の外観図である。FIG. 2 is an external view of the back side of the digital camera 1. FIG.

【図３】デジタルカメラ１の概略構成図である。FIG. 3 is a schematic configuration diagram of the digital camera 1.

【図４】図３の詳細構成図である。FIG. 4 is a detailed configuration diagram of FIG. 3;

【図５】符号化部３４のより詳細な構成図である。FIG. 5 is a more detailed configuration diagram of an encoding unit 34.

【図６】復号化部６２のより詳細な構成図である。FIG. 6 is a more detailed configuration diagram of a decoding unit 62.

【図７】１画面分の輝度成分Ｙとブロックとの関係を示
す。FIG. 7 shows a relationship between a luminance component Y for one screen and blocks.

【図８】（ａ）輝度成分Ｙの１ブロック分の画素の具体
例Ｙxy（x,y＝０〜７；x,yはブロック中の画素位置を表
わす）を示す。（ｂ）Ｙxyに対してＤＣＴを行うことにより得られるＤ
ＣＴ係数ブロックSuvを示す。（ｃ）量子化テーブルＱuvの具体例を示す。（ｄ）図８（ｂ）に示したＤＣＴ係数ブロックＳuvを図
８（ｃ）の量子化テーブルＱuvで量子化した場合の量子
化ＤＣＴ係数ブロックＲuvを示す。FIG. 8A shows a specific example Yxy (x, y = 0 to 7; x, y represents a pixel position in a block) of a pixel of one block of a luminance component Y. (B) D obtained by performing DCT on Yxy
4 shows a CT coefficient block Suv. (C) A specific example of the quantization table Quv is shown. (D) shows a quantized DCT coefficient block Ruv when the DCT coefficient block Suv shown in FIG. 8B is quantized by the quantization table Quv in FIG. 8C.

【図９】埋め込み部３７の詳細な構成図である。9 is a detailed configuration diagram of an embedding unit 37. FIG.

【図１０】選択部１０６による、選択処理を示すフロー
チャートである。FIG. 10 is a flowchart showing a selection process by a selection unit 106;

【図１１】（ａ）図８（ｄ）の量子化ＤＣＴ係数ブロッ
クＲuvの中から埋め込み用として選択される量子化ＤＣ
Ｔ係数を丸印で囲って示す。（ｂ）選択部１０６が同図（ａ）について選択処理を行
った場合の選択結果の埋め込みフラグＥuvを示す。FIG. 11A shows a quantized DC selected for embedding from the quantized DCT coefficient block Ruv in FIG. 8D.
The T coefficient is indicated by a circle. (B) shows the embedding flag Euv of the selection result when the selection unit 106 performs the selection process on FIG.

【図１２】（ａ）選択部１０６が図８（ｄ）とは別の量
子化ＤＣＴ係数ブロックの中から埋め込み用として選択
される量子化ＤＣＴ係数を丸印で囲って示す。（ｂ）（ａ）に対応する埋め込みフラグを示す。（ｃ）選択部１０６の選択処理による最終的な選択結果
の埋め込みフラグを示す。FIG. 12A shows a quantized DCT coefficient selected for embedding from a quantized DCT coefficient block different from that shown in FIG. (B) Indicates an embedding flag corresponding to (a). (C) shows an embedded flag of a final selection result by the selection processing of the selection unit 106.

【図１３】（ａ）部分圧縮音声データの一例を示す。（ｂ）埋め込み処理部１０７が同図（ａ）の部分圧縮音
声データを図１２（ａ）の量子化ＤＣＴ係数ブロックに
埋め込んだ場合の結果を示す。FIG. 13A shows an example of partially compressed audio data. FIG. 12B shows a result when the embedding processing unit 107 embeds the partially compressed audio data in FIG. 12A into the quantized DCT coefficient block in FIG.

【図１４】抽出部８３の詳細な構成図である。FIG. 14 is a detailed configuration diagram of an extraction unit 83.

【図１５】埋め込み部４７の詳細構成図である。15 is a detailed configuration diagram of an embedding section 47. FIG.

【図１６】選択部２０６が埋め込みを行う量子化ＤＣＴ
係数の数と埋め込みを行うビット位置とを決定するため
の論理を表わすフローチャートである。FIG. 16 shows a quantized DCT in which a selection unit 206 embeds data.
9 is a flowchart illustrating logic for determining the number of coefficients and the bit position to be embedded.

【図１７】この決定結果に従ってセットされた埋め込み
フラグＥ２uvを示す。FIG. 17 shows an embedding flag E2uv set according to the result of this determination.

[Explanation of symbols]

３画像符号化部３５符号用メモリ３６メモリカード入出力部４音声符号化部３７埋め込み部５音声復号化部６画像復号化部８３抽出部３１撮像部３３撮影画像メモリ３４符号化部７１ＤＣＴ部７２量子化部７４エントロピー符号化部８４エントロピー復号化部８２逆量子化部８１逆ＤＣＴ部１０１圧縮画像入力部１０２判定値入力部１０３埋め込み量入力部１０４圧縮音声入力部１０５エントロピー復号化部１０６選択部１０７埋め込み処理部１０８出力部８３１識別部８３２抽出処理部 Reference Signs List 3 image encoding unit 35 encoding memory 36 memory card input / output unit 4 audio encoding unit 37 embedding unit 5 audio decoding unit 6 image decoding unit 83 extraction unit 31 imaging unit 33 photographed image memory 34 encoding unit 71 DCT unit 72 quantization unit 74 entropy coding unit 84 entropy decoding unit 82 inverse quantization unit 81 inverse DCT unit 101 compressed image input unit 102 decision value input unit 103 embedding amount input unit 104 compressed audio input unit 105 entropy decoding unit 106 selection Unit 107 embedding processing unit 108 output unit 831 identification unit 832 extraction processing unit

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩテーマコート゛(参考）Ｈ０４Ｎ 7/30 Ｈ０４Ｎ 7/133 ＺＦターム(参考） 5C022 AA13 AB00 AC01 AC32 AC42 AC71 AC72 5C053 FA07 GB07 GB11 GB22 GB34 GB36 HA33 JA03 JA07 JA12 KA04 KA05 LA01 LA06 5C059 KK02 KK08 LA01 MA00 MA23 MC04 MC14 MC23 MC26 MC32 MC34 PP01 RB06 RC32 RC38 SS15 TA36 TB07 TC04 TC06 TD12 UA02 UA05 UA38 5J064 AA01 BA16 BB11 BC25 BD03──────────────────────────────────────────────────続き Continued on the front page (51) Int.Cl. ⁷ Identification symbol FI Theme coat ゛ (Reference) H04N 7/30 H04N 7/133 Z F-term (Reference) 5C022 AA13 AB00 AC01 AC32 AC42 AC71 AC72 5C053 FA07 GB07 GB11 GB22 GB34 GB36 HA33 JA03 JA07 JA12 KA04 KA05 LA01 LA06 5C059 KK02 KK08 LA01 MA00 MA23 MC04 MC14 MC23 MC26 MC32 MC34 PP01 RB06 RC32 RC38 SS15 TA36 TB07 TC04 TC06 TD12 UA02 UA05 UA38 5J064 AA01 BA16 BB11 BC25 BD03

Claims

[Claims]

1. A low-frequency AC coefficient and an AC coefficient whose absolute value is equal to or greater than a first threshold value in a quantized DCT coefficient block generated by performing discrete cosine transform (DCT) and quantization on an image. An embedding apparatus, comprising: a selection unit that selects a predetermined number of AC coefficients from a plurality of AC coefficients; and a replacement unit that replaces least significant bits of the predetermined number of AC coefficients selected by the selection unit with audio data.

2. The method according to claim 1, wherein the selecting unit selects an AC coefficient whose absolute value is equal to or greater than a first threshold value, and the number of AC coefficients selected by the first selecting unit is less than a predetermined number. 2. The embedding apparatus according to claim 1, further comprising: a second selection unit that selects AC coefficients that are less than the first threshold value and that has a predetermined number in order from lower frequency AC coefficients.

3. The method according to claim 1, wherein said selecting means further comprises a signal having an absolute value equal to or greater than a second threshold value larger than said first threshold value.
3. The method according to claim 2, further comprising a third selection unit that selects a C coefficient, wherein the replacement unit embeds audio data in the second least significant bit of the AC coefficient selected by the third selection unit. Embedded device.

4. A quantized DCT coefficient block generated by performing discrete cosine transform (DCT) and quantization on an image, wherein a low-frequency AC coefficient and an AC coefficient whose absolute value is equal to or greater than a first threshold value are included. An embedding method comprising: a selection step of selecting a predetermined number of AC coefficients from the above; and a replacement step of replacing the least significant bit of the predetermined number of AC coefficients selected by the selection step with audio data.

5. The method according to claim 1, wherein the selecting includes selecting an AC coefficient whose absolute value is equal to or greater than a first threshold value, and when the number of AC coefficients selected by the first selecting unit is less than a predetermined number. 5. The embedding method according to claim 4, further comprising the step of: selecting a plurality of AC coefficients from a lower frequency AC coefficient until the number of AC coefficients becomes smaller than a first threshold value.

6. A digital camera that embeds compressed audio data corresponding to audio data for several seconds in a compressed image, wherein the acquisition unit obtains a quantized DCT coefficient block subjected to discrete cosine transform and quantization from the compressed image. Dividing means for dividing the compressed audio data into partial-compressed audio data of a predetermined bit;
Selecting means for selecting the predetermined number of AC coefficients from a coefficient and an AC coefficient whose absolute value is equal to or greater than a first threshold value; and compressing least significant bits of the predetermined number of AC coefficients selected by the selecting means. A digital camera, comprising: replacement means for replacing with audio data.

7. A computer-readable recording medium in which a program for causing a computer to execute a process of embedding audio data in an image is recorded. The program stores the program in a computer by means of discrete cosine transform (DCT) and quantum Selecting a predetermined number of AC coefficients from a low-frequency AC coefficient and an AC coefficient whose absolute value is equal to or greater than a first threshold value in a quantized DCT coefficient block generated by performing Replacing the least significant bit of the predetermined number of AC coefficients selected by the step with audio data.

8. The method according to claim 1, wherein the selecting includes selecting an AC coefficient whose absolute value is greater than or equal to a first threshold value, and when the number of AC coefficients selected by the first selecting unit is less than a predetermined number. 8. The program according to claim 7, wherein the program further comprises: a second selection step of selecting an AC coefficient from a lower frequency AC coefficient until the number of AC coefficients becomes smaller than a predetermined number. Recording medium.