JPH0998418A

JPH0998418A - Method and system for encoding and decoding picture

Info

Publication number: JPH0998418A
Application number: JP7254361A
Authority: JP
Inventors: Masaomi Nakajima; 正臣中嶋; Shunichiro Nonaka; 俊一郎野中; Yasuo Sanbe; 靖夫三部; Taichi Nakamura; 太一中村
Original assignee: N T T DATA TSUSHIN KK; NTT Data Communications Systems Corp
Current assignee: N T T DATA TSUSHIN KK; NTT Data Corp
Priority date: 1995-10-02
Filing date: 1995-10-02
Publication date: 1997-04-08

Abstract

PROBLEM TO BE SOLVED: To provide an image encoding/decoding method for preferentially handling an observation area while maintaining consistency with an international standard. SOLUTION: A position parameter for correcting a quantization scale code corresponding to observation density distribution in pictures is stored in a table 25. A contour extraction part divides an image supplied from a camera 11 into fine blocks and discriminates whether or not the contour of the image which is an observation part is present inside the respective blocks. A quantization part 22 corrects the quantization scale code based on the position parameter read from the table 25 and the presence/absence of the contour discriminated by the contour extraction part 24 corresponding to the position of the image to be compressed and quantizes the observation area by a small quantization scale code and a non-observation area by a large quantization scale code. Thereafter, a variable length encoding processing is executed and encoded data are generated. The quantization scale code used at the time of quantization is used for a decoding processing.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】この発明は画像符号化・復号
化技術に関し、特に、低ビットレートで高品質の画像を
復号化できる画像符号化・復号化技術に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image coding / decoding technique, and more particularly to an image coding / decoding technique capable of decoding a high quality image at a low bit rate.

【０００２】[0002]

【従来の技術】画像符号化（圧縮）・復号化技術は、画
像通信システムにおけるキーテクノロジーとして注目さ
れている。画像符号化（圧縮）・復号化技術において
は、高品質の画像の復号化を可能としつつ符号化率を高
めなければならないという課題が存在する。この課題を
解決する手法として、表示画像の注視領域を優先的に扱
い、画像の中央部から非線形に標本化（間引き）を行い
伝送化する技術が提案されている。2. Description of the Related Art Image coding (compression) / decoding techniques have received attention as key technologies in image communication systems. In the image coding (compression) / decoding technology, there is a problem that the coding rate must be increased while enabling the decoding of high quality images. As a method for solving this problem, a technique has been proposed in which a gaze area of a display image is preferentially treated, and non-linear sampling (decimation) is performed from the central portion of the image for transmission.

【０００３】[0003]

【発明が解決しようとする課題】しかし、画面の中央か
ら非線形に標本化して伝送する方法では、間引かれた領
域を復号側で補間する必要があり、制御が複雑になる。
また、画面の中央部から伝送するシーケンスとなり、動
画符号化・復号化方法の国際標準であるＭＰＥＧ方式の
シーケンスとの隔たりが大きく、既存製品との整合性を
図ることができない。However, in the method of nonlinearly sampling and transmitting from the center of the screen, it is necessary to interpolate the thinned region on the decoding side, which complicates control.
Further, since the sequence is transmitted from the center of the screen, there is a large gap from the sequence of the MPEG system, which is an international standard for moving picture encoding / decoding methods, and it is not possible to achieve consistency with existing products.

【０００４】この発明は、上記実状に鑑みてなされたも
ので、国際標準規格との整合性を維持しつつ注視領域を
優先的に扱う画像符号化・復号化方法及びシステムを提
供することを目的とする。また、この発明は、同一のビ
ットレートならばより高品質の画像を復号化できる画像
符号化・復号化方法及びシステムを提供することを目的
とする。The present invention has been made in view of the above situation, and an object thereof is to provide an image encoding / decoding method and system which preferentially treats a gaze area while maintaining consistency with an international standard. And It is another object of the present invention to provide an image coding / decoding method and system capable of decoding higher quality images at the same bit rate.

【０００５】[0005]

【課題を解決するための手段】上記目的を達成するた
め、この発明の第１の観点にかかる画像符号化システム
は、画像を入力する入力手段と、前記入力手段により入
力された画像の内の注視される領域を判別する判別手段
と、前記判別手段により判別された注視領域について第
１の量子化スケールコードを用い、非注視領域について
前記第１の量子化スケールコードよりも大きい第２の量
子化スケールコードを用いて前記画像を量子化する量子
化手段と、より構成されることを特徴とする。In order to achieve the above object, the image coding system according to the first aspect of the present invention includes an input unit for inputting an image and an image input by the input unit. A discriminator that discriminates a region to be gazed, and a first quantization scale code is used for the gaze region discriminated by the discriminator, and a second quantum that is larger than the first quantization scale code is used for a non-gaze region. It is characterized by comprising a quantizing means for quantizing the image using a quantization scale code.

【０００６】また、この発明の第２の観点にかかる画像
符号化システムは、量子化対象画像の微小領域毎に、注
視点の空間的な分布密度に基づく量子化スケールコード
の第１の補正量を設定する手段と、量子化対象画像の微
小領域毎に、注視時間に基づく量子化スケールコードの
第２の補正量を設定する手段と、量子化対象画像の量子
化スケールコードを前記第１の補正量と第２の補正量に
もとづいて補正し、補正された量子化スケールコードを
用いて量子化対象画像を量子化する量子化手段と、より
構成されることを特徴とする。Further, in the image coding system according to the second aspect of the present invention, the first correction amount of the quantization scale code based on the spatial distribution density of the gazing point is set for each minute region of the quantization target image. For setting the second correction amount of the quantization scale code based on the gaze time for each minute area of the quantization target image, and the quantization scale code of the quantization target image for the first area. It is characterized by comprising a quantizing means for performing a correction based on the correction amount and the second correction amount, and for quantizing the quantization target image using the corrected quantization scale code.

【０００７】また、この発明の第３の観点にかかる画像
符号化方法は、符号化対象画像上の注視点の空間分布密
度を判別し、符号化対象画像上の注視時間の分布を判別
し、判別された空間分布密度と注視時間の分布に基づい
て、符号化対象画像の注視度の高い領域を細かく量子化
し、符号化対象画像の注視度の低い領域を粗く量子化す
る、ことを特徴とする。In the image coding method according to the third aspect of the present invention, the spatial distribution density of the gazing points on the image to be coded is discriminated, and the distribution of the gaze time on the image to be coded is discriminated. Based on the determined spatial distribution density and the distribution of the gaze time, finely quantize a region with a high degree of gaze in the target image for encoding, and roughly quantize a region with a low degree of gaze in the target image for encoding. To do.

【０００８】注視点は、空間的には画面中央部を中心と
するほぼ正規分布に近い分布になる。また、画像の複雑
な部分や認識の困難な部分の注視時間は長くなる。これ
らの空間的及び時間的注視度から、注視度の高い部分を
小さい量子化スケールコードを用いて細かく量子化し、
注視度の低い部分を大きい量子化スケールコードを用い
て粗く量子化することにより、低ビットレートで従来と
実質的に同一品質の画像を符号化・復号化することがで
き、同一ビットレートならば従来より実質的に高品質の
画像を符号化・復号化することができる。The gazing point spatially has a distribution close to a normal distribution centered on the center of the screen. In addition, the gaze time of a complicated portion of an image or a portion that is difficult to recognize becomes long. From these spatial and temporal gaze degrees, the high gaze degree is finely quantized using a small quantization scale code,
By coarsely quantizing the low attention part using a large quantization scale code, it is possible to encode / decode an image of substantially the same quality as a conventional one at a low bit rate. It is possible to encode / decode an image of substantially higher quality than ever before.

【０００９】[0009]

【発明の実施の形態】以下、この発明を動画の符号化・
復号化技術の国際標準であるＭＰＥＧ−２のテストモデ
ルであるＴＭ５に適用した符号化・復号化システムの実
施の形態を図１〜図３を参照して説明する。なお、ＴＭ
５の詳細は、Test Model Editing Commitee: "Test Mod
el 5", ISO/IEC JTC1/SC29/WG11 N0400(Apr.1993)に定
義されている。BEST MODE FOR CARRYING OUT THE INVENTION Hereinafter, the present invention will be described with reference to FIG.
An embodiment of an encoding / decoding system applied to TM5 which is a test model of MPEG-2 which is an international standard of decoding technology will be described with reference to FIGS. In addition, TM
For details of 5, refer to Test Model Editing Commitee: "Test Mod
el 5 ", defined in ISO / IEC JTC1 / SC29 / WG11 N0400 (Apr.1993).

【００１０】この符号化・復号化システムは、図１に示
すように、カメラ１１と、符号化部１２と、復号化部１
３と、ディスプレイ１４とより構成されている。As shown in FIG. 1, this encoding / decoding system includes a camera 11, an encoding unit 12, and a decoding unit 1.
3 and a display 14.

【００１１】カメラ１１は、入力光量に対応した画像信
号を出力する。符号化部１２は、変換部２１と量子化部
２２と可変長符号化部２３と輪郭抽出部２４と位置パラ
メータテーブル２５を備える。この構成は、ＴＭ５方式
の符号化部に輪郭抽出部２４と位置パラメータテーブル
２５を追加した構成に相当する。The camera 11 outputs an image signal corresponding to the amount of input light. The coding unit 12 includes a conversion unit 21, a quantization unit 22, a variable length coding unit 23, a contour extraction unit 24, and a position parameter table 25. This configuration corresponds to a configuration in which the contour extraction unit 24 and the position parameter table 25 are added to the TM5 encoding unit.

【００１２】変換部２１は、カメラ１１から供給された
画像データに対し動き補償予測（必要に応じて）やＤＣ
Ｔ（離散コサイン変換）等の変換処理を施し、量子化対
象の画像データに変換する。量子化部２２は、輪郭抽出
部２４から供給される画像の輪郭部分を指示するデータ
と位置パラメータテーブル２５から供給される位置パラ
メータを用いて、量子化スケールコードを補正し、補正
された量子化スケールコードを用いてＭＰＥＧ−２の適
応量子化処理を実行する。量子化スケールコードを補正
する手法については後述する。可変長符号化部２３は、
量子化部２２により量子化された画像データに可変長符
号化処理を施し、被符号化データを得る。このようにし
て得られた被符号化データは伝送路を介して送信され、
或いは、蓄積装置に蓄積される。The conversion unit 21 performs motion compensation prediction (if necessary) and DC on the image data supplied from the camera 11.
A conversion process such as T (Discrete Cosine Transform) is performed to convert the image data to be quantized. The quantization unit 22 corrects the quantization scale code using the data indicating the contour portion of the image supplied from the contour extraction unit 24 and the position parameter supplied from the position parameter table 25, and the corrected quantization is performed. An adaptive quantization process of MPEG-2 is executed using the scale code. A method of correcting the quantization scale code will be described later. The variable length coding unit 23
The image data quantized by the quantization unit 22 is subjected to variable length coding processing to obtain coded data. The encoded data thus obtained is transmitted via a transmission line,
Alternatively, it is stored in the storage device.

【００１３】輪郭抽出部２４は、フレームメモリを備
え、入力された少なくとも１画面分の画像を記憶し、画
像中の輪郭の位置を判別し、量子化部２２に供給する。
輪郭の判別手法については後述する。位置パラメータテ
ーブル２５は、画像の位置毎に、量子化スケールコード
を補正するための補正値を記憶し、量子化部２２に供給
する。補正値の設定手法については後述する。The contour extraction unit 24 has a frame memory, stores at least one input image, determines the position of the contour in the image, and supplies it to the quantization unit 22.
A method of discriminating the contour will be described later. The position parameter table 25 stores a correction value for correcting the quantization scale code for each position of the image, and supplies it to the quantization unit 22. The method of setting the correction value will be described later.

【００１４】復号化部１３は、受信した又は蓄積装置か
ら読み出した被符号化データを復号化するための回路で
あり、従来の復号化部と実質的に同一の構成を有し、可
変長復号化部３１と、逆量子化部３２と、逆変換部３３
とを備える。The decoding unit 13 is a circuit for decoding the encoded data received or read from the storage device, has the substantially same configuration as the conventional decoding unit, and has a variable length decoding. Conversion section 31, inverse quantization section 32, and inverse conversion section 33
With.

【００１５】可変長復号化部３１は、被符号化データに
可変長符号化処理を施し、量子化された画像データ及び
量子化スケールコードを復号する。逆量子化部３２は、
復号された量子化スケールコードを判別し、該量子化ス
ケールコードを用いて受信画像を逆量子化する。逆変換
部３３は、逆量子化された画像データに対し、変換部２
１が施した変換の逆変換、即ち、逆ＤＣＴ変換処理を行
と動き補償予測の逆の予測（動き補償予測が行われた場
合）を行う。The variable length decoding unit 31 performs variable length coding processing on the coded data and decodes the quantized image data and the quantized scale code. The inverse quantizer 32
The decoded quantized scale code is discriminated, and the received image is dequantized using the quantized scale code. The inverse transforming unit 33 transforms the inversely quantized image data into the transforming unit 2
The inverse transform of the transform performed by 1 is performed, that is, the inverse DCT transform process is performed and the reverse prediction of the motion compensation prediction (when the motion compensation prediction is performed) is performed.

【００１６】ディスプレイ（表示装置）１４は、供給さ
れた画像データに対応する階調を表示するＣＲＴから構
成される。The display (display device) 14 is composed of a CRT which displays gradations corresponding to the supplied image data.

【００１７】量子化スケールコードの補正は、空間的な
注視点の分布（分布密度）に基づく補正と時間的な注視
点の分布（注視時間の分布）に基づく補正との２段階の
処理により実行される。空間的な注視点の分布密度は、
図２に網掛部に示すとおり、画面中央部を中心とする正
規分布に近い分布になる。そこで、画面の中央から離れ
るに従って徐々に小さくなる位置パラメータを設定し、
この位置パラメータを量子化スケールコードに乗算して
量子化スケールコードを補正する。これにより、分布密
度が高くなる画面中央部をより細かく量子化し、分布密
度が低くなる画面の端をより粗く量子化する。このよう
にして求められた位置パラメータが、位置パラメータテ
ーブル２５に設定され、量子化時に参照される。The correction of the quantization scale code is executed by a two-step process of correction based on the spatial distribution (distribution density) of the gazing points and correction based on the temporal distribution of the gazing points (gaze time distribution). To be done. The distribution density of spatial gazing points is
As shown by the shaded area in FIG. 2, the distribution is close to the normal distribution centered on the center of the screen. Therefore, set a position parameter that gradually decreases with distance from the center of the screen,
The position parameter is multiplied by the quantization scale code to correct the quantization scale code. As a result, the central portion of the screen where the distribution density is high is quantized more finely, and the edge of the screen where the distribution density is low is quantized more coarsely. The position parameter thus obtained is set in the position parameter table 25 and referred to at the time of quantization.

【００１８】アスペクト比が３：４のＮＴＳＣ方式の画
像を例に位置パラメータを設定する方法を具体的に説明
する。図２に示すように、画面を論理的に縦横１６×１
６ドットの微小ブロック（以下、マイクロブロック）に
分割する。次に、各マイクロブロックに画面の中心から
水平方向に０．６〜１．３、垂直方向に０．８〜１．３
の範囲でそれぞれ等間隔に値を配分する。次に、マイク
ロブロック毎に水平の設定値と垂直方向の設定値を平均
し、これを位置パラメータ（補正値）とする。A method of setting the position parameter will be specifically described by taking an image of the NTSC system having an aspect ratio of 3: 4 as an example. As shown in FIG. 2, the screen is logically 16 × 1 vertically and horizontally.
Divide into 6-dot minute blocks (hereinafter referred to as microblocks). Then, from the center of the screen to each microblock, 0.6 to 1.3 in the horizontal direction and 0.8 to 1.3 in the vertical direction.
Values are evenly distributed in each range. Next, the horizontal setting value and the vertical setting value are averaged for each microblock, and this is set as a position parameter (correction value).

【００１９】従って、例えば、図２の画面の右上端のマ
イクロブロックには、位置パラメータとして１．３（＝
（１．３＋１．３）／２）が割り付けられ、画面中央の
マイクロブロックには０．７（＝（０．６＋０．８）／
２）が割り付けられる。各μブロックに割り付けられた
位置パラメータの値を図２に示す。Therefore, for example, in the micro block at the upper right corner of the screen of FIG. 2, 1.3 (=
(1.3 + 1.3) / 2) is allocated, and 0.7 (= (0.6 + 0.8) / is assigned to the microblock in the center of the screen.
2) is assigned. The values of the positional parameters assigned to each μ block are shown in FIG.

【００２０】このようにして設定された位置パラメータ
を位置パラメータテーブル２５に設定し、量子化時に位
置パラメータテーブル２５より対応する位置パラメータ
を読み出し、量子化スケールコードに乗算し、補正後の
量子化スケールコードを用いて量子化を行う。The position parameter set in this way is set in the position parameter table 25, the corresponding position parameter is read from the position parameter table 25 at the time of quantization, multiplied by the quantization scale code, and the corrected quantization scale. Quantize using code.

【００２１】注視点は空間的な分布と共に時間的な側面
を併せ持つ。情報量が多い部分や、認識するのが困難な
部分に対する注視時間は長くなる。ここでは、そのよう
な要件から注視時間を長くする要因として、画像（オブ
ジェクト）の輪郭を取り上げ、輪郭部分では量子化スケ
ールコードを小さい値に補正し輪郭部分の画像を細かく
量子化し、非輪郭部分では量子化スケールコードを大き
い値に補正し、輪郭部分の画像を粗く量子化する。一般
に、輪郭部分はその周辺部分よりも情報量が多い。従っ
て、輪郭部分を判別し、輪郭部分の量子化スケールコー
ドを小さい値に補正することは、符号化対象画像の各部
の情報量を判別し、周辺部よりも情報量の多い領域の量
子化スケールコードを小さくして細かく量子化するとい
う意味も有する。The gazing point has both a spatial distribution and a temporal aspect. The gaze time for a portion having a large amount of information or a portion that is difficult to recognize becomes long. Here, the contour of the image (object) is taken up as a factor that prolongs the gaze time from such requirements, and the contour scale image is finely quantized by correcting the quantization scale code to a small value in the contour portion, and the non-contour portion is Then, the quantization scale code is corrected to a large value, and the outline image is roughly quantized. In general, the contour portion has more information than the peripheral portion. Therefore, determining the contour part and correcting the quantization scale code of the contour part to a small value determines the information amount of each part of the image to be coded, and the quantization scale of the region with more information amount than the peripheral part. It also means that the code is made smaller and finely quantized.

【００２２】次に、輪郭部分を抽出し、量子化スケール
コードを補正する手法を具体的に節説明する。画像（オ
ブジェクト）中の輪郭の抽出は輝度信号Ｙに基づいて行
う。先ず、輝度信号Ｙを二値化する。二値化の際のしき
い値は、画素値を横軸としたヒストグラムを生成し、白
画素と黒画素との比が例えば９：１となる値とする。こ
の比の値は、マイクロブロックの約半分に輪郭が含まれ
るように設定することが望ましい。Next, a method of extracting the contour portion and correcting the quantized scale code will be concretely described. The contour in the image (object) is extracted based on the luminance signal Y. First, the luminance signal Y is binarized. The threshold value for binarization is a value in which a histogram with the pixel value on the horizontal axis is generated and the ratio of white pixels to black pixels is, for example, 9: 1. The value of this ratio is preferably set so that the contour is included in about half of the microblock.

【００２３】二値化された画像データについて、縦横３
×３画素の画像データを順次抽出し、次の考え方の３×
３テーブルを作成する。（ａ）３×３画素の中央の画素が黒いものは、残りの
８画素の値にかかわらず出力を黒とする。（ｂ）３×３画素の中央の画素が白で、残りの８画素
も全て白の場合は出力を黒とする。（ｃ）（ａ）と（ｂ）以外の全ての場合、出力を白と
する。このような処理を行い、黒であると判別された画素が存
在するマイクロブロックに表示画像の輪郭部が位置する
と判別する。Regarding the binarized image data, the length and width are 3
× 3 pixel image data is sequentially extracted, and 3 ×
3 Create a table. (A) If the central pixel of 3 × 3 pixels is black, the output is black regardless of the values of the remaining 8 pixels. (B) If the central pixel of the 3 × 3 pixels is white and the remaining 8 pixels are all white, the output is black. (C) In all cases except (a) and (b), the output is white. By performing such processing, it is determined that the contour portion of the display image is located in the microblock in which the pixel determined to be black exists.

【００２４】このような処理により、標準画像であるＣ
ＩＦの"Miss Ameriaca"(352x288画素）の画像を処理し
た、処理結果を図３に示す。図３において、網線を付し
たマイクロブロックが輪郭を含むと判別されたブロック
である。図３より、適切に輪郭の判別ができていること
が確認できる。By such processing, the standard image C
The processing result of processing the image of IF "Miss Ameriaca" (352 × 288 pixels) is shown in FIG. In FIG. 3, a microblock with a halftone line is a block determined to include a contour. It can be confirmed from FIG. 3 that the contour can be properly discriminated.

【００２５】輪郭が抽出されたマイクロブロック内の画
像の量子化のための量子化スケールコードを０．８倍
に、輪郭が抽出されなかったマイクロブロックの画像の
量子化のために量子化スケールコードを１．２倍に補正
し、注視時間に基づく補正を実現する。The quantization scale code for quantizing the image in the contour-extracted microblock is multiplied by 0.8, and the quantization scale code for quantizing the image of the microblock in which the contour is not extracted is set to 1. .2 times correction, and correction based on the gaze time is realized.

【００２６】最終的に、位置パラメータと補正値０．８
又は１．２を量子化スケールコードに乗算して補正され
た量子化パラメータを求め、補正された量子化スケール
コードを用いて画像を量子化する。Finally, the position parameter and the correction value 0.8
Alternatively, the quantization scale code is multiplied by 1.2 to obtain the corrected quantization parameter, and the image is quantized using the corrected quantization scale code.

【００２７】次に、画像を取得し、符号化し、さらに復
号化して表示するまでの処理を説明する。先ず、カメラ
１１の撮像部で取得された画像は順次ディジタルデータ
に変換され、符号化部１２の変換部２１に供給され、動
き補償予測（必要なフレームについて）、ＤＣＴ変換処
理等が施される。一方、カメラ１１から供給された画像
データは輪郭抽出部２４にも供給され、フレームバッフ
ァに格納される。輪郭抽出部２４は、フレームバッファ
に格納された画像データに対して、前述のように３×３
テーブルを作成し、黒の画素を検出し、画像の輪郭が位
置するマイクロブロックを判別する。輪郭抽出部２４
は、輪郭が位置すると判別したマイクロブロックの位置
情報を量子化部２２に供給する。Next, the process of acquiring an image, encoding it, decoding it, and displaying it will be described. First, the image acquired by the image pickup unit of the camera 11 is sequentially converted into digital data, supplied to the conversion unit 21 of the encoding unit 12, and subjected to motion compensation prediction (for necessary frames), DCT conversion processing, and the like. . On the other hand, the image data supplied from the camera 11 is also supplied to the contour extraction unit 24 and stored in the frame buffer. The contour extraction unit 24 uses the 3 × 3 image data stored in the frame buffer as described above.
A table is created, black pixels are detected, and the microblock where the contour of the image is located is determined. Contour extraction unit 24
Supplies the position information of the microblock determined to have the contour to the quantizer 22.

【００２８】量子化部２２には、変換部２１より変換さ
れた画像データが供給され、量子化部２２は、通常の適
応量子化処理の処理ルーチンに従って、その画像データ
を量子化するための量子化スケールコードを求める。Ｔ
Ｍ５モードにおいては、視覚的に劣化の目立ちやすい平
坦部で細かく量子化し、劣化の比較的目立ちにくい絵柄
の複雑な部分で粗く量子化するように量子化スケールコ
ードを設定する（中島、尾高、田原：「ＭＰＥＧの符号
化方式−ビデオ圧縮」、テレビ誌、４９，４，ｐｐ４３
５−４６６（１９９１）参照）。The quantizing unit 22 is supplied with the image data converted by the converting unit 21, and the quantizing unit 22 carries out quantization for quantizing the image data according to a processing routine of a normal adaptive quantizing process. Calculate the code scale code. T
In the M5 mode, the quantizer scale code is set so that the flat part where visual deterioration is easily noticed is finely quantized, and the complicated part of the pattern where deterioration is relatively less noticeable is roughly quantized (Nakajima, Otaka, Tahara). : "MPEG coding method-video compression", TV magazine, 49, 4, pp43
5-466 (1991)).

【００２９】次に、量子化部２２は量子化対象の画像デ
ータが位置するマイクロブロックを判別し、該マイクロ
ブロックに割り当てられた位置パラメータの値（０．７
〜１．３）を位置パラメータテーブル２５より読み出
し、量子化スケールコードに乗算する。Next, the quantizing unit 22 discriminates the microblock in which the image data to be quantized is located, and the value (0.7) of the positional parameter assigned to the microblock.
~ 1.3) is read from the position parameter table 25 and is multiplied by the quantization scale code.

【００３０】さらに、輪郭抽出部２４の判別結果から、
量子化対象の画像データが位置するマイクロブロックに
輪郭が位置するか否かを判別し、位置する場合には量子
化スケールコードを０．８倍し、位置しない場合には量
子化スケールコードを１．２倍する。即ち、量子化部２
２は、通常の適応量子化処理の処理ルーチンに従って求
められた量子化スケールコードに位置パラメータと輪郭
の有無に基づく補正値を乗算し、量子化スケールコード
を注視度に基づいて補正する。Further, from the discrimination result of the contour extracting section 24,
It is determined whether or not the contour is located in the microblock where the image data to be quantized is located. If it is located, the quantization scale code is multiplied by 0.8. If it is not located, the quantization scale code is 1. Double. That is, the quantizer 2
In step 2, the quantized scale code obtained in accordance with the processing routine of the normal adaptive quantization processing is multiplied by the correction value based on the position parameter and the presence or absence of the contour, and the quantized scale code is corrected based on the gaze degree.

【００３１】量子化部２２は、このようにして得られた
量子化スケールコードを用いて、変換部２１による変換
後の画像を量子化する。The quantizing unit 22 quantizes the image converted by the converting unit 21 using the quantization scale code thus obtained.

【００３２】可変長符号化部２３は、量子化部２２より
供給される量子化された画像データと量子化に使用した
量子化スケールコードに可変長符号化処理を施す。この
ようにして生成された可変長符号化画像データは伝送路
を介して送信され、或いは、データ蓄積装置に格納され
る。The variable length coding unit 23 performs variable length coding processing on the quantized image data supplied from the quantization unit 22 and the quantized scale code used for quantization. The variable-length coded image data thus generated is transmitted via the transmission path or stored in the data storage device.

【００３３】受信した或いは蓄積装置より読み出した可
変長符号化画像データを復号化する場合には、先ず、可
変長復号化部３１により量子化画像データと量子化スケ
ールコードを復号し、逆量子化部３２に供給する。逆量
子化部３２は、復号化された量子化スケールコードを用
いて量子化画像データを逆量子化し、さらに、逆変換部
３３は逆量子化された画像データに逆ＤＣＴ変換を行
い、動き補償予測の逆の予測を行い、画像データを復号
する。復号された画像データはディスプレイ１４に供給
され、表示される。When decoding the variable length coded image data received or read from the storage device, first, the variable length decoding unit 31 decodes the quantized image data and the quantized scale code, and dequantizes them. Supply to the section 32. The inverse quantizer 32 inversely quantizes the quantized image data using the decoded quantization scale code, and the inverse transformer 33 further performs inverse DCT transform on the inversely quantized image data to perform motion compensation. The prediction opposite to the prediction is performed and the image data is decoded. The decoded image data is supplied to the display 14 and displayed.

【００３４】以上説明したように、この実施の形態によ
れば、他の条件が同一であるならば、観者により注視さ
れる領域の画像データを量子化するために小さな量子化
スケールコードが割り当てられて細かく量子化され、注
視されない領域の画像データを量子化するために大きな
量子化スケールコードが割り当てられて粗く量子化され
る。従って、注視領域に対して優先的に符号量を割り当
てることにより、主観評価によって得られる復号画像の
品質を向上することができる。また、注視領域へ割り当
てる符号量を変化させることなく、非注視領域に割り当
てる符号量を低減させれば、主観評価によって得られる
画像品質を維持したまま低ビットレートの符号化を実現
することができる。As described above, according to this embodiment, if the other conditions are the same, a small quantization scale code is assigned to quantize the image data of the area watched by the viewer. The image is finely quantized, and a large quantization scale code is assigned to quantize the image data in the unfocused area, and the image is coarsely quantized. Therefore, the quality of the decoded image obtained by the subjective evaluation can be improved by preferentially assigning the code amount to the gaze area. Further, if the code amount assigned to the non-gaze region is reduced without changing the code amount assigned to the gaze region, it is possible to realize the low bit rate encoding while maintaining the image quality obtained by the subjective evaluation. .

【００３５】しかも、人間の注視点の空間的な分布と時
間的な分布に基づいて量子化スケールコードを設定して
いるので、動画像を適切に符号化・復号化することがで
きる。Moreover, since the quantization scale code is set based on the spatial distribution and the temporal distribution of the human gazing point, the moving image can be appropriately encoded / decoded.

【００３６】なお、この発明は上記実施の形態に限定さ
れず、種々の変形及び応用が可能である。例えば、以上
の説明においては、動画像の符号化・復号化処理の国際
標準であるＭＰＥＧ−２のテストモデルであるＴＭ５を
例にこの発明を説明したが、他の任意の画像データの符
号化・復号化処理に適用することができる。また、動画
像に限定されず、静止画像の符号化・復号化処理にも適
用することが可能である。但し、静止画の場合は、空間
的な注視の影響を大きくし、注視時間による注視の影響
を小さくする。また、注視時間の長い領域として画像の
輪郭部分を選択したが、他の部分が選択してもよい。ま
た、輪郭を抽出する他の手法を使用してもよい。さら
に、マイクロブロックのサイズ等も任意である。The present invention is not limited to the above embodiment, and various modifications and applications are possible. For example, in the above description, the present invention has been described by taking TM5, which is a test model of MPEG-2, which is an international standard for encoding / decoding of moving images, as an example, but encoding any other image data. -Can be applied to decryption processing. Further, the present invention is not limited to moving images, and can be applied to still image encoding / decoding processing. However, in the case of a still image, the influence of the spatial gaze is increased and the influence of the gaze depending on the gaze time is reduced. Further, although the contour portion of the image is selected as the area having a long gaze time, another portion may be selected. Also, other methods of extracting the contour may be used. Further, the size of the microblock and the like are also arbitrary.

【００３７】また、ＮＴＳＣ方式の通常画像に限らず、
ＮＴＳＣ方式のワイドモード、ＰＡＬ方式の画像などに
もこの発明は同様に適用可能である。また、位置パラメ
ータはＮＴＳＣ方式の通常画像の場合の例であり、他の
任意の設定値を使用することができる。また、輪郭の検
出の有無に基づく補正値も０．８又は１．２に限定され
ず、他の値を使用することができる。Further, not only the normal image of the NTSC system,
The present invention is similarly applicable to the wide mode of the NTSC system and the image of the PAL system. Further, the position parameter is an example in the case of a normal image of the NTSC system, and any other set value can be used. Further, the correction value based on the presence or absence of contour detection is not limited to 0.8 or 1.2, and other values can be used.

【００３８】例えば、上記実施の形態においては、変換
部２１と逆変換部３３では、ＤＣＴ変換と逆ＤＣＴ変換
を実行したが、サブバンド符号化とサブバンド復号化、
ウェーブレッド変換と逆ウエーブレッド変換等の量子化
に適した画像データを生成する各種の変換法を使用でき
る。For example, in the above embodiment, the transform unit 21 and the inverse transform unit 33 perform the DCT transform and the inverse DCT transform. However, subband coding and subband decoding,
Various transform methods that generate image data suitable for quantization, such as wave red transform and inverse wave red transform, can be used.

【００３９】画像を取得してから可変長符号化を実行す
るまでの装置と、可変長符号化された画像をデータを受
信して表示するまでの装置とは、物理的に分離されてい
てもよい。同様に、カメラ１１で得られた画像を一旦蓄
積し、その後、符号化部１２に供給するようにしてもよ
く、また、カメラ１１を使用せずに、コンピュータ等に
より画像データを生成して符号化部１２に供給してもよ
い。Even though the device from the acquisition of the image to the variable length coding and the device from the data reception and display of the variable length coded image are physically separated, Good. Similarly, the image obtained by the camera 11 may be temporarily stored and then supplied to the encoding unit 12. Alternatively, the image data may be generated by a computer or the like and encoded without using the camera 11. You may supply to the conversion part 12.

【００４０】[0040]

【発明の効果】この発明によれば、注視領域の画像の符
号化に小さな量子化スケールコードを割り当て、注視領
域の画像の符号化に大きな量子化スケールコードを割り
当てているので、低ビットレートで従来と実質的に同一
品質の画像を符号化・復号化することができ、同一ビッ
トレートならば従来より実質的に高品質の画像を符号化
・復号化することができる。According to the present invention, a small quantization scale code is assigned to the image of the gaze area and a large quantization scale code is assigned to the image of the gaze area. Therefore, at a low bit rate. Images of substantially the same quality as in the past can be encoded / decoded, and if the bit rate is the same, images of substantially higher quality than in the past can be encoded / decoded.

[Brief description of drawings]

【図１】この発明の実施の形態にかかる画像符号化・復
号化システムの構成を示すブロック図である。FIG. 1 is a block diagram showing a configuration of an image encoding / decoding system according to an embodiment of the present invention.

【図２】注視領域の空間的分布とマイクロブロックに割
り当てられる位置パラメータの例を示す図である。FIG. 2 is a diagram showing an example of a spatial distribution of a gaze area and positional parameters assigned to microblocks.

【図３】表示画像内の輪郭の位置を示す図である。FIG. 3 is a diagram showing a position of a contour in a display image.

[Explanation of symbols]

１１カメラ１２符号化部１３復号化部１４ディスプレイ２１変換部２２量子化部２３可変長符号化部２４輪郭抽出部２５位置パラメータテーブル３１可変長符号化部３２逆量子化部３３逆変換部 11 camera 12 coding unit 13 decoding unit 14 display 21 conversion unit 22 quantization unit 23 variable length coding unit 24 contour extraction unit 25 position parameter table 31 variable length coding unit 32 dequantization unit 33 inverse transformation unit

───────────────────────────────────────────────────── フロントページの続き (72)発明者中村太一東京都江東区豊洲三丁目３番３号エヌ・ティ・ティ・データ通信株式会社内 ────────────────────────────────────────────────── ─── Continuing on the front page (72) Inventor Taichi Nakamura 3-3-3 Toyosu, Koto-ku, Tokyo NTT Data Communications Corporation

Claims

[Claims]

1. An input unit for inputting an image, a discriminating unit for discriminating a gaze region in the image input by the input unit, and a first quantization for the gaze region discriminated by the discriminating unit. And a quantizing means for quantizing the image using a second quantized scale code that is larger than the first quantized scale code for the non-gaze region and uses a scale code. Image coding system.

2. The discriminating means comprises means for discriminating a spatially focused area on the image and means for discriminating a temporally focused area according to a predetermined standard. The image coding system according to claim 1.

3. The means for discriminating the spatially gazed area includes means for discriminating a relative position in the image, and the means for judging the temporal gaze area is an outline position of the image. The image coding system according to claim 2, further comprising means for discriminating.

4. A means for setting a first correction amount of a quantization scale code based on a relative position in the quantization target image for each minute region of the quantization target image, and the quantization target image. Means for discriminating the amount of information in the micro region for each micro region, and setting a second correction amount of the quantization scale code according to the discrimination result; An image code characterized by comprising a quantizing means for performing correction based on the first correction amount and the second correction amount, and quantizing the image to be quantized using the corrected quantization scale code. System.

5. An image coding / decoding system, further comprising means for decoding an image coded by the image coding system according to any one of claims 1 to 4.

6. A spatial distribution density of gaze points on an encoding target image is discriminated, a distribution of gaze times on an encoding target image is discriminated, and based on the discriminated spatial distribution density and gaze time distribution,
Finely quantize the high-gaze area of the image to be encoded,
An image coding method characterized in that a region of a target image to be coded having a low degree of gaze is roughly quantized.

7. The image coding method according to claim 6, wherein the determination of the gaze time distribution is performed by extracting a contour in the image to be coded.

8. An image coding / decoding method, which decodes an image coded by the image coding method according to claim 6.