JPH10336653A

JPH10336653A - Image coder and image decoder

Info

Publication number: JPH10336653A
Application number: JP13971297A
Authority: JP
Inventors: Yoshiaki Shishikui; 善明鹿喰; Shinichi Sakaida; 慎一境田; Yutaka Kaneko; 金子　　豊; Yutaka Tanaka; 豊田中
Original assignee: Nippon Hoso Kyokai NHK; Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 1997-05-29
Filing date: 1997-05-29
Publication date: 1998-12-18
Anticipated expiration: 2017-05-29
Also published as: JP3933749B2

Abstract

PROBLEM TO BE SOLVED: To optimize decoding processing by scalar multiplying with a value proportional to the square root of a pixel number for each image area with a conversion coefficient and giving the product to a quantization means to minimize total distortion of a decoded image. SOLUTION: An image signal is given to a block processing circuit 2, its area information is fed to a block processing circuit 1, where they are divided into block signals and block area information with respect to a plurality of blocks of a prescribed size of N×N. The block area information subject to block processing by the block processing circuit 1 is fed to support area correction circuits 4, 5, a coefficient introduction circuit 6a, and a discrimination circuit 7. (N×N) conversion coefficients introduced by a region supported(RS)- discrete cosine transform(DCT) section 10 are fed to a multiplier 12, where all the conversion coefficients are multiplied by a scalar value of M<1/2> /N. The conversion coefficients subject to scalar multiple of a value M<1/2> /N proportional to the square root of coding object pixel number M are fed to a quantizer 14, from which a quantized index signal is outputted.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は画像符号化装置およ
び復号化装置に関し、特に、テレビジョン信号等の画像
信号を画像構成要素単位で処理することで高能率符号化
を行う画像符号化装置およびメディアを介してこの画像
符号化装置からの符号化データを復号する画像復号化装
置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image coding apparatus and a decoding apparatus, and more particularly to an image coding apparatus and a decoding apparatus for performing high-efficiency coding by processing an image signal such as a television signal in units of image components. The present invention relates to an image decoding device that decodes encoded data from the image encoding device via a medium.

【０００２】[0002]

【従来の技術】従来行われている画像信号の高能率符号
化は、画像信号の水平または垂直方向の相関を利用した
直交変換により特定の変換係数に信号電力を集中させ
て、必要ビット数の削減を図るものが多い。この直交変
換を用いた高能率符号化では１枚の画像を一定数の画素
からなる一定形状の複数の単位ブロックに分け、このブ
ロック毎に変換を行い、このブロック内の画素を基本と
なる基底ベクトルの重み付き線形和として表し、この重
みを量子化してこの対応番号をメディアを介して伝送し
ている。2. Description of the Related Art Conventionally, high-efficiency coding of an image signal is performed by concentrating signal power on a specific transform coefficient by orthogonal transform using a horizontal or vertical correlation of the image signal to reduce a required number of bits. There are many things to reduce. In the high-efficiency coding using the orthogonal transform, one image is divided into a plurality of unit blocks each having a fixed shape composed of a fixed number of pixels, and conversion is performed for each block. Vectors are represented as weighted linear sums, the weights are quantized, and the corresponding numbers are transmitted via media.

【０００３】ＭＰＥＧの標準化に代表される従来の画像
符号化においては、１枚の画像は画像内容とは関係なく
連続したデータ列としてのみ扱われ、より少ないビット
数でより高忠実に画像の波形（パターン）の再現を行う
ことが目標とされている。そして、このような従来の符
号化では、通常はＤＣＴ(Discrete Cosine Transform)
等の基本的に定常性を前提とした処理が用いられるた
め、前景と背景といった画像信号に本質的に内在する非
定常性が符号化効率や復号画質の低下につながることに
なる。また、従来、画像の信号処理やハンドリングは、
１枚の画像毎、動画像の場合には、ひと続きのフレーム
毎に一括して扱われている。従って、画像構成要素単位
等、より小さな単位で信号処理やハンドリングを行うこ
とは考慮されていなかった。In the conventional image coding represented by MPEG standardization, one image is treated only as a continuous data sequence irrespective of the image content, and the image waveform is more faithfully realized with a smaller number of bits. The goal is to reproduce (patterns). In such conventional encoding, usually, DCT (Discrete Cosine Transform) is used.
And the like, which basically assumes continuity, the non-stationaryness inherent in the image signal, such as the foreground and background, leads to a decrease in coding efficiency and decoding image quality. Conventionally, signal processing and handling of images are
In the case of a single image, or in the case of a moving image, it is handled collectively for each continuous frame. Therefore, performing signal processing and handling in smaller units such as image component units has not been considered.

【０００４】そこで、近年、画像信号を各画像構成要素
に対応する複数の領域に分割した上で処理・符号化を行
う符号化の研究が活発になってきている。この符号化で
は、上述のような非定常性を回避した上で、領域毎に最
適な処理手法を選択して施すことができるため、より高
性能な符号化が可能となる。また、画像構成要素単位の
処理が可能となり、よりハンドリング性の良好な映像記
述を実現できる。Therefore, in recent years, studies on encoding for processing and encoding after dividing an image signal into a plurality of regions corresponding to respective image components have been actively conducted. In this encoding, it is possible to select and apply an optimal processing method for each area while avoiding the above-described non-stationarity, thereby enabling higher-performance encoding. In addition, processing can be performed in units of image components, and a video description with better handleability can be realized.

【０００５】また、従来の画像信号の波形符号化では、
ＤＣＴ符号化に代表されるように画像を一定サイズの矩
形ブロックに分割して処理を行っていた。[0005] In the conventional image signal waveform encoding,
The image has been divided into rectangular blocks of a certain size and processed as represented by DCT coding.

【０００６】しかし、複数の領域に分割して領域毎に処
理する場合、領域形状は一般に矩形にはならないので、
画像を一定サイズの矩形ブロックに分割して処理する従
来の手法をそのまま用いることはできない。そこで、従
来のブロックベースの符号化手法を基本に、任意形状の
画素群の処理を行う拡張手法がいくつか提案されてい
る。However, in the case of dividing into a plurality of regions and processing each region, since the region shape is not generally rectangular,
A conventional method of dividing an image into rectangular blocks of a certain size and processing the image cannot be used as it is. Therefore, based on the conventional block-based encoding method, some extended methods for processing a pixel group having an arbitrary shape have been proposed.

【０００７】このような手法の一つとして、ＤＣＴ変換
基底に対して符号化対象画像の画像領域によってマスキ
ングし、かつスカラー倍して、画像領域毎に新たな変換
基底ベクトルを導出し、これを用いて変換係数を求める
という領域サポートＤＣＴ（ＲＳ−ＤＣＴ：Region Sup
port DCT）が本出願人らによって提案されている（鹿喰
善明「領域サポートＤＣＴを用いた任意形状符号化の検
討」信学技報ＩＥ９５−１０９（Ｄｅｃ．１９９５）、
特願平７−３２５９７１号）。これにより導出される変
換係数は、周知の２次元逆ＤＣＴにより復号することが
できる。As one of such techniques, a DCT transform base is masked by an image area of an image to be coded and multiplied by a scalar to derive a new transform base vector for each image area. Region support DCT (RS-DCT: Region Sup
port DCT) has been proposed by the present applicants (Yoshiaki Kaga, "Study of Arbitrary Shape Coding Using Region-Supported DCT," IEICE Technical Report IE 95-109 (Dec. 1995),
Japanese Patent Application No. 7-325971). The transform coefficient derived in this way can be decoded by a well-known two-dimensional inverse DCT.

【０００８】このＲＳ−ＤＣＴに類似の手法として、矩
形形状の２次元ＤＣＴを用いるために、矩形内で符号化
対象領域に属さない画素については領域内画素のレベル
の平均等で外挿を行う手法であるＲＦ−ＤＣＴ（Region
Filling DCT）も提案されている（木村淳一：「ブロッ
ク内領域分割による画像符号化の検討」Ｐｒｏｃ．ＰＣ
ＳＪ，’９５，３−７，ｐｐ．３９−４０（Ｏｃｔ．１
９９５），伊藤典男、堅田裕之、草尾寛：「任意形状領
域に対するＤＣＴ実現手法の検討」，Ｐｒｏｃ．ＰＣＳ
Ｊ，’９５，５‐２，ｐｐ．７７−７８（Ｏｃｔ．１９
９５））。As a method similar to the RS-DCT, in order to use a rectangular two-dimensional DCT, pixels that do not belong to the encoding target region in the rectangle are extrapolated by averaging the levels of the pixels in the region. RF-DCT (Region
Filling DCT) has also been proposed (Junichi Kimura: "Study of image coding by region division in block" Proc. PC
SJ, '95, 3-7, pp. 39-40 (Oct. 1)
995), Norio Ito, Hiroyuki Katata, Hiroshi Kusao: "Study of DCT Realization Method for Arbitrary Shape Area", Proc. PCS
J, '95, 5-2, pp. 77-78 (Oct. 19)
95)).

【０００９】ＲＳ−ＤＣＴ、ＲＦ−ＤＣＴでは、単位ブ
ロック内にたとえば２つの画像領域が混在するときに
は、両方の画像領域についてそれぞれブロックＤＣＴを
行う。このため、変換係数はそれぞれ画像領域毎にブロ
ックの大きさの分だけ（即ち、２つの画像領域を含んだ
ブロックの画素数だけ）導出される。従って、両方の画
像領域についての変換係数を合わせると、１ブロックに
対し２ブロック分の数の係数が導出され、これら２ブロ
ック分の変換係数を伝送する必要がある。このように、
ＲＳ−ＤＣＴ、ＲＦ−ＤＣＴでは、ブロック内に２つの
画像領域が混在するときには、見かけ上画素数の２倍の
変換係数を伝送する必要がある。しかし、ＤＣＴを用い
た通常の高能率符号化においては、視覚的に重要で、か
つ信号電力の大半を占める低周波成分に相当する変換係
数のみを伝送するので、伝送する変換係数が見かけ上増
えることは、伝送情報量の実効的な増加につながるもの
ではない。In the RS-DCT and RF-DCT, when, for example, two image areas are mixed in a unit block, block DCT is performed for both image areas. Therefore, the transform coefficients are derived for each image area by the size of the block (ie, by the number of pixels of the block including the two image areas). Therefore, when the transform coefficients for both image areas are combined, two blocks of coefficients are derived for one block, and it is necessary to transmit these two blocks of transform coefficients. in this way,
In the RS-DCT and the RF-DCT, when two image areas are mixed in a block, it is necessary to transmit a transform coefficient twice as many as the number of pixels apparently. However, in normal high-efficiency coding using DCT, only the transform coefficients that are visually important and correspond to low-frequency components that occupy most of the signal power are transmitted, so that the transform coefficients to be transmitted increase in appearance. This does not lead to an effective increase in the amount of transmitted information.

【００１０】これらの手法では、ブロック内に含まれる
符号化対象画素により変換係数が導出されるが、復号側
においては２次元逆ＤＣＴによってブロック形状の画像
が見かけ上復元され、別途伝送した符号化対象領域を示
すマスクパターンにより、該当領域の画素だけが選択さ
れ、復号画像に用いられる。In these methods, a transform coefficient is derived from a pixel to be coded included in a block. On the decoding side, a block-shaped image is apparently restored by two-dimensional inverse DCT, and the separately transmitted coding is performed. Only the pixels in the corresponding area are selected by the mask pattern indicating the target area and used for the decoded image.

【００１１】上記の手法は簡易であるが、低周波領域内
とその近傍での連続性はある程度保証されるものの、中
・高周波領域については不連続となり、挿入画素レベ
ル、ひいては領域内画素記述法としての最適性の保証が
ない。Although the above method is simple, continuity in and around the low-frequency region is guaranteed to some extent, but discontinuity occurs in the middle and high-frequency regions, and the inserted pixel level and, consequently, the pixel description method in the region There is no guarantee of optimality.

【００１２】[0012]

【発明が解決しようとする課題】ＲＳ−ＤＣＴ、ＲＦ−
ＤＣＴともに、変換基底の数は、符号化対象画素数（以
下Ｍとする）に係わらずブロック内の画素数（以下、Ｎ
×Ｎとする）と一致する。従って、見かけ上（Ｎ×Ｎ）
個の変換係数が得られる。ただし、変換はＭ個（Ｍ≦Ｎ
×Ｎ）の符号化対象画素の値に対して行われるので、得
られた変換係数のうちで実際に有意な（０でない）もの
は、高々Ｍ個にすぎない。SUMMARY OF THE INVENTION RS-DCT, RF-
In both DCTs, the number of transform bases is the number of pixels in a block (hereinafter N) regardless of the number of pixels to be encoded (hereinafter M).
× N). Therefore, apparently (N × N)
Are obtained. However, the number of conversions is M (M ≦ N
× N) is performed on the value of the pixel to be coded, so that at most M of the obtained transform coefficients are actually significant (non-zero).

【００１３】ＤＣＴを用いた高能率符号化では、変換係
数に対し量子化が行われて情報量が低減されると同時
に、量子化時の量子化誤差による符号化歪みが発生す
る。ただし、変換係数の値が０の画素については量子化
雑音は０になる。従って、Ｍ個の符号化対象画素を含む
ブロック全体では、量子化雑音はＭ個分となる。即ち、
符号化対象画素１画素についての量子化雑音電力をＰｅ
とすると、Ｍ個の画素に対してＭ個の有意な変換係数が
求められ、その結果、ブロックに対しＭ×Ｐｅの符号化
歪みをもたらす。In high-efficiency coding using DCT, transform coefficients are quantized to reduce the amount of information, and at the same time, coding distortion due to a quantization error at the time of quantization occurs. However, quantization noise is 0 for a pixel whose transform coefficient value is 0. Therefore, in the entire block including M pixels to be encoded, the quantization noise is equivalent to M pixels. That is,
The quantization noise power for one pixel to be encoded is Pe
Then, M significant transform coefficients are obtained for M pixels, resulting in M × Pe coding distortion for the block.

【００１４】このようなＤＣＴにより得られた符号化デ
ータから、２次元逆ＤＣＴによって一旦（Ｎ×Ｎ）個の
画素のブロック状の復号画像が得られた後、これらの画
素のうちＭ個のみがマスクパターンを用いて選択されて
残される。この処理に伴い、選択された画素以外に分配
された量子化雑音による歪みも除去される。After a block-like decoded image of (N × N) pixels is once obtained by two-dimensional inverse DCT from the encoded data obtained by such DCT, only M of these pixels are obtained. Are selected and left using the mask pattern. Along with this processing, distortion due to quantization noise distributed to other than the selected pixel is also removed.

【００１５】すなわち、ブロックに残される量子化雑音
による歪みはThat is, distortion due to quantization noise left in a block is

【００１６】[0016]

【数１】 (Equation 1)

【００１７】となり、画素単位ではIn the pixel unit,

【００１８】[0018]

【数２】 (Equation 2)

【００１９】となる。## EQU1 ##

【００２０】この式から、復号側で選択された画素単位
の歪み量は、その画素が属するブロック内の符号化対象
画素数Ｍに比例することになる。From this equation, the amount of distortion for each pixel selected on the decoding side is proportional to the number M of pixels to be encoded in the block to which the pixel belongs.

【００２１】伝送する情報量一定の条件で復号画像の総
歪み量が最小になるよう最適化するには、すなわち、伝
送情報と歪みに関して最も効率的な符号化を実現するに
は、画素単位の歪み量を均一にする必要がある。従っ
て、上記のように符号化された場合、画素単位の歪み量
がＭに依存してブロック毎、画像毎に異なり、最適化さ
れないことになる。In order to optimize the total amount of distortion of the decoded image under the condition that the amount of information to be transmitted is constant, that is, to realize the most efficient encoding of the transmission information and the distortion, it is necessary to use the pixel unit. It is necessary to make the amount of distortion uniform. Therefore, when the encoding is performed as described above, the distortion amount for each pixel differs for each block and each image depending on M, and is not optimized.

【００２２】そこで、本発明は上述の点に鑑みて成され
たもので、上記の課題を解決した画像符号化装置および
画像復号化装置を提供することを目的とする。Therefore, the present invention has been made in view of the above points, and has as its object to provide an image encoding apparatus and an image decoding apparatus which solve the above-mentioned problems.

【００２３】[0023]

【課題を解決するための手段】上記目的を達成するため
に、請求項１に記載の本発明の装置では、複数の構成要
素からなる画像を所定数の画素で構成される所定寸法の
複数のブロックに分割し、該ブロックに含まれる単一の
構成要素からなる少なくとも一つの画像領域に対応した
画素信号ベクトルを、前記構成要素の境界情報に基づい
て前記画像領域毎に導出された直交変換基底ベクトルの
重み付け線形和として変換出力する変換手段と、前記変
換手段からの信号を量子化する量子化手段とを備える画
像符号化装置において、前記変換手段からの重み付け変
換係数を、前記ブロックに含まれる前記画像領域毎にそ
の画素数の平方根に比例した値でスカラー倍して前記量
子化手段の入力とすることにより符号化処理を最適化す
る最適化手段を備えたことを特徴とする。In order to achieve the above object, in the apparatus according to the first aspect of the present invention, an image composed of a plurality of constituent elements is formed by a plurality of pixels of a predetermined size formed of a predetermined number of pixels. A pixel signal vector corresponding to at least one image region composed of a single component included in the block is divided into blocks, and an orthogonal transform base derived for each image region based on boundary information of the component. In an image encoding apparatus including a conversion unit that converts and outputs a vector as a weighted linear sum and a quantization unit that quantizes a signal from the conversion unit, the block includes a weighted conversion coefficient from the conversion unit. Optimizing means for optimizing the encoding process by scalar multiplying by a value proportional to the square root of the number of pixels for each image area and inputting the result to the quantizing means is provided. Characterized in that was.

【００２４】ここで、請求項２に記載の本発明の装置で
は、複数の構成要素からなる画像を所定数の画素で構成
される所定寸法の複数のブロックに分割し、該ブロック
に含まれる単一の構成要素からなる少なくとも一つの画
像領域に対応した画素信号ベクトルを、前記構成要素の
境界情報に基づいて前記画像領域毎に導出された直交変
換基底ベクトルの重み付け線形和として変換出力する変
換手段と、前記変換手段からの信号を量子化する量子化
手段とを備える画像符号化装置において、前記画素信号
ベクトルの信号レベルを、前記ブロックに含まれる前記
画像領域毎にその画素数の平方根に比例した値でスカラ
ー倍して前記変換手段の入力とすることにより、符号化
処理を最適化する最適化手段を備えたことを特徴とす
る。Here, in the apparatus according to the second aspect of the present invention, an image composed of a plurality of constituent elements is divided into a plurality of blocks of a predetermined size composed of a predetermined number of pixels, and a single block included in the block is divided. A conversion unit for converting and outputting a pixel signal vector corresponding to at least one image region composed of one component as a weighted linear sum of orthogonal transformation base vectors derived for each image region based on boundary information of the component; And a quantization means for quantizing a signal from the conversion means, wherein the signal level of the pixel signal vector is proportional to the square root of the number of pixels for each of the image regions included in the block. Optimizing means for optimizing the encoding process by scalar multiplying the obtained value by the scalar multiplication and inputting the input to the conversion means.

【００２５】ここで、請求項３に記載の本発明の装置で
は、複数の構成要素からなる画像を所定数の画素で構成
される所定寸法の複数のブロックに分割し、該ブロック
に含まれる単一の構成要素からなる少なくとも一つの画
像領域に対応した画素信号ベクトルを、前記構成要素の
境界情報に基づいて前記画像領域毎に導出された直交変
換基底ベクトルの重み付け線形和として変換出力する変
換手段と、前記変換手段からの信号を量子化する量子化
手段とを備える画像符号化装置において、前記量子化手
段は、前記ブロックに含まれる前記画像領域毎にその画
素数の平方根に逆比例した値で量子化ステップ幅をスカ
ラー倍して符号化処理を最適化する量子化手段であるこ
とを特徴とする。Here, in the apparatus according to the present invention, an image composed of a plurality of constituent elements is divided into a plurality of blocks of a predetermined size composed of a predetermined number of pixels, and a single block included in the block is divided. A conversion unit for converting and outputting a pixel signal vector corresponding to at least one image region composed of one component as a weighted linear sum of orthogonal transformation base vectors derived for each image region based on boundary information of the component; And a quantizing means for quantizing a signal from the transforming means, wherein the quantizing means has a value inversely proportional to the square root of the number of pixels for each of the image regions included in the block. And a quantization means for optimizing the encoding process by multiplying the quantization step width by a scalar.

【００２６】ここで、請求項４に記載の本発明の装置で
は、前記直交変換基底ベクトルは、離散コサイン変換基
底ベクトルの所定成分を前記構成要素の境界情報に基づ
いて前記画像領域毎にマスキングし、かつスカラー倍し
て前記画像領域毎に新たに導出した直交変換基底ベクト
ルとすることもできる。Here, in the apparatus according to the present invention, the orthogonal transform base vector masks a predetermined component of the discrete cosine transform base vector for each of the image regions based on the boundary information of the constituent elements. And a scalar multiplication to obtain a newly derived orthogonal transformation base vector for each image area.

【００２７】上記目的を達成するために、請求項５に記
載の本発明の装置では、請求項１に記載の画像符号化装
置から出力される、所定数の画素で構成され所定寸法で
分割されたブロックに含まれる単一の構成要素からなる
少なくとも一つの画像領域に対応した画素信号ベクトル
を符号化した符号化データを、逆量子化して重み付け係
数を求める逆量子化手段と、前記構成要素の境界情報に
基づいて前記画像領域毎に導出された直交変換基底ベク
トルを重み付け線形加算して、前記画像領域に対応した
画素信号ベクトルを復号する逆変換手段とを備える画像
復号化装置であって、前記逆量子化して求めた重み付け
係数を、前記ブロックに含まれる前記画像領域毎にその
画素数の平方根に逆比例した値でスカラー倍して前記直
交変換基底ベクトルを重み付け線形加算することにより
復号化処理を最適化する最適化手段を備えたことを特徴
とする。According to a fifth aspect of the present invention, there is provided an apparatus according to the present invention, comprising a predetermined number of pixels which are output from the image encoding apparatus according to the first aspect and are divided by a predetermined size. Inverse quantization means for inversely quantizing encoded data obtained by encoding a pixel signal vector corresponding to at least one image region composed of a single component included in a block to obtain a weighting coefficient; An inverse decoding unit for performing weighted linear addition of orthogonal transformation base vectors derived for each image region based on boundary information and decoding a pixel signal vector corresponding to the image region, The weighting coefficient obtained by the inverse quantization is scalar-multiplied by a value inversely proportional to the square root of the number of pixels for each of the image regions included in the block, and the orthogonal transform basis vector is calculated. Characterized by comprising an optimization means for optimizing a decoding process by the weighted linear addition of.

【００２８】ここで、請求項６に記載の本発明の装置で
は、請求項２に記載の画像符号化装置から出力される、
所定数の画素で構成され所定寸法で分割されたブロック
に含まれる単一の構成要素からなる少なくとも一つの画
像領域に対応した画素信号ベクトルを符号化した符号化
データを、逆量子化して重み付け係数を求める逆量子化
手段と、前記構成要素の境界情報に基づいて前記画像領
域毎に導出された直交変換基底ベクトルを重み付け線形
加算して、前記画像領域に対応した画素信号ベクトルを
復号する逆変換手段とを備える画像復号化装置であっ
て、前記逆変換手段出力を、前記ブロックに含まれる前
記画像領域毎にその画素数の平方根に逆比例した値でス
カラー倍して前記画像領域に対応した画素信号ベクトル
を復号することにより復号化処理を最適化する最適化手
段を備えたことを特徴とする。Here, in the apparatus of the present invention described in claim 6, the image output from the image coding apparatus described in claim 2 is:
A coded data obtained by coding a pixel signal vector corresponding to at least one image region consisting of a single component included in a block divided by a predetermined size and configured by a predetermined number of pixels is inversely quantized and weighted by a weighting coefficient. Inverse quantization means for calculating the weighted linear addition of orthogonal transformation basis vectors derived for each of the image regions based on the boundary information of the components, and decoding pixel signal vectors corresponding to the image regions. And an image decoding apparatus comprising: means for scalar multiplying the output of the inverse transform means by a value inversely proportional to the square root of the number of pixels for each of the image areas included in the block to correspond to the image area. Optimizing means for optimizing a decoding process by decoding a pixel signal vector is provided.

【００２９】ここで、請求項７に記載の本発明の装置で
は、請求項３に記載の画像符号化装置から出力される、
所定数の画素で構成され所定寸法で分割されたブロック
に含まれる単一の構成要素からなる少なくとも一つの画
像領域に対応した画素信号ベクトルを符号化した符号化
データを、逆量子化して重み付け係数を求める逆量子化
手段と、前記構成要素の境界情報に基づいて前記画像領
域毎に導出された直交変換基底ベクトルを重み付け線形
加算して、前記画像領域に対応した画素信号ベクトルを
復号する逆変換手段とを備える画像復号化装置であっ
て、前記逆量子化手段は、前記ブロックに含まれる前記
画像領域毎にその画素数の平方根に比例した値で量子化
ステップ幅をスカラー倍して復号化処理を最適化する逆
量子化手段を備えたことを特徴とする。Here, in the apparatus of the present invention described in claim 7, the image output from the image coding apparatus according to claim 3 is:
A coded data obtained by coding a pixel signal vector corresponding to at least one image region consisting of a single component included in a block divided by a predetermined size and configured by a predetermined number of pixels is inversely quantized and weighted by a weighting coefficient. Inverse quantization means for calculating the weighted linear addition of orthogonal transformation basis vectors derived for each of the image regions based on the boundary information of the components, and decoding pixel signal vectors corresponding to the image regions. Means, wherein said inverse quantization means performs scalar multiplication of a quantization step width by a value proportional to a square root of the number of pixels for each of said image regions included in said block. It is characterized by comprising an inverse quantization means for optimizing the processing.

【００３０】ここで、請求項８に記載の本発明の装置で
は、前記直交変換基底ベクトルは、離散コサイン変換基
底ベクトルの所定成分を前記構成要素の境界情報に基づ
いて前記画像領域毎にマスキングし、かつスカラー倍し
て前記画像領域毎に新たに導出した直交変換基底ベクト
ルとすることもできる。Here, in the apparatus according to the present invention, the orthogonal transform basis vector masks a predetermined component of the discrete cosine transform basis vector for each of the image areas based on the boundary information of the component. And a scalar multiplication to obtain a newly derived orthogonal transformation base vector for each image area.

【００３１】[0031]

【発明の実施の形態】以下、図面を参照しながら本発明
の実施の形態を詳細に説明する。Embodiments of the present invention will be described below in detail with reference to the drawings.

【００３２】（第１の実施の形態）図１は本発明を適用
した画像符号化装置の第１の実施の形態の構成を示すブ
ロック図である。(First Embodiment) FIG. 1 is a block diagram showing the configuration of a first embodiment of an image coding apparatus to which the present invention is applied.

【００３３】本実施の形態では、処理対象画像の画像構
成要素数は２であり、分割された各単位ブロック（以
下、ブロックと記す）内の画像領域は最大で２（最小は
１）とする。また、各画像構成要素が画面内で占める領
域を表す領域情報（境界情報）は既知であるものとす
る。In the present embodiment, the number of image components of the image to be processed is 2, and the image area in each divided unit block (hereinafter, referred to as a block) is 2 at the maximum (the minimum is 1). . Also, it is assumed that area information (boundary information) indicating an area occupied by each image component in the screen is known.

【００３４】図１に示す画像符号化装置は、領域サポー
トＤＣＴ部１０と乗算器１２と量子化器１４と可変長符
号化器１６を備えている。The image encoding apparatus shown in FIG. 1 includes an area support DCT unit 10, a multiplier 12, a quantizer 14, and a variable length encoder 16.

【００３５】領域サポートＤＣＴ部１０は、領域情報が
入力されるブロック化回路１，処理対象の画像信号が入
力されるブロック化回路２，ＤＣＴ(Discrete Cosine T
ransform；離散コサイン変換）基底供給回路３，サポー
ト領域修正回路４および５，係数導出回路６ａおよび６
ｂおよび６ｃ，組合せ回路９，判定回路７，並びに切替
え回路８により構成されている。サポート領域修正回路
は、一つの処理対象画像の画像構成要素数だけ必要であ
る。The area support DCT unit 10 includes a blocking circuit 1 for receiving area information, a blocking circuit 2 for receiving an image signal to be processed, and a DCT (Discrete Cosine T).
ransform; discrete cosine transform) basis supply circuit 3, support area correction circuits 4 and 5, coefficient derivation circuits 6a and 6
b and 6c, a combination circuit 9, a judgment circuit 7, and a switching circuit 8. The support area correction circuit is required for the number of image components of one processing target image.

【００３６】画像信号はブロック化回路２に、その領域
情報はブロック化回路１に供給され、Ｎ×Ｎの一定の寸
法の複数のブロックに対するブロック信号およびブロッ
ク領域情報に分割される。画素数Ｎの値は、一般的には
８または１６である。The image signal is supplied to a blocking circuit 2 and its area information is supplied to a blocking circuit 1, where it is divided into block signals and block area information for a plurality of blocks of a fixed size of N × N. The value of the number of pixels N is generally 8 or 16.

【００３７】入力画像がたとえば人と背景の画像構成要
素からなるとすると、ブロック化回路１から出力される
ブロック領域情報は、各ブロックが人のみ、または背景
のみを画像構成要素とすること、或いは人の一部と背景
の一部を画像構成要素とすることを表す。換言すれば、
これらのブロック領域情報は、ブロック内に画像構成要
素の境界がなく画像が矩形で構成されること、或いはブ
ロック内に画像構成要素の境界があり各画像が非矩形で
構成されることを表す。Assuming that the input image is composed of, for example, image components of a person and a background, the block area information output from the blocking circuit 1 indicates that each block includes only a person or only the background as an image component, or And a part of the background as image components. In other words,
The block area information indicates that the image has a rectangular shape without a boundary of the image component in the block, or that the image has a non-rectangular shape with the boundary of the image component in the block.

【００３８】このようにブロック化回路１によってブロ
ック化されたブロック領域情報は、サポート領域修正回
路４および５，係数導出回路６ａ，並びに判定回路７に
供給される。サポート領域修正回路４および５では、Ｄ
ＣＴ基底供給回路３から供給されたＤＣＴ基底のサポー
ト領域をブロック毎に画像領域形状に応じて修正する。
サポート領域修正回路４では２つの画像領域のうち１つ
の画像領域（たとえば人とする）の領域形状に、サポー
ト領域修正回路５ではもう一方の画像領域（背景とす
る）の領域形状に対応して修正が行われる。サポート領
域修正回路４および５によるサポート領域修正方法につ
いては後述する。The block area information thus blocked by the blocking circuit 1 is supplied to the support area correction circuits 4 and 5, the coefficient derivation circuit 6a, and the determination circuit 7. In the support area correction circuits 4 and 5, D
The support area of the DCT base supplied from the CT base supply circuit 3 is corrected for each block according to the image area shape.
The support region correction circuit 4 corresponds to the region shape of one image region (for example, a person) of the two image regions, and the support region correction circuit 5 corresponds to the region shape of the other image region (for the background). Modifications are made. The support area correction method by the support area correction circuits 4 and 5 will be described later.

【００３９】また、ブロック化回路２によってブロック
化された画像信号は、係数導出回路６ａおよび６ｂおよ
び６ｃに供給される。人のみで構成されるブロックおよ
び背景のみで構成されるブロックの画像信号に対して
は、係数導出回路６ａによりＤＣＴ基底供給回路３から
供給された２次元ＤＣＴ基底を修正せずに用いて、ＤＣ
Ｔ係数の導出およびその符号化を行う。係数導出回路６
ａからの符号化データは、切替え回路８に供給される。The image signals blocked by the blocking circuit 2 are supplied to coefficient deriving circuits 6a, 6b and 6c. For the image signal of the block composed only of the person and the block composed only of the background, the two-dimensional DCT base supplied from the DCT base supply circuit 3 by the coefficient derivation circuit 6a is used without correction,
Derivation of the T coefficient and its encoding are performed. Coefficient derivation circuit 6
The encoded data from a is supplied to the switching circuit 8.

【００４０】２つの画像構成要素のそれぞれの一部が混
在するブロックの画像信号に対しては、係数導出回路６
ｂおよび６ｃにより各々サポート領域修正回路４および
５から供給される修正されたＤＣＴ基底を用いてＤＣＴ
係数の導出およびその符号化を行う。このＤＣＴ係数の
導出およびその符号化については後述する。For an image signal of a block in which a part of each of the two image components is mixed, a coefficient deriving circuit 6
DCT using the modified DCT bases supplied from support region modifying circuits 4 and 5 by b and 6c, respectively.
Derivation of coefficients and encoding thereof. The derivation of this DCT coefficient and its encoding will be described later.

【００４１】係数導出回路６ｂおよび６ｃからの符号化
データは、それぞれ組み合わせ回路９に供給される。そ
して、１つのブロックを構成する２つの画像領域を記述
する符号化データが多重された符号化データとされて、
組み合わせ回路９から切替え回路８に供給される。組み
合わせ回路９による符号化データの多重は、切替え回路
８によりブロック単位の切替えを行なうために１つのブ
ロック内の２つの画像領域の符号化データをまとめるた
めの処理であり、係数導出回路６ｂおよび６ｃからの２
つの符号化データ列を単に直列に並べるものである。The encoded data from the coefficient deriving circuits 6b and 6c are supplied to a combination circuit 9, respectively. Then, encoded data describing two image regions constituting one block is multiplexed encoded data,
The signal is supplied from the combination circuit 9 to the switching circuit 8. The multiplexing of the coded data by the combination circuit 9 is a process for combining the coded data of two image areas in one block in order to perform switching on a block basis by the switching circuit 8, and the coefficient deriving circuits 6b and 6c 2 from
One coded data sequence is simply arranged in series.

【００４２】一方、判定回路７では、ブロック化回路１
から供給された各ブロックのブロック領域情報に基づい
て、各ブロック毎に１つの画像領域（人または背景）の
みが存在するか２つの画像領域（人および背景）が混在
するかを判定する。この判定回路７からの判定信号は、
切替え回路８に供給される。On the other hand, in the judgment circuit 7, the blocking circuit 1
It is determined whether there is only one image region (person or background) or two image regions (person and background) are mixed for each block based on the block region information of each block supplied from. The determination signal from the determination circuit 7 is
It is supplied to the switching circuit 8.

【００４３】切替え回路８は判定回路７からの判定信号
に基づき、１つの画像領域のみが存在するブロックに対
しては係数導出回路６ａからの符号化データを、２つの
画像領域が混在するブロックに対しては組み合わせ回路
９からの符号化データを選択的に切替えて圧縮された直
列データとして出力する。すなわち、２つの画像領域が
混在するブロックに対しては組み合わせ回路９からの直
列に多重された符号化データを出力し、その他の１つの
画像領域から構成されるブロックに対しては係数導出回
路６ａからの重み付け線形和で表される符号化データを
出力する。The switching circuit 8 converts the encoded data from the coefficient deriving circuit 6a into a block in which two image areas coexist based on the determination signal from the determination circuit 7 for a block in which only one image area exists. On the other hand, the coded data from the combination circuit 9 is selectively switched and output as compressed serial data. That is, serially multiplexed encoded data from the combination circuit 9 is output to a block in which two image areas are mixed, and the coefficient derivation circuit 6a is output to a block composed of another one image area. And outputs encoded data represented by a weighted linear sum.

【００４４】ここで、サポート領域修正回路４および５
によるサポート領域修正の方法と、係数導出回路６ｂお
よび６ｃにおける係数導出方法について簡単に説明す
る。Here, the support area correction circuits 4 and 5
The method of correcting the support region according to the above and the method of deriving the coefficients in the coefficient deriving circuits 6b and 6c will be briefly described.

【００４５】サポート領域修正では、ＤＣＴ基底供給回
路３から供給される予め定められた大きさＮ×ＮのIn the support area correction, a predetermined size N × N supplied from the DCT base supply circuit 3 is used.

【００４６】[0046]

【外１】 [Outside 1]

【００４７】が供給される。Is supplied.

【００４８】上記ＤＣＴ直交変換基底ベクトルのＮ×Ｎ
個の各々の成分のうち、符号化対象（サポート）画像領
域に対応するＭ個の成分を保持し、その他の成分が０と
なるようにN × N of the DCT orthogonal transform base vector
Of each of the components, M components corresponding to the encoding target (support) image area are held, and the other components are set to 0.

【００４９】[0049]

【外２】 [Outside 2]

【００５０】を設定し、符号化対象画像領域で修正す
る。そして、以下に示す式（１）の演算を行ってIs set, and correction is made in the image area to be encoded. Then, the following equation (1) is calculated.

【００５１】[0051]

【外３】 [Outside 3]

【００５２】を求め、マスキングされたベクトルをノル
ムの自乗で除算することで符号化対象画像構成要素の形
状に修正された新たな変換基底ベクトルを算出し、ノル
ム補正を行う。すなわち、Then, the masked vector is divided by the square of the norm to calculate a new transformed base vector corrected to the shape of the image component to be encoded, and the norm is corrected. That is,

【００５３】[0053]

【数３】 (Equation 3)

【００５４】によりBy

【００５５】[0055]

【外４】 [Outside 4]

【００５６】が求められる。ノルム補正後の画像構成要
素の形状に修正された新たな変換基底ベクトルは、マス
キングされたベクトルの符号化対象画素に対応する各成
分をスカラー倍したものとなっている。Is required. The new transformed basis vector corrected to the shape of the image component after the norm correction is a scalar multiplication of each component of the masked vector corresponding to the encoding target pixel.

【００５７】次に、係数導出回路７および８における係
数導出法について説明する。Next, a coefficient deriving method in the coefficient deriving circuits 7 and 8 will be described.

【００５８】ここで、Here,

【００５９】[0059]

【外５】 [Outside 5]

【００６０】を示すブロック化回路２からの画像信号ベ
クトル（要素数Ｎ×Ｎ）とする。Is an image signal vector (the number of elements N × N) from the blocking circuit 2.

【００６１】まず、第１の段階では、上記画像信号ベク
トルとノルム補正後の修正されたFirst, in the first stage, the image signal vector and the corrected

【００６２】[0062]

【外６】 [Outside 6]

【００６３】との内積を計算してスカラー関数The scalar function is calculated by calculating the inner product of

【００６４】[0064]

【数４】 (Equation 4)

【００６５】に変換して修正されたConverted to modified

【００６６】[0066]

【外７】 [Outside 7]

【００６７】を求め、これらの係数の中から次式により
絶対値が最大の係数ｃ_k を選択する。Then, the coefficient c _k having the maximum absolute value is selected from these coefficients by the following equation.

【００６８】[0068]

【数５】 (Equation 5)

【００６９】次に、第２の段階では、絶対値最大の係数
ｃ_k に対応するNext, in the second stage, the coefficient corresponding to the coefficient c _k having the maximum absolute value is determined.

【００７０】[0070]

【外８】 [Outside 8]

【００７１】次式のとおり絶対値が最大の係数ｃ_k と絶
対値が最大の係数ｃ_k に対応する上記ＤＣＴ直交変換基
底ベクトルとの積の成分を上記画像信号ベクトルから減
じて除去することで残差信号ベクトルを求め、これを画
像信号ベクトルと置き換える。[0071] The components of the product of the DCT orthogonal transform basis vector absolute value maximum coefficient c _k and the absolute value of the following equation corresponding to the maximum of the coefficient c _k by removing subtracted from the image signal vector A residual signal vector is obtained, and this is replaced with an image signal vector.

【００７２】[0072]

【数６】 (Equation 6)

【００７３】次に、第３の段階では、第２の段階におい
て求めた上記の残差信号ベクトルで置き換えられた画像
信号ベクトルと前記したノルム補正後の修正された新た
な変換基底ベクトルとの内積を第１の段階と同様に再び
計算してスカラー関数に変換して、この関数の係数のう
ちから第１の段階と同様に新たな絶対値最大の係数を選
択する。Next, in the third step, the inner product of the image signal vector replaced with the above-mentioned residual signal vector obtained in the second step and the new transformed base vector corrected after the above-mentioned norm correction is obtained. Is calculated again in the same manner as in the first step, is converted into a scalar function, and a new coefficient having a maximum absolute value is selected from the coefficients of this function as in the first step.

【００７４】次に、第４の段階では、第３の段階におい
て選択された絶対値が最大の係数ｃ_k とこの係数に対応
するＤＣＴ直交変換基底ベクトルとの積の成分を画像信
号ベクトルから減じて除去することで第２の段階と同様
に再び新たな残差信号ベクトルを求め、画像信号ベクト
ルと置き換える。Next, in the fourth stage, the component of the product of the coefficient c _k having the largest absolute value selected in the third stage and the DCT orthogonal transform base vector corresponding to this coefficient is subtracted from the image signal vector. Then, a new residual signal vector is obtained again in the same manner as in the second stage, and is replaced with an image signal vector.

【００７５】次に、第５の段階では、第４の段階におい
て求められた残差信号ベクトル、すなわち画像信号ベク
トルに基づいて、残差信号ベクトルによる残差信号電力
が予め十分小さな値に定められた閾値以下であるかを評
価し、閾値以下であればこのときの修正された新たな変
換基底ベクトルの係数を採用し、重み付け線形和として
符号化して変換出力する。ここで残差信号電力の評価は
符号化対象画像構成要素分だけとする。すなわち、第２
の段階での処理はＮ×Ｎの要素に対して施されるが、目
的は符号化対象のＭ個の画素のレベルを表現することに
あるので、誤差の評価は残差信号ベクトルの要素のう
ち、このＭ個に対応する要素の自乗和で行なう。Next, in the fifth stage, the residual signal power based on the residual signal vector is previously set to a sufficiently small value based on the residual signal vector obtained in the fourth stage, that is, the image signal vector. It evaluates whether it is equal to or less than the threshold value, and if it is equal to or less than the threshold value, adopts the corrected coefficient of the new transformed base vector at this time, encodes it as a weighted linear sum, and outputs the result. Here, the evaluation of the residual signal power is performed only for the image component to be coded. That is, the second
Is performed on N × N elements, but the purpose is to represent the level of M pixels to be coded. Therefore, the evaluation of the error is based on the elements of the residual signal vector. Of these, the sum of the squares of the elements corresponding to the M elements is used.

【００７６】一方、残差信号電力が十分小さな閾値以下
になっていないと判定されたときは再び第３の段階に戻
って処理を続行し、第５の段階において残差信号電力が
予め十分小さな値に定められた閾値以下であると評価さ
れるまで、第３の段階と第４の段階と第５の段階の処理
を繰り返し実行する。On the other hand, when it is determined that the residual signal power is not below the sufficiently small threshold value, the process returns to the third stage to continue the processing, and in the fifth stage, the residual signal power is previously sufficiently small. The processes of the third stage, the fourth stage, and the fifth stage are repeatedly executed until it is evaluated that the difference is equal to or smaller than the threshold value determined as the value.

【００７７】なお、各変換基底ベクトルは直交していな
いので、一度除かれた成分が他の成分の除去に伴って再
び現れることもある。この場合、同一変換基底の係数を
累積し、その結果を最終的に採用する。Note that since the transformed base vectors are not orthogonal, the component once removed may reappear with the removal of other components. In this case, the coefficients of the same transformation basis are accumulated, and the result is finally adopted.

【００７８】このように、ＲＳ−ＤＣＴ部１０によれ
ば、比較的小さなハードウェア規模で、高能率かつ画像
構成要素単位で画像信号のハンドリングを行うことがで
きる。As described above, according to the RS-DCT unit 10, it is possible to handle an image signal in units of image components with high efficiency on a relatively small hardware scale.

【００７９】ＲＳ−ＤＣＴ部１０により導出された（Ｎ
×Ｎ）個の変換係数は、乗算器１２に供給され、(N) derived by the RS-DCT unit 10
× N) transform coefficients are supplied to the multiplier 12,

【００８０】[0080]

【外９】 [Outside 9]

【００８１】量子化器１４に供給され、量子化インデッ
クス信号が出力される。出力されたインデックス信号は
可変長符号化器１６に供給されて２値の符号化データ列
に変換され、通信メディアや記録メディア等の伝送メデ
ィアを介して伝送される。The quantization index signal is supplied to the quantizer 14 and is output. The output index signal is supplied to the variable length encoder 16 and converted into a binary encoded data string, and transmitted via a transmission medium such as a communication medium or a recording medium.

【００８２】このように符号化すると、１個の符号化対
象画素の量子化雑音電力は量子化前のスカラー倍の係数
にかかわらずＰｅで表され、量子化雑音電力によるブロ
ック全体の符号化歪みはＭ×Ｐｅで表される。With this encoding, the quantization noise power of one pixel to be encoded is represented by Pe regardless of the scalar-multiplied coefficient before quantization, and the coding distortion of the entire block due to the quantization noise power is obtained. Is represented by M × Pe.

【００８３】伝送メディアからの符号化データ列は、図
２に示す画像復号化装置により復号される。The encoded data sequence from the transmission medium is decoded by the image decoding device shown in FIG.

【００８４】この復号処理において、符号化データ列が
可変長復号化器２０に供給され、復号量子化インデック
スが出力される。出力された復号量子化インデックスは
逆量子化器２２に供給され、逆量子化変換係数が出力さ
れる。この逆量子化変換係数はIn this decoding process, the coded data sequence is supplied to the variable length decoder 20, and the decoded quantization index is output. The output decoding quantization index is supplied to the inverse quantizer 22, and the inverse quantization transform coefficient is output. This inverse quantization transform coefficient is

【００８５】[0085]

【外１０】 [Outside 10]

【００８６】つまり、符号化画素数Ｍの平方根に逆比例
した値でスカラー倍される。That is, scalar multiplication is performed with a value inversely proportional to the square root of the number M of encoded pixels.

【００８７】ここで、各画像構成要素（オブジェクト）
が画面内で占める領域を表す領域情報（境界情報）は既
知であり、各ブロック内の符号化対象画素数ＭはＤＣＴ
係数を導出する際に既に求められているものを用いる。Here, each image component (object)
Is known, and the number M of pixels to be encoded in each block is a DCT.
When deriving the coefficients, those already obtained are used.

【００８８】スカラー倍された逆量子化変換係数はＮ×
Ｎの２次元逆ＤＣＴ部２６に供給され、（Ｎ×Ｎ）個の
画素信号レベルが復元される。このように、The inversely quantized transform coefficient multiplied by scalar is N ×
The signal is supplied to the N two-dimensional inverse DCT unit 26, and (N × N) pixel signal levels are restored. in this way,

【００８９】[0089]

【外１１】 [Outside 11]

【００９０】復元された画素信号のブロック全体での量
子化雑音電力による歪みは、The distortion due to the quantization noise power in the entire block of the restored pixel signal is as follows.

【００９１】[0091]

【数７】 (Equation 7)

【００９２】となる。Is obtained.

【００９３】復元された（Ｎ×Ｎ）個の画素信号は選択
器２８に供給される。選択器２８においては、別途伝送
された当該ブロックにおける画像領域を示すマスクパタ
ーン２９を用いて、（Ｎ×Ｎ）個のブロック内全画素か
らブロックの画像領域内に対応するＭ個の画素を選択
し、出力する。この結果、ブロック全体での最終的な歪
み量はThe restored (N × N) pixel signals are supplied to the selector 28. The selector 28 selects M pixels corresponding to the image area of the block from all the pixels in the (N × N) blocks using the mask pattern 29 indicating the image area in the block separately transmitted. And output. As a result, the final distortion amount for the entire block is

【００９４】[0094]

【数８】 (Equation 8)

【００９５】となる。Is obtained.

【００９６】従って、画素単位での量子化雑音による符
号化歪みの量は符号化対象画素数Ｍに逆比例するので、
画素単位での最終的な歪み量は、ＲＳ−ＤＣＴにおける
符号化対象画素数Ｍとの比例性を相殺し、Ｍに依存する
ことなく一律にＰｅとなり、画素単位の歪み量を均一に
最適化することができる。Therefore, the amount of coding distortion due to quantization noise in pixel units is inversely proportional to the number M of pixels to be coded.
The final distortion amount in pixel units cancels out the proportionality with the number M of encoding target pixels in RS-DCT, and becomes Pe independently of M, thereby uniformly optimizing the distortion amount in pixel units. can do.

【００９７】このように、画像符号化装置において量子
化器の前に乗算器１２を設けてAs described above, the multiplier 12 is provided before the quantizer in the image encoding apparatus.

【００９８】[0098]

【外１２】 [Outside 12]

【００９９】復号した画素単位の歪み量が均一となるよ
うに情報量割り当てを最適化して、伝送する情報量を一
定とした場合の復号画像の総歪み量を最小とすることが
でき、より高能率な符号化データの伝送を実現できる効
果がある。The information amount allocation is optimized so that the amount of distortion in the decoded pixel unit becomes uniform, and the total amount of distortion of the decoded image when the amount of information to be transmitted is constant can be minimized. There is an effect that efficient transmission of encoded data can be realized.

【０１００】（他の実施の形態）上記実施の形態におけ
る各乗算器１２，２４による処理はスカラー倍であり、
これは線形変換に対して不変である。従って、乗算器に
よるこのスカラー倍を、画素信号レベルに対して行って
も、上記実施の形態において変換係数に対して行ったの
と等価になる。(Other Embodiments) The processing by the multipliers 12 and 24 in the above embodiment is a scalar multiplication.
It is invariant to linear transformation. Therefore, even when the scalar multiplication by the multiplier is performed on the pixel signal level, it is equivalent to the scalar multiplication performed on the transform coefficient in the above embodiment.

【０１０１】そこで、符号化装置においてブロック化回
路２の入力または出力に乗算器を設け、ここでの画素信
号レベルに対して図１の乗算器１２と同様のスカラー倍
を行い、復号化装置では２次元逆ＤＣＴ部２６の出力に
乗算器を設け、ここでの画素信号レベルに対して図２の
乗算器２４と同様のスカラー倍を行っても第１の実施の
形態と同様の効果を得ることができる。Therefore, a multiplier is provided at the input or output of the blocking circuit 2 in the encoding device, and the pixel signal level is multiplied by the same scalar as that of the multiplier 12 in FIG. A multiplier is provided at the output of the two-dimensional inverse DCT unit 26, and the same effect as in the first embodiment can be obtained by performing a scalar multiplication on the pixel signal level in the same manner as the multiplier 24 in FIG. be able to.

【０１０２】また、乗算器を設けることなく、量子化雑
音電力が量子化ステップ幅の二乗に比例して増加するこ
とを利用して画素単位の歪み量を均一に最適化し、第１
の実施の形態と同様の効果を得るように構成することも
できる。Also, without providing a multiplier, the amount of distortion for each pixel is optimized uniformly by utilizing the fact that the quantization noise power increases in proportion to the square of the quantization step width.
It is also possible to configure so as to obtain the same effect as that of the embodiment.

【０１０３】すなわち、符号化装置において量子化器１
４の量子化ステップ幅をＲＳ−ＤＣＴ部１０における符
号化対象画素数Ｍの平方根に逆比例した値でスカラー倍
して修正し、さらに、復号化装置では逆量子化器２２の
量子化ステップ幅を符号化対象画素数Ｍの平方根に比例
した値でスカラー倍して修正することによっても等価な
効果が得られる。That is, in the encoding device, the quantizer 1
4 is corrected by scalar multiplication by a value inversely proportional to the square root of the number M of pixels to be encoded in the RS-DCT unit 10, and further, the quantization step width of the inverse quantizer 22 in the decoding device. An equivalent effect can also be obtained by correcting scalar by a value proportional to the square root of the number M of pixels to be encoded.

【０１０４】ただし、量子化処理は乗算とは逆の除算な
ので、符号化装置における量子化ステップ幅は、第１の
実施の形態においてHowever, since the quantization process is a division opposite to the multiplication, the quantization step width in the encoding device is the same as that in the first embodiment.

【０１０５】[0105]

【外１３】 [Outside 13]

【０１０６】を用いてスカラー倍する必要がある。It is necessary to perform scalar multiplication by using

【０１０７】[0107]

【発明の効果】以上説明してきたように、本発明によれ
ば、複数の構成要素からなる画像の境界情報に基づく符
号化処理を最適化した符号化装置から出力される符号化
データを復号化装置により復号化するときに、復号画像
の総歪み量を最小とするように復号化処理を最適化する
ことができる効果が得られる。As described above, according to the present invention, it is possible to decode encoded data output from an encoding device that has optimized encoding based on boundary information of an image composed of a plurality of components. When decoding is performed by the device, an effect is obtained that the decoding process can be optimized so as to minimize the total distortion amount of the decoded image.

[Brief description of the drawings]

【図１】本発明を適用した画像符号化装置の第１の実施
の形態の構成を示すブロック図である。FIG. 1 is a block diagram illustrating a configuration of a first embodiment of an image encoding device to which the present invention has been applied.

【図２】本発明を適用した画像復号化装置の第１の実施
の形態の構成を示すブロック図である。FIG. 2 is a block diagram illustrating a configuration of a first embodiment of an image decoding device to which the present invention has been applied.

[Explanation of symbols]

１，２ブロック化回路３ＤＣＴ基底供給回路４，５サポート領域修正回路６ａ，６ｂ，６ｃ係数導出回路７判定回路８切替え回路９組み合わせ回路１０ＲＳ−ＤＣＴ部１２，２４乗算器１４量子化器１６可変長符号化器２０可変長復号化器２２逆量子化器２６２次元逆ＤＣＴ部２８選択器２９マスクパターン 1, 2 Blocking circuit 3 DCT base supply circuit 4, 5 Support area correction circuit 6a, 6b, 6c Coefficient derivation circuit 7 Judgment circuit 8 Switching circuit 9 Combination circuit 10 RS-DCT unit 12, 24 Multiplier 14 Quantizer 16 Variable length encoder 20 Variable length decoder 22 Dequantizer 26 Two-dimensional inverse DCT unit 28 Selector 29 Mask pattern

───────────────────────────────────────────────────── フロントページの続き (72)発明者田中豊東京都世田谷区砧一丁目10番11号日本放送協会放送技術研究所内 ──────────────────────────────────────────────────続き Continued on front page (72) Inventor Yutaka Tanaka 1-10-11 Kinuta, Setagaya-ku, Tokyo Japan Broadcasting Corporation Broadcasting Research Institute

Claims

[Claims]

1. An image composed of a plurality of constituent elements is divided into a plurality of blocks each having a predetermined size and composed of a predetermined number of pixels.
A pixel signal vector corresponding to at least one image region composed of a single component included in the block is obtained by weighting a linear sum of orthogonal transformation base vectors derived for each image region based on boundary information of the component. In an image encoding apparatus comprising: a conversion unit that converts and outputs as; and a quantization unit that quantizes a signal from the conversion unit, a weighted conversion coefficient from the conversion unit is provided for each of the image regions included in the block. An image encoding apparatus, comprising: an optimizing means for optimizing an encoding process by multiplying a scalar by a value proportional to the square root of the number of pixels and inputting the result to the quantizing means.

2. An image composed of a plurality of constituent elements is divided into a plurality of blocks of a predetermined size composed of a predetermined number of pixels,
A pixel signal vector corresponding to at least one image region composed of a single component included in the block is obtained by weighting a linear sum of orthogonal transformation base vectors derived for each image region based on boundary information of the component. In an image encoding apparatus comprising: a conversion unit that converts and outputs a signal from the conversion unit; and a quantization unit that quantizes a signal from the conversion unit, the signal level of the pixel signal vector is set for each of the image regions included in the block. An image encoding apparatus, comprising: an optimizing means for optimizing an encoding process by multiplying a scalar by a value proportional to the square root of the number of pixels and inputting the result to the conversion means.

3. An image composed of a plurality of constituent elements is divided into a plurality of blocks of a predetermined size composed of a predetermined number of pixels,
A pixel signal vector corresponding to at least one image region composed of a single component included in the block is obtained by weighting a linear sum of orthogonal transformation base vectors derived for each image region based on boundary information of the component. In an image encoding apparatus comprising: a conversion unit configured to convert and output as; and a quantization unit configured to quantize a signal from the conversion unit, wherein the quantization unit determines the number of pixels for each of the image regions included in the block. An image coding apparatus characterized in that the image coding apparatus is a quantization means for optimizing a coding process by scalar multiplying a quantization step width by a value inversely proportional to a square root.

4. The orthogonal transformation basis vector masks a predetermined component of a discrete cosine transformation basis vector for each image region based on boundary information of the component, and performs scalar multiplication to newly generate a component for each image region. 4. The derived orthogonal transformation basis vector.
The image encoding device according to any one of the above.

5. An image output from the image encoding apparatus according to claim 1, wherein at least one image area composed of a single component included in a block composed of a predetermined number of pixels and divided by a predetermined size. Inverse quantization means for inversely quantizing the encoded data obtained by encoding the corresponding pixel signal vector to obtain a weighting coefficient, and an orthogonal transform base vector derived for each image region based on the boundary information of the components. An inverse decoding means for performing a weighted linear addition and decoding a pixel signal vector corresponding to the image area, wherein the weighting coefficient obtained by the inverse quantization is the image included in the block. For each region, the decoding process is optimized by scalar multiplication by a value inversely proportional to the square root of the number of pixels and weighted linear addition of the orthogonal transform base vectors. Image decoding apparatus comprising the means.

6. An image encoding apparatus according to claim 2, wherein at least one image area composed of a single component included in a block composed of a predetermined number of pixels and divided by a predetermined size is output. Inverse quantization means for inversely quantizing the encoded data obtained by encoding the corresponding pixel signal vector to obtain a weighting coefficient, and an orthogonal transform base vector derived for each image region based on the boundary information of the components. A weighted linear addition, and an inverse transform unit for decoding a pixel signal vector corresponding to the image region, wherein the output of the inverse transform unit is calculated for each image region included in the block. Optimizing means for optimizing a decoding process by scalar multiplying by a value inversely proportional to the square root of the number of pixels and decoding a pixel signal vector corresponding to the image area; Image decoding apparatus characterized by.

7. An image output from the image encoding apparatus according to claim 3, wherein at least one image area including a single component included in a block composed of a predetermined number of pixels and divided by a predetermined size is included. Inverse quantization means for inversely quantizing the encoded data obtained by encoding the corresponding pixel signal vector to obtain a weighting coefficient, and an orthogonal transform base vector derived for each image region based on the boundary information of the components. A weighted linear addition, and an inverse transform unit for decoding a pixel signal vector corresponding to the image region, wherein the inverse quantization unit is configured for each of the image regions included in the block. An image decoding device, characterized in that it is an inverse quantization means for scalar multiplying a quantization step width by a value proportional to the square root of the number of pixels to optimize decoding processing.

8. The orthogonal transform basis vector masks a predetermined component of a discrete cosine transform basis vector for each of the image regions based on boundary information of the components, and performs scalar multiplication to newly generate a discrete component for each of the image regions. 8. The derived orthogonal transformation basis vector, wherein:
The image decoding device according to any one of the above.