JP2009049969A

JP2009049969A - Device and method of coding moving image and device and method of decoding moving image

Info

Publication number: JP2009049969A
Application number: JP2007279783A
Authority: JP
Inventors: Masashi Takahashi; 昌史高橋; Tomokazu Murakami; 智一村上
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2007-07-25
Filing date: 2007-10-29
Publication date: 2009-03-05

Abstract

<P>PROBLEM TO BE SOLVED: To solve the problem that, in the intra-screen predictive coding according to the H.264/AVC, adjacent encoded pixels are used to perform prediction of a target block, so that the prediction needs to be performed by referencing spatially-separate pixels, as such, the prediction accuracy is not always sufficient. <P>SOLUTION: Adjacent pixels are always used to perform the intra-screen predictive coding and decoding. In the coding, the difference value (prediction residue) between the adjacent pixels is calculated in a certain direction (predicting direction) and encoded with the predicting direction. In the coding of each block, adjacent pixels may have not been encoded yet. In that case, the difference value between adjacent pixels in an original image is encoded. On the other hand, in the decoding, processing to add a value produced by decoding the previous pixel value and the prediction residue is repeated in a predicting direction to obtain a decoded image. This can perform highly-efficient coding processing using strong correlation between adjacent pixels. <P>COPYRIGHT: (C)2009,JPO&INPIT

Description

本発明は動画像を符号化及び復号化する動画像符号化技術に関する。 The present invention relates to a moving image encoding technique for encoding and decoding a moving image.

大容量の動画像情報をデジタルデータ化して記録、伝達する手法として、MPEG (Moving
Picture Experts Group)方式等の符号化方式が策定され、MPEG-1規格、MPEG-2規格、MPE
G-4規格、H.264/AVC(Advanced Video Coding)規格等として国際標準の符号化方式とな
っている。これらの方式はデジタル衛星放送やDVD、携帯電話やデジタルカメラなどにお
ける符号化方式として採用され、現在ますます利用の範囲が広がり、身近なものとなって
きている。 MPEG (Moving) is a method for recording and transmitting large volumes of moving image information as digital data.
Coding schemes such as the Picture Experts Group) were formulated, and the MPEG-1 standard, MPEG-2 standard, MPE
It is an international standard encoding method such as G-4 standard, H.264 / AVC (Advanced Video Coding) standard, and the like. These systems have been adopted as encoding systems for digital satellite broadcasting, DVDs, mobile phones, digital cameras, and the like, and the range of use is now expanding and becoming familiar.

これらの規格では、符号化処理が完了した画像情報を利用して符号化対象画像をブロッ
ク単位で予測し、原画像との予測差分を符号化することによって、動画像の持つ冗長性を
除いて符号量を減らしている。特にH.264/AVCでは、符号化対象ブロックの周辺画素を利
用する画面内予測符号化方式を採用することにより、劇的な圧縮率の向上を実現した。 In these standards, the encoding target image is predicted in block units using the image information that has been encoded, and the prediction difference from the original image is encoded, thereby eliminating the redundancy of the moving image. The code amount is reduced. In particular, H.264 / AVC achieved a dramatic improvement in compression rate by adopting an intra-screen predictive encoding method that uses peripheral pixels of the encoding target block.

しかし、H.264/AVCにおける画面内予測の場合、対象ブロックに隣接する画素情報しか
参照することができず、予測精度が十分であるとは言えなかった。そのため、画面内予測
の精度を向上させて圧縮率を上げるための画面内符号化技術が求められるようになった。
以上に述べた背景から、画面内予測に利用可能な画素の種類を増やすことによって画面
内予測の精度を向上させる技術が多数提案されている。 However, in the case of intra prediction in H.264 / AVC, only pixel information adjacent to the target block can be referred to, and it cannot be said that the prediction accuracy is sufficient. Therefore, an intra-screen coding technique for improving the accuracy of intra-screen prediction and increasing the compression rate has been demanded.
From the background described above, many techniques for improving the accuracy of intra prediction by increasing the types of pixels that can be used for intra prediction are proposed.

例えば、特許文献１では、画像全体を反転してから符号化することを可能にすることに
よって、符号化処理の順序を変更して画面内予測に利用可能な画素の種類を増やしている
。
また、特許文献２では、対象ブロックの予測に利用するブロックを同一画面内の符号化
済み領域の中から探索することによって、離れたブロックを参照して予測を行うことを可
能にしている。 For example, in Patent Document 1, the entire image can be inverted before encoding, thereby changing the encoding processing order and increasing the types of pixels that can be used for intra prediction.
Further, in Patent Document 2, it is possible to perform prediction with reference to a distant block by searching for a block used for prediction of a target block from an encoded region in the same screen.

また、特許文献３には、入力ブロックを少なくとも２つ以上の領域に分離して、分離さ
れた領域のうち、第１領域の画素に対して周辺ブロックの画素を利用してイントラ予測符
号化を行い、イントラ予測符号化された第１領域の画素を復元した後、復元された第１領
域の画素を利用して、複数個の予測モードのうち少なくとも一つ以上の予測モードによっ
て第２領域の画素を予測する段階を含むことを特徴とするイントラ予測符号化方法が開示
されている。 In Patent Document 3, the input block is separated into at least two regions, and intra prediction coding is performed using pixels in the peripheral block for pixels in the first region among the separated regions. Performing the intra-prediction encoded first region pixels, and using the restored first region pixels, the second region pixels may be stored in the second region according to at least one of the plurality of prediction modes. An intra-prediction encoding method including a step of predicting a pixel is disclosed.

また、特許文献４には、入力ブロックを少なくとも２つ以上の領域に分離し、分離され
た領域のうち、第１領域の画素に対して周辺ブロックの画素を利用してイントラ予測を行
った後、分離された領域のうち、第２領域の所定の画素周辺に位置した第１領域の画素の
空間的特性を決定し、決定された空間的特性に基づいて第２領域の画素の予測のための第
１領域の参照画素を決定し、決定された第１領域の参照画素に基づいて第２領域の画素の
予測を行う映像のイントラ予測符号化方法が開示されている。 Further, in Patent Document 4, after the input block is separated into at least two regions, and intra prediction is performed on the pixels in the first region among the separated regions using the pixels in the peripheral blocks. The spatial characteristics of the pixels in the first area located around the predetermined pixels in the second area of the separated areas are determined, and the pixels in the second area are predicted based on the determined spatial characteristics. A video intra prediction encoding method is disclosed in which a reference pixel of the first region is determined, and a pixel of the second region is predicted based on the determined reference pixel of the first region.

また、非特許文献１には、画質歪みと符号量の関係から最適な符号化モードを決定する
RD-Optimization方式が記載されている。
特開２００６−３５２１８１号公報特開２００７−４３６５１号公報特開２００７−７４７２５号公報特開２００７−７４７２５号公報 G. Sullivan and T.Wiegand:Rate-Distortion Optimization for VｉｄｅｏＣｏｍｐｒｅｓｓｉｏｎ”，ＩＥＥＥＳｉｇｎａｌＰｒｏｃｅｓｓｉｎｇＭａｇａｚｉｎｅ，ｖｏｌ．１５，ｎｏ．６，ｐｐ．７４−９０，１９９８． In Non-Patent Document 1, an optimal encoding mode is determined from the relationship between image quality distortion and code amount.
RD-Optimization method is described.
JP 2006-352181 A JP 2007-43651 A JP 2007-74725 A JP 2007-74725 A G. Sullivan and T. Wiegand: Rate-Distortion Optimization for Video Compression ", IEEE Signal Processing Magazine, vol.15, no.6, pp.74-90, 1998.

しかし、上記の文献を含む既存の画面内予測符号化技術は、どれも符号化済みの画素を
復号化したものを用いて予測を行っているため、ブロック単位で画面内予測符号化を行う
場合には、空間的に離れた画素を参照して予測する必要があった。この場合、予測対象画
素と被参照画素の距離が大きくなることにより、画素間の相関性が低下して予測精度が落
ちるといった問題があった。 However, since all the existing intra prediction encoding techniques including the above documents perform prediction using decoded pixels, the intra prediction encoding is performed on a block basis. Therefore, it was necessary to predict with reference to spatially separated pixels. In this case, since the distance between the prediction target pixel and the referenced pixel is increased, there is a problem that the correlation between the pixels is reduced and the prediction accuracy is lowered.

H.264/AVCにおける画面内予測符号化では、符号化済みの隣接画素を利用して対象ブロ
ックの予測を行うため、空間的に離れた画素を参照して予測を行う必要があり、予測精度
が十分であるとは言えなかった。 In the intra prediction encoding in H.264 / AVC, since the target block is predicted using the encoded adjacent pixels, it is necessary to perform prediction with reference to spatially separated pixels, and the prediction accuracy Was not enough.

そこで、本発明は常に隣接する画素を参照して予測を行うことによって、予測精度を改
善する技術を提供する。本発明の目的は、常に画素間相関の高い隣接画素を参照して予測
を行うことによって、画面内予測符号化の効率性を高めることにある。 Therefore, the present invention provides a technique for improving prediction accuracy by always performing prediction with reference to adjacent pixels. An object of the present invention is to improve the efficiency of intra prediction encoding by always performing prediction with reference to adjacent pixels having high inter-pixel correlation.

本発明では、画面内予測符号化および復号化を行う際には、常に隣接する画素を利用す
る。符号化時には、一定の方向（予測方向）に沿って隣接画素の差分値（予測差分）を計
算し、予測方向とともに符号化する。この時、ブロック単位で符号化する場合には、隣接
する画素が未符号化状態である可能性があるが、この場合は原画像における隣接画素間の
差分値を符号化する。一方、復号化時には、直前の画素値を復号した値に予測差分を加算
する処理を予測方向に沿って繰り返すことによって、復号画像を取得する。これにより、
隣接する画素間の強い相関性を利用して効率性の高い符号化処理を行うことができる。 In the present invention, adjacent pixels are always used when performing intra prediction encoding and decoding. At the time of encoding, a difference value (prediction difference) between adjacent pixels is calculated along a certain direction (prediction direction) and encoded together with the prediction direction. At this time, when encoding is performed in block units, adjacent pixels may be in an uncoded state. In this case, a difference value between adjacent pixels in the original image is encoded. On the other hand, at the time of decoding, a decoded image is acquired by repeating the process of adding the prediction difference to the value obtained by decoding the immediately preceding pixel value along the prediction direction. This
A highly efficient encoding process can be performed using a strong correlation between adjacent pixels.

本発明によれば、少ない符号量で高画質の映像を提供するための動画像符号化技術およ
び復号化技術を提供することができる。本発明は次世代の画像符号化規格に向けたもので
あり、例えば本技術による予測符号化技術を従来技術と組み合わせて利用することにより
、高い圧縮率を実現することができる。例えば、本技術とH.264/AVCによる画面内予測符
号化技術をブロック単位で使い分けることにより、画像の性質に適した符号化を行うこと
が可能になる。また、本発明は高ビットレートでの符号化、特にロスレス符号化に対して
特に高い効果がある。 ADVANTAGE OF THE INVENTION According to this invention, the moving image encoding technique and decoding technique for providing a high quality image | video with a small code amount can be provided. The present invention is directed to a next-generation image coding standard. For example, a high compression ratio can be realized by using a predictive coding technique according to the present technique in combination with a conventional technique. For example, by appropriately using the present technology and the intra-screen predictive coding technology based on H.264 / AVC in units of blocks, it is possible to perform coding suitable for the properties of the image. Further, the present invention is particularly effective for encoding at a high bit rate, particularly lossless encoding.

以下、本発明の実施例を、図面を参照して説明する。 Embodiments of the present invention will be described below with reference to the drawings.

図３は、従来技術であるH.264/AVCによる画面内予測符号化処理の動作について概念的
に示したものである。H.264/AVCでは、符号化対象画像に対してラスタースキャンの順序
に従って符号化を実行し(301)、符号化済み画像領域の復号化画像を用いて予測符号化を
行う。 FIG. 3 conceptually shows the operation of the intra-screen predictive encoding process based on H.264 / AVC, which is a conventional technique. In H.264 / AVC, encoding is performed on an encoding target image according to the order of raster scanning (301), and predictive encoding is performed using a decoded image in an encoded image region.

そのため、画面内予測符号化処理の動作例(302)では、予測方向ベクトルを傾きとする
同一直線上の画素はすべて同一画素から予測されることになり、例えば(303)に示すよう
に、符号化対象ブロックの画素B、C、D、Eはすべて同一画素Aを参照して予測符号化が行
われ、画素Aを復号化した値との差分(予測差分)が計算される。さらに、縦、横、斜めな
ど、数種類の予測方向候補の中から最適なものをブロック単位で選択することができ(304
)、予測差分と予測方向の値が符号化される。 Therefore, in the operation example (302) of the intra prediction encoding process, all the pixels on the same straight line with the prediction direction vector as the gradient are predicted from the same pixel. For example, as shown in (303) All the pixels B, C, D, and E of the conversion target block are subjected to predictive encoding with reference to the same pixel A, and a difference (prediction difference) from a value obtained by decoding the pixel A is calculated. In addition, the optimal prediction direction can be selected from several types of prediction direction candidates such as vertical, horizontal, and diagonal (block 304).
), The prediction difference and the value of the prediction direction are encoded.

一方、復号化処理は復号化済みの被参照画素と予測差分を用いて上記処理の逆手順を踏
めばよく、予測差分を予測方向に沿って非参照画素と足し合わせることによって復号化画
像を取得する(305)。例えば(306)では、復号化対象ブロック上の画素B、C、D、Eはすべて
復号化済みの被参照画素Aと予測差分の和によって復号化される。以上のように、H.264/A
VCによる画面内予測符号化処理では、被参照画素を符号化済みのものに限定するため、空
間的に離れた画素を参照する必要があり（例えば画素Eは離れた画素Aを参照して予測符号
化が行われている。）、離れた画素間では相関性が低下して予測精度が十分ではないとい
った問題があった。 On the other hand, in the decoding process, the reverse process of the above process may be performed using the decoded reference pixel and the prediction difference, and the decoded image is obtained by adding the prediction difference to the non-reference pixel along the prediction direction. (305). For example, in (306), all the pixels B, C, D, and E on the decoding target block are decoded by the sum of the decoded reference pixel A and the prediction difference. As above, H.264 / A
In the intra-screen predictive encoding process using VC, it is necessary to refer to pixels that are spatially separated in order to limit the referenced pixels to those that have already been encoded (for example, pixel E is predicted with reference to distant pixel A) Encoding has been performed.) However, there is a problem in that the correlation is lowered between distant pixels and the prediction accuracy is not sufficient.

図４は、本発明による画面内予測符号化処理の動作について概念的に示した図である。
この場合も符号化対象画像に対してラスタースキャンの順序に従って符号化を行う(401)
が、本発明では上記の問題を解決するために、符号化処理が完了していなくても空間的に
近い画素を参照して予測を行うことを可能にした。例えば(402)では、常に隣接する画素
を参照して縦方向に予測を行っている。 FIG. 4 is a diagram conceptually showing the operation of the intra prediction encoding process according to the present invention.
In this case as well, the encoding target image is encoded according to the raster scan order (401).
However, in the present invention, in order to solve the above problem, it is possible to perform prediction by referring to spatially close pixels even if the encoding process is not completed. For example, in (402), prediction is always performed in the vertical direction with reference to adjacent pixels.

この場合、未符号化領域を参照する際には、従来技術のように復号化済みの画素情報を
利用することができないため、同じ座標に位置する原画像の値を用いて予測を行う。例え
ば(403)では、画素C、D、Eはそれぞれ、未符号化の画素B、C、Dを用いて予測が行われて
おり、これら未符号化の画素に対しては原画像における上下に位置する画素間の差分（B-
A=b、C-B=ｃ、D-C=ｄ、E-D=e）を計算する。さらに本発明に対しても様々な予測方向を定
義することができ(404)、予測差分と予測方向の値を符号化する。 In this case, when referring to an unencoded area, the decoded pixel information cannot be used as in the prior art, and therefore prediction is performed using the value of the original image located at the same coordinates. For example, in (403), pixels C, D, and E are predicted using unencoded pixels B, C, and D, respectively. Difference between pixels in position (B-
A = b, CB = c, DC = d, ED = e). Furthermore, various prediction directions can be defined for the present invention (404), and the prediction difference and the value of the prediction direction are encoded.

復号化の際には符号化時とは異なり予測処理に復号化画像のみを利用し、予測方向に沿
って直前の座標に位置する復号画像に予測差分を足し合わせる処理を繰り返すことにより
復号画像を取得できる(405)。例えば(406)では、画素B、C、D、Eはそれぞれ、画素A、B、
C、Dの復号画像（A’、B’、C’、D’）に対して予測差分(b’、c’、d’、e’)を加算す
ることによって復号画像を取得できる。 When decoding, unlike the case of encoding, only the decoded image is used for the prediction process, and the decoded image is repeated by repeating the process of adding the prediction difference to the decoded image located at the immediately preceding coordinate along the prediction direction. Can be obtained (405). For example, in (406), pixels B, C, D, E are pixels A, B,
A decoded image can be acquired by adding the prediction difference (b ′, c ′, d ′, e ′) to the decoded image of C, D (A ′, B ′, C ′, D ′).

本発明の場合、符号化時に予測差分に対して量子化処理を施せば、量子化誤差が蓄積し
てしまう恐れがある。そのため、本発明を利用すれば予測精度が向上して予測差分は減少
するが、量子化誤差が増加するという事態が発生する場合がある。こういった場合、従来
技術を用いて予測符号化を行った方が、符号化効率が高くなる可能性がある。 In the case of the present invention, if the quantization process is performed on the prediction difference at the time of encoding, a quantization error may be accumulated. For this reason, if the present invention is used, the prediction accuracy is improved and the prediction difference is reduced, but there may be a case where the quantization error increases. In such a case, encoding efficiency may be higher when predictive encoding is performed using the conventional technique.

そのため、本発明による予測符号化技術を従来技術と組み合わせて利用することにより
、画像の性質に応じて高い圧縮率を実現することができる。例えば、本技術と従来技術を
ブロック単位で使い分けて予測符号化することにより、画像の性質に適した符号化が可能
になる。従来技術としては、例えば図３に示すH.264/AVCによる画面内予測符号化や、符
号化対象画像とは異なる画像（復号画像）を用いる画面間予測符号化などを利用すると効
果的である。 Therefore, by using the predictive coding technique according to the present invention in combination with the conventional technique, a high compression rate can be realized according to the property of the image. For example, predictive encoding using the present technology and the conventional technology separately in units of blocks enables encoding suitable for the properties of the image. As the prior art, for example, it is effective to use intra-screen predictive encoding using H.264 / AVC shown in FIG. 3 or inter-screen predictive encoding using an image (decoded image) different from the encoding target image. .

図１は、本発明による動画像符号化装置の一実施例を示したものである。動画像符号化
装置は、入力された原画像(101)を保持する入力画像メモリ(102)と、入力画像を小領域に
分割したブロック単位で動きを検出する動き探索部(103)と、同じくブロック単位で画面
内予測を行う画面内予測部(104)、検出された動き量を基に画面間予測を行う画面間予測
部(105)と、画像の性質に合った予測符号化手段を決定するモード選択部(106)と、予測差
分を生成するための減算部(107)と、予測差分に対して符号化を行う周波数変換部(108)、
および量子化部(109)と、記号の発生確率に応じた符号化を行うための可変長符号化部(11
0)と、一度符号化した予測差分を復号化するための逆量子化処理部(111)、および逆周波
数変換部(112)と、復号化された予測差分を用いて復号画像を生成するための加算部(113)
と、復号化画像を保持して後の予測に活用するための参照画像メモリ(114)を有する。 FIG. 1 shows an embodiment of a moving picture coding apparatus according to the present invention. Similarly, the moving image coding apparatus includes an input image memory (102) that holds an input original image (101), and a motion search unit (103) that detects motion in units of blocks obtained by dividing the input image into small regions. In-screen prediction unit (104) that performs intra-screen prediction on a block basis, inter-screen prediction unit (105) that performs inter-screen prediction based on the detected amount of motion, and predictive coding means that matches the nature of the image A mode selection unit (106), a subtraction unit (107) for generating a prediction difference, a frequency conversion unit (108) that performs encoding on the prediction difference,
And a quantizing unit (109) and a variable length coding unit (11) for performing coding according to the probability of occurrence of a symbol
0), an inverse quantization processing unit (111) for decoding a prediction difference once encoded, and an inverse frequency transform unit (112), and for generating a decoded image using the decoded prediction difference Addition part (113)
And a reference image memory (114) for holding the decoded image and using it for later prediction.

原画像メモリ(102)は原画像(101)の中から一枚の画像を符号化対象画像として保持しこ
れを細かなブロックに分割して動き探索部(103)、および画面内予測部(104)に渡す。 The original image memory (102) holds one image from the original image (101) as an encoding target image, and divides it into fine blocks to divide the image into a motion search unit (103) and an in-screen prediction unit (104 ).

動き探索部(103)では、参照画像メモリ(114)に格納されている復号化済み画像を用いて
該当ブロックの動き量を計算し、動きベクトルとして画面間予測部(105)に渡す。 The motion search unit (103) calculates the motion amount of the corresponding block using the decoded image stored in the reference image memory (114), and passes it to the inter-screen prediction unit (105) as a motion vector.

画面内予測部(104)、および画面間予測部(105)ではそれぞれ画面内予測処理と画面間予
測処理をいくつかの大きさのブロック単位で実行し、モード選択部(106)にて最適な予測
符号化手段を選ぶ。続いて加算部(107)では最適な予測符号化手段による予測差分を生成
し、周波数変換部(108)に渡す。 The in-screen prediction unit (104) and the inter-screen prediction unit (105) execute the intra-screen prediction process and the inter-screen prediction process in blocks of several sizes, respectively, and the mode selection unit (106) Select predictive encoding means. Subsequently, the adder (107) generates a prediction difference by the optimum predictive encoding means and passes it to the frequency converter (108).

画面内予測部(104)には、本発明の特徴である符号化対象となる画素を常に隣接する画
素を参照して予測符号化処理を行う改良型の画面内予測符号化手段と、例えばH.264/AVC
による従来型の画面内予測符号化手段が含まれる。 The intra-screen prediction unit (104) includes an improved intra-screen prediction encoding unit that performs predictive encoding processing with reference to a pixel that is always adjacent to an encoding target pixel that is a feature of the present invention, for example, H .264 / AVC
Includes a conventional intra-frame predictive encoding means.

周波数変換部(108)、および量子化処理部(109)では、送られてきた差分画像に対して指
定された大きさのブロック単位でそれぞれDCT(Discrete
Cosine Transformation：離散コサイン変換)などの周波数変換、および量子化処理を行い
、可変長符号化処理部(110)および逆量子化処理部(111)に渡す。 The frequency conversion unit (108) and the quantization processing unit (109) each receive a DCT (Discrete) in units of blocks of a specified size for the transmitted difference image.
Frequency conversion such as Cosine Transformation (discrete cosine transformation) and quantization processing are performed, and the result is passed to the variable length coding processing unit (110) and the inverse quantization processing unit (111).

さらに可変長符号化処理部(110)では、周波数変換係数によって表される予測差分情報
を、例えば画面内予測符号化における予測方向や画面間予測符号化における動きベクトル
など、予測復号化に必要な付加情報とともに、記号の発生確率に基づいて可変長符号化を
行って符号化ストリームを生成する。 Further, in the variable length coding processing unit (110), the prediction difference information represented by the frequency transform coefficient is necessary for predictive decoding such as a prediction direction in intra prediction encoding and a motion vector in inter prediction encoding. Along with the additional information, variable length coding is performed based on the occurrence probability of symbols to generate a coded stream.

また、逆量子化処理部(111)、逆周波数変換部(112)では、量子化後の周波数変換係数に
対して、それぞれ逆量子化、およびIDCT(Inverse DCT：逆DCT)などの逆周波数変換を施し
、予測差分を取得して加算部(113)に送る。続いて加算部(113)により復号画像を生成して
参照画像メモリ(114)に格納する。 In addition, the inverse quantization processing unit (111) and the inverse frequency conversion unit (112) respectively perform inverse frequency conversion such as inverse quantization and IDCT (Inverse DCT) on the frequency conversion coefficient after quantization. To obtain the prediction difference and send it to the adder (113). Subsequently, the adder (113) generates a decoded image and stores it in the reference image memory (114).

図２は、本発明による動画像復号化装置の一実施例を示したものである。動画像復号化
装置は、例えば図１に示す動画像符号化装置によって生成された符号化ストリーム(201)
に対して可変長符号化の逆の手順を踏む可変長復号化部(202)と、予測差分を復号化する
ための逆量子化処理部(203)、および逆周波数変換部(204)と、予測を行うための画面内予
測部(205)、および画面間予測部(206)と、復号画像を取得するための加算部(207)と、復
号画像を一時的に記憶しておくための参照画像メモリ(208)を有する。 FIG. 2 shows an embodiment of the moving picture decoding apparatus according to the present invention. For example, the moving picture decoding apparatus includes an encoded stream (201) generated by the moving picture encoding apparatus shown in FIG.
A variable length decoding unit (202) that follows the reverse procedure of variable length coding, an inverse quantization processing unit (203) for decoding a prediction difference, and an inverse frequency transform unit (204), Intra-screen prediction unit (205) and inter-screen prediction unit (206) for performing prediction, an addition unit (207) for acquiring a decoded image, and a reference for temporarily storing the decoded image An image memory (208) is included.

画面内予測部(205)には、本発明の特徴である復号対象となる画素を常に隣接する画素
を参照して予測する画面内予測復号化手段と、例えばH.264/AVCによる従来型の画面内予
測符号化手段が含まれる。 The intra-screen prediction unit (205) includes an intra-screen predictive decoding unit that always predicts a pixel to be decoded, which is a feature of the present invention with reference to adjacent pixels, and a conventional type based on, for example, H.264 / AVC. Intra-screen predictive encoding means is included.

可変長復号化部(202)では、符号化ストリーム(201)を復号化し、予測差分の周波数変換
係数成分と、予測方向や動きベクトルなどの予測処理に必要な付加情報を取得する。前者
の予測差分情報に対しては逆量子化処理部(203)に、後者の付加情報に対しては、予測手
段に応じて画面内予測部(205)または画面間予測部(206)に送られる。続いて、逆量子化処
理部(203)、逆周波数変換部(204)では、予測差分情報に対して逆量子化と逆周波数変換を
施して復号化を行い、一方で画面内予測部(205)または画面間予測部(206)において、付加
情報を基に予測処理を実行して加算部(207)で復号画像を生成するとともに、復号画像を
参照画像メモリ(208)に格納する。 The variable length decoding unit (202) decodes the encoded stream (201), and obtains additional information necessary for prediction processing, such as a frequency conversion coefficient component of a prediction difference and a prediction direction and a motion vector. The former prediction difference information is sent to the inverse quantization processing unit (203), and the latter additional information is sent to the intra prediction unit (205) or the inter prediction unit (206) depending on the prediction means. It is done. Subsequently, the inverse quantization processing unit (203) and the inverse frequency transform unit (204) perform inverse quantization and inverse frequency transform on the prediction difference information to perform decoding, while the intra prediction unit (205 ) Or the inter-screen prediction unit (206) executes a prediction process based on the additional information, and the addition unit (207) generates a decoded image and stores the decoded image in the reference image memory (208).

図５は、図１に示す動画像符号化装置の実施例における1フレームの符号化処理手順に
ついて示している。まず、符号化対象となるフレーム内に存在するすべてのブロックに対
して(501)、以下の処理を行う。すなわち、該当ブロックに対して一度すべての符号化モ
ード(予測方法とブロックサイズの組み合わせ)に対して予測符号化処理を行って予測差分
を計算し、その中から最も符号化効率の高いものを選択する。予測処理方法としては、本
発明による方法（以下、「改良型画面内予測符号化処理」(505)）の他に、例えばH.264/A
VCに採用されている画面内予測方法(以下、「従来型画面内予測符号化処理」(506)）や画
面間予測符号化処理(507)を実行し、その中から最適なものを選ぶことによって、画像の
性質に応じて効率良く符号化できる。さらに、上記多数の符号化モードの中から最も符号
化効率の高いものを選択する際には(508)、例えば画質歪みと符号量の関係から最適な符
号化モードを決定するRD-Optimization方式を利用することによって、効率良く符号化で
きる。RD-Optimization方式の詳細については、非特許文献１に記載されている。 FIG. 5 shows a procedure for encoding one frame in the embodiment of the moving picture encoding apparatus shown in FIG. First, the following processing is performed for all blocks existing in the frame to be encoded (501). In other words, the prediction coding process is performed once for all coding modes (combination of prediction method and block size) for the corresponding block, the prediction difference is calculated, and the one with the highest coding efficiency is selected from among them. To do. As a prediction processing method, in addition to the method according to the present invention (hereinafter referred to as “the improved intra prediction encoding process” (505)), for example, H.264 / A
Execute the intra-screen prediction method (hereinafter referred to as “conventional intra-screen predictive encoding process” (506)) and inter-screen predictive encoding process (507) used in VC, and select the optimal one from them. Thus, encoding can be performed efficiently according to the nature of the image. Furthermore, when selecting the one with the highest encoding efficiency from among the above-mentioned many encoding modes (508), for example, an RD-Optimization method for determining the optimal encoding mode from the relationship between image quality distortion and code amount is used. By using this, encoding can be performed efficiently. Details of the RD-Optimization method are described in Non-Patent Document 1.

続いて、選ばれた符号化モードで生成された予測差分に対して周波数変換(509)と量子
化処理(510)を施し、さらに可変長符号化を行うことによって符号化ストリームを生成す
る(511)。一方、量子化済みの周波数変換係数に対しては逆量子化処理(512)と逆周波数変
換処理(513)を施して予測差分を復号化し、復号画像を生成して参照画像メモリに格納す
る(514)。以上の処理をすべてのブロックに対して完了すれば、画像1フレーム分の符号化
は終了する(515)。 Subsequently, frequency conversion (509) and quantization processing (510) are performed on the prediction difference generated in the selected encoding mode, and further, variable length encoding is performed to generate an encoded stream (511). ). On the other hand, the quantized frequency transform coefficient is subjected to inverse quantization processing (512) and inverse frequency transform processing (513) to decode the prediction difference, generate a decoded image, and store it in the reference image memory ( 514). When the above processing is completed for all the blocks, the encoding for one frame of the image is completed (515).

図６は、上記改良型画面内予測符号化処理(505)の詳細な処理手順について示している
。ここでは、縦、横、斜めなど、あらかじめ定義されたすべての予測方向に対し(601)、
ブロック内に存在するすべての画素に対して予測方向に沿った順序で以下の処理を行う(6
02)。すなわち、該当する画素がブロック境界に属するなら(603)、直前座標の復号画像と
現座標の原画像の差分を計算し(604)、ブロック境界に属さないなら、直前座標の原画像
と現座標の原画像の差分を計算する(605)。以上の処理を、すべての予測方向に対して完
了すれば、1ブロック分の予測差分を取得することができ、予測符号化処理を終了する(60
6)。 FIG. 6 shows the detailed processing procedure of the improved intra prediction encoding process (505). Here, (601) for all predefined prediction directions, such as vertical, horizontal, diagonal,
The following processing is performed in the order along the prediction direction for all pixels present in the block (6
02). That is, if the corresponding pixel belongs to the block boundary (603), the difference between the decoded image of the previous coordinate and the original image of the current coordinate is calculated (604), and if not, the original image of the previous coordinate and the current coordinate The difference between the original images is calculated (605). If the above process is completed for all prediction directions, a prediction difference for one block can be acquired, and the prediction encoding process is terminated (60
6).

図７は、図２に示す動画像復号化装置の実施例における1フレームの復号化処理手順に
ついて示している。まず、1フレーム内のすべてのブロックに対して、以下の処理を行う(
701)。すなわち、入力ストリームに対して可変長復号化処理を施し(702)、逆量子化処理(
703)と逆周波数変換処理(704)を施して予測差分を復号化する。続いて、対象ブロックが
どの方法によって予測符号化されているかに応じて、改良型画面内予測復号化処理(707)
、従来型画面内予測復号化処理(708)、画面間予測復号化処理(709)を行い、復号画像を取
得して参照画像メモリに格納する。以上の処理をすべてのブロックに対して完了すれば、
画像1フレーム分の復号化が終了する(710)。 FIG. 7 shows a procedure for decoding one frame in the embodiment of the moving picture decoding apparatus shown in FIG. First, the following processing is performed for all blocks in one frame (
701). That is, variable length decoding processing is performed on the input stream (702), and inverse quantization processing (
703) and inverse frequency transform processing (704) are performed to decode the prediction difference. Subsequently, an improved intra prediction decoding process (707) depending on which method is used for predictive encoding of the target block
Then, conventional intra-screen predictive decoding processing (708) and inter-screen predictive decoding processing (709) are performed, and a decoded image is acquired and stored in the reference image memory. If the above process is completed for all blocks,
Decoding of one image frame is completed (710).

図８は、上記改良型画面内予測復号化処理(707)の詳細な処理手順について示している
。ここでは、ブロック内に存在するすべての画素に対して、予測方向に沿った順序で以下
の処理を行う(801)。すなわち、直前座標の復号画像と現座標の予測差分の和を計算する(
802)。以上の処理をすべての画素に対して完了すれば、1ブロック分の復号画像を取得す
ることができ、予測復号化処理を終了する(803)。 FIG. 8 shows a detailed processing procedure of the improved intra prediction decoding process (707). Here, the following processing is performed on all the pixels present in the block in the order along the prediction direction (801). That is, the sum of the decoded image of the previous coordinate and the prediction difference of the current coordinate is calculated (
802). If the above processing is completed for all the pixels, a decoded image for one block can be acquired, and the predictive decoding processing ends (803).

以上の実施例による改良型画面内予測符号化処理では、ブロック境界に位置する画素の
予測に符号化済み画素の復号画像を利用している。これは、対象ブロックにおける量子化
誤差が別のブロックに伝播するのを防ぐためである。しかし、この場合ブロックの境界部
分で予測誤差が大きくなる可能性がある。そのため、予測誤差を小さくするために、ブロ
ック境界に位置する画素に対しても、直前座標の原画像を利用して予測符号化してもよい
。 In the improved intra prediction encoding process according to the above embodiment, a decoded image of an encoded pixel is used for prediction of a pixel located at a block boundary. This is to prevent the quantization error in the target block from propagating to another block. However, in this case, there is a possibility that the prediction error becomes large at the block boundary. Therefore, in order to reduce the prediction error, prediction encoding may be performed on the pixels located at the block boundary using the original image of the immediately preceding coordinates.

また、本実施例による改良型画面内予測符号化処理では、符号化時と復号化時の参照画素の値が異なるため、量子化誤差が蓄積しやすい。そこで、この量子化誤差を低減するために、改良型画面内予測符号化処理の別の構成例として、予測を行う際に参照画素の画素値を平滑化しても良い。この場合、誤差が拡散されて予測精度が高くなる。この効果は、特に低ビットレートにおいて高くなる。以下にこの別の構成例について説明する。 Further, in the improved intra prediction encoding process according to the present embodiment, the value of the reference pixel at the time of encoding is different from that at the time of decoding, so that a quantization error is likely to be accumulated. Therefore, in order to reduce this quantization error, as another configuration example of the improved intra prediction encoding process, the pixel value of the reference pixel may be smoothed when performing prediction. In this case, the error is diffused and the prediction accuracy is increased. This effect is particularly high at low bit rates. Hereinafter, another example of the configuration will be described.

図１２は、図６に示す改良型画面内予測符号化処理において、上記平滑化処理を用いた場合の構成例である。図１２の構成例では、参照画素の画素値の平滑化を行っている。ここでは、縦、横、斜めなど、あらかじめ定義されたすべての予測方向に対し(1201)、ブロック内に存在するすべての画素に対して予測方向に沿った順序で以下の処理を行う(1202)。すなわち、該当する画素がブロック境界に属するなら(1203)、直前座標の復号画像を平滑化し(1204)、続いて平滑化後の復号画像と現座標の原画像の差分を計算する(1206)。逆にブロック境界に属さないなら、直前座標の原画像を平滑化し(1205)、平滑化後の原画像と現座標の原画像の差分を計算する(1207)。以上の処理を、すべての予測方向に対して完了すれば、1ブロック分の予測差分を取得することができ、予測符号化処理を終了する(1208)。 FIG. 12 is a configuration example when the above smoothing process is used in the improved intra prediction encoding process shown in FIG. In the configuration example of FIG. 12, the pixel value of the reference pixel is smoothed. Here, for all predefined prediction directions such as vertical, horizontal, and diagonal (1201), the following processing is performed in the order along the prediction direction for all pixels present in the block (1202). . That is, if the corresponding pixel belongs to the block boundary (1203), the decoded image at the immediately preceding coordinates is smoothed (1204), and then the difference between the decoded image after smoothing and the original image at the current coordinates is calculated (1206). On the contrary, if it does not belong to the block boundary, the original image at the immediately preceding coordinates is smoothed (1205), and the difference between the original image after smoothing and the original image at the current coordinates is calculated (1207). If the above processing is completed for all prediction directions, a prediction difference for one block can be acquired, and the prediction encoding processing is terminated (1208).

ここで、図１２のステップ1204及びステップ1205における平滑化処理ついて、図１１を用いて説明する。図１１は、参照画素の平滑化処理の一実施例について示している。 Here, the smoothing process in step 1204 and step 1205 in FIG. 12 will be described with reference to FIG. FIG. 11 shows an embodiment of the smoothing process of the reference pixel.

図１２のステップ1204における平滑化処理は、例えば、図１１においては、画素F’に対する平滑化処理として示される。すなわち、図１２のステップ1203において予測対象画素がブロックの境界に属する画素(この例では画素Ｊ)であると判断した場合は、直前座標の画素（画素F’）及びその周辺の画素の復号画像を平滑化する。この例では、平滑化対象となる画素F’の平滑化後の画素値S(F’)は、画素F’の復号画像の画素値と、画素F’の両隣に位置するそれぞれの画素(G’、Ｈ’)の復号画像の画素値との加重平均として算出している。ここで、S(X)は画素Xについて、複数の画素の復号画像の画素値を平滑化した画素値を算出するための関数の一例である。 The smoothing process in step 1204 of FIG. 12 is shown as a smoothing process for the pixel F ′ in FIG. 11, for example. That is, if it is determined in step 1203 of FIG. 12 that the pixel to be predicted is a pixel belonging to the block boundary (pixel J in this example), the decoded image of the pixel at the immediately previous coordinate (pixel F ′) and its surrounding pixels Is smoothed. In this example, the pixel value S (F ′) after smoothing of the pixel F ′ to be smoothed is the pixel value of the decoded image of the pixel F ′ and each pixel (G ', H') is calculated as a weighted average with the pixel value of the decoded image. Here, S (X) is an example of a function for calculating a pixel value obtained by smoothing pixel values of a decoded image of a plurality of pixels for the pixel X.

図１２のステップ1205における平滑化処理は、例えば、図１１においては、画素Jに対する平滑化処理として示される。すなわち、図１２のステップ1203において予測対象画素がブロックの境界に属しない画素(この例では画素L)であると判断した場合は、直前座標の画素（画素J）及びその周辺の画素の原画像を平滑化する。この例では、平滑化対象となる画素Ｊの平滑化後の画素値T(J)は、画素Ｊの原画像の画素値と、画素Ｊの両隣に位置するそれぞれの画素(I、K)の原画像の画素値との加重平均として算出している。ここで、T(X)は画素Xについて、複数の画素の原画像の画素値を平滑化した画素値を算出するための関数の一例である。 The smoothing process in step 1205 of FIG. 12 is shown as a smoothing process for the pixel J in FIG. That is, if it is determined in step 1203 of FIG. 12 that the prediction target pixel is a pixel that does not belong to the block boundary (in this example, pixel L), the original image of the pixel at the immediately previous coordinate (pixel J) and the surrounding pixels Is smoothed. In this example, the pixel value T (J) after smoothing of the pixel J to be smoothed is the pixel value of the original image of the pixel J and each pixel (I, K) located on both sides of the pixel J. It is calculated as a weighted average with the pixel value of the original image. Here, T (X) is an example of a function for calculating a pixel value obtained by smoothing pixel values of an original image of a plurality of pixels for the pixel X.

上記の二つの平滑化処理の例では、加重平均の一例を用いた。しかし、平滑化対象の画素の周辺(隣接していても、離れていてもかまわない)に位置する複数の画素の画素値（復号画像または原画像の画素値）を変数とする関数の値を用いて平滑化対象の画素を算出する方法であれば、従来の様々な平滑化方法を用いてもかまわない。 In the above two smoothing processing examples, an example of a weighted average is used. However, the value of the function whose variable is the pixel value (the decoded image or the original image) of multiple pixels located around the pixel to be smoothed (which may be adjacent or distant) Any conventional smoothing method may be used as long as the method is used to calculate the pixel to be smoothed.

以上説明した平滑化処理を用いた改良型画面内予測符号化処理を行う画像符号化装置及び画像符号化方法によれば、復号時においてより量子化誤差を低減することが可能となり、少ない符号量でより高画質な映像を復号することのできる符号化ストリームを生成することが可能となる。 According to the image encoding device and the image encoding method that perform the improved intra prediction encoding process using the smoothing process described above, it is possible to reduce the quantization error at the time of decoding, and to reduce the amount of codes Thus, it is possible to generate an encoded stream that can decode a higher quality video.

次に、図１３に図８に示す改良型画面内予測復号化処理において、上記平滑化処理を用いた場合の構成例を説明する。ここでは、ブロック内に存在するすべての画素に対して、予測方向に沿った順序で以下の処理を行う(1301)。すなわち、直前座標の復号画像を平滑化し(1302)、平滑化後の復号画像と現座標の予測差分の和を計算する(1303)。以上の処理をすべての画素に対して完了すれば、1ブロック分の復号画像を取得することができ、予測復号化処理を終了する(1304)。図１３のステップ1302に示す平滑化処理は、例えば、図１2のステップ1204における画素F’の平滑化処理において画素値S(F’)を算出する場合と同様の処理を行えばよい。 Next, a configuration example in the case where the smoothing process is used in the improved intra prediction decoding process shown in FIG. 8 will be described with reference to FIG. Here, the following processing is performed on all the pixels present in the block in the order along the prediction direction (1301). That is, the decoded image at the immediately preceding coordinates is smoothed (1302), and the sum of the smoothed decoded image and the prediction difference between the current coordinates is calculated (1303). If the above process is completed for all the pixels, a decoded image for one block can be acquired, and the predictive decoding process ends (1304). The smoothing process shown in Step 1302 of FIG. 13 may be performed, for example, in the same way as when the pixel value S (F ′) is calculated in the smoothing process of the pixel F ′ in Step 1204 of FIG.

以上説明した平滑化処理を用いた改良型画面内予測復号化処理を行う画像復号化装置及び画像復号化方法によれば、より量子化誤差を低減することが可能となり、より高画質な映像を復号することが可能となる。 According to the image decoding apparatus and the image decoding method that perform the improved in-screen predictive decoding process using the smoothing process described above, it becomes possible to further reduce the quantization error and to display a higher quality video. Decoding is possible.

また、実施例では周波数変換の一例としてDCTを挙げているが、DST(Discrete Sine T
ransformation：離散サイン変換)、WT(Wavelet Transformation：ウェーブレット変換)
、DFT(Discrete Fourier Transformation：離散フーリエ変換)、KLT(Karhunen-Loeve
Transformation：カルーネン-レーブ変換)など、画素間相関除去に利用する直交変換なら
どんなものでも構わないし、特に周波数変換を施さずに予測差分そのものを符号化しても
構わない。さらに、可変長符号化も特に行わなくて良い。 In the embodiment, DCT is cited as an example of frequency conversion, but DST (Discrete Sine T
ransformation (Discrete sine transformation), WT (Wavelet Transformation)
, DFT (Discrete Fourier Transformation), KLT (Karhunen-Loeve
Any transformation may be used as long as the orthogonal transformation is used to remove the correlation between pixels, such as Transformation (Karunen-Reeve transformation), and the prediction difference itself may be encoded without performing frequency transformation. Furthermore, variable length coding is not particularly required.

図９は、本発明を利用した場合に生成される符号化ストリームのうち、ブロック単位で
設定されるべき符号化パラメータに関する部分の構成例について示している。ここではH.
264/AVCにおける処理単位と同様に、固定長サイズのマクロブロック単位で符号化モード
を決定する場合について説明している。マクロブロックはさらに細かなブロックに分割す
ることができ、分割されたブロック単位で予測符号化が行われるものとする。この時、マ
クロブロックごとにその座標を特定するためのマクロブロック番号(901)と、予測方法と
ブロックサイズを表す符号化モード番号(902)、そして例えば画面間予測の際の動きベク
トルや画面内予測の際の予測方向など、予測を行う際に必要となる付加情報(903)、そし
て予測差分情報(904)が可変長符号化されて格納される。特に符号化モード番号(902)に対
しては、すべての予測手段に対して連番で割り当てても良いし、予測手段ごとに異なるビ
ットで表しても良い。 FIG. 9 shows a configuration example of a part related to encoding parameters to be set in units of blocks in an encoded stream generated when the present invention is used. Here H.
Similar to the processing unit in H.264 / AVC, a case has been described in which the encoding mode is determined in units of fixed-length macroblocks. The macroblock can be divided into finer blocks, and predictive coding is performed in units of the divided blocks. At this time, the macroblock number (901) for specifying the coordinates for each macroblock, the encoding mode number (902) indicating the prediction method and the block size, and the motion vector and intra-screen for the inter-screen prediction, for example Additional information (903) necessary for prediction, such as a prediction direction in prediction, and prediction difference information (904) are variable-length-coded and stored. In particular, the encoding mode number (902) may be assigned sequentially to all prediction means, or may be represented by different bits for each prediction means.

図１０は、ブロック単位で設定されるべき符号化パラメータに関する部分に対する別の
構成例について示している。この例では、マクロブロック番号(1001)と、予測方法(1002)
に対して、本発明による隣接画素を利用した予測を行うか否かを表すフラグ(1003)を付加
し、さらに付加情報(1004)と予測差分情報(1005)を格納する。 FIG. 10 shows another configuration example for the part related to the encoding parameter to be set in units of blocks. In this example, the macroblock number (1001) and the prediction method (1002)
On the other hand, a flag (1003) indicating whether or not to perform prediction using adjacent pixels according to the present invention is added, and additional information (1004) and prediction difference information (1005) are stored.

本実施例は動画像を符号化する場合について述べているが、本発明は静止画の符号化に
も有効である。すなわち、図１のブロック図から動き探索部(103)と画面間予測部(105)を
排除すれば、すなわち静止画像に特化した符号化装置のブロック図に相当する。 Although the present embodiment describes the case of encoding a moving image, the present invention is also effective for encoding a still image. That is, if the motion search unit (103) and the inter-screen prediction unit (105) are excluded from the block diagram of FIG. 1, this corresponds to a block diagram of an encoding device specialized for still images.

図１は、本実施例で用いる画像符号化装置のブロック図である。FIG. 1 is a block diagram of an image encoding apparatus used in this embodiment. 図２は、本実施例で用いる画像復号化装置のブロック図である。FIG. 2 is a block diagram of an image decoding apparatus used in this embodiment. 図３は、H.264/AVCで用いる画面内予測符号化処理の概念的な説明図である。FIG. 3 is a conceptual explanatory diagram of intra prediction encoding processing used in H.264 / AVC. 図４は、本実施例で用いる画面内予測符号化処理の概念的な説明図である。FIG. 4 is a conceptual explanatory diagram of the intra prediction encoding process used in the present embodiment. 図５は、本実施例で用いる画像符号化装置の流れ図である。FIG. 5 is a flowchart of the image encoding apparatus used in this embodiment. 図６は、本実施例で用いる画像符号化装置の詳細な流れ図である。FIG. 6 is a detailed flowchart of the image encoding apparatus used in this embodiment. 図７は、本実施例で用いる画像復号化装置の流れ図である。FIG. 7 is a flowchart of the image decoding apparatus used in this embodiment. 図８は、本実施例で用いる画像復号化装置の詳細な流れ図である。FIG. 8 is a detailed flowchart of the image decoding apparatus used in this embodiment. 図９は、本実施例で用いる符号化ストリームの構成例である。FIG. 9 is a configuration example of an encoded stream used in the present embodiment. 図１０は、本実施例で用いる符号化ストリームの構成例である。FIG. 10 is a configuration example of an encoded stream used in this embodiment. 図１１は、本実施例で用いる平滑化処理の概念的な説明図である。FIG. 11 is a conceptual explanatory diagram of the smoothing process used in the present embodiment. 図１２は、本実施例で用いる画像符号化装置の詳細な流れ図である。FIG. 12 is a detailed flowchart of the image encoding apparatus used in this embodiment. 図１３は、本実施例で用いる画像復号化装置の詳細な流れ図である。FIG. 13 is a detailed flowchart of the image decoding apparatus used in this embodiment.

Explanation of symbols

１０１入力画像
１０２現画像メモリ
１０３動き探索部
１０４画面内予測部
１０５画面間予測部
１０６モード選択部
１０７減算部
１０８周波数変換部
１０９量子化処理部
１１０可変長符号化部
１１１逆量子化処理部
１１２逆周波数変換部
１１３加算部
１１４参照画像メモリ
２０１符号化ストリーム
２０２可変長復号化部
２０３逆量子化処理部
２０４逆周波数変換部
２０５画面内予測部
２０６画面間予測部
２０７加算部
２０８参照画像メモリ
３０１〜３０６ H.264/AVCによる画面内予測符号化処理の説明図
４０１〜４０６本発明による画面内予測符号化処理の説明図
５０１〜５１５フローチャートのブロック
６０１〜６０６フローチャートのブロック
７０１〜７１０フローチャートのブロック
８０１〜８０３フローチャートのブロック
９０１〜９０４ストリームに格納する情報
１００１〜１００５ストリームに格納する情報
１２０１〜１２０８フローチャートのブロック
１３０１〜１３０４フローチャートのブロック DESCRIPTION OF SYMBOLS 101 Input image 102 Current image memory 103 Motion search part 104 Intra-screen prediction part 105 Inter-screen prediction part 106 Mode selection part 107 Subtraction part 108 Frequency conversion part 109 Quantization process part 110 Variable length encoding part 111 Inverse quantization process part 112 Inverse frequency conversion unit 113 Addition unit 114 Reference image memory 201 Encoded stream 202 Variable length decoding unit 203 Inverse quantization processing unit 204 Inverse frequency conversion unit 205 Intra-screen prediction unit 206 Inter-screen prediction unit 207 Addition unit 208 Reference image memory 301 306 Explanation diagrams of intra prediction encoding processing by H.264 / AVC 401 to 406 Explanation diagrams of intra prediction encoding processing according to the present invention 501 to 515 Flowchart blocks 601 to 606 Flowchart blocks 701 to 710 Flowchart blocks 801-803 Block chart block 901-904 Information stored in stream 1001-1005 Information stored in stream 1201-1208 Flowchart block 1301-1304 Flowchart block

Claims

In order to perform encoding according to the probability of occurrence of a symbol, an intra-screen prediction unit that performs intra-screen prediction in block units and calculates a prediction difference, a frequency conversion unit and a quantization unit that perform encoding on the prediction difference, and In the video encoding device having the variable length encoding unit,
The moving picture encoding apparatus, wherein the intra prediction unit includes an intra prediction encoding unit that performs predictive encoding processing with reference to a pixel that is always adjacent to a pixel to be encoded.

The moving image encoding device according to claim 1,
In the intra prediction unit, a plurality of prediction directions are defined, and a difference between adjacent pixels along a corresponding prediction direction is calculated at the time of predictive encoding.

In the moving image encoding device according to claim 1 or 2,
In the in-screen prediction unit, in addition to the in-screen predictive encoding means for performing predictive encoding processing with reference to pixels that are always adjacent to pixels to be encoded, in-screen predictive processing is executed. A predictive encoding unit, and an encoding mode selection unit that selects an optimal result from a plurality of intra-screen prediction encoding units of the intra-screen prediction unit, in units of blocks or images. A moving image encoding apparatus comprising:

Performs intra-frame prediction on a block basis, calculates the prediction difference, performs frequency conversion and quantization to encode the prediction difference, and performs variable length coding to perform coding according to the probability of occurrence of the symbol In the moving image encoding method to be performed,
An intra-frame prediction encoding method for performing intra-prediction encoding processing for performing predictive encoding processing with reference to a pixel that is always adjacent to a pixel to be encoded in the intra-frame prediction.

In the moving image encoding method according to claim 4,
A plurality of prediction directions are defined at the time of intra prediction, and a difference between adjacent pixels along the corresponding prediction direction is calculated at the time of prediction encoding. Method.

In the moving image encoding method according to claim 4 or 5,
In the case of the intra prediction, in addition to the intra prediction encoding for executing the predictive encoding process with reference to the pixels that are always adjacent to the pixel to be encoded, a screen for executing a different predictive encoding process A moving picture code characterized by performing intra prediction encoding and selecting an optimum one from a plurality of results encoded by intra prediction encoding at the time of the predictive encoding in units of blocks or images Method.

A variable-length decoding unit that performs the reverse procedure of variable-length coding, an inverse quantization processing unit and an inverse frequency transform unit for decoding a prediction difference, and intra-frame prediction for each block to obtain a decoded image In a video decoding device having an intra-screen prediction unit for
An intra-screen prediction unit includes an intra-screen prediction decoding unit that always predicts a pixel to be decoded with reference to adjacent pixels.

The moving picture decoding apparatus according to claim 7,
A plurality of prediction directions are defined in the in-screen prediction unit, and a moving image is calculated by calculating a sum of a decoded image of a pixel adjacent to the corresponding prediction direction and a prediction difference in predictive decoding. Image decoding device.

The moving picture decoding apparatus according to claim 7 or 8,
In addition to the intra-screen prediction decoding unit, the intra-screen prediction unit includes an intra-screen prediction decoding unit that executes different prediction decoding processes. In the plurality of intra-screen prediction decoding units, A moving picture decoding apparatus that decodes data that has been predictively encoded by a different method in units or images.

Performs variable-length decoding following the reverse procedure of variable-length coding, performs inverse quantization processing and inverse frequency transform to decode the prediction difference, and obtains a decoded image by performing intra prediction in units of blocks In the video decoding method for performing intra-screen prediction for
An intra-picture prediction method for performing intra-prediction decoding that predicts a pixel to be decoded with reference to adjacent pixels at the time of intra-prediction.

The moving picture decoding method according to claim 10, wherein
A plurality of prediction directions are defined when performing the intra-screen prediction, and at the time of predictive decoding, the prediction difference of the pixel to be decoded is added to the decoded image of the adjacent pixel along the corresponding prediction direction. A video decoding method characterized by the above.

In the moving image decoding method according to claim 10 or 11,
In addition to intra-frame predictive decoding in which the pixel to be decoded is always predicted with reference to adjacent pixels, intra-frame predictive decoding in which predictive decoding processing is performed by different methods is possible. A moving picture decoding method, comprising decoding data that has been predictively encoded by a method different in units.

The moving picture encoding apparatus according to claim 1, wherein the intra-screen predictive encoding unit includes a pixel value of a pixel adjacent to the pixel to be encoded or a plurality of pixels located around the adjacent pixel. A moving picture coding apparatus characterized by referring to a smoothing function value having a variable of a pixel value of any one of a plurality of decoded pictures or a pixel value of an original picture.

5. The moving image encoding method according to claim 4, wherein in the intra prediction, a pixel value of a pixel adjacent to the pixel to be encoded or a plurality of pixels located around the adjacent pixel is calculated. A moving picture coding method characterized by referring to a smoothing function value having a variable of a pixel value of any one of a plurality of decoded pictures or a pixel value of an original picture.

The moving picture decoding apparatus according to claim 7, wherein the intra prediction decoding means includes a pixel value of a pixel adjacent to the pixel to be decoded or a plurality of pixels located around the adjacent pixel. A moving picture decoding apparatus characterized by referring to a smoothing function value having a pixel value of any one of a plurality of decoded pictures as a variable.

The moving picture decoding method according to claim 10, wherein in the intra prediction, a pixel value of a pixel adjacent to the pixel to be decoded or a plurality of pixels located around the adjacent pixel is selected. A moving picture decoding method characterized by referring to a value of a smoothing function whose variable is a pixel value of any one of a plurality of decoded pictures.