JP2010509800A

JP2010509800A - Video encoding / decoding method and apparatus using motion vector tracking

Info

Publication number: JP2010509800A
Application number: JP2009535217A
Authority: JP
Inventors: リー，キョ−ヒョク; キム，ソ−ヨン
Original assignee: Samsung Electronics Co Ltd
Current assignee: Samsung Electronics Co Ltd
Priority date: 2006-11-03
Filing date: 2007-11-02
Publication date: 2010-03-25
Anticipated expiration: 2027-11-02
Also published as: CN101573982A; CN101573982B; JP5271271B2

Abstract

動きベクトル・トラッキングを利用した映像符号化方法及び装置が開示される。該映像符号化方法及び装置は、現在ピクチャの動き情報を利用して参照ピクチャの対応領域を決定し、決定された参照ピクチャの対応領域の動き情報を利用して他の参照ピクチャの対応領域を決定する動きベクトル・トラッキング過程を介して、現在ピクチャの予測に利用する複数枚の参照ピクチャを決定し、決定された複数枚の参照ピクチャの加重和を介して、現在ピクチャの予測値を生成することを特徴とする。 A video encoding method and apparatus using motion vector tracking is disclosed. The video encoding method and apparatus determine a corresponding region of a reference picture using motion information of a current picture, and determine a corresponding region of another reference picture using motion information of the determined corresponding region of the reference picture. A plurality of reference pictures to be used for prediction of the current picture are determined through a motion vector tracking process to be determined, and a prediction value of the current picture is generated through a weighted sum of the determined plurality of reference pictures. It is characterized by that.

Description

本発明は、映像の予測符号化及び復号化に係り、さらに具体的には、現在ピクチャの動きベクトル経路を連続的にトラッキングすることによって複数枚の参照ピクチャを決定し、決定された複数枚の参照ピクチャを利用し、現在ピクチャを予測符号化する映像符号化方法及び装置、その復号化方法及び装置に関する。 The present invention relates to video predictive encoding and decoding, and more specifically, determines a plurality of reference pictures by continuously tracking a motion vector path of a current picture, and determines a plurality of determined plurality of pictures. The present invention relates to a video encoding method and apparatus for predictively encoding a current picture using a reference picture, and a decoding method and apparatus thereof.

動映像の符号化時には、映像シーケンス内の空間的冗長性及び時間的冗長性（redundancy）を除去することによって圧縮が行われる。時間的冗長性を除去するためには、現在符号化されるピクチャの前方または後方に位置した他のピクチャを参照ピクチャとして利用し、現在符号化されるピクチャの領域と類似した参照ピクチャの領域を検索し、現在符号化されるピクチャと参照ピクチャとの対応する領域間の動き量を検出し、該動き量に基づいて動き補償処理を行って得られる予測映像と現在符号化される映像との差分（residue）を符号化する。 When encoding a moving picture, compression is performed by removing spatial redundancy and temporal redundancy in the video sequence. In order to remove temporal redundancy, another picture located in front of or behind the current coded picture is used as a reference picture, and a reference picture area similar to the current coded picture area is used. A search is performed to detect a motion amount between corresponding regions of a picture that is currently encoded and a reference picture, and a predicted video obtained by performing a motion compensation process based on the motion amount and a video that is currently encoded Encode the difference.

ビデオピクチャは、一つまたはそれ以上のスライス単位にコーディングされる。ここで、１つのスライスは、１つのマクロブロックのみを含む場合もあり、１つのピクチャ全体が１つのスライスに符号化される場合もある。Ｈ．２６４標準案によれば、ビデオピクチャは、画面内の予測だけで符号化されるＩ（Intra）スライス、１枚の参照ピクチャの映像サンプルを利用した予測によって符号化されるＰ（Predictive）スライス、及び２枚の参照ピクチャの映像サンプルを利用した予測によって符号化されるＢ（Bi-predictive）スライス単位でコーディングされる。 A video picture is coded in one or more slice units. Here, one slice may include only one macroblock, and one entire picture may be encoded into one slice. H. According to the H.264 standard, a video picture is an I (Intra) slice that is encoded only by prediction within the screen, a P (Predictive) slice that is encoded by prediction using a video sample of one reference picture, And B (Bi-predictive) slices encoded by prediction using video samples of two reference pictures.

従来、ＭＰＥＧ−２などでは、現在ピクチャの直前のピクチャ１枚、及び直後に出てくるピクチャ１枚を参照ピクチャとして利用する双方向予測（bi-directional prediction）を行っていた。Ｈ．２６４／ＡＶＣ（Advanced Video Coding）では、かような双方向予測の概念を拡張し、現在ピクチャの直前及び直後のピクチャに限定せず、前後に関係なくして任意の２枚のピクチャを参照ピクチャとして利用できる。このように、前後に関係なくして任意の２枚のピクチャを参照ピクチャとして利用して予測されるピクチャを、双予測ピクチャ（Bi-predictive picture、以下、「Ｂピクチャ」とする）と定義する。 Conventionally, in MPEG-2 or the like, bi-directional prediction using one picture immediately before the current picture and one picture appearing immediately after as a reference picture has been performed. H. In H.264 / AVC (Advanced Video Coding), the concept of bi-directional prediction is extended, and it is not limited to the picture immediately before and immediately after the current picture, and any two pictures can be used as reference pictures regardless of before and after. Available. In this way, a picture predicted using any two pictures as reference pictures regardless of before and after is defined as a bi-predictive picture (hereinafter referred to as “B picture”).

図１は、Ｈ．２６４／ＡＶＣによって、Ｂピクチャとして符号化される現在ピクチャに備わったブロックの予測過程について説明するための図である。Ｈ．２６４／ＡＶＣによれば、Ｂピクチャ内のブロックは、ＭＢ１のように、同じ方向の２枚の参照ピクチャＡ及びＢを利用したり、ＭＢ２のように、異なる方向の２枚の参照ピクチャＢ及びＣを利用したり、ＭＢ３のように、同一参照ピクチャＡ内の２枚の互いに異なる領域でサンプリングされた領域を利用したり、ＭＢ４やＭＢ５のように、任意の参照ピクチャＢまたはＤのみを利用して予測される。 FIG. 2 is a diagram for describing a prediction process of a block included in a current picture encoded as a B picture by H.264 / AVC. FIG. H. According to H.264 / AVC, a block in a B picture uses two reference pictures A and B in the same direction as in MB 1 or two reference pictures in different directions as in MB 2 B and C are used, an area sampled in two different areas within the same reference picture A, such as MB 3, or any reference picture B or D, such as MB4 or MB5 Predicted using only

一般的に、Ｂピクチャとしてコーディングされる映像データの符号化効率は、ＩピクチャやＰピクチャとしてコーディングされる映像データの符号化効率に比べてさらに高い。これは、Ｂピクチャは、２枚の参照ピクチャを利用するために１枚の参照ピクチャを利用するＰピクチャや、画面内の予測を利用するＩピクチャに比べ、現在映像データと類似した予測データを生成できる可能性が高いだけではなく、Ｂピクチャの場合、２枚の参照ピクチャの平均値を予測データとして利用するために、時間的に前後に位置するピクチャ間に誤差が発生するとしても、時間方向に平坦になった平均値を予測値として利用するようになり、一種の低周波フィルタリングを行ったように、視覚的に符号化歪曲がほとんど発生しない場合もあるためである。 In general, the encoding efficiency of video data coded as a B picture is even higher than the encoding efficiency of video data coded as an I picture or P picture. This is because the B picture has prediction data similar to the current video data compared to the P picture that uses one reference picture to use two reference pictures and the I picture that uses prediction in the screen. In addition to a high possibility of being generated, in the case of a B picture, in order to use an average value of two reference pictures as prediction data, even if an error occurs between pictures located before and after in time, time This is because the average value flattened in the direction is used as the predicted value, and there is a case where the coding distortion hardly occurs visually as if a kind of low frequency filtering is performed.

Ｐピクチャに比べ、２枚の参照ピクチャを利用するＢピクチャの場合がさらに大きい符号化効率を有するように、さらに多数の参照ピクチャを利用するならば、符号化効率を向上させることができるにもかかわらず、各参照ピクチャごとに動き予測及び補償を行う場合、演算量が増加する限界によって、従来の映像圧縮標準案では、最大２枚までの参照ピクチャのみを利用している。 Compared to the P picture, if a larger number of reference pictures are used so that a B picture using two reference pictures has a larger encoding efficiency, the encoding efficiency can be improved. Regardless, when motion prediction and compensation are performed for each reference picture, the conventional video compression standard proposal uses only up to two reference pictures due to the limit of the amount of calculation.

本発明は、前記のような問題点を解決するために案出されたものであり、現在ブロックの参照ピクチャの動きベクトル経路をトラッキングすることによって、さらに多くの参照ピクチャを現在ブロックの予測に利用して符号化効率を向上させる映像符号化方法及び装置、該復号化方法及び装置を提供するところに目的がある。 The present invention has been devised to solve the above-described problems, and uses more reference pictures for prediction of the current block by tracking the motion vector path of the reference picture of the current block. Thus, it is an object of the present invention to provide a video encoding method and apparatus for improving the encoding efficiency, and the decoding method and apparatus.

前記のような技術的課題を解決するために、本発明による映像符号化方法は、現在ブロックが参照する参照ピクチャの対応領域の動きベクトル経路をトラッキングすることによって、前記現在ブロックの予測に利用する複数枚の参照ピクチャの対応領域を決定する段階と、前記複数枚の参照ピクチャの対応領域の加重和を求めることによって、前記現在ブロックの予測ブロックを生成する段階と、前記現在ブロックと予測ブロックとの差分を符号化する段階とを含むことを特徴とする。 In order to solve the above technical problem, the video coding method according to the present invention is used for prediction of the current block by tracking the motion vector path of the corresponding region of the reference picture referenced by the current block. Determining a corresponding region of a plurality of reference pictures; generating a predicted block of the current block by obtaining a weighted sum of the corresponding regions of the plurality of reference pictures; and the current block and the predicted block; And a step of encoding the difference.

本発明による映像符号化装置は、現在ブロックが参照する参照ピクチャの対応領域の動きベクトル経路をトラッキングすることによって、前記現在ブロックの予測に利用する複数枚の参照ピクチャの対応領域を決定する参照ピクチャ決定部と、前記複数枚の参照ピクチャの対応領域の加重和を求めることによって、前記現在ブロックの予測ブロックを生成する加重予測部と、前記現在ブロックと予測ブロックとの差分を符号化する符号化部とを備えることを特徴とする。 A video encoding apparatus according to the present invention tracks a motion vector path of a corresponding area of a reference picture referenced by a current block, thereby determining a corresponding area of a plurality of reference pictures used for prediction of the current block A determination unit; a weighted prediction unit that generates a prediction block of the current block by obtaining a weighted sum of corresponding regions of the plurality of reference pictures; and an encoding that encodes a difference between the current block and the prediction block And a section.

本発明による映像復号化方法は、入力されたビットストリームに備わった予測モード情報を判読し、復号化される現在ブロックの予測モードを判別する段階と、前記判別の結果、複数枚の参照ピクチャの対応領域を利用して予測された前記現在ブロックに対し、前記ビットストリームに備わった前記現在ブロックの動きベクトルが指し示す参照ピクチャの対応領域と、前記参照ピクチャの対応領域の動きベクトル経路とをトラッキングすることによって、前記現在ブロックの予測に利用する複数枚の参照ピクチャの対応領域を決定する段階と、前記複数枚の参照ピクチャの対応領域の加重和を求めることによって、前記現在ブロックの予測ブロックを生成する段階と、前記生成された予測ブロックと、前記ビットストリームに備わった前記現在ブロックと予測ブロックとの差分値とを加え、前記現在ブロックを復号化する段階とを含むことを特徴とする。 According to the video decoding method of the present invention, the prediction mode information included in the input bitstream is read, and the prediction mode of the current block to be decoded is determined. For the current block predicted using the corresponding area, the corresponding area of the reference picture indicated by the motion vector of the current block included in the bitstream and the motion vector path of the corresponding area of the reference picture are tracked. And determining a corresponding area of a plurality of reference pictures used for prediction of the current block, and generating a predicted block of the current block by obtaining a weighted sum of the corresponding areas of the plurality of reference pictures Performing the generated prediction block and the current stream included in the bitstream. And the difference value of the block and the predicted block is added, characterized in that it comprises a step of decoding the current block.

本発明による映像復号化装置は、入力されたビットストリームに備わった予測モード情報を判読し、復号化される現在ブロックの予測モードを判別する予測モード判別部と、前記判別の結果、複数枚の参照ピクチャの対応領域を利用して予測された前記現在ブロックに対し、前記ビットストリームに備わった前記現在ブロックの動きベクトルが指し示す参照ピクチャの対応領域と、前記参照ピクチャの対応領域の動きベクトル経路とをトラッキングすることによって、前記現在ブロックの予測に利用する複数枚の参照ピクチャの対応領域を決定する参照ピクチャ決定部と、前記複数枚の参照ピクチャの対応領域の加重和を求めることによって、前記現在ブロックの予測ブロックを生成する加重予測部と、前記生成された予測ブロックと、前記ビットストリームに備わった前記現在ブロックと予測ブロックとの差分値とを加え、前記現在ブロックを復号化する復号化部とを備えることを特徴とする。 The video decoding apparatus according to the present invention interprets prediction mode information included in an input bitstream and determines a prediction mode of a current block to be decoded, and a plurality of discs as a result of the determination. For the current block predicted using a corresponding area of a reference picture, a corresponding area of a reference picture indicated by a motion vector of the current block included in the bitstream, and a motion vector path of the corresponding area of the reference picture By tracking a current picture block, a reference picture determination unit that determines a corresponding area of a plurality of reference pictures used for prediction of the current block, and a weighted sum of the corresponding areas of the plurality of reference pictures. A weighted prediction unit for generating a prediction block of the block, the generated prediction block, and Tsu said equipped Doo stream current block and the difference value between the prediction block is added, characterized in that it comprises a decoding unit to decode the current block.

本発明によれば、さらに多数の参照ピクチャを利用して予測符号化を行うことによって、予測効率及び符号化効率を向上させることができる。 According to the present invention, prediction efficiency and encoding efficiency can be improved by performing predictive encoding using a larger number of reference pictures.

Ｈ．２６４／ＡＶＣによってＢピクチャとして符号化される現在ピクチャに備わったブロックの予測過程について説明するための図である。H. 2 is a diagram for describing a prediction process of a block included in a current picture encoded as a B picture by H.264 / AVC. FIG. 本発明による映像符号化方法によって、現在ピクチャの予測に利用する複数枚の参照ピクチャを決定する過程の一例について説明した図である。FIG. 6 is a diagram illustrating an example of a process of determining a plurality of reference pictures used for prediction of a current picture by a video encoding method according to the present invention. 本発明による映像符号化方法によって、現在ピクチャの予測に利用する複数枚の参照ピクチャを決定する過程の他の例について説明した図である。FIG. 10 is a diagram illustrating another example of a process of determining a plurality of reference pictures used for prediction of a current picture by the video encoding method according to the present invention. 本発明による映像符号化方法によって、現在ピクチャの予測に利用する複数枚の参照ピクチャを決定する過程のさらに他の例について説明した図である。FIG. 10 is a diagram illustrating still another example of a process of determining a plurality of reference pictures used for prediction of a current picture by the video encoding method according to the present invention. 本発明による映像符号化装置を示したブロック図である。1 is a block diagram illustrating a video encoding apparatus according to the present invention. 図５の動き補償部５０４の具体的な構成を示したブロック図である。FIG. 6 is a block diagram illustrating a specific configuration of a motion compensation unit 504 in FIG. 5. Ｈ．２６４／ＭＰＥＧ−４ＡＶＣの可変ブロックサイズ動き予測で利用される多様な大きさのブロックを示した図である。H. 2 is a diagram illustrating blocks of various sizes used in H.264 / MPEG-4AVC variable block size motion prediction. FIG. 可変ブロック動き予測された映像の一例を図示したイメージである。It is the image which illustrated an example of the image by which variable block motion prediction was carried out. 本発明による映像符号化方法によって、動きブロック境界を基に分割された参照ピクチャの対応領域が参照する他の参照ピクチャの対応領域を決定するための過程について説明するための図である。FIG. 10 is a diagram illustrating a process for determining a corresponding area of another reference picture that is referenced by a corresponding area of a reference picture divided based on a motion block boundary by the video encoding method according to the present invention. 本発明による映像符号化方法によって、動きブロック境界を基準に分割された参照ピクチャの対応領域が参照する他の参照ピクチャの対応領域を決定するための過程の他の例について説明するための図である。FIG. 6 is a diagram for explaining another example of a process for determining a corresponding area of another reference picture to which a corresponding area of a reference picture divided based on a motion block boundary is referred by the video encoding method according to the present invention; is there. 本発明による映像符号化方法において、参照ピクチャの対応領域に付与される重み付けを計算する過程について説明するための図である。FIG. 10 is a diagram for explaining a process of calculating a weight given to a corresponding region of a reference picture in the video encoding method according to the present invention. 本発明による映像符号化方法を示したフローチャートである。5 is a flowchart illustrating a video encoding method according to the present invention. 本発明による映像復号化装置の構成を示したブロック図である。It is the block diagram which showed the structure of the video decoding apparatus by this invention. 本発明による映像復号化方法を示したフローチャートである。5 is a flowchart illustrating a video decoding method according to the present invention.

以下、添付された図面を参照しつつ、本発明の望ましい実施形態について詳細に説明する。 Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings.

本発明による映像符号化方法は、現在ピクチャの動きベクトルが指し示す参照ピクチャの動きベクトルを利用し、他の参照ピクチャの対応領域を継続的にトラッキングすることによって、現在ピクチャの予測に利用される複数枚の参照ピクチャを決定し、決定された参照ピクチャの加重和を介して、現在ピクチャの予測値を生成することを特徴とする。まず、図２ないし図４を参照し、本発明による映像符号化方法及び装置に適用される複数枚の参照ピクチャ決定過程について説明する。 The video encoding method according to the present invention uses a motion vector of a reference picture indicated by a motion vector of a current picture, and continuously tracks a corresponding area of another reference picture, so that a plurality of images used for prediction of the current picture are used. One reference picture is determined, and a prediction value of the current picture is generated through a weighted sum of the determined reference pictures. First, a process of determining a plurality of reference pictures applied to the video coding method and apparatus according to the present invention will be described with reference to FIGS.

図２は、本発明による映像符号化方法によって、現在ピクチャの予測に利用する複数枚の参照ピクチャを決定する過程の一例について説明した図である。 FIG. 2 is a diagram illustrating an example of a process of determining a plurality of reference pictures used for prediction of the current picture by the video encoding method according to the present invention.

現在ピクチャの符号化対象になるブロック（以下、「現在ブロック」とする）２１に対して一般的な動き予測が行われ、現在ブロック２１と最も類似した参照ピクチャ１の対応領域２２、及び現在ブロック２１と参照ピクチャ１の対応領域２２との位置差を示す動きベクトルＭＶ_１が決定された状態であると仮定する。また、現在ブロック２１は、Ｐピクチャ内に備わった１枚の参照ピクチャのみを参照する動きブロックであると仮定する。しかし本発明は、図２に図示されたように、１つの動きベクトルを有する動きブロックだけではなく、２個の動きベクトルを有するＢピクチャ内の動きブロックに対して動きベクトルそれぞれをトラッキングすることによっても適用されうる。 General motion prediction is performed on a block (hereinafter referred to as “current block”) 21 to be encoded of the current picture, the corresponding region 22 of the reference picture 1 most similar to the current block 21, and the current block It is assumed that the motion vector MV ₁ indicating the position difference between the reference area 21 and the corresponding area 22 of the reference picture 1 is determined. Further, it is assumed that the current block 21 is a motion block that refers to only one reference picture included in the P picture. However, the present invention is not limited to a motion block having one motion vector, as shown in FIG. 2, but by tracking each motion vector with respect to a motion block in a B picture having two motion vectors. Can also be applied.

図２を参照するに、現在ブロック２１に対する動き予測の結果として生成される動きベクトルＭＶ_１は、参照ピクチャ１で現在ブロック２１と最も誤差が少ない領域を示す。従来技術によれば、参照ピクチャ１の対応領域２２の値を現在ブロック２１の予測値に決定し、決定された予測値と現在ブロック２１の原画素値との差である剰余（residue）を符号化する。 Referring to FIG. 2, a motion vector MV ₁ generated as a result of motion prediction for the current block 21 indicates an area in the reference picture 1 that has the least error from the current block 21. According to the prior art, the value of the corresponding region 22 of the reference picture 1 is determined as the predicted value of the current block 21, and a residue that is the difference between the determined predicted value and the original pixel value of the current block 21 is encoded. Turn into.

本発明による映像符号化方法は、従来技術のように、現在ブロックの動きベクトルが指し示す参照ピクチャの対応領域だけではなく、参照ピクチャの対応領域が有する動き情報を利用し、参照ピクチャの対応領域の予測に利用された他の参照ピクチャの対応領域を現在ブロックの予測に利用する。図２を参照するに、現在ブロック２１に対応する参照ピクチャ１の対応領域２２が有する動きベクトルＭＶ_２を利用し、参照ピクチャ１の対応領域２２の予測に利用された参照ピクチャ２の対応領域２３を決定する。また、参照ピクチャ２の対応領域２３の動きベクトルＭＶ_３を利用し、参照ピクチャ２の対応領域２３の予測に利用された参照ピクチャ３の対応領域２４を決定する。参照ピクチャｎ−１の対応領域２５の動きベクトルＭＶ_ｎは、参照ピクチャｎ−１の対応領域２５の予測に利用された参照ピクチャＡの対応領域２６を決定するのに利用される。後述するように、現在ブロックの動きベクトルが指し示す参照ピクチャの対応領域、及び参照ピクチャの対応領域が有する動きベクトルが指し示す他の参照ピクチャの対応領域をトラッキングし続ける過程は、イントラ予測されるブロックだけで構成される参照ピクチャまで遂行されるか、またはイントラ予測ブロックに含まれる対応領域の大きさが所定臨界値以上である参照ピクチャまで行われる。 The video encoding method according to the present invention uses not only the corresponding area of the reference picture indicated by the motion vector of the current block but also the motion information of the corresponding area of the reference picture, as in the prior art, Corresponding regions of other reference pictures used for prediction are used for prediction of the current block. Referring to FIG. 2, the motion vector MV _{2 included} in the corresponding region 22 of the reference picture 1 corresponding to the current block 21 is used, and the corresponding region 23 of the reference picture 2 used for prediction of the corresponding region 22 of the reference picture 1. To decide. Further, by using the motion vector MV ₃ of reference pictures 2 corresponding region 23, to determine the corresponding area 24 of the reference picture 3, which is used to predict the reference picture 2 corresponding region 23. The motion vector MV _n of the corresponding area 25 of the reference picture n-1 is used to determine the corresponding area 26 of the reference picture A used for prediction of the corresponding area 25 of the reference picture n-1. As will be described later, the process of continuously tracking the corresponding area of the reference picture pointed to by the motion vector of the current block and the corresponding area of the other reference picture pointed to by the motion vector of the corresponding area of the reference picture is limited to the intra-predicted block. Or a reference picture in which the size of the corresponding region included in the intra prediction block is equal to or greater than a predetermined critical value.

このように本発明は、従来技術のように、現在ブロック２１が有する動きベクトルＭＶ_１が指し示す参照ピクチャ１の対応領域２２だけではなく、参照ピクチャ１の対応領域２２の動きベクトルＭＶ_２が指し示す参照ピクチャ２の対応領域２３、参照ピクチャ２の対応領域２３の動きベクトルＭＶ３が指し示す参照ピクチャ３の対応領域２４のように、現在ブロック２１の動きベクトルが指し示す参照ピクチャの対応領域の動きベクトル経路をトラッキングすることによって決定された複数枚の参照ピクチャの対応領域に、所定の重み付けを乗じた後で加算することによって、現在ブロック２１の予測ブロックを生成する。 As described above, according to the present invention, not only the corresponding area 22 of the reference picture 1 indicated by the motion vector MV ₁ of the current block 21 but also the reference indicated by the motion vector MV ₂ of the corresponding area 22 of the reference picture 1 as in the conventional technique. The motion vector path of the corresponding area of the reference picture indicated by the motion vector of the current block 21 is tracked, such as the corresponding area 24 of the reference picture 3 indicated by the motion vector MV3 of the corresponding area 23 of the picture 2 and the corresponding vector 23 of the reference picture 2. The prediction block of the current block 21 is generated by multiplying the corresponding areas of the plurality of reference pictures determined by performing a predetermined weighting and adding them.

図３は、本発明による映像符号化方法によって、現在ピクチャの予測に利用する複数枚の参照ピクチャを決定する過程の他の例について説明した図である。図３で、Ｉ_０はＩピクチャ、Ｐ_１及びＰ_５はＰピクチャ、Ｂ_２及びＢ_３はＢピクチャであると仮定する。以下では、本発明によって、Ｂ_２ピクチャに備わった現在ブロック３１の予測に利用される複数枚の参照ピクチャ対応領域を決定する過程について説明する。 FIG. 3 is a diagram illustrating another example of a process of determining a plurality of reference pictures used for prediction of a current picture by the video encoding method according to the present invention. In FIG. 3, it is assumed that I ₀ is an I picture, P ₁ and P ₅ are P pictures, and B ₂ and B ₃ are B pictures. Hereinafter, the present invention, the process will be described for determining a plurality of reference pictures corresponding area used for prediction of the current block 31 provided in B ₂ picture.

図３を参照するに、Ｂ_２ピクチャに備わった現在ブロック３１は、一般的な動き予測の結果、２個の動きベクトルＭＶ_１，ＭＶ_２を有すると仮定する。Ｂ_２ピクチャに備わった現在ブロック３１のように、現在符号化される対象ブロックが２個の動きベクトルを有する場合には、それぞれの動きベクトルに対して参照ピクチャの対応領域の動きベクトル経路をトラッキングする過程を遂行することによって、複数枚の参照ピクチャの対応領域を決定する。すなわち、現在ブロック３１の第１動きベクトルＭＶ_１が指し示すＰ_１ピクチャの対応領域３３が有する動き情報を利用し、Ｐ_１ピクチャの対応領域３３の予測に利用された他の参照ピクチャＩ_０の対応領域３４を決定する。Ｉ_０ピクチャの場合、イントラ予測されるブロックによってのみ構成されるＩピクチャであり、対応領域３４は、動き情報を有さないので、トラッキング過程は中断される。 Referring to FIG. 3, it is assumed that the current block 31 included in the B ₂ picture has two motion vectors MV ₁ and MV ₂ as a result of general motion prediction. When the current block to be encoded has two motion vectors, such as the current block 31 included in the B ₂ picture, the motion vector path of the corresponding region of the reference picture is tracked for each motion vector. The corresponding region of the plurality of reference pictures is determined by performing the process. That is, the motion information of the corresponding region 33 of the P ₁ picture indicated by the first motion vector MV ₁ of the current block 31 is used, and the correspondence of the other reference picture I ₀ used for the prediction of the corresponding region 33 of the P ₁ picture. Region 34 is determined. In the case of an I ₀ picture, the tracking process is interrupted because it is an I picture composed only of intra-predicted blocks, and the corresponding region 34 does not have motion information.

同様に、現在ブロック３１の第２動きベクトルＭＶ_２が指し示すＢ_３ピクチャの対応領域３２の場合、Ｂ_３ピクチャがＢピクチャであるので、対応領域３２も２個の動きベクトルを有する。この場合もまた、Ｂ_３ピクチャの対応領域３２が有する２個の動きベクトルのうち、左側の動きベクトルをトラッキングし、Ｂ_３ピクチャの対応領域３２の予測に利用されたＰ_１ピクチャの対応領域４１、Ｐ_１ピクチャの対応領域４１の予測に利用されたＩ_０ピクチャの対応領域４２を決定する。また、Ｂ_３ピクチャの対応領域３２が有する２個の動きベクトルのうち、右側の動きベクトルをトラッキングし、Ｂ_３ピクチャの対応領域３２の予測に利用されたＰ_５ピクチャの対応領域３８を決定する。前述のように、Ｂ_３ピクチャの対応領域３２が有する２個の動きベクトルのうち、右側の動きベクトルを利用したトラッキング過程は、Ｉ_０ピクチャのように、動き情報を有さないイントラ予測ピクチャまで行われるか、対応領域のうち、イントラ予測されるブロックに含まれる領域の広さが所定大きさ以上である参照ピクチャまで続けて行われる。 Similarly, when the second motion vector MV ₂ is indicated _{B 3} picture corresponding region 32 of the current block 31, the _{B 3} picture is a B-picture, the corresponding area 32 has two motion vectors. Again, one of the two motion vectors corresponding area 32 of the B ₃ picture has to track the left motion vector, B ₃ corresponding region 41 of P ₁ picture is used to predict the picture of the corresponding region 32 , The corresponding area 42 of the I ₀ picture used for the prediction of the corresponding area 41 of the P ₁ picture is determined. Also, of the two motion vectors corresponding area 32 of the B ₃ picture has to track the right motion vectors, determining a corresponding area 38 of the P ₅ picture is available of the B ₃ picture prediction of a corresponding region 32 . As described above, the tracking process using the right motion vector among the two motion vectors included in the corresponding region 32 of the B ₃ picture is the same as the intra prediction picture having no motion information like the I ₀ picture. Or the reference picture in which the size of the area included in the intra-predicted block of the corresponding area is equal to or larger than a predetermined size is performed.

このように、現在ブロック３１の動きベクトルをトラッキングすることによって決定された複数枚の参照ピクチャの対応領域３２，３３，３４，３８にそれぞれ所定の重み付けを乗じた後で加算し、現在ブロック３１の予測ブロックを生成するよ。 As described above, the corresponding areas 32, 33, 34, and 38 of the plurality of reference pictures determined by tracking the motion vector of the current block 31 are respectively multiplied by predetermined weights, and then added to each other. Generate a prediction block.

図４は、本発明による映像符号化方法によって、現在ピクチャの予測に利用する複数枚の参照ピクチャを決定する過程のさらに他の例について説明した図である。図４では、映像符号化時のディレイを減らすために、現在ピクチャ以前に符号化されたピクチャだけを利用するという点を除き、前述の図３で説明した動きベクトル・トラッキング過程と類似している。Ｈ．２６４／ＡＶＣでは、現在ピクチャの直前または直後のピクチャに限定されずに、任意の方向の２枚の参照ピクチャを利用できるので、図４に図示されたように、前方向（forward）ピクチャだけを利用して予測符号化を行う場合もある。 FIG. 4 is a diagram illustrating still another example of a process of determining a plurality of reference pictures used for prediction of the current picture by the video encoding method according to the present invention. FIG. 4 is similar to the motion vector tracking process described above with reference to FIG. 3 except that only a picture encoded before the current picture is used to reduce the delay during video encoding. . H. In H.264 / AVC, two reference pictures in any direction can be used without being limited to the picture immediately before or after the current picture, so that only the forward picture is used as shown in FIG. In some cases, predictive encoding is performed.

図４を参照するに、現在ブロック４３が有する２個の動きベクトルＭＶ_１，ＭＶ_２をそれぞれトラッキングすることによって、現在ブロック４３の予測に利用される複数枚の参照ピクチャの対応領域４４，４５，４６，４７，４８，４９，５０，５１を決定できる。 Referring to FIG. 4, by tracking the _two motion vectors MV ₁ and MV ₂ of the current block 43, the corresponding regions 44, 45, 46, 47, 48, 49, 50, 51 can be determined.

以上で説明した通り、本発明による映像符号化方法及び装置は、現在ブロックの動きベクトルが指し示す参照ピクチャの対応領域だけではなく、参照ピクチャの対応領域が有する動き情報を利用し、参照ピクチャの対応領域の予測に利用された他の参照ピクチャの対応領域を現在ブロックの予測に利用する。また、現在ブロックまたは参照ピクチャの対応領域が２個の動きベクトルを有する場合には、各動きベクトルをトラッキングすることによって、参照ピクチャの対応領域を決定する。 As described above, the video coding method and apparatus according to the present invention uses not only the corresponding area of the reference picture indicated by the motion vector of the current block but also the motion information included in the corresponding area of the reference picture, Corresponding regions of other reference pictures used for region prediction are used for prediction of the current block. When the corresponding area of the current block or reference picture has two motion vectors, the corresponding area of the reference picture is determined by tracking each motion vector.

図５は、本発明による映像符号化装置を示したブロック図である。以下では、説明の便宜のために、Ｈ．２６４／ＡＶＣによる映像符号化装置を中心に説明するが、本発明による映像符号化装置は、動き予測及び補償を利用する他の方式の映像コーディング方式にも適用されうる。図５を参照するに、映像符号化装置５００は、動き予測部５０２、動き補償部５０４、イントラ予測部５０６、変換部５０８、量子化部５１０、再整列部５１２、エントロピ・コーディング部５１４、逆量子化部５１６、逆変換部５１８、フィルタ５２０、フレームメモリ５２２及び制御部５２５を具備する。 FIG. 5 is a block diagram illustrating a video encoding apparatus according to the present invention. In the following, for the convenience of explanation, H.C. The video coding apparatus according to the H.264 / AVC will be mainly described, but the video coding apparatus according to the present invention can be applied to other video coding schemes using motion prediction and compensation. Referring to FIG. 5, the video encoding apparatus 500 includes a motion prediction unit 502, a motion compensation unit 504, an intra prediction unit 506, a conversion unit 508, a quantization unit 510, a rearrangement unit 512, an entropy coding unit 514, and an inverse. A quantization unit 516, an inverse transformation unit 518, a filter 520, a frame memory 522, and a control unit 525 are provided.

動き予測部５０２は、現在ピクチャを所定大きさのブロック単位に分割し、以前に符号化された後で復元されてフレームメモリ５２２に保存された参照ピクチャの所定探索領域範囲内で、現在ブロックと最も類似した領域を探索する動き予測を行い、現在ブロックと参照ピクチャの対応領域との位置差である動きベクトルを出力する。 The motion prediction unit 502 divides the current picture into blocks of a predetermined size, and within the predetermined search area range of the reference picture restored after being previously encoded and stored in the frame memory 522, Motion prediction for searching the most similar region is performed, and a motion vector that is a position difference between the current block and the corresponding region of the reference picture is output.

動き補償部５０４は、動きベクトルが指し示す参照ピクチャの対応領域情報を利用し、現在ブロックの予測値を生成する。特に、本発明による動き補償部５０４は、前述のように、現在ブロックが有する動きベクトルを継続的にトラッキングすることによって、複数枚の参照ピクチャの対応領域を決定し、決定された参照ピクチャの対応領域の加重和を介して現在ブロックの予測値を生成する。本発明による動き補償部５０４の具体的な構成及び動作については後述する。 The motion compensation unit 504 generates the prediction value of the current block using the corresponding region information of the reference picture indicated by the motion vector. In particular, as described above, the motion compensation unit 504 according to the present invention determines a corresponding region of a plurality of reference pictures by continuously tracking a motion vector included in the current block, and determines the correspondence of the determined reference pictures. A prediction value of the current block is generated through the weighted sum of the regions. A specific configuration and operation of the motion compensation unit 504 according to the present invention will be described later.

イントラ予測部５０６は、現在ブロックの予測値を現在ピクチャ内で探すイントラ予測を行う。 The intra prediction unit 506 performs intra prediction for searching for the prediction value of the current block in the current picture.

インター予測、イントラ予測、または本発明による複数枚の参照ピクチャの対応領域を利用した予測方式によって、現在ブロックの予測ブロックが生成されれば、現在ブロックと予測ブロックとの誤差値である剰余が生成され、生成された剰余は、変換部５０８によって周波数領域に変換され、量子化部５１０で量子化される。エントロピ・コーディング部５１４は、量子化された剰余を符号化し、ビットストリームを出力する。 If a prediction block of the current block is generated by inter prediction, intra prediction, or a prediction method using a corresponding region of a plurality of reference pictures according to the present invention, a remainder that is an error value between the current block and the prediction block is generated. Then, the generated remainder is converted into the frequency domain by the conversion unit 508 and quantized by the quantization unit 510. The entropy coding unit 514 encodes the quantized residue and outputs a bit stream.

参照ピクチャを得るために量子化されたピクチャは、逆量子化部５１６と逆変換部５１８とによってさらに復元される。このように復元された現在ピクチャは、デブロッキング・フィルタリングを行うフィルタ５２０を経た後、フレームメモリ５２２に保存されていて、次のピクチャの予測時に利用される。 The picture quantized to obtain the reference picture is further restored by the inverse quantization unit 516 and the inverse transform unit 518. The current picture restored in this way passes through a filter 520 that performs deblocking filtering, and is then stored in the frame memory 522 and used when predicting the next picture.

制御部５２５は、映像符号化装置５００の各構成要素を制御する一方、現在ブロックの予測モードを決定する。具体的に制御部５２５は、一般的なインター予測またはイントラ予測されたブロックと現在ブロックとのコスト、及び本発明によって複数枚の参照ピクチャの対応領域を利用して予測されたブロックと現在ブロックとのコストを比較し、最小コストを有する予測モードを決定する。ここで、コスト計算は、さまざまな方法によって行われうる。使われるコスト関数としては、ＳＡＤ（Sum of Absolute Difference）、ＳＡＴＤ（Sum of Absolute Transformed Difference）、ＳＳＤ（Sum of Squared Difference）、ＭＡＤ（Mean of Absolute Difference）及びラグランジュ関数（Lagrange function）などがある。ＳＡＤは、各４×４ブロック予測誤差（residue）値の絶対値を取ってその値を合わせた値である。ＳＡＴＤは、各４×４ブロックの予測誤差値にハダマード変換（Hadamard transform）を適用して生成された係数の絶対値を取って加えた値である。ＳＳＤは、各４×４ブロック予測サンプルの予測誤差値を自乗して加えた値であり、ＭＡＤは、各４×４ブロック予測サンプルの予測誤差値に絶対値を取って平均を求めた値である。ラグランジュ関数は、コスト関数にビットストリーム長の情報を含めて作られた新しい関数である。 The control unit 525 controls each component of the video encoding device 500 and determines the prediction mode of the current block. Specifically, the control unit 525 determines the cost of a general inter prediction or intra predicted block and the current block, and the block and current block predicted using corresponding regions of a plurality of reference pictures according to the present invention. Are compared, and the prediction mode having the minimum cost is determined. Here, the cost calculation may be performed by various methods. Examples of cost functions used include SAD (Sum of Absolute Difference), SATD (Sum of Absolute Transformed Difference), SSD (Sum of Squared Difference), MAD (Mean of Absolute Difference), and Lagrange function. The SAD is a value obtained by taking the absolute value of each 4 × 4 block prediction error value and combining the values. SATD is a value obtained by taking the absolute value of a coefficient generated by applying a Hadamard transform to the prediction error value of each 4 × 4 block. The SSD is a value obtained by squaring the prediction error value of each 4 × 4 block prediction sample, and the MAD is an average value obtained by taking an absolute value from the prediction error value of each 4 × 4 block prediction sample. is there. The Lagrangian function is a new function created by including the bit stream length information in the cost function.

図６は、図５の動き補償部５０４の具体的な構成を示したブロック図である。図６を参照するに、動き補償部６００は、参照ピクチャ決定部６１０及び加重予測部６２０を備える。 FIG. 6 is a block diagram showing a specific configuration of the motion compensation unit 504 of FIG. Referring to FIG. 6, the motion compensation unit 600 includes a reference picture determination unit 610 and a weighted prediction unit 620.

参照ピクチャ決定部６１０は、動き予測部５０２で生成された現在ブロックの動きベクトルを利用して参照ピクチャの対応領域を決定し、決定された参照ピクチャの対応領域が有する動きベクトルの経路をトラッキングすることによって、現在ブロックの予測に利用する複数枚の参照ピクチャの対応領域を決定する。 The reference picture determination unit 610 determines a corresponding region of the reference picture using the motion vector of the current block generated by the motion prediction unit 502, and tracks the path of the motion vector included in the determined corresponding region of the reference picture. As a result, a corresponding area of a plurality of reference pictures used for prediction of the current block is determined.

加重予測部６２０は、決定された複数枚の参照ピクチャの対応領域の加重和を求めることによって、現在ブロックの予測ブロックを生成する。具体的に加重予測部６２０は、複数枚の参照ピクチャの対応領域の重み付けを決定する重み付け計算部６２１、及び複数枚の参照ピクチャの対応領域に重み付けを乗じた後で加えることによって、現在ブロックに対する予測ブロックを生成する予測ブロック生成部６２２を備える。 The weighted prediction unit 620 generates a predicted block of the current block by obtaining a weighted sum of the corresponding areas of the determined plurality of reference pictures. Specifically, the weighted prediction unit 620 adds the weighting calculation unit 621 that determines the weighting of the corresponding region of the plurality of reference pictures and the weighting of the corresponding region of the plurality of reference pictures, and then adds the weighting to the current block. A prediction block generation unit 622 that generates a prediction block is provided.

以下では、参照ピクチャ決定部６１０で、現在ブロックの予測に利用する複数枚の参照ピクチャの対応領域を決定する過程について具体的に説明する。 Hereinafter, a process in which the reference picture determination unit 610 determines corresponding areas of a plurality of reference pictures used for prediction of the current block will be described in detail.

図７は、Ｈ．２６４／ＭＰＥＧ−４ＡＶＣの可変ブロックサイズ動き予測で利用される多様な大きさのブロックを示した図であり、図８は、可変ブロック動き予測された映像の一例を図示したものである。図７に図示されたように、マクロブロックは、四種の方法で分割されうる。すなわち、マクロブロックは、１つの１６×１６マクロブロック・パーティション、２つの１６×８パーティション、２つの８×１６パーティションまたは４つの８×８パーティションに分割されて動き予測されうる。また、８×８モードが選択されれば、マクロブロック内の４つの８×８サブマクロブロックは、それぞれ四種の方法でまた分割されうる。すなわち、８×８モードが選択された場合、各８×８ブロックは、１つの８×８サブマクロブロック・パーティション、２つの８×４サブマクロブロック・パーティション、２つの４×８サブマクロブロック・パーティションまたは４つの４×４サブマクロブロック・パーティションのうち、一つで分割される。各マクロブロック内で、かようなパーティションとサブマクロブロックとの非常に多数の組み合わせが可能である。マクロブロックを多様な大きさのサブブロックに分ける方法をツリー構造動き補償（tree structured motion compensation）という。 FIG. FIG. 8 is a diagram illustrating blocks of various sizes used in the H.264 / MPEG-4AVC variable block size motion prediction, and FIG. 8 illustrates an example of a variable block motion predicted video. As shown in FIG. 7, the macroblock may be divided in four ways. That is, the macroblock may be divided into one 16 × 16 macroblock partition, two 16 × 8 partitions, two 8 × 16 partitions, or four 8 × 8 partitions and motion predicted. If the 8 × 8 mode is selected, the four 8 × 8 sub-macroblocks in the macroblock can be divided by four methods. That is, when the 8 × 8 mode is selected, each 8 × 8 block has one 8 × 8 sub-macroblock partition, two 8 × 4 sub-macroblock partitions, two 4 × 8 sub-macroblocks, The partition is divided into one of four 4 × 4 sub-macroblock partitions. Within each macroblock, a large number of combinations of such partitions and sub-macroblocks are possible. The method of dividing a macroblock into sub-blocks of various sizes is called tree structured motion compensation.

図８を参照するに、映像で、エネルギーが低いブロックは、大サイズのパーティションに動き予測され、エネルギーが大きいブロックは、小サイズのパーティションに動き予測される。本発明について説明するにおいて、かようなツリー構造動き補償を利用し、現在ピクチャを分割するそれぞれの動きブロック間の境界を動きブロック境界と定義する。 Referring to FIG. 8, in the video, a low energy block is predicted to move to a large size partition, and a high energy block is predicted to move to a small size partition. In describing the present invention, such a tree structure motion compensation is used, and a boundary between each motion block dividing the current picture is defined as a motion block boundary.

前述のように、本発明による映像符号化方法によれば、現在ブロックの予測に利用する参照ピクチャの対応領域を決定するために、参照ピクチャの対応領域が有する動きベクトルを利用したトラッキングを遂行する。しかし、図８に図示されたように、参照ピクチャは、多様な大きさの動きブロックに分割されているので、現在ブロックに対応する参照ピクチャの対応領域は、正確に１つの動きブロックと一致せずに、複数個の動きブロックにわたって形成される場合がある。かような場合、参照ピクチャの対応領域内には、複数個の動きベクトルが存在する。このように、参照ピクチャの対応領域内に複数個の動きベクトルが存在する場合、動きベクトル経路をトラッキングする過程について説明する。 As described above, according to the video coding method of the present invention, tracking is performed using a motion vector included in a corresponding region of a reference picture in order to determine a corresponding region of the reference picture used for prediction of the current block. . However, as shown in FIG. 8, the reference picture is divided into motion blocks of various sizes, so that the corresponding area of the reference picture corresponding to the current block exactly matches one motion block. Instead, it may be formed over a plurality of motion blocks. In such a case, there are a plurality of motion vectors in the corresponding region of the reference picture. In this manner, a process of tracking a motion vector path when a plurality of motion vectors exist in the corresponding region of the reference picture will be described.

図９は、本発明による映像符号化方法によって、動きブロック境界を基準に分割された参照ピクチャの対応領域が参照する他の参照ピクチャの対応領域を決定するための過程について説明するための図である。 FIG. 9 is a diagram illustrating a process for determining a corresponding area of another reference picture that is referenced by a corresponding area of a reference picture divided based on a motion block boundary by the video encoding method according to the present invention. is there.

図９を参照するに、現在ブロック９０が有する動きベクトルＭＶ_１が指し示す参照ピクチャ１の対応領域９１は、複数個の動きブロック（BLOCK）にわたって存在する。すなわち、現在ブロック９０に対応する参照ピクチャ１の対応領域９１は、いずれか１つの動きブロックとマッチングされるのではなく、ブロックＡ、ブロックＢ、ブロックＣ及びブロックＤにわたって存在する。このような場合、参照ピクチャ決定部６１０は、参照ピクチャ１の対応領域９１を参照ピクチャ１の動きブロック境界に沿って分割し、分割された参照ピクチャ１のサブ対応領域ａ，ｂ，ｃ，ｄを具備する参照ピクチャ１の各動きブロックの動きベクトルが指し示す参照ピクチャ２と参照ピクチャ３との対応領域を決定する。すなわち、参照ピクチャ決定部６１０は、分割されたａ領域が属するブロックＡが有する動きベクトルＭＶ_ａを利用して参照ピクチャ２の対応領域ａ’９３を決定し、ｂ領域が属するブロックＢが有する動きベクトルＭＶ_ｂを利用して参照ピクチャ２の対応領域ｂ’９４を決定し、ｃ領域が属するブロックＣが有する動きベクトルＭＶ_ｃを利用し、参照ピクチャ３の対応領域ｃ’９６を決定し、ｄ領域が属するブロックＤが有する動きベクトルＭＶ_ｄを利用し、参照ピクチャ３の対応領域ｄ’９５を決定する。 Referring to FIG. 9, the corresponding area 91 of the reference picture 1 indicated by the motion vector MV ₁ of the current block 90 exists over a plurality of motion blocks (BLOCK). That is, the corresponding region 91 of the reference picture 1 corresponding to the current block 90 is not matched with any one motion block, but exists over the block A, the block B, the block C, and the block D. In such a case, the reference picture determination unit 610 divides the corresponding area 91 of the reference picture 1 along the motion block boundary of the reference picture 1, and sub-corresponding areas a, b, c, d of the divided reference picture 1 The corresponding region between the reference picture 2 and the reference picture 3 indicated by the motion vector of each motion block of the reference picture 1 having the above is determined. That is, the reference picture determination unit 610 determines the corresponding area a′93 of the reference picture 2 using the motion vector MV _a that the block A to which the divided a area belongs, and the motion that the block B to which the b area belongs has A corresponding area b′94 of the reference picture 2 is determined using the vector MV _b , a corresponding area c′96 of the reference picture 3 is determined using the motion vector MV _c of the block C to which the c area belongs, and d The corresponding area d′ 95 of the reference picture 3 is determined using the motion vector MV _d of the block D to which the area belongs.

図９では、現在ブロック９０と対応する参照ピクチャ１の対応領域９１を部分的に含むブロックＡ，Ｂ，Ｃ，Ｄが参照ピクチャ２及び参照ピクチャ３を参照する場合を例示しているが、ブロックＡ，Ｂ，Ｃ，Ｄが参照する参照ピクチャが変更される場合にも、同様にブロックＡ，Ｂ，Ｃ，Ｄが有するモーションフィールド（motion field）情報、すなわちブロックＡ，Ｂ，Ｃ，Ｄの動きベクトル及び参照ピクチャ情報を利用し、分割された参照ピクチャ１の対応領域に対応する他の参照ピクチャの対応領域を決定できる。 FIG. 9 illustrates a case where blocks A, B, C, and D partially including the corresponding area 91 of the reference picture 1 corresponding to the current block 90 refer to the reference picture 2 and the reference picture 3. Similarly, when the reference pictures referenced by A, B, C, and D are changed, the motion field information that the blocks A, B, C, and D have, that is, the blocks A, B, C, and D Using the motion vector and the reference picture information, it is possible to determine the corresponding area of another reference picture corresponding to the corresponding area of the divided reference picture 1.

図１０は、本発明による映像符号化方法によって、動きブロック境界を基準に分割された参照ピクチャの対応領域が参照する他の参照ピクチャの対応領域を決定するための過程の他の例について説明するための図である。図１０を参照するに、現在ブロックに対する参照ピクチャの対応領域１００がブロックＡ，Ｂ１，Ｂ２，Ｃ，Ｄに部分的に含まれた場合には、前述のように、対応領域１００を参照ピクチャの動きブロック境界に沿って分割し、分割された各対応領域ａ，ｂ１，ｂ２，ｃ，ｄが属するブロックのモーションフィールド情報を利用し、他の参照ピクチャの対応領域を決定する。すなわち、参照ピクチャ決定部６１０は、ａ領域が属するブロックＡのモーションフィールド情報を利用し、ａ領域に対応する他の参照ピクチャの対応領域を決定し、Ｂ１領域が属するブロックＢ１のモーションフィールド情報を利用し、ｂ１領域に対応する他の参照ピクチャの対応領域を決定し、ｂ２領域が属するブロックＢ２のモーションフィールド情報を利用し、ｂ２領域に対応する他の参照ピクチャの対応領域を決定し、ｃ領域が属するブロックＣのモーションフィールド情報を利用し、ｃ領域に対応する他の参照ピクチャの対応領域を決定し、ｄ領域が属するブロックＤが有するモーションフィールド情報を利用し、ｄ領域に対応する他の参照ピクチャの対応領域を決定する。 FIG. 10 illustrates another example of a process for determining a corresponding area of another reference picture that is referenced by a corresponding area of a reference picture divided based on a motion block boundary by the video encoding method according to the present invention. FIG. Referring to FIG. 10, when the corresponding area 100 of the reference picture for the current block is partially included in the blocks A, B1, B2, C, and D, as described above, the corresponding area 100 is included in the reference picture. Dividing along the motion block boundary, the motion field information of the block to which each of the divided corresponding areas a, b1, b2, c, d belongs is used to determine the corresponding areas of other reference pictures. That is, the reference picture determining unit 610 uses the motion field information of the block A to which the area a belongs, determines the corresponding area of another reference picture corresponding to the area a, and determines the motion field information of the block B1 to which the area B1 belongs. A corresponding region of another reference picture corresponding to the b1 region is determined, a motion field information of the block B2 to which the b2 region belongs is used, a corresponding region of another reference picture corresponding to the b2 region is determined, and c Using the motion field information of the block C to which the area belongs, determining the corresponding area of another reference picture corresponding to the c area, and using the motion field information of the block D to which the d area belongs, The corresponding area of the reference picture is determined.

前述の参照ピクチャ決定過程は、現在ブロックの動きベクトルが指し示す参照ピクチャの対応領域から、他の参照ピクチャの対応領域を決定するときだけではなく、決定された他の参照ピクチャの対応領域から、さらに他の参照ピクチャを決定するときに、同一に適用されうる。すなわち、動きベクトルを利用したトラッキング過程は、対応領域が動きベクトル情報を有する動きブロックを含んでいる場合には、続けて行われうる。ただし、対応領域が全部イントラ予測ブロックに属する場合、またはイントラ予測されるブロックに属する対応領域の広さが所定臨界値以上である場合には、それ以上のトラッキングを行わずに、現在対応する参照ピクチャだけまでトラッキングを行うことができる。例えば、ふたたび図９を参照するに、もし現在ブロック９０に対応する参照ピクチャ１の対応領域９１が属するブロックＡ，Ｂ，Ｃ，Ｄがいずれもイントラ予測ブロックである場合には、それ以上のトラッキングを行わずに、参照ピクチャ１の対応領域９１のみを現在ブロック９０の予測に利用する。もしブロックＡ，Ｂ，Ｃは動きベクトルを有する動きブロックであり、ブロックＤのみがイントラ予測ブロックであるとするとき、ブロックＤに属する対応領域ｄの広さが所定臨界値以上である場合、参照ピクチャ１の対応領域９１に所定の重み付けを乗じた値を現在ブロック９０の予測に利用する。トラッキング過程を遂行するか否かを判断する過程は、参照ピクチャから決定された他の参照ピクチャの対応領域についても同一に行われる。 The reference picture determination process is performed not only when determining the corresponding area of another reference picture from the corresponding area of the reference picture indicated by the motion vector of the current block, but also from the determined corresponding area of the other reference picture. The same can be applied when determining other reference pictures. That is, the tracking process using a motion vector can be continuously performed when the corresponding region includes a motion block having motion vector information. However, if the corresponding region belongs to all intra-predicted blocks, or if the size of the corresponding region belonging to the intra-predicted block is greater than or equal to a predetermined critical value, the reference corresponding to the current correspondence is performed without further tracking. Tracking can be done only for pictures. For example, referring again to FIG. 9, if all of the blocks A, B, C, and D to which the corresponding area 91 of the reference picture 1 corresponding to the current block 90 belongs are intra prediction blocks, further tracking is performed. Without performing the above, only the corresponding area 91 of the reference picture 1 is used for the prediction of the current block 90. If the blocks A, B, and C are motion blocks having motion vectors, and only the block D is an intra-prediction block, the reference is given when the size of the corresponding region d belonging to the block D is equal to or greater than a predetermined critical value. A value obtained by multiplying the corresponding area 91 of the picture 1 by a predetermined weight is used for prediction of the current block 90. The process of determining whether or not to perform the tracking process is the same for the corresponding areas of other reference pictures determined from the reference picture.

一方、対応領域の一部がイントラ予測ブロックに含まれるが、イントラ予測ブロックに含まれる対応領域の広さが所定臨界値未満である場合には、続けて他の参照ピクチャの対応領域を決定するトラッキング過程を遂行する。このとき、イントラ予測ブロック周辺の動きブロックが有する動きベクトルを利用し、イントラ予測ブロックに仮想動きベクトルを割り当て、割り当てられた仮想動きベクトルが指し示す他の参照ピクチャの対応領域を決定できる。前述の例で、ブロックＡ，Ｂ，Ｃは動きベクトルＭＶ_ａ，ＭＶ_ｂ，ＭＶ_ｃを有する動きブロックであり、ブロックＤのみイントラ予測ブロックであり、ブロックＤに属する対応領域ｄの広さが所定臨界値未満であると仮定するならば、トラッキング過程は中断されずに続く。かような場合、ブロックＡ，Ｂ，Ｃに属する対応領域ａ，ｂ，ｃについては、前述のようにブロックＡ，Ｂ，Ｃの動きベクトルＭＶ_ａ，ＭＶ_ｂ，ＭＶ_ｃを利用して他の参照ピクチャの対応領域を決定し、ブロックＤに属する対応領域ｄについては、ブロックＤの周辺動きブロックＡ，Ｂ，Ｃが有する動きベクトルＭＶ_ａ，ＭＶ_ｂ，ＭＶ_ｃの中間値（median）または平均値をブロックＤの仮想動きベクトルとして割り当て、仮想動きベクトルが指し示す他の参照ピクチャの対応領域を決定できる。 On the other hand, when a part of the corresponding region is included in the intra prediction block, but the size of the corresponding region included in the intra prediction block is less than a predetermined critical value, the corresponding region of another reference picture is determined subsequently. Perform the tracking process. At this time, a motion vector included in a motion block around the intra prediction block is used, a virtual motion vector is assigned to the intra prediction block, and a corresponding region of another reference picture indicated by the assigned virtual motion vector can be determined. In the above example, the blocks A, B, and C are motion blocks having the motion vectors MV _a , MV _b , and MV _c , and only the block D is an intra prediction block, and the width of the corresponding region d belonging to the block D is predetermined. Assuming that it is below the critical value, the tracking process continues uninterrupted. In such a case, for the corresponding areas a, b, and c belonging to the blocks A, B, and C, as described above, the motion vectors MV _a , MV _b , and MV _c of the blocks A, B, and C are used. The corresponding area of the reference picture is determined, and for the corresponding area d belonging to the block D, an intermediate value (median) or an average of the motion vectors MV _a , MV _b , and MV _{c included} in the peripheral motion blocks A, B, and C of the block D A value is assigned as a virtual motion vector of block D, and a corresponding area of another reference picture indicated by the virtual motion vector can be determined.

再び図６を参照するに、参照ピクチャ決定部６１０で、現在ブロックが有する動きベクトル経路をトラッキングし、複数枚の参照ピクチャの対応領域が決定されれば、重み付け計算部６２１は、それぞれの対応領域に付与される重み付けを計算する。 Referring to FIG. 6 again, when the reference picture determination unit 610 tracks the motion vector path of the current block and determines the corresponding areas of the plurality of reference pictures, the weight calculation unit 621 Calculate the weight given to.

重み付け計算部６２１は、以前に処理された現在ブロックの周辺ブロックの画素と、現在ブロックの周辺ブロックの画素に対応する参照ピクチャの対応領域の周辺画素とを利用し、参照ピクチャの対応領域の周辺画素の加重和を介して予測された現在ブロックの周辺画素の予測値と、現在ブロックの周辺画素の画素値との差が最小になる値を重み付けに決定する。 The weighting calculation unit 621 uses the peripheral block pixels of the current block processed previously and the peripheral pixels of the corresponding area of the reference picture corresponding to the peripheral block pixels of the current block. The value that minimizes the difference between the predicted value of the neighboring pixels of the current block predicted through the weighted sum of the pixels and the neighboring pixel values of the current block is determined as the weight.

図１１は、本発明による映像符号化方法において、参照ピクチャの対応領域に付与される重み付けを計算する過程について説明するための図である。図１１で、現在ブロックをＤ_ｔ、現在ブロックＤ_ｔに対応する参照ピクチャ（ｔ−１）の対応領域をＤ_ｔ−１、対応領域Ｄ_ｔ−１の各分割された領域ａ，ｂ，ｃ，ｄに対応する参照ピクチャ（ｔ−２）の対応領域をそれぞれＤ_{ｔ−２，ａ}，Ｄ_{ｔ−２，ｂ}，Ｄ_{ｔ−２，ｃ}，Ｄ_{ｔ−２，ｄ}（まとめて「Ｄ_ｔ−２」とする）とし、Ｐ_ｔを現在ブロックＤ_ｔの予測ブロックであると仮定する。 FIG. 11 is a diagram for explaining a process of calculating a weight given to a corresponding region of a reference picture in the video encoding method according to the present invention. In FIG. 11, the current block is D _t , the corresponding area of the reference picture (t−1) corresponding to the current block D _t is D _t−1 , and the corresponding areas D _t−1 are divided areas a, b, and c. , D corresponding to the reference picture (t-2) corresponding to D _{t-2, a} , D _{t-2, b} , D _{t-2, c} , D _{t-2, d} (collectively “D _t want to) _-2 ', it is assumed that the _{P t} current is the prediction block of the block _{D t.}

重み付け計算部６２１は、参照ピクチャ単位で重み付けを付与する。すなわち、重み付け計算部６２１は、同じ参照ピクチャに属する対応領域に同じ重み付けを付与する。図１１で、参照ピクチャ（ｔ−１）の対応領域Ｄ_ｔ−１に付与される重み付けをα、参照ピクチャ（ｔ−２）の対応領域Ｄ_ｔ−２に付与される重み付けをβとすれば、現在ブロックＤ_ｔの予測ブロックＰ_ｔは、次の式（１）のように、参照ピクチャ（ｔ−１）の対応領域Ｄ_ｔ−１と、参照ピクチャ（ｔ−２）の対応領域Ｄ_ｔ−２との加重和として計算される。 The weighting calculation unit 621 assigns weighting in units of reference pictures. That is, the weight calculation unit 621 assigns the same weight to corresponding regions belonging to the same reference picture. In FIG. 11, if the weight assigned to the corresponding area D _t-1 of the reference picture (t-1) is α and the weight assigned to the corresponding area D _t-2 of the reference picture (t-2) is β. , the predicted block _{P t} of the current block _{D t,} as in the following equation (1), and the corresponding region _{D t-1} of the reference picture (t-1), the reference picture (t-2) corresponding region _{D t -2} is calculated as a weighted sum with _-2 .

参照ピクチャの対応領域に付与される重み付けα及びβは、多様なアルゴリズムを介して決定されうる。本発明では、現在ブロックＤ_ｔと予測ブロックＰ_ｔとの誤差を最小にする重み付けを利用する。現在ブロックＤ_ｔと予測ブロックＰ_ｔとの誤差（ＳＳＥ：Sum of Squared Error）は、次の式（２）の通りである。

The weighting α and β given to the corresponding region of the reference picture can be determined through various algorithms. In the present invention, weighting that minimizes the error between the current block D _t and the prediction block P _t is used. An error (SSE: Sum of Squared Error) between the current block D _t and the prediction block P _t is expressed by the following equation (2).

重み付けα及びβは、ＳＳＥをα及びβに対して偏微分したとき、０になる次の式（３）の偏微分方程式を計算することによって決定されうる。

The weights α and β can be determined by calculating the partial differential equation of the following equation (3) that becomes 0 when the SSE is partially differentiated with respect to α and β.

式（３）の偏微分方程式を計算するために、現在ブロックの周辺ブロックの画素と、現在ブロックの周辺ブロックの画素に対応する参照ピクチャの対応領域の周辺画素とを利用する。これは、復号化部で以前に復号化された現在ブロックの周辺画素情報を利用して重み付けを決定することによって、別途に現在ブロックの予測に利用された重み付けを伝送せずとも、以前に処理された周辺画素データを基に重み付けを決定できるようにするためである。従って、本発明では、参照ピクチャの対応領域に割り当てられる重み付けを別途に伝送せずに、符号化部と復号化部とで以前に処理されたデータを利用して重み付けを計算して利用できるように、現在ブロックの周辺ブロックの画素と、現在ブロックの周辺ブロックの画素に対応する参照ピクチャの対応領域の周辺画素とを利用する。

In order to calculate the partial differential equation of Equation (3), the pixels of the peripheral block of the current block and the peripheral pixels of the corresponding region of the reference picture corresponding to the pixels of the peripheral block of the current block are used. This is because the weighting is determined by using the peripheral pixel information of the current block previously decoded by the decoding unit, so that the weighting used for prediction of the current block is not transmitted separately. This is because the weighting can be determined based on the peripheral pixel data. Therefore, according to the present invention, the weight assigned to the corresponding region of the reference picture can be used by calculating the weight using the data previously processed by the encoding unit and the decoding unit without separately transmitting the weight. In addition, the peripheral block pixels of the current block and the peripheral pixels of the corresponding area of the reference picture corresponding to the peripheral block pixels of the current block are used.

図１１をまた参照するに、参照ピクチャ（ｔ−１）の対応領域Ｄ_ｔ−１と、参照ピクチャ（ｔ−２）の対応領域Ｄ_ｔ−２とを利用し、現在ブロックＤ_ｔの予測ブロックＰ_ｔを計算するように、現在ブロックＤ_ｔの周辺ブロックの画素Ｎ_ｔは、現在ブロックＤ_ｔとの空間的位置を考慮し、参照ピクチャ（ｔ−１）の対応領域Ｄ_ｔ−１の周辺画素Ｎ_{ｔ−１，ａ}，Ｎ_{ｔ−１，ｂ}，Ｎ_{ｔ−１，ｃ}と、参照ピクチャ（ｔ−２）の対応領域Ｄ_ｔ−２の周辺画素Ｎ_{ｔ−２，ａ}，Ｎ_{ｔ−２，ｂ}，Ｎ_{ｔ−２，ｃ}とを利用して計算できる。この場合、現在ブロックＤ_ｔの周辺ブロックの画素Ｎ_ｔ、参照ピクチャ（ｔ−１）の対応領域Ｄ_ｔ−１の周辺画素Ｎ_ｔ−１及び参照ピクチャ（ｔ−２）の対応領域Ｄ_ｔ−２の周辺画素Ｎ_ｔ−２を利用して予測された周辺ブロック画素の予測値Ｎ_ｔ’間のＳＳＥは、次の式（４）の通りである。 To also refer to FIG. 11, a corresponding region _{D t-1} of the reference picture (t-1), the reference picture (t-2) and a corresponding region _{D t-2} using the prediction block of the current block _{D t} to calculate P _t, pixel _{N t} of neighboring blocks of the current block _{D t} is now considering the spatial position of the block _{D t,} the periphery of the corresponding region _{D t-1} of the reference picture (t-1) Pixels N _{t−1, a} , N _{t−1, b} , N _{t−1, c} and peripheral pixels N _{t−2, a} , N _t− of the corresponding region D _t−2 of the reference picture (t−2) _{2, b} , N _{t-2, c} . In this case, the pixel _{N t} of neighboring blocks of the current block _{D t,} the corresponding region of the corresponding region _{D t-1} of the peripheral pixels _{N t-1} and the reference picture of the reference picture (t-1) (t- 2) D t- predicted value N _{t 'SSE} between the peripheral block pixel which is predicted using the _second peripheral pixels N _t-2 is as the following equation (4).

重み付け計算部６２１は、前記式（４）の誤差値ＳＳＥをα及びβに対して偏微分した後、偏微分した値が０になるα及びβを決定する。

The weighting calculation unit 621 partially differentiates the error value SSE of the equation (4) with respect to α and β, and then determines α and β at which the partially differentiated value becomes zero.

一方、式（１）で、α及びβの値を正規化（normalization）してα＋β＝１とすれば、β＝１−αである。これを、式（１）に代入すれば、次の式（５）のようであり、この場合にＳＳＥは、式（６）の通りである。 On the other hand, if α + β is normalized by α + β = 1 in equation (1), β = 1−α. If this is substituted into the equation (1), the following equation (5) is obtained. In this case, the SSE is as the equation (6).

式（６）の誤差値（ＳＳＥ）をαに対して偏微分し、

Partially differentiate the error value (SSE) of equation (6) with respect to α,

を満足するαの値は、次の式（７）の通りである。

The value of α satisfying is as in the following formula (7).

前述のように、対応領域それぞれの重み付けを別途に伝送せずとも、重み付けを決定できるようにするために、現在ブロックＤ_ｔの代わりに、以前に処理された周辺ブロックの画素Ｎ_ｔを利用し、Ｄ_ｔ−２の代わりにＮ_ｔ−２，Ｄ_ｔ−１の代わりにＮ_ｔ−１を利用する。

As described above, in order to be able to determine the weight without separately transmitting the weight of each corresponding region, the pixel N _t of the previously processed peripheral block is used instead of the current block D _t. N _t−2 instead of D _t−2 and N _t−1 instead of D _t−1 .

一方、前述の重み付け決定過程は、さらに多くの参照ピクチャの対応領域を利用する場合にも、各参照ピクチャに重み付けを割り当て、現在ブロックと予測ブロックとの誤差値を最小にする重み付けを決定することによって遂行されうる。 On the other hand, the weight determination process described above assigns a weight to each reference picture and determines a weight that minimizes an error value between the current block and the prediction block even when more corresponding areas of reference pictures are used. Can be accomplished by

すなわち、現在ブロックＤ_ｔの予測に利用されるｎ枚（ｎは整数）の参照ピクチャの対応領域をそれぞれＤ_１，Ｄ_２，Ｄ_３，…，Ｄ_ｎとし、各対応領域に割り当てられる重み付けをＷ１，Ｗ２，Ｗ３，…，Ｗｎとすれば、現在ブロックＤ_ｔの予測ブロックＰ_ｔは、Ｐ_ｔ＝Ｗ１＊Ｄ_１＋Ｗ２＊Ｄ_２＋Ｗ３＊Ｄ_３＋…＋Ｗｎ＊Ｄ_ｎを利用して計算される。各重み付けＷ１，Ｗ２，…，Ｗｎは、現在ブロックＤ_ｔと予測ブロックＰ_ｔとの誤差値の自乗であるＳＳＥを各重み付けを媒介変数として偏微分した後、偏微分した値が０になる値に決定される。前述のように、偏微分方程式を解くときの条件として、現在ブロックの周辺ブロックの画素と、これに対応する参照ピクチャの対応領域の周辺画素とを利用する。 That is, the corresponding areas of n reference pictures (n is an integer) used for prediction of the current block D _t are D ₁ , D ₂ , D ₃ ,..., D _n , respectively, and the weight assigned to each corresponding area is W1, W2, W3, ..., if Wn, the prediction block _{P t} of the current block _{D t} _is calculated using the _{_{P t = W1 * D 1 +}} W2 * D 2 + W3 * D 3 + ... + Wn * D n Is done. Each weighting W1, W2, ..., Wn, after partially differentiating the respective weights as parametric the SSE is a square error value of the current block D _t and the prediction block P _t, the value obtained by partially differentiating becomes 0 value To be determined. As described above, as a condition for solving the partial differential equation, the pixels of the peripheral block of the current block and the peripheral pixels of the corresponding region of the reference picture corresponding to this are used.

再び図６を参照するに、予測ブロック生成部６２２は、式（１）のように、決定された重み付けを参照ピクチャの対応領域に乗じた後で加えることによって、現在ブロックの予測ブロックを生成する。 Referring to FIG. 6 again, the prediction block generation unit 622 generates the prediction block of the current block by adding the determined weight after multiplying the corresponding region of the reference picture as shown in Equation (1). .

本発明による動き補償部５０４で、複数枚の参照ピクチャの対応領域を利用して予測された予測ブロックと現在ブロックとの差である剰余は、変換、量子化及びエントロピ符号化過程を経てビットストリームとして出力される。 In the motion compensation unit 504 according to the present invention, a remainder, which is a difference between a prediction block predicted using a corresponding region of a plurality of reference pictures and a current block, is converted into a bitstream through transformation, quantization, and entropy coding processes. Is output as

一方、本発明による映像符号化方法によって符号化されるビットストリームのヘッダには、各ブロック単位で複数枚の参照ピクチャの対応領域を利用して予測されたことを示す１ｂｉｔのフラグ（flag）を付加できる。例えば、「０」は、従来技術によって符号化されたビットストリームを示し、「１」は、本発明によって符号化されたビットストリームを示すようにすることができる。 On the other hand, in the header of the bitstream encoded by the video encoding method according to the present invention, a 1-bit flag (flag) indicating that prediction is performed using corresponding regions of a plurality of reference pictures in units of blocks. Can be added. For example, “0” may indicate a bitstream encoded according to the prior art, and “1” may indicate a bitstream encoded according to the present invention.

図１２は、本発明による映像符号化方法を示したフローチャートである。図１２を参照するに、段階１２１０で、現在ブロックが参照する参照ピクチャの対応領域の動きベクトル経路をトラッキングすることによって、現在ブロックの予測に利用する複数枚の参照ピクチャの対応領域を決定する。前述のように、参照ピクチャの対応領域が動きブロック境界を利用して分離された場合には、分離された各対応領域が属する動きブロックの動きベクトルを利用し、他の参照ピクチャの対応領域を決定する。 FIG. 12 is a flowchart illustrating a video encoding method according to the present invention. Referring to FIG. 12, in step 1210, the corresponding region of a plurality of reference pictures used for prediction of the current block is determined by tracking the motion vector path of the corresponding region of the reference picture to which the current block refers. As described above, when the corresponding region of the reference picture is separated using the motion block boundary, the motion vector of the motion block to which each separated corresponding region belongs is used, and the corresponding region of the other reference picture is determined. decide.

段階１２２０で、複数枚の参照ピクチャの対応領域に適用する重み付けを決定する。前述のように、対応領域の重み付けは、現在ブロックの周辺画素と、現在ブロックの周辺画素に対応する参照ピクチャの対応領域の周辺画素とを利用し、対応領域の周辺画素から予測される現在ブロックの周辺画素と原周辺画素との値の差を最小にする値に決定される。 In step 1220, weights to be applied to corresponding regions of a plurality of reference pictures are determined. As described above, the weighting of the corresponding area is performed by using the surrounding pixels of the current block and the surrounding pixels of the corresponding area of the reference picture corresponding to the surrounding pixels of the current block, and predicting the current block from the surrounding pixels of the corresponding area. Is determined to be a value that minimizes the difference in value between the peripheral pixels of the original and the original peripheral pixels.

段階１２３０で、参照ピクチャの対応領域と計算された重み付けとを乗じた値を合算し、現在ブロックの予測ブロックを生成する。 In operation 1230, a value obtained by multiplying the corresponding area of the reference picture and the calculated weight is added to generate a prediction block of the current block.

段階１２４０で、現在ブロックと予測ブロックとの差分値の剰余を変換、量子化及びエントロピ符号化し、ビットストリームを生成する。 In operation 1240, the remainder of the difference value between the current block and the prediction block is transformed, quantized, and entropy encoded to generate a bitstream.

図１３は、本発明による映像復号化装置の構成を示したブロック図である。図１３を参照するに、本発明による映像復号化装置１３００は、エントロピデコーダ１３１０、再整列部１３２０、逆量子化部１３３０、逆変換部１３４０、動き補償部１３５０、イントラ予測部１３６０及びフィルタ１３７０を具備する。 FIG. 13 is a block diagram showing a configuration of a video decoding apparatus according to the present invention. Referring to FIG. 13, a video decoding apparatus 1300 according to the present invention includes an entropy decoder 1310, a rearrangement unit 1320, an inverse quantization unit 1330, an inverse transform unit 1340, a motion compensation unit 1350, an intra prediction unit 1360, and a filter 1370. It has.

エントロピデコーダ１３１０及び再整列部１３２０は、圧縮されたビットストリームを受信してエントロピ復号化を行い、量子化された係数を生成する。逆量子化部１３３０及び逆変換部１３４０は、量子化された係数に対する逆量子化及び逆変換を行い、変換符号化係数、動きベクトル情報、予測モード情報などを抽出する。ここで、予測モード情報には、本発明による映像符号化方法によって、複数枚の参照ピクチャの対応領域を利用した加重和を介して、現在ブロックが符号化されているか否かを示すフラグが含まれうる。前述のように、現在ブロックの復号化に利用される複数枚の参照ピクチャの対応領域は、映像符号化方式と同一に復号化される現在ブロックの動きベクトル情報を利用して決定されうるので、現在ブロックの復号化に利用される参照ピクチャの対応領域情報を別途に伝送する必要がない。 The entropy decoder 1310 and the reordering unit 1320 receive the compressed bit stream, perform entropy decoding, and generate quantized coefficients. The inverse quantization unit 1330 and the inverse transform unit 1340 perform inverse quantization and inverse transform on the quantized coefficients, and extract transform coding coefficients, motion vector information, prediction mode information, and the like. Here, the prediction mode information includes a flag indicating whether or not the current block is encoded through a weighted sum using corresponding regions of a plurality of reference pictures by the video encoding method according to the present invention. Can be. As described above, the corresponding areas of the plurality of reference pictures used for decoding the current block can be determined using the motion vector information of the current block decoded in the same way as the video encoding method. There is no need to separately transmit the corresponding area information of the reference picture used for decoding the current block.

イントラ予測部１３６０は、イントラ予測符号化された現在ブロックに対し、以前に復号化された現在ブロックの周辺ブロックを利用して予測ブロックを生成する。 The intra prediction unit 1360 generates a prediction block using the neighboring blocks of the current block decoded previously for the current block subjected to intra prediction encoding.

動き補償部１３５０は、前述の図５の動き補償部５０４と同じ構成及び動作を有する。すなわち、動き補償部１３５０は、復号化される現在ブロックが前述の複数枚の参照ピクチャの対応領域の加重和を利用して予測符号化された場合、ビットストリームに備わった現在ブロックの動きベクトルを利用し、以前に復号化された参照ピクチャの対応領域をトラッキングすることによって、参照ピクチャの対応領域を決定し、各参照ピクチャの対応領域に付与する重み付けを計算した後、参照ピクチャの対応領域に重み付けを乗じた後で合算し、現在ブロックの予測ブロックを生成する。前述のように、参照ピクチャの対応領域の重み付けは、以前に復号化された現在ブロックの周辺画素と、現在ブロックの周辺画素に対応する参照ピクチャの対応領域の周辺画素とを利用して決定される。 The motion compensation unit 1350 has the same configuration and operation as the motion compensation unit 504 of FIG. That is, when the current block to be decoded is predictively encoded using the weighted sum of the corresponding regions of the plurality of reference pictures, the motion compensation unit 1350 calculates the motion vector of the current block included in the bitstream. The corresponding region of the reference picture that has been previously decoded is used to determine the corresponding region of the reference picture and calculate the weight to be assigned to the corresponding region of each reference picture. After multiplying by weighting, it adds together and generates a prediction block of the current block. As described above, the weighting of the corresponding region of the reference picture is determined using the peripheral pixels of the current block that have been decoded previously and the peripheral pixels of the corresponding region of the reference picture corresponding to the peripheral pixels of the current block. The

動き補償部１３５０及びイントラ予測部１３６０では、生成された予測ブロックは、ビットストリームから抽出された現在ブロックと予測ブロックとの誤差値Ｄ’_ｎと加えられ、復元された映像データｕＦ’_ｎが生成される。ｕＦ’_ｎは、フィルタ１３７０を経て、最終的に、現在ブロックに対する復号化が行われる。 In the motion compensation unit 1350 and the intra prediction unit 1360, the generated prediction block is added to the error value D ′ _n between the current block and the prediction block extracted from the bit stream, and the restored video data uF ′ _n is generated. Is done. uF ′ _n passes through the filter 1370 and is finally decoded for the current block.

図１４は、本発明による映像復号化方法を示したフローチャートである。図１４を参照するに、段階１４１０で、入力されたビットストリームに備わった予測モード情報を判読し、復号化される現在ブロックの予測モードを判別する。 FIG. 14 is a flowchart illustrating a video decoding method according to the present invention. Referring to FIG. 14, in step 1410, prediction mode information included in an input bitstream is read to determine a prediction mode of a current block to be decoded.

段階１４２０で、復号化される現在ブロックが複数枚の参照ピクチャの対応領域を利用して予測されたと判別された場合、ビットストリームに備わった現在ブロックの動きベクトルが指し示す参照ピクチャの対応領域と、参照ピクチャの対応領域の動きベクトル経路とをトラッキングすることによって、現在ブロックの予測に利用する複数枚の参照ピクチャの対応領域を決定する。 If it is determined in step 1420 that the current block to be decoded is predicted using the corresponding regions of the plurality of reference pictures, the corresponding region of the reference picture indicated by the motion vector of the current block included in the bitstream; By tracking the motion vector path of the corresponding area of the reference picture, corresponding areas of a plurality of reference pictures used for prediction of the current block are determined.

段階１４３０で、以前に復号化された現在ブロックの周辺画素と、現在ブロックの周辺画素に対応する参照ピクチャの対応領域の周辺画素とを利用し、複数枚の参照ピクチャの対応領域に適用する重み付けを計算し、複数枚の参照ピクチャの対応領域の加重和を求めることによって、現在ブロックの予測ブロックを生成する。 In step 1430, the weights applied to the corresponding regions of the plurality of reference pictures using the peripheral pixels of the current block previously decoded and the peripheral pixels of the corresponding region of the reference picture corresponding to the peripheral pixels of the current block. And a predicted block of the current block is generated by calculating a weighted sum of corresponding regions of a plurality of reference pictures.

段階１４４０で、生成された予測ブロックと、ビットストリームに備わった現在ブロックと予測ブロックとの差分値とを加えて現在ブロックを復号化する。 In operation 1440, the current block is decoded by adding the generated prediction block and a difference value between the current block and the prediction block included in the bitstream.

本発明はまた、コンピュータで読み取り可能な記録媒体にコンピュータで読み取り可能なコードとして具現することが可能である。コンピュータで読み取り可能な記録媒体は、コンピュータシステムによって読み取り可能なデータが保存されるあらゆる種類の記録装置を含む。コンピュータで読み取り可能な記録媒体の例としては、ＲＯＭ（Read-Only Memory）、ＲＡＭ（Random-Access Memory）、ＣＤ−ＲＯＭ、磁気テープ、フロッピー（登録商標）ディスク、光データ保存装置などがあり、またキャリアウェーブ（例えば、インターネットを介した伝送）の形態で具現されるものも含む。また、コンピュータで読み取り可能な記録媒体は、ネットワークに連結されたコンピュータシステムに分散され、分散方式でコンピュータで読み取り可能なコードが保存されて実行されうる。 The present invention can also be embodied as computer readable codes on a computer readable recording medium. Computer-readable recording media include all types of recording devices that can store data that can be read by a computer system. Examples of computer-readable recording media include ROM (Read-Only Memory), RAM (Random-Access Memory), CD-ROM, magnetic tape, floppy (registered trademark) disk, optical data storage device, etc. Also included are those embodied in the form of a carrier wave (for example, transmission via the Internet). Further, the computer-readable recording medium can be distributed in a computer system connected to a network, and computer-readable code can be stored and executed in a distributed manner.

以上、本発明についてその望ましい実施形態を中心に説明した。本発明が属する技術分野で当業者ならば、本発明の本質的な特性から外れない範囲で変形された形態で具現されうることを理解することができるであろう。従って、開示された実施形態は、限定的な観点ではなくして、説明的な観点から考慮されねばならない。本発明の範囲は、前述の説明ではなくして特許請求の範囲に示されており、それと同等な範囲内にあるあらゆる差異点は、本発明に含まれるものと解釈されるものである。 In the above, this invention was demonstrated centering on the desirable embodiment. Those skilled in the art to which the present invention pertains can understand that the present invention can be embodied in a modified form without departing from the essential characteristics of the present invention. Accordingly, the disclosed embodiments should be considered from an illustrative viewpoint rather than a limiting viewpoint. The scope of the present invention is defined by the terms of the claims, rather than the foregoing description, and all differences that are within the scope equivalent thereto are construed as being included in the present invention.

Claims

In the video encoding method,
Determining a corresponding region of a plurality of reference pictures used for prediction of the current block by tracking a motion vector path of a corresponding region of a reference picture referred to by a current block of the current picture;
Generating a predicted block of the current block by obtaining a weighted sum of corresponding regions of the plurality of reference pictures;
Encoding a difference between the current block and the prediction block.

Determining a corresponding region of the plurality of reference pictures,
Performing motion prediction on the current block and determining a corresponding region of a first reference picture corresponding to the current block;
Dividing a corresponding region of the first reference picture along a motion block boundary of the first reference picture;
2. The method of claim 1, further comprising: determining a corresponding area of a second reference picture indicated by a motion vector of a motion block of the first reference picture having the corresponding area of the divided first reference picture. The video encoding method described in 1.

Determining a corresponding region of the second reference picture comprises:
Of the corresponding regions of the first reference picture divided along the motion block boundary of the first reference picture, the motion vector of the peripheral motion block of the intra prediction block is used for the corresponding region included in the intra prediction block. 3. The video encoding method according to claim 2, further comprising: determining a virtual motion vector of the intra prediction block, and determining a corresponding region of a second reference picture indicated by the determined virtual motion vector.

The virtual motion vector of the intra prediction block is
4. The video encoding method according to claim 3, wherein one of average values or intermediate values of motion vectors of the peripheral motion blocks is used.

Tracking the motion vector path of the corresponding region of the reference picture is
Determining the corresponding region of the second reference picture indicated by the motion vector of the current block up to the corresponding region of the nth reference picture indicated by the motion vector of the (n-1) th reference picture; 2. The video encoding method according to claim 1, wherein n is 3 or more, and the nth reference picture is a reference picture of an (n−1) th reference picture.

Tracking the motion vector path of the corresponding region of the reference picture is
6. The video according to claim 5, wherein from the corresponding area of the nth reference picture to a reference picture whose corresponding area included in the intra-predicted block is greater than or equal to a predetermined critical value is performed. Encoding method.

Generating a prediction block for the current block;
Determining weights of corresponding areas of the plurality of reference pictures;
The video of claim 1, further comprising: generating a prediction block for the current block by multiplying each corresponding region of the plurality of reference pictures by the weight and adding a result of each multiplication. Encoding method.

The weight is
The previously processed pixels of the peripheral block of the current block and the peripheral pixels of the corresponding area of the reference picture corresponding to the pixels of the peripheral block of the current block are used to determine the peripheral pixels of the corresponding area of the reference picture. The method of claim 7, wherein a difference between a predicted value of a neighboring pixel of the current block predicted through a weighted sum and a pixel value of a neighboring pixel of the current block is determined to be a minimum. Video encoding method.

The method may further include inserting a predetermined flag indicating a block that has been predictively encoded using the plurality of reference pictures into a predetermined area of the bitstream generated as a result of the encoding. Item 2. The video encoding method according to Item 1.

Determining a corresponding region of the plurality of reference pictures,
Of the first reference picture corresponding regions divided along the motion block boundary of the first reference picture, when the size of the region included in the intra prediction block is greater than or equal to a predetermined threshold value, the first reference picture correspondence Determining only the region as the corresponding region of the reference picture used for prediction of the current block;
Generating a prediction block of the current block comprises:
The video encoding method according to claim 1, wherein a value obtained by multiplying a corresponding area of the first reference picture by a predetermined weight is determined as a prediction block of the current block.

In a video encoding device,
A reference picture determination unit for determining a corresponding region of a plurality of reference pictures used for prediction of the current block by tracking a motion vector path of a corresponding region of a reference picture to which a current block refers;
A weighted prediction unit that generates a predicted block of the current block by obtaining a weighted sum of corresponding regions of the plurality of reference pictures;
A video encoding apparatus comprising: an encoding unit that encodes a difference between the current block and a prediction block.

The reference picture determination unit
The first reference picture corresponding to the first reference picture indicated by the motion vector of the current block is divided along a motion block boundary of the first reference picture, and the first reference picture comprises the divided first reference picture corresponding area. 12. The video encoding apparatus according to claim 11, wherein a corresponding area of the second reference picture indicated by the motion vector of the motion block of the picture is determined.

The reference picture determination unit
Of the corresponding regions of the first reference picture divided along the motion block boundary of the first reference picture, the motion vector of the peripheral motion block of the intra prediction block is used for the corresponding region included in the intra prediction block. 13. The video encoding apparatus according to claim 12, wherein a virtual motion vector of the intra prediction block is determined, and a corresponding region of the second reference picture indicated by the determined virtual motion vector is determined.

The virtual motion vector of the intra prediction block is
The video encoding apparatus according to claim 13, wherein one of average values or intermediate values of motion vectors of the peripheral motion blocks is used.

The reference picture determination unit
The step of determining the corresponding region of the second reference picture indicated by the motion vector of the current block is performed up to the corresponding region of the nth reference picture indicated by the motion vector of the (n−1) th reference picture, where n 12. The video encoding apparatus according to claim 11, wherein is 3 or more, and the n-th reference picture is a reference picture of an (n−1) -th reference picture.

The reference picture determination unit
16. The tracking according to claim 15, wherein the tracking is performed up to a reference picture in which the size of the corresponding area included in the intra-predicted block is equal to or greater than a predetermined critical value among the corresponding areas of the nth reference picture. The video encoding device described.

The weighted prediction unit includes:
A weight calculation unit for determining weights of corresponding regions of the plurality of reference pictures;
The prediction block generation unit configured to generate a prediction block for the current block by multiplying each corresponding region of the plurality of reference pictures by the weight and adding the result of the multiplication. The video encoding device described.

The weight calculation unit
The previously processed pixels of the peripheral block of the current block and the peripheral pixels of the corresponding area of the reference picture corresponding to the pixels of the peripheral block of the current block are used to determine the peripheral pixels of the corresponding area of the reference picture. The weighting is determined as a value that minimizes a difference between a predicted value of a neighboring pixel of the current block predicted through a weighted sum and a pixel value of a neighboring pixel of the current block. The video encoding device described.

The encoding unit includes:
The predetermined flag indicating a block that is predictively encoded using the plurality of reference pictures is inserted into a predetermined region of the bitstream generated as a result of the encoding. Video encoding device.

The reference picture determination unit
Of the first reference picture corresponding regions divided along the motion block boundary of the first reference picture, when the size of the region included in the intra prediction block is greater than or equal to a predetermined threshold value, the first reference picture correspondence Determining only the region as the corresponding region of the reference picture used for prediction of the current block;
The weighted prediction unit includes:
The video encoding apparatus according to claim 11, wherein a value obtained by multiplying a corresponding area of the first reference picture by a predetermined weight is determined as a prediction block of the current block.

In the video decoding method,
Reading prediction mode information provided in the input bitstream and determining a prediction mode of a current block to be decoded;
As a result of the determination, for the current block predicted using the corresponding areas of a plurality of reference pictures, the corresponding area of the reference picture indicated by the motion vector of the current block included in the bitstream, and the reference picture Determining a corresponding region of a plurality of reference pictures used for prediction of the current block by tracking a motion vector path of the corresponding region of
Generating a predicted block of the current block by obtaining a weighted sum of corresponding regions of the plurality of reference pictures;
A video decoding method comprising: adding the generated prediction block and a difference value between the current block and the prediction block included in the bitstream and decoding the current block.

Determining a corresponding region of the plurality of reference pictures,
Dividing a corresponding region of a first reference picture indicated by a motion vector of the current block along a motion block boundary of the first reference picture;
The method further comprises: determining a corresponding area of a second reference picture indicated by a motion vector of a motion block of the first reference picture having the corresponding area of the divided first reference picture. The video decoding method according to claim 1.

Tracking the motion vector path of the corresponding region of the reference picture is
The step of determining the corresponding region of the second reference picture indicated by the motion vector of the current block is performed up to the corresponding region of the nth reference picture indicated by the motion vector of the (n−1) th reference picture, where n The video decoding method according to claim 21, wherein is 3 or more, and the nth reference picture is a reference picture of an (n-1) th reference picture.

Generating a prediction block for the current block;
The previously processed pixels of the peripheral block of the current block and the peripheral pixels of the corresponding area of the reference picture corresponding to the pixels of the peripheral block of the current block are used to determine the peripheral pixels of the corresponding area of the reference picture. A value that minimizes the difference between the predicted value of the neighboring pixels of the current block and the pixel value of the neighboring pixels of the current block predicted through the weighted sum is determined as the weight of the corresponding region of the plurality of reference pictures. Stages,
The video decoding method according to claim 21, further comprising: generating a prediction block for the current block by adding the weighted areas to corresponding areas of the plurality of reference pictures.

In the video decoding device,
A prediction mode discriminator that interprets the prediction mode information provided in the input bitstream and discriminates the prediction mode of the current block to be decoded;
As a result of the determination, for the current block predicted using the corresponding areas of a plurality of reference pictures, the corresponding area of the reference picture indicated by the motion vector of the current block included in the bitstream, and the reference picture A reference picture determining unit that determines a corresponding area of a plurality of reference pictures used for prediction of the current block by tracking a motion vector path of the corresponding area of
A weighted prediction unit that generates a predicted block of the current block by obtaining a weighted sum of corresponding regions of the plurality of reference pictures;
A video decoding apparatus comprising: the generated prediction block; and a decoding unit that adds a difference value between the current block and the prediction block provided in the bitstream and decodes the current block.