JP2008539646A

JP2008539646A - Video coding method and apparatus for providing high-speed FGS

Info

Publication number: JP2008539646A
Application number: JP2008508745A
Authority: JP
Inventors: ウ−ジン・ハン; キョ−ヒュク・イ; サン−チャン・チャ
Original assignee: Samsung Electronics Co Ltd
Current assignee: Samsung Electronics Co Ltd
Priority date: 2005-04-29
Filing date: 2006-04-20
Publication date: 2008-11-13
Also published as: AU2006241637A1; CA2609648A1; WO2006118383A1; RU2340115C1; EP1878261A1

Abstract

本発明は、多階層基盤のＰＦＧＳアルゴリズムの演算量を減少させる方法、その方法を用いたビデオコーディング方法及び装置に関する。
ビデオエンコーディング方法は、所定の精度で推定された動きベクトルを用いて現在フレームに対する予測イメージを求めるステップと、現在フレームと予測イメージ間の残差を量子化した後、逆量子化することによって現在フレームの復元イメージを生成するステップと、推定された動きベクトルを用いてＦＧＳ階層の参照フレームと基礎階層の参照フレームを動き補償するステップと、動き補償されたＦＧＳ階層の参照フレームと動き補償された基礎階層の参照フレームとの差分を求めるステップと、現在フレームから復元イメージ及び差分を差し引くステップと、差し引き結果を符号化するステップとからなる。The present invention relates to a method for reducing the amount of computation of a multi-layer based PFGS algorithm, and a video coding method and apparatus using the method.
The video encoding method includes a step of obtaining a predicted image for a current frame using a motion vector estimated with a predetermined accuracy, and quantizing a residual between the current frame and the predicted image, and then inversely quantizing the current frame. Generating a reconstructed image of the FGS layer, using the estimated motion vector to motion compensate the reference frame of the FGS layer and the reference frame of the base layer, and the motion compensated reference frame of the FGS layer and the motion compensated base The step includes a step of obtaining a difference from the reference frame of the hierarchy, a step of subtracting the restored image and the difference from the current frame, and a step of encoding the subtraction result.

Description

本発明は、ビデオコーディング技術に関し、より詳しくは、多階層基盤のＰＦＧＳアルゴリズムの演算量を減少させる方法、その方法を用いたビデオコーディング方法及び装置に関する。 The present invention relates to a video coding technique, and more particularly to a method for reducing the amount of computation of a multi-layer based PFGS algorithm, and a video coding method and apparatus using the method.

インターネットを含め情報通信技術の発達により、文字、音声だけでなく、画像通信も増加しつつある。既存の文字中心の通信方式は消費者の多様な欲求を充足させられず、文字、映像、音楽など多様な形態の情報を受け入れることができるマルチメディアサービスが増加している。マルチメディアデータは、その量が膨大で大容量の格納媒体を必要とし、伝送時に広い帯域幅を必要とする。したがって、文字、映像、オーディオを含むマルチメディアデータを伝送するためには、圧縮コーディング技法の使用が必須である。 With the development of information communication technology including the Internet, not only text and voice but also image communication is increasing. Existing character-centric communication methods cannot satisfy the diverse needs of consumers, and multimedia services that can accept various forms of information such as characters, video, and music are increasing. Multimedia data requires an enormous amount and a large capacity storage medium, and requires a wide bandwidth during transmission. Therefore, in order to transmit multimedia data including characters, video, and audio, it is essential to use a compression coding technique.

データを圧縮する基本的な原理は、データの重複要素を除去する過程である。イメージで同じ色やオブジェクトが繰り返されるような空間的重複、動画フレームで隣接フレームがほとんど変化しない場合やオーディオで同じ音が繰り返されるような時間的重複、人間の視覚及び知覚能力が高い周波数に鈍感なことを考慮して知覚的重複を除去することによって、データを圧縮することができる。一般的なビデオコーディング方法において、時間的重複は動き補償に基づく時間的フィルタリングで除去し、空間的重複は空間的変換で除去する。 The basic principle of compressing data is a process of removing duplicate elements of data. Spatial overlap in which the same color and object are repeated in the image, temporal overlap in which the adjacent frame is hardly changed in the video frame and the same sound is repeated in the audio, insensitive to frequencies with high human visual and perceptual ability The data can be compressed by removing perceptual duplication taking account of this. In a general video coding method, temporal overlap is removed by temporal filtering based on motion compensation, and spatial overlap is removed by spatial transformation.

データの重複を除去した後、生成されるマルチメディアを伝送するためには伝送媒体が必要となるが、その性能は伝送媒体別に差がある。現在使われている伝送媒体は、１秒当たり数十メガビットのデータを伝送し得る超高速通信網及び１秒当たり３８４ｋｂｉｔの伝送速度を有する移動通信網など多様な伝送速度を有する。多様な速度の伝送媒体を提供したり、または伝送環境に応じて適する伝送率でマルチメディアを伝送したりすることが可能な、拡張性を持ったデータコーディング方法がマルチメディア環境にふさわしいのかもしれない。 After removing data duplication, a transmission medium is required to transmit the generated multimedia, but the performance varies depending on the transmission medium. Currently used transmission media have various transmission speeds such as an ultrahigh-speed communication network capable of transmitting data of several tens of megabits per second and a mobile communication network having a transmission speed of 384 kbits per second. A scalable data coding method that can provide transmission media of various speeds or transmit multimedia at a transmission rate suitable for the transmission environment may be suitable for the multimedia environment. Absent.

拡張性とは、ビットストリームの復号を完全には行えない可能性も意味する。また、ビデオの解像度を意味する空間的拡張や、ビデオの質のレベルに対するＳＮＲ拡張性、フレーム率に対する時間的拡張性などを含む。 Extensibility also means that the bitstream cannot be completely decoded. It also includes spatial expansion, meaning video resolution, SNR scalability for video quality level, temporal scalability for frame rate, and the like.

現在、ＭＰＥＧとＩＴＵの共同作業グループであるＪＶＴでは、Ｈ．２６４を基本に多階層状に拡張性を実現するための標準化作業を進行している。ＪＶＴでは、ＳＮＲ拡張性を提供するために既存のＦＧＳ技術を採択している。 Currently, JVT, a joint working group of MPEG and ITU, Standardization work for realizing expandability in multiple layers based on H.264 is in progress. JVT has adopted existing FGS technology to provide SNR extensibility.

図１は、従来のＦＧＳ技術を説明する図面である。ＦＧＳ基盤のコーデックは、基礎階層とＦＧＳ階層とに分けてコーディングを行う。本明細書においてプライム（’）符号は、元イメージでなく、量子化／逆量子化を経て生成された、すなわち復元されたイメージを示す。具体的に、現在オリジナルフレーム１２で、あるブロックＯは、動きベクトルによって左側の復元された基礎階層フレーム１１の対応するブロックＭ_Ｂ’、及び右側の復元された基礎階層フレーム１２の対応するブロックＮ_Ｂ’から予測されたブロックＰ_Ｂと差し引かれて差分ブロックＲ_Ｂになる。したがって、Ｒ_Ｂは下記数式（１）で表される。 FIG. 1 is a diagram for explaining a conventional FGS technique. The FGS-based codec performs coding in a base layer and an FGS layer. In this specification, a prime (') code indicates an image generated through quantization / inverse quantization, that is, a restored image, not an original image. Specifically, in the current original frame 12, a certain block O includes a corresponding block M _B ′ of the restored base layer frame 11 on the left side by a motion vector and a corresponding block N of the restored base layer frame 12 on the right side. It becomes difference block R _B is subtracted the predicted block P _B from _{B '.} Thus, _{R B} is represented by the following equation (1).

Ｒ_Ｂ＝Ｏ−Ｐ_Ｂ＝Ｏ−（Ｍ_Ｂ’＋Ｎ_Ｂ’）／２・・・（１） _{_{R B = O-P B =}} O- (M B '+ N B') / 2 ··· (1)

差分ブロックＲ_Ｂは、基礎階層の量子化ステップＱＰ_Ｂによって量子化された後Ｒ_Ｂ ^Ｑ、また逆量子化過程を経て復元された差分ブロックＲ_Ｂ’になる。この後、ＦＧＳ階層では前記量子化されない差分ブロックＲ_Ｂと前記復元された差分ブロックＲ_Ｂ’とを差し引き、差し引き結果のブロック△を基礎階層の量子化ステップよりも小さい量子化ステップＱＰ_Ｆによって量子化する（量子化ステップが小さいほど圧縮率が低い）。量子化された△は△^Ｑで表される。結局、デコーダ段に伝送されるデータは基礎階層のＲ_Ｂ ^ＱとＦＧＳ階層の△^Ｑである。 Difference block R _B is, R _{B ^Q,} also becomes difference block R _{B 'restored} through the inverse quantization process after being quantized by the quantization step QP _B base layer. Thereafter, in the FGS layer, the difference block R _B that is not quantized and the restored difference block R _B ′ are subtracted, and the resulting block Δ is quantized by a quantization step QP _F that is smaller than the quantization step of the base layer. (The smaller the quantization step, the lower the compression ratio). The quantized △ is represented by △ ^Q. After all, data to be transmitted to the decoder stage is △ ^Q of the base layer R _B ^Q and FGS layers.

図２は、従来のＰＦＧＳ技術を説明する図面である。既存のＦＧＳ技術が復元された基礎階層の量子化された差分Ｒ_Ｂ’を用いてＦＧＳ階層のデータの量を減らすのに比べ、ＰＦＧＳ技術はＦＧＳ階層で、左右参照フレームもＦＧＳ技術によってその品質が向上していることを用いて、新しく更新された左右参照フレームを用いて新しい差分ブロックＲ_Ｆを計算し、これと基礎階層の量子化されたブロックＲ_Ｂ’との差分を量子化することによって性能を高める。前記Ｒ_Ｆは下記数式（２）で表される。 FIG. 2 is a diagram for explaining a conventional PFGS technique. Compared to reducing the amount of data in the FGS layer using the quantized difference R _B ′ of the base layer where the existing FGS technology is restored, the PFGS technology is the FGS layer, and the right and left reference frames are also improved by the FGS technology. There using that improved, the newly updated using the left and right reference frames to calculate a new difference block R _F, quantizes the difference between the block R _{B 'which} has been quantized in this and the base layer that To increase performance. The _RF is represented by the following formula (2).

Ｒ_Ｆ＝Ｏ−Ｐ_Ｆ＝Ｏ−（Ｍ_Ｆ’＋Ｎ_Ｆ’）／２・・・（２） R _F = O-P _F = O- (M _F '+ N _F ') / 2 (2)

ここで、Ｍ_Ｆ’はＦＧＳ階層の復元された左側参照フレーム２１のうち、動きベクトルによって対応する領域であり、Ｎ_Ｆ’はＦＧＳ階層の復元された右側参照フレーム２３のうち、動きベクトルによって対応する領域である。 Here, M _F ′ is a region corresponding to the motion vector in the restored left reference frame 21 of the FGS layer, and N _F ′ corresponds to the motion vector of the restored right reference frame 23 in the FGS layer. It is an area to do.

ＦＧＳ技術に比べてＰＦＧＳ技術が有する長所は、左右参照フレームの品質が高まることによって、ＦＧＳ階層のデータ量が小さくなり得るという点である。ただし、ＦＧＳ階層でも動き補償が別途に必要であるため、演算量が増加する短所もある。このように、ＰＦＧＳは既存ＦＧＳに比べて性能が向上する長所があるが、ＦＧＳ階層ごとに動き補償によって予測信号を生成し、これに対する残差信号を生成しなければならないため演算量が増加する。最近のビデオコーデックは、１／２ピクセルあるいは１／４ピクセル単位まで補間をして動き補償を行う。もし１／４ピクセル単位に動き補償行う場合、該当解像度の４倍の大きさのイメージを生成しなければならない。 The advantage of the PFGS technology compared to the FGS technology is that the amount of data in the FGS layer can be reduced by increasing the quality of the left and right reference frames. However, since motion compensation is separately required even in the FGS layer, there is a disadvantage that the amount of calculation increases. As described above, the PFGS has an advantage that the performance is improved as compared with the existing FGS. However, the prediction signal must be generated by motion compensation for each FGS layer, and a residual signal corresponding to this must be generated. . Recent video codecs perform motion compensation by interpolating to 1/2 pixel or 1/4 pixel unit. If motion compensation is performed in units of 1/4 pixel, an image having a size four times the corresponding resolution must be generated.

さらに、Ｈ．２６４に基づくＨ．２６４ＳＥの場合、１／２ピクセル補間フィルタは６タップフィルタで、その計算が非常に複雑で動き補償の大部分演算量を占める。したがって、エンコーディングプロセス及びデコーディングプロセスが複雑になるため、より高いシステム資源を必要とし、リアルタイム放送、画像会議などのようにリアルタイムエンコーディング及びデコーディングが要求される分野では特に問題になる。 Further, H.C. H.264 based on H.264. In the case of H.264SE, the ½ pixel interpolation filter is a 6-tap filter, and its calculation is very complicated and occupies most of the amount of motion compensation. Therefore, the encoding process and the decoding process become complicated, which requires higher system resources, and is particularly problematic in fields where real-time encoding and decoding are required, such as real-time broadcasting and image conferencing.

本発明の目的は、ＰＦＧＳアルゴリズムの性能を維持しつつ、動き補償時に要求される演算量を減少し得る方法及びその方法を用いた装置を提供することにある。 An object of the present invention is to provide a method capable of reducing the amount of calculation required at the time of motion compensation while maintaining the performance of the PFGS algorithm, and an apparatus using the method.

また、本発明は前記目的に制限されず、言及していないさらなる目的は下記によって当業者に明確に理解できる。 Further, the present invention is not limited to the above objects, and further objects not mentioned can be clearly understood by those skilled in the art by the following.

上記目的を達成するためのＦＧＳ基盤のビデオエンコーディング方法は、所定の精度で推定された動きベクトルを用いて現在フレームに対する予測イメージを求めるステップと、前記現在フレームと前記予測イメージ間の残差を量子化した後、逆量子化することによって現在フレームの復元イメージを生成するステップと、前記推定された動きベクトルを用いてＦＧＳ階層の参照フレームと基礎階層の参照フレームを動き補償するステップと、前記動き補償されたＦＧＳ階層の参照フレームと前記動き補償された基礎階層の参照フレームとの差分を求めるステップと、前記現在フレームから前記復元イメージ及び前記差分を差し引くステップと、前記差し引き結果を符号化するステップとを含む。 An FGS-based video encoding method for achieving the above object includes a step of obtaining a prediction image for a current frame using a motion vector estimated with a predetermined accuracy, and a residual between the current frame and the prediction image is quantized. And generating a restored image of the current frame by inverse quantization, motion-compensating the reference frame of the FGS layer and the reference frame of the base layer using the estimated motion vector, and the motion Determining a difference between a compensated FGS layer reference frame and the motion compensated base layer reference frame, subtracting the restored image and the difference from the current frame, and encoding the subtraction result Including.

上記目的を達成するためのＦＧＳ基盤のビデオエンコーディング方法は、所定の精度で推定された動きベクトルを用いて現在フレームに対する予測イメージを求めるステップと、前記現在フレームと前記予測イメージ間の残差を量子化した後、逆量子化することによって現在フレームの復元イメージを生成するステップと、前記推定された動きベクトルを用いてＦＧＳ階層の参照フレーム及び基礎階層の参照フレームを動き補償することによって、ＦＧＳ階層の予測フレーム及び基礎階層の予測フレームを生成するステップと、前記ＦＧＳ階層の予測フレームと前記基礎階層の予測フレームとの差分を求めるステップと、前記現在フレームから前記復元イメージ及び前記差分を差し引くステップと、前記差し引き結果を符号化するステップとを含む。 An FGS-based video encoding method for achieving the above object includes a step of obtaining a prediction image for a current frame using a motion vector estimated with a predetermined accuracy, and a residual between the current frame and the prediction image is quantized. And generating a restored image of the current frame by inverse quantization, and performing motion compensation on the reference frame of the FGS layer and the reference frame of the base layer using the estimated motion vector, Generating a prediction frame and a base layer prediction frame, obtaining a difference between the FGS layer prediction frame and the base layer prediction frame, and subtracting the restored image and the difference from the current frame. Encoding the subtraction result; Including.

上記目的を達成するためのＦＧＳ基盤のビデオエンコーディング方法は、所定の精度で推定された動きベクトルを用いて現在フレームに対する予測イメージを求めるステップと、前記現在フレームと前記予測イメージ間の残差を量子化した後、逆量子化することによって現在フレームの復元イメージを生成するステップと、ＦＧＳ階層の参照フレームと基礎階層の参照フレームとの差分を求めるステップと、前記推定された動きベクトルを用いて前記差分を動き補償するステップと、前記現在フレームから前記復元イメージ及び前記動き補償された結果を差し引くステップと、前記差し引き結果を符号化するステップとを含む。 An FGS-based video encoding method for achieving the above object includes a step of obtaining a prediction image for a current frame using a motion vector estimated with a predetermined accuracy, and a residual between the current frame and the prediction image is quantized. And generating a restored image of the current frame by inverse quantization, determining a difference between a reference frame of the FGS layer and a reference frame of the base layer, and using the estimated motion vector Subtracting the difference from motion, subtracting the restored image and the motion compensated result from the current frame, and encoding the subtraction result.

上記目的を達成するためのＦＧＳ基盤のビデオデコーディング方法は、入力されたビットストリームから基礎階層のテクスチャデータと、ＦＧＳ階層のテクスチャデータと、動きベクトルとを抽出するステップと、前記基礎階層のテクスチャデータから基礎階層フレームを復元するステップと、前記動きベクトルを用いてＦＧＳ階層の参照フレームと基礎階層の参照フレームを動き補償するステップと、前記動き補償されたＦＧＳ階層の参照フレームと前記動き補償された基礎階層の参照フレームとの差分を求めるステップと、前記基礎階層フレーム、前記ＦＧＳ階層のテクスチャデータ、前記差分を加算するステップと、を含む。 In order to achieve the above object, an FGS-based video decoding method includes a step of extracting base layer texture data, FGS layer texture data, and a motion vector from an input bitstream, and the base layer texture. Reconstructing a base layer frame from data, using the motion vector to perform motion compensation of a reference frame of an FGS layer and a reference frame of the base layer, and the motion compensated reference frame of the FGS layer and the motion compensated. A step of obtaining a difference from the reference frame of the base layer, and a step of adding the base layer frame, the texture data of the FGS layer, and the difference.

上記目的を達成するためのＦＧＳ基盤のビデオデコーディング方法は、入力されたビットストリームから基礎階層のテクスチャデータと、ＦＧＳ階層のテクスチャデータと、動きベクトルとを抽出するステップと、前記基礎階層のテクスチャデータから基礎階層フレームを復元するステップと、前記動きベクトルを用いてＦＧＳ階層の参照フレーム及び基礎階層の参照フレームを動き補償することによって、ＦＧＳ階層の予測フレーム及び基礎階層の予測フレームを生成するステップと、前記ＦＧＳ階層の予測フレームと前記基礎階層の予測フレームとの差分を求めるステップと、前記テクスチャデータと、前記復元された基礎階層フレーム及び前記差分を加算するステップとを含む。 In order to achieve the above object, an FGS-based video decoding method comprises: extracting base layer texture data, FGS layer texture data, and motion vectors from an input bitstream; and A step of restoring a base layer frame from the data, and a step of generating a prediction frame of the FGS layer and a base layer prediction frame by performing motion compensation on the reference frame of the FGS layer and the reference frame of the base layer using the motion vector And calculating the difference between the predicted frame of the FGS layer and the predicted frame of the base layer, and adding the texture data, the restored base layer frame and the difference.

上記目的を達成するためのＦＧＳ基盤のビデオデコーディング方法は、入力されたビットストリームから基礎階層のテクスチャデータと、ＦＧＳ階層のテクスチャデータと、動きベクトルとを抽出するステップと、前記基礎階層のテクスチャデータから基礎階層フレームを復元するステップと、ＦＧＳ階層の参照フレームと基礎階層の参照フレームとの差分を求めるステップと、前記動きベクトルを用いて前記差分を動き補償するステップと、前記ＦＧＳ階層のテクスチャデータ、前記復元された基礎階層フレーム及び前記動き補償された結果を加算するステップと、を含む。 In order to achieve the above object, an FGS-based video decoding method comprises: extracting base layer texture data, FGS layer texture data, and motion vectors from an input bitstream; and Restoring a base layer frame from data; obtaining a difference between a reference frame of the FGS layer and a reference frame of the base layer; motion compensating the difference using the motion vector; and texture of the FGS layer Adding data, the restored base layer frame and the motion compensated result.

その他、実施形態の具体的な事項は詳細な説明及び図面に含まれている。 In addition, the specific matter of embodiment is contained in detailed description and drawing.

本発明によれば、ＰＦＧＳの実現において演算量を大幅に減少し得、これによってデコーディング過程も変更されるのでＨ．２６４ＳＥ標準化文書にも適用し得る。 According to the present invention, the amount of computation can be greatly reduced in the implementation of PFGS, and the decoding process is changed accordingly. It can also be applied to H.264 SE standardized documents.

本発明の利点及び特徴、そしてそれらを達成する方法は、添付する図面とともに詳述する実施形態を参照すれば明確になる。しかし、本発明は以下に開示する実施形態に限定されず、相異なる多様な形態で実現できる。本実施形態は、本発明の開示を完全なものにし、本発明の属する技術分野における通常の知識を有する者に発明の範疇を知らせるために提供するものであって、本発明は請求項の範疇によってのみ定義される。また、明細書全体において同じ参照符号は同じ構成要素を示す。 Advantages and features of the present invention and methods for achieving them will be apparent with reference to the embodiments described in detail in conjunction with the accompanying drawings. However, the present invention is not limited to the embodiments disclosed below, and can be realized in various different forms. This embodiment is provided to complete the disclosure of the present invention and to inform those having ordinary knowledge in the technical field to which the present invention pertains the scope of the invention. Defined only by. The same reference numerals denote the same components throughout the specification.

［第１実施形態］
図３は、本発明の第１実施形態による高速ＰＦＧＳ方法を説明する図面である。 [First Embodiment]
FIG. 3 illustrates a high-speed PFGS method according to the first embodiment of the present invention.

図２と同様に、ＰＦＧＳ方法よるＦＧＳ階層で量子化される値は△であり、これは下記数式（３）で表される。 As in FIG. 2, the value quantized in the FGS hierarchy by the PFGS method is Δ, which is expressed by the following mathematical formula (3).

△＝Ｒ_Ｆ−Ｒ_Ｂ’ ・・・（３） Δ = R _F −R _B ′ (3)

ここで、Ｒ_Ｆは上記数式（２）であり、Ｒ_Ｂ’は下記数式（４）で表される。 Here, R _F is the above equation (2), and R _B ′ is represented by the following equation (4).

Ｒ_Ｂ’＝Ｏ’−Ｐ_Ｂ＝Ｏ’−（Ｍ_Ｂ’＋Ｎ_Ｂ’）／２・・・（４） _{_{R B '= O'-P B}} = O' - (M B '+ N B') / 2 ··· (4)

ここで、Ｏ’はオリジナルイメージＯを基礎階層の量子化ステップＱＰ_Ｂによって量子化した後、逆量子化して復元されたイメージを意味する。 Here, O ′ means an image restored by performing inverse quantization after quantizing the original image O by the quantization step QP _B of the base layer.

上記数式（２）で表されるＲ_Ｆと上記数式（４）で表されるＲ_Ｂ’を用いて上記数式（３）を整理すれば、△は下記数式（５）で表される。 If the above formula (3) is arranged using R _F represented by the above formula (2) and R _B ′ represented by the above formula (4), Δ is represented by the following formula (5).

△＝Ｏ−（Ｍ_Ｆ’＋Ｎ_Ｆ’）／２−［Ｏ’−（Ｍ_Ｂ’＋Ｎ_Ｂ’）／２］・・・（５） _{△ = O- (M F '+} N F') / 2- [O '- (M B' + N B ') / 2] ··· (5)

一方、図３を参照すれば、階層間参照フレームの差分である△_Ｍ及び△_Ｎは下記数式（６）で表される。 On the other hand, referring to FIG. 3, is the difference in the hierarchy between the reference frames △ _M and △ _N is represented by the following equation (6).

△_Ｍ＝Ｍ_Ｆ’−Ｍ_Ｂ’
△_Ｎ＝Ｎ_Ｆ’−Ｎ_Ｂ’ ・・・（６） _{_{_{△ M = M F '-M B}}} '
_{_{_{△ N = N F '-N B}}} ' ··· (6)

数式（６）を用いて上記数式（５）を整理すれば、△は下記数式（７）で表される。 If the above formula (5) is arranged using the formula (6), Δ is expressed by the following formula (7).

△＝Ｏ−Ｏ’−（△_Ｍ＋△_Ｎ）／２・・・（７） △ = O-O '- ( △ M + △ N) / 2 ··· (7)

数式（７）によれば、エンコーダ段ではオリジナルイメージＯから、ＱＰ_Ｂで量子化された後、逆量子化された基礎階層イメージ、すなわち基礎階層の復元されたイメージＯ’と、階層間参照フレームの差分の平均（（△_Ｍ＋△_Ｎ）／２）とを差し引くことによって、△を求め得ることが分かる。同様に、デコーダ段ではイメージＯは、基礎階層の復元されたイメージＯ’、△及び階層間参照フレームの差分の平均を加算することによって復元できる。 According to Equation (7), the encoder stage is quantized with QP _B from the original image O, and then the inversely quantized base layer image, that is, the restored base layer image O ′ and the inter-layer reference frame. It can be seen that Δ can be obtained by subtracting the average of the differences ((Δ _M + Δ _N ) / 2). Similarly, in the decoder stage, the image O can be restored by adding the restored images O ′ and Δ of the base layer and the average of the differences of the inter-layer reference frames.

既存のＰＦＧＳは、動きベクトル探索によって生成されたピクセルまたはサブピクセル（１／２ピクセル、１／４ピクセルなど）精度を有する動きベクトルによって動き補償を行う。最近では圧縮効率を高めるために、１／２ピクセルまたは１／４ピクセルなどの高い精度によって動きベクトル探索及び動き補償を行うのが一般的である。既存のＰＦＧＳは、例えば１／４ピクセル精度で動き補償して生成した予測イメージを整数ピクセル単位でパッキングした後、オリジナルイメージと予測イメージを差し引き、これを量子化するものである。ここでパッキングとは、１／４ピクセル単位に動きベクトル探索をするとき、４倍に補間された参照イメージを元の大きさに戻す過程で、例えば４つのピクセルごとに１つのピクセルを選択する方式で行われる。 In the existing PFGS, motion compensation is performed using a motion vector having a pixel or sub-pixel accuracy (1/2 pixel, 1/4 pixel, etc.) generated by motion vector search. Recently, in order to increase the compression efficiency, it is common to perform motion vector search and motion compensation with high accuracy such as 1/2 pixel or 1/4 pixel. In the existing PFGS, for example, after a predicted image generated by motion compensation with 1/4 pixel accuracy is packed in units of integer pixels, the original image and the predicted image are subtracted and quantized. Packing is a method of selecting one pixel for every four pixels, for example, in the process of returning the reference image interpolated four times to the original size when performing a motion vector search in units of 1/4 pixel. Done in

ところが、本発明による高速ＰＦＧＳで量子化するＦＧＳ階層のデータ△は、上記数式（７）のように表されるため、高い精度で動きベクトル探索を行わなくても圧縮性能にそれほど影響を与えない。上記数式（７）の右辺の１番目項Ｏと２番目項Ｏ’は、動きベクトル探索及び動き補償が適用されない部分であるので問題ない。ただし、３番目項（（△_Ｍ＋△_Ｍ）／２）にだけ動きベクトル探索及び動き補償が適用されるが、この項は階層間の差分で表されているため、高い精度で動きベクトル探索及び動き補償を行うことはそれほど効果はない。それは、基礎階層にて所定ピクセル精度で動き補償したイメージと向上階層にて前記ピクセル精度で動き補償したイメージとを差し引くため、前記差し引き結果、イメージは相対的にピクセル精度に鈍感になるからである。したがって、既存のＰＦＧＳに比べて低いピクセル精度で動きベクトル探索及び動き補償を行うことができる。 However, since the data Δ in the FGS layer quantized by the high-speed PFGS according to the present invention is expressed as the above equation (7), the compression performance is not greatly affected even if the motion vector search is not performed with high accuracy. . The first term O and the second term O ′ on the right side of the equation (7) are portions to which motion vector search and motion compensation are not applied, so that there is no problem. However, although motion vector search and motion compensation are applied only to the third term ((Δ _M + Δ _M ) / 2), since this term is represented by a difference between layers, motion vector search is performed with high accuracy. And motion compensation is not very effective. This is because the image is relatively insensitive to pixel accuracy because the image compensated for motion with a predetermined pixel accuracy in the basic layer is subtracted from the image compensated for motion with the pixel accuracy in the enhancement layer. . Therefore, motion vector search and motion compensation can be performed with lower pixel accuracy than existing PFGS.

［第２実施形態］
第１実施形態での上記数式（５）は下記数式（８）のように、予測信号間の差分で説明することもできる。ここで、Ｐ_Ｆは（Ｍ_Ｆ’＋Ｎ_Ｆ’）／２であり、Ｐ_Ｂは（Ｍ_Ｂ’＋Ｎ_Ｂ’）／２である。 [Second Embodiment]
The above formula (5) in the first embodiment can also be described by the difference between the prediction signals as in the following formula (8). Here, _{P F} is the _{_{(M F '+ N F'}} ) / 2, P B is the _{_{(M B '+ N B'}} ) / 2.

△＝Ｏ−Ｏ’−（Ｐ_Ｆ−Ｐ_Ｂ）・・・（８） Δ = O−O ′ − (P _F −P _B ) (8)

第１実施形態と第２実施形態には、以下のような違いがある。第１実施形態では参照イメージの階層間の差分△_Ｍ，△_Ｎを先に計算した後、これを２で割る。第２実施形態では各階層での予測イメージＰ_Ｆ−Ｐ_Ｂを先に計算した後、予測イメージ間の差分を求める。ただし、これはアルゴリズム実現上の差であり、両者の計算結果△は同一である。 There are the following differences between the first embodiment and the second embodiment. After calculating the difference △ _M between layers of the reference _image, the △ _N previously in the first embodiment, dividing it in two. After calculating the predicted image P _F -P _B in each layer above the second embodiment obtains a difference between the prediction image. However, this is a difference in the realization of the algorithm, and the calculation results Δ are the same.

［第３実施形態］
前記第１実施形態と第２実施形態では動き補償を先に行った後、イメージ間の差分を求めた。しかし、この順序を変えて参照イメージの階層間の差分を先に計算した後、動き補償を行うことも可能である。このように第３実施形態によれば、差分信号に対する動き補償を行うため、境界パディングの影響が微小である。したがって、境界パディング過程を省略することができる。境界パディングとは、動きベクトル探索時にフレームの境界部分でブロックマッチングが制限されるのを考慮して、境界部分のピクセルを境界周辺にコピーすることを意味する。 [Third Embodiment]
In the first embodiment and the second embodiment, after performing motion compensation first, a difference between images is obtained. However, it is also possible to perform motion compensation after changing the order and calculating the difference between the layers of the reference image first. Thus, according to the third embodiment, since motion compensation is performed on the differential signal, the influence of boundary padding is very small. Therefore, the boundary padding process can be omitted. Boundary padding means that pixels in the boundary portion are copied around the boundary in consideration of block matching being limited at the boundary portion of the frame during motion vector search.

第３実施形態による差分△は下記数式（９）で表される。ここで、ｍｃ（．）は動き補償を行う関数である。 The difference Δ according to the third embodiment is expressed by the following mathematical formula (9). Here, mc (.) Is a function for performing motion compensation.

△＝Ｏ−Ｏ’−［（ｍｃ（Ｍ_Ｆ’−Ｍ_Ｂ’）＋ｍｃ（Ｎ_Ｆ’−Ｎ_Ｂ’）］／２
・・・（９） △ = O-O '- [ (mc (M F' -M B ') + mc (N F' -N B ')] / 2
... (9)

既存のＰＦＧＳでは上記数式（３）のＲ_ＦまたはＲ_Ｂを求めるとき、直接予測（動きベクトル探索及び動き補償）を行うのに比べ、上記３つの実施形態では予測結果を差し引いたり、差し引き結果を予測したりするため、動きベクトルの精度を高めるための補間によってもその性能がそれほど大きく変わらない、すなわち補間に鈍感な特徴を有する。 In the existing PFGS, when obtaining R _F or R _B in the above formula (3), the prediction results are subtracted or subtracted in the above three embodiments compared to direct prediction (motion vector search and motion compensation). In order to predict, the performance does not change so much even by the interpolation for improving the accuracy of the motion vector, that is, it has a feature insensitive to the interpolation.

したがって、１／４ピクセル補間あるいは１／２ピクセル補間を省略することもできる。また、高い演算量を要求するＨ．２６４の１／２ピクセル補間フィルタの代わりに、相対的に演算量が少ないバイリニアフィルタを使用することもできる。例えば、上記数式（７）、（８）、及び（９）の３番目項にバイリニアフィルタを適用する。その結果、既存のＰＦＧＳのようにＲ_Ｆ及びＲ_Ｂを求めるための予測信号に直接バイリニアフィルタを適用する場合に比べて性能低下が減少する。 Therefore, 1/4 pixel interpolation or 1/2 pixel interpolation can be omitted. In addition, H.M. In place of the H.264 1/2 pixel interpolation filter, a bilinear filter having a relatively small amount of calculation can be used. For example, a bilinear filter is applied to the third term of the mathematical formulas (7), (8), and (9). As a result, the performance degradation is reduced as compared with the case where the bilinear filter is directly applied to the prediction signal for obtaining R _F and R _B as in the existing PFGS.

［第４実施形態］
前記第１実施形態ないし第３実施形態は上記数式（３）に基づいている。言い換えれば、コーディングされるべき値がＦＧＳ階層から得た差分Ｒ_Ｆと基礎階層から得た差分Ｒ_Ｂとをさらに差し引いた値であることを基本仮定にしている。しかし、ＦＧＳ階層から得た差分が非常に少ない場合、すなわち時間的連関性が非常に大きい場合は、このような接近方法が却ってコーディング性能を低下する場合がある。この場合には、却ってＦＧＳ階層から得た差分だけをコーディングするのがより良いコーディング性能を示す。すなわち、上記数式（３）でＲ_Ｆだけをコーディングするのである。 [Fourth Embodiment]
The first to third embodiments are based on the mathematical formula (3). In other words, is the basic assumption that the values to be coded is further subtracted value and the difference R _B obtained from the difference R _F and base layer obtained from FGS layer. However, when the difference obtained from the FGS layer is very small, that is, when the temporal relevance is very large, such an approach may decrease coding performance. In this case, it is better to code only the difference obtained from the FGS layer. That is, only R _F is coded in the above equation (3).

この場合、上記数式（７）ないし（９）は下記数式（１０）ないし（１２）のように変更できる。 In this case, the above formulas (7) to (9) can be changed to the following formulas (10) to (12).

△＝Ｏ−Ｐ_Ｂ−（△_Ｍ＋△_Ｎ）／２・・・（１０） _{△ = O-P B - (} △ M + △ N) / 2 ··· (10)

△＝Ｏ−Ｐ_Ｂ−（Ｐ_Ｆ−Ｐ_Ｂ）・・・（１１） Δ = O−P _B − (P _F −P _B ) (11)

△＝Ｏ−Ｐ_Ｂ−［（ｍｃ（Ｍ_Ｆ’−Ｍ_Ｂ’）＋ｍｃ（Ｎ_Ｆ’−Ｎ_Ｂ’）］／２
・・・（１２） _{△ = O-P B - [} (mc (M F '-M B') + mc (N F '-N B')] / 2
(12)

結局、上記数式（１０）から（１２）は、上記数式７から９で復元された基礎階層イメージＯ’が前記基礎階層イメージに対する予測イメージＰ_Ｂに代えられることが分かる。もちろん、上記数式（１０）から（１２）の３番目項にも補間自体を省略したり、演算量が相対的に少ないバイリニアフィルタによる補間を適用したりすることができる。 Eventually, Equations (10) to (12) show that the base layer image O ′ restored by Equations 7 to 9 is replaced with the predicted image P _B for the base layer image. Of course, the interpolation itself can be omitted for the third term of the above formulas (10) to (12), or the interpolation by the bilinear filter having a relatively small amount of calculation can be applied.

上記数式（１１）にはＰ_Ｂが２つあるが、本発明によれば、その２つは完全に同じ値ではない。１番目Ｐ_Ｂを生成するための動き補償過程では推定された動きベクトルをそのまま使用するが、２番目Ｐ_Ｂ及びＰ_Ｆを生成するための動き補償過程では前記推定された動きベクトルよりも低い精度の動きベクトルを使用することができる。または、演算量が少なくかかるフィルタ（例えば、バイリニアフィルタ）を適用することができる。 Although there are two P _B in the above formula (11), according to the present invention, the two are not exactly the same value. Although it accepts the motion vector estimated in the motion compensation process to generate a first P _B, 2 th P _B and lower accuracy than the motion vectors in the motion compensation process is the estimation for generating a P _F Motion vectors can be used. Alternatively, a filter that requires a small amount of calculation (for example, a bilinear filter) can be applied.

［第５実施形態］
ＰＦＧＳでは両方の復元された参照フレームを使用して現在フレームを復元するため、両参照フレームの画質の低下が現在フレームに累積反映されるドリフト現象が発生する。これを減少させるために使われるのがｌｅａｋｙ予測であるが、これは両参照フレームから得た予測イメージと基礎階層から得た予測イメージ間の加重合で生成された予測イメージを用いる方法である。 [Fifth Embodiment]
In PFGS, both restored reference frames are used to restore the current frame, so that a drift phenomenon occurs in which the degradation of the image quality of both reference frames is cumulatively reflected in the current frame. Leaky prediction is used to reduce this, and this is a method using a prediction image generated by polymerization between prediction images obtained from both reference frames and prediction images obtained from the base layer.

既存のＰＦＧＳで使用するｌｅａｋｙ予測によれば、ＦＧＳ階層でコーディングされる値は下記数式（１３）で表される。 According to leaky prediction used in the existing PFGS, a value coded in the FGS layer is expressed by the following equation (13).

△＝Ｏ−［αＰ_Ｆ＋（１−α）Ｐ_Ｂ］・・・（１３） Δ = O− [αP _F + (1−α) P _B ] (13)

この式を第５実施形態によって整理すれば、下記数式（１４）で表される。 If this formula is arranged by the fifth embodiment, it is expressed by the following formula (14).

△＝Ｏ−Ｐ_Ｂ−α（Ｐ_Ｆ−Ｐ_Ｂ）・・・（１４） _{_{△ = O-P B -α (}} P F -P B) ··· (14)

数式（１４）によれば、上記数式（１１）で単に予測間の差分に加重因子（α）を適用するだけで良いことが分かる。したがって、本発明はｌｅａｋｙ予測にも適用し得る。すなわち、（Ｐ_Ｆ−Ｐ_Ｂ）に補間自体を省略したり、バイリニアフィルタによる補間を適用したりしてその結果にαを掛ければ良い。 According to the equation (14), it can be understood from the equation (11) that the weighting factor (α) may be simply applied to the difference between predictions. Therefore, the present invention can also be applied to leaky prediction. That is, the interpolation itself may be omitted from (P _F −P _B ) or the bilinear filter may be applied, and the result may be multiplied by α.

図４は、本発明の第１実施形態によるビデオエンコーダ１００の構成を示すブロック図である。図１ないし図３の説明では動きベクトル探索の単位であるブロックを基準にしたが、以下では前記ブロックが含まれるフレーム単位で説明する。表現の統一のために、前記ブロックの識別子はフレームを表す「Ｆ」文字を添字で示した。例えば、Ｒ_Ｂというブロックを含むフレームはＦ_ＲＢで表される。もちろん、以下でもプライム（’）表示は量子化／逆量子化を経て復元されたデータであることを示す。 FIG. 4 is a block diagram showing a configuration of the video encoder 100 according to the first embodiment of the present invention. The description of FIGS. 1 to 3 is based on a block which is a unit of motion vector search. However, the following description will be made on a frame basis including the block. In order to unify the expression, the identifier of the block is indicated by a subscript “F” character representing a frame. For example, a frame including a block of _{R B} is represented by _{F RB.} Of course, in the following, the prime (') display indicates data restored through quantization / inverse quantization.

入力される現在フレームＦ_Ｏは、動きベクトル探索部１０５、差分器１１５、及び差分計算部１７０で入力される。 The input current frame _FO is input by the motion vector search unit 105, the difference unit 115, and the difference calculation unit 170.

動きベクトル探索部１０５は、周辺フレームを参照して現在フレームに対する動きベクトル探索を行うことによって動きベクトルＭＶを求める。このように参照される周辺フレームを「参照フレーム」という。一般に、このような動きベクトル探索のためにブロックマッチングアルゴリズムが広く使われている。すなわち、与えられたブロックを参照フレームの特定探索領域内でピクセルまたはサブピクセル（１／２ピクセル、１／４ピクセルなど）単位に動かしながら、そのエラーが最低になる場合の変位を動きベクトルとして推定するものである。動きベクトル探索のために固定されたブロックを用いることもできるが、階層的可変サイズブロックマッチング法（ＨＶＳＢＭ）による階層的な方法を用いることもできる。 The motion vector search unit 105 obtains a motion vector MV by performing a motion vector search for the current frame with reference to surrounding frames. The peripheral frame referred to in this way is referred to as a “reference frame”. In general, a block matching algorithm is widely used for such motion vector search. That is, while moving a given block in units of pixels or sub-pixels (1/2 pixel, 1/4 pixel, etc.) within a specific search region of the reference frame, the displacement when the error is minimized is estimated as a motion vector. To do. Although a fixed block can be used for motion vector search, a hierarchical method based on a hierarchical variable size block matching method (HVSBM) can also be used.

仮に前記動きベクトル探索過程がサブピクセル単位で行われれば、参照フレームはアップサンプリングないし補間されなければならない。１／２ピクセル単位で行われる場合は２倍のアップサンプリングないし補間が必要であり、１／４ピクセル単位で行われる場合は４倍のアップサンプリングないし補間が必要である。 If the motion vector search process is performed in units of subpixels, the reference frame must be upsampled or interpolated. When it is performed in units of 1/2 pixel, double upsampling or interpolation is required, and when it is performed in units of 1/4 pixel, upsampling or interpolation is required 4 times.

ところが、エンコーダ１００が開ループコーデック状になっていれば、前記参照フレームではオリジナル周辺フレームＦ_Ｍ，Ｆ_Ｎをそのまま用いるが、閉ループコーデック状になっていれば、前記参照フレームでは復元された基礎階層の周辺フレームＦ_ＭＢ’，Ｆ_ＮＢ’を用いるようになる。以下では閉ループコーデックを中心に説明するが、本発明はこれに限定されない。 However, if the encoder 100 is in the open-loop codec like, the reference original peripheral frame F _M in the _frame, the F _N is used as it is, if turned closed loop codec shape, reconstructed base layer in the reference frame Peripheral frames F _MB ′ and F _NB ′ are used. Although the following description will focus on a closed loop codec, the present invention is not limited to this.

動きベクトル探索部１０５で求めた動きベクトルＭＶは、動き補償部１１０に提供される。動き補償部１１０は前記動きベクトルＭＶを用いて前記参照フレームＦ_ＭＢ’，Ｆ_ＮＢ’を動き補償し、前記現在フレームに対する予測イメージＦ_ＰＢを生成する。両方向参照が使われる場合、前記予測イメージは動き補償された参照フレームの平均で計算できる。そして、単方向参照が使われる場合、前記予測イメージは動き補償された参照フレームと同じものであり得る。以下では動きベクトル探索及び動き補償において両方向参照を使う場合を説明するが、単方向参照についても本発明が適用できるのは当業者には自明である。 The motion vector MV obtained by the motion vector search unit 105 is provided to the motion compensation unit 110. The motion compensation unit 110 performs motion compensation on the reference frames F _MB ′ and F _NB ′ using the motion vector MV, and generates a predicted image _FPB for the current frame. If bi-directional reference is used, the predicted image can be calculated by averaging motion compensated reference frames. And if unidirectional reference is used, the predicted image may be the same as the motion compensated reference frame. In the following, the case of using a bi-directional reference in motion vector search and motion compensation will be described.

そして、差分器１１５は、前記現在フレームから前記予測イメージを差し引いて計算される残差信号Ｆ_ＲＢを変換部１２０に提供する。 The differentiator 115 provides a residual signal F _RB which is calculated by subtracting the predicted image from the current frame to the converter 120.

変換部１２０は前記残差信号Ｆ_ＲＢに対して空間的変換を行い、変換係数Ｆ_ＲＢ ^Ｔを生成する。このような空間的変換方法としては、ＤＣＴ、ウェーブレット変換などが使われる。ＤＣＴを使用する場合、前記変換係数はＤＣＴ係数になり、ウェーブレット変換を使用する場合、前記変換係数はウェーブレット係数になる。 The transform unit 120 performs a spatial transform on the residual signal F _RB to generate a transform coefficient F _RB ^T. As such a spatial transformation method, DCT, wavelet transformation or the like is used. When using DCT, the transform coefficient is a DCT coefficient, and when using wavelet transform, the transform coefficient is a wavelet coefficient.

量子化部１２５は、前記変換係数を量子化する。前記量子化は、任意の実数値で表される前記変換係数を不連続的な値で表す過程を意味する。例えば、量子化部１２５は、任意の実数値で表される前記変換係数を所定の量子化ステップで割って、その結果を整数値に四捨五入する方法で量子化を行うことができる。前記量子化ステップは、基礎階層に適用されるもので、一般にＦＧＳ階層に比べてその値が大きい。 The quantization unit 125 quantizes the transform coefficient. The quantization means a process of expressing the transform coefficient represented by an arbitrary real value as a discontinuous value. For example, the quantization unit 125 can perform quantization by dividing the transform coefficient represented by an arbitrary real value by a predetermined quantization step and rounding the result to an integer value. The quantization step is applied to the base layer and generally has a larger value than the FGS layer.

量子化部１２５によって量子化された結果、すなわち量子化係数Ｆ_ＲＢ ^Ｑは、エントロピー符号化部１４０及び逆量子化部１３０に提供される。 The result of quantization by the quantization unit 125, that is, the quantization coefficient F _RB ^Q is provided to the entropy encoding unit 140 and the inverse quantization unit 130.

逆量子化部１３０は、前記量子化係数を逆量子化する。このような逆量子化過程は、量子化過程で使われたものと同じ量子化ステップを用いて、量子化過程で生成されたインデックスからそれにマッチングされる値を復元する過程である。 The inverse quantization unit 130 inversely quantizes the quantization coefficient. Such an inverse quantization process is a process of restoring a value matched with an index generated in the quantization process using the same quantization step as that used in the quantization process.

逆変換部１３５は、前記逆量子化された結果を受信して逆変換を行う。このような逆変換は、変換部１２０の変換過程の逆過程で行われ、具体的には逆ＤＣＴ変換、逆ウェーブレット変換などが使われる。加算器１４０は、前記逆変換された結果と前記動き補償部１１０の動き補償過程で使われた予測イメージＦ_ＰＢを加算することによって、現在フレームの復元イメージＦ_Ｏ’を生成する。 The inverse transform unit 135 receives the inversely quantized result and performs inverse transform. Such an inverse transform is performed in the inverse process of the transform process of the transform unit 120, and specifically, an inverse DCT transform, an inverse wavelet transform, or the like is used. The adder 140 adds the result of the inverse transformation and the predicted image _FPB used in the motion compensation process of the motion compensation unit 110 to generate a restored image F _O ′ of the current frame.

バッファ１４５は、加算器１４０から提供される結果を格納する。したがって、バッファ１４５には現在フレームの復元イメージＦ_Ｏ’だけでなく、予め復元された基礎階層の参照フレームＦ_ＭＢ’，Ｆ_ＮＢ’も格納することができる。 The buffer 145 stores the result provided from the adder 140. Therefore, the buffer 145 can store not only the restored image F _O ′ of the current frame but also the reference frames F _MB ′ and F _NB ′ of the base layer restored in advance.

動きベクトル変更部１５５は、前記動きベクトルＭＶを受信して動きベクトルの精度を変更する。例えば、前記動きベクトルＭＶの精度が１／４ピクセル単位であれば、前記動きベクトルＭＶは小数位の値として、０、０．２５、０．５、及び０．７５のうち１つを有することができる。本発明の実施形態によれば、ＦＧＳ階層での動き補償時には基礎階層で求めた高い精度の動きベクトルの高い精度をそのまま維持しなくても性能には大きい差がないということは上述した通りである。したがって、動きベクトル変更部１５５は、前記１／４ピクセル単位の動きベクトルを１／２ピクセル単位、ピクセル単位などより低い精度の動きベクトルＭＶ_１に変更する。このような変更過程は、元の動きベクトルで変更される精度単位を超える部分を切り捨てたり、四捨五入したりする簡単な方法で行われる。 The motion vector changing unit 155 receives the motion vector MV and changes the accuracy of the motion vector. For example, if the accuracy of the motion vector MV is 1/4 pixel unit, the motion vector MV has one of 0, 0.25, 0.5, and 0.75 as a decimal value. Can do. As described above, according to the embodiment of the present invention, there is no significant difference in performance even if the high accuracy of the high-precision motion vector obtained in the base layer is not maintained at the time of motion compensation in the FGS layer. is there. Accordingly, the motion vector changing unit 155 changes the 1/4 pixel unit motion vector to a motion vector MV ₁ with a lower accuracy such as 1/2 pixel unit or pixel unit. Such a change process is performed by a simple method of truncating or rounding a portion exceeding the accuracy unit changed by the original motion vector.

バッファ１６５は、ＦＧＳ階層の参照フレームを一時格納する。詳細に示していないが、ＦＧＳ階層の参照フレームとしてはＦＧＳ階層の復元されたフレームＦ_ＭＦ’，Ｆ_ＮＦ’が用いられたり、現在フレーム周辺のオリジナルフレームが用いられたりする。 The buffer 165 temporarily stores the reference frame of the FGS layer. Although not shown in detail, as the reference frame of the FGS layer, the restored frames F _MF ′ and F _NF ′ of the FGS layer are used, or original frames around the current frame are used.

動き補償部１６０は、前記変更された動きベクトルＭＶ_１を用いて、バッファ１４５から提供される基礎階層の復元された参照フレームＦ_ＭＢ’，Ｆ_ＮＢ’及びバッファ１６５から提供されるＦＧＳ階層の参照フレームＦ_ＭＦ’，Ｆ_ＮＦ’を動き補償し、その結果ｍｃ（Ｆ_ＭＢ’），ｍｃ（Ｆ_ＮＢ’），ｍｃ（Ｆ_ＭＦ’），ｍｃ（Ｆ_ＮＦ’）を差分計算部１７０に提供する。ここで、Ｆ_ＭＦ’はＦＧＳ階層の順方向参照フレーム、Ｆ_ＮＦ’はＦＧＳ階層の逆方向参照フレーム、Ｆ_ＭＢ’は基礎階層の順方向参照フレーム、Ｆ_ＮＢ’は基礎階層の逆方向参照フレームをそれぞれ表す。 The motion compensation unit 160 uses the changed motion vector MV ₁ to restore the reference frames F _MB ′ and F _NB ′ of the base layer provided from the buffer 145 and the reference of the FGS layer provided from the buffer 165. The frames F _MF ′ and F _NF ′ are motion-compensated, and as a result, the mc (F _MB ′), mc (F _NB ′), mc (F _MF ′), and mc (F _NF ′) are provided to the difference calculation unit 170. . Here, F _MF ′ is a forward reference frame in the FGS layer, F _NF ′ is a backward reference frame in the FGS layer, F _MB ′ is a forward reference frame in the base layer, and F _NB ′ is a backward reference frame in the base layer. Respectively.

動き補償部１６０の動き補償のために補間が必要な場合に、動きベクトル探索部１０５や動き補償部１１０で使われた補間フィルタと異なる形態の補間フィルタを使用することができる。例えば、前記動き補償時に１／２ピクセル単位の動きベクトルＭＶ_１が使われる場合に、前記補間のためにＨ．２６４の６タップフィルタの代わりに、演算量が少ないバイリニアフィルタを使用することもでき、それでもその後動き補償されたフレーム間の階層間の差分が求められるため圧縮効率にはそれほど大きい影響を与えない。 When interpolation is necessary for motion compensation of the motion compensation unit 160, an interpolation filter of a form different from the interpolation filter used in the motion vector search unit 105 or the motion compensation unit 110 can be used. For example, when a motion vector MV _{1 in} units of 1/2 pixel is used during the motion compensation, H.264 is used for the interpolation. Instead of the H.264 6-tap filter, a bilinear filter with a small amount of calculation can be used. However, since the difference between the layers between the frames after motion compensation is obtained, the compression efficiency is not greatly affected.

差分計算部１７０は、前記動き補償されたＦＧＳ階層の参照フレームｍｃ（Ｆ_ＭＦ’），ｍｃ（Ｆ_ＮＦ’）と前記動き補償された基礎階層の参照フレームｍｃ（Ｆ_ＭＢ’），ｍｃ（Ｆ_ＮＢ’）との差分を求める。すなわち、△_Ｍ（＝ｍｃ（Ｆ_ＭＦ’）−ｍｃ（Ｆ_ＭＢ’））、及び△_Ｎ（＝ｍｃ（Ｆ_ＮＦ’）−ｍｃ（Ｆ_ＮＢ’））を求める。もちろん、単方向参照である場合には１つの差分だけ求めることができる。 The difference calculation unit 170 performs the motion-compensated FGS layer reference frames mc (F _MF ′) and mc (F _NF ′) and the motion-compensated base layer reference frames mc (F _MB ′) and mc (F _Find the difference from _NB '). _{_{That, △ M (= mc (F}} MF ') -mc (F MB')), and _{_{△ N (= mc (F NF}} ') -mc (F NB')) determined. Of course, in the case of unidirectional reference, only one difference can be obtained.

そして、差分計算部１７０は、前記差分△_Ｍ，△_Ｎの平均を求め、前記現在フレームＦ_Ｏから前記復元イメージＦ_Ｏ’及び前記差分の平均を差し引く。もちろん、単方向参照である場合には前記平均を求める過程を必要としない。 Then, the difference calculation unit 170, the difference △ _M, △ Average look of _N, the current from the frame F _O subtracting the mean of the restored image F _{O 'and} the differential. Of course, in the case of unidirectional reference, the process of obtaining the average is not required.

差分計算部１７０から差し引かれた結果Ｆ_△は変換部１７５によって空間的変換Ｆ_△ ^Ｔされ、量子化部１８０を経て量子化され、量子化された結果Ｆ_△ ^Ｑはエントロピー符号化部１５０に伝達される。量子化部１８０で使われる量子化ステップは、一般に量子化部１２５で使われる量子化ステップに比べて小さい値が使われる。 The result F _Δ subtracted from the difference calculation unit 170 is spatially transformed F _Δ ^T by the transformation unit 175, quantized through the quantization unit 180, and the quantized result F _Δ ^Q is transmitted to the entropy coding unit 150. Is done. The quantization step used in the quantization unit 180 generally has a smaller value than the quantization step used in the quantization unit 125.

エントロピー符号化部１５０は、動きベクトル探索部１０５で推定された動きベクトルＭＶと、量子化部１２５から提供されるＦ_ＲＢ ^Ｑと、量子化部１８０から提供されるＦ_△ ^Ｑを無損失符号化してビットストリームを生成する。このような無損失符号化方法としては、ハフマン符号化、算術符号化、可変長符号化、その他多様な方法が用いられる。 The entropy encoding unit 150 losslessly encodes the motion vector MV estimated by the motion vector search unit 105, the F _RB ^Q provided from the quantization unit 125, and the F _Δ ^Q provided from the quantization unit 180. To generate a bitstream. As such a lossless coding method, Huffman coding, arithmetic coding, variable length coding, and other various methods are used.

一方、本発明の第２実施形態によるビデオエンコーダの構成も図４と同様に示すことができる。ただし、第２実施形態では階層間の差分を求める前に各階層別に予測フレームを先に計算するという点だけ差がある。すなわち、差分計算部１７０の動作だけ差がある。 On the other hand, the configuration of the video encoder according to the second embodiment of the present invention can also be shown as in FIG. However, in the second embodiment, there is a difference in that a prediction frame is calculated first for each layer before obtaining a difference between layers. That is, there is a difference only in the operation of the difference calculation unit 170.

第２実施形態による場合、差分計算部１７０は、前記動き補償されたＦＧＳ階層の参照フレームｍｃ（Ｆ_ＭＦ’），ｍｃ（Ｆ_ＮＦ’）からＦＧＳ階層の予測フレームＦ_ＰＦを生成し、前記動き補償された基礎階層の参照フレームｍｃ（Ｆ_ＭＢ’），ｍｃ（Ｆ_ＮＢ’）から基礎階層の予測フレームＦ_ＢＦを生成する。予測フレームを生成する過程は、２つの動き補償された参照フレームを平均することによって簡単に求めることができる。もちろん、単方向参照である場合には動き補償されたフレームがそのまま予測フレームになる。 If according to the second embodiment, the difference calculation unit 170, the motion compensated FGS layer reference frames mc _{(F MF} ') to produce a predicted frame _{F PF} of FGS layer from, mc _{(F NF'),} the motion A base layer prediction frame F _BF is generated from the compensated base layer reference frames mc (F _MB ′) and mc (F _NB ′). The process of generating a prediction frame can be easily determined by averaging two motion compensated reference frames. Of course, in the case of unidirectional reference, the motion-compensated frame becomes the prediction frame as it is.

そして、差分計算部１７０は、予測フレームＦ_ＰＦ，Ｆ_ＰＢから階層間の差分Ｆ_ＰＦ−Ｆ_ＰＢを求め、前記現在フレームＦ_Ｏから前記復元イメージＦ_Ｏ’及び前記差分Ｆ_ＰＦ−Ｆ_ＰＢを差し引く。 Then, the difference calculation unit 170, predicted frame _F _PF, obtains the difference _F PF _{-F PB} between hierarchy from _{F PB,} subtracting the restored image _{F O} 'and the difference _F PF _{-F PB} the current from the frame _{F O} .

図５は、本発明の第３実施形態によるビデオエンコーダ３００の構成を示すブロック図である。前記第１実施形態と第２実施形態では動き補償を先に行った後、イメージ間の差分を求めたが、第３実施形態ではこの順序を変えて、参照イメージの階層間の差分を先に計算した後、動き補償を行う。図４と重複する説明を避けるために相違する部分を中心に説明する。 FIG. 5 is a block diagram showing a configuration of a video encoder 300 according to the third embodiment of the present invention. In the first embodiment and the second embodiment, after performing motion compensation first, the difference between images is obtained. However, in the third embodiment, this order is changed, and the difference between the layers of the reference image is determined first. After the calculation, motion compensation is performed. In order to avoid an overlapping description with FIG.

差分器３９０は、バッファ３６５から提供されるＦＧＳ階層の参照フレームＦ_ＭＦ’，Ｆ_ＮＦ’から、バッファ３４５から提供される基礎階層の復元された参照フレームＦ_ＭＢ’，Ｆ_ＮＢ’を差し引き、その結果Ｆ_ＭＦ’−Ｆ_ＭＢ’，Ｆ_ＮＦ’−Ｆ_ＮＢ’を動き補償部３６０に提供する。もちろん、単方向参照の場合には１つの差分だけ存在する。 The subtractor 390 subtracts the base layer restored reference frames F _MB ′ and F _NB ′ provided from the buffer 345 from the reference frames F _MF ′ and F _NF ′ of the FGS layer provided from the buffer 365, and The results F _MF ′ −F _MB ′ and F _NF ′ −F _NB ′ are provided to the motion compensation unit 360. Of course, there is only one difference in the case of unidirectional reference.

動き補償部３６０は、動きベクトル変更部３５５から提供される変更された動きベクトルＭＶ_１を用いて、差分器３９０から提供される階層間参照フレームの差分Ｆ_ＭＦ’−Ｆ_ＭＢ’，Ｆ_ＮＦ’−Ｆ_ＮＢ’を動き補償する。前記動き補償時に１／２ピクセル単位の動きベクトルＭＶ_１が使われる場合、前記補間のためにＨ．２６４の６タップフィルタの代わりに、演算量が少ないバイリニアフィルタが使われ、それでも圧縮効率にはそれほど大きい影響は与えない。 The motion compensation unit 360 uses the changed motion vector MV ₁ provided from the motion vector change unit 355 and uses the difference F _MF ′ −F _MB ′, F _NF ′ of the inter-layer reference frame provided from the differentiator 390. -F _NB 'is motion compensated. When a motion vector MV _{1 in} units of 1/2 pixel is used during the motion compensation, H.264 is used for the interpolation. Instead of the H.264 6-tap filter, a bilinear filter with a small amount of calculation is used, and the compression efficiency is not so much affected.

差分計算部３７０は、動き補償された差分ｍｃ（Ｆ_ＭＦ’−Ｆ_ＭＢ’），ｍｃ（Ｆ_ＮＦ’−Ｆ_ＮＢ’）の平均を求め、前記現在フレームＦ_Ｏから前記復元イメージＦ_Ｏ’及び前記差分の平均を差し引く。もちろん、単方向参照である場合には前記平均を求める過程を必要としない。 The difference calculation unit 370 calculates an average of the motion-compensated differences mc (F _MF ′ −F _MB ′) and mc (F _NF ′ −F _NB ′), and calculates the restored image F _O ′ from the current frame F _O and Subtract the average of the differences. Of course, in the case of unidirectional reference, the process of obtaining the average is not required.

図６及び図７は、本発明の第４実施形態によるビデオエンコーダ４００，６００の構成を示すブロック図である。第４実施形態と前記第１ないし第３実施形態との差は、単に差分計算部で現在フレームＦ_Ｏから基礎階層の復元されたフレームＦ_Ｏ’ではなく、基礎階層の予測フレームＦ_ＰＢが差し引かれるという点である。 6 and 7 are block diagrams showing the configuration of the video encoders 400 and 600 according to the fourth embodiment of the present invention. The difference between the fourth embodiment and the first to third embodiments are simply the current frame F _O from reconstructed frame F _{O 'rather} than the base layer by the difference calculation section, predicted frame F _PB of the base layer is subtracted It is a point.

図６は図４（第１実施形態）に対応し、図７は図５（第３実施形態）に対応する。図６を参照すれば、差分計算部４７０には図４の現在フレームＦ_Ｏで基礎階層の復元されたイメージＦ_Ｏ’の代わりに、基礎階層の参照イメージＦ_ＰＢが動き補償部４１０から提供されるものと示されている。したがって、差分計算部４７０は、現在フレームＦ_Ｏから前記予測イメージＦ_ＰＢ及び階層間の差分△_Ｍ，△_Ｎの平均を差し引くことによってＦ_△を求める。 6 corresponds to FIG. 4 (first embodiment), and FIG. 7 corresponds to FIG. 5 (third embodiment). Referring to FIG. 6, the difference calculation unit 470 is provided with a reference image _FPB of the base layer from the motion compensation unit 410 instead of the image F _O ′ of the base layer restored in the current frame F _O of FIG. Is shown. Therefore, the difference calculation unit 470 calculates the F _△ by subtracting the average of the difference △ _M, △ _N between the predicted image _{F PB} and hierarchy currently from the frame _{F O.}

同様に、図７において差分計算部６７０は、現在フレームＦ_Ｏから前記予測イメージＦ_ＰＢ及び動き補償された差分ｍｃ（Ｆ_ＭＦ’−Ｆ_ＭＢ’），ｍｃ（Ｆ_ＮＦ’−Ｆ_ＮＢ’）の平均を差し引くことによってＦ_△を求める。 Similarly, the difference calculation unit 670 in FIG. 7, the predicted current from the frame _{F O} image _{F PB} and motion compensated differential _{_{mc (F MF '-F MB'}} ), mc of _{(F NF} _{'-F NB')} Find F _Δ by subtracting the average.

一方、第２実施形態に対応する第４実施形態は図示していないが、図６の構成図と同じ構成を有する。ただし、差分計算部４７０の動作で多少の差がある。第２実施形態に対応する第４実施形態において差分計算部４７０は、前記動き補償されたＦＧＳ階層の参照フレームｍｃ（Ｆ_ＭＦ’），ｍｃ（Ｆ_ＮＦ’）からＦＧＳ階層の予測フレームＦ_ＰＦを生成し、前記動き補償された基礎階層の参照フレームｍｃ（Ｆ_ＭＢ’），ｍｃ（Ｆ_ＮＢ’）から基礎階層の予測フレームＦ_ＢＦを生成する。そして、差分計算部１７０は、予測フレームＦ_ＰＦ，Ｆ_ＰＢから階層間の差分Ｆ_ＰＦ−Ｆ_ＰＢを求め、前記現在フレームＦ_Ｏから前記復元イメージＦ_Ｏ’及び前記差分Ｆ_ＰＦ−Ｆ_ＰＢを差し引くことによってＦ_△を求める。 On the other hand, the fourth embodiment corresponding to the second embodiment is not shown, but has the same configuration as the configuration diagram of FIG. However, there are some differences in the operation of the difference calculation unit 470. In the fourth embodiment corresponding to the second embodiment, the difference calculation unit 470 calculates the FGS layer predicted frame F _PF from the motion-compensated FGS layer reference frames mc (F _MF ′) and mc (F _NF ′). The base layer prediction frame F _BF is generated from the motion compensated base layer reference frames mc (F _MB ′) and mc (F _NB ′). Then, the difference calculation unit 170, predicted frame _F _PF, obtains the difference _F PF _{-F PB} between hierarchy from _{F PB,} subtracting the restored image _{F O} 'and the difference _F PF _{-F PB} the current from the frame _{F O} determine the F _△ by.

仮に、ここにｌｅａｋｙ予測（第５実施形態）を適用すれば、差分計算部１７０は、前記階層間の差分Ｆ_ＰＦ−Ｆ_ＰＢに加重因子（α）を掛け、前記現在フレームＦ_Ｏから前記復元イメージＦ_Ｏ’及び前記掛けた結果α×（Ｆ_ＰＦ−Ｆ_ＰＢ）を差し引くことによってＦ_△を求める。 If, by applying the herein leaky prediction (fifth exemplary embodiment), the difference calculation unit 170, the multiplying weighting factor to the difference _F PF _{-F PB} between layers (alpha), the said recovery current from the frame _{F O} F _Δ is obtained by subtracting the image F _O ′ and the multiplied result α × (F _PF −F _PB ).

図８は、本発明の第１実施形態によるビデオデコーダ７００の構成を示すブロック図である。 FIG. 8 is a block diagram showing a configuration of the video decoder 700 according to the first embodiment of the present invention.

エントロピー復号化部７０１は、入力されたビットストリームに対して無損失復号化を行い、基礎階層のテクスチャデータと、ＦＧＳ階層のテクスチャデータと、動きベクトルとを抽出する。無損失復号化は、エンコーダ段での無損失符号化過程の逆に進行される過程である。 The entropy decoding unit 701 performs lossless decoding on the input bitstream, and extracts base layer texture data, FGS layer texture data, and motion vectors. Lossless decoding is a process that is performed in reverse of the lossless encoding process in the encoder stage.

前記抽出された基礎階層のテクスチャデータＦ_ＰＢ ^Ｑは、逆量子化部７０５に提供され、前記抽出されたＦＧＳ階層のテクスチャデータＦ_△ ^Ｑは、逆量子化部１０４５に提供され、動きベクトルＭＶは、動き補償部７２０及び動きベクトル変更部７３０に提供される。 The extracted base layer texture data F _PB ^Q is provided to an inverse quantization unit 705, the extracted FGS layer texture data F _Δ ^Q is provided to an inverse quantization unit 1045, and a motion vector MV is The motion compensation unit 720 and the motion vector change unit 730 are provided.

逆量子化部７０５は、エントロピー復号化部７０１から出力される基礎階層のテクスチャデータＦ_ＰＢ ^Ｑを逆量子化する。このような逆量子化過程は量子化過程で使われたものと同じ量子化テーブルを用いて、量子化過程で生成されたインデックスからそれにマッチングされる値を復元する過程である。 The inverse quantization unit 705 inversely quantizes the base layer texture data F _PB ^Q output from the entropy decoding unit 701. Such an inverse quantization process is a process of restoring a value matched with an index generated in the quantization process using the same quantization table used in the quantization process.

逆変換部７１０は、前記逆量子化された結果に対して逆変換を行う。このような逆変換はエンコーダ段の変換過程を逆に行い、具体的に、逆ＤＣＴ変換、逆ウェーブレット変換などが使われる。 The inverse transform unit 710 performs inverse transform on the inversely quantized result. Such inverse transformation reverses the conversion process of the encoder stage, and specifically, inverse DCT transformation, inverse wavelet transformation or the like is used.

前記逆変換結果、復元された残差信号Ｆ_ＲＢ’は加算器７１５に提供される。 As a result of the inverse transformation, the restored residual signal F _RB ′ is provided to the adder 715.

動き補償部７２０は、抽出された動きベクトルＭＶによって予め復元され、バッファ７２５に格納された基礎階層の復元された参照フレームＦ_ＭＢ’，Ｆ_ＮＢ’を動き補償することによって予測イメージＦ_ＰＢを生成し、これを加算器７１５に提供する。 The motion compensation unit 720 generates a prediction image _FPB by performing motion compensation on the reference frames F _MB ′ and F _NB ′ restored in advance using the extracted motion vector MV and stored in the buffer 725. This is provided to the adder 715.

両方向予測の場合、予測イメージＦ_ＰＢは動き補償された参照フレームＦ_ＭＢ’，Ｆ_ＮＢ’の平均で計算でき、単方向予測の場合は、動き補償された参照フレームがそのまま予測イメージＦ_ＰＢになり得る。 In the case of bi-directional prediction, the prediction image F _PB can be calculated by the average of the motion-compensated reference frames F _MB ′ and F _NB ′. In the case of uni-directional prediction, the motion-compensated reference frame becomes the prediction image F _PB as it is. obtain.

加算器７１５は、入力されたＦ_ＲＢ’及びＦ_ＰＢを加算することによって基礎階層の復元されたイメージＦ_Ｏ’を出力し、バッファ７２５は前記復元されたイメージＦ_Ｏ’を格納する。 The adder 715 outputs the restored image F _O ′ of the base layer by adding the input F _RB ′ and _{FP B} , and the buffer 725 stores the restored image F _O ′.

一方、逆量子化部７４５はＦＧＳ階層のテクスチャデータＦ_△ ^Ｑを逆量子化し、逆変換部７５０は前記逆量子化された結果Ｆ_△ ^Ｔ’に対して逆変換を行うことによって、復元されたＦ_△（Ｆ_△’）を求めてフレーム復元部７５５に提供する。 Meanwhile, the inverse quantization unit 745 inversely quantizes the texture data F _△ ^Q of FGS layer, by the inverse transform unit 750 performs inverse transform on the inversely quantized result F _△ ^{T ',} it was restored F _Δ (F _Δ ′) is obtained and provided to the frame restoration unit 755.

動きベクトル変更部７３０は、前記抽出された動きベクトルＭＶを受信して動きベクトルの精度を下げる。例えば、前記動きベクトルＭＶの精度が１／４ピクセル単位であれば、前記動きベクトルＭＶは小数位の値として、０、０．２５、０．５、及び０．７５のうち１つを有することができる。動きベクトル変更部１５５は、前記１／４ピクセル単位の動きベクトルを１／２ピクセル単位、ピクセル単位などより低い精度の動きベクトルＭＶ_１に変更する。 The motion vector changing unit 730 receives the extracted motion vector MV and reduces the accuracy of the motion vector. For example, if the accuracy of the motion vector MV is 1/4 pixel unit, the motion vector MV has one of 0, 0.25, 0.5, and 0.75 as a decimal value. Can do. The motion vector changing unit 155 changes the motion vector in ¼ pixel units to a motion vector MV ₁ having a lower accuracy than ½ pixel unit, pixel unit, or the like.

動き補償部７３５は、前記変更された動きベクトルＭＶ_１を用いて、バッファ７２５から提供される基礎階層の復元された参照フレームＦ_ＭＢ’，Ｆ_ＮＢ’及びバッファ７４０から提供されるＦＧＳ階層の参照フレームＦ_ＭＦ’，Ｆ_ＮＦ’を動き補償し、その結果ｍｃ（Ｆ_ＭＢ’），ｍｃ（Ｆ_ＮＢ’），ｍｃ（Ｆ_ＭＦ’），ｍｃ（Ｆ_ＮＦ’）をフレーム復元部７５５に提供する。 The motion compensation unit 735 uses the changed motion vector MV ₁ to restore the reference frames F _MB ′ and F _NB ′ of the base layer provided from the buffer 725 and the reference of the FGS layer provided from the buffer 740. The frames F _MF ′ and F _NF ′ are motion-compensated, and as a result, mc (F _MB ′), mc (F _NB ′), mc (F _MF ′), mc (F _NF ′) are provided to the frame restoration unit 755. .

前記動き補償時に１／２ピクセル単位の動きベクトルＭＶ_１が使われる場合に、前記補間のためにＨ．２６４の６タップフィルタの代わりに、演算量が少ないバイリニアフィルタが使われ、それでも圧縮効率にそれほど大きい影響は与えない。 When the motion vector MV _{1 in} units of 1/2 pixel is used at the time of the motion compensation, H.264 is used for the interpolation. Instead of the H.264 6-tap filter, a bilinear filter with a small amount of calculation is used, and it still does not have a great influence on the compression efficiency.

フレーム復元部７５５は、前記動き補償されたＦＧＳ階層の参照フレームｍｃ（Ｆ_ＭＦ’），ｍｃ（Ｆ_ＮＦ’）と、前記動き補償された基礎階層の参照フレームｍｃ（Ｆ_ＭＢ’），ｍｃ（Ｆ_ＮＢ’）との差分を求める。すなわち、△_Ｍ（＝ｍｃ（Ｆ_ＭＦ’）−ｍｃ（Ｆ_ＭＢ’））、及び△_Ｎ（＝ｍｃ（Ｆ_ＮＦ’）−ｍｃ（Ｆ_ＮＢ’））を求める。もちろん、単方向参照である場合には１つの差分だけ求められる。 The frame restoration unit 755 includes the motion-compensated FGS layer reference frames mc (F _MF ′) and mc (F _NF ′) and the motion-compensated base layer reference frames mc (F _MB ′) and mc ( The difference from F _NB ′) is obtained. _{_{That, △ M (= mc (F}} MF ') -mc (F MB')), and _{_{△ N (= mc (F NF}} ') -mc (F NB')) determined. Of course, in the case of unidirectional reference, only one difference is obtained.

そして、フレーム復元部７５５は前記差分△_Ｍ，△_Ｎの平均を求め、前記Ｆ_△’と、基礎階層の復元されたイメージＦ_Ｏ’と、前記差分の平均とを加算する。その結果、ＦＧＳ階層の復元されたイメージＦ_ＯＦ’が生成される。もちろん単方向参照である場合には前記平均を求める過程を必要としない。 The frame restoration unit 755 obtains an average of the difference △ _M, △ _N, the F _△ 'and reconstructed image F _O base _layer' and adds the average of the difference. As a result, a restored image F _OF ′ of the FGS hierarchy is generated. Of course, in the case of unidirectional reference, the process of obtaining the average is not required.

バッファ７４０は復元されたイメージＦ_ＯＦ’を格納する。もちろん、バッファ７４０には予め復元されたイメージＦ_ＭＦ’，Ｆ_ＢＦ’も格納され得る。 Buffer 740 stores the restored image F _OF '. Of course, the image F _MF ′, F _BF ′ restored in advance can also be stored in the buffer 740.

一方、本発明の第２実施形態によるビデオデコーダの構成も図８と同様に示すことができる。ただし、第２実施形態では階層間の差分を求める前に各階層別に予測フレームを先に計算するという点だけ差がある。すなわち、フレーム復元部７５５の動作だけ差がある。 On the other hand, the configuration of the video decoder according to the second embodiment of the present invention can also be shown as in FIG. However, in the second embodiment, there is a difference in that a prediction frame is calculated first for each layer before obtaining a difference between layers. That is, there is a difference only in the operation of the frame restoration unit 755.

第２実施形態による場合、フレーム復元部７５５は前記動き補償されたＦＧＳ階層の参照フレームｍｃ（Ｆ_ＭＦ’），ｍｃ（Ｆ_ＮＦ’）からＦＧＳ階層の予測フレームＦ_ＰＦを生成し、前記動き補償された基礎階層の参照フレームｍｃ（Ｆ_ＭＢ’），ｍｃ（Ｆ_ＮＢ’）から基礎階層の予測フレームＦ_ＢＦを生成する。予測フレームを生成する過程は２つの動き補償された参照フレームを平均することによって簡単に求められる。もちろん、単方向参照である場合には動き補償されたフレームがそのまま予測フレームになる。 If according to the second embodiment, a frame restoration unit 755 generates a predicted frame _{F PF} of the reference frame mc _{(F MF} ') of the motion compensated FGS layer FGS layer from, mc _{(F NF'),} the motion compensation The base layer prediction frame F _BF is generated from the base layer reference frames mc (F _MB ′) and mc (F _NB ′). The process of generating a prediction frame is easily determined by averaging two motion compensated reference frames. Of course, in the case of unidirectional reference, the motion-compensated frame becomes the prediction frame as it is.

そして、フレーム復元部７５５は予測フレームＦ_ＰＦ，Ｆ_ＰＢから階層間の差分Ｆ_ＰＦ−Ｆ_ＰＢを求め、前記Ｆ_△’と、前記基礎階層の復元イメージＦ_Ｏ’と、前記差分Ｆ_ＰＦ−Ｆ_ＰＢとを加算する。 Then, the frame restoration unit 755 obtains a difference F _PF -F _PB between layers from the prediction frames F _PF and F _PB , and calculates the F _Δ ′, the restored image F _O ′ of the base layer, and the difference F _PF -F. Add _PB .

図９は、本発明の第３実施形態によるビデオデコーダ９００の構成を示すブロック図である。前記第１実施形態と第２実施形態によるビデオデコーダでは動き補償を先に行った後、イメージ間の差分を求めたが、第３実施形態ではこの順序を変えて参照イメージの階層間の差分を先に計算した後、動き補償を行う。図４と重複する説明を避けるために相違する部分を中心に説明する。 FIG. 9 is a block diagram showing a configuration of a video decoder 900 according to the third embodiment of the present invention. In the video decoders according to the first and second embodiments, after performing motion compensation, the difference between the images is obtained. In the third embodiment, the difference between the layers of the reference image is changed by changing the order. After the calculation, motion compensation is performed. In order to avoid an overlapping description with FIG.

差分器９６０はバッファ９４０から提供されるＦＧＳ階層の参照フレームＦ_ＭＦ’，Ｆ_ＮＦ’から、バッファ９２５から提供される基礎階層の復元された参照フレームＦ_ＭＢ’，Ｆ_ＮＢ’を差し引き、その結果Ｆ_ＭＦ’−Ｆ_ＭＢ’，Ｆ_ＮＦ’−Ｆ_ＮＢ’を動き補償部９３５に提供する。もちろん、単方向参照の場合には１つの差分だけ存在する。 The subtractor 960 subtracts the base layer restored reference frames F _MB ′ and F _NB ′ provided from the buffer 925 from the FGS layer reference frames F _MF ′ and F _NF ′ provided from the buffer 940, and the result F _MF ′ −F _MB ′ and F _NF ′ −F _NB ′ are provided to the motion compensation unit 935. Of course, there is only one difference in the case of unidirectional reference.

動き補償部９３５は動きベクトル変更部９３０から提供される、変更された動きベクトルＭＶ_１を用いて差分器３９０から提供される階層間参照フレームの差分Ｆ_ＭＦ’−Ｆ_ＭＢ’，Ｆ_ＮＦ’−Ｆ_ＮＢ’を動き補償する。前記動き補償時に１／２ピクセル単位の動きベクトルＭＶ_１が使われる場合、前記補間のためにＨ．２６４の６タップフィルタの代わりに、演算量が少ないバイリニアフィルタが使われ、それでも圧縮効率にそれほど大きい影響は与えない。 The motion compensation unit 935 provides the difference F _MF ′ −F _MB ′, F _NF ′ − of the inter-layer reference frame provided from the differentiator 390 using the changed motion vector MV ₁ provided from the motion vector change unit 930. F _NB 'is motion compensated. When a motion vector MV _{1 in} units of 1/2 pixel is used during the motion compensation, H.264 is used for the interpolation. Instead of the H.264 6-tap filter, a bilinear filter with a small amount of calculation is used, and it still does not have a great influence on the compression efficiency.

フレーム復元部９５５は動き補償された差分ｍｃ（Ｆ_ＭＦ’−Ｆ_ＭＢ’），ｍｃ（Ｆ_ＮＦ’−Ｆ_ＮＢ’）の平均を求め、逆変換部９５０から提供されるＦ_△’と、基礎階層の復元イメージＦ_Ｏ’と、前記差分の平均とを加算する。もちろん単方向参照である場合には前記平均を求める過程を必要としない。 The frame restoration unit 955 obtains the average of the motion-compensated differences mc (F _MF ′ −F _MB ′) and mc (F _NF ′ −F _NB ′), and F _Δ ′ provided from the inverse transform unit 950 and the basis The restored image F _O ′ of the hierarchy is added to the average of the differences. Of course, in the case of unidirectional reference, the process of obtaining the average is not required.

図１０ないし図１１は、本発明の第４実施形態によるビデオデコーダ１０００，１２００の構成を示すブロック図である。 10 to 11 are block diagrams showing configurations of video decoders 1000 and 1200 according to the fourth embodiment of the present invention.

第４実施形態のビデオデコーダと前記第１ないし第３実施形態のビデオデコーダとの差は、単にフレーム復元部の加算過程で基礎階層の復元されたフレームＦ_Ｏ’の代わりに、基礎階層の予測フレームＦ_ＰＢが使われるという点でだけである。 The difference between the video decoder of the fourth embodiment and the video decoders of the first to third embodiments is that the prediction of the base layer is simply performed instead of the frame F _O ′ of the base layer restored in the addition process of the frame restoration unit. Only in that the frame _FPB is used.

図１０は図８（第１実施形態）に対応し、図１１は図９（第３実施形態）に対応する。図１０を参照すれば、フレーム復元部１０５５には図８の基礎階層の復元されたイメージＦ_Ｏ’の代わりに、基礎階層の参照イメージＦ_ＰＢが動き補償部１０２０から提供されるものと示されている。したがって、フレーム復元部１０５５は逆変換部１０５０から提供されるＦ_△’と、前記予測イメージＦ_ＰＢと、階層間の差分△_Ｍ，△_Ｎの平均とを加算することによって、ＦＧＳ階層の復元されたイメージＦ_ＯＦ’を求めることができる。 10 corresponds to FIG. 8 (first embodiment), and FIG. 11 corresponds to FIG. 9 (third embodiment). Referring to FIG. 10, the frame restoration unit 1055 indicates that the base layer reference image _FPB is provided from the motion compensation unit 1020 instead of the base layer restored image F _O ′ of FIG. 8. ing. Accordingly, the frame restoration unit 1055 restores the FGS layer by adding F _Δ 'provided from the inverse transformation unit 1050, the predicted image F _PB, and the average of the differences Δ _M and Δ _N between layers. Image F _OF 'can be obtained.

同様に、図１１において、フレーム復元部１２５５は逆変換部１２５０から提供されるＦ_△’と、動き補償部１２２０から提供される予測イメージＦ_ＰＢと、動き補償された差分ｍｃ（Ｆ_ＭＦ’−Ｆ_ＭＢ’），ｍｃ（Ｆ_ＮＦ’−Ｆ_ＮＢ’）の平均とを加算することによって、ＦＧＳ階層の復元されたイメージＦ_ＯＦ’を求めることができる。 Similarly, in FIG. 11, the frame recovery unit 1255 F _△ provided from the inverse transformation unit 1250 'and the prediction image _{F PB} is provided from the motion compensation unit 1220, a motion-compensated difference mc _{(F MF'} - The restored image F _OF ′ of the FGS hierarchy can be obtained by adding the average of F _MB ′) and mc (F _NF ′ −F _NB ′).

一方、第２実施形態に対応する第４実施形態は図示していないが、図８の構成図と同じ構成を有する。ただし、フレーム復元部１２５５の動作で多少の差があるだけである。第２実施形態に対応する第４実施形態で、フレーム復元部１２５５は動き補償されたＦＧＳ階層の参照フレームｍｃ（Ｆ_ＭＦ’），ｍｃ（Ｆ_ＮＦ’）からＦＧＳ階層の予測フレームＦ_ＰＦを生成し、動き補償された基礎階層の参照フレームｍｃ（Ｆ_ＭＢ’），ｍｃ（Ｆ_ＮＢ’）から基礎階層の予測フレームＦ_ＢＦを生成する。そして、フレーム復元部１２５５は予測フレームＦ_ＰＦ，Ｆ_ＰＢから階層間の差分Ｆ_ＰＦ−Ｆ_ＰＢを求め、逆変換部１２５０から提供されるＦ_△’と、動き補償部１２２０から提供される予測イメージＦ_ＰＢと、予測フレーム間の差分Ｆ_ＰＦ−Ｆ_ＰＢとを加算することによって、ＦＧＳ階層の復元されたイメージＦ_ＯＦ’を求めることができる。 On the other hand, the fourth embodiment corresponding to the second embodiment is not shown, but has the same configuration as the configuration diagram of FIG. However, there is only a slight difference in the operation of the frame restoration unit 1255. In the fourth embodiment corresponding to the second embodiment, generates a predicted frame _{F PF} of FGS layer frame restoration unit 1255 reference frame mc of the motion compensated FGS layer _{(F MF} ') from, mc _{(F NF')} Then, the base layer predicted frame F _BF is generated from the motion compensated base layer reference frames mc (F _MB ′) and mc (F _NB ′). Then, the frame restoration unit 1255 obtains a difference F _PF −F _PB between layers from the prediction frames F _PF and F _PB, and F _Δ ′ provided from the inverse conversion unit 1250 and the prediction image provided from the motion compensation unit 1220. and F _PB, by adding the difference _F PF _{-F PB} between a predicted frame, it is possible to obtain an image _{F oF} 'restored the FGS layer.

仮に、ここにｌｅａｋｙ予測（第５実施形態）を適用すれば、フレーム復元部１２５５は前記階層間の差分Ｆ_ＰＦ−Ｆ_ＰＢに加重因子（α）を掛け、前記Ｆ_△’と、前記復元イメージＦ_Ｏ’と、前記掛けられた結果α×（Ｆ_ＰＦ−Ｆ_ＰＢ）とを加算することによってＦ_ＯＦ’を求める。 If leaky prediction (fifth embodiment) is applied here, the frame restoration unit 1255 multiplies the difference F _PF -F _PB between the layers by a weighting factor (α), and the F _Δ ′ and the restored image F _OF ′ is obtained by adding F _O ′ and the multiplied result α × (F _PF −F _PB ).

図１２は、本発明の一実施形態によるビデオエンコーダ１００，３００，４００，６００、またはビデオデコーダ７００，９００，１０００，１２００を実現するためのシステムの構成図である。前記システムは、例えばＴＶ、セットトップボックス、デスクトップ、ラップトップコンピュータ、パームトップコンピュータ、ＰＤＡ、ビデオまたはイメージ格納装置（例えば、ＶＣＲ、ＤＶＲなど）を示す。また、前記システムは前記装置を組み合わせたもの、または前記装置が他の装置の一部分として含まれたものであり得る。前記システムは少なくとも１つ以上のビデオソース１３１０、１つ以上の入出力装置１３２０、プロセッサ１３４０、メモリ１３５０、そしてディスプレイ装置１３３０を含んで構成され得る。 FIG. 12 is a configuration diagram of a system for realizing the video encoder 100, 300, 400, 600 or the video decoder 700, 900, 1000, 1200 according to an embodiment of the present invention. The system represents, for example, a TV, set-top box, desktop, laptop computer, palmtop computer, PDA, video or image storage device (eg, VCR, DVR, etc.). The system may be a combination of the devices, or the device may be included as part of another device. The system may be configured to include at least one or more video sources 1310, one or more input / output devices 1320, a processor 1340, a memory 1350, and a display device 1330.

ビデオソース１３１０は、ＴＶ受信機、ＶＣＲ、または他のビデオ格納装置を示す。また、前記ソース１３１０は、インターネット、ＷＡＮ、ＬＡＮ、地上波放送システム、ケーブルネットワーク、衛星通信ネットワーク、無線ネットワーク、電話ネットワークなどを用いてサーバーからビデオを受信するための１つ以上のネットワーク連結を示す。また、前記ソースは前記ネットワークを組み合わせたもの、または前記ネットワークが他のネットワークの一部分として含まれたものを示す。 Video source 1310 represents a TV receiver, VCR, or other video storage device. Also, the source 1310 indicates one or more network connections for receiving video from a server using the Internet, WAN, LAN, terrestrial broadcasting system, cable network, satellite communication network, wireless network, telephone network, etc. . In addition, the source indicates a combination of the networks, or the network included as a part of another network.

入出力装置１３２０、プロセッサ１３４０、そしてメモリ１３５０は通信媒体１３６０を介して通信する。前記通信媒体１３６０には通信バス、通信ネットワーク、または１つ以上の内部連結回路を示す。前記ソース１３１０から受信される入力ビデオデータは、メモリ１３５０に格納された１つ以上のソフトウェアプログラムによってプロセッサ１３４０で処理され得、ディスプレイ装置１３３０に提供される出力ビデオを生成するためにプロセッサ１３４０で実行され得る。 The input / output device 1320, the processor 1340, and the memory 1350 communicate via a communication medium 1360. The communication medium 1360 represents a communication bus, a communication network, or one or more internal connection circuits. Input video data received from the source 1310 may be processed by the processor 1340 by one or more software programs stored in the memory 1350 and executed by the processor 1340 to generate output video provided to the display device 1330. Can be done.

特に、メモリ１３５０に格納されたソフトウェアプログラムは、本発明による方法を行うウェーブレット変換に基づいた拡張性のあるコーデックを含むことができる。前記エンコーダまたは前記コーデックは、メモリ１３５０に格納されたり、ＣＤ−ＲＯＭやフロッピー（登録商標）ディスクといった格納媒体で読み取られたり、各種ネットワークを介して所定のサーバーからダウンロードしたものであり得る。 In particular, the software program stored in the memory 1350 can include a scalable codec based on wavelet transforms that perform the method according to the invention. The encoder or the codec may be stored in the memory 1350, read by a storage medium such as a CD-ROM or a floppy (registered trademark) disk, or downloaded from a predetermined server via various networks.

以上、添付する図面を参照して本発明の実施形態を説明したが、本発明の属する技術分野における通常の知識を有する者は本発明がその技術的思想や必須的な特徴を変更せずに他の具体的な形態によって実施できることを理解することができる。したがって前述した実施形態はすべての面で例示的なものであって、限定的なものではないことを理解しなければならない。 The embodiments of the present invention have been described above with reference to the accompanying drawings. However, those skilled in the art to which the present invention pertains have ordinary skill in the art without changing the technical idea or essential features. It can be understood that it can be implemented in other specific forms. Accordingly, it should be understood that the above-described embodiments are illustrative in all aspects and not limiting.

従来のＦＧＳ技術を説明する図面である。It is drawing explaining the conventional FGS technique. 従来のＰＦＧＳ技術を説明する図面である。1 is a diagram illustrating a conventional PFGS technique. 本発明の第１実施形態による高速ＰＦＧＳ方法を説明する図面である。1 is a diagram illustrating a high-speed PFGS method according to a first embodiment of the present invention. 本発明の第１実施形態によるビデオエンコーダの構成を示すブロック図である。It is a block diagram which shows the structure of the video encoder by 1st Embodiment of this invention. 本発明の第３実施形態によるビデオエンコーダの構成を示すブロック図である。It is a block diagram which shows the structure of the video encoder by 3rd Embodiment of this invention. 本発明の第４実施形態によるビデオエンコーダの構成を示すブロック図である。It is a block diagram which shows the structure of the video encoder by 4th Embodiment of this invention. 本発明の第４実施形態によるビデオエンコーダの構成を示すブロック図である。It is a block diagram which shows the structure of the video encoder by 4th Embodiment of this invention. 本発明の第１実施形態によるビデオデコーダの構成を示すブロック図である。It is a block diagram which shows the structure of the video decoder by 1st Embodiment of this invention. 本発明の第３実施形態によるビデオデコーダの構成を示すブロック図である。It is a block diagram which shows the structure of the video decoder by 3rd Embodiment of this invention. 本発明の第４実施形態によるビデオデコーダの構成を示すブロック図である。It is a block diagram which shows the structure of the video decoder by 4th Embodiment of this invention. 本発明の第４実施形態によるビデオデコーダの構成を示すブロック図である。It is a block diagram which shows the structure of the video decoder by 4th Embodiment of this invention. 本発明の一実施形態によるビデオエンコーダまたはビデオデコーダを実現するためのシステムの構成図である。1 is a configuration diagram of a system for realizing a video encoder or a video decoder according to an embodiment of the present invention.

Explanation of symbols

１００、３００、４００、６００ビデオエンコーダ
７００、９００、１０００、１２００ビデオデコーダ
１０５動きベクトル探索部
１１０、１６０動き補償部
１２０変換部
１２５量子化部
１３０逆量子化部
１３５逆変換部
１５０エントロピー符号化部
１５５動きベクトル変更部
１７０差分計算部
１７５変換部
１８０量子化部
７０１エントロピー復号化部
７０５、７４５逆量子化部
７１０、７５０逆変換部
７２０、７３５動き補償部
７３０動きベクトル変更部
７５５フレーム復元部 100, 300, 400, 600 Video encoder 700, 900, 1000, 1200 Video decoder 105 Motion vector search unit 110, 160 Motion compensation unit 120 Conversion unit 125 Quantization unit 130 Inverse quantization unit 135 Inverse conversion unit 150 Entropy encoding unit 155 Motion vector change unit 170 Difference calculation unit 175 Conversion unit 180 Quantization unit 701 Entropy decoding unit 705, 745 Inverse quantization unit 710, 750 Inverse conversion unit 720, 735 Motion compensation unit 730 Motion vector change unit 755 Frame restoration unit

Claims

Obtaining a predicted image for the current frame using a motion vector estimated with a predetermined accuracy;
Generating a restored image of the current frame by quantizing a residual between the current frame and the predicted image and then inverse-quantizing;
Motion-compensating a reference frame of the FGS layer and a reference frame of the base layer using the estimated motion vector;
Obtaining a difference between the reference frame of the motion compensated FGS layer and the reference frame of the motion compensated base layer;
Subtracting the restored image and the difference from the current frame;
Encoding the subtraction result;
A video encoding method based on FGS.

The motion vector used in the step of motion compensation of the reference frame of the FGS layer and the reference frame of the base layer using the estimated motion vector has lower accuracy than the estimated motion vector. Item 4. The FGS-based video encoding method according to Item 1.

The obtained difference is an average of the first difference between the forward reference frame in the FGS layer and the forward reference frame in the base layer, and the second difference between the reverse reference frame in the FGS layer and the reverse reference frame in the base layer. The FGS-based video encoding method according to claim 1, wherein:

3. The FGS-based video according to claim 2, wherein an interpolation filter having a different form from an interpolation filter used for obtaining a prediction image for the current frame is used when interpolation is necessary for the motion compensation. Encoding method.

The step of encoding the subtraction result includes:
Generating a conversion coefficient by converting the subtraction result;
Quantizing the transform coefficient to generate a quantized coefficient;
Lossless encoding the quantized coefficients;
The FGS-based video encoding method according to claim 1, further comprising:

Obtaining a predicted image for the current frame;
Estimating a motion vector using the current frame and the restored frame of at least one base layer as a reference frame;
Motion compensating the reference frame with the estimated motion vector;
Determining the predicted image by averaging the motion compensated reference frames;
The FGS-based video encoding method according to claim 1, further comprising:

Obtaining a predicted image for the current frame;
Estimating a motion vector using the current frame and an original frame around the current frame as a reference frame;
Motion compensating the reference frame with the estimated motion vector;
Determining the predicted image by averaging the motion compensated reference frames;
The FGS-based video encoding method according to claim 1, further comprising:

The FGS-based video encoding method according to claim 1, wherein the reference frame of the FGS layer is an original frame around the current frame, and the reference frame of the base layer is a peripheral frame restored in the base layer. .

The FGS-based video of claim 1, wherein the reference frame of the FGS layer is a peripheral frame restored in the FGS layer, and the reference frame of the base layer is a peripheral frame restored in the base layer. Encoding method.

6. The FGS base according to claim 5, wherein the magnitude of the quantization step used to quantize the transform coefficient is smaller than the magnitude of the quantization step used to quantize the residual. Video encoding method.

Obtaining a predicted image for the current frame using a motion vector estimated with a predetermined accuracy;
Generating a restored image of the current frame by quantizing a residual between the current frame and the predicted image and then inverse-quantizing;
Generating an FGS layer prediction frame and a base layer prediction frame by performing motion compensation on an FGS layer reference frame and a base layer reference frame using the estimated motion vector;
Obtaining a difference between the predicted frame of the FGS layer and the predicted frame of the base layer;
Subtracting the restored image and the difference from the current frame;
Encoding the subtraction result;
A video encoding method based on FGS.

The motion vector used in the step of generating the prediction frame of the FGS layer and the prediction frame of the base layer by performing motion compensation on the reference frame of the FGS layer and the reference frame of the base layer using the estimated motion vector, The method of claim 11, wherein the FGS-based video encoding method has a lower accuracy than the estimated motion vector.

The FGS layer prediction frame is an average of the motion compensated FGS layer reference frames, and the base layer prediction frame is an average of the motion compensated base layer reference frames. The FGS-based video encoding method according to claim 11.

13. The FGS-based video according to claim 12, wherein an interpolation filter having a different form from the interpolation filter used in the step of obtaining a prediction image for the current frame is used when interpolation is required for the motion compensation. Encoding method.

The step of encoding the subtraction result includes:
Generating a conversion coefficient by converting the subtraction result;
Quantizing the transform coefficient to generate a quantized coefficient;
Lossless encoding the quantized coefficients;
The FGS-based video encoding method according to claim 11, further comprising:

The FGS according to claim 15, wherein the magnitude of the quantization step used to quantize the transform coefficient is smaller than the magnitude of the quantization step used to quantize the residual. Base video encoding method.

Obtaining a predicted image for the current frame using a motion vector estimated with a predetermined accuracy;
Generating a restored image of the current frame by quantizing a residual between the current frame and the predicted image and then inverse-quantizing;
Obtaining a difference between the reference frame of the FGS layer and the reference frame of the base layer;
Motion compensating the difference using the estimated motion vector;
Subtracting the restored image and the motion compensated result from the current frame;
Encoding the subtraction result;
A video encoding method based on FGS.

The method of claim 17, wherein a motion vector used in the motion compensation step has a lower accuracy than the estimated motion vector.

The method of claim 17, wherein the subtracted motion compensated result is an average of the motion compensated differences in the motion compensation step.

19. The FGS-based video of claim 18, wherein an interpolation filter having a different form from the interpolation filter used in the step of obtaining a prediction image for the current frame is used when interpolation is required for the motion compensation. Encoding method.

The step of encoding the subtraction result includes:
Generating a conversion coefficient by converting the subtraction result;
Quantizing the transform coefficient to generate a quantized coefficient;
Lossless encoding the quantized coefficients;
The FGS-based video encoding method according to claim 17, further comprising:

The FGS of claim 21, wherein the magnitude of the quantization step used to quantize the transform coefficient is smaller than the magnitude of the quantization step used to quantize the residual. Base video encoding method.

Obtaining a predicted image for the current frame using a motion vector estimated with a predetermined accuracy;
Motion-compensating the reference frame of the FGS layer and the reference frame of the base layer with a motion vector having a lower accuracy than the motion vector;
Obtaining a difference between the reference frame of the motion compensated FGS layer and the reference frame of the motion compensated base layer;
Subtracting the predicted image and the difference from the current frame;
Encoding the subtraction result;
A video encoding method based on FGS.

Obtaining a predicted image for the current frame using a motion vector estimated with a predetermined accuracy;
Generating a FGS layer predicted frame and a base layer predicted frame by motion compensating the FGS layer reference frame and the base layer reference frame with a motion vector having a lower accuracy than the motion vector;
Obtaining a difference between the predicted frame of the FGS layer and the predicted frame of the base layer;
Subtracting the predicted image and the difference from the current frame;
Encoding the subtraction result;
A video encoding method based on FGS.

The method of claim 24, further comprising multiplying the obtained difference by a weighting factor (α).

Obtaining a predicted image for the current frame using a motion vector estimated with a predetermined accuracy;
Obtaining a difference between the reference frame of the FGS layer and the reference frame of the base layer;
Motion-compensating the difference with a motion vector that is less accurate than the motion vector;
Subtracting the restored image and the motion compensated result from the current frame;
Encoding the subtraction result;
A video encoding method based on FGS.

Extracting base layer texture data, FGS layer texture data, and motion vectors from the input bitstream;
Restoring a base layer frame from the base layer texture data;
Using the motion vector to perform motion compensation of a reference frame of an FGS layer and a reference frame of a base layer;
Obtaining a difference between the reference frame of the motion compensated FGS layer and the reference frame of the motion compensated base layer;
Adding the base layer frame, the texture data of the FGS layer, and the difference;
An FGS-based video decoding method comprising:

The method of claim 27, wherein the motion vector used in the motion compensation step has a lower accuracy than the extracted motion vector.

The difference is an average of the first difference between the forward reference frame in the FGS layer and the forward reference frame in the base layer, and the second difference between the reverse reference frame in the FGS layer and the reverse reference frame in the base layer. 28. The FGS-based video decoding method according to claim 27.

29. The FGS-based video data according to claim 28, wherein an interpolation filter having a different form from the interpolation filter used in the step of restoring the base layer frame is used when interpolation is required for the motion compensation. Coding method.

The FGS base according to claim 27, wherein the texture data of the FGS layer to be added is a result of performing an inverse quantization process and an inverse transform process on the extracted texture data of the FGS layer. Video decoding method.

Restoring the base layer frame comprises:
Dequantizing the base layer texture information;
Inverse transforming the inverse quantization result;
Generating a predicted image from a reference frame of a base layer previously restored using the motion vector;
Adding the prediction image and the inverse transformation result;
32. The FGS-based video decoding method according to claim 31, further comprising:

The size of the quantization step used in the inverse quantization applied to the texture data of the FGS layer is smaller than the size of the quantization step used in the inverse quantization of the step of restoring the base layer frame. 33. The FGS-based video decoding method according to claim 32, wherein:

Extracting base layer texture data, FGS layer texture data, and motion vectors from the input bitstream;
Restoring a base layer frame from the base layer texture data;
Generating a FGS layer prediction frame and a base layer prediction frame by motion-compensating an FGS layer reference frame and a base layer reference frame using the motion vector;
Obtaining a difference between the predicted frame of the FGS layer and the predicted frame of the base layer;
Adding the texture data, the restored base layer frame and the difference;
An FGS-based video decoding method comprising:

The method of claim 34, wherein the motion vector used in the motion compensation has a lower accuracy than the extracted motion vector.

The FGS-based video data according to claim 35, wherein an interpolation filter having a different form from the interpolation filter used in the step of restoring the base layer frame is used when interpolation is required for the motion compensation. Coding method.

The FGS base according to claim 34, wherein the texture data of the FGS layer to be added is a result of performing an inverse quantization process and an inverse transform process on the extracted texture data of the FGS layer. Video decoding method.

Extracting base layer texture data, FGS layer texture data, and motion vectors from the input bitstream;
Restoring a base layer frame from the base layer texture data;
Obtaining a difference between the reference frame of the FGS layer and the reference frame of the base layer;
Motion compensating the difference using the motion vector;
Adding the FGS layer texture data, the restored base layer frame, and the motion compensated result;
An FGS-based video decoding method comprising:

The method of claim 38, wherein the added motion compensated result is an average of the motion compensated differences.

The FGS-based video decoding method of claim 38, wherein a motion vector used in the motion compensation of the difference has a lower accuracy than the extracted motion vector.

The FGS-based video data according to claim 40, wherein an interpolation filter having a different form from the interpolation filter used in the step of restoring the base layer frame is used when interpolation is necessary for the motion compensation. Coding method.

The FGS base according to claim 38, wherein the texture data of the added FGS layer is a result of performing an inverse quantization process and an inverse transform process on the extracted texture data of the FGS layer. Video decoding method.

Extracting base layer texture data, FGS layer texture data, and motion vectors from the input bitstream;
Restoring the predicted image of the base layer frame from the texture data of the base layer using the extracted motion vector;
Motion-compensating the reference frame of the FGS layer and the reference frame of the base layer with a motion vector having a lower accuracy than the motion vector;
Obtaining a difference between the reference frame of the motion compensated FGS layer and the reference frame of the motion compensated base layer;
Adding the FGS layer texture data, the predicted image, and the difference;
An FGS-based video decoding method comprising:

Extracting base layer texture data, FGS layer texture data, and motion vectors from the input bitstream;
Restoring the predicted image of the base layer frame from the texture data of the base layer using the extracted motion vector;
Generating a FGS layer predicted frame and a base layer predicted frame by motion compensating the FGS layer reference frame and the base layer reference frame with a motion vector having a lower accuracy than the motion vector;
Obtaining a difference between the predicted frame of the FGS layer and the predicted frame of the base layer;
Adding the FGS layer texture data, the predicted image, and the difference;
An FGS-based video decoding method comprising:

45. The FGS-based video decoding method according to claim 44, further comprising a step of multiplying the obtained difference by a weighting factor ([alpha]).

Extracting base layer texture data, FGS layer texture data, and motion vectors from the input bitstream;
Restoring the predicted image of the base layer frame from the texture data of the base layer using the extracted motion vector;
Obtaining a difference between the reference frame of the FGS layer and the reference frame of the base layer;
Motion-compensating the difference with a motion vector that is less accurate than the motion vector;
Adding the FGS layer texture data, the predicted image, and the difference;
An FGS-based video decoding method comprising:

Means for obtaining a predicted image for the current frame using a motion vector estimated with a predetermined accuracy;
Means for generating a reconstructed image of the current frame by dequantizing the residual between the current frame and the predicted image and then dequantizing;
Means for generating a prediction frame of the FGS layer and a prediction frame of the base layer by performing motion compensation on the reference frame of the FGS layer and the reference frame of the base layer using the estimated motion vector;
Means for obtaining a difference between the prediction frame of the FGS layer and the prediction frame of the base layer;
Means for subtracting the restored image and the difference from the current frame;
Means for encoding the subtraction result;
A video encoder based on FGS, comprising:

The FGS-based video of claim 47, wherein a motion vector used in the means for generating the FGS layer prediction frame and the base layer prediction frame has lower accuracy than the estimated motion vector. Encoder.

Means for extracting texture data of the base layer, texture data of the FGS layer, and a motion vector from the input bitstream;
Means for restoring a base layer frame from the base layer texture data;
Means for generating a FGS layer prediction frame and a base layer prediction frame by motion-compensating an FGS layer reference frame and a base layer reference frame using the motion vector;
Means for obtaining a difference between the prediction frame of the FGS layer and the prediction frame of the base layer;
Means for adding the texture data, the restored base layer frame and the difference;
A video decoder based on FGS.

50. The FGS-based video decoder of claim 49, wherein a motion vector used in the motion compensation has a lower accuracy than the extracted motion vector.