JP2006211304A

JP2006211304A - Device, method, and program for video image coding and decoding

Info

Publication number: JP2006211304A
Application number: JP2005020750A
Authority: JP
Inventors: Daijiro Ichimura; 大治郎市村; Yoshimasa Honda; 義雅本田
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 2005-01-28
Filing date: 2005-01-28
Publication date: 2006-08-10

Abstract

PROBLEM TO BE SOLVED: To solve a problem that, if information showing a scanning order is stored in a video image stream, a code amount increases and coding efficiency worsens. SOLUTION: The device creates a determination formula for determining scanning order from a reference image for coding images, and determines scanning order for each image block using the determination formula. Moreover, it is possible to obtain scanning order for each image block determined by a coding device also in a decoding device by storing the determination formula in the image stream. COPYRIGHT: (C)2006,JPO&NCIPI

Description

本発明は、映像を符号化し映像ストリームを生成する映像符号化装置および方法と、映像ストリームを復号化して復号化映像を生成する映像復号化装置および方法に関するものである。 The present invention relates to a video encoding apparatus and method for encoding video and generating a video stream, and a video decoding apparatus and method for generating decoded video by decoding the video stream.

映像は、もはや我々の生活とは切り離せない関係にあり、インターネットや携帯電話網、放送波、蓄積メディアなどの伝送手段を通じ、パソコンや携帯端末、テレビ、ハイビジョンテレビなどの多様な表示端末において、視覚的な情報を享受させてくれる重要な存在となった。伝送手段を通じて伝送される映像は、効率良く情報を伝達するために、映像符号化技術を用いてより少ないデータ量を持つ映像ストリームに圧縮される。 Video is no longer inseparable from our daily lives, and it can be viewed on various display terminals such as personal computers, mobile terminals, TVs, and high-definition TVs through transmission methods such as the Internet, mobile phone networks, broadcast waves, and storage media. It became an important existence to let you enjoy the information. The video transmitted through the transmission means is compressed into a video stream having a smaller amount of data using a video encoding technique in order to efficiently transmit information.

また、１つの映像を異なる速度を持つ伝送手段に接続された複数のユーザに伝送することが求められており、幾つかの階層からなるデータ構造を持ち、符号化後も必要に応じて映像ストリームの伝送量を変更することのできる階層符号化方式の研究開発が進んでいる。階層符号化方式では、映像を、単体で復号化可能な基本レイヤストリームと、基本レイヤストリームの画質を向上させるための拡張レイヤストリームに階層化して符号化する。 Also, it is required to transmit one video to a plurality of users connected to transmission means having different speeds, and it has a data structure consisting of several layers, and a video stream as needed after encoding. Research and development of a hierarchical coding method that can change the transmission amount of the video is progressing. In the hierarchical coding scheme, video is layered and coded into a base layer stream that can be decoded alone and an enhancement layer stream for improving the image quality of the base layer stream.

しかし、近年、カメラや映像ディスプレイの高解像度化による映像の持つ情報は膨大となり、これら映像符号化方式をさらに高効率化する技術が求められている。 However, in recent years, the amount of information held by video due to higher resolution of cameras and video displays has become enormous, and a technology for further increasing the efficiency of these video encoding methods is required.

ここで、映像とは連続した画像、すなわち動画像のことを指す。 Here, the video refers to continuous images, that is, moving images.

映像符号化技術では、入力した画像を例えば８×８画素ごとの画像ブロックに分割してＤＣＴ変換（ＤｉｓｃｒｅｔｅＣｏｓｉｎｅＴｒａｎｓｆｏｒｍａｔｉｏｎ：離散コサイン変換）を行い、周波数を表すＤＣＴ係数に変換する。 In the video encoding technique, an input image is divided into, for example, image blocks for every 8 × 8 pixels, and DCT conversion (Discrete Cosine Transformation) is performed to convert the input image into DCT coefficients representing a frequency.

これは映像が統計的に低周波を多く含み高周波をあまり含まないという特性を利用したもので、各係数を適当な数値で除算する量子化により人間の視覚的に重要でない周波数を０にして情報量を減らすことが可能である。 This uses the characteristic that the video statistically contains a lot of low frequencies and does not contain high frequencies, and the frequency that is not visually important to humans is set to 0 by quantization that divides each coefficient by an appropriate value. It is possible to reduce the amount.

図１２（ａ）は量子化する前の８×８画素の画像ブロックのＤＣＴ係数を表し、図１２（ｂ）は量子化後のＤＣＴ係数を表す。図１２（ｂ）では高周波ほど大きい値で除算して量子化している。 FIG. 12A shows DCT coefficients of an image block of 8 × 8 pixels before quantization, and FIG. 12B shows DCT coefficients after quantization. In FIG. 12 (b), quantization is performed by dividing by a larger value as the frequency becomes higher.

量子化したＤＣＴ係数を並べ替えることをスキャンといい、図１３（ａ）に示すジグザグスキャンの順序で低周波成分から高周波成分にＤＣＴ係数を並べ替えることにより、量子化により０になった統計的に小さい絶対値をもつ高周波係数を順序の後半に偏らせることが可能である。 Rearranging the quantized DCT coefficients is referred to as scanning, and the statistical values that are zeroed by quantization by rearranging the DCT coefficients from low frequency components to high frequency components in the zigzag scan order shown in FIG. It is possible to bias the high frequency coefficients having small absolute values to the second half of the order.

さらにスキャンによって並べ替えたＤＣＴ係数を「以降の係数は全て０である」ことを表す「Ｅｎｄ−ｏｆ−Ｂｌｏｃｋ信号」すなわちＥｏＢを用いて符号化ことにより、後半の０を効率よく１つの信号で表現することが可能である。スキャン順序の先頭から最後の０以外のＤＣＴ係数が現れるまでの順番をスキャン長といい、スキャン長が短くなれば符号量が減り符号化効率が向上する。 Further, the DCT coefficients rearranged by scanning are encoded using an “End-of-Block signal” that indicates that “the subsequent coefficients are all 0”, that is, EoB, so that the latter half of 0 can be efficiently converted into one signal. It is possible to express. The order from the beginning of the scan order until the last DCT coefficient other than 0 appears is called the scan length. If the scan length is shortened, the code amount is reduced and the coding efficiency is improved.

ここで、全く同じ復号化画像を得ることが可能な２つの映像ストリームがある場合、より符号量の少ない映像ストリームを、符号化効率が良いという。また、全く同じ符号量の映像ストリームが２つある場合、復号化画像がより綺麗な映像ストリームを、符号化効率が良いという。 Here, when there are two video streams from which the same decoded image can be obtained, a video stream with a smaller code amount is said to have good coding efficiency. In addition, when there are two video streams with the same code amount, a video stream with a cleaner decoded image is said to have good encoding efficiency.

ここで、より綺麗な画像とは種々の定義があるが、人間が見てより綺麗と感じる画像や、原画像との誤差が少ない画像を言う。 Here, there are various definitions of a cleaner image, but an image that is more beautiful when viewed by humans or an image with less error from the original image.

特許文献１では、画像ブロックのエッジを調べ、スキャン順序を適用的に選択することにより、スキャンしたＤＣＴ係数の後半により多くの０を偏らせ符号化効率を向上させる。ここで、エッジとは種々の定義が存在するが、ここでは、水平方向のエッジとは左右に隣接する２画素の差であらわし、垂直方向のエッジとは上下に隣接する２画素の差で表すものとする。 In Patent Document 1, the edge of an image block is examined, and the scan order is appropriately selected, so that more zeros are biased in the latter half of the scanned DCT coefficient to improve the coding efficiency. Here, there are various definitions of the edge. Here, the edge in the horizontal direction is represented by a difference between two pixels adjacent to the left and right, and the edge in the vertical direction is represented by a difference between two pixels adjacent in the vertical direction. Shall.

図１４は、特許文献１を適用した符号化装置の構成を示す。エッジ抽出部１４０１によりＤＣＴ変換する前の画像ブロックごとに水平方向と垂直方向のエッジの量を計算する。ブロック分類部１４０２がそのエッジの量に応じて例えば図１３（ａ）（ｂ）（ｃ）の中から画像ブロックごとに適するスキャン順序を選択する。例えば、画像ブロックが水平方向のエッジを多く含むならば水平周波数が多く含まれているので、水平周波数を優先してスキャンする図１３（ｂ）の水平スキャンを選択する。垂直方向のエッジを多く含むならば図１３（ｃ）の垂直スキャンを選択する。適応型ＤＣＴ符号化部１４０３がＤＣＴ変換や画像ブロックごとに選択したスキャン順序に従ったスキャンなどにより符号化する。 FIG. 14 shows a configuration of an encoding apparatus to which Patent Document 1 is applied. The edge extraction unit 1401 calculates the amount of edges in the horizontal and vertical directions for each image block before DCT conversion. The block classification unit 1402 selects a scan order suitable for each image block from FIGS. 13A, 13B, and 13C, for example, according to the amount of the edge. For example, if the image block includes many horizontal edges, the horizontal frequency is included. Therefore, the horizontal scan shown in FIG. If many vertical edges are included, the vertical scan shown in FIG. 13C is selected. The adaptive DCT encoding unit 1403 performs encoding by DCT conversion or scanning according to the scanning order selected for each image block.

ここで、画像ブロックごとにどのスキャン順序を選択したかを示すフラグを映像ストリーム中に格納することにより、復号化装置において画像ブロックごとに異なるスキャン順序を特定することが可能である。 Here, by storing in the video stream a flag indicating which scan order is selected for each image block, it is possible to specify a different scan order for each image block in the decoding apparatus.

特許文献２では、符号化する画像ブロックと時間的・空間的に隣接した画像ブロックのＤＣＴ係数の分布を利用し、スキャン順序を適用的に選択することにより、スキャンしたＤＣＴ係数の後半により多くの０を偏らせ符号化効率を向上させる。 In Patent Document 2, the distribution of DCT coefficients of an image block temporally and spatially adjacent to an image block to be encoded is used, and the scanning order is appropriately selected, so that the latter half of the scanned DCT coefficient is increased. The encoding efficiency is improved by biasing 0.

図１６（ａ）（ｂ）は画像を８×８画素ごとに区切った画像ブロックを表す。図１６（ａ）は符号化する画像ブロック１６０１とその周囲の画像ブロック１６０２を示す。図１６（ｂ）は符号化する画像ブロックの１フレーム前の同位置の画像ブロック１６０３とそれに隣接する画像ブロック１６０４を示す。特許文献２の方法では、これらの画像ブロック１６０２、１６０３、１６０４のＤＣＴ係数の分布を利用する。 FIGS. 16A and 16B show image blocks obtained by dividing an image every 8 × 8 pixels. FIG. 16A shows an image block 1601 to be encoded and an image block 1602 around it. FIG. 16B shows an image block 1603 at the same position one frame before the image block to be encoded and an image block 1604 adjacent thereto. In the method of Patent Document 2, the distribution of DCT coefficients of these image blocks 1602, 1603, and 1604 is used.

図１５は、特許文献２を適用した符号化装置の構成を示す。走査方式選択部１５０４が、符号化する画像ブロックや時間的・空間的に隣接した画像ブロックのＤＣＴ係数を最も短いスキャン長になるよう並べ替えることができるスキャン順序を選択する。 FIG. 15 shows a configuration of an encoding apparatus to which Patent Document 2 is applied. The scanning method selection unit 1504 selects a scan order in which the DCT coefficients of image blocks to be encoded and temporally and spatially adjacent image blocks can be rearranged to have the shortest scan length.

ここで、参照する画像ブロックの選択方法を符号化装置と復号化装置で一致させておけば、画像ブロックごとにどのスキャン順序を選択したかを示すフラグを格納することなく、復号化装置において画像ブロックごとに異なるスキャン順序を特定することが可能である。
特開平１０−１２６７７５号公報特開２００３−２５０１５７号公報 Here, if the selection method of the image block to be referenced is matched between the encoding device and the decoding device, the decoding device can store the image without storing a flag indicating which scan order is selected for each image block. It is possible to specify a different scan order for each block.
JP-A-10-126775 JP 2003-250157 A

しかしながら、特許文献１の方法では、スキャン順序を示すフラグの符号量だけ映像ストリームが増大し、符号化効率が悪くなる。 However, in the method of Patent Document 1, the video stream increases by the code amount of the flag indicating the scan order, and the encoding efficiency deteriorates.

特許文献１の方法を階層符号化の拡張レイヤに適用した場合も同様に、画像ブロックごとのスキャン順序を示すフラグを映像ストリームに格納せねばならず、符号化効率が悪くなる。 Similarly, when the method of Patent Document 1 is applied to an enhancement layer of hierarchical encoding, a flag indicating the scan order for each image block must be stored in the video stream, resulting in poor encoding efficiency.

また、特許文献２の方法は、符号化する画像ブロックと周囲の画像ブロックおよび別の画像の同位置の画像ブロックのＤＣＴ係数の分布が似ているとは限らず、スキャン長が長くなり符号化効率が悪くなる。 In addition, the method of Patent Document 2 does not necessarily mean that the distribution of DCT coefficients of the image block to be encoded, the surrounding image block, and the image block at the same position in another image are similar, and the scan length becomes long. Inefficiency.

また、符号化画像ブロックが動き予測補償符号化などの予測符号化の場合、不具合が生じる。図１６（ｃ）の画像ブロック１６０５は図１６（ａ）の符号化する画像ブロック１６０１の参照画像ブロックを表す。一般的に、予測符号化では画像ブロック１６０１と１６０５の差分が小さくなるように画像ブロック１６０５を選択するので、画像ブロック１６０１のＤＣＴ係数の分布は画像ブロック１６０５に似る。しかしながら、特許文献２では、画像をちょうど８×８画素ごとに区切った位置にある画像ブロックのＤＣＴ係数しか利用することはできず、ちょうど８×８画素ごとに区切った位置にない画像ブロック１６０５の情報は利用できない。画像ブロック１６０２、１６０３、１６０４のＤＣＴ係数の分布が画像ブロック１６０１と似ているとは限らず、符号化効率が向上するとは限らない。 Further, when the encoded image block is predictive encoding such as motion prediction compensation encoding, a problem occurs. An image block 1605 in FIG. 16C represents a reference image block of the image block 1601 to be encoded in FIG. Generally, in the predictive coding, the image block 1605 is selected so that the difference between the image blocks 1601 and 1605 is small, so that the DCT coefficient distribution of the image block 1601 is similar to that of the image block 1605. However, in Patent Document 2, only the DCT coefficient of the image block located at the position where the image is divided every 8 × 8 pixels can be used, and the image block 1605 not located at the position divided every 8 × 8 pixels is used. Information is not available. The DCT coefficient distribution of the image blocks 1602, 1603, and 1604 is not necessarily similar to that of the image block 1601, and the encoding efficiency is not necessarily improved.

特許文献２の方法を階層符号化の拡張レイヤに適用した場合も同様に、拡張レイヤの符号化する画像ブロックの周囲の画像ブロックや時間的に前後するフレームの同位置や周囲の画像ブロックとＤＣＴ係数の分布が似ているとは限らず、符号化効率が向上するとは限らない。 Similarly, when the method of Patent Document 2 is applied to an enhancement layer of hierarchical coding, the image block around the image block to be coded in the enhancement layer, the same position of the frame before and after the time, the surrounding image block, and the DCT The coefficient distribution is not always similar, and the coding efficiency is not necessarily improved.

本発明の映像符号化装置は、符号化する画像の中の幾つかの画素の集合である符号化画像ブロックを符号化して映像ストリームに格納する順序であるスキャン順序を、符号化の際に参照する参照画像ブロックに対するスキャン判定式によって決定するスキャン決定手段と、符号化の際に参照する参照画像を用いて前記スキャン判定式を生成するスキャン判定式生成手段と、前記スキャン判定式を映像ストリームに格納するスキャン判定式格納手段を有する構成をとる。 The video encoding apparatus of the present invention refers to a scan order, which is an order in which an encoded image block, which is a set of several pixels in an image to be encoded, is encoded and stored in a video stream. A scan determination unit that determines a scan determination formula for a reference image block to be generated, a scan determination formula generation unit that generates the scan determination formula using a reference image that is referred to during encoding, and the scan determination formula into a video stream A configuration having a scan judgment expression storing means for storing is adopted.

この構成により、画像ブロックごとにスキャン順序を記述する必要がなく、前記スキャン判定式のみで全ての前記符号化画像ブロックの前記スキャン順序を表現でき、また、符号化画像ブロックごとに短いスキャン長で符号化できる前記スキャン順序を与えることができ、符号化効率を向上することが可能である。 With this configuration, it is not necessary to describe the scan order for each image block, the scan order of all the encoded image blocks can be expressed only by the scan determination formula, and a short scan length can be used for each encoded image block. The scan order that can be encoded can be given, and the encoding efficiency can be improved.

また、本発明の映像復号化装置は上記の構成において、前記スキャン判定式格納手段が、符号化画像ブロックの符号より前の映像ストリームの位置に前記スキャン判定式の情報を格納することを特徴とする。 Further, the video decoding apparatus according to the present invention is characterized in that, in the above configuration, the scan determination formula storage means stores the information of the scan determination formula at a position of the video stream before the code of the encoded image block. To do.

この構成により、映像ストリームが途中で途切れているときも途中の符号化画像ブロックまでは復号化可能である。 With this configuration, even when the video stream is interrupted, it is possible to decode up to an encoded image block in the middle.

また、本発明の符号化順序決定装置は上記の構成において、前記スキャン判定式生成手段が、前記参照画像の水平方向のエッジと垂直方向のエッジを比較して、水平方向のエッジを垂直方向のエッジより多く含めば水平方向を優先し、垂直方向のエッジを水平方向のエッジより多く含めば垂直方向を優先してスキャン順序を前記スキャン決定手段に決定させる前記スキャン判定式を生成する特長を有する。 In the coding order determining apparatus according to the present invention, in the above configuration, the scan determination formula generation unit compares a horizontal edge and a vertical edge of the reference image, and compares the horizontal edge with the vertical edge. It has a feature of generating the scan determination formula that causes the scan determining means to determine the scan order in preference to the horizontal direction if more than the edges are included, and in priority to the vertical direction if more vertical edges are included than the horizontal edges. .

この構成により、符号化画像ブロックと相関の強い参照画像のエッジの強度を用いて前記スキャン判定式を生成し、符号化画像ブロックごとに短いスキャン長で符号化できる前記スキャン順序を与えることができ、符号化効率を向上することが可能である。 With this configuration, it is possible to generate the scan determination formula using the edge strength of the reference image having a strong correlation with the encoded image block, and to give the scan order that can be encoded with a short scan length for each encoded image block. It is possible to improve the encoding efficiency.

また、本発明の符号化順序決定装置は上記の構成において、前記スキャン判定式生成手段が、前記参照画像のＤＣＴ係数の水平高周波係数と垂直高周波係数を比較して、水平高周波係数を垂直高周波係数より多く含めば水平方向を優先し、垂直高周波係数を水平高周波係数より多く含めば垂直方向を優先してスキャン順序を前記スキャン決定手段に決定させる前記スキャン判定式を生成する特長を有する。 In the coding order determining apparatus according to the present invention, in the above configuration, the scan determination formula generation unit compares the horizontal high-frequency coefficient of the DCT coefficient of the reference image with the vertical high-frequency coefficient, and converts the horizontal high-frequency coefficient into the vertical high-frequency coefficient. If more are included, the horizontal direction is prioritized, and if more vertical high-frequency coefficients are included than the horizontal high-frequency coefficient, the scan determination formula is generated to cause the scan determination unit to determine the scan order with priority to the vertical direction.

この構成により、エッジを表すＤＣＴ係数を用いて前記スキャン判定式を生成し、符号化画像ブロックごとに短いスキャン長で符号化できる前記スキャン順序を与えることができ、符号化効率を向上することが可能である。 With this configuration, it is possible to generate the scan determination formula using DCT coefficients representing edges, and to provide the scan order that can be encoded with a short scan length for each encoded image block, thereby improving encoding efficiency. Is possible.

また、本発明の符号化順序決定装置は上記の構成において、前記スキャン判定式生成手段が、前記参照画像ブロックごとの情報の統計から前記符号化画像ブロックの特徴を予測して前記スキャン判定式を生成する特長を有する。 In the coding order determining apparatus according to the present invention, in the above configuration, the scan determination formula generation unit predicts the characteristics of the encoded image block from the statistics of information for each reference image block, and calculates the scan determination formula. It has the feature to generate.

この構成により、前記参照画像ブロックの情報と前記参照画像全体の統計を用いて前記スキャン判定式を生成し、符号化画像ブロックごとに短いスキャン長で符号化できる前記スキャン順序を与えることができ、符号化効率を向上することが可能である。 With this configuration, it is possible to generate the scan determination formula using the information of the reference image block and the statistics of the entire reference image, and to give the scan order that can be encoded with a short scan length for each encoded image block, It is possible to improve the encoding efficiency.

また、本発明の符号化順序決定装置は上記の構成において、前記スキャン判定式生成手段が、複数の前記符号化画像ブロックごとに前記スキャン判定式を生成する特長を有する。 The coding order determining apparatus according to the present invention is characterized in that, in the above-described configuration, the scan determination expression generation unit generates the scan determination expression for each of the plurality of encoded image blocks.

この構成により、画像のフレームごと、または、画像内の領域ごとの特徴に応じた前記スキャン判定式を生成し、符号化画像ブロックごとに短いスキャン長で符号化できる前記スキャン順序を与えることができ、符号化効率を向上することが可能である。 With this configuration, it is possible to generate the scan determination formula according to the characteristics of each frame of an image or each region in the image, and to give the scan order that can be encoded with a short scan length for each encoded image block. It is possible to improve the encoding efficiency.

また、本発明の符号化順序決定装置は上記の構成において、前記スキャン判定式生成手段が、過去の前記スキャン判定式を用いて前記スキャン判定式に含まれるパラメータの値を変更する特長を有する。 Further, the coding order determining apparatus of the present invention has the feature that, in the above configuration, the scan determination formula generation means changes the value of the parameter included in the scan determination formula using the past scan determination formula.

この構成により、過去の画像や過去の前記参照画像の特長から符号化中の画像の特徴を予測して前記スキャン判定式を生成し、符号化画像ブロックごとに短いスキャン長で符号化できる前記スキャン順序を与えることができ、符号化効率を向上することが可能である。 With this configuration, the scan determination formula is generated by predicting the feature of the image being encoded from the features of the past image and the past reference image, and the scan that can be encoded with a short scan length for each encoded image block An order can be given and encoding efficiency can be improved.

また、本発明の符号化順序決定装置は上記の構成において、前記スキャン判定式生成手段が、前記符号化画像ブロックをスキャンした結果を用いて前記スキャン判定式を生成する特長を有する。 The coding order determining apparatus according to the present invention is characterized in that, in the above-described configuration, the scan determination expression generation unit generates the scan determination expression using a result of scanning the encoded image block.

この構成により、前記スキャン判定式によるスキャン順序での符号化効率を調べて前記スキャン判定式を生成し、符号化画像ブロックごとに短いスキャン長で符号化できる前記スキャン順序を与えることができ、符号化効率を向上することが可能である。 With this configuration, it is possible to check the encoding efficiency in the scan order according to the scan determination formula to generate the scan determination formula, and to give the scan order that can be encoded with a short scan length for each encoded image block. It is possible to improve the efficiency.

また、本発明の符号化順序決定装置は上記の構成において、前記スキャン判定式生成手段が、予め前記スキャン判定式を複数用意しておき切り替える特徴を有する。 Further, the coding order determining apparatus according to the present invention is characterized in that, in the above configuration, the scan determination formula generation means prepares and switches a plurality of the scan determination formulas in advance.

これにより、画像の特長に応じて、よりスキャン長の短くなる前記スキャン判定式に適応的に切り替え、符号化画像ブロックごとに短いスキャン長で符号化できる前記スキャン順序を与えることができ、符号化効率を向上することが可能である。 Thereby, according to the feature of the image, it is possible to adaptively switch to the scan determination formula that shortens the scan length, and to give the scan order that can be encoded with a short scan length for each encoded image block. Efficiency can be improved.

また、本発明の符号化順序決定装置は上記の構成において、映像を単体で復号化可能な基本レイヤと基本レイヤの復号化画像を画質向上させる拡張レイヤに階層化して符号化する階層符号化において、前記スキャン判定式生成手段が、基本レイヤ復号化画像を前記参照画像として、基本レイヤの前記参照画像ブロックに応じた拡張レイヤの符号化ブロックの前記スキャン順序を決定する前記スキャン判定式を生成することを特徴とする。 The coding order determining apparatus according to the present invention has the above-described configuration, in the hierarchical coding in which the base layer capable of decoding a single video and the decoded image of the base layer are hierarchized into an enhancement layer for improving the image quality. The scan determination formula generation unit generates the scan determination formula for determining the scan order of the encoded blocks of the enhancement layer corresponding to the reference image block of the base layer, using the base layer decoded image as the reference image. It is characterized by that.

この構成により、拡張レイヤと相関の強い基本レイヤの復号化画像を用いて前記スキャン判定式を生成し、符号化画像ブロックごとに短いスキャン長で符号化できる前記スキャン順序を与えることができ、符号化効率を向上することが可能である。 With this configuration, it is possible to generate the scan determination formula using the decoded image of the base layer having a strong correlation with the enhancement layer, and to give the scan order that can be encoded with a short scan length for each encoded image block. It is possible to improve the efficiency.

また、本発明の符号化順序決定装置は上記の構成において、前記スキャン判定式生成手段が、映像の予測符号化において、前記予測符号化で参照する画像を前記参照画像として、前記スキャン判定式を生成する特長を有する。 The coding order determining apparatus according to the present invention has the above-described configuration, wherein the scan determination formula generation unit uses the image referred to in the predictive coding as the reference image in the predictive coding of the video, and the scan determination formula is It has the feature to generate.

この構成により、符号化画像ブロックと相関の強い予測符号化の前記参照画像を用いて前記スキャン判定式を生成し、符号化画像ブロックごとに短いスキャン長で符号化できる前記スキャン順序を与えることができ、符号化効率を向上することが可能である。 With this configuration, the scan determination formula is generated using the reference image of predictive encoding having a strong correlation with the encoded image block, and the scan order that can be encoded with a short scan length is provided for each encoded image block. It is possible to improve the encoding efficiency.

また、本発明の符号化順序決定装置は上記の構成において、前記予測符号化が、画面間予測符号化である特長を有する。 In addition, the coding order determination apparatus according to the present invention is characterized in that, in the above configuration, the predictive coding is inter-screen predictive coding.

この構成により、符号化画像ブロックと相関の強い画面間予測符号化の前記参照画像を用いて前記スキャン判定式を生成し、符号化画像ブロックごとに短いスキャン長で符号化できる前記スキャン順序を与えることができ、符号化効率を向上することが可能である。 With this configuration, the scan determination formula is generated using the reference image of inter-picture predictive coding having a strong correlation with the coded image block, and the scan order that can be coded with a short scan length is provided for each coded image block It is possible to improve the coding efficiency.

また、本発明の符号化順序決定装置は上記の構成において、前記予測符号化が、画面内予測符号化である特長を有する。 In addition, the coding order determination apparatus of the present invention has the feature that, in the above configuration, the predictive coding is intra-screen predictive coding.

この構成により、符号化画像ブロックと相関の強い画面内予測符号化の前記参照画像を用いて前記スキャン判定式を生成し、符号化画像ブロックごとに短いスキャン長で符号化できる前記スキャン順序を与えることができ、符号化効率を向上することが可能である。 With this configuration, the scan determination formula is generated using the reference image of intra prediction encoding having a strong correlation with the encoded image block, and the scan order that can be encoded with a short scan length is provided for each encoded image block It is possible to improve the coding efficiency.

また、本発明の符号化順序決定装置は上記の構成において、前記予測符号化が、前記予測符号化の前記参照画像ブロックの候補のうち、前記符号化画像ブロックの符号量が最も小さくなると前記スキャン判定式が予想する前記参照画像ブロックを予測符号化に用いる特徴と有する。 In the coding order determining apparatus according to the present invention, in the above configuration, the predictive coding is performed when the code amount of the coded image block is the smallest among the reference image block candidates of the predictive coding. The reference image block predicted by the determination formula is used for predictive coding.

この構成により、前記スキャン判定式で決定する符号化画像ブロックの前記スキャン順序が与えるスキャン長が短くなり符号化効率が向上するように、予測符号化の参照画像ブロックを選出することができ、符号化効率を向上することが可能である。 With this configuration, a reference image block for predictive coding can be selected so that the scan length given by the scan order of the coded image block determined by the scan determination formula is shortened and coding efficiency is improved. It is possible to improve the efficiency.

また、本発明の符号化順序決定装置は上記の構成において、前記スキャン判定式生成手段が、前記予測符号化が予測に用いた情報を用いて、前記参照画像ブロックに応じた前記スキャン判定式を生成する特長を有する。 In the coding order determining apparatus according to the present invention, in the configuration described above, the scan determination formula generation unit uses the information used for prediction by the prediction encoding to calculate the scan determination formula corresponding to the reference image block. It has the feature to generate.

この構成により、前記参照画像のより詳しい情報を用いて前記スキャン判定式を生成し、符号化画像ブロックごとに短いスキャン長で符号化できる前記スキャン順序を与えることができ、符号化効率を向上することが可能である。 With this configuration, it is possible to generate the scan determination formula using more detailed information of the reference image and to give the scan order that can be encoded with a short scan length for each encoded image block, thereby improving encoding efficiency. It is possible.

本発明の映像復号化装置は、符号化した画像の中の幾つかの画素の集合である符号化画像ブロックを符号化して映像ストリームに格納した順序であるスキャン順序を、復号化の際に参照する参照画像ブロックに対するスキャン判定式によって決定するスキャン判定手段と、前記スキャン判定式を映像ストリームから取得するスキャン判定式取得手段と、を有する構成をとる。 The video decoding apparatus according to the present invention refers to a scan order, which is an order in which an encoded image block, which is a set of several pixels in an encoded image, is encoded and stored in a video stream. A scan determination unit that determines a scan determination formula for a reference image block to be performed, and a scan determination formula acquisition unit that acquires the scan determination formula from a video stream.

この構成により、画像ブロックごとにスキャン順序を映像ストリームに記述する必要がなく、前記スキャン判定式のみで全ての前記符号化画像ブロックの前記スキャン順序を表現でき、符号化効率の高い映像ストリームを復号化することが可能である。 With this configuration, it is not necessary to describe the scan order for each image block in the video stream, and the scan order of all the encoded image blocks can be expressed only by the scan determination formula, and a video stream with high encoding efficiency is decoded. It is possible to

本発明は、映像の符号化を実行する映像符号化方法であって、映像符号化装置が実行するところの、符号化する画像の中の幾つかの画素の集合である符号化画像ブロックを符号化して映像ストリームに格納する順序であるスキャン順序を、符号化の際に参照する他の参照画像ブロックに対するスキャン判定式によって決定するスキャン決定処理ステップと、符号化の際に参照する参照画像を用いて前記スキャン判定式を生成するスキャン判定式生成処理ステップと、前記スキャン判定式を映像ストリームに格納するスキャン判定式格納処理ステップと、を有する。 The present invention relates to a video coding method for performing video coding, which encodes a coded image block, which is a set of several pixels in an image to be coded, executed by a video coding device. A scan determination processing step for determining a scan order, which is an order to be converted into a video stream and stored in a video stream, using a scan determination formula for other reference image blocks to be referred to at the time of encoding, and a reference image to be referred to at the time of encoding A scan determination expression generation processing step for generating the scan determination expression, and a scan determination expression storage processing step for storing the scan determination expression in a video stream.

本発明は、映像ストリームの復号化を実行する映像復号化方法であって、映像復号化装置が実行するところの、符号化した画像の中の幾つかの画素の集合である符号化画像ブロックを符号化して映像ストリームに格納した順序であるスキャン順序を、復号化の際に参照する参照画像ブロックに対するスキャン判定式によって決定するスキャン決定処理ステップと、前記スキャン判定式を映像ストリームから取得するスキャン判定式取得処理ステップと、を有する構成をとる。 The present invention is a video decoding method for decoding a video stream, and an encoded image block which is a set of several pixels in an encoded image, which is executed by a video decoding device. A scan determination processing step for determining a scan order, which is an order of encoding and storing in a video stream, by a scan determination formula for a reference image block to be referred to at the time of decoding, and a scan determination for acquiring the scan determination formula from the video stream A formula acquisition processing step.

本発明は、映像の符号化を実行するためにコンピュータを、符号化する画像の中の幾つかの画素の集合である符号化画像ブロックを符号化して映像ストリームに格納する順序であるスキャン順序を、符号化の際に参照する他の参照画像ブロックに対するスキャン判定式によって決定するスキャン決定処理ステップと、符号化の際に参照する参照画像を用いて前記スキャン判定式を生成するスキャン判定式生成処理ステップと、前記スキャン判定式を映像ストリームに格納するスキャン判定式格納処理ステップと、として動作させる。 The present invention relates to a scan order, which is an order in which an encoded image block, which is a set of several pixels in an image to be encoded, is encoded and stored in a video stream in order to perform video encoding. A scan determination processing step that determines a scan determination formula for other reference image blocks that are referred to in encoding, and a scan determination formula generation process that generates the scan determination formula using a reference image that is referred to in encoding And a scan determination formula storage processing step for storing the scan determination formula in a video stream.

本発明は、映像ストリームの復号化を実行するためにコンピュータを、前記スキャン判定式を映像ストリームから取得するスキャン判定式取得処理ステップと、前記スキャン判定式と復号化の際に参照する参照画像を用いてスキャン順序を決定するスキャン決定処理ステップと、として動作させる。 The present invention provides a computer for executing decoding of a video stream, a scan determination expression acquisition processing step for acquiring the scan determination expression from a video stream, and a reference image to be referred to when the scan determination expression and decoding are performed. And a scan determination processing step for determining the scan order.

本発明では、映像符号化において符号化効率を向上することが可能である。 In the present invention, it is possible to improve encoding efficiency in video encoding.

以下、本発明の実施の形態について、図面を参照して詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

（実施の形態１）
第１の実施の形態では、映像を基本レイヤと拡張レイヤに階層化して符号化する階層符号化において、拡張レイヤの画像ブロックのＤＣＴ係数のスキャン順序を生成するためのスキャン判定式を基本レイヤの復号化画像から生成し、拡張レイヤの画像ブロックのＤＣＴ係数を前記スキャン判定式により生成したスキャン順序により符号化する。 (Embodiment 1)
In the first embodiment, in hierarchical coding in which video is hierarchized into a base layer and an enhancement layer and encoded, a scan determination formula for generating the scan order of the DCT coefficients of the image blocks of the enhancement layer is used for the base layer. It is generated from the decoded image, and the DCT coefficients of the enhancement layer image block are encoded by the scan order generated by the scan determination formula.

本実施の形態は、映像すなわち連続する画像の符号化に関して説明するが、静止画像の符号化に用いても構わない。 Although this embodiment will be described with respect to encoding of video, that is, continuous images, it may be used for encoding still images.

図１は、本発明の実施の形態１に関わる映像符号化方法を適用した映像符号化装置１００の構成を表す。 FIG. 1 shows a configuration of a video encoding apparatus 100 to which a video encoding method according to Embodiment 1 of the present invention is applied.

図１において、映像符号化装置１００は、映像信号入力部１０１、基本レイヤ符号化部１０２、基本レイヤ出力部１０３、拡張レイヤ符号化部１１０を有する。拡張レイヤ符号化部１１０は、差分部１１１、ＤＣＴ部１１２、スキャン部１１３、可変長符号化部１１４、拡張レイヤ出力部１１５、スキャン判定式生成部１１６、スキャン決定部１１７を有する。 In FIG. 1, the video encoding device 100 includes a video signal input unit 101, a base layer encoding unit 102, a base layer output unit 103, and an enhancement layer encoding unit 110. The enhancement layer coding unit 110 includes a difference unit 111, a DCT unit 112, a scanning unit 113, a variable length coding unit 114, an enhancement layer output unit 115, a scan determination formula generation unit 116, and a scan determination unit 117.

映像信号入力部１０１は、映像符号化装置１１０の外部から映像を１フレームずつ原画像として入力し、基本レイヤ符号化部１０２と拡張レイヤ符号化部１１０に出力する。また、映像符号化装置１１０の外部から入力される映像の有無を判定し、映像の入力がなければ処理を終了する。 The video signal input unit 101 inputs a video as an original image frame by frame from the outside of the video encoding device 110 and outputs it to the base layer encoding unit 102 and the enhancement layer encoding unit 110. Also, the presence / absence of a video input from the outside of the video encoding device 110 is determined. If there is no video input, the processing is terminated.

基本レイヤ符号化部１０２は、映像信号入力部１０１から入力した原画像を符号化して基本レイヤストリームを生成し、基本レイヤ出力部１１６に出力する。また、基本レイヤストリームを復号化して基本レイヤ復号化画像を生成し、拡張レイヤ符号化部１１０に出力する。 The base layer encoding unit 102 encodes the original image input from the video signal input unit 101 to generate a base layer stream, and outputs the base layer stream to the base layer output unit 116. Also, the base layer stream is decoded to generate a base layer decoded image, which is output to the enhancement layer encoding unit 110.

基本レイヤ出力部１０３は、基本レイヤ符号化部１０２から入力した基本レイヤストリームを映像符号化装置１００の外部に出力する。 Base layer output section 103 outputs the base layer stream input from base layer encoding section 102 to the outside of video encoding apparatus 100.

差分部１１１は、映像信号入力部１０１から入力した原画像と基本レイヤ符号化部１０２から入力した基本レイヤ復号化画像の差分をとって差分画像を生成し、ＤＣＴ部１１２に出力する。 The difference unit 111 generates a difference image by taking the difference between the original image input from the video signal input unit 101 and the base layer decoded image input from the base layer encoding unit 102 and outputs the difference image to the DCT unit 112.

なお、基本レイヤ復号化画像と原画像の解像度が異なる場合は、基本レイヤ復号化画像を原画像と同じ解像度に変更する。 When the resolution of the base layer decoded image and the original image are different, the base layer decoded image is changed to the same resolution as the original image.

ＤＣＴ部１１２は、差分部１１１から入力した差分画像を８×８画素ごとの画像ブロックに分割し、画像ブロックごとにＤＣＴ変換を施してＤＣＴ係数を生成し、スキャン部１１３に出力する。 The DCT unit 112 divides the difference image input from the difference unit 111 into image blocks of 8 × 8 pixels, performs DCT conversion for each image block, generates a DCT coefficient, and outputs the DCT coefficient to the scan unit 113.

ここで、本実施の形態では説明の簡単のため画像ブロックを８×８画素とするが、特に画像ブロックの大きさに限定はない。 In this embodiment, the image block is 8 × 8 pixels for simplicity of explanation, but the size of the image block is not particularly limited.

スキャン部１１３は、ＤＣＴ部１１２から入力したＤＣＴ係数に対してスキャン決定部１１７から入力したスキャン順序を用いて並べ替えてスキャン済ＤＣＴ係数を生成し、可変長符号化部１１４に出力する。 The scan unit 113 rearranges the DCT coefficient input from the DCT unit 112 using the scan order input from the scan determination unit 117 to generate a scanned DCT coefficient, and outputs the generated DCT coefficient to the variable length encoding unit 114.

可変長符号化部１１４は、スキャン部１１３から入力したスキャン済ＤＣＴ係数を可変長符号化して拡張レイヤストリームを生成し、拡張レイヤ出力部１１５に出力する。 The variable length coding unit 114 performs variable length coding on the scanned DCT coefficient input from the scanning unit 113 to generate an enhancement layer stream, and outputs the enhancement layer stream to the enhancement layer output unit 115.

ここで、可変長符号化部１１４ではＩＳＯ／ＩＥＣ１４４９６−１０に記載のＭＰＥＧ−４ＡＶＣ方式のような量子化を用いた符号化でもよく、ＩＳＯ／ＩＥＣ１４４９６−２Ａｍｅｎｄｍｅｎｔ２に記載のＭＰＥＧ−４ＦＧＳのようなビットプレーンを用いた符号化でもよい。本発明は、ＤＣＴ係数をスキャンによって並べ替えスキャン順序の後半に０を偏らせてＥｏＢを用いて符号化する方法をとるあらゆる符号化方式に適用可能である。 Here, the variable length encoding unit 114 may be encoding using quantization such as the MPEG-4 AVC method described in ISO / IEC 14496-10, or MPEG-4 described in ISO / IEC 14496-2 Amendment 2. Encoding using a bit plane such as FGS may be used. The present invention can be applied to any coding scheme in which DCT coefficients are reordered by scanning and coded using EoB with 0 being biased to the second half of the scanning order.

なお、量子化を行う符号化の場合、可変長符号化部１１４が、可変長符号化を行う前に量子化の処理を行う。 In the case of encoding that performs quantization, the variable length encoding unit 114 performs quantization processing before performing variable length encoding.

また、可変長符号化の方法は、ハフマン符号化や算術符号化など、その方法を問わない。 The variable length coding method may be any method such as Huffman coding or arithmetic coding.

拡張レイヤ出力部１１５は、可変長符号化部１１４から入力した拡張レイヤストリームにスキャン判定式生成部１１６から入力したスキャン判定式を格納し、映像符号化装置１００の外部に出力する。 The enhancement layer output unit 115 stores the scan determination formula input from the scan determination formula generation unit 116 in the enhancement layer stream input from the variable length encoding unit 114, and outputs the stored scan determination formula to the outside of the video encoding device 100.

スキャン判定式の格納については後述する。 The storage of the scan determination formula will be described later.

スキャン判定式生成部１１６は、基本レイヤ符号化部１０２から入力した基本レイヤ復号化画像を用いてスキャン判定式を生成し、拡張レイヤ出力部１１５とスキャン決定部１１７に出力する。 The scan determination formula generation unit 116 generates a scan determination formula using the base layer decoded image input from the base layer encoding unit 102 and outputs the scan determination formula to the enhancement layer output unit 115 and the scan determination unit 117.

なお、本実施の形態では１フレームの拡張レイヤストリームごとにスキャン判定式を格納するものとする。これに基づきスキャン順序を生成することにより、１フレームごとの画像特性に適用した短いスキャン長とすることが可能である。しかしながら、スキャン判定式の格納の頻度は特に限定しない。 In the present embodiment, it is assumed that a scan determination formula is stored for each enhancement layer stream of one frame. By generating the scan order based on this, a short scan length applied to the image characteristics for each frame can be obtained. However, the frequency of storing the scan determination formula is not particularly limited.

スキャン判定式の生成方法については後述する。 A method for generating the scan determination formula will be described later.

スキャン決定部１１７は、基本レイヤ復号化部１０２から入力した基本レイヤ復号化画像とスキャン判定式生成部１１６から入力したスキャン判定式を用いて画像ブロックごとのスキャン順序を生成し、スキャン部１１３に出力する。 The scan determination unit 117 generates a scan order for each image block using the base layer decoded image input from the base layer decoding unit 102 and the scan determination formula input from the scan determination formula generation unit 116, and sends it to the scan unit 113. Output.

なお、スキャン判定式生成部１１６が本発明のスキャン判定式生成手段に相当し、スキャン決定部１１７がスキャン決定手段に相当し、拡張レイヤ出力部１１５がスキャン判定式格納手段に総合する。 The scan determination formula generation unit 116 corresponds to the scan determination formula generation unit of the present invention, the scan determination unit 117 corresponds to the scan determination unit, and the enhancement layer output unit 115 is integrated into the scan determination formula storage unit.

以下に、スキャン判定式生成部１１６が行うスキャン判定式生成処理を詳しく説明する。 The scan determination formula generation process performed by the scan determination formula generation unit 116 will be described in detail below.

拡張レイヤ符号化部１１０では、基本レイヤ復号化画像では表現しきれない原画像の詳細情報を符号化する。差分画像では絶対値の大きい画素が物体の輪郭周辺に集中しており、差分画像は基本レイヤ復号化画像の輪郭を精細化することを示している。また、基本レイヤ復号化画像の強いエッジは物体の輪郭周辺に集中しており、差分画像と基本レイヤ復号化画像のエッジは強い相関があることが分かる。このことを利用して本実施の形態では、エッジの特性からスキャン順序を生成する方法を生成する。 The enhancement layer encoding unit 110 encodes detailed information of the original image that cannot be expressed in the base layer decoded image. In the difference image, pixels having a large absolute value are concentrated around the contour of the object, and the difference image indicates that the contour of the base layer decoded image is refined. In addition, it can be seen that strong edges of the base layer decoded image are concentrated around the contour of the object, and the difference image and the edges of the base layer decoded image have a strong correlation. In this embodiment, a method for generating the scan order from the edge characteristics is generated by utilizing this fact.

拡張レイヤの符号化画像ブロックに対応する、基本レイヤ復号画像の同位置の画像ブロックを参照画像ブロックとする。 The image block at the same position of the base layer decoded image corresponding to the encoded image block of the enhancement layer is set as a reference image block.

ＤＣＴ係数は周波数成分の強度を表しており、強いエッジを含んだ画像ブロックをＤＣＴ変換すると高周波に絶対値の大きいＤＣＴ係数が生じる。また、エッジの種類によってもＤＣＴ係数の分布は変化し、水平方向のエッジを多く含んだ符号化画像ブロックは水平高周波ＤＣＴ係数に大きい絶対値が偏り、垂直方向のエッジを多く含んだ符号化画像ブロックは垂直高周波ＤＣＴ係数に大きい絶対値が偏る。 The DCT coefficient represents the intensity of the frequency component. When an image block including a strong edge is subjected to DCT conversion, a DCT coefficient having a large absolute value is generated at a high frequency. Also, the distribution of DCT coefficients varies depending on the type of edge, and a coded image block including many horizontal edges has a large absolute value in the horizontal high-frequency DCT coefficient, and a coded image including many vertical edges. The block has a large absolute value biased to the vertical high frequency DCT coefficient.

よって、例えば、水平方向のエッジを多く含む参照画像ブロックに対応する符号化画像ブロックには水平方向を優先した図１３（ｂ）の水平スキャンを、垂直方向のエッジを多く含む参照画像ブロックに対応する符号化画像ブロックには垂直方向を優先した図１３（ｃ）の垂直スキャンを選択することにより、ジグザグスキャンを適用した場合より短いスキャン長となり符号化効率が向上する。水平方向および垂直方向のどちらのエッジの方が多いともいえない場合は、図１３（ａ）のジグザグスキャンを選択する。 Therefore, for example, for the encoded image block corresponding to the reference image block including many horizontal edges, the horizontal scan in FIG. 13B giving priority to the horizontal direction corresponds to the reference image block including many vertical edges. By selecting the vertical scan of FIG. 13C giving priority to the vertical direction for the encoded image block to be performed, the scan length becomes shorter than when the zigzag scan is applied, and the encoding efficiency is improved. If it cannot be said that there are more edges in either the horizontal direction or the vertical direction, the zigzag scan in FIG. 13A is selected.

なお、説明の簡単のためスキャン順序は図１３（ａ）（ｂ）（ｃ）の何れかを選択するものとするがこれに限らない。 For the sake of simplicity, it is assumed that one of FIGS. 13A, 13B, and 13C is selected as the scan order, but the present invention is not limited to this.

スキャン判定式生成部１１６は、参照画像ブロックのエッジから符号化画像ブロックを効率良く符号化できるスキャン順序を与えるスキャン判定式を生成する。スキャン判定式の決定いかんによっては、符号化画像ブロックを効率悪く符号化するスキャン順序を与えてしまう。参照画像ブロックから例えば（数式１）〜（数式４）のスキャン判定式を生成する。 The scan determination formula generation unit 116 generates a scan determination formula that gives a scan order in which the encoded image block can be efficiently encoded from the edge of the reference image block. Depending on the determination of the scan determination formula, a scan order for encoding the encoded image block inefficiently is given. For example, the scan determination formulas of (Formula 1) to (Formula 4) are generated from the reference image block.

スキャン判定式の生成方法の前に、スキャン判定式を用いたスキャン順序の生成について説明する。 Prior to the method for generating the scan determination formula, generation of the scan order using the scan determination formula will be described.

スキャン決定部１１８は、（数式１）と（数式２）により参照画像ブロックのエッジの強度を計算し、参照画像ブロックに対する（数式３）と（数式４）の判定式により符号化画像ブロックのスキャン順序を選択する。（数式３）の判定式が真となれば水平スキャンを選択し、（数式４）の判定式が真となれば垂直スキャンを選択し、どちらも偽であるならばジグザグスキャンを選択する。 The scan determination unit 118 calculates the strength of the edge of the reference image block using (Equation 1) and (Equation 2), and scans the encoded image block using the determination equations (Equation 3) and (Equation 4) for the reference image block. Select the order. If the determination formula (Equation 3) is true, the horizontal scan is selected, if the determination equation (Equation 4) is true, the vertical scan is selected, and if both are false, the zigzag scan is selected.

（数式１）は参照画像ブロックの水平方向のエッジの計算式である。Ｅｄｇｅはエッジの強度を現す値を表し、Ｅｄｇｅの左上の添え字Ｈは水平方向のエッジであることを現し、Ｅｄｇｅの右下の添え字ｉは画像ブロックの通し番号を表す。また、ｐは画素の持つ値を表し、ｘとｙは画像ブロック内での画素の座標を示す。λは０より大きい値である。よって（数式１）は、横隣の画素との差分絶対値のλ乗を８×８画像ブロック内で合計した値の計算式を表す。 (Formula 1) is a formula for calculating the horizontal edge of the reference image block. Edge represents a value representing the strength of the edge, the subscript H at the upper left of Edge represents the edge in the horizontal direction, and the subscript i at the lower right of Edge represents the serial number of the image block. Further, p represents a value of the pixel, and x and y represent the coordinates of the pixel in the image block. λ is a value greater than zero. Therefore, (Formula 1) represents a calculation formula of a value obtained by summing up the λ power of the absolute difference value between the adjacent pixels in the 8 × 8 image block.

（数式２）は参照画像ブロックの垂直方向のエッジの計算式である。Ｅｄｇｅの左上の添え字Ｖは垂直方向のエッジであることを現す。よって（数式２）は（数式１）と同様に、縦隣の画素との差分絶対値のλ乗を８×８画像ブロック内で合計した値の計算式を表す。 (Formula 2) is a formula for calculating the vertical edge of the reference image block. The subscript V at the upper left of Edge represents a vertical edge. Therefore, (Formula 2) represents a formula for calculating the sum of the λ power of the absolute value of the difference from the vertically adjacent pixels in the 8 × 8 image block, as in (Formula 1).

（数式３）は符号化画像ブロックに対して、水平スキャンを選択する場合の判定式である。α＿１、α＿２は１以上の適当な値をとる。上線を引いたＥｄｇｅ＿Ｈは、基本レイヤ復号化画像の水平方向のエッジの平均値を現す。よって（数式３）は、参照画像ブロックにおいて、水平方向のエッジの強度が垂直方向のエッジの強度をある程度上回り、かつ、平均の水平方向のエッジの強度を上回ることを表す。 (Formula 3) is a determination formula in the case of selecting the horizontal scan for the encoded image block. α_1 and α_2 take appropriate values of 1 or more. Edge_H with an overline represents an average value of edges in the horizontal direction of the base layer decoded image. Therefore, (Equation 3) represents that in the reference image block, the strength of the edge in the horizontal direction exceeds the strength of the edge in the vertical direction to some extent and exceeds the strength of the edge in the horizontal direction in average.

（数式４）は符号化画像ブロックに対して、垂直スキャンを選択する場合の判定式である。β＿１、β＿２は１以上の適当な値をとる。上線を引いたＥｄｇｅ＿Ｖは、基本レイヤ復号化画像の垂直方向のエッジの全画像ブロックの平均値を現す。よって（数式３）は、参照画像ブロックにおいて、垂直方向のエッジの強度が垂直方向のエッジの強度をある程度上回り、かつ、平均の垂直方向のエッジの強度を上回ることを表す。 (Formula 4) is a determination formula in the case of selecting the vertical scan for the encoded image block. β_1 and β_2 take appropriate values of 1 or more. Edge_V with an overline represents the average value of all image blocks at the edge in the vertical direction of the base layer decoded image. Therefore, (Equation 3) represents that, in the reference image block, the strength of the vertical edge exceeds the strength of the vertical edge to some extent and exceeds the average vertical edge strength.

次に、スキャン判定式の生成方法について説明する。説明の簡単のため、（数式１）と（数式２）のλは固定値、例えば、２であるとする。 Next, a method for generating a scan determination formula will be described. For simplicity of explanation, it is assumed that λ in (Expression 1) and (Expression 2) is a fixed value, for example, 2.

スキャン判定式生成部１１６は、（数式１）と（数式２）で計算するエッジの強度を全参照画像ブロックについて計算する。そして、平均や分散などの統計値を求め、（数式３）と（数式４）の中のα＿１、α＿２、β＿１、β＿２の値を決定する。エッジの強度の統計値からα＿１、α＿２、β＿１、β＿２の値を決定する方法は、予め実験によって得た対応表などを用いる。 The scan determination formula generation unit 116 calculates the edge strength calculated by (Formula 1) and (Formula 2) for all reference image blocks. Then, statistical values such as average and variance are obtained, and the values of α_1, α_2, β_1, and β_2 in (Equation 3) and (Equation 4) are determined. As a method for determining the values of α_1, α_2, β_1, and β_2 from the statistical values of the edge strength, a correspondence table or the like obtained in advance through experiments is used.

例えば、α＿１は水平のエッジの強度の平均値を垂直のエッジの強度の平均値で除した数値の２倍と決定する。水平のエッジの強度の平均値が１０００以下であるならばα＿２は２で、そうでないならα＿２は３と決定する。β＿１とβ＿２に関しても同様に決定する。 For example, α_1 is determined to be twice the value obtained by dividing the average value of the horizontal edge intensity by the average value of the vertical edge intensity. If the average value of the horizontal edge intensity is 1000 or less, α_2 is determined to be 2, and if not, α_2 is determined to be 3. β_1 and β_2 are similarly determined.

上記のように決定したスキャン判定式の情報α＿１、α＿２、β＿１、β＿２をスキャン決定部１１７に出力し、拡張レイヤ出力部１１５が拡張レイヤストリームに格納する。 The information α_1, α_2, β_1, and β_2 of the scan determination formula determined as described above is output to the scan determination unit 117, and the enhancement layer output unit 115 stores the information in the enhancement layer stream.

図２は、拡張レイヤストリームの構造の例を示す。拡張レイヤストリームの先頭部分の２０１は、この映像ストリームが拡張レイヤの情報を表すことを示すフラグや、画像の解像度などの情報を含むヘッダである。２０３の部分は、可変長符号化部１１４で生成した画像ブロックの符号である。スキャン判定式の情報は２０２の部分に格納する。例えば、α＿１、α＿２、β＿１、β＿２は１〜１６までの値だけを取るものとし、それら値を４ビットで表現する。００００は１を表し、１１１１が１６を表す。 FIG. 2 shows an example of the structure of the enhancement layer stream. A head portion 201 of the enhancement layer stream is a header including a flag indicating that this video stream represents enhancement layer information and information such as image resolution. A portion 203 is a code of the image block generated by the variable length encoding unit 114. The information of the scan determination formula is stored in the portion 202. For example, α_1, α_2, β_1, and β_2 only take values from 1 to 16, and these values are expressed by 4 bits. 0000 represents 1 and 1111 represents 16.

スキャン判定式を格納する映像ストリームの中の位置は、画像ブロックの符号よりも前とする。復号化の処理の際にスキャン判定式がなければ画像ブロックのスキャン順序が特定できず、もし画像ブロックの後にスキャン判定式があるならば映像ストリームが途中で途切れたとき全画像ブロックを復号化することができない。スキャン判定式が画像ブロックの符号の前に格納されているならば、映像ストリームが途中で途切れているときも途中の画像ブロックまでは復号化できる。 The position in the video stream in which the scan determination formula is stored is assumed to be before the code of the image block. If there is no scan judgment formula in the decoding process, the scan order of the image blocks cannot be specified. If there is a scan judgment formula after the image block, all the image blocks are decoded when the video stream is interrupted in the middle. I can't. If the scan determination formula is stored before the code of the image block, even when the video stream is interrupted in the middle, the image block in the middle can be decoded.

本発明の特徴は、スキャン生成に必要な計算式やパラメータを映像ストリームに格納して、符号化と復号化の処理で画像ブロックのスキャン順序を一致させることであり、スキャン判定式の決定方法は重要ではなく上記の方法以外でも良い。 A feature of the present invention is that calculation formulas and parameters necessary for scan generation are stored in a video stream, and the scan order of image blocks is matched in the encoding and decoding processes. It is not important and may be other than the above method.

なお、エッジの強度の計算式とスキャン判定式の条件は複数用意しておくことも可能で、その場合、どのエッジの強度の計算式とスキャン判定式の条件を用いたかを示すパラメータを拡張レイヤストリームに格納する。 Note that it is possible to prepare multiple conditions for edge strength calculation formulas and scan judgment formulas, in which case the parameters indicating which edge strength formulas and scan judgment formulas are used are extended layers. Store in stream.

これにより、画像の特長に応じて、よりスキャン長の短くなるスキャン判定式に適応的に切り替えることが可能である。画像の特長に応じた切り替え方法は、予め実験により求めておいても良い。例えば、基本レイヤストリームの符号量に応じて切り替えるなどである。 Accordingly, it is possible to adaptively switch to a scan determination formula that shortens the scan length according to the feature of the image. A switching method according to the feature of the image may be obtained in advance by experiments. For example, switching according to the code amount of the base layer stream.

λの値は固定値でなくても良い。例えば、あるフレームではλの値を２に決定して符号化を行い、そのフレームにおいて水平スキャンまたは垂直スキャンを選択した数が全画像ブロック数の１０％以下であれば次のフレームではλを３に決定し直し、そうでなければ２のままとする。 The value of λ may not be a fixed value. For example, in a certain frame, the value of λ is determined to be 2 and encoding is performed. If the number of horizontal scans or vertical scans selected in the frame is 10% or less of the total number of image blocks, λ is set to 3 in the next frame. If not, leave it at 2 otherwise.

これにより、符号化する画像に適したエッジの強度の計算式を選ぶことにより、効率良く符号化できるスキャン順序を与えるスキャン判定式を生成する。 Thus, by selecting an edge strength calculation formula suitable for the image to be encoded, a scan determination formula that gives a scan order that can be encoded efficiently is generated.

エッジの強度の計算式は（数式１）と（数式２）に限らない。 The formula for calculating the edge strength is not limited to (Formula 1) and (Formula 2).

例えば、参照画像ブロックの水平方向のエッジの強度を、水平方向の８画素の分散を垂直方向に８列合計した値としても良い。また、垂直方向のエッジの強度も同様である。 For example, the intensity of the horizontal edge of the reference image block may be a value obtained by totaling 8 columns in the vertical direction for the dispersion of 8 pixels in the horizontal direction. The vertical edge strength is also the same.

例えば、参照画像ブロックをＤＣＴ変換して図３に示すようなＤＣＴ係数を生成し、水平方向のエッジの強度は破線で囲まれる水平高周波ＤＣＴ係数３０１の絶対和、垂直方向のエッジの強度は破線で囲まれる垂直高周波ＤＣＴ係数３０２の絶対和で計算しても良い。 For example, DCT conversion is performed on the reference image block to generate DCT coefficients as shown in FIG. 3, and the horizontal edge strength is the absolute sum of horizontal high-frequency DCT coefficients 301 surrounded by a broken line, and the vertical edge strength is a broken line. The absolute sum of the vertical high-frequency DCT coefficients 302 surrounded by

これにより、参照画像ブロックのＤＣＴ係数を既に計算している場合、エッジの強度の算出の処理負荷が軽減する。 Thereby, when the DCT coefficient of the reference image block has already been calculated, the processing load for calculating the edge strength is reduced.

スキャン生成の判定式は（数式３）と（数式４）に限らない。 The determination formula for scan generation is not limited to (Equation 3) and (Equation 4).

例えば、（数式３）に垂直方向のエッジの強度が平均値以下であることや、（数式４）に水平方向のエッジの強度が平均値以下であることを条件に追加しても良い。また、（数式３）や（数式４）に閾値での判定条件を追加しても良い。 For example, (Equation 3) may be added on condition that the vertical edge strength is equal to or lower than the average value, and (Equation 4) is added on the condition that the horizontal edge strength is equal to or lower than the average value. Further, a determination condition with a threshold value may be added to (Expression 3) or (Expression 4).

本発明の骨子は、スキャン生成に必要な計算式やパラメータを示すスキャン判定式を映像ストリームに格納して、符号化と復号化の処理で画像ブロックのスキャン順序を一致させることであり、スキャン判定式そのものは上記の方法以外でも良い。 The gist of the present invention is to store a scan judgment formula indicating calculation formulas and parameters necessary for scan generation in a video stream, and to match the scan order of image blocks in encoding and decoding processing. The expression itself may be other than the above method.

スキャン判定式生成部１１６は、ＤＣＴ部１１２から符号化画像ブロックのＤＣＴ係数を入力しても良い。 The scan determination formula generation unit 116 may input the DCT coefficient of the encoded image block from the DCT unit 112.

これにより、例えば、幾つかのα＿１、α＿２、β＿１、β＿２、λの組合せで得られるスキャン順序で全符号化画像ブロックのＤＣＴ係数を実際にスキャンして、最短となるスキャン順序を調べることにより、符号化効率の良いα＿１、α＿２、β＿１、β＿２、λを決定することが可能である。 Thereby, for example, by actually scanning the DCT coefficients of all the coded image blocks in the scan order obtained by a combination of several α_1, α_2, β_1, β_2, λ, and examining the shortest scan order, It is possible to determine α_1, α_2, β_1, β_2, and λ with good coding efficiency.

スキャン判定式生成部１１６は、スキャン判定式を生成する際に生成した中間情報をスキャン決定部１１７に出力しても良い。 The scan determination formula generation unit 116 may output the intermediate information generated when generating the scan determination formula to the scan determination unit 117.

これにより、スキャン判定式の生成とスキャン順序の選択とで重複する処理を省くことが可能である。 As a result, it is possible to omit duplicate processing between the generation of the scan determination formula and the selection of the scan order.

次に、以上のように構成された映像符号化装置１００の動作を説明する。図４は図１に示す第１の実施の形態の映像符号化装置１００の動作の一例を示すフローチャートである。なお、図４に示すフローチャートは、図示しない記憶装置（例えばＲＯＭやフラッシュメモリなど）に格納された制御プログラムを、同じく図示しないＣＰＵが実行することにより、プログラムの実行によりソフトウエア的に実行されるようにすることも可能である。 Next, the operation of the video encoding apparatus 100 configured as described above will be described. FIG. 4 is a flowchart showing an example of the operation of the video encoding apparatus 100 according to the first embodiment shown in FIG. The flowchart shown in FIG. 4 is executed by software by executing a control program stored in a storage device (not shown) such as a ROM or a flash memory by a CPU (not shown). It is also possible to do so.

まず、ステップＳ４０１において、映像信号入力部１０１が、映像符号化装置１１０の外部から映像を１フレームずつ原画像として入力し、基本レイヤ符号化部１０２と拡張レイヤ符号化部１１０に出力する。 First, in step S 401, the video signal input unit 101 inputs a video as an original image frame by frame from the outside of the video encoding device 110, and outputs it to the base layer encoding unit 102 and the enhancement layer encoding unit 110.

次に、ステップＳ４０２において、基本レイヤ符号化部１０２が、映像信号入力部１０１から入力した原画像を符号化して基本レイヤストリームを生成し、基本レイヤ出力部１１６に出力する。また、基本レイヤストリームを復号化して基本レイヤ復号化画像を生成し、拡張レイヤ符号化部１１０に出力する。 Next, in step S 402, the base layer encoding unit 102 encodes the original image input from the video signal input unit 101 to generate a base layer stream, and outputs the base layer stream to the base layer output unit 116. Also, the base layer stream is decoded to generate a base layer decoded image, which is output to the enhancement layer encoding unit 110.

次に、ステップＳ４０３において、差分部１１１が、映像信号入力部１０１から入力した原画像と基本レイヤ符号化部１０２から入力した基本レイヤ復号化画像の差分をとって差分画像を生成し、ＤＣＴ部１１２に出力する。 Next, in step S403, the difference unit 111 generates a difference image by taking the difference between the original image input from the video signal input unit 101 and the base layer decoded image input from the base layer encoding unit 102, and a DCT unit. To 112.

次に、ステップＳ４０４において、ＤＣＴ部１１２が、差分部１１１から入力した差分画像を８×８画素ごとの画像ブロックに分割し、画像ブロックごとにＤＣＴ変換を施してＤＣＴ係数を生成し、スキャン部１１３に出力する。 Next, in step S404, the DCT unit 112 divides the difference image input from the difference unit 111 into image blocks of 8 × 8 pixels, performs DCT conversion for each image block, generates a DCT coefficient, and scan unit It outputs to 113.

次に、ステップＳ４０５において、スキャン判定式生成部１１６が、基本レイヤ符号化部１０２から入力した基本レイヤ復号化画像を用いてスキャン判定式を生成し、拡張レイヤ出力部１１５とスキャン決定部１１７に出力する。 Next, in step S 405, the scan determination formula generation unit 116 generates a scan determination formula using the base layer decoded image input from the base layer encoding unit 102, and sends it to the enhancement layer output unit 115 and the scan determination unit 117. Output.

次に、ステップＳ４０６において、スキャン決定部１１７が、基本レイヤ復号化部１０２から入力した基本レイヤ復号化画像とスキャン判定式生成部１１６から入力したスキャン判定式を用いて画像ブロックごとのスキャン順序を生成し、スキャン部１１３に出力する。 Next, in step S 406, the scan determination unit 117 uses the base layer decoded image input from the base layer decoding unit 102 and the scan determination formula input from the scan determination formula generation unit 116 to set the scan order for each image block. Generate and output to the scan unit 113.

次に、ステップＳ４０７において、スキャン部１１３が、ＤＣＴ部１１２から入力したＤＣＴ係数をスキャン決定部１１７から入力したスキャン順序を用いて並べ替えてスキャン済ＤＣＴ係数を生成し、可変長符号化部１１４に出力する。 In step S407, the scan unit 113 rearranges the DCT coefficients input from the DCT unit 112 using the scan order input from the scan determination unit 117 to generate scanned DCT coefficients, and the variable length encoding unit 114 Output to.

次に、ステップＳ４０８において、可変長符号化部１１４が、スキャン部１１３から入力したスキャン済ＤＣＴ係数を可変長符号化して拡張レイヤストリームを生成し、拡張レイヤ出力部１１５に出力する。 Next, in step S 408, the variable length coding unit 114 performs variable length coding on the scanned DCT coefficient input from the scanning unit 113 to generate an enhancement layer stream, and outputs the enhancement layer stream to the enhancement layer output unit 115.

次に、ステップＳ４０９において、拡張レイヤ出力部１１５が、可変長符号化部１１４から入力した拡張レイヤストリームにスキャン判定式生成部１１６から入力したスキャン判定式を格納し、映像符号化装置１００の外部に出力する。また、基本レイヤ出力部１０３が、基本レイヤ符号化部１０２から入力した基本レイヤストリームを映像符号化装置１００の外部に出力する。 Next, in step S409, the enhancement layer output unit 115 stores the scan judgment formula input from the scan judgment formula generation unit 116 in the enhancement layer stream input from the variable length coding unit 114, and the video coding apparatus 100 externally. Output to. Also, the base layer output unit 103 outputs the base layer stream input from the base layer encoding unit 102 to the outside of the video encoding device 100.

最後に、ステップＳ４１０において、映像信号入力部１０１が、映像符号化装置１１０の外部から入力される映像の有無を判定し、映像の入力がなければ処理を終了する。そうでなければ、ステップＳ４０１に戻る。 Finally, in step S410, the video signal input unit 101 determines the presence / absence of a video input from the outside of the video encoding device 110. If there is no video input, the processing ends. Otherwise, the process returns to step S401.

なお、ステップＳ４０５が本発明のスキャン判定式生成処理ステップに相当し、ステップＳ４０６がスキャン決定処理ステップに相当し、ステップＳ４０９がスキャン判定式格納処理ステップに相当する。 Note that step S405 corresponds to the scan determination formula generation processing step of the present invention, step S406 corresponds to the scan determination processing step, and step S409 corresponds to the scan determination formula storage processing step.

以上のように、本実施の形態１によれば、映像符号化装置１００は、スキャン判定式生成部１１６が差分画像と相関の高い基本レイヤ復号化画像を用いて、基本レイヤ復号化画像ブロックのエッジから符号化画像ブロックのスキャン順序を特定するスキャン判定式を生成し、スキャン決定部１１７が基本レイヤ復号化画像に対してスキャン判定式を用いることにより、画像ブロックごとにスキャン順序の情報を映像ストリームに格納することなく、拡張レイヤの画像ブロックに対してスキャン長が短いスキャン順序を特定することができ、階層符号化の符号化効率を向上することが可能である。 As described above, according to the first embodiment, the video encoding apparatus 100 uses the base layer decoded image whose scan determination formula generation unit 116 has a high correlation with the difference image to generate the base layer decoded image block. A scan determination formula for specifying the scan order of the encoded image block is generated from the edge, and the scan determination unit 117 uses the scan determination formula for the base layer decoded image, so that the information on the scan order is displayed for each image block. It is possible to specify a scan order having a short scan length for an enhancement layer image block without storing it in a stream, and to improve the coding efficiency of hierarchical coding.

また、本実施の形態１によれば、映像符号化装置１００は、拡張レイヤ符号化部１１５がスキャン判定式を映像ストリームに格納することにより、画像ブロックごとのスキャン順序の情報を映像ストリームに格納することなく、スキャン判定式のみで全符号化画像ブロックのスキャン順序を表現することができ、符号化効率を向上することが可能である。 Further, according to the first embodiment, the video encoding device 100 stores the scan order information for each image block in the video stream by the enhancement layer encoding unit 115 storing the scan determination formula in the video stream. Without this, the scan order of all the encoded image blocks can be expressed only by the scan determination formula, and the encoding efficiency can be improved.

（実施の形態２）
第２の実施の形態では、第1の実施の形態を適用した映像符号化装置１００が出力した映像ストリームの復号化において、拡張レイヤの画像ブロックのＤＣＴ係数の逆スキャン順序を生成するためのスキャン判定式を映像ストリームから取得し、拡張レイヤの画像ブロックのＤＣＴ係数を前記スキャン判定式により生成したスキャン順序により復号化する。 (Embodiment 2)
In the second embodiment, in decoding of the video stream output from the video encoding apparatus 100 to which the first embodiment is applied, a scan for generating the reverse scan order of the DCT coefficients of the image block of the enhancement layer The determination formula is acquired from the video stream, and the DCT coefficients of the enhancement layer image block are decoded by the scan order generated by the scan determination formula.

図５は、本発明の実施の形態２に関わる映像復号化方法を適用した映像復号化装置５００の構成を表す。 FIG. 5 shows a configuration of a video decoding apparatus 500 to which the video decoding method according to Embodiment 2 of the present invention is applied.

図５において、映像復号化装置５００は、映像信号出力部５０１、基本レイヤ復号化部５０２、基本レイヤ入力部５０３、拡張レイヤ復号化部５１０を有する。拡張レイヤ復号化部５１０は、差分部５１１、ＩＤＣＴ部５１２、逆スキャン部５１３、可変長復号化部５１４、拡張レイヤ入力部５１５、スキャン判定式取得部５１６、スキャン決定部５１７を有する。 In FIG. 5, the video decoding apparatus 500 includes a video signal output unit 501, a base layer decoding unit 502, a base layer input unit 503, and an enhancement layer decoding unit 510. The enhancement layer decoding unit 510 includes a difference unit 511, an IDCT unit 512, an inverse scan unit 513, a variable length decoding unit 514, an enhancement layer input unit 515, a scan determination formula acquisition unit 516, and a scan determination unit 517.

基本レイヤ入力部５０３は、映像復号化装置５００の外部から入力した基本レイヤストリームを基本レイヤ復号化部５０２に出力する。また、外部から入力される映像ストリームの有無を判定し、映像ストリームの入力がなければ処理を終了する。 Base layer input section 503 outputs a base layer stream input from the outside of video decoding apparatus 500 to base layer decoding section 502. Also, the presence / absence of a video stream input from the outside is determined. If there is no video stream input, the process ends.

基本レイヤ復号化部５０２は、基本レイヤ入力部５０３から入力した基本レイヤストリームを復号化し基本レイヤ復号化画像を生成し、拡張レイヤ復号化部５１０に出力する。 Base layer decoding section 502 decodes the base layer stream input from base layer input section 503, generates a base layer decoded image, and outputs the base layer decoded image to enhancement layer decoding section 510.

ここで、基本レイヤ復号化画像は映像符号化装置１００で生成した基本レイヤ復号化画像に一致する。 Here, the base layer decoded image matches the base layer decoded image generated by the video encoding device 100.

拡張レイヤ入力部５１５は、映像復号化装置５００の外部から入力した拡張レイヤストリームを可変長復号化部５１４とスキャン判定式取得部５１６に出力する。 The enhancement layer input unit 515 outputs the enhancement layer stream input from the outside of the video decoding device 500 to the variable length decoding unit 514 and the scan determination formula acquisition unit 516.

可変長復号化部５１４は、拡張レイヤ入力部５１５から入力した拡張レイヤストリームを可変長復号化してスキャン済ＤＣＴ係数を生成し、逆スキャン部５１３に出力する。 The variable length decoding unit 514 performs variable length decoding on the enhancement layer stream input from the enhancement layer input unit 515 to generate a scanned DCT coefficient, and outputs the generated DCT coefficient to the inverse scanning unit 513.

なお、映像符号化装置１００が量子化を行う符号化の場合、可変長復号化部５１４が、可変長復号化をいった後に逆量子化の処理を行う。逆量子化とはＤＣＴ係数を除した数値で乗算する処理である。 Note that in the case of encoding in which the video encoding device 100 performs quantization, the variable length decoding unit 514 performs inverse quantization processing after performing variable length decoding. Inverse quantization is a process of multiplying by a numerical value obtained by dividing the DCT coefficient.

逆スキャン部５１３は、可変長復号化部５１４から入力したスキャン済ＤＣＴ係数を、スキャン決定部５１７から入力したスキャン順序の逆の順序でスキャンしてＤＣＴ係数を生成し、ＩＤＣＴ部５１２に出力する。 The inverse scan unit 513 generates the DCT coefficient by scanning the scanned DCT coefficient input from the variable length decoding unit 514 in the reverse order of the scan order input from the scan determination unit 517, and outputs the DCT coefficient to the IDCT unit 512. .

ＩＤＣＴ部５１２は、逆スキャン部５１２から入力したＤＣＴ係数をＩＤＣＴ変換して復号化画像ブロックを生成する。複数の領域に分割していた復号化画像ブロックを合成し復号化差分画像を生成し、加算部５１１に出力する。 The IDCT unit 512 performs IDCT conversion on the DCT coefficient input from the inverse scan unit 512 to generate a decoded image block. Decoded image blocks that have been divided into a plurality of regions are combined to generate a decoded difference image, which is output to the adder 511.

ここでＩＤＣＴ（ＩｎｖｅｒｓｅＤｉｓｃｒｅｔｅＣｏｓｉｎｅＴｒａｎｓｆｏｒｍ）変換とはＤＣＴ変換した係数をもとに戻す逆の処理である。 Here, the IDCT (Inverse Discrete Cosine Transform) conversion is an inverse process for returning the DCT converted coefficient to the original.

スキャン判定式取得部５１６は、拡張レイヤ入力部５１５から入力した拡張レイヤストリームからスキャン判定式を取得し、スキャン決定部５１７に出力する。 The scan determination formula acquisition unit 516 acquires a scan determination formula from the enhancement layer stream input from the enhancement layer input unit 515 and outputs the scan determination formula to the scan determination unit 517.

スキャン決定部５１７は、基本レイヤ復号化部５０２から入力した基本レイヤ復号化画像に対して、スキャン判定式取得部５１６から入力したスキャン判定式を用いて画像ブロックごとのスキャン順序を生成し、逆スキャン部５１３に出力する。 The scan determination unit 517 generates a scan order for each image block with respect to the base layer decoded image input from the base layer decoding unit 502 using the scan determination formula input from the scan determination formula acquisition unit 516, and performs reverse processing. The data is output to the scan unit 513.

加算部５１１は、基本レイヤ復号化部５０２から入力した基本レイヤ復号化画像と、ＩＤＣＴ部５１２から入力した復号化差分画像を加算して復号化画像を生成し、映像信号出力部５０１に出力する。 The addition unit 511 generates a decoded image by adding the base layer decoded image input from the base layer decoding unit 502 and the decoded differential image input from the IDCT unit 512, and outputs the decoded image to the video signal output unit 501. .

映像信号出力部５０１は、加算部５１１から入力した復号化画像を映像復号化装置５００の外部に出力する。 The video signal output unit 501 outputs the decoded image input from the adding unit 511 to the outside of the video decoding device 500.

なお、スキャン判定式取得部５１６が本発明のスキャン判定式取得手段に相当し、スキャン決定部５１７がスキャン決定手段に相当する。 The scan determination formula acquisition unit 516 corresponds to the scan determination formula acquisition unit of the present invention, and the scan determination unit 517 corresponds to the scan determination unit.

次に、以上のように構成された映像復号化装置５００の動作を説明する。図６は図５に示す第２の実施の形態の映像復号化装置５００の動作の一例を示すフローチャートである。なお、図６に示すフローチャートは、図示しない記憶装置（例えばＲＯＭやフラッシュメモリなど）に格納された制御プログラムを、同じく図示しないＣＰＵが実行することにより、プログラムの実行によりソフトウエア的に実行されるようにすることも可能である。 Next, the operation of the video decoding apparatus 500 configured as described above will be described. FIG. 6 is a flowchart showing an example of the operation of the video decoding apparatus 500 according to the second embodiment shown in FIG. Note that the flowchart shown in FIG. 6 is executed by software by executing a control program stored in a storage device (not shown) such as a ROM or a flash memory by a CPU (not shown). It is also possible to do so.

まず、ステップＳ６０１において、基本レイヤ入力部５０３が、映像復号化装置５００の外部から入力した基本レイヤストリームを基本レイヤ復号化部５０２に出力する。また、拡張レイヤ入力部５１５が、映像復号化装置５００の外部から入力した拡張レイヤストリームを可変長復号化部５１４とスキャン判定式取得部５１６に出力する。 First, in step S 601, the base layer input unit 503 outputs the base layer stream input from the outside of the video decoding apparatus 500 to the base layer decoding unit 502. Also, the enhancement layer input unit 515 outputs the enhancement layer stream input from the outside of the video decoding apparatus 500 to the variable length decoding unit 514 and the scan determination formula acquisition unit 516.

次に、ステップＳ６０２において、基本レイヤ復号化部５０２が、基本レイヤ入力部５０３から入力した基本レイヤストリームを復号化し基本レイヤ復号化画像を生成し、拡張レイヤ復号化部５１０に出力する。 Next, in step S602, base layer decoding section 502 decodes the base layer stream input from base layer input section 503, generates a base layer decoded image, and outputs the base layer decoded image to enhancement layer decoding section 510.

次に、ステップＳ６０３において、可変長復号化部５１４が、拡張レイヤ入力部５１５から入力した拡張レイヤストリームを可変長復号化してスキャン済ＤＣＴ係数を生成し、逆スキャン部５１３に出力する。 Next, in step S603, the variable length decoding unit 514 performs variable length decoding on the enhancement layer stream input from the enhancement layer input unit 515 to generate a scanned DCT coefficient, and outputs the generated DCT coefficient to the inverse scan unit 513.

次に、ステップＳ６０４において、スキャン判定式取得部５１６が、拡張レイヤ入力部５１５から入力した拡張レイヤストリームからスキャン判定式を取得し、スキャン決定部５１７に出力する。 Next, in step S 604, the scan determination formula acquisition unit 516 acquires a scan determination formula from the enhancement layer stream input from the enhancement layer input unit 515 and outputs the scan determination formula to the scan determination unit 517.

次に、ステップＳ６０５において、スキャン決定部５１７が、基本レイヤ復号化部５０２から入力した基本レイヤ復号化画像に対して、スキャン判定式取得部５１６から入力したスキャン判定式を用いて画像ブロックごとのスキャン順序を生成し、逆スキャン部５１３に出力する。 In step S605, the scan determination unit 517 uses the scan determination formula input from the scan determination formula acquisition unit 516 for the base layer decoded image input from the base layer decoding unit 502 for each image block. A scan order is generated and output to the reverse scan unit 513.

次に、ステップＳ６０６において、ＩＤＣＴ部５１２が、逆スキャン部５１２から入力したＤＣＴ係数をＩＤＣＴ変換して復号化画像ブロックを生成して、分割していた画像ブロックを合成し復号化差分画像を生成し、加算部５１１に出力する。 Next, in step S606, the IDCT unit 512 generates a decoded image block by performing IDCT conversion on the DCT coefficient input from the inverse scan unit 512, and generates a decoded differential image by combining the divided image blocks. And output to the adding unit 511.

次に、ステップＳ６０７において、逆スキャン部５１３が、可変長復号化部５１４から入力したスキャン済ＤＣＴ係数を、スキャン決定部５１７から入力したスキャン順序の逆の順序でスキャンしてＤＣＴ係数を生成し、ＩＤＣＴ部５１２に出力する。 Next, in step S 607, the inverse scan unit 513 scans the scanned DCT coefficients input from the variable length decoding unit 514 in the reverse order of the scan order input from the scan determination unit 517 to generate DCT coefficients. , Output to the IDCT unit 512.

次に、ステップＳ６０８において、加算部５１１が、基本レイヤ復号化部５０２から入力した基本レイヤ復号化画像と、ＩＤＣＴ部５１２から入力した復号化差分画像を加算して復号化画像を生成し、映像信号出力部５０１に出力する。 Next, in step S608, the addition unit 511 generates a decoded image by adding the base layer decoded image input from the base layer decoding unit 502 and the decoded differential image input from the IDCT unit 512 to generate a video image. The signal is output to the signal output unit 501.

次に、ステップＳ６０９において、映像信号出力部５０１が、加算部５１１から入力した復号化画像を映像復号化装置５００の外部に出力する。 Next, in step S 609, the video signal output unit 501 outputs the decoded image input from the addition unit 511 to the outside of the video decoding device 500.

最後に、ステップＳ６１０において、基本レイヤ復号化部５０２が、外部から入力される映像ストリームの有無を判定し、映像ストリームの入力がなければ処理を終了する。そうでなければ、ステップＳ６０１に戻る。 Finally, in step S610, the base layer decoding unit 502 determines whether or not there is a video stream input from the outside. If there is no video stream input, the process ends. Otherwise, the process returns to step S601.

なお、ステップＳ６０４が本発明のスキャン判定式取得処理ステップに相当し、ステップＳ６０５がスキャン決定処理ステップに相当する。 Note that step S604 corresponds to the scan determination formula acquisition processing step of the present invention, and step S605 corresponds to the scan determination processing step.

以上のように、本実施の形態２によれば、映像復号化装置５００は、拡張レイヤ復号化部５１５がスキャン判定式を映像ストリームから取得し、スキャン決定部５１７が基本レイヤ復号化画像に対してスキャン判定式を用いることにより、拡張レイヤの全画像ブロックのスキャン順序を映像ストリームに含めることなくスキャン順序を特定することができ、符号化効率が向上する。 As described above, according to the second embodiment, in video decoding apparatus 500, enhancement layer decoding section 515 acquires a scan determination formula from a video stream, and scan determination section 517 applies to a base layer decoded image. By using the scan determination formula, it is possible to specify the scan order without including the scan order of all the image blocks of the enhancement layer in the video stream, and the encoding efficiency is improved.

（実施の形態３）
第３の実施の形態では、予測を用いた映像符号化において、符号化画像ブロックのＤＣＴ係数のスキャン順序を生成するためのスキャン判定式を、符号化画像ブロックの予測符号化に用いる参照画像ブロックから生成し、ＤＣＴ係数を前記スキャン判定式により生成したスキャン順序により符号化する。 (Embodiment 3)
In the third embodiment, in video coding using prediction, a scan determination formula for generating a scan order of DCT coefficients of a coded image block is used as a reference image block for predictive coding of the coded image block. And the DCT coefficients are encoded according to the scan order generated by the scan determination formula.

本実施の形態は、第１と第２の実施の形態と異なり階層符号化ではない映像ストリームを対象とする。 Unlike the first and second embodiments, this embodiment targets a video stream that is not hierarchically encoded.

また、本実施の形態は、映像すなわち連続する画像の符号化に関して説明するが、静止画像の符号化に用いても構わない。 Further, although the present embodiment will be described with respect to encoding of video, that is, continuous images, it may be used for encoding of still images.

図７は、本発明の実施の形態３に関わる映像符号化方法を適用した映像符号化装置７００の構成を表す。 FIG. 7 shows a configuration of a video encoding apparatus 700 to which the video encoding method according to Embodiment 3 of the present invention is applied.

図７において、映像符号化装置７００は、映像信号入力部７０１、予測符号化部７０２、ＤＣＴ部７０３、スキャン部７０４、可変長符号化部７０５、ストリーム出力部７０６、復号化部７０７、スキャン判定式生成部７０８、スキャン決定部７０９を有する。 In FIG. 7, a video encoding apparatus 700 includes a video signal input unit 701, a predictive encoding unit 702, a DCT unit 703, a scanning unit 704, a variable length encoding unit 705, a stream output unit 706, a decoding unit 707, a scan determination. An expression generation unit 708 and a scan determination unit 709 are included.

映像信号入力部７０１は、映像符号化装置７１０の外部から映像を１フレームずつ原画像として入力し、予測符号化部７０２に出力する。また、映像符号化装置７１０の外部から入力される映像の有無を判定し、映像の入力がなければ処理を終了する。 The video signal input unit 701 inputs video from the outside of the video encoding device 710 frame by frame as an original image and outputs it to the prediction encoding unit 702. Also, the presence / absence of a video input from the outside of the video encoding device 710 is determined. If there is no video input, the process is terminated.

予測符号化部７０２は、映像信号入力部７０１から入力した原画像と復号化部７０７から入力した復号化画像ブロックを用いて予測符号化により差分画像を生成して、ＤＣＴ部７０３に出力する。また、予測符号化に用いた参照画像をスキャン判定式生成部７０８に出力する。また、予測符号化により予測情報を生成し、ストリーム出力部７０６に出力する。 The prediction encoding unit 702 generates a difference image by prediction encoding using the original image input from the video signal input unit 701 and the decoded image block input from the decoding unit 707, and outputs the difference image to the DCT unit 703. In addition, the reference image used for predictive encoding is output to the scan determination formula generation unit 708. Also, prediction information is generated by predictive encoding and output to the stream output unit 706.

予測符号化については後述する。 Prediction coding will be described later.

ＤＣＴ部７０３は、予測符号化部７０２から入力した差分画像を４×４画素ごとの画像ブロックに分割し、画像ブロックごとにＤＣＴ変換と量子化を施して量子化ＤＣＴ係数を生成し、スキャン部７０４と復号化部７０７に出力する。 The DCT unit 703 divides the difference image input from the predictive encoding unit 702 into 4 × 4 pixel image blocks, performs DCT transformation and quantization for each image block, generates quantized DCT coefficients, and scan unit 704 and the decryption unit 707.

ここで、実施の形態１では画像ブロックを８×８画素としていたが、本実施の形態では説明の簡単のため画像ブロックを４×４画素とする。しかしながら、特に画像ブロックの大きさに限定はない。 Here, the image block is 8 × 8 pixels in the first embodiment, but in this embodiment, the image block is 4 × 4 pixels for the sake of simplicity of explanation. However, the size of the image block is not particularly limited.

スキャン部７０４は、ＤＣＴ部７０３から入力したＤＣＴ係数をスキャン決定部７０９から入力したスキャン順序を用いて並べ替えてスキャン済ＤＣＴ係数を生成し、可変長符号化部７０５に出力する。 The scan unit 704 rearranges the DCT coefficients input from the DCT unit 703 using the scan order input from the scan determination unit 709 to generate a scanned DCT coefficient, and outputs the generated DCT coefficient to the variable length encoding unit 705.

可変長符号化部７０５は、スキャン部７０４から入力したスキャン済ＤＣＴ係数を可変長符号化して映像ストリームを生成し、ストリーム出力部７０６に出力する。 The variable length coding unit 705 generates a video stream by variable length coding the scanned DCT coefficient input from the scanning unit 704 and outputs the video stream to the stream output unit 706.

ストリーム出力部７０６は、可変長符号化部７０５から入力した映像ストリームに、予測符号化部７０２から入力した予測情報とスキャン判定式生成部７０８から入力したスキャン判定式を格納し、映像符号化装置７００の外部に出力する。 The stream output unit 706 stores the prediction information input from the prediction encoding unit 702 and the scan determination expression input from the scan determination expression generation unit 708 in the video stream input from the variable length encoding unit 705, and the video encoding device Output outside 700.

復号化部７０７は、ＤＣＴ部７０３から入力した量子化ＤＣＴ係数をＩＤＣＴ変換し復号化画像ブロックを生成し、予測符号化部７０２に出力する。 The decoding unit 707 performs IDCT conversion on the quantized DCT coefficient input from the DCT unit 703 to generate a decoded image block, and outputs the decoded image block to the prediction encoding unit 702.

スキャン判定式生成部７０８は、予測符号化部７０２から入力した参照画像を用いてスキャン判定式を生成し、ストリーム出力部７０６とスキャン決定部７０９に出力する。 The scan determination formula generation unit 708 generates a scan determination formula using the reference image input from the prediction encoding unit 702 and outputs the scan determination formula to the stream output unit 706 and the scan determination unit 709.

スキャン決定部７０９は、基本レイヤ復号化部７０２から入力した参照画像とスキャン判定式生成部７０８から入力したスキャン判定式を用いて符号化画像ブロックごとのスキャン順序を生成し、スキャン部７０４に出力する。 The scan determination unit 709 generates a scan order for each encoded image block using the reference image input from the base layer decoding unit 702 and the scan determination formula input from the scan determination formula generation unit 708, and outputs the scan order to the scan unit 704. To do.

なお、スキャン判定式生成部７０８が本発明のスキャン判定式生成手段に相当し、スキャン決定部７０９がスキャン決定手段に相当し、ストリーム出力部７０６がスキャン判定式格納手段に相当する。 The scan determination formula generation unit 708 corresponds to the scan determination formula generation unit of the present invention, the scan determination unit 709 corresponds to the scan determination unit, and the stream output unit 706 corresponds to the scan determination formula storage unit.

以下に、本実施の形態の予測符号化部７０２が行う予測符号化と、スキャン判定式生成部７０８が行うスキャン判定式の生成方法について説明する。 The predictive encoding performed by the predictive encoding unit 702 according to the present embodiment and the scan determination formula generation method performed by the scan determination formula generation unit 708 will be described below.

予測符号化部７０２では、公知の技術であるＩＳＯ／ＩＥＣ１４４９６−１０に記載のＭＰＥＧ−４ＡＶＣに用いられている画面間予測符号化や画面内予測符号化などを行う。 The predictive coding unit 702 performs inter-screen predictive coding and intra-screen predictive coding used in MPEG-4 AVC described in ISO / IEC 14496-10, which is a known technique.

図１６を用いて画面間予測符号化を説明する。 The inter-picture prediction encoding will be described with reference to FIG.

図１６（ａ）の符号化画像ブロック１６０１と類似した領域を既に映像ストリームに格納した過去や未来の復号化画像の中から図１６（ｃ）の画像ブロック１６０５のように選出し、参照画像ブロック１６０５とする。 An area similar to the encoded image block 1601 in FIG. 16A is selected from past and future decoded images already stored in the video stream as an image block 1605 in FIG. 1605.

特許文献２では図１６（ａ）の画像ブロック１６０２や図１６（ｂ）の画像ブロック１６０３、１６０４で表すＤＣＴ変換を行う画像ブロック単位でしか参照する画像ブロックを選出することが出来ないが、本発明では図１６（ｃ）の画像ブロック１６０５のように任意の位置の画素を含む画像ブロックを選出することが可能である。 In Patent Document 2, it is possible to select an image block to be referred to only in units of image blocks to be subjected to DCT conversion represented by the image block 1602 in FIG. 16A and the image blocks 1603 and 1604 in FIG. 16B. In the invention, it is possible to select an image block including a pixel at an arbitrary position, such as an image block 1605 in FIG.

映像ストリームに、参照画像ブロック１６０５を持つ画像や参照画像ブロック１６０５の位置などを示す予測情報と、参照画像ブロック１６０５と符号化画像ブロック１６０２の差分のＤＣＴ係数を符号化して格納する。一般に、参照画像ブロック１６０５は符号化画像ブロック１６０１との差分が小さくなるように選出するので、画面間予測符号化はそれを用いない場合よりも少ない符号量で符号化画像ブロック１６０１を映像ストリームに格納することが可能である。 In the video stream, the prediction information indicating the image having the reference image block 1605, the position of the reference image block 1605, and the like, and the DCT coefficient of the difference between the reference image block 1605 and the encoded image block 1602 are encoded and stored. In general, since the reference image block 1605 is selected so that the difference from the encoded image block 1601 is small, inter-picture predictive encoding is performed by using the encoded image block 1601 as a video stream with a smaller code amount than when not using it. Can be stored.

予測符号化部７０２は、予測符号化に用いるための過去や未来の復号化画像を蓄積するための記憶領域を有するものとする。 The predictive encoding unit 702 has a storage area for storing past and future decoded images for use in predictive encoding.

画面間予測符号化において、参照画像ブロックは符号化画像ブロックとの差分が小さいように選ぶ。例えば、参照画像内において符号化画像ブロックとの画素ごとの差分の絶対２乗和が最小となる参照画像ブロックを選ぶ。よって、それら２つの画像ブロックのエッジの特性は似ており、実施の形態１で述べた判定式の生成方法を同様に用いることが可能である。 In inter-picture predictive encoding, the reference image block is selected so that the difference from the encoded image block is small. For example, in the reference image, a reference image block is selected that minimizes the absolute sum of squares of differences for each pixel from the encoded image block. Therefore, the edge characteristics of the two image blocks are similar, and the determination formula generation method described in the first embodiment can be used in the same manner.

また、予測符号化の参照画像ブロックの選出方法にスキャン判定式に含めても良い。 In addition, the scan determination formula may include a reference image block selection method for predictive coding.

これにより、例えば、参照画像ブロックの水平方向のエッジの強度がある閾値以上であれば、符号化画像ブロックとの差分のＤＣＴ係数の水平スキャンが最短スキャンとなる参照画像ブロックの選出をするというルールを作ることにより、参照画像ブロックからスキャン順序を特定することが可能となる。 Thereby, for example, if the intensity of the horizontal edge of the reference image block is equal to or greater than a certain threshold value, the rule is to select a reference image block in which the horizontal scan of the difference DCT coefficient from the encoded image block is the shortest scan. This makes it possible to specify the scan order from the reference image block.

図８を用いて画面内予測符号化を説明する。 The intra prediction encoding will be described with reference to FIG.

図８（ａ）の８０１は符号化する４×４画素の画像ブロックを表す。その周りのマスは既に符号化し映像ストリームに格納している画素を復号化したものを表す。画面内予測符号化では、符号化画像ブロックの周りの復号化した画素を図８（ｂ）の予測方向のうちの一つに従って画像ブロック８０１の位置にコピーしたものを参照画像ブロックとする。 Reference numeral 801 in FIG. 8A denotes an image block of 4 × 4 pixels to be encoded. The surrounding cells represent the decoded pixels that have already been encoded and stored in the video stream. In intra prediction encoding, a reference image block is obtained by copying a decoded pixel around an encoded image block to the position of an image block 801 according to one of the prediction directions in FIG. 8B.

映像ストリームに予測方向を示す予測情報と、画像ブロック８０１と参照画像ブロックの差分のＤＣＴ係数を符号化して格納する。一般に、参照画像ブロックは画像ブロック８０１との差分が小さくなるように生成するので、画面内予測符号化はそれを用いない場合よりも少ない符号量で符号化画像ブロック１６０２を映像ストリームに格納することが可能である。 The prediction information indicating the prediction direction and the DCT coefficient of the difference between the image block 801 and the reference image block are encoded and stored in the video stream. In general, since the reference image block is generated so that the difference from the image block 801 is small, the intra-frame predictive coding stores the coded image block 1602 in the video stream with a smaller code amount than when not using it. Is possible.

画面内予測符号化において、参照画像ブロックは符号化画像ブロックとの差分が小さいように選ぶ。例えば、全予測方向から符号化画像ブロックとの画素ごとの差分の絶対２乗和が最小となる予測方向を選ぶ。よって、それら２つの画像ブロックのエッジの特性は似ており、実施の形態１で述べた判定式の生成方法を同様に用いることが可能である。 In the intra prediction encoding, the reference image block is selected so that the difference from the encoded image block is small. For example, a prediction direction in which the absolute sum of squares of differences for each pixel from the encoded image block is selected from all prediction directions is selected. Therefore, the edge characteristics of the two image blocks are similar, and the determination formula generation method described in the first embodiment can be used in the same manner.

また、スキャン判定式の条件の中に、図８（ｂ）に示す画面内予測の予測方向を含めても良い。 Further, the prediction direction of the intra prediction shown in FIG. 8B may be included in the conditions of the scan determination formula.

これにより、例えば、予測方向が真横であるならば水平方向のエッジはなく、水平方向のエッジの強度を計算を省略することにより処理負荷が軽減する。 Thereby, for example, if the prediction direction is just beside, there is no edge in the horizontal direction, and the processing load is reduced by omitting the calculation of the strength of the edge in the horizontal direction.

次に、以上のように構成された映像符号化装置７００の動作を説明する。図９は図７に示す第３の実施の形態の映像符号化装置７００の動作の一例を示すフローチャートである。なお、図９に示すフローチャートは、図示しない記憶装置（例えばＲＯＭやフラッシュメモリなど）に格納された制御プログラムを、同じく図示しないＣＰＵが実行することにより、プログラムの実行によりソフトウエア的に実行されるようにすることも可能である。 Next, the operation of the video encoding apparatus 700 configured as described above will be described. FIG. 9 is a flowchart showing an example of the operation of the video encoding apparatus 700 according to the third embodiment shown in FIG. The flowchart shown in FIG. 9 is executed by software by executing a control program stored in a storage device (not shown) such as a ROM or a flash memory by a CPU (not shown). It is also possible to do so.

まず、ステップＳ９０１において、映像信号入力部７０１が、映像符号化装置７１０の外部から映像を１フレームずつ原画像として入力し、予測符号化部７０２に出力する。 First, in step S 901, the video signal input unit 701 inputs a video as an original image frame by frame from the outside of the video encoding device 710 and outputs it to the predictive encoding unit 702.

次に、ステップＳ９０２において、予測符号化部７０２が、映像信号入力部７０１から入力した原画像と復号化部７０７から入力した復号化画像ブロックを用いて予測符号化により差分画像を生成して、ＤＣＴ部７０３に出力する。また、予測符号化により予測情報を生成し、ストリーム出力部７０６に出力する。 Next, in step S902, the prediction encoding unit 702 generates a difference image by predictive encoding using the original image input from the video signal input unit 701 and the decoded image block input from the decoding unit 707, and The data is output to the DCT unit 703. Also, prediction information is generated by predictive encoding and output to the stream output unit 706.

次に、ステップＳ９０３において、ＤＣＴ部７０３が、予測符号化部７０２から入力した差分画像を４×４画素ごとの画像ブロックに分割し、画像ブロックごとにＤＣＴ変換と量子化を施して量子化ＤＣＴ係数を生成し、スキャン部７０４と復号化部７０７に出力する。 Next, in step S903, the DCT unit 703 divides the difference image input from the predictive encoding unit 702 into 4 × 4 pixel image blocks, performs DCT transformation and quantization for each image block, and performs quantization DCT. A coefficient is generated and output to the scanning unit 704 and the decoding unit 707.

次に、ステップＳ９０４において、復号化部７０７が、ＤＣＴ部７０３から入力した量子化ＤＣＴ係数をＩＤＣＴ変換し復号化画像ブロックを生成し、予測符号化部７０２に出力する。 Next, in step S904, the decoding unit 707 generates a decoded image block by performing IDCT conversion on the quantized DCT coefficient input from the DCT unit 703, and outputs the decoded image block to the prediction encoding unit 702.

次に、ステップＳ９０５において、スキャン判定式生成部７０８が、予測符号化部７０２から入力した参照画像を用いてスキャン判定式を生成し、ストリーム出力部７０６とスキャン決定部７０９に出力する。 In step S 905, the scan determination formula generation unit 708 generates a scan determination formula using the reference image input from the prediction encoding unit 702 and outputs the scan determination formula to the stream output unit 706 and the scan determination unit 709.

次に、ステップＳ９０６において、スキャン決定部７０９が、基本レイヤ復号化部７０２から入力した参照画像とスキャン判定式生成部７０８から入力したスキャン判定式を用いて符号化画像ブロックごとのスキャン順序を生成し、スキャン部７０４に出力する。 In step S906, the scan determination unit 709 generates a scan order for each encoded image block using the reference image input from the base layer decoding unit 702 and the scan determination formula input from the scan determination formula generation unit 708. And output to the scan unit 704.

次に、ステップＳ９０７において、スキャン部７０４が、ＤＣＴ部７０３から入力したＤＣＴ係数をスキャン決定部７０９から入力したスキャン順序を用いて並べ替えてスキャン済ＤＣＴ係数を生成し、可変長符号化部７０５に出力する。 Next, in step S907, the scan unit 704 generates a scanned DCT coefficient by rearranging the DCT coefficients input from the DCT unit 703 using the scan order input from the scan determination unit 709, and the variable length encoding unit 705. Output to.

次に、ステップＳ９０８において、可変長符号化部７０５が、スキャン部７０４から入力したスキャン済ＤＣＴ係数を可変長符号化して映像ストリームを生成し、ストリーム出力部７０６に出力する。 In step S 908, the variable length coding unit 705 generates a video stream by variable length coding the scanned DCT coefficient input from the scanning unit 704, and outputs the video stream to the stream output unit 706.

次に、ステップＳ９０９において、ストリーム出力部７０６が、可変長符号化部７０５から入力した映像ストリームに、予測符号化部７０２から入力した予測情報とスキャン判定式生成部７０８から入力したスキャン判定式を格納し、映像符号化装置７００の外部に出力する。 Next, in step S909, the stream output unit 706 adds the prediction information input from the prediction encoding unit 702 and the scan determination expression input from the scan determination expression generation unit 708 to the video stream input from the variable length encoding unit 705. Store and output to the outside of the video encoding device 700.

最後に、ステップＳ９１０において、映像信号入力部７０１が、映像符号化装置７１０の外部から入力される映像の有無を判定し、映像の入力がなければ処理を終了する。そうでなければ、ステップＳ９０１に戻る。 Finally, in step S910, the video signal input unit 701 determines whether there is a video input from the outside of the video encoding device 710. If there is no video input, the process ends. Otherwise, the process returns to step S901.

なお、ステップＳ９０５が本発明のスキャン判定式生成処理ステップに相当し、ステップＳ９０６がスキャン決定処理ステップに相当し、ステップＳ９０９がスキャン判定式格納処理ステップに相当する。 Note that step S905 corresponds to the scan determination formula generation processing step of the present invention, step S906 corresponds to the scan determination processing step, and step S909 corresponds to the scan determination formula storage processing step.

以上のように、本実施の形態３によれば、映像符号化装置７００は、スキャン判定式生成部７０８が予測符号化の参照画像を用いてスキャン判定式を生成し、スキャン決定部７０９が予測符号化の参照画像に対してスキャン判定式を用いることにより、全ての画像ブロックに対してスキャン長が短いスキャン順序を生成することができ、予測符号化の符号化効率を向上することが可能である。 As described above, according to the third embodiment, in video encoding apparatus 700, scan determination expression generation section 708 generates a scan determination expression using a reference image for predictive encoding, and scan determination section 709 predicts. By using the scan determination formula for the reference image for encoding, it is possible to generate a scan order with a short scan length for all image blocks, and to improve the encoding efficiency of predictive encoding. is there.

また、本実施の形態３によれば、映像符号化装置７００は、スキャン判定式生成部７０８が予測情報を用いてスキャン判定式を生成することや、予測符号化部７０２がスキャン判定式の情報を用いて予測することにより、より短いスキャン長となるスキャン順序が生成でき、符号化効率を向上することが可能である。 Further, according to the third embodiment, in video encoding apparatus 700, scan determination formula generation section 708 generates a scan determination formula using prediction information, and predictive encoding section 702 uses scan determination formula information. By using the prediction, it is possible to generate a scan order having a shorter scan length, and to improve the encoding efficiency.

（実施の形態４）
第４の実施の形態では、第３の実施の形態を適用した映像符号化装置７００が出力した映像ストリームの復号化において、復号化する画像ブロックのＤＣＴ係数の逆スキャン順序を生成するためのスキャン判定式を映像ストリームから取得し、画像ブロックのＤＣＴ係数を前記スキャン判定式により生成したスキャン順序により復号化する。 (Embodiment 4)
In the fourth embodiment, in decoding a video stream output from the video encoding apparatus 700 to which the third embodiment is applied, a scan for generating a reverse scan order of DCT coefficients of an image block to be decoded The determination formula is acquired from the video stream, and the DCT coefficient of the image block is decoded by the scan order generated by the scan determination formula.

図１０は、本発明の実施の形態４に関わる映像復号化方法を適用した映像復号化装置１０００の構成を表す。 FIG. 10 shows the configuration of a video decoding apparatus 1000 to which the video decoding method according to Embodiment 4 of the present invention is applied.

図１０において、映像復号化装置１０００は、映像信号出力部１００１、予測復号化部１００２、ＩＤＣＴ部１００３、逆スキャン部１００４、可変長復号化部１００５、ストリーム入力部１００６、スキャン判定式取得部１００７、スキャン決定部１００８を有する。 In FIG. 10, the video decoding apparatus 1000 includes a video signal output unit 1001, a predictive decoding unit 1002, an IDCT unit 1003, an inverse scan unit 1004, a variable length decoding unit 1005, a stream input unit 1006, and a scan determination formula acquisition unit 1007. And a scan determination unit 1008.

ストリーム入力部１００６は、映像復号化装置１０００の外部から映像ストリームを入力し可変長復号化部１００５と予測復号化部１００２とスキャン判定式取得部１００７に出力する。また、外部から入力される映像ストリームの有無を判定し、映像ストリームの入力がなければ処理を終了する。 The stream input unit 1006 inputs a video stream from the outside of the video decoding apparatus 1000 and outputs the video stream to the variable length decoding unit 1005, the prediction decoding unit 1002, and the scan determination formula acquisition unit 1007. Also, the presence / absence of a video stream input from the outside is determined. If there is no video stream input, the process ends.

可変長復号化部１００５は、ストリーム入力部１００６から入力した映像ストリームを可変長復号化してスキャン済ＤＣＴ係数を生成し、逆スキャン部１００４に出力する。 The variable length decoding unit 1005 performs variable length decoding on the video stream input from the stream input unit 1006 to generate a scanned DCT coefficient, and outputs the generated DCT coefficient to the inverse scanning unit 1004.

逆スキャン部１００４は、可変長復号化部１００５から入力したスキャン済ＤＣＴ係数を、スキャン決定部１００８から入力したスキャン順序の逆の順序でスキャンして量子化ＤＣＴ係数を生成し、ＩＤＣＴ部５Ｊ０３に出力する。 The inverse scan unit 1004 scans the scanned DCT coefficients input from the variable length decoding unit 1005 in the reverse order of the scan order input from the scan determination unit 1008 to generate quantized DCT coefficients, and sends them to the IDCT unit 5J03. Output.

ＩＤＣＴ部１００３は、逆スキャン部１００４から入力した量子化ＤＣＴ係数を逆量子化とＩＤＣＴ変換を施して復号化画像ブロックを生成し、予測復号化部１００２に出力する。 The IDCT unit 1003 performs inverse quantization and IDCT conversion on the quantized DCT coefficient input from the inverse scan unit 1004 to generate a decoded image block, and outputs the decoded image block to the predictive decoding unit 1002.

スキャン判定式取得部１００７は、ストリーム入力部１００６から入力した映像ストリームからスキャン判定式を取得し、スキャン決定部１００８に出力する。 The scan determination formula acquisition unit 1007 acquires the scan determination formula from the video stream input from the stream input unit 1006 and outputs the scan determination formula to the scan determination unit 1008.

スキャン決定部１００８は、予測復号化部１００２から入力した参照画像に対して、スキャン判定式取得部１００７から入力したスキャン判定式を用いて画像ブロックごとのスキャン順序を生成し、逆スキャン部１００４に出力する。 The scan determination unit 1008 generates a scan order for each image block using the scan determination formula input from the scan determination formula acquisition unit 1007 for the reference image input from the predictive decoding unit 1002, and sends the scan order to the inverse scan unit 1004. Output.

予測復号化部１００２は、ストリーム入力部１００６から入力した映像ストリームから予測情報を取得し、ＩＤＣＴ部１００３から入力した復号化画像ブロックを用いて予測復号化により復号化画像を生成し、映像信号出力部１００１に出力する。また、予測復号化に用いた参照画像をスキャン決定部７０８に出力する。 The predictive decoding unit 1002 acquires prediction information from the video stream input from the stream input unit 1006, generates a decoded image by predictive decoding using the decoded image block input from the IDCT unit 1003, and outputs a video signal. Output to the unit 1001. In addition, the reference image used for predictive decoding is output to the scan determination unit 708.

映像信号出力部１００１は、予測復号化部１００２から入力した復号化画像を映像復号化装置１０００の外部に出力する。 The video signal output unit 1001 outputs the decoded image input from the predictive decoding unit 1002 to the outside of the video decoding apparatus 1000.

なお、スキャン判定式取得部１００７が本発明のスキャン判定式取得手段に相当し、スキャン決定部１００８がスキャン決定手段に相当する。 The scan determination formula acquisition unit 1007 corresponds to the scan determination formula acquisition unit of the present invention, and the scan determination unit 1008 corresponds to the scan determination unit.

次に、以上のように構成された映像復号化装置１０００の動作を説明する。図１１は図１０に示す第４の実施の形態の映像復号化装置１０００の動作の一例を示すフローチャートである。なお、図１１に示すフローチャートは、図示しない記憶装置（例えばＲＯＭやフラッシュメモリなど）に格納された制御プログラムを、同じく図示しないＣＰＵが実行することにより、プログラムの実行によりソフトウエア的に実行されるようにすることも可能である。 Next, the operation of the video decoding apparatus 1000 configured as described above will be described. FIG. 11 is a flowchart showing an example of the operation of the video decoding apparatus 1000 according to the fourth embodiment shown in FIG. The flowchart shown in FIG. 11 is executed by software by executing a control program stored in a storage device (not shown) such as a ROM or a flash memory by a CPU (not shown). It is also possible to do so.

まず、ステップＳ１１０１では、ストリーム入力部１００６が、映像復号化装置１０００の外部から映像ストリームを入力し可変長復号化部１００５と予測復号化部１００２とスキャン判定式取得部１００７に出力する。 First, in step S1101, the stream input unit 1006 inputs a video stream from the outside of the video decoding apparatus 1000, and outputs the video stream to the variable length decoding unit 1005, the predictive decoding unit 1002, and the scan determination formula acquisition unit 1007.

次に、ステップＳ１１０２では、可変長復号化部１００５が、ストリーム入力部１００６から入力した映像ストリームを可変長復号化してスキャン済ＤＣＴ係数を生成し、逆スキャン部１００４に出力する。 Next, in step S1102, the variable length decoding unit 1005 performs variable length decoding on the video stream input from the stream input unit 1006, generates a scanned DCT coefficient, and outputs the generated DCT coefficient to the inverse scanning unit 1004.

次に、ステップＳ１１０３では、スキャン判定式取得部１００７が、ストリーム入力部１００６から入力した映像ストリームからスキャン判定式を取得し、スキャン決定部１００８に出力する。 Next, in step S 1103, the scan determination formula acquisition unit 1007 acquires a scan determination formula from the video stream input from the stream input unit 1006 and outputs the scan determination formula to the scan determination unit 1008.

次に、ステップＳ１１０４では、スキャン決定部１００８が、予測復号化部１００２から入力した参照画像に対して、スキャン判定式取得部１００７から入力したスキャン判定式を用いて画像ブロックごとのスキャン順序を生成し、逆スキャン部１００４に出力する。 In step S1104, the scan determination unit 1008 generates a scan order for each image block using the scan determination formula input from the scan determination formula acquisition unit 1007 for the reference image input from the predictive decoding unit 1002. And output to the reverse scan unit 1004.

次に、ステップＳ１１０５では、逆スキャン部１００４が、可変長復号化部１００５から入力したスキャン済ＤＣＴ係数を、スキャン決定部１００８から入力したスキャン順序の逆の順序でスキャンして量子化ＤＣＴ係数を生成し、ＩＤＣＴ部５Ｊ０３に出力する。 Next, in step S1105, the inverse scan unit 1004 scans the scanned DCT coefficients input from the variable length decoding unit 1005 in the reverse order of the scan order input from the scan determination unit 1008 to obtain the quantized DCT coefficients. Generate and output to the IDCT unit 5J03.

次に、ステップＳ１１０６では、ＩＤＣＴ部１００３が、逆スキャン部１００４から入力した量子化ＤＣＴ係数を逆量子化とＩＤＣＴ変換を施して復号化画像ブロックを生成し、予測復号化部１００２に出力する。 Next, in step S 1106, the IDCT unit 1003 performs inverse quantization and IDCT transformation on the quantized DCT coefficient input from the inverse scan unit 1004, generates a decoded image block, and outputs the decoded image block to the predictive decoding unit 1002.

次に、ステップＳ１１０７では、予測復号化部１００２が、ストリーム入力部１００６から入力した映像ストリームから予測情報を取得し、ＩＤＣＴ部１００３から入力した復号化画像ブロックを用いて予測復号化により復号化画像を生成し、映像信号出力部１００１に出力する。また、予測復号化に用いた参照画像をスキャン決定部７０８に出力する。 Next, in step S1107, the predictive decoding unit 1002 acquires prediction information from the video stream input from the stream input unit 1006, and uses the decoded image block input from the IDCT unit 1003 to decode a decoded image by predictive decoding. Is output to the video signal output unit 1001. In addition, the reference image used for predictive decoding is output to the scan determination unit 708.

次に、ステップＳ１１０８では、映像信号出力部１００１が、予測復号化部１００２から入力した復号化画像を映像復号化装置１０００の外部に出力する。 Next, in step S1108, the video signal output unit 1001 outputs the decoded image input from the predictive decoding unit 1002 to the outside of the video decoding apparatus 1000.

最後に、ステップＳ１１０９では、ストリーム入力部１００６が、外部から入力される映像ストリームの有無を判定し、映像ストリームの入力がなければ処理を終了する。そうでなければ、ステップＳ１１０１に戻る。 Finally, in step S1109, the stream input unit 1006 determines whether there is a video stream input from the outside. If there is no video stream input, the process ends. Otherwise, the process returns to step S1101.

なお、ステップＳ１１０３が本発明のスキャン判定式取得処理ステップに相当し、ステップＳ１１０４がスキャン決定処理ステップに相当する。 Note that step S1103 corresponds to the scan determination formula acquisition processing step of the present invention, and step S1104 corresponds to the scan determination processing step.

以上のように、本実施の形態４によれば、映像復号化装置１０００は、スキャン判定式取得部１００７がスキャン判定式を映像ストリームから取得し、スキャン決定部５１７が予測符号化の参照画像に対してスキャン判定式を用いることにより、全画像ブロックのスキャン順序を映像ストリームに含めることなくスキャン順序を特定することができ、符号化効率が向上する。 As described above, according to the fourth embodiment, in the video decoding apparatus 1000, the scan determination formula acquisition unit 1007 acquires the scan determination formula from the video stream, and the scan determination unit 517 converts it into a reference image for prediction encoding. On the other hand, by using the scan determination formula, it is possible to specify the scan order without including the scan order of all image blocks in the video stream, and the encoding efficiency is improved.

本発明は、画像ブロックごとに適したスキャン順序を生成することにより、符号化効率を向上することが可能であり、有用である。 INDUSTRIAL APPLICABILITY The present invention is useful because it is possible to improve encoding efficiency by generating a scan order suitable for each image block.

また、本発明は、特に階層符号化方式に適用することが可能で、拡張レイヤの符号化効率を向上することができ有用である。 In addition, the present invention is particularly applicable to the hierarchical coding scheme, and is useful because it can improve the coding efficiency of the enhancement layer.

よって、映像ストリームを符号化した後、符号量を調整して復号化する用途に適している。 Therefore, after encoding a video stream, it is suitable for the purpose of decoding by adjusting the code amount.

また、本発明は、特に予測符号化方式に適用することが可能で、予測符号化の符号化効率を向上することができ有用である。 In addition, the present invention is particularly applicable to a predictive coding scheme, and is useful because it can improve the coding efficiency of predictive coding.

よって、映像すなわち連続する画像の符号化に適している。すなわち、通信速度の変動するネットワークを介して映像ストリームの量を動的に変化させながら映像を送受信するシステムの映像符号化方式に適している。また、１つの映像コンテンツを通信速度や端末の処理能力の異なる複数のユーザに、それぞれ異なる画質や通信速度で配信するシステムの映像符号化方式に適している。また、映像ストリームを蓄積した後に、復号化して再度符号化することなく、蓄積容量を変更する映像蓄積装置の映像符号化方式に適している。 Therefore, it is suitable for encoding video, that is, continuous images. That is, it is suitable for a video encoding system of a system that transmits and receives video while dynamically changing the amount of the video stream via a network whose communication speed varies. Further, the present invention is suitable for a video encoding system of a system that distributes one video content to a plurality of users having different communication speeds and terminal processing capabilities at different image quality and communication speeds. Further, the present invention is suitable for a video encoding method of a video storage apparatus that changes the storage capacity without decoding and re-encoding after storing the video stream.

本発明の実施の形態１による映像符号化装置の構成を示す図The figure which shows the structure of the video coding apparatus by Embodiment 1 of this invention. 拡張レイヤストリームの構造の例を示す図Diagram showing an example of the structure of the enhancement layer stream ＤＣＴ係数の水平高周波と垂直高周波の例を示す図The figure which shows the example of the horizontal high frequency of a DCT coefficient, and a vertical high frequency 本発明の実施の形態１による映像符号化装置の動作を示すフローチャートThe flowchart which shows operation | movement of the video coding apparatus by Embodiment 1 of this invention. 本発明の実施の形態２による映像復号化装置の構成を示す図The figure which shows the structure of the video decoding apparatus by Embodiment 2 of this invention. 本発明の実施の形態２による映像復号化装置の動作を示すフローチャートThe flowchart which shows operation | movement of the video decoding apparatus by Embodiment 2 of this invention. 本発明の実施の形態３による映像符号化装置の構成を示す図The figure which shows the structure of the video coding apparatus by Embodiment 3 of this invention. 画面内予測の例を示す図Diagram showing an example of in-screen prediction 本発明の実施の形態３による映像符号化装置の動作を示すフローチャートThe flowchart which shows operation | movement of the video coding apparatus by Embodiment 3 of this invention. 本発明の実施の形態４による映像復号化装置の構成を示す図The figure which shows the structure of the video decoding apparatus by Embodiment 4 of this invention. 本発明の実施の形態４による映像復号化装置の動作を示すフローチャートThe flowchart which shows operation | movement of the video decoding apparatus by Embodiment 4 of this invention. 量子化の例を示す図Diagram showing an example of quantization スキャン順序の例を示す図Diagram showing an example of scan order 特許文献１による映像符号化装置の構成を示す図The figure which shows the structure of the video coding apparatus by patent document 1 特許文献２による映像符号化装置の構成を示す図The figure which shows the structure of the video coding apparatus by patent document 2 特許文献２の概念図Conceptual diagram of Patent Document 2

Explanation of symbols

１０１，７０１映像信号入力部
１０２基本レイヤ符号化部
１０３基本レイヤ出力部
１１０拡張レイヤ符号化部
１１１差分部
１１２，７０３ＤＣＴ部
１１３，７０４スキャン部
１１４，７０５可変長符号化部
１１５拡張レイヤ出力部
１１６，５１６，７０８，９０７スキャン判定式生成部
１１７，５１７，７０９，９０８スキャン決定部
５０１，９０１映像信号出力部
５０２基本レイヤ復号化部
５０３基本レイヤ入力部
５１０拡張レイヤ復号化部
５１１加算部
５１２，９０３ＩＤＣＴ部
５１３，９０４逆スキャン部
５１４，９０５可変長復号部
５１５拡張レイヤ入力部
７０２予測符号化部
７０６ストリーム出力部
７０７復号化部
９０２予測復号化部
９０６ストリーム入力部
２０１拡張レイヤストリームヘッダ
２０２スキャン判定式
２０３画像ブロックの符号 101, 701 Video signal input unit 102 Base layer encoding unit 103 Base layer output unit 110 Enhancement layer encoding unit 111 Difference unit 112, 703 DCT unit 113, 704 Scan unit 114, 705 Variable length encoding unit 115 Enhancement layer output unit 116, 516, 708, 907 Scan determination formula generation unit 117, 517, 709, 908 Scan determination unit 501, 901 Video signal output unit 502 Base layer decoding unit 503 Base layer input unit 510 Enhancement layer decoding unit 511 Addition unit 512 903 IDCT unit 513 904 Reverse scan unit 514 905 Variable length decoding unit 515 Enhancement layer input unit 702 Predictive coding unit 706 Stream output unit 707 Decoding unit 902 Predictive decoding unit 906 Stream input unit 201 Enhancement layer stream head 202 code scan judgment formula 203 image blocks

Claims

The scan order, which is the order in which an encoded image block, which is a set of several pixels in the image to be encoded, is encoded and stored in the video stream is determined by the scan judgment formula for the reference image block to be referred to at the time of encoding. A scan determination means for determining; a scan determination expression generation means for generating the scan determination expression using a reference image referred to in encoding; a scan determination expression storage means for storing the scan determination expression in a video stream; A video encoding device.

The video encoding apparatus according to claim 1, wherein the scan determination formula storage unit stores the information of the scan determination formula at a position of the video stream before the code of the encoded image block.

The scan determination formula generation means compares the horizontal edge and the vertical edge of the reference image, and if the horizontal edge is included more than the vertical edge, the horizontal direction is given priority, and the vertical edge is determined. 2. The video encoding apparatus according to claim 1, wherein if there are more edges than in the horizontal direction, the scan determination formula is generated to cause the scan determination unit to determine the scan order in preference to the vertical direction.

The scan determination formula generation unit compares the horizontal high-frequency coefficient and the vertical high-frequency coefficient of the DCT coefficient of the reference image, and if the horizontal high-frequency coefficient is included more than the vertical high-frequency coefficient, the horizontal direction is given priority, and the vertical high-frequency coefficient is set as the horizontal high-frequency coefficient. 4. The video encoding apparatus according to claim 3, wherein when the number of coefficients is greater than the number of coefficients, the scan determination formula is generated to cause the scan determination unit to determine a scan order in preference to the vertical direction.

The video encoding apparatus according to claim 1, wherein the scan determination formula generation unit generates the scan determination formula by predicting a feature of the encoded image block from statistics of information for each reference image block.

The video encoding apparatus according to claim 1, wherein the scan determination formula generation unit generates the scan determination formula for each of the plurality of encoded image blocks.

The video encoding apparatus according to claim 1, wherein the scan determination formula generation unit changes a parameter value included in the scan determination formula using the past scan determination formula.

The video encoding apparatus according to claim 1, wherein the scan determination expression generation unit generates the scan determination expression using a result of scanning the encoded image block.

The video encoding apparatus according to claim 1, wherein the scan determination formula generation unit prepares and switches a plurality of scan determination formulas in advance.

In the hierarchical coding for hierarchizing and encoding a base layer capable of decoding video alone and a decoded image of the basic layer into an enhancement layer for improving the image quality, the scan determination formula generating means converts the base layer decoded image into the base layer decoded image The video encoding apparatus according to claim 1, wherein the scan determination formula for determining the scan order of the encoded block of the enhancement layer corresponding to the reference image block of the base layer is generated as a reference image.

The video encoding apparatus according to claim 1, wherein the scan determination expression generation unit generates the scan determination expression using an image referred to in the prediction encoding as the reference image in predictive encoding of a video.

The video encoding apparatus according to claim 11, wherein the predictive encoding is inter-picture predictive encoding.

The video encoding device according to claim 11, wherein the predictive encoding is intra-screen predictive encoding.

The predictive coding uses the reference image block predicted by the scan determination formula for predictive coding when the code amount of the coded image block is the smallest among the reference image block candidates of the predictive coding. The video encoding device according to claim 11.

12. The video encoding apparatus according to claim 11, wherein the scan determination formula generation unit generates the scan determination formula corresponding to the reference image block using information used by the prediction encoding for prediction.

The scan order, which is the order in which the encoded image block, which is a set of several pixels in the encoded image, is encoded and stored in the video stream is determined by the scan determination formula for the reference image block that is referred to at the time of decoding. A video decoding apparatus configured to include: a scan determination unit that determines; and a scan determination formula acquisition unit that acquires the scan determination formula from a video stream.

A video encoding method for encoding video, wherein the video encoding device executes
Scan determination formula for other reference image blocks that are referred to when encoding the scan order, which is the order in which an encoded image block that is a set of several pixels in the image to be encoded is encoded and stored in the video stream Scanning determination processing step determined by:
A scan determination expression generation processing step for generating the scan determination expression using a reference image referred to in encoding;
And a scan determination expression storing processing step of storing the scan determination expression in a video stream.

A video decoding method for decoding a video stream, which is executed by a video decoding device,
The scan order, which is the order in which an encoded image block, which is a set of several pixels in the encoded image, is encoded and stored in the video stream is determined by the scan determination formula for the reference image block that is referred to at the time of decoding. A video decoding method comprising: a scan determination processing step for determining; and a scan determination formula acquisition processing step for acquiring the scan determination formula from a video stream.

Computer to perform video encoding,
Scan determination formula for other reference image blocks that are referred to when encoding the scan order, which is the order in which an encoded image block that is a set of several pixels in the image to be encoded is encoded and stored in the video stream Scanning determination processing step determined by:
A scan determination expression generation processing step for generating the scan determination expression using a reference image referred to in encoding;
A video encoding program that operates as a scan determination formula storage processing step for storing the scan determination formula in a video stream.

A computer to perform the decoding of the video stream,
The scan order, which is the order in which the encoded image block, which is a set of several pixels in the encoded image, is encoded and stored in the video stream is determined by the scan determination formula for the reference image block that is referred to at the time of decoding. A video decoding program that operates as a scan determination processing step to determine and a scan determination formula acquisition processing step to acquire the scan determination formula from a video stream.