JP2016106444A

JP2016106444A - Moving image encoding apparatus and moving image encoding method

Info

Publication number: JP2016106444A
Application number: JP2013063456A
Authority: JP
Inventors: 安倍　清史; Seishi Abe; 清史安倍; 一仁木村; Kazuhito Kimura; 秀之大古瀬; Hideyuki Okose; 荒川　博; Hiroshi Arakawa; 博荒川; 耕治有村; Koji Arimura; 和真榊原; Kazuma Sakakibara
Original assignee: Panasonic Corp
Current assignee: Panasonic Corp
Priority date: 2013-03-26
Filing date: 2013-03-26
Publication date: 2016-06-16
Also published as: WO2014155454A1

Abstract

PROBLEM TO BE SOLVED: To provide a moving image encoding apparatus that performs encoding by selecting a combination of plural block sizes to be subjected to inter-image prediction processing and plural block sizes to be subjected to orthogonal transform processing and that is capable of properly performing block size determination processing with a small amount of processing.SOLUTION: A moving image encoding apparatus 100 includes: an inter-image prediction processing unit 107 that generates a prediction image for each of blocks; a difference calculation unit 109 that generates a difference image for each of the blocks; and an orthogonal transform processing block size determination unit 103 that determines the block size to be subjected to orthogonal transform processing according to the magnitude of amplitude of pixel values of difference images associated with the plural blocks.SELECTED DRAWING: Figure 1

Description

本開示は、入力された画像をブロックに分割して符号化する動画像符号化装置に関する。 The present disclosure relates to a moving image encoding apparatus that divides an input image into blocks and encodes them.

近年、マルチメディアアプリケーションの発展に伴い、画像、音声及びテキストなど、あらゆるメディアの情報を統一的に扱うことが一般的になってきた。また、ディジタル化された画像は膨大なデータ量を持つため、蓄積及び伝送のためには、画像の情報圧縮技術が不可欠である。一方で、圧縮した画像データを相互運用するためには、圧縮技術の標準化も重要である。例えば、画像圧縮技術の標準規格としては、ＩＴＵ−Ｔ（国際電気通信連合電気通信標準化部門）のＨ．２６１、Ｈ．２６３、Ｈ．２６４、ＩＳＯ／ＩＥＣ（国際標準化機構）のＭＰＥＧ−１、ＭＰＥＧ−３、ＭＰＥＧ−４、ＭＰＥＧ−４ＡＶＣなどがある。また、現在は、ＩＴＵ−ＴとＩＳＯ／ＩＥＣとの共同によるＨＥＶＣと呼ばれる次世代動画像符号化方式の標準化活動が進んでいる。 In recent years, with the development of multimedia applications, it has become common to handle all media information such as images, sounds and texts in a unified manner. Also, since a digitized image has a huge amount of data, an image information compression technique is indispensable for storage and transmission. On the other hand, in order to interoperate compressed image data, standardization of compression technology is also important. For example, as a standard for image compression technology, ITU-T (International Telecommunication Union Telecommunication Standardization Sector) 261, H.H. 263, H.M. H.264, ISO / IEC (International Organization for Standardization) MPEG-1, MPEG-3, MPEG-4, MPEG-4AVC, and the like. At present, standardization activities for a next-generation video coding method called HEVC in cooperation with ITU-T and ISO / IEC are in progress.

このような動画像の符号化では、符号化対象の各ピクチャを符号化単位ブロックに分割し、ブロック毎に時間方向および空間方向の冗長性を削減することによって情報量の圧縮を行う。時間的な冗長性の削減を目的とする画面間予測符号化では、前方または後方のピクチャを参照してブロック単位で動きの検出および予測画像の生成を行う。そして、得られた予測画像と符号化対象のブロックとの差分画像を取得する。また空間的な冗長性の削減を目的とする画面内予測符号化では、周辺の符号化済みブロックの画素情報をから予測画像の生成を行い、得られた予測画像と符号化対象のブロックとの差分画像を取得する。さらに得られた差分画像に対して離散コサイン変換等の直交変換処理および量子化処理を行い、可変長符号化および算術符号化を用いて符号列を生成することで情報量が圧縮される。 In such moving picture coding, each picture to be coded is divided into coding unit blocks, and the amount of information is compressed by reducing redundancy in the time direction and the spatial direction for each block. In inter-frame predictive coding for the purpose of reducing temporal redundancy, motion detection and prediction image generation are performed in units of blocks with reference to forward or backward pictures. Then, a difference image between the obtained predicted image and the block to be encoded is acquired. In addition, in the intra prediction encoding for the purpose of reducing spatial redundancy, a prediction image is generated from pixel information of surrounding encoded blocks, and the obtained prediction image and a block to be encoded are obtained. Get the difference image. Further, the amount of information is compressed by performing orthogonal transform processing such as discrete cosine transform and quantization processing on the obtained difference image and generating a code string using variable length coding and arithmetic coding.

ＨＥＶＣ（非特許文献１）では、前述の画面間予測符号化において予測画像を生成する単位として、３２×３２画素、１６×３２画素、１６×１６画素等の８種類のブロックサイズの中から任意のサイズを選択して使用することができる。撮像物の動きが複雑な画像では小さなブロックサイズを使用し、撮像物の動きが単純な画像では大きなブロックサイズを使用することで高い符号化効率を実現している。 In HEVC (Non-Patent Document 1), as a unit for generating a prediction image in the above-described inter-screen prediction coding, any of eight block sizes such as 32 × 32 pixels, 16 × 32 pixels, and 16 × 16 pixels can be arbitrarily selected. You can select and use the size. A high block efficiency is realized by using a small block size for an image with a complex motion of the imaged object and a large block size for an image with a simple motion of the imaged object.

また、前述の直交変換処理および量子化処理を行う単位として、３２×３２画素、１６×１６画素、８×８画素、４×４画素の４種類のブロックサイズの中から任意のサイズを選択して使用することができる。細かい範囲で特徴が異なる画像では小さなブロックサイズを使用し、広い範囲で特徴が同じような画像では大きなブロックサイズを使用することで高い符号化効率を実現している。 In addition, an arbitrary size is selected from four types of block sizes of 32 × 32 pixels, 16 × 16 pixels, 8 × 8 pixels, and 4 × 4 pixels as a unit for performing the above-described orthogonal transform processing and quantization processing. Can be used. High encoding efficiency is achieved by using a small block size for images with different features in a fine range and using a large block size for images with similar features in a wide range.

ＪＣＴＶＣ-Ｌ１００３: ＨｉｇｈＥｆｆｉｃｉｅｎｃｙＶｉｄｅｏＣｏｄｉｎｇ (ＨＥＶＣ) ｔｅｘｔｓｐｅｃｉｆｉｃａｔｉｏｎｄｒａｆｔ１０ (０１/２０１３)JCTVC-L1003: High Efficiency Video Coding (HEVC) text specification draft 10 (01/2013)

ＨＥＶＣでは前述の通り、画面間予測符号化において予測画像を生成する単位として８種類のブロックサイズを持つ。さらに、直交変換処理および量子化処理を行う単位として４種類のブロックサイズを持つ。そのため、符号化処理においてその中から最適なブロックサイズの組み合わせを決定するために、膨大な処理量が必要となってしまう。 As described above, HEVC has eight types of block sizes as a unit for generating a predicted image in inter-screen predictive coding. Furthermore, there are four types of block sizes as units for performing orthogonal transform processing and quantization processing. Therefore, an enormous amount of processing is required to determine an optimal combination of block sizes from among the encoding processing.

本開示は、例えばＨＥＶＣを用いた動画像符号化装置において、画面間予測符号化において予測画像を生成する単位であるブロックのサイズと、直交変換処理および量子化処理を行う単位であるブロックのサイズとの組み合わせを、少ない処理量で効率的に決定できる動画像符号化装置を提供する。 The present disclosure relates to a block size that is a unit for generating a prediction image in inter-frame prediction encoding and a block size that is a unit for performing orthogonal transform processing and quantization processing, for example, in a moving image encoding device using HEVC. A moving picture encoding apparatus capable of efficiently determining a combination with the above is provided with a small processing amount.

本開示における動画像符号化装置は、符号化対象ピクチャを画面間予測処理が含まれた所定の符号化規格にて符号化し、符号列を生成する動画像符号化装置であって、符号化対象ピクチャを構成する画素を、予め設定される複数のブロックサイズのうちいずれかのサイズを有する出力用のブロック毎に出力するブロック分割部と、出力用のブロックに属する画素に対して、出力用のブロックサイズ以下のブロックサイズを有する予測処理用のブロックを設定し、この予測処理用のブロック毎に予測画像を生成する画面間予測処理部と、予測処理用のブロック毎に生成された予測画像と、当該予測画像のそれぞれに対応する出力用のブロックに属する画素とを差分演算し、予測処理用のブロック単位で差分画像を生成する差分演算部と、予測処理用のブロックのうち隣接するブロックに属する差分画像に基づいて、（１）隣接する予測処理用のブロックを統合し得られる正方形状のブロックを直交変換用のブロックとして設定する第１動作と、（２）予測処理用のブロックを正方形状に分割し、分割して得られたブロックを直交変換用のブロックとして設定する第２動作を実行する直交変換ブロックサイズ決定部と、設定した直交変換用のブロック毎に、直交変換および量子化処理し、残差係数を生成する残差係数符号化部とを備える。 A moving image encoding apparatus according to the present disclosure is a moving image encoding apparatus that generates a code string by encoding a coding target picture using a predetermined coding standard including inter-screen prediction processing, and A block dividing unit that outputs pixels constituting a picture for each output block having any one of a plurality of block sizes set in advance, and an output unit for pixels belonging to the output block An inter-screen prediction processing unit that sets a block for prediction processing having a block size equal to or smaller than the block size and generates a prediction image for each block for prediction processing; a prediction image generated for each block for prediction processing; A difference calculation unit that calculates a difference between pixels belonging to an output block corresponding to each of the prediction images and generates a difference image in units of blocks for prediction processing; (1) a first operation for setting a square block obtained by integrating adjacent prediction processing blocks as an orthogonal transform block based on a difference image belonging to an adjacent block among the blocks for 2) Dividing the block for prediction processing into a square shape, an orthogonal transform block size determining unit that executes a second operation of setting the block obtained by the division as a block for orthogonal transform, and the set orthogonal transform Each block includes a residual coefficient encoding unit that performs orthogonal transform and quantization processing to generate a residual coefficient.

なお、本発明は、このような動画像符号化装置として実現することができるだけでなく、このような動画像符号化装置に含まれる各手段と同等の処理をプログラムや集積回路としても実現することもできる。 It should be noted that the present invention can be realized not only as such a moving picture coding apparatus, but also as a program or an integrated circuit that realizes processing equivalent to each means included in such a moving picture coding apparatus. You can also.

本開示における動画像符号化装置は、例えばＨＥＶＣを用いた動画像符号化装置において、画面間予測符号化において予測画像を生成する単位であるブロックのサイズと、直交変換処理および量子化処理を行う単位であるブロックのサイズとの組み合わせを、少ない処理量で効率的に決定できる。 The moving image encoding apparatus according to the present disclosure performs, for example, a block size that is a unit for generating a predicted image in inter-frame predictive encoding, orthogonal transform processing, and quantization processing in a moving image encoding device using HEVC. A combination with the unit block size can be determined efficiently with a small amount of processing.

本発明の実施の形態に係る動画像符号化装置を示すブロック図The block diagram which shows the moving image encoder which concerns on embodiment of this invention 処理単位となるブロックサイズの組み合わせを説明するための概念を示す図The figure which shows the concept for demonstrating the combination of the block size used as a processing unit 本発明の実施の形態に係る直交変換ブロックサイズ決定処理および残差係数符号化処理を示すフローチャートThe flowchart which shows the orthogonal transformation block size determination process and residual coefficient encoding process which concern on embodiment of this invention 直交変換ブロックサイズ決定処理において差分画像の画素値を用いた判定を行う方法の一例を説明するための概念をしめす図The figure which shows the concept for demonstrating an example of the method of performing the determination using the pixel value of a difference image in orthogonal transformation block size determination processing 本発明の実施の形態において決定される直交変換ブロックサイズの一例を説明するための概念を示す図The figure which shows the concept for demonstrating an example of the orthogonal transformation block size determined in embodiment of this invention 本発明の実施の形態に係る別の直交変換ブロックサイズ決定処理および残差係数符号化処理を示すフローチャートThe flowchart which shows another orthogonal transformation block size determination process and residual coefficient encoding process which concern on embodiment of this invention 本発明の実施の形態に係るまた別の直交変換ブロックサイズ決定処理および残差係数符号化処理を示すフローチャートThe flowchart which shows another orthogonal transformation block size determination process which concerns on embodiment of this invention, and a residual coefficient encoding process 直交変換ブロックサイズ決定処理において差分画像の画素値を用いた判定を行う方法の別の一例を説明するための概念を示す図The figure which shows the concept for demonstrating another example of the method of performing the determination using the pixel value of a difference image in orthogonal transformation block size determination processing. 本発明の実施の形態に係るまた別の直交変換ブロックサイズ決定処理および残差係数符号化処理を示すフローチャートThe flowchart which shows another orthogonal transformation block size determination process which concerns on embodiment of this invention, and a residual coefficient encoding process 直交変換ブロックサイズ決定処理において差分画像の画素値を用いた判定を行う方法のまた別の一例を説明するための概念を示す図The figure which shows the concept for demonstrating another example of the method of performing the determination using the pixel value of a difference image in orthogonal transformation block size determination processing.

以下、適宜図面を参照しながら、実施の形態を詳細に説明する。但し、必要以上に詳細な説明は省略する場合がある。例えば、既によく知られた事項の詳細説明や実質的に同一の構成に対する重複説明を省略する場合がある。これは、以下の説明が不必要に冗長になるのを避け、当業者の理解を容易にするためである。 Hereinafter, embodiments will be described in detail with reference to the drawings as appropriate. However, more detailed description than necessary may be omitted. For example, detailed descriptions of already well-known matters and repeated descriptions for substantially the same configuration may be omitted. This is to avoid the following description from becoming unnecessarily redundant and to facilitate understanding by those skilled in the art.

なお、発明者（ら）は、当業者が本開示を十分に理解するために添付図面および以下の説明を提供するのであって、これらによって特許請求の範囲に記載の主題を限定することを意図するものではない。 The inventor (s) provides the accompanying drawings and the following description in order for those skilled in the art to fully understand the present disclosure, and is intended to limit the subject matter described in the claims. Not what you want.

（実施の形態１）
以下、図１〜１０を用いて、実施の形態１を説明する。 (Embodiment 1)
Hereinafter, Embodiment 1 will be described with reference to FIGS.

（符号化装置全体の処理説明）
図１は、本発明の実施の形態に係る動画像符号化装置１００のブロック図である。動画像符号化装置１００は、ピクチャ単位で入力された画像をブロックに分割し、ブロック単位で符号化処理し、符号列を生成する。 (Description of processing of the entire encoding device)
FIG. 1 is a block diagram of a video encoding apparatus 100 according to an embodiment of the present invention. The moving image encoding apparatus 100 divides an image input in units of pictures into blocks, performs encoding processing in units of blocks, and generates a code string.

この動画像符号化装置１００は、ピクチャメモリ１０１と、ブロック分割部１０２と、直交変換ブロックサイズ決定部１０３と、残差係数符号化部１０４と、残差係数復号化部１０５と、ピクチャバッファ１０６と、画面間予測処理部１０７と、符号列生成部１０８とを備えている。 The moving picture encoding apparatus 100 includes a picture memory 101, a block dividing unit 102, an orthogonal transform block size determining unit 103, a residual coefficient encoding unit 104, a residual coefficient decoding unit 105, and a picture buffer 106. And an inter-screen prediction processing unit 107 and a code string generation unit 108.

ピクチャメモリ１０１は、表示を行う順にピクチャ単位で入力される入力画像信号を、符号化を行う順にピクチャの並び替えを行って格納する。そしてピクチャメモリ１０１は、ブロック分割部１０２からの読出し命令を受け付けると当該読出し命令に係る入力画像信号を出力する。 The picture memory 101 stores the input image signals input in units of pictures in the order of display by rearranging the pictures in the order of encoding. When the picture memory 101 receives a read command from the block dividing unit 102, the picture memory 101 outputs an input image signal related to the read command.

ブロック分割部１０２は、符号化対象ピクチャを構成する画素を、予め設定される複数のブロックサイズのうちいずれかのサイズを有する出力用のブロック毎に出力する。 The block division unit 102 outputs the pixels constituting the encoding target picture for each output block having one of a plurality of preset block sizes.

具体的にブロック分割部１０２は，ピクチャメモリ１０１から入力される入力画像信号を、符号化処理単位であるコーディングユニット（ＣＵ）と呼ばれるブロックに分割する。図２は選択可能なブロックのサイズを示す図である。 Specifically, the block division unit 102 divides the input image signal input from the picture memory 101 into blocks called coding units (CUs) that are encoding processing units. FIG. 2 is a diagram showing selectable block sizes.

図２に示されているように、ＣＵは６４×６４画素から８×８画素の４種類のブロックサイズの中から選択することができる。一般的に、入力画像の画素値や対象物の動きが複雑な領域では小さなブロックサイズを選択し、入力画像の画素値や対象物の動きが単純な領域では大きなブロックサイズを選択する。以降の処理はこのＣＵのブロック単位で処理が行われる。 As shown in FIG. 2, the CU can be selected from four block sizes ranging from 64 × 64 pixels to 8 × 8 pixels. In general, a small block size is selected in a region where the pixel value of the input image and the movement of the object are complex, and a large block size is selected in a region where the pixel value of the input image and the movement of the object are simple. Subsequent processing is performed in units of blocks of this CU.

画面間予測処理部１０７は、出力用のブロックに属する画素に対して、出力用のブロックサイズ以下のブロックサイズを有する予測処理用のブロックを設定し、この予測処理用のブロック毎に予測画像を生成する。 The inter-screen prediction processing unit 107 sets a prediction processing block having a block size equal to or smaller than the output block size for the pixels belonging to the output block, and generates a prediction image for each prediction processing block. Generate.

具体的に画面間予測処理部１０７は、ブロック分割部１０２から入力される入力画像信号を基に、ピクチャバッファ１０６に格納されている既に符号化済みの過去のピクチャの再構成画像信号を用いて画面間予測処理する。画面間予測処理では、前記入力画像の画素構成と最も似ている画素構成を持った領域を前記再構成画像の中から探索（動き探索）し、探索された前記再構成画像内の領域を予測画像として生成（動き補償）する。このとき、前記ＣＵ単位のブロックをさらに分割したプレディクションユニット（ＰＵ）と呼ばれるブロック単位で動き補償を行う。図２は選択可能なブロックのサイズを示す図である。図２に示されているように、例えばＣＵが３２×３２画素のブロックサイズの場合、ＰＵは３２×３２画素から２４×３２画素の８種類のブロックサイズの中から選択することができる。一般的に、撮像物の動きが複雑な画像では小さなブロックサイズを選択し、撮像物の動きが単純な画像では大きなブロックサイズを選択する。 Specifically, the inter-screen prediction processing unit 107 uses a reconstructed image signal of a past picture that has been encoded and stored in the picture buffer 106 based on the input image signal input from the block dividing unit 102. Inter-screen prediction processing. In the inter-screen prediction process, an area having a pixel configuration most similar to the pixel configuration of the input image is searched from the reconstructed image (motion search), and an area in the searched reconstructed image is predicted. Generate (motion compensation) as an image. At this time, motion compensation is performed in block units called prediction units (PUs) obtained by further dividing the CU unit block. FIG. 2 is a diagram showing selectable block sizes. As shown in FIG. 2, for example, when the CU has a block size of 32 × 32 pixels, the PU can be selected from eight block sizes of 32 × 32 pixels to 24 × 32 pixels. In general, a small block size is selected for an image with a complicated movement of the imaged object, and a large block size is selected for an image with a simple movement of the imaged object.

差分演算部１０９は、ブロック分割部１０２から入力されるＰＵ単位の入力画像信号と、画面間予測処理部１０７から入力されるＰＵ単位の予測画像信号との差分値である差分画像信号を生成する。生成した差分画像信号は直交変換ブロックサイズ決定部１０３に出力する。 The difference calculation unit 109 generates a difference image signal that is a difference value between the PU unit input image signal input from the block division unit 102 and the PU unit prediction image signal input from the inter-screen prediction processing unit 107. . The generated difference image signal is output to the orthogonal transform block size determination unit 103.

直交変換ブロックサイズ決定部１０３は、予測処理用のブロックのうち隣接するブロックに属する差分画像に基づいて、（１）隣接する予測処理用のブロックを統合し得られる正方形状のブロックを直交変換用のブロックとして設定する第１動作と、（２）予測処理用のブロックを正方形状に分割し、分割して得られたブロックを直交変換用のブロックとして設定する第２動作を実行する。 The orthogonal transform block size determination unit 103 performs orthogonal transform on a square block obtained by integrating (1) adjacent prediction processing blocks based on a difference image belonging to an adjacent block among prediction processing blocks. And (2) a second operation for dividing a block for prediction processing into a square shape and setting a block obtained by the division as a block for orthogonal transformation.

具体的に直交変換ブロックサイズ決定部１０３は、差分演算部１０９から入力されるＰＵ単位の差分画像信号を用いて、残差係数符号化部１０４における直交変換処理を行う単位となるトランスフォームユニット（ＴＵ）と呼ばれるブロックのサイズを決定する。図２は選択可能なブロックのサイズを示す図である。図２に示されているように、例えばＣＵが６４×６４画素のブロックサイズの場合、ＴＵは３２×３２画素から４×４画素の４種類のブロックサイズの中から選択することができる。 Specifically, the orthogonal transform block size determination unit 103 uses a difference image signal in units of PUs input from the difference calculation unit 109 to perform a transform unit (a unit for performing orthogonal transform processing in the residual coefficient encoding unit 104 ( The size of a block called TU) is determined. FIG. 2 is a diagram showing selectable block sizes. As shown in FIG. 2, for example, when the CU has a block size of 64 × 64 pixels, the TU can be selected from four types of block sizes of 32 × 32 pixels to 4 × 4 pixels.

残差係数符号化部１０４は、設定した直交変換用のブロック毎に、直交変換および量子化処理し、残差係数を生成する。 The residual coefficient encoding unit 104 performs orthogonal transform and quantization processing for each set orthogonal transform block, and generates a residual coefficient.

具体的に残差係数符号化部１０４は、直交変換ブロックサイズ決定部１０３において決定されたブロックサイズのＴＵを処理単位として、差分演算部１０９において生成された差分画像信号に対して直交変換処理する。そして残差係数符号化部１０４は、得られた各周波数成分の直交変換係数に対し量子化処理を行うことで残差係数信号を生成する。 Specifically, the residual coefficient encoding unit 104 performs orthogonal transform processing on the difference image signal generated by the difference calculation unit 109 using the TU having the block size determined by the orthogonal transform block size determination unit 103 as a processing unit. . Then, the residual coefficient encoding unit 104 generates a residual coefficient signal by performing quantization processing on the obtained orthogonal transform coefficient of each frequency component.

残差係数復号化部１０５は、直交変換ブロックサイズ決定部１０３において決定されたブロックサイズのＴＵを処理単位として、残差係数符号化部１０４から入力される残差係数信号に対して逆量子化処理する。さらに残差係数復号化部１０５は、逆直交変換処理を行うことで再構成差分画像信号を生成する。 The residual coefficient decoding unit 105 performs inverse quantization on the residual coefficient signal input from the residual coefficient encoding unit 104 using the TU having the block size determined by the orthogonal transform block size determining unit 103 as a processing unit. Process. Further, the residual coefficient decoding unit 105 generates a reconstructed difference image signal by performing an inverse orthogonal transform process.

加算演算部１１０は、残差係数復号化部１０５から入力される再構成差分画像信号と、画面間予測処理部１０７から入力される予測画像信号とをＰＵ単位で加算することにより再構成画像信号を生成する。 The addition operation unit 110 adds the reconstructed difference image signal input from the residual coefficient decoding unit 105 and the predicted image signal input from the inter-screen prediction processing unit 107 in units of PUs, thereby adding a reconstructed image signal. Is generated.

ピクチャバッファ１０６は、現在の符号化対象ピクチャより時間的に後に符号化するピクチャの画面間予測処理で参照するために、加算演算部１１０から入力される再構成画像信号を格納する。 The picture buffer 106 stores the reconstructed image signal input from the addition operation unit 110 for reference in inter-picture prediction processing of a picture to be encoded temporally after the current encoding target picture.

符号列生成部１０８は、残差係数符号化部１０４から入力される残差係数信号、およびその他の復号化処理時に必要となる符号化情報信号に対して、可変長符号化および算術符号化を行うことで符号列を生成する。 The code string generation unit 108 performs variable length encoding and arithmetic encoding on the residual coefficient signal input from the residual coefficient encoding unit 104 and an encoded information signal necessary for other decoding processes. By doing so, a code string is generated.

（直交変換ブロックサイズ決定部および残差係数符号化部の具体的な説明）
ここで、直交変換ブロックサイズ決定部１０３および残差係数符号化部１０４において、ＴＵのブロックサイズを決定し、直交変換処理および量子化処理を行って残差係数信号を生成する方法について、図３のフローチャートを用いて具体的に説明する。 (Specific description of orthogonal transform block size determination unit and residual coefficient encoding unit)
Here, an orthogonal transform block size determination unit 103 and a residual coefficient encoding unit 104 determine a TU block size and perform orthogonal transform processing and quantization processing to generate a residual coefficient signal. This will be specifically described with reference to the flowchart of FIG.

まず、直交変換ブロックサイズ決定部１０３は、差分演算部１０９が生成するＰＵ単位の差分画像信号を入力として、符号化対象ＣＵ内の全ＰＵの差分画像の画素値の集計を行う（Ｓ３０１）。さらに直交変換ブロックサイズ決定部１０３は、集計結果より全画素値が閾値以下かどうかを判定する（Ｓ３０２）。 First, the orthogonal transform block size determination unit 103 receives the PU-specific difference image signal generated by the difference calculation unit 109, and totals the pixel values of the difference images of all PUs in the encoding target CU (S301). Further, the orthogonal transform block size determination unit 103 determines whether or not all pixel values are equal to or smaller than a threshold value based on the aggregation result (S302).

図４は、ステップＳ３０１およびステップＳ３０２における判定の例を示す図である。 FIG. 4 is a diagram illustrating an example of determination in step S301 and step S302.

図４（ａ）の例は、符号化対象ＣＵ内の各々の画素位置における差分画像の画素値をグラフにしたものであり、符号化対象ＣＵ内の１つめのＰＵをＰＵ１、２つめのＰＵをＰＵ２としている。ＰＵ１およびＰＵ２に属する全ての画素値が閾値以下の値であるため、ステップＳ３０２の判定の結果として全画素値が閾値以下であると判定される。 The example of FIG. 4A is a graph of the pixel values of the difference image at each pixel position in the encoding target CU. The first PU in the encoding target CU is represented by PU1, the second PU. Is PU2. Since all pixel values belonging to PU1 and PU2 are values equal to or smaller than the threshold value, it is determined that all pixel values are equal to or smaller than the threshold value as a result of the determination in step S302.

一方、図４（ｂ）の例では、ＰＵ１およびＰＵ２に属する画素値の一部の値が閾値よりも大きな値となっているため、ステップＳ３０２の判定の結果として全画素値が閾値以下でないと判定される。 On the other hand, in the example of FIG. 4B, since some of the pixel values belonging to PU1 and PU2 are larger than the threshold value, all pixel values are not less than or equal to the threshold value as a result of the determination in step S302. Determined.

ステップＳ３０２において全画素値が閾値以下でないと判定された場合、ステップＳ３０３において符号化対象ＣＵ内の各ＰＵを正方形状に分割したサイズをＴＵブロックサイズとして決定する。一方、ステップＳ３０２において全画素値が閾値以下であると判定された場合、ステップＳ３０４において符号化対象ＣＵ内の全ＰＵを正方形状に統合したサイズをＴＵブロックサイズとして決定する。 When it is determined in step S302 that all the pixel values are not equal to or smaller than the threshold value, in step S303, a size obtained by dividing each PU in the encoding target CU into a square shape is determined as a TU block size. On the other hand, when it is determined in step S302 that all pixel values are equal to or smaller than the threshold value, a size obtained by integrating all PUs in the encoding target CU into a square shape is determined as a TU block size in step S304.

図５は、ステップＳ３０３およびステップＳ３０４によってＴＵブロックサイズが決定される例を示す図である。図５の例では、符号化対象ＣＵ内のＰＵは、３２×８画素のＰＵと３２×２４画素のＰＵとの２つである。 FIG. 5 is a diagram illustrating an example in which the TU block size is determined in steps S303 and S304. In the example of FIG. 5, there are two PUs in the encoding target CU, a 32 × 8 pixel PU and a 32 × 24 pixel PU.

ステップＳ３０３においてＰＵを分割する場合、図５（ａ）のように、各々のＰＵ内で正方形状のＴＵが隙間なく敷き詰められるようにＴＵブロックサイズが決定される。図５（ａ）では、３２×８画素のＰＵは４つの８×８画素のＴＵに分割され、３２×２４画素のＰＵは４つの８×８画素のＴＵと２つの１６×１６画素のＴＵに分割されている。 When the PU is divided in step S303, as shown in FIG. 5A, the TU block size is determined so that square TUs are spread without gaps in each PU. In FIG. 5A, a 32 × 8 pixel PU is divided into four 8 × 8 pixel TUs, and a 32 × 24 pixel PU is divided into four 8 × 8 pixel TUs and two 16 × 16 pixel TUs. It is divided into

一方、ステップＳ３０４においてＰＵを統合する場合、図５（ｂ）のように、全てのＰＵを１つに統合した正方形状としてＴＵブロックサイズが決定される。図５（ｂ）では、３２×８画素のＰＵと３２×２４画素のＰＵが１つの３２×３２画素のＴＵに統合されている。 On the other hand, when PUs are integrated in step S304, the TU block size is determined as a square shape in which all PUs are integrated into one as shown in FIG. In FIG. 5B, a 32 × 8 pixel PU and a 32 × 24 pixel PU are integrated into one 32 × 32 pixel TU.

次に、残差係数符号化部１０４は、ステップ３０３もしくはステップＳ３０４によって決定された各々のＴＵに対してＴＵ単位で直交変換処理を行う（Ｓ３０５）。そして残差係数符号化部１０４は、得られた各周波数成分の直交変換係数に対しＴＵ単位で量子化処理する（Ｓ３０６）ことで残差係数信号を生成する。 Next, the residual coefficient encoding unit 104 performs orthogonal transform processing for each TU determined in step 303 or step S304 in units of TUs (S305). Then, the residual coefficient encoding unit 104 generates a residual coefficient signal by performing quantization processing on the obtained orthogonal transform coefficient of each frequency component in units of TU (S306).

直交変換処理を行う単位であるＴＵ内の全ての差分画像の画素値が同じような値である場合、直交変換処理によって生成される直交変換係数は低周波数成分の係数のみに値が集中する。そのため、量子化処理を行うことで高い効率で情報を圧縮することができる。 When the pixel values of all the difference images in the TU, which is a unit for performing the orthogonal transform process, have the same value, the orthogonal transform coefficients generated by the orthogonal transform process are concentrated only on the low frequency component coefficients. Therefore, information can be compressed with high efficiency by performing quantization processing.

本来、画面間予測処理はＰＵ単位で行われるため、ＰＵ毎に別々の領域から動き補償が行われて予測画像が生成され、その結果生成される差分画像もＰＵ毎に異なる画素構造を持つ。そのため１つのＰＵの中で閉じた小さなブロック単位で直交変換処理を行うことが一般的である。しかし、直交変換処理を行うブロックの数が増えると、ブロック毎に符号列に記載する情報が発生するため、発生符号量が大きくなってしまう。 Originally, since the inter-screen prediction process is performed in units of PUs, motion compensation is performed from different regions for each PU to generate a predicted image, and the resulting difference image also has a different pixel structure for each PU. Therefore, it is common to perform orthogonal transform processing in units of small blocks closed in one PU. However, when the number of blocks to be subjected to orthogonal transform processing increases, information to be described in the code string is generated for each block, so that the generated code amount increases.

ところが、画面間予測が効率良く行われ、符号化対象内の全てのＰＵの差分画像の画素値を極めて小さくすることができた場合、ＰＵを統合した大きなブロック単位で直交変換処理を行っても、低周波数成分の係数のみに値が集中するため画質を劣化させることなく発生符号量を小さくすることができる。 However, when inter-screen prediction is performed efficiently and the pixel values of the difference images of all PUs in the encoding target can be made extremely small, even if orthogonal transform processing is performed in units of large blocks that integrate PUs Since the values are concentrated only on the low frequency component coefficients, the generated code amount can be reduced without degrading the image quality.

本実施の形態では、符号化対象ＣＵ内の全ＰＵの差分画像の画素値が閾値以下のときのみＰＵを統合した大きなブロック単位で直交変換処理する。そのため、画面間予測が効率良く行われなかった場合は従来相当の符号化効率でありながら、画面間予測が効率良く行われた場合は極めて高い符号化効率で符号化することが可能となる。これにより、高画質化と発生符号量の削減を同時に実現することができる。 In the present embodiment, orthogonal transform processing is performed in units of large blocks in which PUs are integrated only when the pixel values of the difference images of all PUs in the encoding target CU are equal to or less than a threshold value. For this reason, when the inter-screen prediction is not performed efficiently, the encoding efficiency is equivalent to the conventional one. However, when the inter-screen prediction is performed efficiently, the encoding can be performed with extremely high encoding efficiency. As a result, high image quality and a reduction in the amount of generated codes can be realized at the same time.

（直交変換ブロックサイズ決定部および残差係数符号化部の別の形態１）
ここで、直交変換ブロックサイズ決定部１０３および残差係数符号化部１０４において、ＴＵのブロックサイズを決定し、直交変換処理および量子化処理を行って残差係数信号を生成する別の方法について、図６のフローチャートを用いて説明する。 (Another form 1 of the orthogonal transform block size determination unit and the residual coefficient encoding unit)
Here, another method for determining the block size of the TU in the orthogonal transform block size determination unit 103 and the residual coefficient encoding unit 104, and performing orthogonal transform processing and quantization processing to generate a residual coefficient signal, This will be described with reference to the flowchart of FIG.

ステップＳ３０１からステップＳ３０６における処理は図３を用いて説明した処理と全く同様であるためここでは説明を省略し、新たに追加されたステップＳ６０７に係る処理についてのみ詳しく説明する。 Since the processing from step S301 to step S306 is exactly the same as the processing described with reference to FIG. 3, the description thereof will be omitted here, and only the processing related to the newly added step S607 will be described in detail.

ステップＳ３０２において全画素値が閾値以下であると判定され、ステップＳ３０４において符号化対象ＣＵ内の全ＰＵを正方形状に統合したサイズをＴＵブロックサイズとして決定した場合、残差係数符号化部１０４において、ステップＳ３０５およびステップＳ３０６の処理を行わず、替わりにステップＳ６０７の処理が行われる。 If it is determined in step S302 that all pixel values are equal to or less than the threshold value, and the size obtained by integrating all the PUs in the CU to be encoded in a square shape is determined as the TU block size in step S304, the residual coefficient encoding unit 104 Instead of performing the processing of step S305 and step S306, the processing of step S607 is performed instead.

ステップＳ６０７では決定されたＴＵに対して、直交変換処理および量子化処理を行うことなく、残差係数符号化部１０４が生成する残差係数信号を全て値が０のものとする。 In step S607, the residual coefficient signals generated by the residual coefficient encoding unit 104 are all set to 0 without performing orthogonal transform processing and quantization processing on the determined TU.

ステップＳ６０７の処理が適用されるＴＵは、ステップＳ３０２において全ての差分画像の画素値が小さいことが分かっている。そのため、残差係数信号の値を全て０としても画質を劣化させることなく発生符号量を削減することが可能となる。さらに直交変換処理と量子化処理を行う必要がないため処理量を削減することが可能となる。 It is known that the TU to which the process of step S607 is applied has small pixel values of all the difference images in step S302. For this reason, it is possible to reduce the amount of generated codes without degrading the image quality even if the values of the residual coefficient signals are all zero. Furthermore, since it is not necessary to perform orthogonal transform processing and quantization processing, the processing amount can be reduced.

なお、ステップＳ６０７において、ステップＳ３０５の直交変換処理とステップＳ３０６の量子化処理と同様の処理を行って生成される残差係数信号に対して、後から全ての値を０に変更するという処理を行ってもよい。これにより、ステップＳ３０２の判定結果に依存することなく処理フローを共通化することが可能となり、処理量の削減効果はなくなるが処理フローの単純化を実現することが可能となる。 In step S607, a process of changing all values to 0 later is performed on the residual coefficient signal generated by performing the same process as the orthogonal transform process in step S305 and the quantization process in step S306. You may go. As a result, it is possible to share the processing flow without depending on the determination result in step S302, and it is possible to achieve simplification of the processing flow although the effect of reducing the processing amount is lost.

なお、上記においては残差係数の信号をすべて０とする例を示したが、０と近似できる値であればどのような値を利用しても構わない。 In the above description, the residual coefficient signals are all set to 0. However, any value may be used as long as the value can be approximated to 0.

（直交変換ブロックサイズ決定部および残差係数符号化部の別の形態２）
ここで、直交変換ブロックサイズ決定部１０３および残差係数符号化部１０４において、ＴＵのブロックサイズを決定し、直交変換処理および量子化処理を行って残差係数信号を生成するまた別の方法について、図７のフローチャートを用いて説明する。 (Another form 2 of the orthogonal transform block size determination unit and the residual coefficient encoding unit)
Here, another method for generating the residual coefficient signal by determining the block size of the TU in the orthogonal transform block size determining unit 103 and the residual coefficient encoding unit 104 and performing orthogonal transform processing and quantization processing. This will be described with reference to the flowchart of FIG.

ステップＳ３０３からステップＳ３０６における処理は図３を用いて説明した処理と全く同様であるためここでは説明を省略する。そして、図３におけるステップＳ３０１およびステップＳ３０２に替わって新たに追加されたステップＳ７０１およびステップＳ７０２に係る処理についてのみ詳しく説明する。 Since the processing from step S303 to step S306 is exactly the same as the processing described with reference to FIG. 3, the description thereof is omitted here. Only the processing relating to steps S701 and S702 newly added in place of steps S301 and S302 in FIG. 3 will be described in detail.

直交変換ブロックサイズ決定部１０３は、差分演算部１０９が生成するＰＵ単位の差分画像信号を入力として、符号化対象ＣＵ内の全ＰＵの差分画像の画素値の最大値と最小値を抽出する（Ｓ７０１）。さらに直交変換ブロックサイズ決定部１０３は、抽出された最大値と最小値の幅が閾値以下かどうかを判定する（Ｓ７０２）。 The orthogonal transform block size determination unit 103 receives the difference image signal for each PU generated by the difference calculation unit 109 and extracts the maximum value and the minimum value of the pixel values of the difference images of all PUs in the encoding target CU ( S701). Further, the orthogonal transform block size determination unit 103 determines whether or not the width of the extracted maximum value and minimum value is equal to or smaller than a threshold value (S702).

図８は、ステップＳ７０１およびステップＳ７０２における判定の例を示す図である。 FIG. 8 is a diagram illustrating an example of determination in step S701 and step S702.

図８（ａ）の例は、符号化対象ＣＵ内の各々の画素位置における差分画像の画素値をグラフにしたものであり、符号化対象ＣＵ内の１つめのＰＵをＰＵ１、２つめのＰＵをＰＵ２としている。ＰＵ１およびＰＵ２に属する全ての画素値の最大値と最小値の幅が閾値以下の値であるため、ステップＳ７０２の判定の結果として最大値と最小値の幅が閾値以下であると判定される。 In the example of FIG. 8A, the pixel values of the difference image at each pixel position in the encoding target CU are graphed, and the first PU in the encoding target CU is represented by PU1, the second PU. Is PU2. Since the widths of the maximum value and the minimum value of all the pixel values belonging to PU1 and PU2 are equal to or less than the threshold value, it is determined that the width of the maximum value and the minimum value is equal to or less than the threshold value as a result of the determination in step S702.

一方、図４（ｂ）の例では、ＰＵ１およびＰＵ２に属する全ての画素値の最大値と最小値の幅が閾値より大きい値であるため、ステップＳ７０２の判定の結果として最大値と最小値の幅が閾値以下でないと判定される。 On the other hand, in the example of FIG. 4B, since the maximum value and the minimum value width of all the pixel values belonging to PU1 and PU2 are larger than the threshold value, as a result of the determination in step S702, the maximum value and the minimum value are set. It is determined that the width is not less than the threshold value.

ステップＳ７０２において最大値と最小値の幅が閾値以下でないと判定された場合、ステップＳ３０３において符号化対象ＣＵ内の各ＰＵを正方形状に分割したサイズをＴＵブロックサイズとして決定する。一方、ステップＳ７０２において最大値と最小値の幅が閾値以下であると判定された場合、ステップＳ３０４において符号化対象ＣＵ内の全ＰＵを正方形状に統合したサイズをＴＵブロックサイズとして決定する。 If it is determined in step S702 that the width between the maximum value and the minimum value is not equal to or smaller than the threshold value, a size obtained by dividing each PU in the encoding target CU into a square shape is determined as a TU block size in step S303. On the other hand, if it is determined in step S702 that the width between the maximum value and the minimum value is equal to or smaller than the threshold value, a size obtained by integrating all PUs in the encoding target CU into a square shape is determined as a TU block size in step S304.

図４（ａ）の例と図８（ａ）の例とを比較すると、図８（ａ）の例の方が、各々の画素値は大きな値を持っている。しかし、ばらつきを比較した場合、図８（ａ）に示す例は、図４（ａ）と同様に小さいため、直交変換処理によって生成される直交変換係数は低周波数成分の係数のみに値が集中する。そのため、ＰＵを統合した大きなブロック単位で直交変換処理を行っても、画質を劣化させることなく発生符号量を小さくすることができる。 Comparing the example of FIG. 4A and the example of FIG. 8A, each pixel value has a larger value in the example of FIG. 8A. However, when variations are compared, the example shown in FIG. 8A is as small as FIG. 4A, and the orthogonal transform coefficients generated by the orthogonal transform process are concentrated only in the low-frequency component coefficients. To do. For this reason, even if orthogonal transform processing is performed in units of large blocks in which PUs are integrated, the amount of generated codes can be reduced without degrading image quality.

（直交変換ブロックサイズ決定部および残差係数符号化部の別の形態３）
ここで、直交変換ブロックサイズ決定部１０３および残差係数符号化部１０４において、ＴＵのブロックサイズを決定し、直交変換処理および量子化処理を行って残差係数信号を生成するまた別の方法について、図９のフローチャートを用いて説明する。 (Another form 3 of the orthogonal transform block size determination unit and the residual coefficient encoding unit)
Here, another method for generating the residual coefficient signal by determining the block size of the TU in the orthogonal transform block size determining unit 103 and the residual coefficient encoding unit 104 and performing orthogonal transform processing and quantization processing. This will be described with reference to the flowchart of FIG.

ステップＳ３０３からステップＳ３０６における処理は図３を用いて説明した処理と全く同様であるためここでは説明を省略する。そして、図３におけるステップＳ３０１およびステップＳ３０２に替わって新たに追加されたステップＳ９０１およびステップＳ９０２に係る処理についてのみ詳しく説明する。 Since the processing from step S303 to step S306 is exactly the same as the processing described with reference to FIG. 3, the description thereof is omitted here. Only the processing relating to steps S901 and S902 newly added in place of steps S301 and S302 in FIG. 3 will be described in detail.

直交変換ブロックサイズ決定部１０３は、差分演算部１０９が生成するＰＵ単位の差分画像信号を入力として、符号化対象ＣＵ内のＰＵとＰＵの境界の差分画像の画素値の差である境界差分値を抽出する（Ｓ９０１）。さらに直交変換ブロックサイズ決定部１０３は、抽出された全境界差分値が閾値以下かどうかを判定する（Ｓ９０２）。 The orthogonal transform block size determination unit 103 receives a difference image signal in units of PUs generated by the difference calculation unit 109 as an input, and a boundary difference value that is a difference between pixel values of a difference image between the PU and the PU in the encoding target CU. Is extracted (S901). Further, the orthogonal transform block size determination unit 103 determines whether or not the extracted total boundary difference value is equal to or smaller than a threshold value (S902).

図１０は、ステップＳ９０１およびステップＳ９０２における判定の例を示す図である。 FIG. 10 is a diagram illustrating an example of determination in step S901 and step S902.

図１０（ａ）の例は、符号化対象ＣＵ内の各々の画素位置における差分画像の画素値をグラフにしたものであり、符号化対象ＣＵ内の１つめのＰＵをＰＵ１、２つめのＰＵをＰＵ２としている。ＰＵ１とＰＵ２の境界に隣接する画素値の差（境界差分値）が閾値以下となっていることを示している。図１０（ａ）の例では画素位置を１次元のグラフで表現している。そのため、境界差分値が１つしかないが、本来、処理対象とする差分画像は２次元の画像である。よって、境界差分値はＰＵとＰＵの境界に隣接する画素のペアの数だけ存在する。それらの全ての境界差分値が同様に閾値以下であると、ステップＳ９０２の判定の結果として全境界差分値が閾値以下であると判定される。 The example of FIG. 10A is a graph of the pixel value of the difference image at each pixel position in the encoding target CU. The first PU in the encoding target CU is represented by PU1, the second PU. Is PU2. It shows that the difference between pixel values adjacent to the boundary between PU1 and PU2 (boundary difference value) is less than or equal to the threshold value. In the example of FIG. 10A, the pixel position is represented by a one-dimensional graph. Therefore, although there is only one boundary difference value, the difference image to be processed is originally a two-dimensional image. Therefore, there are boundary difference values as many as the number of pixel pairs adjacent to the boundary between PU and PU. If all the boundary difference values are similarly equal to or less than the threshold value, it is determined that all the boundary difference values are equal to or less than the threshold value as a result of the determination in step S902.

一方、図１０（ｂ）の例では、ＰＵ１とＰＵ２の境界に隣接する画素値の差（境界差分値）が閾値より大きい値であるため、ステップＳ９０２の判定の結果として全境界差分値が閾値以下でないと判定される。 On the other hand, in the example of FIG. 10B, since the difference (boundary difference value) between the pixel values adjacent to the boundary between PU1 and PU2 is larger than the threshold value, as a result of the determination in step S902, the entire boundary difference value is the threshold value. It is determined that it is not below.

ステップＳ９０２において全境界差分値が閾値以下でないと判定された場合、ステップＳ３０３において符号化対象ＣＵ内の各ＰＵを正方形状に分割したサイズをＴＵブロックサイズとして決定する。一方、ステップＳ９０２において全境界差分値が閾値以下であると判定された場合、ステップＳ３０４において符号化対象ＣＵ内の全ＰＵを正方形状に統合したサイズをＴＵブロックサイズとして決定する。 If it is determined in step S902 that the total boundary difference value is not equal to or smaller than the threshold value, the size obtained by dividing each PU in the encoding target CU into a square shape is determined as a TU block size in step S303. On the other hand, if it is determined in step S902 that the total boundary difference value is equal to or smaller than the threshold value, a size obtained by integrating all PUs in the encoding target CU into a square shape is determined as a TU block size in step S304.

図１０（ｂ）の例のように、ＰＵ１とＰＵ２の境界に隣接する画素値の差が大きな値であると、ＰＵ１とＰＵ２を統合した大きなブロック単位で直交変換処理を行った場合に、直交変換係数の高周波数成分の係数にも情報が分散されてしまう。そのため量子化処理を行っても生成される残差係数が多く発生してしまい、高い効率で情報を圧縮することができなくなる。 If the difference between the pixel values adjacent to the boundary between PU1 and PU2 is a large value as in the example of FIG. 10B, when orthogonal transformation processing is performed in units of large blocks in which PU1 and PU2 are integrated, Information is also dispersed in the coefficient of the high frequency component of the conversion coefficient. For this reason, many residual coefficients are generated even if the quantization process is performed, and information cannot be compressed with high efficiency.

一方、図１０（ａ）の例のように、ＰＵ１とＰＵ２の境界に隣接する画素値の差が小さな値であると、ＰＵ１とＰＵ２を統合した大きなブロック単位で直交変換処理を行った場合でも、直交変換係数の高周波数成分の係数に情報が分散される可能性が低くなる。そのため、画質を劣化させることなく発生符号量を小さくすることができる。 On the other hand, as in the example of FIG. 10A, when the difference between the pixel values adjacent to the boundary between PU1 and PU2 is small, even when orthogonal transformation processing is performed in units of large blocks in which PU1 and PU2 are integrated. Therefore, the possibility that information is dispersed in the coefficient of the high frequency component of the orthogonal transform coefficient is reduced. Therefore, the generated code amount can be reduced without degrading the image quality.

（まとめ）
本実施形態における動画像符号化装置１００は、符号化対象ピクチャを画面間予測処理が含まれた所定の符号化規格にて符号化し、符号列を生成する動画像符号化装置であって、符号化対象ピクチャを構成する画素を、予め設定される複数のブロックサイズのうちいずれかのサイズを有する出力用のブロック毎に出力するブロック分割部１０２と、出力用のブロックに属する画素に対して、出力用のブロックサイズ以下のブロックサイズを有する予測処理用のブロックを設定し、この予測処理用のブロック毎に予測画像を生成する画面間予測処理部１０７と、予測処理用のブロック毎に生成された予測画像と、当該予測画像のそれぞれに対応する出力用のブロックに属する画素とを差分演算し、予測処理用のブロック単位で差分画像を生成する差分演算部１０９と、予測処理用のブロックのうち隣接するブロックに属する差分画像に基づいて、（１）隣接する予測処理用のブロックを統合し得られる正方形状のブロックを直交変換用のブロックとして設定する第１動作と、（２）予測処理用のブロックを正方形状に分割し、分割して得られたブロックを直交変換用のブロックとして設定する第２動作を実行する直交変換ブロックサイズ決定部１０３と、設定した直交変換用のブロック毎に、直交変換および量子化処理し、残差係数を生成する残差係数符号化部１０４と、を備える。 (Summary)
A moving image encoding apparatus 100 according to the present embodiment is a moving image encoding apparatus that generates a code string by encoding an encoding target picture according to a predetermined encoding standard including inter-screen prediction processing. A block dividing unit 102 for outputting a pixel constituting a picture to be processed for each output block having any one of a plurality of block sizes set in advance, and pixels belonging to the output block, A prediction processing block having a block size equal to or smaller than the output block size is set, and an inter-screen prediction processing unit 107 that generates a prediction image for each prediction processing block, and is generated for each prediction processing block. The prediction image and the pixels belonging to the output block corresponding to each of the prediction images are subjected to a difference calculation, and a difference image is generated in units of prediction processing blocks. Based on the difference calculation unit 109 and a difference image belonging to an adjacent block among the prediction processing blocks, (1) a square block obtained by integrating adjacent prediction processing blocks is used as an orthogonal transform block. An orthogonal transform block size determination unit that executes a first operation to be set and (2) a second operation to divide a block for prediction processing into a square shape and set a block obtained by the division as a block for orthogonal transform 103 and a residual coefficient encoding unit 104 that performs orthogonal transform and quantization processing for each set orthogonal transform block and generates a residual coefficient.

また本実施形態における直交変換ブロックサイズ決定部１０３は、属するすべての差分画像の画素値が所定の閾値以下である予測処理用のブロックが隣接する場合、この隣接する予測処理用のブロックを統合し正方形状の直交変換用のブロックとして生成する。 Also, the orthogonal transform block size determination unit 103 in the present embodiment integrates the adjacent prediction processing blocks when the prediction processing blocks whose pixel values of all the difference images belong to the predetermined threshold value are adjacent to each other. It is generated as a square-shaped orthogonal transform block.

また本実施形態における残差係数符号化部１０４は、直交変換ブロックサイズ決定部１０３において、隣接する予測処理用のブロックを統合し正方形状の直交変換用のブロックとして生成した場合、統合後における直交変換用のブロックに属するすべての残差係数を０として生成する。 In addition, when the orthogonal transform block size determining unit 103 integrates adjacent prediction processing blocks and generates a square-shaped orthogonal transform block, the residual coefficient encoding unit 104 according to the present embodiment performs orthogonality after integration. All residual coefficients belonging to the block for conversion are generated as 0.

本実施形態における直交変換ブロックサイズ決定部１０３は、属する差分画像の画素値の最小値と最大値の幅が所定の範囲内の値である予測処理用のブロックが隣接する場合、この隣接する予測処理用のブロックを統合し正方形状の直交変換用のブロックとして生成する。 In the present embodiment, the orthogonal transform block size determination unit 103, when the blocks for prediction processing in which the width of the minimum value and the maximum value of the difference image to which the difference image belongs are values within a predetermined range are adjacent to each other, The processing blocks are integrated to generate a square orthogonal transform block.

本実施形態における直交変換ブロックサイズ決定部１０３は、隣接する予測処理用のブロックに属する差分画像の画素値のうち、隣接境界を挟む画素値の差が所定の閾値以下の場合、この隣接する予測処理用のブロックを統合し正方形状の直交変換用のブロックとして生成する。
（その他の実施形態）
さらに、上記実施の形態で示した動画像符号化装置に含まれる各手段と同等の機能を備えるプログラムを、フレキシブルディスク等の記録媒体に記録するようにすることにより、上記実施の形態で示した処理を、独立したコンピュータシステムにおいて簡単に実施することが可能となる。なお、記録媒体としてはフレキシブルディスクに限らず、光ディスク、ＩＣカード、ＲＯＭカセット等、プログラムを記録できるものであれば同様に実施することができる。 The orthogonal transform block size determination unit 103 according to the present embodiment, when the difference between the pixel values sandwiching the adjacent boundary among the pixel values of the difference image belonging to the adjacent prediction processing block is equal to or less than a predetermined threshold, The processing blocks are integrated to generate a square orthogonal transform block.
(Other embodiments)
Further, the program having the same function as each unit included in the moving picture encoding apparatus shown in the above embodiment is recorded on a recording medium such as a flexible disk, thereby showing the above embodiment. Processing can be easily performed in an independent computer system. The recording medium is not limited to a flexible disk, and can be similarly implemented as long as it can record a program, such as an optical disk, an IC card, and a ROM cassette.

また、上記実施の形態で示した動画像符号化装置に含まれる各手段と同等の機能を集積回路であるＬＳＩとして実現してもよい。これらは一部または全てを含むように１チップ化されてもよい。またＬＳＩは集積度の違いにより、ＩＣ、システムＬＳＩ、スーパーＬＳＩ、ウルトラＬＳＩと称されることもある。 In addition, a function equivalent to each unit included in the moving picture coding apparatus described in the above embodiment may be realized as an LSI which is an integrated circuit. These may be integrated into one chip so as to include a part or all of them. An LSI may also be called an IC, a system LSI, a super LSI, or an ultra LSI depending on the degree of integration.

また、集積回路化の手法はＬＳＩに限るものではなく、専用回路または汎用プロセッサで実現しても良い。ＬＳＩ製造後に、プログラムすることが可能なＦＰＧＡ（ＦｉｅｌｄＰｒｏｇｒａｍｍａｂｌｅＧａｔｅＡｒｒａｙ）や、ＬＳＩ内部の回路セルの接続や設定を再構成可能なリコンフィギュラブル・プロセッサを利用しても良い。 Further, the method of circuit integration is not limited to LSI's, and implementation using dedicated circuitry or general purpose processors is also possible. An FPGA (Field Programmable Gate Array) that can be programmed after manufacturing the LSI, or a reconfigurable processor that can reconfigure the connection and setting of circuit cells inside the LSI may be used.

さらには、半導体技術の進歩または派生する別技術によりＬＳＩなどに置き換わる集積回路の技術が登場すれば、当然、その技術を用いて機能ブロックの集積化を行ってもよい。 Furthermore, if integrated circuit technology that replaces LSI or the like appears as a result of progress in semiconductor technology or other derived technology, it is naturally also possible to carry out function block integration using this technology.

また、上記実施の形態に係る、動画像符号化装置、またはその変形例の機能のうち少なくとも一部を組み合わせてもよい。 Moreover, you may combine at least one part among the functions of the moving image encoder which concerns on the said embodiment, or its modification.

本発明は、例えば、ビデオカメラ、デジタルカメラ、ビデオレコーダ、携帯電話、及びパーソナルコンピューター等における、入力画像を構成する各ピクチャを符号化して動画像符号化データとして出力する動画像符号化装置として有用である。 INDUSTRIAL APPLICABILITY The present invention is useful as a moving image encoding apparatus that encodes each picture constituting an input image and outputs it as moving image encoded data in, for example, a video camera, a digital camera, a video recorder, a mobile phone, and a personal computer. It is.

１００動画像符号化装置
１０１ピクチャメモリ
１０２ブロック分割部
１０３直交変換ブロックサイズ決定部
１０４残差係数符号化部
１０５残差係数復号化部
１０６ピクチャバッファ
１０７画面間予測処理部
１０８符号列生成部
１０９差分演算部
１１０加算演算部 DESCRIPTION OF SYMBOLS 100 Moving image encoding apparatus 101 Picture memory 102 Block division part 103 Orthogonal transformation block size determination part 104 Residual coefficient encoding part 105 Residual coefficient decoding part 106 Picture buffer 107 Inter-screen prediction process part 108 Code stream generation part 109 Difference Calculation unit 110 Addition calculation unit

Claims

A video encoding device that encodes a picture to be encoded with a predetermined encoding standard including inter-screen prediction processing and generates a code string,
A block dividing unit that outputs pixels constituting the encoding target picture for each output block having any one of a plurality of block sizes set in advance;
Inter-screen prediction processing for setting a prediction processing block having a block size equal to or smaller than the output block size for pixels belonging to the output block and generating a prediction image for each block for the prediction processing And
The difference calculation is performed between the prediction image generated for each block for prediction processing and the pixels belonging to the output block corresponding to each of the prediction images, and a difference image is generated in units of blocks for the prediction processing. A difference calculation unit;
Based on the difference image belonging to an adjacent block among the prediction processing blocks, (1) a square block obtained by integrating the adjacent prediction processing blocks is set as a block for orthogonal transformation. (2) an orthogonal transform block size determining unit that executes a second operation of dividing the block for prediction processing into a square shape and setting the block obtained by the division as a block for orthogonal transform;
For each of the set orthogonal transform blocks, a residual coefficient encoding unit that performs orthogonal transform and quantization processing to generate a residual coefficient, and
Video encoding device.

The orthogonal transform block size determination unit integrates the adjacent prediction processing blocks when the pixel values of all the difference images to which the pixel values are equal to or less than a predetermined threshold are adjacent to each other, and forms a square orthogonal transform. The moving picture coding apparatus according to claim 1, wherein the moving picture coding apparatus is generated as a block for use.

In the orthogonal transform block size determination unit, the residual coefficient encoding unit integrates adjacent prediction processing blocks and generates a square orthogonal transform block. The moving picture encoding apparatus according to claim 2, wherein all the residual coefficients belonging thereto are generated as zero.

The orthogonal transform block size determination unit, when a block for prediction processing in which the width of the minimum value and the maximum value of the difference image to which the image belongs belongs is a value within a predetermined range is adjacent to the block for prediction processing The moving picture coding apparatus according to claim 1, wherein the blocks are integrated and generated as a square-shaped orthogonal transform block.

The orthogonal transform block size determination unit, when the difference between the pixel values sandwiching the adjacent boundary among the pixel values of the difference image belonging to the adjacent prediction processing block is equal to or smaller than a predetermined threshold, the adjacent prediction processing block The moving picture coding apparatus according to claim 1, wherein the blocks are integrated and generated as a square-shaped orthogonal transform block.

A video encoding method for encoding a picture to be encoded with a predetermined encoding standard including inter-screen prediction processing and generating a code string,
Output pixels constituting the encoding target picture for each output block having any one of a plurality of preset block sizes,
For a pixel belonging to the output block, set a block for prediction processing having a block size equal to or smaller than the block size for output, and generate a prediction image for each block for prediction processing,
The difference calculation is performed between the prediction image generated for each block for prediction processing and the pixel belonging to the output block corresponding to each prediction image, and a difference image is generated in units of the block for prediction processing. ,
Based on the difference image belonging to an adjacent block among the prediction processing blocks, (1) a square block obtained by integrating the adjacent prediction processing blocks is set as a block for orthogonal transformation. And (2) performing a second operation of dividing the block for prediction processing into a square shape and setting a block obtained by the division as a block for orthogonal transformation,
For each of the set orthogonal transform blocks, orthogonal transform and quantization processing are performed to generate a residual coefficient.
Video encoding method.