JP6339977B2

JP6339977B2 - Video encoding apparatus and video encoding program

Info

Publication number: JP6339977B2
Application number: JP2015141395A
Authority: JP
Inventors: 優也大森; 卓佐野; 大西　隆之; 隆之大西; 清水　淳; 淳清水
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2015-07-15
Filing date: 2015-07-15
Publication date: 2018-06-06
Anticipated expiration: 2035-07-15
Also published as: JP2017028337A

Description

本発明は、動き予測を用いて映像符号化を行う映像符号化装置及び映像符号化プログラムに関する。 The present invention relates to a video encoding device and a video encoding program that perform video encoding using motion prediction.

映像符号化技術は、ＭＰＥＧ−２、ＭＰＥＧ−４、ＭＰＥＧ−４／ＡＶＣが多く用いられており、最近では次世代の映像符号化規格であるＨＥＶＣ（High Efficiency Video Coding）が規格化され、今後の普及が見込まれる。映像符号化規格では、１つのピクチャ内に閉じた情報を用いて符号化を行う画面内符号化と、時間的に連続した複数のピクチャを用いて符号化を行う画面間符号化が用いられている。画面間符号化には画面間の差分値を削減するために動き予測処理を行い、差分値と動きベクトル情報を符号化することで情報量を削減している。 MPEG-2, MPEG-4, and MPEG-4 / AVC are often used as video encoding technologies. Recently, the next generation video encoding standard, HEVC (High Efficiency Video Coding), has been standardized. Is expected to spread. In the video coding standard, intra-frame coding that performs coding using information closed in one picture and inter-screen coding that performs coding using a plurality of temporally continuous pictures are used. Yes. In inter-frame coding, motion prediction processing is performed to reduce the difference value between the screens, and the information amount is reduced by encoding the difference value and the motion vector information.

画面間符号化における動き予測では、映像の正しい動きを捉えて動き予測を行い符号化すべき情報を小さくするため、動きベクトルの検出処理に膨大な演算量を必要とする。少ない演算量で高精度な動きベクトルを得る手法としては、階層的動きベクトル検出法等が用いられる。階層的動きベクトル検出法では、符号化対象ブロックの大まかな動きを捕える事前動き予測処理を行い、事前動き予測処理から得られた探索結果を元にして再度より高精度な動き予測を行うことで、動きベクトル探索に要する演算量を削減している。 In motion prediction in inter-frame coding, a motion vector is detected and motion prediction is performed to reduce the information to be encoded, so that a large amount of calculation is required for motion vector detection processing. As a technique for obtaining a highly accurate motion vector with a small amount of calculation, a hierarchical motion vector detection method or the like is used. In the hierarchical motion vector detection method, a preliminary motion prediction process that captures the rough motion of the encoding target block is performed, and a higher-precision motion prediction is performed again based on the search result obtained from the preliminary motion prediction process. The amount of calculation required for motion vector search is reduced.

また、動き予測処理では符号化ブロックごとにピクチャ間の動きベクトルを検出しているが、ＭＰＥＧ−４／ＡＶＣ、ＨＥＶＣでは各符号化ブロックのサイズを複数のブロックサイズ候補の中から選択することができる。ＭＰＥＧ−４／ＡＶＣでは、符号化処理はマクロブロックという１６×１６画素単位で行われ、予測処理単位は１６×１６、８×８、４×４の３種類のブロックサイズ候補が選択できる。ＨＥＶＣでは、符号化処理はコーディングユニット（ＣＵ）という単位で６４×６４、３２×３２、１６×１６、８×８の４種類のブロックサイズ候補で行われ、画面間予測においてはＣＵをさらに８種類の形状に分割してプレディクションユニット（ＰＵ）という単位で処理を行う。各ブロックサイズで動きベクトル探索を行うことで、より効率の高い画面間符号化が可能できる。 In the motion prediction process, a motion vector between pictures is detected for each coding block. In MPEG-4 / AVC and HEVC, the size of each coding block can be selected from a plurality of block size candidates. it can. In MPEG-4 / AVC, encoding processing is performed in units of 16 × 16 pixels called macroblocks, and three types of block size candidates of 16 × 16, 8 × 8, and 4 × 4 can be selected as prediction processing units. In HEVC, encoding processing is performed with four types of block size candidates of 64 × 64, 32 × 32, 16 × 16, and 8 × 8 in units of coding units (CU), and an additional CU is used for inter-screen prediction. The process is divided into different shapes and processed in units called prediction units (PU). By performing motion vector search with each block size, more efficient inter-frame coding can be performed.

一方で、複数のブロックサイズ候補の全てを考慮して動きベクトル探索を行い最も高効率な画面間予測方法を決定する処理は、単一のブロックサイズに限定された動き予測処理の場合に比較し、演算処理量が大幅に増大するという問題がある。これに対し、小さいブロックサイズから順次統合してコストを比較していくのではなく、任意の２点のブロックサイズのコストを算出して比較することでブロックサイズを早期に決定し、動き予測に要する演算量を削減する手法が考案されている（例えば、特許文献１参照）。 On the other hand, the process of determining the most efficient inter-frame prediction method by performing a motion vector search considering all of the plurality of block size candidates is compared with the case of the motion prediction process limited to a single block size. There is a problem that the amount of calculation processing increases significantly. On the other hand, instead of sequentially integrating costs starting from a small block size, the cost of any two block sizes is calculated and compared to determine the block size at an early stage for motion prediction. A technique for reducing the amount of calculation required has been devised (see, for example, Patent Document 1).

特開２０１４−１２７８９１号公報JP 2014-127891 A

しかしながら、特許文献１に記載の手法はＲＤ（Rate Distortion）コストを用いるためコスト値取得に対象ブロックの符号化結果が必要であり、また、階層的動きベクトル検出法と併用する場合最後の階層の動き探索時までブロックサイズを絞込まないため、演算量が膨大となる。さらに、特許文献１に記載の映像符号化装置は符号化対象ブロックによって動き予測に要する演算量が変化するため、ハードウェア実装の観点では、ハードウェア規模増大やハードウェアの使用効率低下につながるという問題がある。 However, since the method described in Patent Document 1 uses an RD (Rate Distortion) cost, an encoding result of the target block is necessary for cost value acquisition, and when used in combination with the hierarchical motion vector detection method, Since the block size is not narrowed down until motion search, the amount of calculation becomes enormous. Furthermore, since the amount of calculation required for motion prediction varies depending on the encoding target block in the video encoding device described in Patent Document 1, from the viewpoint of hardware implementation, it leads to an increase in hardware scale and a decrease in hardware usage efficiency. There's a problem.

本発明は、このような事情に鑑みてなされたもので、符号化効率の低下を抑えつつ演算量の削減を行うことができる映像符号化装置及び映像符号化プログラムを提供することを目的とする。 The present invention has been made in view of such circumstances, and an object of the present invention is to provide a video encoding device and a video encoding program capable of reducing the amount of calculation while suppressing a decrease in encoding efficiency. .

本発明の一態様は、入力映像信号のピクチャの時間的相関を利用し、符号化対象ピクチャについて符号化ブロック単位に動き予測を行って差分信号の符号化処理を行う映像符号化装置であって、前記符号化対象ピクチャと動き予測先の復号映像信号の参照ピクチャを用いて、対象符号化ブロックの大まかな動きを捕える事前動き予測処理を全符号化ブロックサイズそれぞれについて行い、各符号化ブロックサイズにおける事前動き予測処理コストを算出する事前動き予測処理手段と、前記事前動き予測処理コストに基づいて、第１の符号化ブロックサイズ候補におけるブロックサイズ候補コストと第２の符号化ブロックサイズ候補におけるブロックサイズ候補コストを決定するブロックサイズ候補コスト手段と、前記入力映像信号の符号化済みピクチャの動きベクトル情報に基づいて、動き統計パラメータを決定するパラメータ決定手段と、前記動き統計パラメータと、前記符号化対象ピクチャの参照構造における階層の深さとに基づいて、コスト比較オフセット値を設定するオフセット値設定手段と、前記第１の符号化ブロックサイズ候補におけるブロックサイズ候補コストと、前記第２の符号化ブロックサイズ候補におけるブロックサイズ候補コストと、前記コスト比較オフセット値とに基づいて、前記符号化対象ピクチャの符号化ブロックのサイズ候補を全ブロックサイズ候補の中から第１のブロックサイズ候補または第２のブロックサイズ候補のいずれかに決定するブロックサイズ候補決定手段と、決定された符号化ブロックサイズ候補を用いて、前記符号化対象ピクチャの動き予測処理を行いブロックサイズ決定及び動きベクトル決定を行う動き探索手段とを備える映像符号化装置である。 One aspect of the present invention is a video encoding apparatus that performs temporal prediction of a picture to be encoded, performs motion prediction for each encoding block, and encodes a differential signal using a temporal correlation of a picture of an input video signal. Then, using the reference picture of the encoding target picture and the decoded picture signal of the motion prediction destination, a pre-motion prediction process for capturing a rough motion of the target encoding block is performed for each of all encoding block sizes, and each encoding block size is determined. A pre-motion prediction processing means for calculating a pre-motion prediction processing cost in the block, and a block size candidate cost in the first coding block size candidate and a second coding block size candidate based on the pre-motion prediction processing cost Block size candidate cost means for determining a block size candidate cost, and the encoded input video signal A cost comparison offset value is set based on parameter determination means for determining a motion statistical parameter based on the motion vector information of the picture, the motion statistical parameter, and the depth of the hierarchy in the reference structure of the encoding target picture. Based on the offset value setting means, the block size candidate cost in the first encoded block size candidate, the block size candidate cost in the second encoded block size candidate, and the cost comparison offset value, the code Block size candidate determining means for determining a coding block size candidate of the encoding target picture as one of the first block size candidate and the second block size candidate from all block size candidates, and the determined coding block Using the size candidates, the encoding target picture A video encoding apparatus and a motion search unit to perform block size determination and motion vector decision performs motion prediction processing.

本発明の一態様は、前記映像符号化装置であって、前記パラメータ決定手段は、前記入力映像信号の指定された符号化済みピクチャにおける全ての動きベクトルを算出し、前記動きベクトルの絶対値和を前記符号化対象ピクチャの動き統計パラメータとして決定する。 One aspect of the present invention is the video encoding device, wherein the parameter determination unit calculates all motion vectors in a specified encoded picture of the input video signal, and sums absolute values of the motion vectors. Are determined as motion statistics parameters of the picture to be encoded.

本発明の一態様は、前記映像符号化装置であって、前記パラメータ決定手段は、前記入力映像信号の指定された符号化済みピクチャにおける全ての動きベクトルを算出し、前記動きベクトルの分散値を前記符号化対象ピクチャの動き統計パラメータとして決定する。 One aspect of the present invention is the video encoding device, wherein the parameter determining unit calculates all motion vectors in a specified encoded picture of the input video signal, and calculates a variance value of the motion vector. It is determined as a motion statistical parameter of the picture to be encoded.

本発明の一態様は、前記映像符号化装置であって、前記ブロックサイズ候補決定手段は、前記第２のブロックサイズ候補における動きベクトルのコストと前記コスト比較オフセット値との和を算出し、算出した前記和と前記第１のブロックサイズ候補における動きベクトルのコストとを比較した結果、前記第１のブロックサイズ候補におけるコストの方が大きいと判定された場合、前記符号化対象ピクチャのブロックサイズ候補を第２のブロックサイズ候補とし、前記第１のブロックサイズ候補におけるコストの方が小さいと判定された場合、前記符号化対象ピクチャのブロックサイズ候補を第１のブロックサイズ候補とする。 One aspect of the present invention is the video encoding device, wherein the block size candidate determining unit calculates a sum of a cost of a motion vector and the cost comparison offset value in the second block size candidate. If the cost of the first block size candidate is determined to be larger as a result of comparing the sum and the cost of the motion vector in the first block size candidate, the block size candidate of the encoding target picture Is the second block size candidate, and if it is determined that the cost of the first block size candidate is smaller, the block size candidate of the coding target picture is set as the first block size candidate.

本発明の一態様は、コンピュータを、前記映像符号化装置として機能させるための映像符号化プログラムである。 One aspect of the present invention is a video encoding program for causing a computer to function as the video encoding device.

本発明によれば、符号化ブロックサイズ候補が複数存在するような動き予測部を有する映像符号化方式において、符号化対象ピクチャの参照構造における階層の深さと符号化済みピクチャの動きベクトル情報に基づいてブロックサイズ候補を適切に決定することで、符号化効率の低下を抑えつつ演算量を削減することができるという効果が得られる。 According to the present invention, in a video encoding system having a motion prediction unit in which there are a plurality of encoding block size candidates, based on the layer depth in the reference structure of the encoding target picture and the motion vector information of the encoded picture. By appropriately determining the block size candidates, it is possible to reduce the amount of calculation while suppressing a decrease in encoding efficiency.

本発明の一実施形態による映像符号化装置の構成を示すブロック図である。It is a block diagram which shows the structure of the video coding apparatus by one Embodiment of this invention. 図１に示すインター予測処理部１０２の構成の示すブロック図である。It is a block diagram which shows the structure of the inter prediction process part 102 shown in FIG. 図２に示すインター予測処理部１０２の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the inter prediction process part 102 shown in FIG. ランダム・アクセス符号化モードにおける画面間参照構造を示す図である。It is a figure which shows the reference structure between screens in random access encoding mode. 図２に示すブロックサイズ候補決定部２０２の構成を示すブロック図である。It is a block diagram which shows the structure of the block size candidate determination part 202 shown in FIG. 図５に示すブロックサイズ候補決定部２０２の動作を示すフローチャートである。6 is a flowchart showing an operation of a block size candidate determination unit 202 shown in FIG. 5. 図５に示す候補コスト生成部３０１の詳細な動作を示すフローチャートである。It is a flowchart which shows the detailed operation | movement of the candidate cost production | generation part 301 shown in FIG. 図５に示すオフセット決定部３０２の詳細な動作を示すフローチャートである。It is a flowchart which shows the detailed operation | movement of the offset determination part 302 shown in FIG. 図５に示す候補コスト比較部３０３の詳細な動作を示すフローチャートである。It is a flowchart which shows the detailed operation | movement of the candidate cost comparison part 303 shown in FIG.

以下、図面を参照して、本発明の一実施形態による映像符号化装置を説明する。以下で用いる「符号化ブロック」については、ＭＰＥＧ−２やＨ．２６４／ＡＶＣ規格ではマクロブロックのことを示し、ＨＥＶＣについてはコーディングユニット（ＣＵ）又はプレディクションユニット（ＰＵ）のことを指し示す。図１は同実施形態による映像符号化装置の構成を示すブロック図である。図１に示す映像符号化装置１００において、インター予測処理１０２が従来技術と異なる部分であり、他の部分はＨ．２６４／ＡＶＣやＨＥＶＣ等の映像符号化装置として用いられている従来の一般的な構成と同様である。 Hereinafter, a video encoding apparatus according to an embodiment of the present invention will be described with reference to the drawings. The “encoded block” used below is MPEG-2 or H.264. The H.264 / AVC standard indicates a macro block, and HEVC indicates a coding unit (CU) or a prediction unit (PU). FIG. 1 is a block diagram showing a configuration of a video encoding apparatus according to the embodiment. In the video encoding apparatus 100 shown in FIG. 1, the inter prediction process 102 is a part different from the prior art, and the other part is H.264. This is the same as the conventional general configuration used as a video encoding device such as H.264 / AVC or HEVC.

映像符号化装置１００は、符号化対象の映像信号（原画像）を入力とし、入力映像信号のピクチャをブロックに分割してブロックごとに符号化し、そのビットストリームを符号化ストリームとして出力する。この符号化のため、予測残差信号生成部１０３は、入力映像信号とイントラ予測処理部あるいはインター予測処理部の出力である予測信号との差分を求め、それを予測残差信号として出力する。 The video encoding apparatus 100 receives a video signal (original image) to be encoded as an input, divides a picture of the input video signal into blocks, encodes each block, and outputs the bit stream as an encoded stream. For this encoding, the prediction residual signal generation unit 103 obtains a difference between the input video signal and the prediction signal output from the intra prediction processing unit or the inter prediction processing unit, and outputs it as a prediction residual signal.

変換・量子化処理部１０４は、予測残差信号を入力とし、入力された予測残差信号に対して離散コサイン変換等の直交変換を行い、変換係数を量子化し、その量子化された変換係数を出力する。エントロピー符号化処理部１０５は、量子化された変換係数を入力とし、入力された量子化された変換係数をエントロピー符号化し、符号化ストリームとして出力する。 The transform / quantization processing unit 104 receives the prediction residual signal, performs orthogonal transform such as discrete cosine transform on the input prediction residual signal, quantizes the transform coefficient, and the quantized transform coefficient Is output. The entropy encoding processing unit 105 receives the quantized transform coefficient as input, entropy-encodes the input quantized transform coefficient, and outputs it as an encoded stream.

一方、量子化された変換係数は、逆量子化・逆変換処理部１０６にも入力され、ここで逆量子化と逆直交変換され、予測残差復号信号を出力する。復号信号生成部１０７は、逆量子化・逆変換処理部１０６の出力である予測残差復号信号とイントラ予測処理部１０１またはインター予測処理部１０２の出力である予測信号とを加算し、符号化した符号化対象ブロックの復号信号を生成する。この復号信号は、インター予測処理部１０２で参照画像として用いるために、ループフィルタ処理部１０８に入力される。ループフィルタ処理部１０８では復号信号に対して符号化歪みを低減するフィルタリング処理を行い、このフィルタリング処理後の画像を参照画像としてインター予測処理部１０２に入力する。 On the other hand, the quantized transform coefficient is also input to the inverse quantization / inverse transform processing unit 106, where inverse quantization and inverse orthogonal transform are performed, and a prediction residual decoded signal is output. The decoded signal generation unit 107 adds the prediction residual decoded signal output from the inverse quantization / inverse conversion processing unit 106 and the prediction signal output from the intra prediction processing unit 101 or the inter prediction processing unit 102 to perform encoding. The decoded signal of the encoding target block is generated. This decoded signal is input to the loop filter processing unit 108 for use as a reference image by the inter prediction processing unit 102. The loop filter processing unit 108 performs filtering processing for reducing the coding distortion on the decoded signal, and inputs the image after the filtering processing to the inter prediction processing unit 102 as a reference image.

本実施形態は、図１に示すインター予測処理部１０２において、ブロックサイズ候補を適切に絞り込んで動き探索処理を行うことで、符号化効率を低下させずにインター予測処理に要する処理量を削減するものである。 In the present embodiment, the inter prediction processing unit 102 shown in FIG. 1 performs motion search processing by appropriately narrowing down block size candidates, thereby reducing the processing amount required for the inter prediction processing without reducing the coding efficiency. Is.

以下の説明は、ＨＥＶＣ規格に基づいた実施形態として説明する。以下の実施形態は、ＨＥＶＣにおける６４×６４、３２×３２、１６×１６、８×８の４種類のブロックサイズ候補から、３種類のブロックサイズ候補に絞り込む手法である。実施形態における処理は６４×６４の領域単位で実行される。以下の実施形態の中で用いているコスト値とは、原画像と動きベクトルが指し示す参照画像との差分絶対値和（ＳＡＤ）もしくは差分値に２次元アダマール変換を行った値の差分絶対値和（ＳＡＴＤ）と、動きベクトルを符号化した際に生じる符号量を簡易的に見積もった動きベクトルコスト値の和で表される値を表している。 The following description will be described as an embodiment based on the HEVC standard. The following embodiment is a method of narrowing down from four types of block size candidates of 64 × 64, 32 × 32, 16 × 16, and 8 × 8 in HEVC to three types of block size candidates. The processing in the embodiment is executed in units of 64 × 64 areas. The cost value used in the following embodiments refers to the sum of absolute differences (SAD) between the original image and the reference image indicated by the motion vector, or the sum of absolute differences obtained by performing two-dimensional Hadamard transformation on the difference values. (SATD) and a value represented by the sum of motion vector cost values obtained by simply estimating the amount of code generated when the motion vector is encoded.

次に、図１に示すインター予測処理部１０２の構成と動作を説明する。図２は、図１に示すインター予測処理部１０２の構成の示すブロック図である。インター予測処理部１０２は、事前動きを予測する事前動き予測処理部２０１と、ブロックサイズ候補を決定するブロックサイズ候補決定部２０２と、動き探索を行う動き探索処理部２０３とを備える。 Next, the configuration and operation of the inter prediction processing unit 102 illustrated in FIG. 1 will be described. FIG. 2 is a block diagram illustrating a configuration of the inter prediction processing unit 102 illustrated in FIG. 1. The inter prediction processing unit 102 includes a pre-motion prediction processing unit 201 that predicts pre-motion, a block size candidate determination unit 202 that determines block size candidates, and a motion search processing unit 203 that performs motion search.

次に、図３を参照して、図２に示すインター予測処理部１０２の動作を説明する。図３は、図２に示すインター予測処理部１０２の動作を示すフローチャートである。まず、事前動き予測処理部２０１は、入力映像信号（原画像）とループフィルタ処理部１０８より入力される復号信号を入力とし、入力映像信号の大まかな動きを捕える事前動き探索を行う（ステップＳ１）。 Next, the operation of the inter prediction processing unit 102 shown in FIG. 2 will be described with reference to FIG. FIG. 3 is a flowchart showing the operation of the inter prediction processing unit 102 shown in FIG. First, the prior motion prediction processing unit 201 receives an input video signal (original image) and a decoded signal input from the loop filter processing unit 108 as input, and performs a prior motion search for capturing a rough motion of the input video signal (step S1). ).

事前動き予測処理部２０１は、入力映像信号中の対象となる６４×６４領域における各ブロックサイズそれぞれで動き探索を行い、各ブロックサイズごとの動きベクトル及びそれに伴うコスト値を算出し、出力する。６４×６４ブロックサイズのコスト値とは、対象の６４×６４領域を分割せずに６４×６４ブロックとして処理した場合のコスト値である。３２×３２ブロックサイズのコスト値とは、対象の６４×６４領域を縦横それぞれ２分割した際の、４つの３２×３２ブロックにおける各コスト値である。１６×１６ブロックサイズのコスト値とは、対象の６４×６４領域を縦横それぞれ４分割した際の、１６つの１６×１６ブロックにおける各コスト値である。８×８ブロックサイズのコスト値とは、対象の６４×６４領域を縦横それぞれ８分割した際の、６４つの８×８ブロックにおける各コスト値である。また、事前動き予測処理部２０１は、入力映像信号（原画像）のフレーム番号から、ＴｅｍｐｏｒａｌＩＤを算出して符号化情報とし、この符号化情報をブロックサイズ候補決定部２０２に出力する。 The prior motion prediction processing unit 201 performs a motion search for each block size in a target 64 × 64 region in the input video signal, calculates a motion vector for each block size, and a cost value associated therewith, and outputs the motion vector. The cost value of the 64 × 64 block size is a cost value when the target 64 × 64 region is processed as 64 × 64 blocks without being divided. The cost value of the 32 × 32 block size is a cost value in four 32 × 32 blocks when the target 64 × 64 region is divided into two parts in the vertical and horizontal directions. The cost value of the 16 × 16 block size is each cost value in 16 16 × 16 blocks when the target 64 × 64 region is divided into four in each of the vertical and horizontal directions. The cost value of the 8 × 8 block size is each cost value in 64 8 × 8 blocks when the target 64 × 64 region is divided into 8 parts in the vertical and horizontal directions. Further, the prior motion prediction processing unit 201 calculates TemporalID from the frame number of the input video signal (original image) as encoded information, and outputs the encoded information to the block size candidate determining unit 202.

次に、ブロックサイズ候補決定部２０２は、事前動き予測処理部２０１で算出された各ブロックサイズのコスト値と、動き探索処理部２０３で算出される符号化済みの過去のフレームである符号化済みピクチャにおける特定の場合の動きベクトル統計値、及び当該ピクチャの符号化情報を入力とし、ブロックサイズ候補の絞り込み処理を行う（ステップＳ２）。絞り込み処理とは、候補数を減らす処理のことである。例えば、後述するように複数のブロックサイズ候補を２つの候補のグループにし、いずれかの候補に決定することを指す。後述する説明では２つの候補のグループにし、いずれかの候補に決定する例で説明するが、候補数を減らす処理であればよく、これに限るものではない。事前動き予測処理部２０１で探索した結果である、各ブロックサイズごとの動きベクトル、及びブロックサイズ候補決定部２０２で絞り込まれたブロックサイズ候補は、動き探索処理部２０３に対してそれぞれ出力する。 Next, the block size candidate determination unit 202 encodes the cost value of each block size calculated by the prior motion prediction processing unit 201 and the encoded past frame calculated by the motion search processing unit 203. The motion vector statistical value in a specific case in the picture and the coding information of the picture are input, and the block size candidate narrowing process is performed (step S2). The narrowing-down process is a process for reducing the number of candidates. For example, as will be described later, this refers to making a plurality of block size candidates into a group of two candidates and determining one of the candidates. In the description to be described later, an example in which two candidates are grouped and determined as one of the candidates will be described. However, the processing is not limited to this as long as the number of candidates is reduced. The motion vector for each block size and the block size candidates narrowed down by the block size candidate determination unit 202, which are the results of searching by the prior motion prediction processing unit 201, are output to the motion search processing unit 203, respectively.

次に、動き探索処理部２０３は事前動き予測処理部２０１で探索した結果を基に、ブロックサイズ候補決定部２０２で絞り込まれた候補内のサイズで動き探索処理を行う（ステップＳ３）。最終的に決定した動きベクトル情報と予測差分画像は予測残差信号生成部１０３へと出力される。さらに当該符号化対象ピクチャが特定の場合のみ、動き探索処理部２０３は決定した動きベクトルの統計値を算出しブロックサイズ候補決定部２０２へ出力する。 Next, the motion search processing unit 203 performs a motion search process with the size within the candidates narrowed down by the block size candidate determination unit 202 based on the result of the search by the prior motion prediction processing unit 201 (step S3). The finally determined motion vector information and the prediction difference image are output to the prediction residual signal generation unit 103. Further, only when the picture to be encoded is specific, the motion search processing unit 203 calculates a statistical value of the determined motion vector, and outputs it to the block size candidate determination unit 202.

以下、動き探索処理部２０３が決定した動きベクトルの統計値を算出しブロックサイズ候補決定部２０２へ出力する、特定の場合について説明する。ＨＥＶＣにおけるランダム・アクセス符号化モードでは、時間軸上過去のピクチャだけでなく未来のピクチャからも予測可能とすることで、高い画面間符号化効率を得ることが可能である。図４は、ランダム・アクセス符号化モードにおける画面間参照構造を示す図である。図４では、各ピクチャが参照のパターンによって４つの階層に分類され、それぞれの階層にＴｅｍｐｏｒａｌＩＤ（時間識別子）とよばれる値を割り振っている。ＴｅｍｐｏｒａｌＩＤ＝０のピクチャから次のＴｅｍｐｏｒａｌＩＤ＝０のピクチャまでをＧＯＰ（Group of Picture）と呼ぶ。 Hereinafter, a specific case where the statistical value of the motion vector determined by the motion search processing unit 203 is calculated and output to the block size candidate determination unit 202 will be described. In the random access coding mode in HEVC, it is possible to obtain high inter-frame coding efficiency by making prediction possible not only from past pictures on the time axis but also from future pictures. FIG. 4 is a diagram showing an inter-screen reference structure in the random access encoding mode. In FIG. 4, each picture is classified into four layers according to the reference pattern, and a value called TemporalID (time identifier) is assigned to each layer. A picture from the picture with TemporalID = 0 to the next picture with TemporalID = 0 is called GOP (Group of Picture).

図４は、ＧＯＰのサイズが８の場合のランダム・アクセス符号化モードの例を表している。動き探索処理部２０３では、当該符号化対象ピクチャの符号化順序がＴｅｍｐｏｒａｌＩＤ＝０のピクチャの１つ前の場合のみ、当該符号化対象ピクチャにおける全ての動きベクトルの分散を算出し、動きベクトル統計値としてブロックサイズ候補決定部２０２へ出力する。 FIG. 4 shows an example of the random access encoding mode when the GOP size is 8. The motion search processing unit 203 calculates the variance of all motion vectors in the current encoding target picture only when the encoding order of the current encoding target picture is one immediately before the picture with TemporalID = 0, and calculates motion vector statistics Is output to the block size candidate determination unit 202.

ここでは動きベクトル統計値として動きベクトルの分散を算出したが、動きベクトルの算術平均や二乗和や絶対値和を動きベクトル統計値として出力する方法であってもよい。動きベクトル統計値として動きベクトルの分散を用いる場合は、画面間の動きが１ピクチャ内で均一的かばらけているかによって特定のブロックサイズ候補を選ばれやすくする効果が見込める。動きベクトル統計値として動きベクトルの絶対値和を用いる場合は、画面間の動きが全体的に大きいか小さいかによって特定のブロックサイズ候補が選ばれやすくする効果が見込める。 Here, the variance of the motion vector is calculated as the motion vector statistical value. However, a method of outputting the arithmetic mean, square sum, or absolute value sum of the motion vector as the motion vector statistical value may be used. When motion vector dispersion is used as a motion vector statistic value, an effect of facilitating selection of a specific block size candidate can be expected depending on whether the motion between screens is uniformly distributed within one picture. When the sum of absolute values of motion vectors is used as the motion vector statistic value, an effect of easily selecting a specific block size candidate can be expected depending on whether the motion between screens is large or small as a whole.

次に、図５を参照して、図２に示すブロックサイズ候補決定部２０２の構成を説明する。図５は、図２に示すブロックサイズ候補決定部２０２の構成を示すブロック図である。ブロックサイズ候補決定部２０２は、候補コスト生成部３０１、オフセット決定部３０２、候補コスト比較部３０３を備えている。 Next, the configuration of the block size candidate determination unit 202 shown in FIG. 2 will be described with reference to FIG. FIG. 5 is a block diagram showing a configuration of the block size candidate determination unit 202 shown in FIG. The block size candidate determination unit 202 includes a candidate cost generation unit 301, an offset determination unit 302, and a candidate cost comparison unit 303.

次に、図６を参照して、図５に示すブロックサイズ候補決定部２０２の動作を説明する。図６は、図５に示すブロックサイズ候補決定部２０２の動作を示すフローチャートである。まず、候補コスト生成部３０１は、各ブロックサイズのコストを入力とし、ブロックサイズ候補のコストを算出する（ステップＳ１１）。候補コスト生成部３０１は、算出したコスト（候補コスト１、候補コスト２）を候補コスト比較部３０３へ出力する。 Next, the operation of the block size candidate determination unit 202 shown in FIG. 5 will be described with reference to FIG. FIG. 6 is a flowchart showing the operation of the block size candidate determination unit 202 shown in FIG. First, the candidate cost generation unit 301 receives the cost of each block size as input, and calculates the cost of the block size candidate (step S11). The candidate cost generation unit 301 outputs the calculated costs (candidate cost 1 and candidate cost 2) to the candidate cost comparison unit 303.

一方、オフセット決定部３０２は、動き探索処理部２０３で算出された符号化済みピクチャの動きベクトル統計値と、符号化対象ピクチャの符号化情報とを入力として、符号化済みピクチャの動きベクトル統計値と、符号化対象ピクチャの符号化情報に基づいて、コスト比較のオフセット値を算出する（ステップＳ１２）。オフセット決定部３０２は、算出したコスト比較オフセット値を候補コスト比較部３０３へ出力する。 On the other hand, the offset determination unit 302 receives the motion vector statistical value of the encoded picture calculated by the motion search processing unit 203 and the encoding information of the encoding target picture, and receives the motion vector statistical value of the encoded picture. Then, an offset value for cost comparison is calculated based on the encoding information of the encoding target picture (step S12). The offset determination unit 302 outputs the calculated cost comparison offset value to the candidate cost comparison unit 303.

次に、候補コスト比較部３０３は、２つのコスト（候補コスト１、候補コスト２）とコスト比較オフセット値を入力とし、２つのコストに基づいて比較を行い、ブロックサイズ候補を決定する（ステップＳ１３）。候補コスト比較部３０３は、決定したブロックサイズ候補を動き探索処理部２０３へ出力する。 Next, the candidate cost comparison unit 303 receives two costs (candidate cost 1 and candidate cost 2) and a cost comparison offset value as inputs, and performs comparison based on the two costs to determine a block size candidate (step S13). ). Candidate cost comparison unit 303 outputs the determined block size candidates to motion search processing unit 203.

次に、図７を参照して、図５に示す候補コスト生成部３０１の詳細な動作を説明する。図７は、図５に示す候補コスト生成部３０１の詳細な動作を示すフローチャートである。まず、候補コスト生成部３０１は、事前動き予測処理部２０１で算出された各ブロックサイズのコスト値を入力する（ステップＳ２１）。そして、候補コスト生成部３０１は、ブロックサイズ候補１のコスト値（候補コスト１）を算出する（ステップＳ２２）。ブロックサイズ候補１は６４×６４のブロックサイズのみとする。ブロックサイズ候補１のコスト値は、事前動き予測処理部２０１で算出された６４×６４ブロックサイズのコスト値とする。 Next, detailed operation of the candidate cost generation unit 301 shown in FIG. 5 will be described with reference to FIG. FIG. 7 is a flowchart showing a detailed operation of the candidate cost generation unit 301 shown in FIG. First, the candidate cost generation unit 301 inputs the cost value of each block size calculated by the prior motion prediction processing unit 201 (step S21). Then, the candidate cost generation unit 301 calculates the cost value (candidate cost 1) of the block size candidate 1 (step S22). The block size candidate 1 is only a block size of 64 × 64. The cost value of the block size candidate 1 is a cost value of 64 × 64 block size calculated by the prior motion prediction processing unit 201.

続いて、候補コスト生成部３０１は、ブロックサイズ候補２のコスト値（候補コスト２）を算出する（ステップＳ２３）。ブロックサイズ候補２は３２×３２、１６×１６、８×８の３つのブロックサイズとする。６４×６４領域を縦横４分割した際の１６×１６領域において、１６×１６ブロックサイズのコスト値と、１６×１６領域を縦横２分割した４つの８×８ブロックサイズのコスト値の和を比較し、１６×１６ブロックサイズのコスト値が小さい場合は１６×１６領域のコスト値を１６×１６ブロックサイズのコスト値とし、１６×１６ブロックサイズのコスト値が大きい場合は１６×１６領域のコスト値を４つの８×８ブロックサイズのコスト値の和とする。以上の１６×１６領域コスト算出処理を１６つの１６×１６領域で行う。 Subsequently, the candidate cost generation unit 301 calculates the cost value (candidate cost 2) of the block size candidate 2 (step S23). Block size candidate 2 has three block sizes of 32 × 32, 16 × 16, and 8 × 8. Compare the cost value of the 16x16 block size and the sum of the cost values of the four 8x8 block sizes obtained by dividing the 16x16 area into 2 parts vertically and horizontally in the 16x16 area when the 64x64 area is divided into 4 parts. When the cost value of the 16 × 16 block size is small, the cost value of the 16 × 16 area is set as the cost value of the 16 × 16 block size, and when the cost value of the 16 × 16 block size is large, the cost of the 16 × 16 area is Let the value be the sum of the cost values of four 8 × 8 block sizes. The above 16 × 16 area cost calculation processing is performed on 16 16 × 16 areas.

６４×６４領域を縦横２分割した際の３２×３２領域において、３２×３２ブロックサイズのコスト値と、３２×３２領域を縦横２分割した４つの１６×１６領域コストの和を比較し、３２×３２ブロックサイズのコスト値が小さい場合は３２×３２領域のコスト値を３２×３２ブロックサイズのコスト値とし、３２×３２ブロックサイズのコスト値が大きい場合は３２×３２領域のコスト値を４つの１６×１６領域コストの和とする。対象の６４×６４領域を縦横２分割した４つの３２×３２領域全てで上記の３２×３２領域コスト算出手順を行い、４つの３２×３２領域コスト値の和をブロックサイズ候補２のコスト値とする。 In the 32 × 32 area when the 64 × 64 area is divided into 2 parts vertically and horizontally, the cost value of the 32 × 32 block size is compared with the sum of the four 16 × 16 area costs obtained by dividing the 32 × 32 area into 2 parts vertically and horizontally. When the cost value of the × 32 block size is small, the cost value of the 32 × 32 region is set as the cost value of the 32 × 32 block size, and when the cost value of the 32 × 32 block size is large, the cost value of the 32 × 32 region is set to 4. The sum of the two 16 × 16 area costs. The above 32 × 32 region cost calculation procedure is performed for all four 32 × 32 regions obtained by dividing the target 64 × 64 region into two vertically and horizontally, and the sum of the four 32 × 32 region cost values is used as the cost value of block size candidate 2. To do.

最後に、候補コスト生成部３０１は、算出したブロックサイズ候補１、２のコスト値を出力する（ステップＳ２４）。 Finally, the candidate cost generation unit 301 outputs the calculated cost values of the block size candidates 1 and 2 (step S24).

次に、図８を参照して、図５に示すオフセット決定部３０２の詳細な動作を説明する。図８は、図５に示すオフセット決定部３０２の詳細な動作を示すフローチャートである。まず、動き探索処理部２０３が算出したピクチャ単位の動きベクトル分散の最新値σ_ＭＶ ^２と、閾値ＴＨとを比較し、動きベクトル分散の最新値σ_ＭＶ ^２が閾値ＴＨより小さいか否かを判定する（ステップＳ３１）。この判定の結果、動きベクトル分散σ_ＭＶ ^２が閾値ＴＨより小さい場合、すなわち物体の動きが１ピクチャ内で均一的な場合、コスト比較オフセット値λ_{ｏｆｆｓｅｔ}を０に設定する（ステップＳ３２）。 Next, the detailed operation of the offset determination unit 302 shown in FIG. 5 will be described with reference to FIG. FIG. 8 is a flowchart showing a detailed operation of the offset determination unit 302 shown in FIG. First, the latest value σ _MV ² of the motion vector variance for each picture calculated by the motion search processing unit 203 is compared with the threshold value TH, and it is determined whether or not the latest value σ _MV ² of the motion vector variance is smaller than the threshold value TH. (Step S31). As a result of this determination, if the motion vector variance σ _MV ² is smaller than the threshold value TH, that is, if the motion of the object is uniform within one picture, the cost comparison offset value λ _offset is set to 0 (step S32).

一方、動きベクトル分散σ_ＭＶ ^２が閾値ＴＨより大きい場合、すなわち物体の動きが１ピクチャ内でばらけている場合、コスト比較オフセット値λ_{ｏｆｆｓｅｔ}を０でない実数（例えば、α・ＴｅｍｐｏｒａｌＩＤ）に設定する（ステップＳ３３）。この場合のλ_{ｏｆｆｓｅｔ}は負の実数を用いるとよい。これにより動きがばらけているピクチャではブロックサイズ候補１、すなわち６４×６４のブロックサイズが選ばれやすくなり、この結果としてブロックサイズが動きのばらけによって小さくなりすぎることを防ぎ、符号量削減が望める。 On the other hand, when the motion vector variance σ _MV ² is larger than the threshold value TH, that is, when the motion of the object varies within one picture, the cost comparison offset value λ _offset is set to a non-zero real number (for example, α · TemporalID). (Step S33). In this case, a negative real number may be used for λ _offset . This makes it easy to select block size candidate 1, that is, a block size of 64 × 64 for a picture with scattered motion, and as a result, the block size is prevented from becoming too small due to motion variation, and the code amount can be reduced. I can hope.

また、この場合のλ_{ｏｆｆｓｅｔ}の決定方法としては、符号化情報より当該符号化対象ピクチャのＴｅｍｐｏｒａｌＩＤを取得し、コスト比較オフセット値λ_{ｏｆｆｓｅｔ}をＴｅｍｐｏｒａｌＩＤの定数倍α・ＴｅｍｐｏｒａｌＩＤに設定するとよい。αは負の実数定数がよい。ＴｅｍｐｏｒａｌＩＤが大きいピクチャでは動き予測の精度が高い傾向があるが、コスト比較オフセット値λ_{ｏｆｆｓｅｔ}をＴｅｍｐｏｒａｌＩＤの定数倍α・ＴｅｍｐｏｒａｌＩＤに設定することで、ＴｅｍｐｏｒａｌＩＤが大きいピクチャではブロックサイズ候補１、すなわち６４×６４のブロックサイズが選ばれやすくなり、より大きなブロックで動き予測が行われることで符号量削減が望める。 Also, as a method for determining λ _offset in this case, it is preferable to obtain TemporalID of the picture to be encoded from the encoding information and set the cost comparison offset value λ _offset to a constant multiple α · TemporalID of TemporalID. α is preferably a negative real constant. Although the accuracy of motion prediction tends to be high for a picture with a large Temporal ID, the cost comparison offset value λ _offset is set to a constant multiple α · Temporal ID of the Temporal ID. The block size can be easily selected, and the amount of code can be reduced by performing motion prediction on a larger block.

ここでは、動き探索処理部２０３で算出された動きベクトル統計値が閾値ＴＨより大きい場合のコスト比較オフセット値λ_{ｏｆｆｓｅｔ}をＴｅｍｐｏｒａｌＩＤの定数倍α・ＴｅｍｐｏｒａｌＩＤとしたが、ＴｅｍｐｏｒａｌＩＤごとに適切なλ_{ｏｆｆｓｅｔ}を固定値として予め用意しておき、符号化対象ピクチャのＴｅｍｐｏｒａｌＩＤに応じてλ_{ｏｆｆｓｅｔ}を切り替えるようにしてもよい。 Here, the cost comparison offset value λ _offset when the motion vector statistical value calculated by the motion search processing unit 203 is larger than the threshold value TH is set to a constant multiple α · TemporalID of TemporalID, but an appropriate λ _offset is fixed for each TemporalID. A value may be prepared in advance, and λ _offset may be switched according to the Temporal ID of the encoding target picture.

また、ここでは動きベクトル統計値と閾値ＴＨの大小を比較してコスト比較オフセット値λ_{ｏｆｆｓｅｔ}の計算式の切り替えを行ったが、ＧＯＰ構造を考慮するために動きベクトル統計値をＧＯＰサイズで割った値と閾値ＴＨを比較してλ_{ｏｆｆｓｅｔ}の計算式を切り替えるようにしてもよい。これにより、動きベクトル統計値がＧＯＰサイズと比較されたスケーリング済みの値となり、ＧＯＰサイズの大小によって閾値ＴＨを変化させる必要がなくなる。 Also, here, the calculation formula of the cost comparison offset value λ _offset is switched by comparing the motion vector statistical value with the threshold TH, but the motion vector statistical value is divided by the GOP size in order to consider the GOP structure. The calculation formula of λ _offset may be switched by comparing the value with the threshold value TH. As a result, the motion vector statistical value becomes a scaled value compared with the GOP size, and there is no need to change the threshold TH depending on the size of the GOP size.

候補コスト生成部３０１で算出された２つの候補コスト値とオフセット決定部３０２で算出されたコスト比較オフセット値は、候補コスト比較部３０３へ入力され、符号化ブロックのサイズ候補を２つのブロックサイズ候補の内のいずれかに決定する。 The two candidate cost values calculated by the candidate cost generation unit 301 and the cost comparison offset value calculated by the offset determination unit 302 are input to the candidate cost comparison unit 303, and the encoded block size candidates are set as two block size candidates. Determine one of the following.

次に、図９を参照して、図５に示す候補コスト比較部３０３の詳細な動作を説明する。図９は、図５に示す候補コスト比較部３０３の詳細な動作を示すフローチャートである。まず、候補コスト比較部３０３は、ブロックサイズ候補１、２のコスト値とコスト比較オフセット値とを入力とする（ステップＳ４１）。 Next, detailed operation of the candidate cost comparison unit 303 shown in FIG. 5 will be described with reference to FIG. FIG. 9 is a flowchart showing a detailed operation of the candidate cost comparison unit 303 shown in FIG. First, the candidate cost comparison unit 303 receives the cost value and the cost comparison offset value of the block size candidates 1 and 2 (step S41).

次に、候補コスト比較部３０３は、ブロックサイズ候補１の候補コスト値とオフセット決定部３０２で決定されたコスト比較オフセット値の和と、ブロックサイズ候補２の候補コスト値とを比較し、ブロックサイズ候補１コスト値＋コスト比較オフセット値＜ブロックサイズ候補２コスト値であるか否かを判定する（ステップＳ４２）。この判定の結果、ブロックサイズ候補２の候補コスト値が大きい場合はブロックサイズ候補を候補１に決定し（ステップＳ４３）、ブロックサイズ候補２の候補コスト値が小さい場合はブロックサイズ候補を候補２に決定する（ステップＳ４４）。 Next, the candidate cost comparison unit 303 compares the sum of the candidate cost value of the block size candidate 1 and the cost comparison offset value determined by the offset determination unit 302 with the candidate cost value of the block size candidate 2 to determine the block size. It is determined whether or not candidate 1 cost value + cost comparison offset value <block size candidate 2 cost value (step S42). As a result of this determination, if the candidate cost value of block size candidate 2 is large, the block size candidate is determined as candidate 1 (step S43), and if the candidate cost value of block size candidate 2 is small, the block size candidate is set as candidate 2. Determine (step S44).

以上説明したように、符号化ブロックサイズ候補が複数存在するような動き予測部を有する映像符号化方式において、符号化対象ピクチャの参照構造における階層の深さ、すなわちＴｅｍｐｏｒａｌＩＤと、符号化済みピクチャの動きベクトルの統計値情報に基づき、ブロックサイズ候補のコスト比較におけるオフセット値を決定し、コスト比較によりブロックサイズ候補を適切に絞り込むことで、動き探索に要する演算量を削減することが可能となる。 As described above, in the video encoding system having a motion prediction unit in which there are a plurality of encoding block size candidates, the depth of the hierarchy in the reference structure of the encoding target picture, that is, TemporalID, and the encoded picture By determining an offset value in cost comparison of block size candidates based on statistical value information of motion vectors and appropriately narrowing down the block size candidates by cost comparison, it is possible to reduce the amount of calculation required for motion search.

前述した実施形態における映像符号化装置１００の全部または一部をコンピュータで実現するようにしてもよい。その場合、この機能を実現するためのプログラムをコンピュータ読み取り可能な記録媒体に記録して、この記録媒体に記録されたプログラムをコンピュータシステムに読み込ませ、実行することによって実現してもよい。なお、ここでいう「コンピュータシステム」とは、ＯＳや周辺機器等のハードウェアを含むものとする。また、「コンピュータ読み取り可能な記録媒体」とは、フレキシブルディスク、光磁気ディスク、ＲＯＭ、ＣＤ−ＲＯＭ等の可搬媒体、コンピュータシステムに内蔵されるハードディスク等の記憶装置のことをいう。さらに「コンピュータ読み取り可能な記録媒体」とは、インターネット等のネットワークや電話回線等の通信回線を介してプログラムを送信する場合の通信線のように、短時間の間、動的にプログラムを保持するもの、その場合のサーバやクライアントとなるコンピュータシステム内部の揮発性メモリのように、一定時間プログラムを保持しているものも含んでもよい。また上記プログラムは、前述した機能の一部を実現するためのものであってもよく、さらに前述した機能をコンピュータシステムにすでに記録されているプログラムとの組み合わせで実現できるものであってもよく、ＰＬＤ（Programmable Logic Device）やＦＰＧＡ（Field Programmable Gate Array）等のハードウェアを用いて実現されるものであってもよい。 You may make it implement | achieve all or one part of the video encoding apparatus 100 in embodiment mentioned above with a computer. In that case, a program for realizing this function may be recorded on a computer-readable recording medium, and the program recorded on this recording medium may be read into a computer system and executed. Here, the “computer system” includes an OS and hardware such as peripheral devices. The “computer-readable recording medium” refers to a storage device such as a flexible medium, a magneto-optical disk, a portable medium such as a ROM and a CD-ROM, and a hard disk incorporated in a computer system. Furthermore, the “computer-readable recording medium” dynamically holds a program for a short time like a communication line when transmitting a program via a network such as the Internet or a communication line such as a telephone line. In this case, a volatile memory inside a computer system serving as a server or a client in that case may be included and a program held for a certain period of time. Further, the program may be a program for realizing a part of the above-described functions, and may be a program capable of realizing the functions described above in combination with a program already recorded in a computer system. It may be realized using hardware such as PLD (Programmable Logic Device) or FPGA (Field Programmable Gate Array).

以上、図面を参照して本発明の実施の形態を説明してきたが、上記実施の形態は本発明の例示に過ぎず、本発明が上記実施の形態に限定されるものではないことは明らかである。したがって、本発明の技術思想及び範囲を逸脱しない範囲で構成要素の追加、省略、置換、その他の変更を行ってもよい。 As mentioned above, although embodiment of this invention has been described with reference to drawings, the said embodiment is only the illustration of this invention, and it is clear that this invention is not limited to the said embodiment. is there. Therefore, additions, omissions, substitutions, and other modifications of the components may be made without departing from the technical idea and scope of the present invention.

動き探索処理に要する演算量が限られている映像符号化装置及び映像符号化プログラムに適用できる。 The present invention can be applied to a video encoding device and a video encoding program in which the amount of computation required for motion search processing is limited.

１０２・・・インター予測処理部、２０１・・・事前動き予測処理部、２０２・・・ブロックサイズ候補決定部、２０３・・・動き探索処理部、３０１・・・候補コスト生成部、３０２・・・オフセット決定部、３０３・・・候補コスト比較部 DESCRIPTION OF SYMBOLS 102 ... Inter prediction process part, 201 ... Prior motion prediction process part, 202 ... Block size candidate determination part, 203 ... Motion search process part, 301 ... Candidate cost generation part, 302 ... Offset determination unit, 303 ... Candidate cost comparison unit

Claims

A video encoding device that performs a temporal prediction of a picture of an input video signal, performs a motion prediction for each coding block for a coding target picture, and performs a coding process of a differential signal,
Using the encoding target picture and a reference picture of the decoded video signal of the motion prediction destination, a pre-motion prediction process for capturing a rough motion of the target encoding block is performed for each of all the encoding block sizes. A prior motion prediction processing means for calculating a prior motion prediction processing cost;
Block size candidate cost means for determining a block size candidate cost in the first encoded block size candidate and a block size candidate cost in the second encoded block size candidate based on the prior motion prediction processing cost;
Parameter determining means for determining a motion statistical parameter based on motion vector information of a coded picture of the input video signal;
Offset value setting means for setting a cost comparison offset value based on the motion statistic parameter and the depth of the hierarchy in the reference structure of the encoding target picture;
Based on the block size candidate cost in the first encoded block size candidate, the block size candidate cost in the second encoded block size candidate, and the cost comparison offset value, encoding of the encoding target picture A block size candidate determining means for determining a block size candidate as a first block size candidate or a second block size candidate from among all block size candidates;
A video encoding device comprising: motion search means for performing motion prediction processing of the encoding target picture and determining a block size and a motion vector by using the determined encoding block size candidate.

The parameter determination means includes
The video encoding according to claim 1, wherein all motion vectors in a specified encoded picture of the input video signal are calculated, and an absolute value sum of the motion vectors is determined as a motion statistical parameter of the encoding target picture. apparatus.

The parameter determination means includes
2. The video encoding device according to claim 1, wherein all motion vectors in a specified encoded picture of the input video signal are calculated, and a variance value of the motion vector is determined as a motion statistical parameter of the encoding target picture. .

The block size candidate determination means includes
As a result of calculating the sum of the cost of the motion vector in the second block size candidate and the cost comparison offset value, and comparing the calculated sum with the cost of the motion vector in the first block size candidate, When it is determined that the cost of one block size candidate is higher, the block size candidate of the encoding target picture is set as a second block size candidate, and it is determined that the cost of the first block size candidate is lower 4. The video encoding device according to claim 1, wherein if the block size is selected, the block size candidate of the encoding target picture is set as a first block size candidate. 5.

A video encoding program for causing a computer to function as the video encoding device according to any one of claims 1 to 4.