JP6663191B2

JP6663191B2 - Encoding setting information determination device and program

Info

Publication number: JP6663191B2
Application number: JP2015172853A
Authority: JP
Inventors: 康孝松尾; 慎平根本; 境田　慎一; 慎一境田
Original assignee: Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 2015-09-02
Filing date: 2015-09-02
Publication date: 2020-03-11
Anticipated expiration: 2035-09-02
Also published as: JP2017050713A

Description

本発明は、映像を符号化する際の設定情報を決定する符号化設定情報決定装置及びプログラムに関する。 The present invention relates to an encoding setting information determining device and a program for determining setting information when encoding a video.

映像の符号化方式として、ＩＴＵ−ＴＨ．２６４／ＭＰＥＧ−４ＡＶＣ（以下、単に「ＡＶＣ」と称する。）や、ＩＴＵ−ＴＨ．２６５／ＭＰＥＧ−ＨＨＥＶＣ（以下、単に「ＨＥＶＣ」と称する。）などが知られている。これらの符号化方式では、インター予測（フレーム間予測、画面間予測）、及びイントラ予測（フレーム内予測、画面内予測）が用いられる。インター予測は時間的に前後する参照フレームから予測画像を生成し、原画像との差分を符号化する予測符号化方式である。イントラ予測はフレーム内符号化されるブロックに対して、符号化済みの隣接ブロックの画素から予測画像を生成し、原画像との差分を符号化する予測符号化方式である。 As a video encoding method, ITU-TH. H.264 / MPEG-4 AVC (hereinafter, simply referred to as “AVC”) and ITU-T H.264. H.265 / MPEG-H HEVC (hereinafter simply referred to as “HEVC”) and the like are known. In these coding systems, inter prediction (inter-frame prediction, inter-screen prediction) and intra prediction (intra-frame prediction, intra-screen prediction) are used. The inter prediction is a prediction encoding method that generates a prediction image from a temporally preceding and succeeding reference frame and encodes a difference from an original image. Intra prediction is a predictive coding method for generating a predictive image from pixels of an adjacent block that has been coded for a block to be intra-coded, and coding a difference from the original image.

非特許文献１には、ＨＥＶＣでは、符号化ツリーユニット（ＣＴＵ：Coding Tree Unit）を階層的に４分木分割することで可変サイズの符号化ユニット（ＣＵ：Coding Unit）を生成することが開示されている。ＣＵのサイズは、６４×６４画素、３２×３２画素、１６×１６画素、及び８×８画素の４種類である。インター予測を行う場合には、各ＣＵを更にインター予測用の予測ユニット（ＰＵ：Prediction Unit）に分割し、該ＰＵごとに予測処理を行う。また、イントラ予測を行う場合には、各ＣＵを更にイントラ予測用のＰＵに分割し、該ＰＵごとに予測処理を行う。 Non-Patent Document 1 discloses that HEVC generates a variable-size coding unit (CU: Coding Unit) by hierarchically dividing a coding tree unit (CTU: Coding Tree Unit) into quadtrees. Have been. The CU has four sizes of 64 × 64 pixels, 32 × 32 pixels, 16 × 16 pixels, and 8 × 8 pixels. When performing inter prediction, each CU is further divided into prediction units (PU: Prediction Unit) for inter prediction, and prediction processing is performed for each PU. When performing intra prediction, each CU is further divided into PUs for intra prediction, and prediction processing is performed for each PU.

また、特許文献１には、複数カメラ撮像映像の符号化演算量を削減するために、複数カメラ撮影映像の各々においてＳＵＲＦ特徴量を用いた特徴点抽出を行い、撮影範囲が重複する領域のマクロブロックについては隣接カメラベクトルに基づいて探索範囲が狭くなるように補正してブロックマッチングを行うことが開示されている。 Japanese Patent Application Laid-Open No. H11-163873 discloses a technique for extracting feature points using SURF feature amounts in each of a plurality of camera-captured videos in order to reduce the amount of coding operation of the plurality of camera-captured videos. It is disclosed that block matching is performed for blocks so that the search range is corrected based on the adjacent camera vector so that the search range becomes narrower.

特開２０１４−１２０９１７号公報JP 2014-120917 A

大久保榮監修、「インプレス標準教科書シリーズＨ．２６５／ＨＥＶＣ教科書」、株式会社インプレスジャパン、2013年10月21日Supervised by Okubo Sakae, "Impress Standard Textbook Series H.265 / HEVC Textbook", Impress Japan, October 21, 2013

しかしながら、従来の符号化方法では、「複数の電車が高速走行する」、「多数のピンポン玉が跳ねる」というように、多数のオブジェクトが大きい動きをするような映像では、インター予測の精度が低いため、符号化画質が低くなるおそれがある。 However, according to the conventional encoding method, the accuracy of inter prediction is low in a video in which a large number of objects move greatly, such as “a plurality of trains run at high speed” and “a large number of ping-pong balls bounce”. Therefore, the encoded image quality may be reduced.

また、特許文献１に記載の技術は、複数カメラ撮影映像の符号化演算量を削減するためのものであって、ＳＵＲＦ特徴点の複数フレーム間マッチングを用いて動き推定を行って符号化効率を向上させるものではない。 Further, the technique described in Patent Document 1 is for reducing the amount of encoding calculation of images captured by a plurality of cameras, and performs motion estimation using matching between a plurality of frames of SURF feature points to reduce the coding efficiency. It does not improve it.

かかる事情に鑑みてなされた本発明の目的は、多数のオブジェクトが大きい動きをする映像の符号化効率を向上させるための符号化設定情報を決定することが可能な符号化設定情報決定装置及びプログラムを提供することにある。 An object of the present invention made in view of such circumstances is an encoding setting information determining apparatus and a program capable of determining encoding setting information for improving encoding efficiency of a video in which a large number of objects move greatly. Is to provide.

上記課題を解決するため、本発明に係る符号化設定情報決定装置は、映像を符号化する際の設定情報を決定する符号化設定情報決定装置であって、映像を入力して、ＳＵＲＦ（Speeded-Up Robust Features）特徴点のフレーム間マッチングにより、ＳＵＲＦ特徴点ごとに動きベクトルを生成する動き推定部と、符号化ブロックごとにＳＵＲＦ特徴点を計数し、計数したＳＵＲＦ特徴点の数に基づき、前記符号化ブロックと同一又は前記符号化ブロックを分割したサイズの予測ブロックを決定する予測ブロックサイズ設定部と、前記予測ブロックごとに、前記ＳＵＲＦ特徴点ごとの動きベクトルの平均値又は中央値を算出し、前記予測ブロックごとの動き探索の初期変位ベクトルと決定する初期変位ベクトル設定部と、前記予測ブロックごとに、前記ＳＵＲＦ特徴点ごとの動きベクトルを用いて、動き探索範囲を決定する動き探索範囲設定部と、を備え、前記予測ブロックサイズ設定部は、前記予測ブロックのサイズを、ＳＵＲＦ特徴点の数が多い符号化ブロックほど小さくなるように決定し、前記動き探索範囲設定部は、前記初期変位ベクトルを前記動きベクトルの平均値とした場合は、予測ブロックに対応する全ての動きベクトルの始点を予測ブロックの中心点に移動させたときの、該動きベクトルの終点を含む範囲を、該予測ブロックの初期変位ベクトルに対する動き探索範囲とし、前記初期変位ベクトルを前記動きベクトルの中央値とした場合は、予測ブロックに対応する動きベクトルのうち大きさが第１四分位数から第３四分位数の動きベクトルの始点を予測ブロックの中心点に移動させたときの、該動きベクトルの終点を含む範囲を、該予測ブロックの初期変位ベクトルに対する動き探索範囲とすることを特徴とする。 In order to solve the above-mentioned problem, an encoding setting information determining apparatus according to the present invention is an encoding setting information determining apparatus that determines setting information when encoding a video. -Up Robust Features) A motion estimator that generates a motion vector for each SURF feature point by inter-frame matching of feature points, and a SURF feature point is counted for each coding block, and based on the counted number of SURF feature points, A prediction block size setting unit that determines a prediction block having the same size as the coding block or a size obtained by dividing the coding block; and calculating an average value or a median value of the motion vectors for each SURF feature point for each prediction block. An initial displacement vector setting unit that determines an initial displacement vector for motion search for each of the prediction blocks; Using the motion vector for each RF feature points, comprising a motion estimation range setting unit for determining a motion estimation range, and the prediction block size setting unit, the size of the prediction block, the number of SURF feature points is often code When the initial displacement vector is the average value of the motion vectors, the motion search range setting unit sets the start points of all the motion vectors corresponding to the prediction block to the center of the prediction block. When moving to a point, the range including the end point of the motion vector is set as a motion search range for the initial displacement vector of the prediction block, and when the initial displacement vector is set as the median value of the motion vector, Move the start point of the motion vector whose size is the first quartile to the third quartile among the corresponding motion vectors to the center point of the prediction block The range including the end point of the motion vector when the motion vector is set is a motion search range for the initial displacement vector of the prediction block .

また、上記課題を解決するため、本発明に係るプログラムは、コンピュータを、上記符号化設定情報決定装置として機能させることを特徴とする。 According to another aspect of the present invention, there is provided a program for causing a computer to function as the encoding setting information determination device.

本発明によれば、多数のオブジェクトが大きい動きをする映像の符号化効率を向上させるための符号化設定情報を決定することが可能になる。 ADVANTAGE OF THE INVENTION According to this invention, it becomes possible to determine the encoding setting information for improving the encoding efficiency of the video in which many objects move greatly.

本発明の第１の実施形態に係る符号化設定情報決定装置の構成例を示すブロック図である。FIG. 2 is a block diagram illustrating a configuration example of an encoding setting information determination device according to the first embodiment of the present invention. 本発明の第１の実施形態に係る符号化設定情報決定装置の動き推定部における処理を説明する図である。FIG. 3 is a diagram illustrating processing in a motion estimating unit of the encoding setting information determination device according to the first embodiment of the present invention. 本発明の第１の実施形態に係る符号化設定情報決定装置の動き推定部における参照フレーム間の予測構造例を示す図である。FIG. 4 is a diagram illustrating an example of a prediction structure between reference frames in a motion estimator of the encoding setting information determination device according to the first embodiment of the present invention. 本発明の第２の実施形態に係る符号化設定情報決定装置の構成例を示すブロック図である。FIG. 11 is a block diagram illustrating a configuration example of an encoding setting information determination device according to a second embodiment of the present invention. 本発明の第２の実施形態に係る符号化設定情報決定装置の初期変位ベクトル設定部における処理を説明する図である。It is a figure explaining processing in the initial displacement vector setting part of the coding setting information deciding device concerning a 2nd embodiment of the present invention.

以下、本発明の実施形態について、図面を参照して詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

（第１の実施形態）
本発明の第１の実施形態に係る符号化設定情報決定装置について、以下に説明する。図１は、本発明の第１の実施形態に係る符号化設定情報決定装置の構成例を示すブロック図である。図１に示す例では、符号化設定情報決定装置１は、動き推定部１１と、予測ブロックサイズ設定部１２とを備える。 (First embodiment)
The encoding setting information determination device according to the first embodiment of the present invention will be described below. FIG. 1 is a block diagram illustrating a configuration example of the encoding setting information determination device according to the first embodiment of the present invention. In the example illustrated in FIG. 1, the coding setting information determination device 1 includes a motion estimating unit 11 and a predicted block size setting unit 12.

動き推定部１１は、映像（原画像群）を入力して、フレーム間（符号化対象フレームと参照フレームとの間）でＳＵＲＦ特徴点のマッチングを行うことにより、ＳＵＲＦ（Speeded-Up Robust Features）特徴点ごとに、ＳＵＲＦ特徴点を始点とする動きベクトルを生成する。そして、ＳＵＲＦ特徴点の位置及び動きベクトルの方向（水平及び垂直方向の位置ずれ量）を示す動きベクトル情報を予測ブロックサイズ設定部１２に出力する。なお、予測ブロックサイズ設定部１２では動きベクトルの方向を示す情報は不要であるため、予測ブロックサイズ設定部１２に対してはＳＵＲＦ特徴点の位置を示す情報のみを出力するようにしてもよい。 The motion estimating unit 11 receives a video (original image group) and performs SURF feature point matching between frames (between an encoding target frame and a reference frame), thereby obtaining a SURF (Speeded-Up Robust Features). A motion vector starting from the SURF feature point is generated for each feature point. Then, motion vector information indicating the position of the SURF feature point and the direction of the motion vector (the amount of displacement in the horizontal and vertical directions) is output to the prediction block size setting unit 12. Since the information indicating the direction of the motion vector is unnecessary in the predicted block size setting unit 12, only the information indicating the position of the SURF feature point may be output to the predicted block size setting unit 12.

動き推定部１１は、ＳＵＲＦ特徴量を用いた動き推定を行うことにより、拡大・縮小、回転、照明変動などを検出でき、「火花が飛び散る映像」のような動きがランダムな映像であっても高精度に動き推定が可能となり、ランダムな動きが多い映像におけるインター予測精度（動き推定精度）を向上させることが可能となる。 The motion estimating unit 11 can detect the enlargement / reduction, rotation, illumination fluctuation, and the like by performing the motion estimation using the SURF feature amount. Even if the motion such as “the image in which sparks scatter” is a random image, High-precision motion estimation becomes possible, and it is possible to improve inter prediction accuracy (motion estimation accuracy) in a video having many random motions.

図２は、動き推定部１１におけるＳＵＲＦ特徴量の算出方法を説明する図である。ＳＵＲＦ特徴量は、原画像に分散値を可変としたガウシアンフィルタを適用した画像において、ヘッセ（Hessian）行列値の極大点を探すことで複数のＳＵＲＦ特徴点を抽出する。図２では左上図に示すように、８×８画素のブロックの中心点をＳＵＲＦ特徴点としている。つぎに、各ＳＵＲＦ特徴点においてハールウェーブレット（Haar wavelet）を用いて強度の強い角度を３／π間隔で求める。そして、強度の強い方向を原方向として、図２の右下図に示すように２×２画素の領域ごとに、ハールウェーブレットを用いて水平方向の微分値ｄｘ及び垂直方向の微分値ｄｙを求めることにより、Σdx、Σ|dx|、Σdy、及びΣ|dy|の４つの特徴量を算出する。 FIG. 2 is a diagram illustrating a method of calculating a SURF feature amount in the motion estimating unit 11. The SURF feature quantity is obtained by extracting a plurality of SURF feature points by searching for a maximum point of a Hessian matrix value in an image obtained by applying a Gaussian filter with a variable variance value to an original image. In FIG. 2, as shown in the upper left diagram, the center point of a block of 8 × 8 pixels is set as a SURF feature point. Next, at each SURF feature point, a strong angle is obtained at an interval of 3 / π using a Haar wavelet. Then, using the direction of strong intensity as the original direction, the horizontal differential value dx and the vertical differential value dy are obtained using the Haar wavelet for each 2 × 2 pixel region as shown in the lower right diagram of FIG. , Four characteristic amounts of Σdx, Σ | dx |, Σdy, and Σ | dy | are calculated.

図２に示すようにＳＵＲＦ特徴点を中心とする８×８画素のブロック単位で特徴量を算出すると、８×８画素のブロックには２×２画素の領域が４×４個存在するので、１つのＳＵＲＦ特徴点において６４（＝４×４×４）次元の特徴量が得られる。なお、ＳＵＲＦの詳細については、例えば、H. Bay, A. Ess, T. Tuytelaars, and L. V. Gool, "Speeded-Up Robust Features (SURF)", Elsevier, vol. 110, Issue 3, p. 346-359, Jun. 2008を参照されたい。 As shown in FIG. 2, when the feature amount is calculated in units of 8 × 8 pixels with the SURF feature point at the center, there are 4 × 4 2 × 2 pixel regions in the 8 × 8 pixel block. For one SURF feature point, 64 (= 4 × 4 × 4) -dimensional feature amounts are obtained. For details of SURF, see, for example, H. Bay, A. Ess, T. Tuytelaars, and LV Gool, "Speeded-Up Robust Features (SURF)", Elsevier, vol. 110, Issue 3, p. 346- 359, Jun. 2008.

ＳＵＲＦ特徴量を用いる場合、動き推定部１１は、符号化対象フレームと参照フレームとの間でＳＵＲＦ特徴量のマッチングを行うことにより、ＳＵＲＦ特徴点ごとに動きベクトルを求める。具体的には、符号化対象フレーム内の任意のＳＵＲＦ特徴点Ａにおける特徴量に対して６４次元のユークリッド距離が最小となる参照フレーム内のＳＵＲＦ特徴点Ｂをマッチング点として抽出し、ＳＵＲＦ特徴点ＡＢ間の水平方向及び垂直方向の位置ずれ量を動きベクトルとして算出する。 When using the SURF feature, the motion estimating unit 11 obtains a motion vector for each SURF feature point by performing matching of the SURF feature between the encoding target frame and the reference frame. Specifically, a SURF feature point B in a reference frame in which a 64-dimensional Euclidean distance is minimum with respect to a feature amount at an arbitrary SURF feature point A in the encoding target frame is extracted as a matching point, and the SURF feature point is extracted. The amount of horizontal and vertical displacement between AB is calculated as a motion vector.

図３は、参照フレーム間の予測構造例を示す図である。動き推定部１１は、インター予測の参照フレームを、参照フレーム間の予測構造情報を基に判定する。図３に示す例では、Ｐ₁ピクチャを符号化対象フレームとすると、Ｉ₀ピクチャが参照フレームとなるため、Ｐ₁ピクチャとＩ₀ピクチャ位置の原画像間でＳＵＲＦ特徴量によるマッチングを行う。また、Ｂ₂ピクチャを符号化対象フレームとすると、Ｉ₀ピクチャとＰ₁ピクチャが参照フレームとなるため、Ｂ₂ピクチャとＩ₀ピクチャ位置、及びＢ₂ピクチャとＰ₁ピクチャ位置の原画像間で動き推定を行う。 FIG. 3 is a diagram illustrating an example of a prediction structure between reference frames. The motion estimating unit 11 determines a reference frame for inter prediction based on prediction structure information between the reference frames. In the example shown in FIG. 3, when the coding target frame P ₁ picture, because the I ₀ picture is a reference frame, performing matching by SURF feature quantity between the original image of the P ₁ picture and I ₀ picture position. Further, the B ₂ picture when the encoding target frame, since the I ₀ picture and P ₁ picture is a reference frame, B ₂ picture and I ₀ picture-position, and B ₂ between the picture and P ₁ picture-position of the original image Perform motion estimation.

なお、参照フレームの予測構造情報の入力は必須ではなく、参照フレームの予測構造情報を入力しないで、常に符号化対象フレームの１フレーム前（過去）の時間位置との原画像間で動き推定を行うようにしてもよい。 The input of the prediction structure information of the reference frame is not essential, and the motion estimation is always performed between the original image and the time position one frame before (past) the encoding target frame without inputting the prediction structure information of the reference frame. It may be performed.

図１に戻ると、予測ブロックサイズ設定部１２は、フレームを分割した符号化ブロックのサイズを示す符号化ブロックサイズ情報を入力する。そして、予測ブロックサイズ設定部１２は、所定サイズの符号化ブロックごとにＳＵＲＦ特徴点を計数し、計数したＳＵＲＦ特徴点の数に基づき予測ブロックのサイズを決定し、決定した予測ブロックのサイズを示す予測ブロックサイズ情報を外部の符号化装置（図示せず）に設定情報として出力する。予測ブロックのサイズは符号化ブロックと同一又は符号化ブロックを分割したサイズである。 Returning to FIG. 1, the predicted block size setting unit 12 inputs encoded block size information indicating the size of the encoded block obtained by dividing the frame. Then, the prediction block size setting unit 12 counts the SURF feature points for each coded block of a predetermined size, determines the size of the prediction block based on the counted number of SURF feature points, and indicates the determined size of the prediction block. The prediction block size information is output as setting information to an external encoding device (not shown). The size of the prediction block is the same as the coding block or the size obtained by dividing the coding block.

符号化装置は、符号化設定情報決定装置１により決定された予測ブロックサイズを設定し、予測ブロックごとに予測処理を行う。ＳＵＲＦ特徴点の数が多いほど当該符号化ブロックは空間高周波成分が多いと考えられるが、予測ブロックサイズ設定部１２はＳＵＲＦ特徴点の数が多い符号化ブロックほど予測ブロックのサイズを小さくするため、微小動きを高精度かつ高確度に検出することができる。 The encoding device sets the prediction block size determined by the encoding setting information determination device 1, and performs a prediction process for each prediction block. Although it is considered that the larger the number of SURF feature points is, the larger the number of SURF feature points is in the coding block, the higher the number of SURF feature points is. However, the prediction block size setting unit 12 reduces the size of the prediction block as the number of SURF feature points increases. A minute movement can be detected with high accuracy and high accuracy.

符号化方式がＡＶＣである場合、例えば符号化ブロックサイズを１６×１６画素とする。ＳＵＲＦ特徴点の総数をＮ_S、符号化ブロックの数をＮ_Bとすると、符号化ブロック内のＳＵＲＦ特徴点数が（Ｎ_S／Ｎ_B）×２以上であれば予測ブロックサイズ（マクロブロックサイズ）を４×４画素と設定し、（Ｎ_S／Ｎ_B）×２未満で（Ｎ_S／Ｎ_B）×１以上であれば予測ブロックサイズを８×８画素と設定し、（Ｎ_S／Ｎ_B）×１未満であれば予測ブロックサイズを１６×１６画素と設定する。 When the coding method is AVC, for example, the coding block size is set to 16 × 16 pixels. Assuming that the total number of SURF feature points is N _S and the number of coding blocks is N _B , if the number of SURF feature points in the coding block is (N _S / N _B ) × 2 or more, the prediction block size (macro block size) Is set to 4 × 4 pixels, and if less than (N _S / N _B ) × 2 and not less than (N _S / N _B ) × 1, the prediction block size is set to 8 × 8 pixels, and (N _S / N) _B ) If less than × 1, the prediction block size is set to 16 × 16 pixels.

また、符号化方式がＨＥＶＣである場合、符号化ブロックサイズをＣＵのサイズとする。例えばＳＵＲＦ特徴点の総数をＮ_S、ＣＵブロックの数をＮ_Bとすると、ＣＵ内のＳＵＲＦ特徴点数が（Ｎ_S／Ｎ_B）×２以上であれば予測ブロックサイズ（ＰＵサイズ）を４×４画素と設定し、（Ｎ_S／Ｎ_B）×２未満で（Ｎ_S／Ｎ_B）×１．５以上であれば予測ブロックサイズを８×８画素と設定し、（Ｎ_S／Ｎ_B）×１．５未満で（Ｎ_S／Ｎ_B）×１以上であれば予測ブロックサイズを１６×１６画素と設定し、（Ｎ_S／Ｎ_B）×１未満で（Ｎ_S／Ｎ_B）×０．５以上であれば予測ブロックサイズを３２×３２画素と設定し、（Ｎ_S／Ｎ_B）×０．５未満であれば予測ブロックサイズを６４×６４画素と設定する。なおＰＵサイズがＣＵサイズを超える場合は、ＰＵとＣＵを同じサイズとする。 When the encoding method is HEVC, the encoding block size is set to the size of the CU. For example, assuming that the total number of SURF feature points is N _S and the number of CU blocks is N _B , if the number of SURF feature points in the CU is (N _S / N _B ) × 2 or more, the prediction block size (PU size) is 4 × If it is set to 4 pixels, and if it is less than (N _S / N _B ) × 2 and (N _S / N _B ) × 1.5 or more, the prediction block size is set to 8 × 8 pixels, and (N _S / N _B) ) × 1.5 and (N _S / N _B ) × 1 or more, the prediction block size is set to 16 × 16 pixels, and if it is less than (N _S / N _B ) × 1, (N _S / N _B ) If it is not less than × 0.5, the prediction block size is set to 32 × 32 pixels, and if it is less than (N _S / N _B ) × 0.5, the prediction block size is set to 64 × 64 pixels. If the PU size exceeds the CU size, the PU and the CU have the same size.

上述したように、符号化設定情報決定装置１は、符号化ブロックごとにＳＵＲＦ特徴点を計数する。そして、ＳＵＲＦ特徴点の数が多い符号化ブロックほど空間高周波成分が多いとみなせるため、ＳＵＲＦ特徴点の数が多い符号化ブロックほど予測ブロックのサイズが小さくなるように決定する。したがって、符号化設定情報決定装置１により決定されたサイズの予測ブロックを用いて符号化することにより、多数のオブジェクトが大きい動きをする映像の符号化効率を向上させることが可能となる。 As described above, the coding setting information determination device 1 counts SURF feature points for each coding block. Then, it can be considered that a coded block having a larger number of SURF feature points has more spatial high-frequency components. Therefore, the size of the prediction block is determined to be smaller as a coded block has a larger number of SURF feature points. Therefore, by performing encoding using the prediction block having the size determined by the encoding setting information determination device 1, it is possible to improve the encoding efficiency of a video in which many objects move greatly.

（第２の実施形態）
つぎに、本発明の第２の実施形態に係る符号化設定情報決定装置について説明する。図４は、第２の実施形態に係る符号化設定情報決定装置の構成例を示すブロック図である。図４に示す例では、符号化設定情報決定装置２は、動き推定部１１と、予測ブロックサイズ設定部１２と、初期変位ベクトル設定部１３と、動き探索範囲設定部１４とを備える。第２の実施形態の符号化設定情報決定装置２は第１の実施形態の符号化設定情報決定装置１と比較して、初期変位ベクトル設定部１３及び動き探索範囲設定部１４を更に備える点が相違する。なお、本実施形態では符号化設定情報決定装置２は初期変位ベクトル設定部１３及び動き探索範囲設定部１４を備えるが、いずれか一方のみを備える構成としてもよい。 (Second embodiment)
Next, an encoding setting information determination device according to a second embodiment of the present invention will be described. FIG. 4 is a block diagram illustrating a configuration example of an encoding setting information determination device according to the second embodiment. In the example illustrated in FIG. 4, the encoding setting information determination device 2 includes a motion estimation unit 11, a prediction block size setting unit 12, an initial displacement vector setting unit 13, and a motion search range setting unit 14. The encoding setting information determining device 2 of the second embodiment is different from the encoding setting information determining device 1 of the first embodiment in that it further includes an initial displacement vector setting unit 13 and a motion search range setting unit 14. Different. In the present embodiment, the encoding setting information determination device 2 includes the initial displacement vector setting unit 13 and the motion search range setting unit 14, but may have a configuration including only one of them.

動き推定部１１は、第１の実施形態と同様に、映像を入力してフレーム間でＳＵＲＦ特徴点のマッチングを行うことにより、ＳＵＲＦ特徴点ごとに動きベクトルを生成する。そして動きベクトル情報を予測ブロックサイズ設定部１２、初期変位ベクトル設定部１３、及び動き探索範囲設定部１４に出力する。 As in the first embodiment, the motion estimating unit 11 generates a motion vector for each SURF feature point by inputting an image and performing matching of the SURF feature points between frames. Then, the motion vector information is output to the prediction block size setting unit 12, the initial displacement vector setting unit 13, and the motion search range setting unit 14.

予測ブロックサイズ設定部１２は、第１の実施形態と同様に、所定サイズの符号化ブロックごとにＳＵＲＦ特徴点を計数し、計数したＳＵＲＦ特徴点の数に基づき、符号化ブロック以下のサイズの予測ブロックサイズを決定する。そして予測ブロックサイズ情報を外部の符号化装置に設定情報として出力するとともに、初期変位ベクトル設定部１３、及び動き探索範囲設定部１４に出力する。 As in the first embodiment, the prediction block size setting unit 12 counts SURF feature points for each coding block of a predetermined size, and based on the counted number of SURF feature points, predicts a size of the coding block or less. Determine the block size. Then, the prediction block size information is output as setting information to an external encoding device, and is also output to the initial displacement vector setting unit 13 and the motion search range setting unit 14.

初期変位ベクトル設定部１３は、動き推定部１１により生成された動きベクトル情報、及び予測ブロックサイズ設定部１２により生成された予測ブロックサイズ情報を入力して、各予測ブロック内に含まれる全ての動きベクトルの平均値又は中央値を算出し、動き探索の初期変位ベクトルとする。そして、初期変位ベクトル設定部１３は、初期変位ベクトルを示す情報を外部の符号化装置に設定情報として出力する。なお、予測ブロックに対応する動きベクトルが存在しない（すなわち、予測ブロック内にＳＵＲＦ特徴点が存在しない）場合には、その旨を示す情報を外部に出力する。 The initial displacement vector setting unit 13 receives the motion vector information generated by the motion estimating unit 11 and the predicted block size information generated by the predicted block size setting unit 12, and inputs all the motions included in each predicted block. An average value or a median value of the vectors is calculated and used as an initial displacement vector for motion search. Then, the initial displacement vector setting unit 13 outputs information indicating the initial displacement vector to an external encoding device as setting information. If there is no motion vector corresponding to the prediction block (that is, no SURF feature point exists in the prediction block), information indicating that fact is output to the outside.

図５は、初期変位ベクトル設定部１３の処理を説明する図である。図５に示すように予測ブロックＰ内に３つのＳＵＲＦ特徴点Ａ，Ｂ，Ｃが存在した場合、初期変位ベクトル設定部１３は、ＳＵＲＦ特徴点Ａを始点とする動きベクトル、ＳＵＲＦ特徴点Ｂを始点とする動きベクトル、及びＳＵＲＦ特徴点Ｃを始点とする動きベクトルの平均値又は中央値を初期変位ベクトルとして算出する。 FIG. 5 is a diagram illustrating the process of the initial displacement vector setting unit 13. As shown in FIG. 5, when three SURF feature points A, B, and C exist in the prediction block P, the initial displacement vector setting unit 13 sets the motion vector starting from the SURF feature point A and the SURF feature point B An average value or a median value of the motion vector starting from the starting point and the motion vector starting from the SURF feature point C is calculated as the initial displacement vector.

動き探索範囲設定部１４は、動き推定部１１により生成された動きベクトル情報、及び予測ブロックサイズ設定部１２により生成された予測ブロックサイズ情報を入力して、予測ブロックごとに、対応する動きベクトル（予測ブロックに含まれる特徴点を始点とする動きベクトル）を用いて動き探索範囲を決定する。そして、動き探索範囲設定部１４は、動き探索範囲を示す情報を外部の符号化装置に設定情報として出力する。なお、予測ブロックに対応する動きベクトルが存在しない（すなわち、予測ブロック内にＳＵＲＦ特徴点が存在しない）場合には、その旨を示す情報を外部に出力する。 The motion search range setting unit 14 inputs the motion vector information generated by the motion estimating unit 11 and the predicted block size information generated by the predicted block size setting unit 12, and for each predicted block, A motion search range is determined using a motion vector starting from a feature point included in the prediction block). Then, the motion search range setting unit 14 outputs information indicating the motion search range as setting information to an external encoding device. If there is no motion vector corresponding to the prediction block (that is, no SURF feature point exists in the prediction block), information indicating that fact is output to the outside.

具体的には、動き探索範囲設定部１４は、初期変位ベクトル設定部１３が初期変位ベクトルを動きベクトルの平均値とした場合は、予測ブロックに対応する全ての動きベクトルの始点を予測ブロックの中心点に移動させたときの、該動きベクトルの終点を含む範囲を、当該予測ブロックの初期変位ベクトルに対する動き探索範囲とする。また、動き探索範囲設定部１４は、初期変位ベクトル設定部１３が初期変位ベクトルを動きベクトルの中央値とした場合は、予測ブロックに対応する動きベクトルのうち大きさが第１四分位数から第３四分位数の動きベクトルの始点を予測ブロックの中心点に移動させたときの、該動きベクトルの終点を含む範囲を、当該予測ブロックの初期変位ベクトルに対する動き探索範囲情報とする。ここで、第１四分位数は、データを小さい順に並べたときの、小さいほうから数えて全体の１／４の順位の値であり、第３四分位数は、データを小さい順に並べたときの、小さいほうから数えて全体の３／４の順位の値である。 Specifically, when the initial displacement vector setting unit 13 sets the initial displacement vector as the average value of the motion vectors, the motion search range setting unit 14 sets the start points of all the motion vectors corresponding to the predicted block to the center of the predicted block. The range including the end point of the motion vector when moved to the point is set as the motion search range for the initial displacement vector of the prediction block. In addition, when the initial displacement vector setting unit 13 sets the initial displacement vector as the median of the motion vectors, the motion search range setting unit 14 sets the size of the motion vector corresponding to the prediction block from the first quartile to the first quartile. The range including the end point of the motion vector when the start point of the motion vector of the third quartile is moved to the center point of the prediction block is defined as motion search range information for the initial displacement vector of the prediction block. Here, the first quartile is the value of the order of 1/4 of the whole when counting the data in ascending order, and the third quartile is the value of sorting the data in ascending order. Is the value of the order of 3/4 of the whole, counting from the smaller one.

上述したように、符号化設定情報決定装置２は、符号化設定情報決定装置１と同様に、符号化ブロックごとにＳＵＲＦ特徴点を計数し、ＳＵＲＦ特徴点の数が多い符号化ブロックほど予測ブロックのサイズが小さくなるように決定するため、符号化設定情報決定装置２により決定されたサイズの予測ブロックを用いて符号化することにより、多数のオブジェクトが大きい動きをする映像の符号化効率を向上させることが可能となる。 As described above, the coding setting information determination device 2 counts the SURF feature points for each coding block, as in the case of the coding setting information determination device 1. Is encoded using a prediction block of the size determined by the encoding setting information determination device 2 in order to reduce the size of the image, thereby improving the encoding efficiency of a video in which a large number of objects move greatly. It is possible to do.

また、符号化設定情報決定装置２は、予測ブロック内に含まれる動きベクトルを用いて動き探索の初期変位ベクトルを決定する。したがって、符号化設定情報決定装置２により決定された初期変位ベクトルを用いることにより、大きな動きであってもブロックマッチングによる動き検出を行うことができ、動き推定の精度を向上させることが可能となる。 In addition, the coding setting information determination device 2 determines an initial displacement vector for motion search using a motion vector included in a prediction block. Therefore, by using the initial displacement vector determined by the encoding setting information determination device 2, even if the motion is large, motion detection can be performed by block matching, and the accuracy of motion estimation can be improved. .

また、符号化設定情報決定装置２は、予測ブロック内に含まれる動きベクトルを用いて動き探索範囲を決定する。したがって、符号化設定情報決定装置２により決定された動き探索範囲を用いてブロックマッチングを行うことにより、動き探索範囲を絞ることができ、動き検出の速度を向上させることが可能となる。 Further, the coding setting information determination device 2 determines a motion search range using a motion vector included in the prediction block. Therefore, by performing block matching using the motion search range determined by the coding setting information determination device 2, the motion search range can be narrowed, and the speed of motion detection can be improved.

なお、上述した符号化設定情報決定装置１又は２として機能させるためにコンピュータを好適に用いることができ、そのようなコンピュータは、符号化設定情報決定装置１又は２の各機能を実現する処理内容を記述したプログラムを該コンピュータの記憶部に格納しておき、該コンピュータのＣＰＵによってこのプログラムを読み出して実行させることで実現することができる。なお、このプログラムは、コンピュータ読取り可能な記録媒体に記録可能である。 Note that a computer can be suitably used to function as the above-described encoding setting information determining device 1 or 2, and such a computer performs processing contents for realizing each function of the encoding setting information determining device 1 or 2. Is stored in the storage unit of the computer, and the program is read out and executed by the CPU of the computer. This program can be recorded on a computer-readable recording medium.

上述の実施形態は代表的な例として説明したが、本発明の趣旨及び範囲内で、多くの変更及び置換ができることは当業者に明らかである。したがって、本発明は、上述の実施形態によって制限するものと解するべきではなく、特許請求の範囲から逸脱することなく、種々の変形や変更が可能である。例えば、実施形態に記載の複数の構成ブロックを１つに組み合わせたり、あるいは１つの構成ブロックを分割したりすることが可能である。 Although the above embodiments have been described as representative examples, it will be apparent to those skilled in the art that many changes and substitutions can be made within the spirit and scope of the present invention. Therefore, the present invention should not be construed as being limited by the above-described embodiment, and various modifications and changes can be made without departing from the scope of the claims. For example, a plurality of constituent blocks described in the embodiment can be combined into one, or one constituent block can be divided.

１，２符号化設定情報決定装置
１１動き推定部
１２予測ブロックサイズ設定部
１３初期変位ベクトル設定部
１４動き探索範囲設定部 1, 2 coding setting information determination device 11 motion estimation unit 12 prediction block size setting unit 13 initial displacement vector setting unit 14 motion search range setting unit

Claims

An encoding setting information determination device that determines setting information when encoding a video,
A motion estimator that receives a video and generates a motion vector for each SURF feature point by performing inter-frame matching of SURF (Speeded-Up Robust Features) feature points;
A prediction block size setting unit that counts SURF feature points for each coding block and determines a prediction block having the same size as the coding block or a size obtained by dividing the coding block based on the counted number of SURF feature points,
An initial displacement vector setting unit that calculates an average value or a median value of motion vectors for each SURF feature point for each prediction block, and determines an initial displacement vector for motion search for each prediction block;
A motion search range setting unit that determines a motion search range using a motion vector for each SURF feature point for each of the prediction blocks;
With
The prediction block size setting unit determines the size of the prediction block to be smaller for a coded block having a larger number of SURF feature points ,
The motion search range setting unit,
When the initial displacement vector is the average value of the motion vectors, when moving the start points of all the motion vectors corresponding to the prediction block to the center point of the prediction block, the range including the end point of the motion vector, The motion search range for the initial displacement vector of the prediction block,
When the initial displacement vector is the median of the motion vectors, the start point of the motion vector whose magnitude is from the first quartile to the third quartile among the motion vectors corresponding to the prediction block is An encoding setting information determining device , wherein a range including an end point of the motion vector when the motion vector is moved to a center point is set as a motion search range for an initial displacement vector of the prediction block .

A program for causing a computer to function as the encoding setting information determination device according to claim 1 .