JP2009207071A

JP2009207071A - Method, apparatus and program for estimating motion estimation accuracy and computer-readable recording medium with the program recorded thereon

Info

Publication number: JP2009207071A
Application number: JP2008049750A
Authority: JP
Inventors: Yukihiro Bando; 幸浩坂東; Kazuya Hayase; 和也早瀬; Masayuki Takamura; 誠之高村; Kazuto Kamikura; 一人上倉; Yoshiyuki Yajima; 由幸八島
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2008-02-29
Filing date: 2008-02-29
Publication date: 2009-09-10
Anticipated expiration: 2028-02-29
Also published as: JP4824712B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide a new technology for achieving efficient encoding to a high frame rate video. <P>SOLUTION: Four model parameters are input which are described in a function to be used for estimating prediction error power in inter-frame prediction when performing encoding at a certain frame rate and on the basis of these model parameters, a secondary function is determined, with the inverse of the frame rate as a parameter, that is used for estimating motion estimation accuracy to minimize the amount of codes to be generated by encoding for fixing a distortion amount. A value of the frame rate to be processed is then substituted to the secondary function, thereby estimating motion estimation accuracy. An analytic theoretical model is constructed which represents a relationship between the frame rate and motion estimation accuracy, and suitable motion estimation accuracy for each frame rate is estimated using the model, thereby achieving efficient encoding to a high frame rate video. <P>COPYRIGHT: (C)2009,JPO&INPIT

Description

本発明は、高フレームレート映像に対する効率的な符号化を実現する映像符号化で用いられる動き推定精度推定方法およびその装置と、その動き推定精度推定方法の実現に用いられる動き推定精度推定プログラムおよびそのプログラムを記録したコンピュータ読み取り可能な記録媒体とに関する。 The present invention relates to a motion estimation accuracy estimation method and apparatus used in video encoding that realizes efficient encoding for high frame rate video, a motion estimation accuracy estimation program used to realize the motion estimation accuracy estimation method, and The present invention relates to a computer-readable recording medium on which the program is recorded.

近年、臨場感あふれる大画面のスポーツ映像やデジタルシネマに代表される超高画質映像への期待が高まっている。これを受けて、映像の高画質化に関する研究が精力的に行われている。 In recent years, there are growing expectations for super-high-quality images such as large-screen sports images and digital cinema that are full of realism. In response to this, research on high-quality video has been vigorously conducted.

超高画質映像の実現には次の四要素が必要である。すなわち、空間解像度、画素値深度、色再現性、時間解像度である。これを受けて、前者の三要素については、デジタルシネマ等の応用およびナチュラルビジョンプロジェクトにおいて検討が進められている。 The following four elements are necessary to realize super high-quality video. That is, spatial resolution, pixel value depth, color reproducibility, and temporal resolution. In response, the former three elements are being studied in applications such as digital cinema and natural vision projects.

しかし、被写体の自然な動きを表現するために不可欠な時間解像度の向上、即ち、映像の高フレームレート化については、充分な検討がなされていない。 However, sufficient studies have not been made on improving the temporal resolution, that is, increasing the frame rate of an image, which is indispensable for expressing the natural movement of the subject.

Ｓpillmannらによれば、網膜の出力細胞である神経節細胞が出力するパルス数の上限は毎秒３００〜４００個程度であるとする生理学的な知見が示されている。このため、人の視覚系は最も短くて１／１５０〜１／２００秒程度までの発光の違いを知覚できると推察される。これは、知覚可能なフレームレートの検知限が１５０〜２００［フレーム／秒］であることを意味する。現行映像のフレームレートである３０，６０［フレーム／秒］は、フリッカーの検知限から定められたものであり、自然な動きを表現するに十分な値ではない。 According to Spillmann et al., Physiological knowledge is shown that the upper limit of the number of pulses output by ganglion cells, which are output cells of the retina, is about 300 to 400 pulses per second. For this reason, it is presumed that the human visual system is the shortest and can perceive the difference in light emission up to 1/150 to 1/200 second. This means that the perceivable frame rate detection limit is 150 to 200 [frames / second]. The current video frame rate of 30, 60 [frame / second] is determined from the flicker detection limit, and is not a sufficient value to express natural motion.

一方、映像の超高画質化はデータ量の爆発的な増加を招くため、ネットワークの帯域制限等の理由により符号量が制限される場合には、効率的な符号化方法が必要となる。 On the other hand, since the super high image quality of the video causes an explosive increase in the amount of data, an efficient encoding method is required when the code amount is limited due to network bandwidth limitations or the like.

高フレームレート映像の場合、フレーム間相関が増加することから、動き補償によるフレーム間予測が効果的である。動き補償のパラメータの一つに動き推定精度がある。例えば、ＭＰＥＧ−２準拠の符号化器における動き推定精度は１／２画素精度となる。 In the case of a high frame rate video, since inter-frame correlation increases, inter-frame prediction by motion compensation is effective. One parameter of motion compensation is motion estimation accuracy. For example, the motion estimation accuracy in an MPEG-2 compliant encoder is ½ pixel accuracy.

高フレームレート映像の場合、フレーム間隔が短くなることから、変移量に高い精度が求められるため、フレームレートと動き推定精度との関係を把握することが重要となる。直感的には、フレームレートの増加に伴い、高い動き推定精度が必要となることが予想される。 In the case of a high frame rate video, since the frame interval is shortened, high accuracy is required for the shift amount. Therefore, it is important to grasp the relationship between the frame rate and the motion estimation accuracy. Intuitively, it is expected that higher motion estimation accuracy is required as the frame rate increases.

しかし、従来の研究では、フレームレートの影響は考慮されておらず、フレームレートと動き推定精度との関係についての定量的な性質は把握されていない（例えば、非特許文献１参照）。このため、アドホックな方法で動き推定精度を設定することになり、動き推定精度が不足した場合、符号化効率が低下する。 However, in the conventional research, the influence of the frame rate is not considered, and the quantitative property regarding the relationship between the frame rate and the motion estimation accuracy is not grasped (for example, refer to Non-Patent Document 1). For this reason, the motion estimation accuracy is set by an ad hoc method. When the motion estimation accuracy is insufficient, the coding efficiency is lowered.

一方、設定可能な範囲における動き推定精度に対して、総当りで符号化を行い、最も符号化効率の高い動き推定精度を選択する方法もある。しかし、この場合には、符号化処理の時間が増大してしまう。
J.Ribas-Corbera and D.L.Neuhoff. Optimizing motion-vector accuracy in block-based video coding. IEEE CSVT, Vol.11, No.4, pp.497-511, 2001. On the other hand, there is also a method in which the motion estimation accuracy in a settable range is encoded with brute force and the motion estimation accuracy with the highest encoding efficiency is selected. However, in this case, the encoding processing time increases.
J.Ribas-Corbera and DLNeuhoff.Optimizing motion-vector accuracy in block-based video coding.IEEE CSVT, Vol.11, No.4, pp.497-511, 2001.

前述したように、被写体の自然な動きを表現するためには、映像の高フレームレート化を実現する必要がある。 As described above, in order to express the natural movement of the subject, it is necessary to realize a high frame rate of the video.

映像の高フレームレート化を実現するためには、フレーム間隔が短くなることから、変移量に高い精度が求められるため、フレームレートと動き推定精度との関係を把握することが重要となる。 In order to realize a high frame rate of video, since the frame interval is shortened, high accuracy is required for the shift amount, and therefore it is important to grasp the relationship between the frame rate and the motion estimation accuracy.

しかし、従来の研究では、フレームレートと動き推定精度との関係についての定量的な性質は把握されておらず、アドホックな方法で動き推定精度を設定するようにしているのが実情である。 However, in the conventional research, the quantitative property about the relationship between the frame rate and the motion estimation accuracy is not grasped, and the actual situation is that the motion estimation accuracy is set by an ad hoc method.

これから、従来技術に従っていると、高フレームレート映像を符号化するときに、動き推定精度が不足した場合には、符号化効率が低下するという問題があった。 Thus, according to the prior art, when encoding a high frame rate video, if the motion estimation accuracy is insufficient, there is a problem that the encoding efficiency decreases.

本発明はかかる事情に鑑みてなされたものであって、高フレームレート映像に対する符号化処理において、フレームレートと動き推定精度との関係を表す解析的な理論モデルを構築して、その理論モデルを利用してフレームレート毎の適切な動き推定精度を推定することで、高フレームレート映像に対する効率的な符号化を実現する新たな動き推定精度推定技術を確立することを目的とする。 The present invention has been made in view of such circumstances. In an encoding process for a high frame rate video, an analytical theoretical model representing the relationship between the frame rate and motion estimation accuracy is constructed, and the theoretical model is It is an object of the present invention to establish a new motion estimation accuracy estimation technique that realizes efficient encoding for a high frame rate video by estimating an appropriate motion estimation accuracy for each frame rate.

この目的を達成するために、本発明の動き推定精度推定装置は、非整数画素精度の動き推定を行うフレーム間予測を伴う映像符号化で用いられて、映像を符号化する際に、歪み量を一定とする符号化の条件下において、その符号化で発生することになる符号量を最小化する動き推定精度を推定することを実現するために、（イ）フレームレートを入力するフレームレート入力手段と、（ロ）あるフレームレートで符号化を行う場合におけるフレーム間予測の予測誤差電力の推定に用いられる関数に記述される４個のモデルパラメータを入力するモデルパラメータ入力手段と、（ハ）モデルパラメータ入力手段の入力した４個のモデルパラメータに基づいて、歪み量を一定とする符号化の条件下において、その符号化で発生することになる符号量を最小化する動き推定精度の推定に用いられる、フレームレートの逆数を変数とする二次関数を決定する決定手段と、（ニ）決定手段の決定した二次関数に対して、フレームレート入力手段の入力したフレームレートの値を代入することで、フレームレート入力手段の入力したフレームで符号化を行う場合に発生することになる符号量を最小化する動き推定精度を推定する推定手段と、（ホ）推定手段の推定した動き推定精度を出力する出力手段とを備えるように構成する。 In order to achieve this object, the motion estimation accuracy estimation apparatus of the present invention is used in video encoding with inter-frame prediction that performs motion estimation with non-integer pixel accuracy. In order to realize the estimation of motion estimation accuracy that minimizes the amount of code that will be generated in the encoding under the condition that the encoding is constant, (a) the frame rate input to input the frame rate And (b) model parameter input means for inputting four model parameters described in a function used for estimation of prediction error power of inter-frame prediction when encoding is performed at a certain frame rate, and (c) Based on the four model parameters input by the model parameter input means, a code that is generated by the encoding under the condition of the encoding with a constant distortion amount Determining means for determining a quadratic function using a reciprocal of the frame rate as a variable, and (d) a frame rate input means for the quadratic function determined by the determining means. Substituting the value of the input frame rate to estimate the motion estimation accuracy for minimizing the amount of code that will occur when encoding is performed on the frame input by the frame rate input means; And (e) output means for outputting the motion estimation accuracy estimated by the estimation means.

以上の各処理手段が動作することで実現される本発明の動き推定精度推定方法はコンピュータプログラムでも実現できるものであり、このコンピュータプログラムは、適当なコンピュータ読み取り可能な記録媒体に記録して提供されたり、ネットワークを介して提供され、本発明を実施する際にインストールされてＣＰＵなどの制御手段上で動作することにより本発明を実現することになる。 The motion estimation accuracy estimation method of the present invention realized by the operation of each of the above processing means can also be realized by a computer program, and this computer program is provided by being recorded on an appropriate computer-readable recording medium. Alternatively, the present invention is realized by being provided via a network, installed when the present invention is carried out, and operating on a control means such as a CPU.

このように構成される本発明の動き推定精度推定装置では、処理対象となるフレームレートを入力すると、続いて、例えば、（１）参照フレームにおける隣接画素間の差分値の二乗和から導出されるモデルパラメータα₁と、（２）フレーム間予測誤差に対するテイラー展開の近似誤差と参照フレームにおける隣接画素間差分値との積の和から導出されるモデルパラメータα₂と、（３）原信号に重畳している雑音と参照フレームにおける補間誤差との和の二乗和の近似値から導出されるモデルパラメータα₃と、（４）フレーム間予測誤差に対するテイラー展開の近似誤差の二乗和から導出されるモデルパラメータα₄という４個のモデルパラメータを入力する。 In the motion estimation accuracy estimation apparatus of the present invention configured as described above, when the frame rate to be processed is input, subsequently, for example, (1) derived from the sum of squares of the difference values between adjacent pixels in the reference frame. Model parameter α ₁ , (2) model parameter α ₂ derived from the sum of the product of the approximation error of Taylor expansion for inter-frame prediction error and the difference value between adjacent pixels in the reference frame, and (3) superposition on the original signal Model parameter α ₃ derived from the approximate value of the sum of squares of the sum of the noise and the interpolation error in the reference frame, and (4) a model derived from the approximate sum of squares of the Taylor expansion approximation error for the inter-frame prediction error to enter the four model parameters of that parameter α _4.

ここで、４個のモデルパラメータを入力してから、処理対象となるフレームレートを入力することもある。 Here, after the four model parameters are input, the frame rate to be processed may be input.

続いて、フレームレートをＦで表し、動き補償の単位となるセグメントの数をＭで表し、係数をκで表すならば、入力した４個のモデルパラメータα_1,α_2,α_3,α₄に基づいて、例えば、
（α₁Ｆ^-2＋α₂Ｆ^-1＋α₃＋α₄）／κα₁(Ｍ−１）
という関数を決定することで、歪み量を一定とする符号化の条件下において、その符号化で発生することになる符号量を最小化する動き推定精度の推定に用いられる、フレームレートの逆数を変数とする二次関数を決定する。 Subsequently, if the frame rate is represented by F, the number of segments as a unit of motion compensation is represented by M, and the coefficient is represented by κ, the input four model parameters α _1, α _2, α _3, α ₄ For example, based on
(Α ₁ F ⁻² + α ₂ F ⁻¹ + α ₃ + α ₄ ) / κα ₁ (M−1)
By determining the function, the reciprocal of the frame rate used to estimate the motion estimation accuracy that minimizes the amount of code that will be generated by the coding under the condition of coding with a constant amount of distortion. Determine a quadratic function as a variable.

この二次関数が、フレームレートとそのフレームレートで符号化を行う場合に発生することになる符号量を最小化する動き推定精度との関係について記述するモデルとなる。 This quadratic function is a model that describes the relationship between the frame rate and the motion estimation accuracy that minimizes the amount of code that occurs when encoding is performed at that frame rate.

これから、続いて、決定した二次関数に対して、入力したフレームレートの値を代入することで、前記入力したフレームで符号化を行う場合に発生することになる符号量を最小化する動き推定精度を推定して、その推定した動き推定精度を出力する。 Next, motion estimation that minimizes the amount of code that will occur when encoding is performed on the input frame by substituting the value of the input frame rate for the determined quadratic function. The accuracy is estimated, and the estimated motion estimation accuracy is output.

本発明によれば、フレームレートが与えられた場合に、歪み量を一定（量子化ステップ幅を固定）とした場合の符号量を最小化する動き推定精度を推定することができるようになる。 According to the present invention, it is possible to estimate the motion estimation accuracy that minimizes the code amount when the distortion amount is constant (quantization step width is fixed) when the frame rate is given.

すなわち、本発明によれば、フレームレートに応じた適切な動き推定精度を設定することが可能となり、動き推定精度の不足に伴う符号化効率の低下を回避できるようになる。また、一般に、高い動き推定精度が必要とされる高フレームレート映像の符号化において、適切な動き推定精度を予め見積もることが可能となり、符号化処理を高速化できることが見込める。 That is, according to the present invention, it is possible to set an appropriate motion estimation accuracy according to the frame rate, and it is possible to avoid a decrease in coding efficiency due to a lack of motion estimation accuracy. In general, in encoding of a high frame rate video that requires high motion estimation accuracy, it is possible to estimate appropriate motion estimation accuracy in advance, and it is expected that the encoding process can be speeded up.

以下、実施の形態に従って本発明を詳細に説明する。 Hereinafter, the present invention will be described in detail according to embodiments.

前述したように、本発明では、高フレームレート映像に対する符号化処理において、フレームレートと動き推定精度との関係を表す解析的な理論モデルを構築して、その理論モデルを利用してフレームレート毎の適切な動き推定精度を推定することで、高フレームレート映像に対する効率的な符号化を実現するものである。 As described above, in the present invention, in an encoding process for a high frame rate video, an analytical theoretical model representing the relationship between the frame rate and the motion estimation accuracy is constructed, and the frame model is used for each frame rate using the theoretical model. By estimating the appropriate motion estimation accuracy, efficient coding for a high frame rate video is realized.

次に、まず最初に、『［１］フレームレートとフレーム間予測誤差電力との関係』で、フレームレートとフレーム間予測誤差電力との関係について説明し、続いて、『［２］パラメータ推定』で、フレームレートとフレーム間予測誤差電力との関係に記述されるパラメータの推定について説明し、続いて、『［３］変移量推定誤差とフレームレートとの関係』で、フレームレートとフレーム間予測誤差電力との関係の導出にあたって必要となる変移量推定誤差とフレームレートとの関係について説明し、最後に、『［４］動き推定精度の最適化』で、フレームレートと動き推定精度との関係を表す解析的な理論モデルを構築して、その理論モデルを利用してフレームレート毎の適切な動き推定精度を推定する構成について説明する。 Next, first, “[1] Relationship between frame rate and inter-frame prediction error power” explains the relationship between frame rate and inter-frame prediction error power, followed by “[2] Parameter estimation”. The parameter estimation described in the relationship between the frame rate and the inter-frame prediction error power will be described, and then, “[3] Relationship between the displacement estimation error and the frame rate” will be described. The relationship between the shift amount estimation error and the frame rate necessary for deriving the relationship with the error power will be explained. Finally, in [4] Optimization of motion estimation accuracy, the relationship between the frame rate and motion estimation accuracy. A configuration is described in which an analytical theoretical model representing the above is constructed and appropriate motion estimation accuracy for each frame rate is estimated using the theoretical model.

［１］フレームレートとフレーム間予測誤差電力との関係
フレームレートとフレーム間予測誤差電力との関係を解析的に導出する。ここでは、簡単のため１次元信号を例にとり説明する。 [1] Relationship between frame rate and inter-frame prediction error power The relationship between frame rate and inter-frame prediction error power is analytically derived. Here, for simplicity, a one-dimensional signal will be described as an example.

Ｘサンプルを有する時刻ｔの位置ｘにおける観測信号をｇ_t（ｘ）（０≦ｘ≦Ｘ−１）として、ＬサンプルからなるセグメントＢ［ｉ］（ｉ＝１，２，... ，Ｘ／Ｌ）を単位として動き補償（推定変位量＾ｄ［ｉ］）を行い、そのセグメント内の動き補償後の予測誤差を評価する。 An observation signal at position x at time t having X samples is represented by g _t (x) (0 ≦ x ≦ X−1), and segment B [i] (i = 1, 2,. / L) as a unit, motion compensation (estimated displacement ^ d [i]) is performed, and the prediction error after motion compensation in the segment is evaluated.

ここで、観測信号ｇ_t（ｘ）は、画像信号ｆ_t（ｘ）に雑音ｎ_t（ｘ）が重畳して構成されているものとする。すなわち、
ｇ_t（ｘ）＝ｆ_t（ｘ）＋ｎ_t（ｘ）
とする。 Here, it is assumed that the observation signal g _t (x) is configured by superimposing noise n _t (x) on the image signal f _t (x). That is,
g _t (x) = _ft (x) + _nt (x)
And

なお、以下の表記において、「＾Ｘ」（Ｘは文字）における記号＾は、「Ｘ」の上に付く記号を示している。また、以下では、位置ｘにおける真の変位量をｄ（ｘ）としている。 In the following notation, the symbol ^ in “^ X” (where X is a letter) indicates a symbol attached to “X”. In the following, the true displacement amount at the position x is d (x).

推定変移量＾ｄ［ｉ］が小数画素精度の場合、参照画素値を補間処理により生成する。補間対象の信号をｓ_t（ｘ）とすると、補間処理は下記の式（１）で表されるフィルタリングとなる。なお、以下では、下記の式（１）式の左辺の演算子を「Ｉ」で表記することがある。 When the estimated shift amount ^ d [i] has decimal pixel accuracy, a reference pixel value is generated by interpolation processing. When a signal to be interpolated and s _t (x), the interpolation processing is filtering represented by the following formula (1). In the following, the operator on the left side of the following formula (1) may be expressed as “I”.

ここで、δは動き推定の精度である。例えば、ＭＰＥＧ−２準拠の符号化器の場合、δ＝１／２に設定される。また、補間係数ψ_j［ｋ］は下記の式（２）を満たすものとする。 Here, δ is the accuracy of motion estimation. For example, in the case of an MPEG-2 compliant encoder, δ = 1/2 is set. The interpolation coefficient ψ _j [k] satisfies the following expression (2).

前述の補間フィルタを用いて、参照位置の画素値は、Ｉ（〜ｇ_t-1（ｘ＋＾ｄ［ｉ］）^(q)）として近似される。 Using the above-described interpolation filter, the pixel value at the reference position is approximated as I (˜gt ₁ (x + ^ d [i]) ^(q) ).

なお、以下の表記において、「〜Ｘ」（Ｘは文字）における記号〜は、「Ｘ」の上に付く記号を示している。 In the following notation, the symbol “˜X” (where X is a letter) indicates a symbol attached to “X”.

ここで、〜ｇ_t-1（ｘ＋＾ｄ［ｉ］）^(q)は、ｇ_t-1（ｘ＋＾ｄ［ｉ］）に対して、量子化ステップ幅ｑでの量子化を行った場合の復号信号であり、下記の式（３）のように表せる。 Here, ~ gt _-1 (x + ^ d [i]) ^(q) is a case where quantization with a quantization step width q is performed on gt _-1 (x + ^ d [i]). And is expressed as the following equation (3).

この式（３）におけるｖ_t-1（ｘ＋＾ｄ［ｉ］）ｑは、量子化に伴い発生する誤差を表しており、ｖ_t-1（ｘ＋＾ｄ［ｉ］）は信号ｆ_t-1（ｘ＋＾ｄ［ｉ］）に依存するパラメータである。 In this equation (3), v _t-1 (x + ^ d [i]) q represents an error caused by quantization, and v _t-1 (x + ^ d [i]) represents the signal f _{t- 1 This} parameter depends on (x + ^ d [i]).

参照位置の画素値ｆ_t-1（ｘ＋＾ｄ［ｉ］）と補間により生成した値Ι（〜ｇ_t-1（ｘ＋＾ｄ［ｉ］）^(q)）との差は、前述の式（３）を使い、下記の式（４）のように表せる。 The difference between the pixel value f _t-1 (x + ^ d [i]) at the reference position and the value Ι (˜gt ₁ (x + ^ d [i]) ^(q) ) generated by interpolation is the above-described equation. Using (3), it can be expressed as the following equation (4).

ここで、ｅ_t-1 ^int（ｘ＋＾ｄ［ｉ］）は、
ｅ_t-1 ^int（ｘ＋＾ｄ［ｉ］）
＝ｆ_t-1（ｘ＋＾ｄ［ｉ］）−Ｉ（ｆ_t-1（ｘ＋＾ｄ［ｉ］））
の通りである。 Here, e _t-1 ^int (x + ^ d [i]) is
e _t-1 ^int (x + ^ d [i])
= _Ft-1 (x + ^ d [i])-I ( _ft-1 (x + ^ d [i]))
It is as follows.

このとき、セグメント内の動き補償後の予測誤差は、“ｇ_t（ｘ）＝ｆ_t（ｘ）＋ｎ_t（ｘ）”と前述の式（３）とを使うことで、下記の式（５）のように表せる。 In this case, the prediction error after motion compensation in a _{segment, "g t (x) =} f t (x) + n t (x)" and the above equations (3) and by using the equation below (5 ).

ここで、φ（ｘ）は、テイラー展開の２次以降の項である。 Here, φ (x) is the second and subsequent terms of Taylor expansion.

また、Ｎ（ｘ）は、参照フレームにおける補間誤差Ｎ_t-1 ^ref（ｘ）と原信号に重畳している雑音ｎ_t（ｘ）との和であり、
Ｎ（ｘ）＝ｎ_t（ｘ）−Ｉ（ｎ_t-1（ｘ＋＾ｄ［ｉ］））
＋ｅ_t-1 ^int（ｘ＋＾ｄ［ｉ］）−Ｉ（ｖ_t-1（ｘ＋＾ｄ［ｉ］））ｑ
のように表せる。 N (x) is the sum of the interpolation error N _t-1 ^ref (x) in the reference frame and the noise n _t (x) superimposed on the original signal,
N (x) = _nt (x) -I ( _nt-1 (x + ^ d [i]))
+ _Et-1 ^int (x + ^ d [i])-I (vt _-1 (x + ^ d [i])) q
It can be expressed as

また、ζ［ｉ］（ｘ）は、推定変移量＾ｄ［ｉ］とｘにおける真の変移量ｄ［ｉ］（ｘ）との推定誤差であり、
ζ［ｉ］（ｘ）＝ｄ［ｉ］（ｘ）−＾ｄ［ｉ］
のように表せる。 Further, ζ [i] (x) is an estimation error between the estimated shift amount ^ d [i] and the true shift amount d [i] (x) at x.
ζ [i] (x) = d [i] (x) − ^ d [i]
It can be expressed as

後述する『［３］変移量推定誤差とフレームレートとの関係』で説明するように、変移量推定誤差をフレームレートＦの関数として、下記の式（６)(７）ように近似できる。 As will be described later in “[3] Relationship between shift amount estimation error and frame rate”, the shift amount estimation error can be approximated as a function of frame rate F as shown in the following equations (6) and (7).

ここで、κ₁，κ₁’およびκ₂は映像信号に依存する定数である。また、δは動き推定の精度である。例えば、ＭＰＥＧ−２準拠の符号化器の場合、δ＝１／２に設定される。 Here, κ ₁ , κ ₁ ′ and κ ₂ are constants depending on the video signal. Also, δ is the accuracy of motion estimation. For example, in the case of an MPEG-2 compliant encoder, δ = 1/2 is set.

全セグメント（Ｂ［ｉ］（ｉ＝１，２，... ，Ｘ／Ｌ））に対するσ［ｉ］²の和を算出する。このとき、Ｎ（ｘ）の各項の独立性を仮定することで、下記の式（８）に示すような近似式を得る。 The sum of σ [i] ² is calculated for all segments (B [i] (i = 1, 2,..., X / L)). At this time, by assuming the independence of each term of N (x), an approximate expression as shown in the following expression (8) is obtained.

ここで、〜Ｎ（ｘ）²は下記の式（９）に示す通りである。 Here, ~ N (x) ² is as shown in the following formula (9).

さらに、変移量推定誤差ζ［ｉ］（ｘ）と画素値の微分値とが統計的に独立だと仮定して、式（６）および式（７）を式（８）に代入すると、予測誤差電力について下記の式（１０）に示す近似式を得る。 Further, assuming that the displacement estimation error ζ [i] (x) and the differential value of the pixel value are statistically independent, substituting Equation (6) and Equation (7) into Equation (8) gives the prediction For the error power, an approximate expression shown in the following expression (10) is obtained.

ここで、モデルパラメータα_1,α_2,α_3,α_4,α₅は、それぞれ下記の式（１１）の通りである。 Here, the model parameters α _1, α _2, α _3, α _{4, and} α ₅ are respectively expressed by the following formula (11).

この式から分かるように、モデルパラメータα₁は、参照フレームにおける隣接画素間の差分値の二乗和から導出されることになる。また、モデルパラメータα₂は、フレーム間予測誤差に対するテイラー展開の近似誤差と参照フレームにおける隣接画素間差分値との積の和から導出されることになる。また、モデルパラメータα₃は、原信号に重畳している雑音ｎ_t（ｘ）と参照フレームにおける補間誤差Ｎ_t-1 ^ref（ｘ）との和として定義されるＮ（ｘ）の二乗値の近似値として導出されることになる。また、モデルパラメータα₄は、フレーム間予測誤差に対するテイラー展開の近似誤差の二乗和から導出されることになる。 As can be seen from this equation, the model parameter α ₁ is derived from the sum of squares of the difference values between adjacent pixels in the reference frame. Further, the model parameter α ₂ is derived from the sum of products of the approximation error of Taylor expansion with respect to the inter-frame prediction error and the difference value between adjacent pixels in the reference frame. The model parameter α ₃ is a square value of N (x) defined as the sum of the noise n _t (x) superimposed on the original signal and the interpolation error N _t-1 ^ref (x) in the reference frame. It will be derived as an approximate value. The model parameter α ₄ is derived from the sum of squares of approximation errors of Taylor expansion with respect to interframe prediction errors.

［２］パラメータの推定
次に、モデルパラメータα₃を定義する前述の式（９）の推定について説明する。前述の式（９）の各項は以下のように推定する。 [2] Parameter Estimation Next, estimation of the above-described equation (9) that defines the model parameter α ₃ will be described. Each term of the above equation (9) is estimated as follows.

前述の式（９）の第一項のｎ_t（ｘ）²は、カメラの熱雑音等に起因する雑音成分の電力である。 N _t (x) ² in the first term of the above equation (9) is the power of the noise component due to the thermal noise of the camera.

この推定には、まず、フレーム内の静止領域を同定する。静止領域の同定方法については外部から与えられる。例えば、フレームをブロックに分割し、ブロック毎に動き推定を行い動きベクトルを求め、その動きベクトルがゼロベクトルとなった領域を静止領域とする。静止領域と同定されたブロックに対して、フレーム間差分を求める。そのフレーム間差分信号の二乗和を求め、その二乗和の画素平均値ｎ_t（ｘ）²を推定値とする。 For this estimation, first, a still region in a frame is identified. The method for identifying the static region is given from the outside. For example, a frame is divided into blocks, motion estimation is performed for each block to obtain a motion vector, and a region where the motion vector becomes a zero vector is set as a still region. An inter-frame difference is obtained for a block identified as a still region. The square sum of the inter-frame difference signal is obtained, and the pixel average value n _t (x) ² of the square sum is used as the estimated value.

前述の式（９）の第二項のＩ（ｎ_t-1（ｘ＋＾ｄ［ｉ］））²は、補間フィルタ通過後の雑音成分の電力である。 I (n _t-1 (x + ^ d [i])) ² of the second term of the above-described equation (9) is the power of the noise component after passing through the interpolation filter.

雑音成分ｎ_t（ｘ）²が無相関であると仮定し、ｊδ（ｊ＝１，... ，δ^-1−１）画素位置での補間フィルタ通過後の雑音電力を下記の式（１２）のように近似する。 Assuming that the noise component n _t (x) ² is uncorrelated, the noise power after passing through the interpolation filter at jδ (j = 1,..., Δ ⁻¹ −1) pixel position is expressed by the following equation (12): ).

さらに、ｊδ（ｊ＝１，... ，δ^-1−１）画素位置での補間フィルタ処理が、ほぼ同程度の頻度で行われたと仮定すると、Ｉ（ｎ_t-1（ｘ＋＾ｄ［ｉ］））²は下記の式（１３）のように近似できる。 Further, assuming that the interpolation filter processing at the jδ (j = 1,..., Δ ⁻¹ −1) pixel position is performed at substantially the same frequency, I ( _nt−1 (x + ^ d [ i])) ² can be approximated by the following equation (13).

前述の式（９）の第三項のｅ_t-1 ^int（ｘ＋＾ｄ［ｉ］）²は、小数画素位置の画素値の補間に伴う推定誤差である。 The third term e _t-1 ^int (x + ^ d [i]) ² in the above equation (9) is an estimation error accompanying interpolation of the pixel value at the decimal pixel position.

ｆ_t（ｘ）に対して、ダウンサンプリング処理を行い、解像度をδに縮小した画像信号ｈ_t（ｘ）を得る。ダウンサンプリング処理に用いるフィルタは外部から与えられるものとする。例えば、前述の補間フィルタの逆フィルタ、あるいは、sinc関数に基づく帯域制限フィルタ等が考えられる。このｈ_t（ｘ）に対して、前述の補間フィルタにより、解像度をδ^-1倍に拡大した信号ｆ_t（ｘ）’を得る。同補間フィルタに伴う補間誤差の推定値として、下記の式（１４）により近似する。 A downsampling process is performed on f _t (x) to obtain an image signal h _t (x) with a resolution reduced to δ. It is assumed that the filter used for the downsampling process is given from the outside. For example, an inverse filter of the above-described interpolation filter or a band limiting filter based on the sinc function can be considered. For this h _t (x), a signal f _t (x) ′ whose resolution is enlarged δ ⁻¹ times is obtained by the interpolation filter described above. The estimated value of the interpolation error associated with the interpolation filter is approximated by the following equation (14).

前述の式（９）の第四項のＩ（ｖ_t-1（ｘ＋＾ｄ［ｉ］））²ｑ²は、参照信号の量子化に伴う誤差である。 I (v _t-1 (x + ^ d [i])) ² q ² in the fourth term of the above equation (9) is an error accompanying quantization of the reference signal.

参照信号として原信号を用いた場合の予測誤差電力と参照信号として量子化・逆量子化により得られる復号信号を用いた場合の予測誤差電力との差分値（画素平均）を、ｖ_t-1（ｘ＋＾ｄ［ｉ］）²ｑ²の推定値“−ε_q”とする。ここで、“−ε_q”における記号“−”は“ε_q”の上に付く記号を示している。さらに、この推定値“−ε_q”を用いて、Ｉ（ｖ_t-1（ｘ＋＾ｄ［ｉ］））²ｑ²を下記の式（１５）のように近似する。 The difference value (pixel average) between the prediction error power when the original signal is used as the reference signal and the prediction error power when the decoded signal obtained by quantization / inverse quantization is used as the reference signal is represented by v _t−1. Let (x + ^ d [i]) ² q ^{2 be} an estimated value “−ε _q ”. Here, - symbols in "ε _q" "-" indicates the symbols attached to the top of the "epsilon _q". Further, using this estimated value “−ε _q ”, I (v _t−1 (x + ^ d [i])) ² q ² is approximated as in the following equation (15).

［３］変移量推定誤差とフレームレートとの関係
次に、前述の式（６)(７）が成立することについて説明する。 [3] Relationship between Transition Amount Estimation Error and Frame Rate Next, the fact that the above-described equations (6) and (7) are established will be described.

［３−１］式（６）の導出
連続値として表した場合の変移量＾ｄ^F［ｉ］を離散した場合の離散値を〜ｄ^F［ｉ］とする。このときの離散化誤差をε［ｉ］とすると、〜ｄ^F［ｉ］は下記の式（１６）のようにあらわせる。 [3-1] Derivation of Equation (6) A discrete value when the amount of change ^ d ^F [i] expressed as a continuous value is discrete is denoted by ˜d ^F [i]. Assuming that the discretization error at this time is ε [i], ˜d ^F [i] is expressed by the following equation (16).

ε［ｉ］は平均が零値、分散が動き推定精度の二乗値に比例する一様分布に従うものと仮定する。二乗誤差規範に基づくブロックマッチングでは、各ブロックに対する推定変移量は、ブロック中央位置の画素の変移量を用いて近似的にあらわすことができる。ｘ_c［ｉ］をセグメントＢ［ｉ］の中央位置の座標とすると、これにより、フレームレートＦ＝Ｆ₀の場合の推定変移量は下記の式（１７）のように近似できる。 It is assumed that ε [i] follows a uniform distribution in which the mean is zero and the variance is proportional to the square value of motion estimation accuracy. In block matching based on the square error criterion, the estimated shift amount for each block can be approximately expressed using the shift amount of the pixel at the block center position. Assuming that x _c [i] is the coordinates of the center position of the segment B [i], the estimated shift amount when the frame rate F = F ₀ can be approximated as in the following equation (17).

さらに、＾ｘにおける変移量と＾ｘ’における変移量との差分は平均がゼロ、分散が両座標位置の差分値に比例する正規分布としてモデル化できる。ここで、＾ｘおよび＾ｘ’は、セグメントＢ［ｉ］（ｉ＝１，２，... ，Ｘ／Ｌ）内での座標位置を示す。つまり、下記の式（１８）のような近似が可能となる。 Further, the difference between the shift amount at ^ x and the shift amount at ^ x 'can be modeled as a normal distribution in which the average is zero and the variance is proportional to the difference value between the two coordinate positions. Here, {circumflex over (x)} and {circumflex over (x)} ′ denote coordinate positions in the segment B [i] (i = 1, 2,..., X / L). That is, approximation as shown in the following equation (18) is possible.

ここで、Ｅ［・］は期待値演算を表し、＾ｃ_hは信号依存のパラメータである。同モデルは、全セグメントＢ［ｉ］（ｉ＝１，２，... ，Ｘ／Ｌ）における変移量の差分値（ｄ^F0［ｉ］（＾ｘ）−ｄ^F0［ｉ］（＾ｘ’））に対する集合平均の良好な近似値を与える。 Here, E [·] denotes the expectation operation, ^ c _h is a parameter of the signal-dependent. The model is the difference value (d ^F0 [i] (^ x) −d ^F0 [i] (^ x) of the transition amount in all segments B [i] (i = 1, 2,..., X / L). Give a good approximation of the set average for ')).

式（１７)(１８）により、下記の式（１９）を得る。 The following equation (19) is obtained from the equations (17) and (18).

ここで、ｃ_h ²＝（Ｘ／Ｌ）＾ｃ_h ²である。ξはセグメントＢ［ｉ］の中央位置を原点とする座標系での座標値であり、ξはｘ−ｘ₀［ｉ］と表すことができる。つまり、ξはセグメントＢ［ｉ］の中央位置からの距離である。 Here, c _h ² = (X / L) ^ c _h ² . ξ is a coordinate value in a coordinate system with the center position of the segment B [i] as the origin, and ξ can be expressed as xx ₀ [i]. That is, ξ is a distance from the center position of the segment B [i].

異なるフレームレートにおける２つの変移量の関係を導く。隣接フレーム間での物体の移動は、等速運動に従うものと仮定する。この場合、物体の変移量はフレーム間隔に比例する。つまり、変移量はフレームレートに反比例するといえる。これにより、下記の式（２０）の関係を得る。 Deriving the relationship between two shifts at different frame rates. It is assumed that the movement of the object between adjacent frames follows a uniform motion. In this case, the amount of displacement of the object is proportional to the frame interval. That is, it can be said that the shift amount is inversely proportional to the frame rate. Thereby, the relationship of the following formula | equation (20) is obtained.

式（１６）、式（２０）およびε［ｉ］がフレームレートに依存しないという仮定を用いて、下記の式（２１）を得る。 Using equation (16), equation (20) and the assumption that ε [i] does not depend on the frame rate, the following equation (21) is obtained.

式（２１）およびε［ｉ］が統計的に変移量と独立であるとする仮定を用いることで、下記の式（２２）を得る。 By using the assumption that the equation (21) and ε [i] are statistically independent of the shift amount, the following equation (22) is obtained.

ここで、ξ＝ｘ−ｘ_c［ｉ］、また、δは動き推定精度を表す。例えば、ＭＰＥＧ−２の符号化器の場合、δ＝１／２に設定され、ＭＰＥＧ−４ＡＳＰ(Advanced Simple Profile) の符号化器の場合、δ＝１／４に設定される。 Here, ξ = x−x _c [i], and δ represents the motion estimation accuracy. For example, in the case of an MPEG-2 encoder, δ is set to 1/2, and in the case of an MPEG-4 ASP (Advanced Simple Profile) encoder, δ is set to ¼.

式（２２）において、Ｆ₀ ²ｃ_h ²Σξ²をκ₁と表し、ＸＹ／１２をκ₁'と表すことにより、式（６）を得る。 In the formula (22), it represents a _{^{_{^{^{F 0 2 c h 2 Σξ 2}}}}} κ 1 and, by the XY / 12 expressed as kappa ₁ ', to obtain a formula (6).

［３−２］式（７）の導出
ε［ｉ］がゼロ平均の一様分布に従うと仮定すると、下記の式（２３）を得る。 [3-2] Derivation of Equation (7) Assuming that ε [i] follows a zero-average uniform distribution, the following Equation (23) is obtained.

ζ^F［ｉ］（ｘ）を、
ζ^F［ｉ］（ｘ）＝ｄ^F［ｉ］（ｘ）−＾ｄ^F［ｉ］
のように定義する変移量の推定誤差とする。 ζ ^F [i] (x)
ζ ^F [i] (x) = d ^F [i] (x) − ^ d ^F [i]
It is assumed that the deviation estimation error is defined as follows.

このとき、｛ΣΣζ^F［ｉ］（ｘ）｝²について、下記の式（２４）の通り表すことができる。 At this time, {ΣΣζ ^F [i] (x)} ² can be expressed as the following equation (24).

式（２４）の第一項は、式（２２）により近似することができる。また、第二項はシュワルツの不等式より下記の式（２５）の関係をみたす。 The first term of equation (24) can be approximated by equation (22). The second term satisfies the following equation (25) from the Schwarz inequality.

ここで、ξ＝ｘ−ｘ_c［ｉ］、ξ’＝ｘ’−ｘ_c［ｉ］である。この式（２５）の不等式により、下記の式（２６）の近似式を得る。 Here, ξ = x−x _c [i] and ξ ′ = x′−x _c [i]. The approximate expression of the following expression (26) is obtained from the inequality of the expression (25).

ここで、βは−１から１の範囲の値をとるパラメータである。式（２６）および式（２２）を式（２４）に代入することにより下記の式（２７）を得る。 Here, β is a parameter that takes a value in the range of −1 to 1. By substituting Equation (26) and Equation (22) into Equation (24), the following Equation (27) is obtained.

したがって、下記の式（２８）を得る。 Therefore, the following equation (28) is obtained.

ここで、γは１か−１である。式（２８）に記述される下記の式（２９）をκ₂とおくことにより、式（７）を得る。 Here, γ is 1 or -1. Formula (7) is obtained by setting the following formula (29) described in formula (28) as κ ₂ .

［４］動き推定精度の最適化
歪み量を一定（量子化ステップ幅を固定）とした場合の符号量を評価関数として考える。以下では、量子化ステップ幅が与えられた場合に、その評価関数を最小化する動き推定精度を算出する方法について詳述する。 [4] Optimization of motion estimation accuracy Consider the code amount when the distortion amount is constant (quantization step width is fixed) as an evaluation function. Hereinafter, a method of calculating motion estimation accuracy that minimizes the evaluation function when a quantization step width is given will be described in detail.

量子化ステップ幅ｑ、フレームレートＦ、動き推定精度δが与えられた場合、フレーム間予測誤差の符号量は、
Ｒ_c（ｑ，Ｆ，δ）＝（１／２）ｌｏｇ₂γ_c（Ｐ（Ｆ，δ）／ｑ²）
・・・式（３０）
のようにモデル化できる。 When the quantization step width q, the frame rate F, and the motion estimation accuracy δ are given, the code amount of the inter-frame prediction error is
R _c (q, F, δ) = (1/2) log ₂ γ _c (P (F, δ) / q ² )
... Formula (30)
Can be modeled as follows.

ここで、γ_cは信号依存の定数パラメータである。また、Ｐ（Ｆ，δ）は、
Ｐ（Ｆ，δ）＝Σσ［ｉ］² 但し、Σはｉ＝１〜Ｘ／Ｌについての総和
であり、フレーム内の予測誤差電力和を示す。 Here, γ _c is a signal-dependent constant parameter. P (F, δ) is
P (F, δ) = Σσ [i] ² where Σ is the sum for i = 1 to X / L and represents the prediction error power sum in the frame.

一方、フレームレートＦ、動き推定精度δが与えられた場合、変移量の符号量は、
Ｒ_mv（Ｆ，δ）＝（Ｘ／Ｌ）ｌｏｇ₂（２Ｕ（Ｆ）／δ）・・・式（３１）
のようにモデル化できる。 On the other hand, when the frame rate F and the motion estimation accuracy δ are given, the code amount of the shift amount is
R _mv (F, δ) = (X / L) log ₂ (2U (F) / δ) (31)
Can be modeled as follows.

ここで、Ｕ（Ｆ）はフレームレートがＦの場合の動き推定の探索範囲を表しており、変移量＾ｄは−Ｕ（Ｆ）≦＾ｄ≦Ｕ（Ｆ）−１の範囲をとりうるものとする。したがって、２Ｕ（Ｆ）／δは探索範囲内における推定変移量の候補数を表す。 Here, U (F) represents a motion estimation search range when the frame rate is F, and the shift amount ^ d can take a range of -U (F) ≤ ^ d≤U (F) -1. Shall. Therefore, 2U (F) / δ represents the number of candidates for the estimated shift amount within the search range.

変移量はフレーム間隔に比例して減少するため、符号量削減の観点から、Ｕ（Ｆ）はＦに反比例するように、
Ｕ（Ｆ）＝Ｕ₀Ｆ₀／Ｆ
というように設定する。ここで、Ｕ₀はフレームレートがＦ₀の場合の動き推定の範囲を表すパラメータである。 Since the shift amount decreases in proportion to the frame interval, U (F) is inversely proportional to F from the viewpoint of code amount reduction.
U (F) = U ₀ F ₀ / F
And so on. Here, U ₀ is a parameter representing the range of motion estimation when the frame rate is F ₀ .

このとき、予測誤差信号と変移量の符号量の総和は、
Ｒ（ｑ，Ｆ，δ）＝Ｒ_c（ｑ，Ｆ，δ）＋Ｒ_mv（Ｆ，δ）・・・式（３２）
となる。 At this time, the sum of the code amount of the prediction error signal and the shift amount is
R (q, F, δ) = R _c (q, F, δ) + R _mv (F, δ) (32)
It becomes.

この式（３２）が歪み量を一定（量子化ステップ幅ｑを固定）とした場合の符号量を与えている。 This equation (32) gives the code amount when the distortion amount is constant (quantization step width q is fixed).

このようにフレームレートＦ、量子化ステップ幅ｑが与えられた場合に、Ｒ（ｑ，Ｆ，δ）を最小化する動き推定精度δ^*（Ｆ）を算出する。求めるべきδ^*（Ｆ）は、
∂Ｒ（ｑ，Ｆ，δ）／∂δ
＝∂Ｒ_c（ｑ，Ｆ，δ）／∂δ＋∂Ｒ_mv（Ｆ，δ）／∂δ＝０
・・・式（３３）
という式を満たす。 In this way, when the frame rate F and the quantization step width q are given, the motion estimation accuracy δ ^* (F) that minimizes R (q, F, δ) is calculated. Δ ^* (F) to be obtained is
∂R (q, F, δ) / ∂δ
= ∂R _c (q, F, δ) / ∂δ + ∂R _mv (F, δ) / ∂δ = 0
... Formula (33)
The following expression is satisfied.

この式（３３）を満たすδ^*（Ｆ）は、下記の式（３４）のように表せる。 Δ ^* (F) satisfying the expression (33) can be expressed as the following expression (34).

ここで、この式（３４）の値は、下記の式（３５）および式（３６）の関係を満足する。 Here, the value of the equation (34) satisfies the relationship of the following equations (35) and (36).

式（３５）は、フレームレートの増加と共に動き推定精度を小さくする必要があることを示している。さらに、式（３６）は、フレームレートが極限まで大きくなると、動き推定精度の減少は収束することを示している。なお、ここでは、モデルパラメータα₄はＦ^-1の４次以上の項からなることを考慮している。 Equation (35) indicates that the motion estimation accuracy needs to be reduced as the frame rate increases. Furthermore, equation (36) indicates that the decrease in motion estimation accuracy converges when the frame rate increases to the limit. Here, it is considered that the model parameter α ₄ includes a fourth-order or higher term of F ⁻¹ .

モデルパラメータα₃における〜Ｎ（ｘ）は、推定変移量＾ｄが小数画素精度の場合、参照に必要な補間位置の画素値を生成することに伴う誤差、撮像系の熱雑音等に起因する成分からなる誤差項である。フレームレートが極限まで大きくなり、フレーム間隔が極限まで小さくなると、フレーム間予測誤差内において、その誤差項に起因する成分が支配的になる。こうした状況では、動き推定精度を高めても、それに見合う予測誤差の低減は得られない。このことを式（３６）は示しており、映像信号の物理的な性質に符合する。 ~ N (x) in the model parameter α ₃ is caused by an error accompanying generation of a pixel value at an interpolation position required for reference, thermal noise of the imaging system, etc., when the estimated shift amount ^ d is decimal pixel accuracy. It is an error term consisting of components. When the frame rate increases to the limit and the frame interval decreases to the limit, the component due to the error term becomes dominant in the inter-frame prediction error. In such a situation, even if the motion estimation accuracy is increased, a reduction in prediction error corresponding to the accuracy cannot be obtained. This is shown in equation (36), which matches the physical nature of the video signal.

このようにして、本発明によれば、フレームレートＦが与えられた場合に、式（３４）に従って、歪み量を一定（量子化ステップ幅を固定）とした場合の符号量を最小化する動き推定精度δ^*（Ｆ）を算出することができるようになる。 In this way, according to the present invention, when the frame rate F is given, the movement of minimizing the code amount when the distortion amount is constant (the quantization step width is fixed) according to the equation (34). The estimated accuracy δ ^* (F) can be calculated.

次に、本発明の実施例について説明する。 Next, examples of the present invention will be described.

図１に、本発明を具備する映像符号化装置１の装置構成を図示する。 FIG. 1 shows a device configuration of a video encoding device 1 having the present invention.

この図に示すように、本発明の映像符号化装置１は、符号化対象となる映像を格納する映像ファイル１０と、非整数画素精度の動き推定を行うフレーム間予測を使って、映像ファイル１０に格納される映像を符号化する符号化部１１と、符号化部１１がフレーム間予測を行う上で必要となる動き推定精度を算出して符号化部１１に与える動き推定精度算出部１２と、符号化部１１の符号化した符号化データを格納する符号化データファイル１３とを備える。 As shown in this figure, the video encoding device 1 of the present invention uses a video file 10 that stores video to be encoded and inter-frame prediction that performs motion estimation with non-integer pixel accuracy, and uses the video file 10. An encoding unit 11 that encodes the video stored in the video, a motion estimation accuracy calculation unit 12 that calculates the motion estimation accuracy necessary for the encoding unit 11 to perform inter-frame prediction, and provides the motion estimation accuracy to the encoding unit 11; And an encoded data file 13 for storing the encoded data encoded by the encoding unit 11.

図２および図３に、動き推定精度算出部１２の実行するフローチャートを図示する。次に、このフローチャートに従って、本発明の実行する処理について詳細に説明する。 2 and 3 are flowcharts executed by the motion estimation accuracy calculation unit 12. Next, processing executed by the present invention will be described in detail according to this flowchart.

動き推定精度算出部１２は、図２のフローチャートに示すように、まず最初に、ステップＳ１０１で、符号化対象フレームを読み込み、続くステップＳ１０２で、参照フレームを読み込む。 As shown in the flowchart of FIG. 2, the motion estimation accuracy calculation unit 12 first reads the encoding target frame in step S101, and then reads the reference frame in step S102.

続いて、ステップＳ１０３で、参照フレームにおける隣接画素間の差分値を算出する。すなわち、式（１１）のα₁の定義式に記述される「ｄ／ｄｘ（ｆ_t-1(ｘ））」を算出するのである。 Subsequently, in step S103, a difference value between adjacent pixels in the reference frame is calculated. That is, “d / dx (f _t−1 (x))” described in the definition formula of α ₁ in Expression (11) is calculated.

続いて、ステップＳ１０４で、ステップＳ１０３で算出した隣接画素間差分値の二乗和を算出する。すなわち、式（１１）のα₁の定義式に記述される「ΣΣ（ｄ／ｄｘ（ｆ_t-1(ｘ）））²」を算出するのである。 Subsequently, in step S104, the sum of squares of the difference values between adjacent pixels calculated in step S103 is calculated. That is, “ΣΣ (d / dx (ft ₋₁ (x))) ² ” described in the definition formula of α ₁ in Expression (11) is calculated.

続いて、ステップＳ１０５で、フレーム間予測誤差に対するテイラー展開の近似誤差を算出する。すなわち、式（１１）のα_2,α₄の定義式に記述される「φ（ｘ）」を算出するのである。 Subsequently, in step S105, an approximation error of Taylor expansion with respect to the inter-frame prediction error is calculated. That is, “φ (x)” described in the definition formula of α _{2 and} α _{4 in} the equation (11) is calculated.

続いて、ステップＳ１０６で、ステップＳ１０５で算出したテイラー展開の近似誤差とステップＳ１０３で算出した隣接画素間差分値との積の和を算出する。すなわち、式（１１）のα₂の定義式に記述される「ΣΣ（φ（ｘ）×ｄ／ｄｘ（ｆ_t-1(ｘ）））」を算出するのである。 Subsequently, in step S106, the sum of the products of the approximation error of Taylor expansion calculated in step S105 and the difference value between adjacent pixels calculated in step S103 is calculated. That is, “ΣΣ (φ (x) × d / dx (ft _-1 (x)))” described in the definition formula of α ₂ in Expression (11) is calculated.

続いて、ステップＳ１０７で、ステップＳ１０５で算出したテイラー展開の近似誤差の二乗和を算出することでモデルパラメータα₄を算出する。すなわち、式（１１）のα₄の定義式に記述される「ΣΣ（φ（ｘ）²）」を算出することで、モデルパラメータα₄を算出するのである。 Subsequently, at step S107, it calculates a model parameter alpha ₄ by calculating the square sum of the approximation error of the Taylor expansion calculated in step S105. That is, the model parameter α ₄ is calculated by calculating “ΣΣ (φ (x) ² )” described in the definition formula of α _{4 in} the equation (11).

続いて、ステップＳ１０８で、式（１１）のα₃の定義式に記述される「ΣΣ（〜Ｎ（ｘ）²）」を算出することでモデルパラメータα₃を算出する。このステップＳ１０８で実行するモデルパラメータα₃の算出処理の詳細については、図３のフローチャートで後述する。 Subsequently, at step S108, it calculates a model parameter alpha ₃ by calculating the "ΣΣ (~N (x) ^2)" described in the alpha ₃ defining equation of the formula (11). Details of the calculation process of the model parameter α ₃ executed in step S108 will be described later with reference to the flowchart of FIG.

一方、ステップＳ１０９で、フレームレートＦを読み込み、続くステップＳ１１０で、読み込んだフレームレートの逆数Ｆ^-1を算出し、続くステップＳ１１１で、算出したフレームレートの逆数の二乗値Ｆ^-2を算出する。 On the other hand, in step S109, the frame rate F is read. In the subsequent step S110, the reciprocal number F ⁻¹ of the read frame rate is calculated. In the subsequent step S111, the square value F ⁻² of the reciprocal number of the calculated frame rate is calculated. .

続いて、ステップＳ１１２で、フレーム内の画素数Ｘを読み込み、続くステップＳ１１３で、セグメント内の画素数Ｌを読み込む。すなわち、『［１］フレームレートとフレーム間予測誤差電力との関係』で説明したＸサンプルのＸの値とＬサンブルのＬの値とを読み込むのである。 Subsequently, in step S112, the number of pixels X in the frame is read, and in step S113, the number of pixels L in the segment is read. That is, the X value of the X sample and the L value of the L sample described in “[1] Relationship between frame rate and inter-frame prediction error power” are read.

続いて、ステップＳ１１４で、ステップＳ１１２で読み込んだ画素数ＸをステップＳ１１３で読み込んだ画素数Ｌで除算することで、セグメント数（Ｘ／Ｌ）を算出する。 Subsequently, in step S114, the number of segments (X / L) is calculated by dividing the number of pixels X read in step S112 by the number of pixels L read in step S113.

このようにして、ステップＳ１０１〜ステップＳ１１４の処理を終了すると、続いて、ステップＳ１１５で、ステップＳ１０４の算出値にＦ^-2と定数パラメータκ₁を乗じた値を算出することで、式（３４）に記述されるα₁Ｆ^-2を算出する。 When the processing of step S101 to step S114 is completed in this way, subsequently, in step S115, a value obtained by multiplying the calculated value of step S104 by F ⁻² and the constant parameter κ ₁ is calculated, thereby obtaining the equation (34 Α ₁ F ^-2 described in ( ₁ ) is calculated.

すなわち、ステップＳ１０４の算出値に定数パラメータκ₁を乗ずることで、式（１１）のα₁の定義式に従ってモデルパラメータα₁を算出して、それにＦ^-2を乗ずることで、式（３４）に記述されるα₁Ｆ^-2を算出するのである。 That is, by multiplying the calculated value in step S104 by the constant parameter κ ₁ , the model parameter α ₁ is calculated in accordance with the definition formula of α _{1 in} equation (11), and multiplied by F ⁻² to obtain equation (34). Α ₁ F ^-2 described in ( ₁₎ is calculated.

続いて、ステップＳ１１６で、ステップＳ１０６の算出値にＦ^-1と定数パラメータκ₂の２倍値（２×κ₂）を乗じた値を算出することで、式（３４）に記述されるα₂Ｆ^-1を算出する。 Subsequently, in step S116, by calculating a value obtained by multiplying the calculated value of step S106 by F ⁻¹ and the double value (2 × κ ₂ ) of the constant parameter κ ₂ , α described in the equation (34) is obtained. ₂ Calculate F- ¹ .

すなわち、ステップＳ１０６の算出値に定数パラメータκ₁の２倍値（２×κ₂）を乗ずることで、式（１１）のα₂の定義式に従ってモデルパラメータα₂を算出して、それにＦ^-1を乗ずることで、式（３４）に記述されるα₂Ｆ^-1を算出するのである。 That is, the model parameter α ₂ is calculated according to the definition formula of α ₂ in the equation (11) by multiplying the calculated value in step S106 by the double value (2 × κ ₂ ) of the constant parameter κ ₁ , and F ⁻ By multiplying by ¹ , α ₂ F ⁻¹ described in the equation (34) is calculated.

続いて、ステップＳ１１７で、ステップＳ１０７，Ｓ１０８，Ｓ１１５，Ｓ１１６の算出値の和を算出することで、式（３４）の分子の値を算出する。 Subsequently, in step S117, the numerator value of the equation (34) is calculated by calculating the sum of the calculated values in steps S107, S108, S115, and S116.

すなわち、ステップＳ１０７の算出値であるα₄と、ステップＳ１０８の算出値であるα₃と、ステップＳ１１５の算出値であるα₁Ｆ^-2と、ステップＳ１１６の算出値であるα₂Ｆ^-1との和を算出することで、式（３４）の分子の値
α₁Ｆ^-2＋α₂Ｆ^-1＋α₃＋α₄
を算出するのである。 That is, α ₄ that is the calculated value in step S107, α ₃ that is the calculated value in step S108, α ₁ F ⁻² that is the calculated value in step S115, and α ₂ F ⁻¹ that is the calculated value in step S116. By calculating the sum of the numerator value α ₁ F ⁻² + α ₂ F ⁻¹ + α ₃ + α _{4 in} equation (34).
Is calculated.

続いて、ステップＳ１１８で、ステップＳ１１４の算出値から１を減算した値にステップＳ１０４の算出値と定数パラメータκ_1,κ₁'を乗じた値を算出することで、式（３４）の分母の値を算出する。 Subsequently, in step S118, a value obtained by subtracting 1 from the calculated value in step S114 is multiplied by the calculated value in step S104 and the constant parameters κ _{1 and} κ ₁ ′, thereby calculating the denominator of the equation (34). Calculate the value.

すなわち、ステップＳ１１４の算出値から１を減算した値である（Ｘ／Ｌ−１）に、ステップＳ１０４の算出値である「ΣΣ（ｄ／ｄｘ（ｆ_t-1(ｘ））²」と定数パラメータκ₁との乗算値として定義されるモデルパラメータα₁を乗じて、さらにそれに定数パラメータκ₁'を乗ずることで、式（３４）の分母の値
（Ｘ／Ｌ−１）α₁κ₁'
を算出するのである。 That is, a value obtained by subtracting 1 from the calculated value in step S114 (X / L−1) is added to “ΣΣ (d / dx (ft _-1 (x)) ² ” that is the calculated value in step S104. By multiplying the model parameter α ₁ defined as the product of the parameter κ ₁ and further multiplying it by the constant parameter κ ₁ ′, the value of the denominator (X / L−1) α ₁ κ ₁ in the equation (34) '
Is calculated.

続いて、ステップＳ１１９で、ステップＳ１１７の算出値をステップＳ１１８の算出値で除算した値を算出することで、歪み量を一定（量子化ステップ幅を固定）とした場合の符号量を最小化する動き推定精度δ^*（Ｆ）を算出する。 Subsequently, in step S119, by calculating a value obtained by dividing the calculated value in step S117 by the calculated value in step S118, the code amount when the distortion amount is constant (quantization step width is fixed) is minimized. The motion estimation accuracy δ ^* (F) is calculated.

すなわち、式（３４）の定義に従い、ステップＳ１１７の算出値であるα₁Ｆ^-2＋α₂Ｆ^-1＋α₃＋α₄を、ステップＳ１１８の算出値である（Ｘ／Ｌ−１）α₁κ₁'で除算することで、歪み量を一定（量子化ステップ幅を固定）とした場合の符号量を最小化する動き推定精度δ^*（Ｆ）を算出するのである。 That is, in accordance with the definition of Expression (34), α ₁ F ⁻² + α ₂ F ⁻¹ + α ₃ + α ₄ that is the calculated value in step S117 is (X / L−1) α ₁ κ that is the calculated value in step S118. _By dividing by ₁ ′, the motion estimation accuracy δ ^* (F) that minimizes the code amount when the distortion amount is constant (quantization step width is fixed) is calculated.

このようにして、動き推定精度算出部１２は、フレームレートが与えられた場合に、式（３４）に従って、歪み量を一定（量子化ステップ幅を固定）とした場合の符号量を最小化する動き推定精度δ^*（Ｆ）を算出するのである。 In this way, the motion estimation accuracy calculation unit 12 minimizes the code amount when the distortion amount is constant (the quantization step width is fixed) according to the equation (34) when the frame rate is given. The motion estimation accuracy δ ^* (F) is calculated.

次に、図３のフローチャートに従って、ステップＳ１０８で実行するモデルパラメータα₃の算出処理の詳細について説明する。 Next, details of the calculation process of the model parameter α ₃ executed in step S108 will be described according to the flowchart of FIG.

ここで、モデルパラメータα₃は、式（１１）に記載するように、式（９）で定義される〜Ｎ（ｘ）²の総和として定義されるものである。 Here, the model parameter α ₃ is defined as a sum of ˜N (x) ² defined by the equation (9) as described in the equation (11).

動き推定精度算出部１２は、図２のフローチャートのステップＳ１０８の処理に入ると、図３のフローチャートに示すように、まず最初に、ステップＳ２０１で、符号化対象フレームを読み込み、続くステップＳ２０２で、参照フレームを読み込む。 When entering the process of step S108 in the flowchart of FIG. 2, the motion estimation accuracy calculation unit 12 first reads the encoding target frame in step S201 as shown in the flowchart of FIG. Read a reference frame.

続いて、ステップＳ２０３で、補間フィルタの伝達関数（前述の式（２）に記述される補間係数ψ_j［ｋ］）を算出する。 Subsequently, in step S203, a transfer function of the interpolation filter (interpolation coefficient ψ _j [k] described in the above equation (2)) is calculated.

続いて、ステップＳ２０４で、原信号に重畳する雑音電力（式（９）の右辺の第一項）を推定する。 Subsequently, in step S204, the noise power superimposed on the original signal (the first term on the right side of Equation (9)) is estimated.

続いて、ステップＳ２０５で、補間フィルタ通過後の原信号に重畳する雑音電力（式（９）の右辺の第二項）を推定する。 Subsequently, in step S205, noise power (second term on the right side of Equation (9)) to be superimposed on the original signal after passing through the interpolation filter is estimated.

続いて、ステップＳ２０６で、補間位置の画素値の推定誤差電力（式（９）の右辺の第三項）を推定する。 Subsequently, in step S206, the estimated error power of the pixel value at the interpolation position (the third term on the right side of Equation (9)) is estimated.

一方、ステップＳ２０７で、量子化パラメータを読み込み、続くステップＳ２０８で、読み込んだ量子化パラメータに対応する量子化ステップ幅を算出する。 On the other hand, in step S207, the quantization parameter is read, and in the subsequent step S208, the quantization step width corresponding to the read quantization parameter is calculated.

続いて、ステップＳ２０９で、補間フィルタ通過後の参照画像の量子化誤差電力（式（９）の右辺の第四項）を推定する。 Subsequently, in step S209, the quantization error power of the reference image after passing through the interpolation filter (the fourth term on the right side of Equation (9)) is estimated.

このようにして、ステップＳ２０１〜ステップＳ２０９の処理を終了すると、最後に、ステップＳ２１０で、ステップＳ２０４，Ｓ２０５，Ｓ２０６，Ｓ２０９で求めた電力の和を算出することで、式（１１）および式（９）で定義されるモデルパラメータα₃を算出して、図２のフローチャートのステップＳ１０８の処理を終了する。 When the processing of step S201 to step S209 is completed in this way, finally, in step S210, the sum of the powers obtained in steps S204, S205, S206, and S209 is calculated, so that equations (11) and ( The model parameter α ₃ defined in 9) is calculated, and the process of step S108 in the flowchart of FIG.

図４に、以上に説明した処理を実行する動き推定精度算出部１２の装置構成を図示する。 FIG. 4 illustrates a device configuration of the motion estimation accuracy calculation unit 12 that executes the processing described above.

動き推定精度算出部１２は、この図に示すように、符号化対象フレーム記憶部１０１と、参照フレーム記憶部１０２と、画素間差分値算出部１０３と、画素間差分値記憶部１０４と、二乗和算出部１０５と、テイラー展開近似誤差算出部１０６と、テイラー展開近似誤差記憶部１０７と、積算値加算処理部１０８と、修正テイラー展開近似誤差記憶部１０９と、二乗和算出部１１０と、テイラー展開近似誤差二乗和記憶部１１１と、残差成分算出部１１２と、残差成分記憶部１１３と、フレームレート記憶部１１４と、逆数算出部１１５と、フレームレート逆数記憶部１１６と、二乗値算出部１１７と、フレームレート逆数二乗値記憶部１１８と、セグメント数算出部１１９と、セグメント数記憶部１２０と、減算処理部１２１と、セグメント数記憶部１２２と、パラメータ記憶部１２３と、パラメータ記憶部１２４と、積算処理部１２５と、画素間差分値二乗和記憶部１２６と、積算処理部１２７と、積算処理部１２８と、加算処理部１２９と、加算値記憶部１３０と、積算処理部１３１と、積算値記憶部１３２と、除算処理部１３３とを備える。 As shown in this figure, the motion estimation accuracy calculation unit 12 includes an encoding target frame storage unit 101, a reference frame storage unit 102, an inter-pixel difference value calculation unit 103, an inter-pixel difference value storage unit 104, and a square. Sum calculation unit 105, Taylor expansion approximation error calculation unit 106, Taylor expansion approximation error storage unit 107, integrated value addition processing unit 108, modified Taylor expansion approximation error storage unit 109, sum of squares calculation unit 110, Taylor Expanded approximation error sum of squares storage unit 111, residual component calculation unit 112, residual component storage unit 113, frame rate storage unit 114, reciprocal number calculation unit 115, frame rate reciprocal number storage unit 116, and square value calculation Unit 117, frame rate reciprocal square value storage unit 118, segment number calculation unit 119, segment number storage unit 120, subtraction processing unit 121, segment Number storage unit 122, parameter storage unit 123, parameter storage unit 124, integration processing unit 125, inter-pixel difference value square sum storage unit 126, integration processing unit 127, integration processing unit 128, and addition processing unit 129, an addition value storage unit 130, an integration processing unit 131, an integration value storage unit 132, and a division processing unit 133.

このように構成される動き推定精度算出部１２では、符号化対象フレーム記憶部１０１は、符号化対象フレームを記憶する。参照フレーム記憶部１０２は、参照フレームを記憶する。 In the motion estimation accuracy calculation unit 12 configured as described above, the encoding target frame storage unit 101 stores the encoding target frame. The reference frame storage unit 102 stores a reference frame.

画素間差分値算出部１０３は、図２のフローチャートのステップＳ１０３の処理を実行するものであって、参照フレームにおける隣接画素間の差分値を算出する。画素間差分値記憶部１０４は、画素間差分値算出部１０３の算出した隣接画素間差分値を記憶する。二乗和算出部１０５は、図２のフローチャートのステップＳ１０４の処理を実行するものであって、画素間差分値算出部１０３の算出した隣接画素間差分値の二乗和を算出する。 The inter-pixel difference value calculation unit 103 executes the process of step S103 in the flowchart of FIG. 2, and calculates a difference value between adjacent pixels in the reference frame. The inter-pixel difference value storage unit 104 stores the adjacent inter-pixel difference value calculated by the inter-pixel difference value calculation unit 103. The square sum calculation unit 105 executes the process of step S104 in the flowchart of FIG. 2 and calculates the square sum of the difference values between adjacent pixels calculated by the inter-pixel difference value calculation unit 103.

テイラー展開近似誤差算出部１０６は、図２のフローチャートのステップＳ１０５の処理を実行するものであって、フレーム間予測誤差に対するテイラー展開の近似誤差を算出する。テイラー展開近似誤差記憶部１０７は、テイラー展開近似誤差算出部１０６の算出したテイラー展開の近似誤差を記憶する。 The Taylor expansion approximation error calculation unit 106 executes the processing of step S105 in the flowchart of FIG. 2 and calculates an approximation error of Taylor expansion with respect to the inter-frame prediction error. The Taylor expansion approximation error storage unit 107 stores the Taylor expansion approximation error calculated by the Taylor expansion approximation error calculation unit 106.

積算値加算処理部１０８は、図２のフローチャートのステップＳ１０６の処理を実行するものであって、テイラー展開近似誤差算出部１０６の算出したテイラー展開の近似誤差と画素間差分値算出部１０３の算出した隣接画素間差分値との積の和を算出する。修正テイラー展開近似誤差記憶部１０９は、積算値加算処理部１０８の算出値（修正テイラー展開近似誤差と称している）を記憶する。 The integrated value addition processing unit 108 executes the process of step S106 in the flowchart of FIG. 2, and calculates the Taylor expansion approximation error calculated by the Taylor expansion approximation error calculation unit 106 and the inter-pixel difference value calculation unit 103. The sum of the product with the difference value between adjacent pixels is calculated. The corrected Taylor expansion approximation error storage unit 109 stores the calculated value of the integrated value addition processing unit 108 (referred to as a corrected Taylor expansion approximation error).

二乗和算出部１１０は、図２のフローチャートのステップＳ１０７の処理を実行するものであって、テイラー展開近似誤差算出部１０６の算出したテイラー展開近似誤差の二乗和を算出することでモデルパラメータα₄を算出する。テイラー展開近似誤差二乗和記憶部１１１は、二乗和算出部１１０の算出したモデルパラメータα₄を記憶する。 The square sum calculation unit 110 executes the process of step S107 in the flowchart of FIG. 2 and calculates the model parameter α ₄ by calculating the square sum of the Taylor expansion approximation error calculated by the Taylor expansion approximation error calculation unit 106. Is calculated. The Taylor expansion approximate error square sum storage unit 111 stores the model parameter α ₄ calculated by the square sum calculation unit 110.

残差成分算出部１１２は、図２のフローチャートのステップＳ１０８の処理を実行するものであって、図３のフローチャートの処理を実行することで、モデルパラメータα₃を算出する。残差成分記憶部１１３は、残差成分算出部１１２の算出したモデルパラメータα₃を記憶する。 Residual component calculating section 112 is for executing the process of step S108 in the flowchart of FIG. 2, by executing the processing in the flowchart of FIG. 3, model parameters are calculated alpha _3. The residual component storage unit 113 stores the model parameter α ₃ calculated by the residual component calculation unit 112.

フレームレート記憶部１１４は、フレームレートＦを記憶する。逆数算出部１１５は、図２のフローチャートのステップＳ１１０の処理を実行するものであって、フレームレートの逆数Ｆ^-1を算出する。フレームレート逆数記憶部１１６は、逆数算出部１１５の算出したフレームレートの逆数Ｆ^-1を記憶する。二乗値算出部１１７は、図２のフローチャートのステップＳ１１１の処理を実行するものであって、フレームレートの逆数の二乗値Ｆ^-2を算出する。フレームレート逆数二乗値記憶部１１８は、二乗値算出部１１７の算出したフレームレートの逆数の二乗値Ｆ^-2を記憶する。 The frame rate storage unit 114 stores the frame rate F. The reciprocal calculation unit 115 executes the process of step S110 in the flowchart of FIG. 2 and calculates a reciprocal F ^{−1 of the} frame rate. The frame rate reciprocal storage unit 116 stores the reciprocal F ⁻¹ of the frame rate calculated by the reciprocal calculation unit 115. The square value calculation unit 117 performs the process of step S111 in the flowchart of FIG. 2, and calculates a square value F ⁻² of the reciprocal of the frame rate. The frame rate reciprocal square value storage unit 118 stores the square value F ⁻² of the reciprocal of the frame rate calculated by the square value calculation unit 117.

セグメント数算出部１１９は、図２のフローチャートのステップＳ１１４の処理を実行するものであって、セグメント数Ｘ／Ｌを算出する。セグメント数記憶部１２０は、セグメント数算出部１１９の算出したセグメント数Ｘ／Ｌを記憶する。減算処理部１２１は、セグメント数算出部１１９の算出したセグメント数から１を減算した値（Ｘ／Ｌ−１）を算出する。セグメント数記憶部１２２は、減算処理部１２１の算出したセグメント数（Ｘ／Ｌ−１）を記憶する。 The segment number calculation unit 119 executes the process of step S114 in the flowchart of FIG. 2 and calculates the number of segments X / L. The segment number storage unit 120 stores the segment number X / L calculated by the segment number calculation unit 119. The subtraction processing unit 121 calculates a value (X / L−1) obtained by subtracting 1 from the number of segments calculated by the segment number calculation unit 119. The segment number storage unit 122 stores the number of segments (X / L−1) calculated by the subtraction processing unit 121.

パラメータ記憶部１２３は、定数パラメータκ_1,κ₁'を記憶する。パラメータ記憶部１２４は、定数パラメータκ₂を記憶する。 The parameter storage unit 123 stores constant parameters κ _1, κ ₁ ′. Parameter storage unit 124 stores a constant parameter kappa _2.

積算処理部１２５は、二乗和算出部１０５の算出した隣接画素間差分値の二乗和と定数パラメータκ₁とを乗算することで、式（１１）で定義されるモデルパラメータα₁を算出する。画素間差分値二乗和記憶部１２６は、積算処理部１２５の算出したモデルパラメータα₁を記憶する。 The integration processing unit 125 calculates the model parameter α ₁ defined by Expression (11) by multiplying the square sum of the difference value between adjacent pixels calculated by the square sum calculation unit 105 and the constant parameter κ ₁ . The inter-pixel difference value square sum storage unit 126 stores the model parameter α ₁ calculated by the integration processing unit 125.

積算処理部１２７は、図２のフローチャートのステップＳ１１５の処理に相当する処理を実行するものであって、積算処理部１２５の算出したモデルパラメータα₁と二乗値算出部１１７の算出したフレームレートの逆数の二乗値Ｆ^-2とを乗算することで、式（３４）に記述されるα₁Ｆ^-2を算出する。 The integration processing unit 127 executes processing corresponding to the processing in step S115 in the flowchart of FIG. 2, and the model parameter α ₁ calculated by the integration processing unit 125 and the frame rate calculated by the square value calculation unit 117 are calculated. Multiplying the reciprocal square value F ^-2 calculates α ₁ F ^-2 described in the equation (34).

積算処理部１２８は、図２のフローチャートのステップＳ１１６の処理を実行するものであって、積算値加算処理部１０８の算出値と定数パラメータκ₂の２倍値と逆数算出部１１５の算出したフレームレートの逆数Ｆ^-1とを乗算することで、式（３４）に記述されるα₂Ｆ^-1を算出する。 The integration processing unit 128 executes the process of step S116 in the flowchart of FIG. 2, and includes the calculated value of the integrated value addition processing unit 108, the double value of the constant parameter κ ₂ , and the frame calculated by the reciprocal calculation unit 115. Multiplying by the reciprocal rate F ⁻¹ , α ₂ F ⁻¹ described in the equation (34) is calculated.

加算処理部１２９は、図２のフローチャートのステップＳ１１７の処理を実行するものであって、積算処理部１２７の算出したα₁Ｆ^-2と、積算処理部１２８の算出したα₂Ｆ^-1と、二乗和算出部１１０の算出したモデルパラメータα₄と、残差成分算出部１１２の算出したモデルパラメータα₃とを加算することで、式（３４）の分子の値を算出する。加算値記憶部１３０は、加算処理部１２９の算出した値を記憶する。 The addition processing unit 129 executes the process of step S117 in the flowchart of FIG. ² and includes α ₁ F ⁻² calculated by the integration processing unit 127 and α ₂ F ⁻¹ calculated by the integration processing unit 128. Then, by adding the model parameter α ₄ calculated by the sum of squares calculation unit 110 and the model parameter α ₃ calculated by the residual component calculation unit 112, the numerator value of the equation (34) is calculated. The addition value storage unit 130 stores the value calculated by the addition processing unit 129.

積算処理部１３１は、図２のフローチャートのステップＳ１１８の処理を実行するものであって、積算処理部１２５の算出したモデルパラメータα₁と、定数パラメータκ₁'と、減算処理部１２１の算出したセグメント数（Ｘ／Ｌ−１）とを乗算することで、式（３４）の分母の値を算出する。積算値記憶部１３２は、積算処理部１３１の算出値を記憶する。 The integration processing unit 131 executes the process of step S118 in the flowchart of FIG. 2. The integration processing unit 131 calculates the model parameter α ₁ calculated by the integration processing unit 125, the constant parameter κ ₁ ′, and the subtraction processing unit 121. By multiplying the number of segments (X / L-1), the value of the denominator of Equation (34) is calculated. The integrated value storage unit 132 stores the calculated value of the integrated processing unit 131.

除算処理部１３３は、図２のフローチャートのステップＳ１１９の処理を実行するものであって、加算処理部１２９の算出値を積算処理部１３１の算出値で除算することで、式（３４）で定義される動き推定精度δ^*（Ｆ）を算出する。 The division processing unit 133 executes the process of step S119 in the flowchart of FIG. 2, and is defined by Expression (34) by dividing the calculated value of the addition processing unit 129 by the calculated value of the integration processing unit 131. The calculated motion estimation accuracy δ ^* (F) is calculated.

このようにして、動き推定精度算出部１２は、この図４に示す装置構成に従って、図２および図３のフローチャートを実行することで、フレームレートが与えられた場合に、式（３４）に従って、歪み量を一定（量子化ステップ幅を固定）とした場合の符号量を最小化する動き推定精度δ^*（Ｆ）を算出するのである。 In this way, the motion estimation accuracy calculation unit 12 executes the flowcharts of FIGS. 2 and 3 according to the apparatus configuration shown in FIG. 4, and when the frame rate is given, according to the equation (34), The motion estimation accuracy δ ^* (F) for minimizing the code amount when the distortion amount is constant (quantization step width is fixed) is calculated.

本発明は、非整数画素精度の動き推定を行うフレーム間予測を伴う映像符号化に適用できるものであり、本発明によれば、フレームレートが与えられた場合に、歪み量を一定とした場合の符号量を最小化する動き推定精度を推定することができるようになる。 The present invention can be applied to video coding with inter-frame prediction that performs motion estimation with non-integer pixel accuracy. According to the present invention, when the frame rate is given, the distortion amount is constant. It is possible to estimate the motion estimation accuracy that minimizes the code amount.

本発明の映像符号化装置の装置構成図である。It is an apparatus block diagram of the video coding apparatus of this invention. 動き推定精度算出部の実行するフローチャートである。It is a flowchart which a motion estimation precision calculation part performs. 動き推定精度算出部の実行するフローチャートである。It is a flowchart which a motion estimation precision calculation part performs. 動き推定精度算出部の装置構成図である。It is an apparatus block diagram of a motion estimation precision calculation part.

Explanation of symbols

１０１符号化対象フレーム記憶部
１０２参照フレーム記憶部
１０３画素間差分値算出部
１０４画素間差分値記憶部
１０５二乗和算出部
１０６テイラー展開近似誤差算出部
１０７テイラー展開近似誤差記憶部
１０８積算値加算処理部
１０９修正テイラー展開近似誤差記憶部
１１０二乗和算出部
１１１テイラー展開近似誤差二乗和記憶部
１１２残差成分算出部
１１３残差成分記憶部
１１４フレームレート記憶部
１１５逆数算出部
１１６フレームレート逆数記憶部
１１７二乗値算出部
１１８フレームレート逆数二乗値記憶部
１１９セグメント数算出部
１２０セグメント数記憶部
１２１減算処理部
１２２セグメント数記憶部
１２３パラメータ記憶部
１２４パラメータ記憶部
１２５積算処理部
１２６画素間差分値二乗和記憶部
１２７積算処理部
１２８積算処理部
１２９加算処理部
１３０加算値記憶部
１３１積算処理部
１３２積算値記憶部
１３３除算処理部 101 encoding target frame storage unit 102 reference frame storage unit 103 inter-pixel difference value calculation unit 104 inter-pixel difference value storage unit 105 square sum calculation unit 106 Taylor expansion approximation error calculation unit 107 Taylor expansion approximation error storage unit 108 integrated value addition processing 109 Modified Taylor expansion approximate error storage unit 110 Sum of squares calculation unit 111 Taylor expansion approximate error sum of squares storage unit 112 Residual component calculation unit 113 Residual component storage unit 114 Frame rate storage unit 115 Reciprocal number calculation unit 116 Frame rate reciprocal number storage unit 117 square value calculation unit 118 frame rate reciprocal square value storage unit 119 segment number calculation unit 120 segment number storage unit 121 subtraction processing unit 122 segment number storage unit 123 parameter storage unit 124 parameter storage unit 125 integration processing unit 126 inter-pixel difference value 2 Sum storage unit 127 integration processing unit 128 integration processing unit 129 adding unit 130 adds value storage unit 131 integration processing unit 132 integrated value storage unit 133 division processing unit

Claims

A motion estimation accuracy estimation method used in video encoding with inter-frame prediction,
The process of entering the frame rate;
Occurs when encoding with the input frame using a model that describes the relationship between the frame rate and the motion estimation accuracy that minimizes the amount of code that will occur when encoding at that frame rate. Estimating the motion estimation accuracy that minimizes the amount of code to be performed,
Outputting the estimated motion estimation accuracy,
A characteristic motion estimation accuracy estimation method.

It is used in video encoding with inter-frame prediction that performs motion estimation with non-integer pixel accuracy, and occurs when encoding video under encoding conditions with a constant amount of distortion. A motion estimation accuracy estimation method for estimating motion estimation accuracy that minimizes the amount of code to be
The process of entering the frame rate;
A process of inputting four model parameters described in a function used to estimate prediction error power of inter-frame prediction when encoding at a certain frame rate;
A frame used for estimation of motion estimation accuracy that minimizes the amount of code that will be generated by the encoding under the condition of encoding with a constant amount of distortion based on the four model parameters that are input. Determining a quadratic function with the inverse of the rate as a variable;
By substituting the value of the input frame rate for the determined quadratic function, motion estimation accuracy is estimated to minimize the amount of code that will be generated when encoding is performed on the input frame. The process of
Outputting the estimated motion estimation accuracy,
A characteristic motion estimation accuracy estimation method.

The motion estimation accuracy estimation method according to claim 2,
In the input process, (1) model parameters derived from the sum of squares of the difference values between adjacent pixels in the reference frame, and (2) an approximation error of Taylor expansion with respect to the inter-frame prediction error and the difference between adjacent pixels in the reference frame (3) a model parameter derived from an approximate value of a sum of squares of the sum of the noise superimposed on the original signal and the interpolation error in the reference frame; ) Inputting four model parameters, model parameters derived from the sum of squares of approximation errors of Taylor expansion for inter-frame prediction errors,
A characteristic motion estimation accuracy estimation method.

The motion estimation accuracy estimation method according to claim 2 or 3,
In the process of the determining, represents the frame rate at F, the four model parameters _{_{_{α 1, α 2, α 3}}} , expressed as alpha _4, it represents the number of segments as a unit of motion compensation M, the coefficients If expressed by κ, the quadratic function is
(Α ₁ F ⁻² + α ₂ F ⁻¹ + α ₃ + α ₄ ) / κα ₁ (M−1)
To determine the function
A characteristic motion estimation accuracy estimation method.

A motion estimation accuracy estimation device used in video encoding with inter-frame prediction,
An input means for inputting a frame rate;
Using a model describing the relationship between the frame rate and the motion estimation accuracy that minimizes the amount of code that will occur when encoding at that frame rate, encoding is performed with the frame input by the input means Estimating means for estimating motion estimation accuracy that minimizes the amount of code that will occur in the case;
Output means for outputting the motion estimation accuracy estimated by the estimation means,
A characteristic motion estimation accuracy estimation apparatus.

It is used in video encoding with inter-frame prediction that performs motion estimation with non-integer pixel accuracy, and occurs when encoding video under encoding conditions with a constant amount of distortion. A motion estimation accuracy estimation device that estimates motion estimation accuracy that minimizes the amount of code to be
A frame rate input means for inputting a frame rate;
Model parameter input means for inputting four model parameters described in a function used for estimating prediction error power of inter-frame prediction when encoding at a certain frame rate;
Based on the four model parameters input by the model parameter input means, the estimation of motion estimation accuracy that minimizes the amount of code that will occur in the encoding under the condition of the encoding with a constant distortion amount Determining means for determining a quadratic function using a reciprocal of the frame rate as a variable,
Occurs when encoding is performed with the frame input by the frame rate input means by substituting the value of the frame rate input by the frame rate input means for the quadratic function determined by the determining means. Estimating means for estimating the motion estimation accuracy that minimizes the amount of code to become,
Output means for outputting the motion estimation accuracy estimated by the estimation means,
A characteristic motion estimation accuracy estimation apparatus.

The motion estimation accuracy estimation apparatus according to claim 6,
The model parameter input means includes (1) a model parameter derived from a sum of squares of a difference value between adjacent pixels in a reference frame, (2) an approximation error of Taylor expansion with respect to an inter-frame prediction error, and an adjacent pixel in a reference frame. (3) model parameters derived from the sum of squares of the sum of the noise superimposed on the original signal and the interpolation error in the reference frame; 4) Input four model parameters, model parameters derived from the sum of squares of approximation errors of Taylor expansion with respect to inter-frame prediction errors,
A characteristic motion estimation accuracy estimation apparatus.

The motion estimation accuracy estimation apparatus according to claim 6 or 7,
The determination means represents the frame rate as F, the four model parameters as α _1, α _2, α _3, α ₄ , the number of segments as a unit of motion compensation as M, and the coefficient as κ. Is expressed as follows:
(Α ₁ F ⁻² + α ₂ F ⁻¹ + α ₃ + α ₄ ) / κα ₁ (M−1)
To determine the function
A characteristic motion estimation accuracy estimation apparatus.

A motion estimation accuracy estimation program for causing a computer to execute the motion estimation accuracy estimation method according to any one of claims 1 to 4.

A computer-readable recording medium having recorded thereon a motion estimation accuracy estimation program for causing a computer to execute the motion estimation accuracy estimation method according to claim 1.