JP2024022374A

JP2024022374A - Encoding device, decoding device, and program

Info

Publication number: JP2024022374A
Application number: JP2022125917A
Authority: JP
Inventors: 雄一近藤; 裕一日下部
Original assignee: Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 2022-08-05
Filing date: 2022-08-05
Publication date: 2024-02-16

Abstract

【課題】映像を所要ビットレートで符号化する際に、計算処理を軽量化しつつ、画質及び符号化効率を向上させる。【解決手段】符号化装置１は、映像フレームに割り当てられたビットレートと第１ＲＤ曲線のパラメータとを用いて、コスト関数を最小化する第１ラグランジュ乗数及び前記ブロックごとの割当ビットレートを算出するビット割当部１０１と、ブロックごとの割当ビットレートと第２ＲＤ曲線のパラメータとから算出される第２ラグランジュ乗数を用いて、ブロックごとの量子化パラメータを算出する量子化パラメータ算出部１０２と、第２ラグランジュ乗数に基づいて、コスト関数を最小化する符号化モードを決定するＲＤ最適化部１０３と、を備える。第１ＲＤ曲線は、対数関数又は指数関数を用いて近似される。【選択図】図２An object of the present invention is to improve image quality and encoding efficiency while reducing the weight of calculation processing when encoding video at a required bit rate. An encoding device 1 calculates a first Lagrangian multiplier that minimizes a cost function and an allocated bit rate for each block using a bit rate allocated to a video frame and parameters of a first RD curve. a bit allocation unit 101; a quantization parameter calculation unit 102 that calculates a quantization parameter for each block using a second Lagrange multiplier calculated from the allocated bit rate for each block and the parameters of the second RD curve; and an RD optimization unit 103 that determines an encoding mode that minimizes the cost function based on the Lagrangian multiplier. The first RD curve is approximated using a logarithmic function or an exponential function. [Selection diagram] Figure 2

Description

特許法第３０条第２項適用申請有り１．近藤雄一・杉藤泰子・市ヶ谷敦郎らが、２０２１年８月１２日付で、ＦＩＴ２０２１第２０回情報科学技術フォーラム講演論文集において公開。２．近藤雄一・杉藤泰子・市ヶ谷敦郎らが、２０２１年８月２７日付で、ＦＩＴ２０２１第２０回情報科学技術フォーラムにおいて公開。３．ｈｔｔｐｓ：／／ｗｗｗ．ｎｈｋ．ｏｒ．ｊｐ／ｓｔｒｌ／ｐｕｂｌｉｃａ／ｇｉｋｅｎ＿ｄａｙｏｒｉ／２０５／４．ｈｔｍｌ近藤雄一が、２０２２年４月１日付で、技研だより２０２２年４月号において公開。Application for application of Article 30, Paragraph 2 of the Patent Act 1. Published by Yuichi Kondo, Yasuko Sugito, Atsuro Ichigaya, and others in the FIT2021 20th Information Science and Technology Forum lecture proceedings on August 12, 2021. 2. Published by Yuichi Kondo, Yasuko Sugito, Atsuro Ichigaya, and others at the 20th Information Science and Technology Forum of FIT2021 on August 27, 2021. 3. https://www. nhk. or. jp/strl/publica/giken_dayori/205/4. html Published by Yuichi Kondo in the Giken Newsletter April 2022 issue on April 1, 2022.

本発明は、符号化装置、復号装置、及びプログラムに関する。 The present invention relates to an encoding device, a decoding device, and a program.

符号化制御とは、用途や目的に見合った映像符号化を実現するための制御技術である。符号化制御には、ビットレート制約条件下で符号化品質を最大化するような処理を行うレート制御や、遅延を最小限に抑える遅延制御などがある。特にレート制御は、高画質で安定的な放送・通信サービスを行うにあたって重要な役割を果たしている。 Encoding control is a control technology for realizing video encoding that suits the application and purpose. Encoding control includes rate control that performs processing to maximize encoding quality under bit rate constraint conditions and delay control that minimizes delay. In particular, rate control plays an important role in providing stable broadcasting and communication services with high image quality.

レート制御は、フレームに割り当てられた目標ビットレートＲ_ｔで符号化できるように、ビットレートをＧＯＰ（Group Of Picture）単位・フレーム単位・ブロック単位で割り当てるビット割当処理ＯＢＡ(Optimal Bit Allocation)と、割り当てたビットレートの範囲内で画質を最も高品質化する（すなわち、符号化歪みを最小化する）符号化モードの選択を行うＲＤ最適化処理ＲＤＯ(Rate Distortion Optimization)の２つの処理によって実現される。 Rate control includes a bit allocation process _OBA (Optimal Bit Allocation) that allocates a bit rate in units of GOP (Group Of Picture), in units of frames, and in units of blocks so that encoding can be performed at the target bit rate Rt allocated to the frame; This is achieved by two processes: RD optimization processing RDO (Rate Distortion Optimization), which selects the encoding mode that maximizes the image quality (that is, minimizes encoding distortion) within the allocated bit rate range. Ru.

ここでは、フレーム単位のビット割り当ては済んでおり、フレーム内の各ブロックにビット割り当てを行う場合を例に説明する。ビット割当処理では、消費したビットレートＲ_ｃを監視しながら、フレームに割り当てられた目標ビットレートＲ_ｔを超えないようにビットを割り当てる。図４は、斜線を付した７ブロックが符号化を終了し、残りの５ブロックが符号化前の状態を示している。最も簡単なビット割当処理手法の１つは、消費ビットレートＲ_ｃから利用可能なビットレートＲ_ａ（Ｒ_ａ＝Ｒ_ｔ－Ｒ_ｃ）を導出し、Ｒ_ａを残りのブロックへ一定の割合で割り当てる方法である。図４に示す例では、残りの５ブロックに対しする1ブロック当たりの割当ビットは、Ｒ_ａ／５となる。 Here, an example will be described in which bit allocation for each frame has been completed and bit allocation is performed for each block within the frame. In the bit allocation process, bits are allocated so as not to exceed the target bit rate _Rt allocated to the frame while monitoring the consumed bit rate _Rc . FIG. 4 shows that seven blocks marked with diagonal lines have been encoded, and the remaining five blocks are in a state before encoding. One of the simplest bit allocation processing techniques is to derive the available bit rate R _a (R _a = R _t − R _c ) from the consumed bit rate R _c and spread R _a to the remaining blocks at a constant rate. This is the method of allocating. In the example shown in FIG. 4, the bits allocated per block for the remaining five blocks are R _a /5.

ＲＤ最適化処理は、ビット割当処理によって割り当てられたビットレートで符号化する際に、符号化歪みが最小となる符号化モードを選択することが目的である。近年のＨＥＶＣ（High Efficiency Video Coding），ＶＶＣ（Versatile Video Coding）などの映像符号化規格の参照ソフトウェアＨＭ（HEVC Test Model），ＶＴＭ（VVC Test Model）では、符号量Ｒ、符号化歪みＤ、及びラグランジュ乗数λ_ＲＤＯから計算されるコスト関数Ｊ＝Ｄ＋λ_ＲＤＯＲを最小化するように符号化モードを選択することでこれを実現する（例えば、非特許文献１参照）。ただし、ラグランジュ乗数λ_ＲＤＯは、符号化映像のＲＤ曲線と割当ビットレートを用いて算出される値である。ＲＤ曲線とは、映像符号化を行った際のビットレートＲと符号化歪みＤの関係を近似する曲線であり、ＨＭ，ＶＴＭでは、式（１）に示す双曲線（hyperbolic）関数を用いている。 The purpose of the RD optimization process is to select an encoding mode that minimizes encoding distortion when encoding at a bit rate allocated by the bit allocation process. In recent years, reference software HM (HEVC Test Model) and VTM (VVC Test Model) for video coding standards such as HEVC (High Efficiency Video Coding) and VVC (Versatile Video Coding) have This is achieved by selecting the encoding mode so as to minimize the cost function J=D+λ _RDO R calculated from the Lagrangian multiplier λ _RDO (see, for example, Non-Patent Document 1). However, the Lagrangian multiplier λ _RDO is a value calculated using the RD curve of the encoded video and the allocated bit rate. The RD curve is a curve that approximates the relationship between the bit rate R and the encoding distortion D during video encoding, and in HM and VTM, the hyperbolic function shown in equation (1) is used. .

ここで、ｃ，ｋは映像に応じて異なるパラメータであり、符号化処理の過程で、符号化済みのブロックの消費ビットレートＲと符号化歪みＤを用いて求めることができる。コスト関数Ｊを最小化するとき、ＪをＲで偏微分した∂J／∂Ｒは０となるので、λ_ＲＤＯは、式（２）により求めることができる。 Here, c and k are parameters that differ depending on the video, and can be determined using the consumed bit rate R and encoding distortion D of the encoded block during the encoding process. When minimizing the cost function J, ∂J/∂R obtained by partially differentiating J with respect to R becomes 0, so λ _RDO can be determined by equation (2).

また、非特許文献２には、テイラー展開を用いた近似や、近似で発生する誤差の削減処理などを行うことでビット割当処理を実現することが開示されている。 Furthermore, Non-Patent Document 2 discloses that bit allocation processing is realized by performing approximation using Taylor expansion, processing to reduce errors generated in approximation, and the like.

大久保榮、「Ｈ２６５/ＨＥＶＣ教科書」、初版、インプレスジャパン、２０１３年１０月２１日発行Sakae Okubo, "H265/HEVC Textbook", first edition, Impress Japan, published October 21, 2013 Li et al., “Optimal Bit Allocation for CTU Level Rate Control in HEVC,” IEEE Trans. Circuits Syst. Video Technol., 2017Li et al., “Optimal Bit Allocation for CTU Level Rate Control in HEVC,” IEEE Trans. Circuits Syst. Video Technol., 2017

前述のビット割当処理は、ビットを各ブロックで均等に割り当てているが、ブロックごとに割り当てるビットを制御することで画面全体の画質を改善することが可能である。以下にその手法を述べる。 In the bit allocation process described above, bits are allocated equally to each block, but by controlling the bits allocated to each block, it is possible to improve the image quality of the entire screen. The method is described below.

ビット割当処理の目的は、画面全体の符号化歪みＤを最小化することである。ただし、すべてのブロックのビットレートの和は、そのフレームに割り当てられたビットレートＲ_ｔを超えてはいけない。これを数式で表現すると、式（３）となる。ここで、添え字ｉはブロック番号を表しておりｄ_ｉ，ｒ_ｉはそれぞれｉ番目のブロックの符号化歪みとビットレートを表す。また、Ｍは画面全体のブロック数を表す。 The purpose of the bit allocation process is to minimize the coding distortion D of the entire screen. However, the sum of the bit rates of all blocks must not exceed the bit rate _Rt assigned to that frame. If this is expressed numerically, it becomes equation (3). Here, the subscript i represents the block number, and d _i and r _i represent the encoding distortion and bit rate of the i-th block, respectively. Furthermore, M represents the number of blocks on the entire screen.

式（３）を満たすようなｒ_ｉを求めたい。式（３）はラグランジュ乗数λ_ＯＢＡを用いて、式（４）に書き換えられる。 We would like to find r _i that satisfies equation (3). Equation (3) can be rewritten into Equation (4) using the Lagrangian multiplier λ _OBA .

双曲線モデルのＲＤ曲線は、式（５）で表される。ｃ_ｉ，ｋ_ｉ（１≦ｉ≦Ｍ）は、符号化済みフレームのブロックから式（１），（２）を用いて算出される推定値である。 The RD curve of the hyperbolic model is expressed by equation (5). c _i and k _i (1≦i≦M) are estimated values calculated from the blocks of the encoded frame using equations (1) and (2).

コスト関数Ｊを最小化するときｒ_ｉによる偏微分は０なので、式（６）が成立する。 When minimizing the cost function J, the partial differential with respect to r _i is 0, so Equation (6) holds true.

であり、ｒ_ｉの総和がフレームに割り当てられたビットレートＲ_ｔとなることから、式（７）が成り立たなければならない。

Since the sum of r _i is the bit rate R _t assigned to the frame, equation (7) must hold.

式(７)を解いてλ_ＯＢＡが求まると、式（６）で求まるｒ_ｉによりi番目のブロックのビットレートを決定することができる。しかし、ｃ_ｉ，ｋ_ｉはブロックごとに値が異なるため、式（７）を解析的に解くことはできない。 When λ _OBA is found by solving Equation (7), the bit rate of the i-th block can be determined by r _i found from Equation (6). However, since c _i and k _i have different values for each block, equation (7) cannot be solved analytically.

また、非特許文献２では、式（７）を解くためにテイラー展開を用いた近似や、近似で発生する誤差の削減処理などを行うことでビット割当処理を実現しているが、処理が複雑化するという問題点があった。 In addition, in Non-Patent Document 2, bit allocation processing is realized by approximation using Taylor expansion to solve equation (7) and processing to reduce errors generated in the approximation, but the processing is complicated. There was a problem that it became

かかる事情に鑑みてなされた本発明の目的は、映像を所要ビットレートで符号化する際に、計算処理を軽量化しつつ、画質及び符号化効率を向上させることが可能な符号化装置、復号装置、及びプログラムを提供することにある。 In view of the above circumstances, an object of the present invention is to provide an encoding device and a decoding device that can improve image quality and encoding efficiency while reducing calculation processing when encoding video at a required bit rate. , and to provide programs.

上記課題を解決するため、一実施形態に係る符号化装置は、映像フレームを符号化対象領域に分割してブロックごとに符号化を行い、ビットレートと符号化歪みの関係を近似するＲＤ曲線を用いてＲＤ最適化処理を行う符号化装置であって、前記映像フレームに割り当てられたビットレートと第１ＲＤ曲線のパラメータとを用いて、コスト関数を最小化する第１ラグランジュ乗数及び前記ブロックごとの割当ビットレートを算出するビット割当部と、前記ブロックごとの割当ビットレートと第２ＲＤ曲線のパラメータとから算出される第２ラグランジュ乗数を用いて、前記ブロックごとの量子化パラメータを算出する量子化パラメータ算出部と、前記第２ラグランジュ乗数に基づいて、コスト関数を最小化する符号化モードを決定するＲＤ最適化部と、を備え、前記第１ＲＤ曲線は、対数関数又は指数関数を用いて近似される。 In order to solve the above problem, an encoding device according to an embodiment divides a video frame into encoding target regions, performs encoding for each block, and generates an RD curve that approximates the relationship between bit rate and encoding distortion. The encoding device performs RD optimization processing using a first Lagrangian multiplier that minimizes a cost function using a bit rate assigned to the video frame and a parameter of a first RD curve, and a bit allocation unit that calculates an allocated bit rate; and a quantization parameter that calculates a quantization parameter for each block using a second Lagrangian multiplier calculated from the allocated bit rate for each block and a parameter of a second RD curve. and an RD optimization unit that determines an encoding mode that minimizes the cost function based on the second Lagrangian multiplier, and the first RD curve is approximated using a logarithmic function or an exponential function. Ru.

さらに、一実施形態において、前記ビット割当部は、ブロック単位の符号化が完了するごとに、前記映像フレーム内の未符号化ブロック全体に割り当てられたビットレートと前記第１ＲＤ曲線のパラメータとを用いて、前記第１ラグランジュ乗数及び前記ブロックごとの割当ビットレートを算出してもよい。 Furthermore, in one embodiment, the bit allocation unit uses the bit rate allocated to the entire unencoded block in the video frame and the parameters of the first RD curve every time encoding of each block is completed. Then, the first Lagrangian multiplier and the allocated bit rate for each block may be calculated.

さらに、一実施形態において、前記ビット割当部は、符号化済みブロックのビットレートと想定していたビットレートとの差が閾値を超えた場合に、前記映像フレーム内の未符号化ブロック全体に割り当てられたビットレートと第１ＲＤ曲線のパラメータとを用いて、前記第１ラグランジュ乗数及び前記ブロックごとの割当ビットレートを算出してもよい
。 Furthermore, in one embodiment, the bit allocation unit allocates all unencoded blocks in the video frame when a difference between the bit rate of the encoded block and the expected bit rate exceeds a threshold. The first Lagrangian multiplier and the allocated bit rate for each block may be calculated using the determined bit rate and the parameters of the first RD curve.

また、上記課題を解決するため、一実施形態に係る復号装置は、符号化装置の前記量子化パラメータ算出部によって算出された前記量子化パラメータを取得し、該量子化パラメータを用いて、前記符号化装置により符号化されたデータを復号する。 Further, in order to solve the above problem, a decoding device according to an embodiment obtains the quantization parameter calculated by the quantization parameter calculation unit of the encoding device, and uses the quantization parameter to The data encoded by the encoding device is decoded.

また、一実施形態係るプログラムは、コンピュータを、上記符号化装置として機能させる。 Further, a program according to an embodiment causes a computer to function as the encoding device.

また、一実施形態係るプログラムは、コンピュータを、上記復号装置として機能させる。 Further, a program according to an embodiment causes a computer to function as the decoding device.

本発明によれば、映像を所要ビットレートで符号化する際に、計算処理を軽量化しつつ、画質及び符号化効率を向上させることが可能となる。 According to the present invention, when encoding a video at a required bit rate, it is possible to improve image quality and encoding efficiency while reducing computational processing.

一実施形態に係る符号化装置の構成例を示すブロック図である。FIG. 1 is a block diagram illustrating a configuration example of an encoding device according to an embodiment. 一実施形態に係る符号化装置におけるレート制御部の構成例を示すブロック図である。FIG. 2 is a block diagram illustrating a configuration example of a rate control unit in an encoding device according to an embodiment. 一実施形態に係る復号装置の構成例を示すブロック図である。FIG. 1 is a block diagram illustrating a configuration example of a decoding device according to an embodiment. 従来のブロックごとにビット割り当てを行う例を示す図である。FIG. 3 is a diagram illustrating an example of conventional bit allocation for each block.

以下、一実施形態について、図面を参照して詳細に説明する。 Hereinafter, one embodiment will be described in detail with reference to the drawings.

（符号化装置）
図１に、本発明の一実施形態に係る符号化装置の構成例を示す。図１に示す符号化装置１は、レート制御部１０と、ブロック分割部１１と、減算部１２と、変換部１３と、量子化部１４と、逆量子化部１５と、逆変換部１６と、加算部１７と、記憶部１８と、予測部１９と、エントロピー符号化部２０と、符号化データ記憶・出力部２１と、を備える。 (encoding device)
FIG. 1 shows a configuration example of an encoding device according to an embodiment of the present invention. The encoding device 1 shown in FIG. , an addition section 17, a storage section 18, a prediction section 19, an entropy encoding section 20, and an encoded data storage/output section 21.

符号化装置１は、映像フレームを符号化対象領域に分割して符号化ブロック（以下、単に「ブロック」という。）ごとに符号化を行い、ビットレートＲと符号化歪みＤの関係を近似するＲＤ曲線を用いてＲＤ最適化処理を行う。その際、一定のビットレート以下となるように、ブロックごとに量子化パラメータを決定して符号化を行う。 The encoding device 1 divides a video frame into encoding target regions, performs encoding for each encoding block (hereinafter simply referred to as "block"), and approximates the relationship between bit rate R and encoding distortion D. RD optimization processing is performed using the RD curve. At this time, quantization parameters are determined for each block and encoding is performed so that the bit rate is below a certain level.

ブロック分割部１１は、映像フレームをブロック単位の符号化対象領域へ分割したブロック画像を生成し、減算部１２及び予測部１９に出力する。 The block dividing unit 11 generates a block image by dividing the video frame into encoding target areas in units of blocks, and outputs the block image to the subtracting unit 12 and the predicting unit 19.

減算部１２は、ブロック分割部１１から入力したブロック画像の各画素値から、後述する予測部１９から入力した予測ブロック画像の各画素値を減算して、ブロック画像と予測ブロック画像との差を示す残差ブロック画像を生成し、変換部１３に出力する。 The subtraction unit 12 subtracts each pixel value of the predicted block image input from the prediction unit 19 (described later) from each pixel value of the block image input from the block division unit 11, and calculates the difference between the block image and the predicted block image. The residual block image shown in FIG.

変換部１３は、減算部１２から入力した残差ブロック画像に対して、直交変換などの変換処理を行って変換係数を算出し、量子化部１４に出力する。 The transformation unit 13 performs transformation processing such as orthogonal transformation on the residual block image input from the subtraction unit 12 to calculate transformation coefficients, and outputs them to the quantization unit 14 .

レート制御部１０は、残差ブロック画像の量子化を行う際の量子化パラメータを適切に決定することにより、レート制御を行う。レート制御部１０の処理の詳細については後述する。レート制御部１０は、決定した量子化パラメータを量子化部１４及びエントロピー
符号化部２０に出力する。 The rate control unit 10 performs rate control by appropriately determining a quantization parameter when quantizing a residual block image. Details of the processing by the rate control unit 10 will be described later. The rate control unit 10 outputs the determined quantization parameter to the quantization unit 14 and the entropy encoding unit 20.

量子化部１４は、変換部１３から入力した変換係数を、レート制御部１０から入力した量子化パラメータに対応する量子化ステップ（例えば、量子化パラメータと量子化ステップの対数が比例するように対応付けられる。）で除算して量子化することにより量子化係数を生成し、逆量子化部１５及びエントロピー符号化部２０に出力する。量子化部１４により、データ量の削減が行われる。 The quantization unit 14 converts the transformation coefficients input from the conversion unit 13 into quantization steps corresponding to the quantization parameters input from the rate control unit 10 (for example, the quantization parameters are arranged so that the logarithms of the quantization steps are proportional to each other). ) and quantization to generate a quantization coefficient, which is output to the inverse quantization section 15 and the entropy encoding section 20. The quantization unit 14 reduces the amount of data.

逆量子化部１５は、量子化部１４から入力した量子化係数に対して、量子化ステップを乗ずることにより変換係数を復元し、逆変換部１６に出力する。 The inverse quantization section 15 multiplies the quantization coefficients input from the quantization section 14 by the quantization step to restore transform coefficients, and outputs the restored transform coefficients to the inverse transform section 16 .

逆変換部１６は、逆量子化部１５から入力した変換係数に対して、逆変換処理（変換部１３で行った変換を元に戻す処理）を行って残差ブロック画像を復元し、加算部１７に出力する。例えば、変換部１３が離散コサイン変換を行った場合には、逆変換部１６は逆離散コサイン変換を行う。 The inverse transformer 16 performs an inverse transform process (processing to undo the transformation performed by the transformer 13) on the transform coefficients input from the inverse quantizer 15 to restore a residual block image, and the adder 16 restores the residual block image. Output to 17. For example, when the transform unit 13 performs a discrete cosine transform, the inverse transform unit 16 performs an inverse discrete cosine transform.

加算部１７は、逆変換部１６から入力した残差ブロック画像と、予測部１９から入力した予測画像とを加算し、符号化画像として記憶部１８に出力する。 The addition unit 17 adds the residual block image input from the inverse transformation unit 16 and the predicted image input from the prediction unit 19, and outputs the result to the storage unit 18 as an encoded image.

逆量子化部１５、逆変換部１６、及び加算部１７により、符号化画像生成部（局所復号部）を構成する。すなわち、符号化画像生成部は、量子化係数に対して量子化ステップを乗じて変換係数を復元し、該変換係数に対して逆変換処理を行って残差ブロック画像を復元し、該残差ブロック画像とイントラ予測画像又は動き補償予測画像とを加算して符号化画像を生成する。 The inverse quantization section 15, the inverse transformation section 16, and the addition section 17 constitute a coded image generation section (local decoding section). That is, the encoded image generation unit restores the transform coefficient by multiplying the quantization coefficient by the quantization step, performs inverse transform processing on the transform coefficient to restore the residual block image, and restores the residual block image by multiplying the quantization coefficient by the quantization step. A coded image is generated by adding the block image and the intra-predicted image or the motion-compensated predicted image.

符号化装置１は、加算部１７が出力する符号化画像に対してデブロッキングフィルタによるフィルタ処理などの後処理を行ってから、記憶部１８に出力してもよい。 The encoding device 1 may perform post-processing such as filter processing using a deblocking filter on the encoded image output by the addition unit 17 and then output the encoded image to the storage unit 18 .

記憶部１８は、加算部１７から入力した符号化画像を記憶する。 The storage unit 18 stores the encoded image input from the addition unit 17.

予測部１９は、イントラ予測（画面内予測）、又はインター予測（画面間予測、動き補償予測）を行う。イントラ予測では、記憶部１８に記憶された符号化画像に対して、イントラ予測モードに従ってイントラ予測したイントラ予測画像を生成する。インター予測では、記憶部１８に記憶された符号化画像に対して、動きベクトルに従って動き補償予測した動き補償予測画像を生成する。予測部１９は、イントラ予測画像と動き補償予測画像とを切替えて予測ブロック画像とし、減算部１２及び加算部１７に出力する。予測部１９は、予測処理に用いられた予測パラメータ（イントラ予測モード及び動きベクトル情報）をエントロピー符号化部２０に出力する。 The prediction unit 19 performs intra prediction (intra-screen prediction) or inter prediction (inter-screen prediction, motion compensation prediction). In intra prediction, an intra prediction image is generated by performing intra prediction on the encoded image stored in the storage unit 18 according to an intra prediction mode. In inter prediction, a motion-compensated predicted image is generated by performing motion-compensated prediction on the encoded image stored in the storage unit 18 according to a motion vector. The prediction unit 19 switches between the intra-predicted image and the motion-compensated predicted image to form a predicted block image, and outputs it to the subtraction unit 12 and addition unit 17 . The prediction unit 19 outputs prediction parameters (intra prediction mode and motion vector information) used in the prediction process to the entropy encoding unit 20.

エントロピー符号化部２０は、量子化部１４から入力した量子化係数、レート制御部１０から入力した量子化パラメータ、及び予測部１９から入力した予測パラメータに対してエントロピー符号化を行い、データ圧縮を行って符号化データを生成し、符号化データ記憶・出力部２１に出力する。エントロピー符号化は、０次指数ゴロム符号やコンテキスト適応型２値算術符号（ＣＡＢＡＣ：Context-based Adaptive Binary Arithmetic Coding）など、任意のエントロピー符号化方式を用いることができる。 The entropy encoding unit 20 performs entropy encoding on the quantization coefficient input from the quantization unit 14, the quantization parameter input from the rate control unit 10, and the prediction parameter input from the prediction unit 19, and performs data compression. to generate encoded data and output it to the encoded data storage/output section 21. For entropy encoding, any entropy encoding method such as zero-order exponential Golomb code or context-based adaptive binary arithmetic coding (CABAC) can be used.

符号化データ記憶・出力部２１は、レート制御部１０により決定された最適符号化モードを用いて符号化された符号化データを、符号化装置１の外部に出力する。 The encoded data storage/output unit 21 outputs encoded data encoded using the optimal encoding mode determined by the rate control unit 10 to the outside of the encoding device 1.

（レート制御部）
次に、レート制御部１０の処理について説明する。図２に、レート制御部１０の構成例を示す。図２に示すレート制御部１０は、ビット割当部１０１と、量子化パラメータ算出部１０２と、ＲＤ最適化部１０３と、を備える。以下、ラグランジュ乗数λ_ＯＢＡを第１ラグランジュ乗数と称し、ラグランジュ乗数λ_ＲＤＯを第２ラグランジュ乗数と称する。また、ビット割当部１０１で使用するＲＤ曲線を第１ＲＤ曲線と称し、量子化パラメータ算出部１０２で使用するＲＤ曲線を第２ＲＤ曲線と称する。 (rate control section)
Next, the processing of the rate control unit 10 will be explained. FIG. 2 shows an example of the configuration of the rate control section 10. The rate control section 10 shown in FIG. 2 includes a bit allocation section 101, a quantization parameter calculation section 102, and an RD optimization section 103. Hereinafter, the Lagrange multiplier λ _OBA will be referred to as a first Lagrange multiplier, and the Lagrange multiplier λ _RDO will be referred to as a second Lagrange multiplier. Further, the RD curve used by the bit allocation section 101 is referred to as a first RD curve, and the RD curve used by the quantization parameter calculation section 102 is referred to as a second RD curve.

ビット割当部１０１は、フレームに割り当てられたビットＲ_ｔを超えないように、フレームを構成するブロックにビットを割り当てる処理を行う。ｉ番目のブロックに割り当てられたビットレート（以下、「ブロックレート」という。）をｒ_ｉ、ｉ番目のブロックの符号化歪み（以下、「ブロック歪み」という。）をｄ_ｉとし、フレーム内のブロック数をＭとする。フレーム全体の画質を最も高品質化（符号化歪みを最小化）するとき、第２ラグランジュ乗数λ_ＲＤＯを用いて、上述した式（４）を満たすようにコスト関数Ｊを最小化する予測モードを選べばよい。 The bit allocation unit 101 performs a process of allocating bits to blocks constituting a frame so as not to exceed the bits _Rt allocated to the frame. Let r i be the bit rate assigned to the i-th block (hereinafter referred to as "block rate"), and let _{d i} _be the encoding distortion of the i-th block (hereinafter referred to as "block distortion"), and Let M be the number of blocks. To maximize the image quality of the entire frame (minimize coding distortion), use the second Lagrangian multiplier λ _RDO to select a prediction mode that minimizes the cost function J so as to satisfy the above equation (4). All you have to do is choose.

上述したように、従来の式（５）で表される双曲線モデルのＲＤ曲線を用いた場合、ブロックレートｒ_ｉを解析的に解くことができない、又は処理が複雑化する。そこで、本発明では対数関数又は指数関数を利用する。第１ＲＤ曲線をブロックレートｒ_ｉの対数関数又は指数関数を用いて近似することにより、式（４）を簡単に解くことができ、ブロックレートｒ_ｉを解析的に解くことが可能となる。 As described above, when the conventional RD curve of the hyperbolic model expressed by equation (5) is used, the block rate r _i cannot be solved analytically, or the processing becomes complicated. Therefore, in the present invention, a logarithmic function or an exponential function is used. By approximating the first RD curve using a logarithmic function or an exponential function of the block rate r _i , equation (4) can be easily solved, and the block rate r _i can be solved analytically.

第１ＲＤ曲線を対数関数を用いて近似する場合には、ブロック歪みｄ_ｉとブロックレートｒ_ｉの関係は式（８）で表される。 When the first RD curve is approximated using a logarithmic function, the relationship between block distortion d _i and block rate r _i is expressed by equation (8).

第１ＲＤ曲線を指数関数を用いて近似する場合には、ブロック歪みｄ_ｉとブロックレートｒ_ｉの関係は式（９）で表される。 When approximating the first RD curve using an exponential function, the relationship between block distortion d _i and block rate r _i is expressed by equation (9).

以下では、第１ＲＤ曲線を対数関数を用いて近似する場合について説明する。ｃ’_ｉ，ｋ’_ｉ（１≦ｉ≦Ｍ）は、符号化済みフレームのブロックから式（１０），（１１）を用いて算出される。 Below, a case will be described in which the first RD curve is approximated using a logarithmic function. c′ _i and k′ _i (1≦i≦M) are calculated from the blocks of the encoded frame using equations (10) and (11).

コスト関数Ｊを最小化するときブロックレートｒ_ｉによる偏微分は０になるので、式（１２）が成立する。 When minimizing the cost function J, the partial differential with respect to the block rate r _i becomes 0, so Equation (12) holds true.

なので、式（１３）が成立する。すると、第１ラグランジュ乗数λ_ＯＢＡは式(１４)により簡単に求めることができて、ブロックレートｒ_ｉは式(１５)により求まる。

Therefore, equation (13) holds true. Then, the first Lagrangian multiplier λ _OBA can be easily obtained using equation (14), and the block rate r _i can be obtained using equation (15).

このように、ビット割当部１０１は、映像フレームに割り当てられたビットレートＲ_ｔと、符号化済みブロックから算出された第１ＲＤ曲線のパラメータとを用いて、コスト関数Ｊを最小化する第１ラグランジュ乗数λ_ＯＢＡ（式（１４）参照）、及びブロックレートｒ_ｉ（式（１５）参照）を算出する。そして、算出したブロックレートｒ_ｉを量子化パラメータ算出部１０２に出力する。ビット割当部１０１は、従来の双曲線モデルではなく、対数モデル又は指数モデルを用いることにより、符号化済みの結果から得られたc’_iを用いて、式(１５)から簡単にブロックレートｒ_ｉを決定することができ、解析的に解くことが可能となる。 In this way, the bit allocation unit 101 uses the bit rate _Rt allocated to the video frame and the parameters of the first RD curve calculated from the encoded block to calculate the first Lagrangian that minimizes the cost function J. The multiplier λ _OBA (see equation (14)) and the block rate r _i (see equation (15)) are calculated. Then, the calculated block rate r _i is output to the quantization parameter calculation unit 102. By using a logarithmic model or an exponential model instead of a conventional hyperbolic model, the bit allocation unit 101 easily calculates the block rate _r i from equation (15) using c′ _i obtained from the encoded result. can be determined and solved analytically.

量子化パラメータ算出部１０２は、第２ＲＤ曲線として従来のＲＤ曲線を使用する場合には、式（６）にブロックレートｒ_ｉを代入することで第２ラグランジュ乗数λ_ＲＤＯを算出する。また、第２ＲＤ曲線として第１ＲＤ曲線と同じＲＤ曲線を使用する場合には、式（１２）にブロックレートｒ_ｉを代入することで第２ラグランジュ乗数λ_ＲＤＯを算出する。 When using the conventional RD curve as the second RD curve, the quantization parameter calculation unit 102 calculates the second Lagrangian multiplier λ _RDO by substituting the block rate r _i into equation (6). Furthermore, when using the same RD curve as the first RD curve as the second RD curve, the second Lagrangian multiplier λ _RDO is calculated by substituting the block rate r _i into equation (12).

そして、量子化パラメータ算出部１０２は、式（１６）に示すように、第２ラグランジュ乗数λ_ＲＤＯに基づいて、ブロックごとに残差信号の量子化を行う際の量子化パラメータＱＰ_ｉを算出する。そして、量子化パラメータ算出部１０２は、算出した量子化パラメータＱＰ_ｉを符号化装置１の量子化部１４に出力する。また、量子化パラメータ算出部１０２は、第２ラグランジュ乗数λ_ＲＤＯ及び量子化パラメータＱＰ_ｉをＲＤ最適化部１０３に出力する。 Then, the quantization parameter calculation unit 102 calculates the quantization parameter QP _i when quantizing the residual signal for each block, based on the second Lagrangian multiplier λ _RDO , as shown in equation (16). . Then, the quantization parameter calculation unit 102 outputs the calculated quantization parameter QP _i to the quantization unit 14 of the encoding device 1. Further, the quantization parameter calculation unit 102 outputs the second Lagrangian multiplier λ _RDO and the quantization parameter QP _i to the RD optimization unit 103.

ＲＤ最適化部１０３は、量子化パラメータ算出部１０２から入力した第２ラグランジュ乗数λ_ＲＤＯ及び量子化パラメータＱＰ_ｉを用いて、コスト関数Ｊを最小化する最適符号化モードを決定する。最適符号化モードは、符号化ツール及びパラメータ（イントラ予測
のＤＣ予測モードなど）の組み合わせである。そして、ＲＤ最適化部１０３は、決定した最適符号化モードを符号化装置１の符号化データ記憶・出力部２１に出力する。なお、ＲＤ最適化で使用するｒ_ｉは、ビット割当部１０１で算出したブロックレートｒ_ｉとは異なるものである。ＲＤ最適化では、あらゆるモードで符号化を実行し、モードごとに得られるｄ_ｉとｒ_ｉの組み合わせをコスト関数Ｊ＝ｄ_ｉ＋λ_ＲＤＯ・ｒ_ｉに代入して、Ｊが最小となるモードを選択する。 The RD optimization unit 103 uses the second Lagrangian multiplier λ _RDO and the quantization parameter QP _i input from the quantization parameter calculation unit 102 to determine the optimal encoding mode that minimizes the cost function J. The optimal encoding mode is a combination of encoding tools and parameters (such as DC prediction mode for intra prediction). Then, the RD optimization section 103 outputs the determined optimal encoding mode to the encoded data storage/output section 21 of the encoding device 1. Note that r _i used in RD optimization is different from the block rate r _i calculated by bit allocation section 101. In RD optimization, encoding is performed in all modes, and the combination of d _i and r _i obtained for each mode is assigned to the cost function J=d _i +λ _RDO・r _i to find the mode in which J is the minimum. select.

（ビット割当処理の第１の変形例）
ビット割当部１０１と量子化パラメータ算出部１０２によって、ブロックレートｒ_ｉと量子化パラメータＱＰ_ｉは一意に定まるが、実際に符号化した時に想定どおりのビットレートになるとは限らない。そこで、フレームに割り当てられた実際のビットレートＲ_ｔから大きく外れるようなことがないように、ブロック単位で割当ビット量の修正を行ってもよい。つまり、第１の変形例では、ビット割当部１０１は、ブロック単位の符号化が完了するごとに、フレーム内の未符号化ブロック全体に割り当てられたビットレートＲ_ｊと第１ＲＤ曲線のパラメータとを用いて、第１ラグランジュ乗数λ_ＯＢＡ及びブロックレートｒ_ｉを算出する。 (First modification of bit allocation processing)
Although the block rate r _i and the quantization parameter QP _i are uniquely determined by the bit allocation section 101 and the quantization parameter calculation section 102, the expected bit rate is not necessarily obtained when actually encoding. Therefore, the amount of allocated bits may be modified on a block-by-block basis so that the actual bit rate _Rt allocated to the frame does not deviate significantly. In other words, in the first modification, the bit allocation unit 101 calculates the bit rate R _j allocated to all unencoded blocks in a frame and the parameters of the first RD curve every time encoding of each block is completed. is used to calculate the first Lagrangian multiplier λ _OBA and the block rate r _i .

具体的には、ｉ番目のブロックで消費したビットをｂ_ｊとすると、ｊ番目までのブロックに対する符号化が終わった時、フレーム内の未符号化ブロック全体に割り当てられたビットレートＲ_ｊは式（１７）で表される。そこで、ビット割当部１０１は、ブロックレートｒ_ｉ（ｊ＜ｉ≦Ｍ）を式（１８）により修正する。 Specifically, if the bits consumed in the i-th block are _bj , then when the encoding of blocks up to the j-th block is completed, the bit rate _Rj assigned to all unencoded blocks in the frame is given by the formula It is expressed as (17). Therefore, the bit allocation unit 101 corrects the block rate r _i (j<i≦M) using equation (18).

（ビット割当処理の第２の変形例）
第１の変形例によりブロック単位でブロックレートｒ_ｉを制御すると、フレーム全体の画質が劣化する場合がある。そこで、フレーム全体の画質を考慮し、必要に応じて第１の変形例による修正を行うようにしてもよい。つまり、第２の変形例では、ビット割当部１０１は、符号化済みブロックのビットレートＲ_{ａｃｔｕａｌ}と想定していたビットレートＲ_{ｉｄｅａｌ}との差が閾値を超えた場合に、フレーム内の未符号化ブロック全体に割り当てられたビットレートＲ_ｊと第１ＲＤ曲線のパラメータとを用いて、第１ラグランジュ乗数λ_ＯＢＡ及びブロックレートｒ_ｉを算出する。 (Second modification example of bit allocation processing)
If the block rate r _i is controlled on a block-by-block basis according to the first modification, the image quality of the entire frame may deteriorate. Therefore, the image quality of the entire frame may be taken into consideration, and corrections may be made according to the first modified example as necessary. In other words, in the second modification, when the difference between the bit rate R _actual of the encoded block and the assumed bit rate R _ideal exceeds the threshold, the bit allocation unit 101 assigns A first Lagrangian multiplier λ _OBA and a block rate r _i are calculated using the bit rate R _j assigned to the entire block and the parameters of the first RD curve.

具体的には、ｊ番目までに実際に消費したビットレートＲ_{ａｃｔｕａｌ}が想定していたビットレートＲ_{ｉｄｅａｌ}からどれだけ乖離したかを基準にして制御を行う。ビット割当部１０１は、式（１９）より、ｊ番目までに消費されるビットレートの想定値Ｒ_{ｉｄｅａｌ}を算出し、式（２０）に示すように、乖離度合いＳを(Ｒ_{ａｃｔｕａｌ}-Ｒ_{ｉｄｅａｌ})/Ｒ_{ｉｄｅａｌ}の絶対値よって算出する。ビット割当部１０１は、ブロック符号化ごとに乖離度合いＳと閾値ｓとを比較し、Ｓ＞ｓとなるブロックでのみ式（１８）で示したブロックレートｒ_ｉに修正することで、余分な制御は行わないようにすることができる。閾値ｓは任意に設定することができ、例えばs＝０．１とする。 Specifically, control is performed based on how far the bit rate R _actual actually consumed up to the j-th time deviates from the expected bit rate R _ideal . The bit allocation unit 101 calculates the expected value R _ideal of the bit rate consumed up to the jth bit rate from Equation (19), and calculates the degree of deviation S by (R _actual - R _ideal ) as shown in Equation (20). Calculated by the absolute value of /R _ideal . The bit allocation unit 101 compares the deviation degree S and the threshold value s for each block encoding, and corrects the block rate r _i shown in equation (18) only for blocks where S>s, thereby eliminating unnecessary control. You can prevent it from happening. The threshold value s can be set arbitrarily, for example, s=0.1.

（復号装置）
次に、本発明の一実施形態に係る復号装置について説明する。図３に、本発明の一実施形態に係る復号装置の構成例を示す。図３に示す復号装置２は、エントロピー復号部３１と、逆量子化部３２と、逆変換部３３と、加算部３４と、記憶部３５と、予測部３６と、を備える。 (Decoding device)
Next, a decoding device according to an embodiment of the present invention will be described. FIG. 3 shows a configuration example of a decoding device according to an embodiment of the present invention. The decoding device 2 shown in FIG. 3 includes an entropy decoding section 31, an inverse quantization section 32, an inverse transformation section 33, an addition section 34, a storage section 35, and a prediction section 36.

復号装置２は、符号化装置１から、量子化パラメータ算出部１０２によって算出された量子化パラメータＱＰ_ｉを取得し、該量子化パラメータＱＰ_ｉを用いて、符号化装置１により符号化された符号化データを復号する。 The decoding device 2 acquires the quantization parameter QP _i calculated by the quantization parameter calculation unit 102 from the encoding device 1, and uses the quantization parameter QP _i to generate the code encoded by the encoding device 1. decrypt the encoded data.

エントロピー復号部３１は、符号化装置１が出力する符号化データを復号し、量子化パラメータＱＰ_ｉ、量子化係数、及び予測パラメータ（イントラ予測モード及び動きベクトル情報）を取得する。そして、エントロピー復号部３１は、量子化パラメータＱＰ_ｉ及び量子化係数を逆量子化部３２に出力し、予測パラメータを予測部３６に出力する。 The entropy decoding unit 31 decodes the encoded data output by the encoding device 1 and obtains the quantization parameter QP _i , the quantization coefficient, and the prediction parameter (intra prediction mode and motion vector information). Then, the entropy decoding unit 31 outputs the quantization parameter QP _i and the quantization coefficient to the inverse quantization unit 32, and outputs the prediction parameter to the prediction unit 36.

逆量子化部３２は、エントロピー復号部３１から量子化係数及び量子化パラメータＱＰ_ｉを入力し、量子化係数に量子化パラメータＱＰ_ｉから導出される量子化ステップを乗算してブロックごとの直交変換係数を復元し、逆変換部３３に出力する。 The inverse quantization unit 32 inputs the quantization coefficient and the quantization parameter QP _i from the entropy decoding unit 31, multiplies the quantization coefficient by the quantization step derived from the quantization parameter QP _i , and performs orthogonal transformation for each block. The coefficients are restored and output to the inverse transform section 33.

逆変換部３３は、逆量子化部３２から入力した直交変換係数に対して逆変換を行って残差画像を生成し、加算部３４に出力する。 The inverse transformer 33 performs inverse transform on the orthogonal transform coefficients input from the inverse quantizer 32 to generate a residual image, and outputs it to the adder 34 .

加算部３４は、逆変換部３３から入力した残差画像と、予測部３６から入力した予測画像の各画素値を加算して復号画像を生成し、記憶部３５及び復号装置２の外部に出力する。 The addition unit 34 adds each pixel value of the residual image input from the inverse transformation unit 33 and the predicted image input from the prediction unit 36 to generate a decoded image, and outputs the decoded image to the storage unit 35 and the outside of the decoding device 2. do.

復号装置２は、符号化装置１と同様に、加算部３４が出力する復号画像に対してデブロッキングフィルタによるフィルタ処理などの後処理を行ってから、記憶部３５に出力してもよい。 Similar to the encoding device 1, the decoding device 2 may perform post-processing such as filter processing using a deblocking filter on the decoded image output by the addition section 34, and then output the image to the storage section 35.

記憶部３５は、加算部３４から入力した復号画像を記憶する。 The storage unit 35 stores the decoded image input from the addition unit 34.

予測部３６は、イントラ予測（画面内予測）、又はインター予測（画面間予測、動き補償予測）を行う。イントラ予測では、記憶部３５に記憶された復号画像に対して、エントロピー復号部３１から入力したイントラ予測モードに従ってイントラ予測したイントラ予測画像を生成する。インター予測では、記憶部３５に記憶された復号画像に対して、エントロピー復号部３１から入力した動きベクトル情報に従って動き補償予測した動き補償予測画像を生成する。予測部３６は、イントラ予測画像と動き補償予測画像とを切替えて予測ブロック画像とし、加算部３４に出力する。 The prediction unit 36 performs intra prediction (intra-screen prediction) or inter prediction (inter-screen prediction, motion compensation prediction). In the intra prediction, an intra prediction image is generated by performing intra prediction on the decoded image stored in the storage unit 35 according to the intra prediction mode input from the entropy decoding unit 31. In inter prediction, a motion-compensated predicted image is generated by performing motion-compensated prediction on the decoded image stored in the storage unit 35 according to motion vector information input from the entropy decoding unit 31. The prediction unit 36 switches between the intra predicted image and the motion compensated predicted image to form a predicted block image, and outputs the predicted block image to the addition unit 34 .

このように、本発明は、符号化・復号方式をＨＥＶＣ，ＶＶＣなどの従来の方式から変更することなく、計算処理を軽量化しつつ、画質及び符号化効率を向上させる映像を所要ビットレートで符号化する際の画質及び符号化効率を向上させることが可能となる。 As described above, the present invention can encode video at a required bit rate, reducing computational processing and improving image quality and encoding efficiency, without changing the encoding/decoding method from conventional methods such as HEVC and VVC. This makes it possible to improve image quality and encoding efficiency when converting images.

（プログラム）
上述した符号化装置１及び復号装置２として機能させるために、それぞれプログラム命令を実行可能なコンピュータを用いることも可能である。ここで、コンピュータは、汎用コンピュータ、専用コンピュータ、ワークステーション、ＰＣ（Personal Computer）、電子ノートパッドなどであってもよい。プログラム命令は、必要なタスクを実行するためのプログラムコード、コードセグメントなどであってもよい。 (program)
In order to function as the above-mentioned encoding device 1 and decoding device 2, it is also possible to use a computer that can execute program instructions, respectively. Here, the computer may be a general-purpose computer, a dedicated computer, a workstation, a PC (Personal Computer), an electronic notepad, or the like. Program instructions may be program code, code segments, etc. to perform necessary tasks.

コンピュータは、プロセッサと、記憶部と、入力部と、出力部と、通信インターフェースとを備える。プロセッサは、ＣＰＵ(Central Processing Unit)、ＭＰＵ（Micro Processing Unit）、ＧＰＵ（Graphics Processing Unit）、ＤＳＰ（Digital Signal Processor）、ＳｏＣ（System on a Chip）などであり、同種又は異種の複数のプロセッサにより構成されてもよい。プロセッサは、記憶部からプログラムを読み出して実行することで、上記各構成の制御及び各種の演算処理を行う。なお、これらの処理内容の少なくとも一部をハードウェアで実現することとしてもよい。入力部は、ユーザの入力操作を受け付けてユーザの操作に基づく情報を取得する入力インターフェースであり、ポインティングデバイス、キーボード、マウスなどである。出力部は、情報を出力する出力インターフェースであり、ディスプレイ、スピーカなどである。通信インターフェースは、外部の装置と通信するためのインターフェースであり、例えばＬＡＮ（Local Area Network）インターフェースである。 The computer includes a processor, a storage section, an input section, an output section, and a communication interface. Processors include CPUs (Central Processing Units), MPUs (Micro Processing Units), GPUs (Graphics Processing Units), DSPs (Digital Signal Processors), and SoCs (System on a Chip). may be configured. The processor controls each of the above components and performs various calculation processes by reading and executing programs from the storage unit. Note that at least a part of these processing contents may be realized by hardware. The input unit is an input interface that receives a user's input operation and obtains information based on the user's operation, and is a pointing device, keyboard, mouse, or the like. The output unit is an output interface that outputs information, such as a display or a speaker. The communication interface is an interface for communicating with an external device, and is, for example, a LAN (Local Area Network) interface.

プログラムは、コンピュータが読み取り可能な記録媒体に記録されていてもよい。このような記録媒体を用いれば、プログラムをコンピュータにインストールすることが可能である。ここで、プログラムが記録された記録媒体は、非一過性（non-transitory）の記録媒体であってもよい。非一過性の記録媒体は、特に限定されるものではないが、例えば、ＣＤ－ＲＯＭ、ＤＶＤ－ＲＯＭ、ＵＳＢ（Universal Serial Bus）メモリなどであってもよい。また、このプログラムは、ネットワークを介して外部装置からダウンロードされる形態としてもよい。 The program may be recorded on a computer readable recording medium. Using such a recording medium, it is possible to install a program on a computer. Here, the recording medium on which the program is recorded may be a non-transitory recording medium. The non-transitory recording medium is not particularly limited, and may be, for example, a CD-ROM, a DVD-ROM, a USB (Universal Serial Bus) memory, or the like. Further, this program may be downloaded from an external device via a network.

例えば、コンピュータを上記の符号化装置１として機能させるためのプログラムは、映像フレームに割り当てられたビットレートと第１ＲＤ曲線のパラメータとを用いて、コスト関数を最小化する第１ラグランジュ乗数及びブロックレートｒ_ｉを算出するステップと、ブロックレートｒ_ｉと第２ＲＤ曲線のパラメータとから算出される第２ラグランジュ乗数を用いて、コスト関数を最小化する符号化モードを決定するステップと、第２ラグランジュ乗数に基づいて、ブロックごとの量子化パラメータＱＰ_ｉを算出するステップと、をコンピュータに実行させ、第１ＲＤ曲線は、対数関数又は指数関数を用いて近似される。 For example, a program for making a computer function as the above-mentioned encoding device 1 includes a first Lagrangian multiplier and a block rate that minimize the cost function using the bit rate assigned to the video frame and the parameters of the first RD curve. calculating r _i ; determining a coding mode that minimizes the cost function using a second Lagrange multiplier calculated from the block rate r _i and the parameters of the second RD curve; and determining a second Lagrange multiplier. The first RD curve is approximated using a _logarithmic function or an exponential function.

また、コンピュータを上記の復号装置２として機能させるためのプログラムは、符号化装置１から、量子化パラメータ算出部１０２によって算出された量子化パラメータＱＰ_ｉを取得するステップと、該量子化パラメータＱＰ_ｉを用いて、符号化装置１により符号化されたデータを復号するステップと、をコンピュータに実行させる。 Further, the program for causing a computer to function as the decoding device 2 described above includes the steps of acquiring the quantization parameter QP i calculated by the quantization parameter calculation unit 102 from the encoding device 1, and the step of acquiring the quantization parameter QP _i calculated by the quantization parameter calculation unit ₁₀₂ The computer is caused to perform the step of decoding the data encoded by the encoding device 1 using the encoder.

上述の実施形態は代表的な例として説明したが、本発明の趣旨及び範囲内で、多くの変更及び置換ができることは当業者に明らかである。したがって、本発明は、上述の実施形態によって制限するものと解するべきではなく、特許請求の範囲から逸脱することなく、種々の変形又は変更が可能である。例えば、実施形態の構成図に記載の複数の構成ブロックを統合したり、１つの構成ブロックを分割したりすることが可能である。 Although the embodiments described above have been described as representative examples, it will be apparent to those skilled in the art that many modifications and substitutions can be made within the spirit and scope of the invention. Therefore, the present invention should not be construed as being limited to the above-described embodiments, and various modifications and changes can be made without departing from the scope of the claims. For example, it is possible to integrate a plurality of configuration blocks described in the configuration diagram of the embodiment, or to divide one configuration block.

１符号化装置
２復号装置
１０レート制御部
１１ブロック分割部
１２減算部
１３変換部
１４量子化部
１５逆量子化部
１６逆変換部
１７加算部
１８記憶部
１９予測部
２０エントロピー符号化部
２１符号化データ記憶・出力部
３１エントロピー復号部
３２逆量子化部
３３逆変換部
３４加算部
３５記憶部
３６予測部
１０１ビット割当部
１０２量子化パラメータ算出部
１０３ＲＤ最適化部 1 Encoding device 2 Decoding device 10 Rate control unit 11 Block division unit 12 Subtraction unit 13 Transformation unit 14 Quantization unit 15 Inverse quantization unit 16 Inverse transformation unit 17 Addition unit 18 Storage unit 19 Prediction unit 20 Entropy encoding unit 21 Code data storage/output unit 31 entropy decoding unit 32 inverse quantization unit 33 inverse transformation unit 34 addition unit 35 storage unit 36 prediction unit 101 bit allocation unit 102 quantization parameter calculation unit 103 RD optimization unit

Claims

An encoding device that divides a video frame into encoding target regions, encodes each block, and performs RD optimization processing using an RD curve that approximates the relationship between bit rate and encoding distortion,
a bit allocation unit that calculates a first Lagrangian multiplier that minimizes a cost function and an allocated bit rate for each block using the bit rate allocated to the video frame and the parameters of the first RD curve;
a quantization parameter calculation unit that calculates a quantization parameter for each block using a second Lagrangian multiplier calculated from the allocated bit rate for each block and a parameter of a second RD curve;
an RD optimization unit that determines an encoding mode that minimizes the cost function based on the second Lagrangian multiplier,
The encoding device, wherein the first RD curve is approximated using a logarithmic function or an exponential function.

The bit allocation unit calculates the first Lagrangian multiplier using the bit rate allocated to the entire uncoded block in the video frame and the parameters of the first RD curve each time encoding of each block is completed. The encoding device according to claim 1, wherein the encoding device calculates an allocated bit rate for each block.

When the difference between the bit rate of the encoded block and the expected bit rate exceeds a threshold, the bit allocation unit divides the bit rate allocated to the entire unencoded blocks in the video frame and the first RD. The encoding device according to claim 1, wherein the first Lagrangian multiplier and the allocated bit rate for each block are calculated using curve parameters.

Obtaining the quantization parameter calculated by the quantization parameter calculation unit of the encoding device according to any one of claims 1 to 3,
A decoding device that decodes data encoded by the encoding device using the quantization parameter.

A program for causing a computer to function as the encoding device according to claim 1.

A program for causing a computer to function as the decoding device according to claim 4.