JP2015027022A

JP2015027022A - Video encoder, video coding method, and program

Info

Publication number: JP2015027022A
Application number: JP2013156514A
Authority: JP
Inventors: 知伸吉野; Tomonobu Yoshino; 内藤　整; Hitoshi Naito; 整内藤
Original assignee: KDDI Corp
Current assignee: KDDI Corp
Priority date: 2013-07-29
Filing date: 2013-07-29
Publication date: 2015-02-05

Abstract

PROBLEM TO BE SOLVED: To achieve high coding performance while minimizing degradation of subjective quality.SOLUTION: A video encoder includes a pixel value evaluation unit 11, and edge analysis unit 12, and division determination unit 13. The pixel value gradient evaluation unit 11 calculates the pixel value gradient of each pixel of an input image (a). The edge analysis unit 12 detects a coding unit block including the boundary of an object, based on the pixel value gradient of each pixel calculated by the pixel value gradient evaluation unit 11. The division determination unit 13 determines the division size of a quadtree structure based on the pixel value gradient of each pixel calculated by the pixel value gradient evaluation unit 11, for a coding unit block including the boundary of an object determined to include the boundary of an object by the edge analysis unit 12.

Description

本発明は、動画像符号化装置、動画像符号化方法、およびプログラムに関する。 The present invention relates to a moving image encoding device, a moving image encoding method, and a program.

非特許文献１には、符号化処理ブロックごとに映像圧縮符号化処理を行う手法が示されている。また、非特許文献２には、符号化処理ブロックに関して、ブロックサイズの上限を予め定めておき、上限のブロックサイズ（Coding Tree Unit ；ＣＴＵ）に対して、四分木構造に従って符号化処理ブロックを分割する手法（図３参照）が示されている。 Non-Patent Document 1 discloses a technique for performing video compression encoding processing for each encoding processing block. Further, in Non-Patent Document 2, an upper limit of the block size is determined in advance with respect to the encoding processing block, and the encoding processing block is assigned to the upper limit block size (Coding Tree Unit; CTU) according to the quadtree structure. A method of dividing (see FIG. 3) is shown.

非特許文献２に示されている手法では、非特許文献１に示されている手法と比べて、ブロックサイズの大きい符号化処理ブロックを許容することで、ブロックごとに付与される制御情報量を少なくして、符号化性能を向上させることができる。特に、高解像度かつ低ビットレートでの符号化では、非特許文献２に示されている手法では、非特許文献１に示されている手法と比べて、高い符号化性能を実現できる。 In the method shown in Non-Patent Document 2, the amount of control information given to each block can be reduced by allowing an encoding process block having a larger block size compared to the method shown in Non-Patent Document 1. The encoding performance can be improved by reducing the number. In particular, in the encoding at a high resolution and a low bit rate, the method shown in Non-Patent Document 2 can realize higher encoding performance than the method shown in Non-Patent Document 1.

符号化処理ブロックを分割する際には、例えば、ＲＤ最適化法が用いられる（例えば、非特許文献３参照）。このＲＤ最適化法では、符号化による歪みおよび発生符号量をコスト関数で評価して、符号化性能を判断する。コスト関数については、発生符号量に対して量子化の粗さに応じた重み付けを行い、量子化が粗くなるに従って（ビットレートが低くなるに従って）、大きな重み付けを行う。 When the encoding processing block is divided, for example, an RD optimization method is used (see, for example, Non-Patent Document 3). In this RD optimization method, the encoding performance is judged by evaluating the distortion caused by encoding and the amount of generated code with a cost function. For the cost function, the generated code amount is weighted according to the roughness of quantization, and the weighting is increased as the quantization becomes coarser (as the bit rate becomes lower).

Joint Video Team(JVT) of ISO/IEC MPEG and ITU-T VCEG, "Text of ISO/IEC 14496-10 Advanced Video Coding," July 2004.Joint Video Team (JVT) of ISO / IEC MPEG and ITU-T VCEG, "Text of ISO / IEC 14496-10 Advanced Video Coding," July 2004. "High Efficiency Video Coding (HEVC) text specification draft10," JCT-VC 12th meeting, JCTVC-L1003 v34, Jan. 2013."High Efficiency Video Coding (HEVC) text specification draft10," JCT-VC 12th meeting, JCTVC-L1003 v34, Jan. 2013. G. Sullivan, T. Wiegand, "Rate-distortion optimization for video compression," IEEE Signal Processing Magazine, Vol. 15, issue 6, p74-90, Nov. 1998.G. Sullivan, T. Wiegand, "Rate-distortion optimization for video compression," IEEE Signal Processing Magazine, Vol. 15, issue 6, p74-90, Nov. 1998.

ＲＤ最適化法を用いて四分木構造の分割を行うことで、低ビットレート符号化時において、大きなブロックが採用されやすくなる。しかし、オブジェクトの境界を含むＣＴＵを分割する場合に大きなブロックが採用されると、エッジ成分の欠損や、不適切な動き参照によるエッジ近傍でのノイズの発生につながってしまい、主観品質の著しい低下を招いてしまっていた。 By dividing the quadtree structure using the RD optimization method, a large block is easily adopted at the time of low bit rate coding. However, if a large block is used to divide a CTU that includes object boundaries, it will lead to missing edge components and noise near the edges due to improper motion reference, resulting in a significant reduction in subjective quality. Has been invited.

また、インター符号化ピクチャでは、ＲＤコスト関数で発生符号量に大きな重み（ラグランジュ乗数）が設定されるため、大きなブロックが採用されやすくなる。特に低ビットレート符号化では、オブジェクトの境界近傍でも大きなブロックが採用されやすくなる。オブジェクトが動いている場合には、オブジェクトの境界近傍の大きなブロックに、オブジェクトの背後から出現するオクルージョン領域が含まれることになるので、オクルージョン領域で主観品質の著しい低下を招いてしまっていた。 In inter-coded pictures, a large weight (Lagrange multiplier) is set to the generated code amount by the RD cost function, so that a large block is easily adopted. In particular, in low bit rate encoding, a large block is likely to be adopted even near the boundary of an object. When the object is moving, an occlusion area that appears from behind the object is included in a large block near the boundary of the object, which causes a significant deterioration in subjective quality in the occlusion area.

そこで、本発明は、上述の課題に鑑みてなされたものであり、主観品質の低下の抑制しつつ、高い符号化性能を実現することを目的とする。 Accordingly, the present invention has been made in view of the above-described problems, and an object thereof is to realize high coding performance while suppressing deterioration in subjective quality.

本発明は、上記の課題を解決するために、以下の事項を提案している。
（１）本発明は、符号化処理単位ブロックを木構造に従って分割することを許容する動画像符号化装置（例えば、図１の動画像符号化装置１に相当）であって、前記符号化処理単位ブロックごとに画素値の特徴量（例えば、後述の画素値勾配に相当）を評価し、互いに隣接する前記符号化処理単位ブロック同士の画素値の特徴量の差異に基づいて、当該符号化処理単位ブロックに適した分割種類（例えば、後述の分割回数（ＣＵｄｅｐｔｈ）ｂに相当）を推定することを特徴とする動画像符号化装置を提案している。 The present invention proposes the following matters in order to solve the above problems.
(1) The present invention is a moving image encoding device (e.g., corresponding to the moving image encoding device 1 in FIG. 1) that allows an encoding processing unit block to be divided according to a tree structure, and the encoding processing A feature value of a pixel value (e.g., corresponding to a pixel value gradient described later) is evaluated for each unit block, and the encoding process is performed based on a difference in feature value of a pixel value between the encoding process unit blocks adjacent to each other. A moving picture coding apparatus has been proposed which is characterized by estimating a division type suitable for a unit block (e.g., corresponding to the number of divisions (CU depth) b described later).

ここで、オブジェクトの境界を境にして、画素値の特徴量が大きく変化する。そこで、この発明によれば、互いに隣接する符号化処理単位ブロック同士の画素値の特徴量の差異に基づいて、これら符号化処理単位ブロックに適した分割種類を決定することとした。このため、オブジェクトの境界を考慮して符号化処理単位ブロックを木構造に従って分割することができる。したがって、主観品質の低下を抑制しつつ、高い符号化性能を実現できる。 Here, the feature value of the pixel value greatly changes at the boundary of the object. Therefore, according to the present invention, the division type suitable for these encoding processing unit blocks is determined based on the difference in the feature values of the pixel values between the encoding processing unit blocks adjacent to each other. For this reason, the encoding processing unit block can be divided according to the tree structure in consideration of the boundary of the object. Therefore, high encoding performance can be realized while suppressing a decrease in subjective quality.

（２）本発明は、（１）の動画像符号化装置について、前記画素値の特徴量として、入力画像の各画素の画素値勾配を算出する画素値勾配評価手段（例えば、図２の画素値勾配評価部１１に相当）と、前記画素値勾配評価手段により算出された各画素の画素値勾配に基づいて、オブジェクトの境界を含む符号化処理単位ブロックを検出するエッジ解析手段（例えば、図２のエッジ解析部１２に相当）と、前記エッジ解析手段によりオブジェクトの境界を含むと検出された符号化処理単位ブロックについて、前記画素値勾配評価手段により算出された各画素の画素値勾配に基づいて木構造の分割サイズを決定する分割決定手段（例えば、図２の分割決定部１３に相当）と、を備えることを特徴とする動画像符号化装置を提案している。 (2) The present invention relates to a moving image encoding apparatus according to (1), wherein a pixel value gradient evaluation unit (for example, the pixel of FIG. 2) calculates a pixel value gradient of each pixel of the input image as the feature value of the pixel value. And an edge analysis unit that detects an encoding processing unit block including an object boundary based on the pixel value gradient of each pixel calculated by the pixel value gradient evaluation unit. 2) and an encoding processing unit block detected by the edge analysis means as including an object boundary, based on the pixel value gradient of each pixel calculated by the pixel value gradient evaluation means. In addition, there is proposed a moving picture encoding apparatus including a division determination unit (for example, equivalent to the division determination unit 13 in FIG. 2) for determining a division size of a tree structure.

この発明によれば、（１）の動画像符号化装置において、入力画像の各画素の画素値勾配に基づいて、オブジェクトの境界を含む符号化処理単位ブロックを検出し、検出した符号化処理単位ブロックについての木構造の分割サイズを、各画素の画素値勾配に基づいて決定することとした。このため、オブジェクトの境界を含む符号化処理単位ブロックについて、オブジェクトの境界を考慮して木構造に従って分割することができる。 According to the present invention, in the moving image encoding apparatus according to (1), an encoding processing unit block including an object boundary is detected based on a pixel value gradient of each pixel of an input image, and the detected encoding processing unit is detected. The division size of the tree structure for the block is determined based on the pixel value gradient of each pixel. Therefore, an encoding processing unit block including an object boundary can be divided according to a tree structure in consideration of the object boundary.

（３）本発明は、（２）の動画像符号化装置について、前記画素値勾配評価手段は、入力画像の各画素の画素値に対してｓｏｂｅｌフィルタまたはラプラシアンフィルタを適用して、前記各画素の画素値勾配を算出することを特徴とする動画像符号化装置を提案している。 (3) In the moving image encoding device according to (2), the pixel value gradient evaluation unit applies a sobel filter or a Laplacian filter to a pixel value of each pixel of the input image, and Has proposed a moving picture coding apparatus characterized by calculating a pixel value gradient of the video.

この発明によれば、動画像符号化装置において、入力画像の各画素の画素値に対してｓｏｂｅｌフィルタまたはラプラシアンフィルタを適用して、各画素の画素値勾配を算出することとした。このため、各画素の画素値勾配を適切に算出することができる。 According to the present invention, the moving image encoding apparatus calculates the pixel value gradient of each pixel by applying a sobel filter or a Laplacian filter to the pixel value of each pixel of the input image. For this reason, the pixel value gradient of each pixel can be calculated appropriately.

（４）本発明は、（２）の動画像符号化装置について、前記画素値勾配評価手段は、入力画像の各画素の画素値に対してローパスフィルタを施した後に、ｓｏｂｅｌフィルタまたはラプラシアンフィルタを適用して、前記各画素の画素値勾配を算出することを特徴とする動画像符号化装置を提案している。 (4) The present invention relates to the moving picture coding apparatus according to (2), wherein the pixel value gradient evaluation unit performs a low-pass filter on a pixel value of each pixel of the input image, and then performs a sobel filter or a Laplacian filter. The present invention proposes a moving picture coding apparatus that is applied to calculate a pixel value gradient of each pixel.

この発明によれば、（２）の動画像符号化装置において、入力画像の各画素の画素値に対してローパスフィルタを施した後に、ｓｏｂｅｌフィルタまたはラプラシアンフィルタを適用して、各画素の画素値勾配を算出することとした。このため、各画素の画素値勾配を適切に算出することができる。 According to the present invention, in the moving image encoding device of (2), after applying a low-pass filter to the pixel value of each pixel of the input image, the sobel filter or Laplacian filter is applied to obtain the pixel value of each pixel. The slope was calculated. For this reason, the pixel value gradient of each pixel can be calculated appropriately.

（５）本発明は、（２）から（４）のいずれかの動画像符号化装置について、前記エッジ解析手段は、符号化処理単位ブロック内の各画素の画素値勾配の絶対値の平均値または分散値が、予め定められた閾値以上であることと、隣接する符号化処理単位ブロックとの画素値勾配の絶対値の平均値または分散値の差分が予め定められた閾値以上であることと、の少なくともいずれかを満たす符号化処理単位ブロックを、前記オブジェクトの境界を含む符号化処理単位ブロックとして検出することを特徴とする動画像符号化装置を提案している。 (5) The present invention provides the moving image encoding device according to any one of (2) to (4), wherein the edge analysis means is an average value of absolute values of pixel value gradients of respective pixels in the encoding processing unit block. Alternatively, the variance value is greater than or equal to a predetermined threshold value, and the average value or absolute value of the absolute value of the pixel value gradient with the adjacent encoding processing unit block is greater than or equal to a predetermined threshold value. A moving picture encoding apparatus is proposed, which detects an encoding processing unit block satisfying at least one of the above as an encoding processing unit block including a boundary of the object.

この発明によれば、（２）から（４）のいずれかの動画像符号化装置において、符号化処理単位ブロック内の各画素の画素値勾配の絶対値の平均値または分散値が閾値以上であることと、隣接する符号化処理単位ブロックとの画素値勾配の絶対値の平均値または分散値の差分が閾値以上であることと、の少なくともいずれかを満たす符号化処理単位ブロックを、オブジェクトの境界を含む符号化処理単位ブロックとして検出することとした。このため、オブジェクトの境界を含む符号化処理単位ブロックを、適切に検出することができる。 According to the present invention, in any one of the moving picture encoding apparatuses according to (2) to (4), the average value or variance value of the absolute value of the pixel value gradient of each pixel in the encoding processing unit block is equal to or greater than the threshold value. An encoding processing unit block that satisfies at least one of the following: the difference in the average value or the variance value of the absolute value of the pixel value gradient from the adjacent encoding processing unit block is equal to or greater than a threshold value, The encoding process unit block including the boundary is detected. For this reason, the encoding process unit block containing the boundary of an object can be detected appropriately.

（６）本発明は、（２）から（５）のいずれかの動画像符号化装置について、前記分割決定手段は、分割前のブロックの画素値勾配の絶対値の平均値または分散値と、分割後の各ブロックの画素値勾配の絶対値の平均値または分散値と、の差分の絶対値が、予め定められた閾値より大きいブロックが存在すれば、当該分割前のブロックを分割可能と判断することを特徴とする動画像符号化装置を提案している。 (6) In the video encoding device according to any one of (2) to (5), the division determination unit includes an average value or a variance value of absolute values of pixel value gradients of blocks before division, If there is a block in which the absolute value of the difference between the average value or the variance value of the absolute value of the pixel value gradient of each block after the division is larger than a predetermined threshold, it is determined that the block before the division can be divided. A video encoding device characterized by the above is proposed.

この発明によれば、（２）から（５）のいずれかの動画像符号化装置において、分割前のブロックの画素値勾配の絶対値の平均値または分散値と、分割後の各ブロックの画素値勾配の絶対値の平均値または分散値と、の差分の絶対値が閾値より大きいブロックが存在すれば、分割前のブロックを分割可能と判断することとした。このため、どのブロックにオブジェクトの境界が含まれており、分割可能なのかを、適切に判断することができる。 According to the present invention, in any one of the moving picture coding apparatuses according to (2) to (5), the average value or variance value of the absolute value of the pixel value gradient of the block before division, and the pixel of each block after division If there is a block in which the absolute value of the difference between the average value or the variance value of the absolute values of the value gradient is larger than the threshold, it is determined that the block before the division can be divided. Therefore, it is possible to appropriately determine which block contains the boundary of the object and can be divided.

（７）本発明は、（２）から（６）のいずれかの動画像符号化装置について、前記エッジ解析手段によりオブジェクトの境界を含むと検出された符号化処理単位ブロックを分割したブロックごとに、前記画素値勾配評価手段により算出された各画素の画素値勾配に基づいて量子化パラメータを決定する量子化パラメータ決定手段を備えることを特徴とする動画像符号化装置を提案している。 (7) In the moving image encoding device according to any one of (2) to (6), the present invention provides, for each block obtained by dividing an encoding processing unit block detected as including an object boundary by the edge analysis unit. The video encoding apparatus is characterized by comprising quantization parameter determining means for determining a quantization parameter based on the pixel value gradient of each pixel calculated by the pixel value gradient evaluating means.

この発明によれば、（２）から（６）のいずれかの動画像符号化装置において、ブロックごとに、各画素の画素値勾配に基づいて量子化パラメータを決定することとした。このため、ブロックごとに適切な量子化パラメータを決定することができる。 According to the present invention, in any one of the moving picture encoding apparatuses according to (2) to (6), the quantization parameter is determined for each block based on the pixel value gradient of each pixel. For this reason, an appropriate quantization parameter can be determined for each block.

（８）本発明は、（７）の動画像符号化装置について、前記量子化パラメータ決定手段は、前記エッジ解析手段によりオブジェクトの境界を含むと検出された符号化処理単位ブロックを分割したブロックごとに、画素値勾配の絶対値の平均値または分散値が、予め定められた閾値以上であるか否かにより、前記量子化パラメータを決定することを特徴とする動画像符号化装置を提案している。 (8) In the moving image encoding apparatus according to (7), the quantization parameter determination unit includes a block obtained by dividing the encoding processing unit block detected by the edge analysis unit as including an object boundary. In addition, a video encoding device is proposed in which the quantization parameter is determined based on whether an average value or a variance value of the absolute value of the pixel value gradient is equal to or greater than a predetermined threshold value. Yes.

この発明によれば、（７）の動画像符号化装置において、ブロックごとに、画素値勾配の絶対値の平均値または分散値が閾値以上であるか否かにより、量子化パラメータを決定することとした。このため、ブロックごとにさらに適切な量子化パラメータを決定することができる。 According to the present invention, in the moving picture coding apparatus according to (7), the quantization parameter is determined for each block depending on whether the average value or the variance value of the absolute value of the pixel value gradient is equal to or greater than the threshold value. It was. For this reason, a more appropriate quantization parameter can be determined for each block.

（９）本発明は、符号化処理単位ブロックを木構造に従って分割することを許容する動画像符号化装置（例えば、図１の動画像符号化装置１に相当）における動画像符号化方法であって、前記符号化処理単位ブロックごとに画素値の特徴量（例えば、後述の画素値勾配に相当）を評価し、互いに隣接する前記符号化処理単位ブロック同士の画素値の特徴量の差異に基づいて、当該符号化処理単位ブロックに適した分割種類（例えば、後述の分割回数（ＣＵｄｅｐｔｈ）ｂに相当）を推定することを特徴とする動画像符号化方法を提案している。 (9) The present invention is a moving picture coding method in a moving picture coding apparatus (for example, equivalent to the moving picture coding apparatus 1 in FIG. 1) that allows a coding processing unit block to be divided according to a tree structure. Then, a feature value of a pixel value (e.g., corresponding to a pixel value gradient described later) is evaluated for each coding processing unit block, and based on a difference in feature value of a pixel value between the coding processing unit blocks adjacent to each other. Thus, a video encoding method has been proposed in which a division type suitable for the encoding processing unit block (e.g., equivalent to the number of divisions (CU depth) b described later) is estimated.

この発明によれば、互いに隣接する符号化処理単位ブロック同士の画素値の特徴量の差異に基づいて、これら符号化処理単位ブロックに適した分割種類を決定することとした。このため、オブジェクトの境界を考慮して符号化処理単位ブロックを木構造に従って分割することができる。したがって、主観品質の低下を抑制しつつ、高い符号化性能を実現できる。 According to the present invention, the division type suitable for these encoding processing unit blocks is determined based on the difference in the feature values of the pixel values between the encoding processing unit blocks adjacent to each other. For this reason, the encoding processing unit block can be divided according to the tree structure in consideration of the boundary of the object. Therefore, high encoding performance can be realized while suppressing a decrease in subjective quality.

（１０）本発明は、符号化処理単位ブロックを木構造に従って分割することを許容する動画像符号化装置（例えば、図１の動画像符号化装置１に相当）における動画像符号化方法を、コンピュータに実行させるためのプログラムであって、前記符号化処理単位ブロックごとに画素値の特徴量（例えば、後述の画素値勾配に相当）を評価し、互いに隣接する前記符号化処理単位ブロック同士の画素値の特徴量の差異に基づいて、当該符号化処理単位ブロックに適した分割種類（例えば、後述の分割回数（ＣＵｄｅｐｔｈ）ｂに相当）を推定するためのプログラムを提案している。 (10) The present invention relates to a moving picture coding method in a moving picture coding apparatus (e.g., equivalent to the moving picture coding apparatus 1 in FIG. 1) that allows a coding processing unit block to be divided according to a tree structure. A program to be executed by a computer, wherein a feature value of a pixel value (e.g., corresponding to a pixel value gradient described later) is evaluated for each encoding processing unit block, and the encoding processing unit blocks adjacent to each other are evaluated. A program for estimating a division type suitable for the encoding processing unit block (e.g., equivalent to the number of divisions (CU depth) b described later) based on the difference in feature values of pixel values is proposed.

本発明によれば、主観品質の低下を抑制しつつ、高い符号化性能を実現できる。 According to the present invention, it is possible to realize high coding performance while suppressing deterioration in subjective quality.

本発明の一実施形態に係る動画像符号化装置のブロック図である。It is a block diagram of the moving image encoder which concerns on one Embodiment of this invention. 前記実施形態に係る動画像符号化装置が備える画質制御部のブロック図である。It is a block diagram of the image quality control part with which the moving image encoder which concerns on the said embodiment is provided. 四分木構造に従った符号化処理ブロックの分割について説明するための図である。It is a figure for demonstrating the division | segmentation of the encoding process block according to a quadtree structure.

以下、本発明の実施の形態について図面を参照しながら説明する。なお、以下の実施形態における構成要素は適宜、既存の構成要素などとの置き換えが可能であり、また、他の既存の構成要素との組合せを含む様々なバリエーションが可能である。したがって、以下の実施形態の記載をもって、特許請求の範囲に記載された発明の内容を限定するものではない。 Hereinafter, embodiments of the present invention will be described with reference to the drawings. Note that the constituent elements in the following embodiments can be appropriately replaced with existing constituent elements, and various variations including combinations with other existing constituent elements are possible. Accordingly, the description of the following embodiments does not limit the contents of the invention described in the claims.

［動画像符号化装置１の構成および動作］
図１は、本発明の一実施形態に係る動画像符号化装置１のブロック図である。動画像符号化装置１は、符号化処理単位ブロックを四分木構造に従って分割することを許容し、分割したブロックごとに量子化パラメータを制御可能な動画像符号化装置である。この動画像符号化装置１は、画質制御部１０、映像分割部２０、予測値生成部３０、ＤＣＴ／量子化部４０、エントロピー符号化部５０、逆ＤＣＴ／逆量子化部６０、およびローカルメモリ７０を備える。 [Configuration and Operation of Video Encoding Device 1]
FIG. 1 is a block diagram of a video encoding apparatus 1 according to an embodiment of the present invention. The moving picture coding apparatus 1 is a moving picture coding apparatus that allows a coding processing unit block to be divided according to a quadtree structure and can control a quantization parameter for each divided block. The moving image encoding apparatus 1 includes an image quality control unit 10, a video dividing unit 20, a predicted value generation unit 30, a DCT / quantization unit 40, an entropy encoding unit 50, an inverse DCT / inverse quantization unit 60, and a local memory. 70.

画質制御部１０は、入力画像ａを入力とする。この画質制御部１０は、入力画像ａの画素値に関する画素値勾配の評価値に基づいて、オブジェクトの境界を含むＣＴＵを検出し、そのＣＴＵで許容する分割回数（ＣＵｄｅｐｔｈ）ｂを求めて出力する。また、入力画像ａの画素値に関する画素値勾配の評価値に基づいて、ＣＵ（ＣｏｄｉｎｇＵｎｉｔ）ごとに、量子化パラメータ（ＱＰ）に対する差分の情報（ΔＱＰ）を決定し、量子化パラメータに対する差分情報ｃとして出力する。なお、量子化パラメータ（ＱＰ）に対する差分の情報（ΔＱＰ）とは、処理スライスにおける量子化パラメータ（ＱＰ）と、そのＣＵで用いられる量子化パラメータと、の差分のことである。 The image quality control unit 10 receives the input image a. The image quality control unit 10 detects a CTU including an object boundary based on the evaluation value of the pixel value gradient related to the pixel value of the input image a, and obtains and outputs the number of divisions (CU depth) b allowed by the CTU. To do. Further, difference information (ΔQP) for the quantization parameter (QP) is determined for each CU (Coding Unit) based on the evaluation value of the pixel value gradient related to the pixel value of the input image a, and the difference information for the quantization parameter Output as c. The difference information (ΔQP) with respect to the quantization parameter (QP) is a difference between the quantization parameter (QP) in the processing slice and the quantization parameter used in the CU.

図２は、画質制御部１０のブロック図である。画質制御部１０は、画素値勾配評価部１１、エッジ解析部１２、分割決定部１３、および量子化パラメータ決定部１４を備える。 FIG. 2 is a block diagram of the image quality control unit 10. The image quality control unit 10 includes a pixel value gradient evaluation unit 11, an edge analysis unit 12, a division determination unit 13, and a quantization parameter determination unit 14.

画素値勾配評価部１１は、入力画像ａを入力とする。この画素値勾配評価部１１は、入力画像ａの全画素値について画素値勾配を算出し、各画素の画素値勾配ｊとして出力する。画素値勾配の算出は、例えば、入力画像ａの全画素値に対して、ローパスフィルタを施した後に、ｓｏｂｅｌフィルタやラプラシアンフィルタなどを適用することで、実現できる。 The pixel value gradient evaluation unit 11 receives the input image a. The pixel value gradient evaluation unit 11 calculates a pixel value gradient for all pixel values of the input image a, and outputs the pixel value gradient as a pixel value gradient j of each pixel. The pixel value gradient can be calculated, for example, by applying a low-pass filter to all pixel values of the input image a and then applying a sobel filter or a Laplacian filter.

エッジ解析部１２は、各画素の画素値勾配ｊを入力とする。このエッジ解析部１２は、各画素の画素値勾配ｊに基づいて、オブジェクトの境界を含むＣＴＵを検出する。オブジェクトの境界を含むＣＴＵの検出では、例えば、まず、各ＣＴＵ内の各画素の画素値勾配の絶対値の平均値または分散値を算出し、算出した平均値または分散値が予め定められた閾値以上であるか否かを判別することにより、各ＣＴＵの画素値勾配の大小を判別する。次に、同一のオブジェクトが複数のＣＴＵにまたがって存在する場合にはこれらＣＴＵのそれぞれの画素値勾配の特徴が類似するという特徴を考慮して、互いに隣接するＣＴＵ同士の画素値勾配の絶対値の平均値または分散値の差分が予め定められた閾値以上であるか否かを判別する。次に、ＣＴＵ内の画素値の絶対値の平均値または分散値が閾値以上で、かつ、隣接するＣＴＵとの画素値勾配の絶対値の平均値または分散値の差分が閾値以上であるＣＴＵを、オブジェクトの境界を含むＣＴＵとして検出し、検出したＣＴＵの座標を、オブジェクトの境界を含むＣＴＵの座標ｋとして出力する。 The edge analysis unit 12 receives the pixel value gradient j of each pixel as an input. The edge analysis unit 12 detects a CTU including an object boundary based on the pixel value gradient j of each pixel. In detecting the CTU including the boundary of the object, for example, first, an average value or a variance value of the absolute value of the pixel value gradient of each pixel in each CTU is calculated, and the calculated average value or variance value is a predetermined threshold value. By determining whether or not this is the case, the magnitude of the pixel value gradient of each CTU is determined. Next, in consideration of the feature that the pixel value gradients of these CTUs are similar when the same object exists across a plurality of CTUs, the absolute value of the pixel value gradients between the adjacent CTUs is considered. It is determined whether or not the difference between the average value or the variance value is equal to or greater than a predetermined threshold value. Next, a CTU in which the average value or variance value of the absolute values of the pixel values in the CTU is greater than or equal to a threshold value, and the difference between the average value or the variance value of the pixel value gradients of adjacent CTUs is greater than or equal to the threshold value The CTU including the boundary of the object is detected, and the coordinates of the detected CTU are output as the coordinates k of the CTU including the boundary of the object.

分割決定部１３は、各画素の画素値勾配ｊと、オブジェクトの境界を含むＣＴＵの座標ｋと、を入力とする。この分割決定部１３は、オブジェクトの境界を含むＣＴＵについて、四分木構造に基づく四分割の可否を判断する。具体的には、まず、分割前のブロックの画素値勾配の絶対値の平均値または分散値と、分割後の各ブロックの画素値勾配の絶対値の平均値または分散値と、の差分の絶対値を算出する。次に、算出した差分の絶対値が予め定められた閾値より大きいブロックが存在すれば、分割前のブロックを分割可能と判断する。そして、分割不可能と判断するか、予め定められた分割回数の最大値まで、上述の差分の絶対値の算出と、上述の分割可否の判断と、を繰り返し、分割サイズが小さい方からＮ階層（ただし、ＮはＮ＞０を満たす整数）の四分木構造表現までを、オブジェクトの境界を含むＣＴＵの座標ｋで示されるＣＴＵで許容する分割回数（ＣＵｄｅｐｔｈ）ｂとして出力する。 The division determination unit 13 receives the pixel value gradient j of each pixel and the CTU coordinates k including the boundary of the object as inputs. The division determination unit 13 determines whether or not the CTU including the object boundary can be divided into four based on the quadtree structure. Specifically, first, the absolute value of the difference between the average value or variance value of the pixel value gradient of the block before division and the average value or variance value of the absolute value of the pixel value gradient of each block after division. Calculate the value. Next, if there is a block in which the absolute value of the calculated difference is greater than a predetermined threshold, it is determined that the block before division can be divided. Then, it is determined that the division is impossible, or the calculation of the absolute value of the difference and the determination of whether the division is possible are repeated up to a predetermined maximum number of divisions. Up to a quadtree structure expression (where N is an integer satisfying N> 0) is output as the number of divisions (CU depth) b allowed by the CTU indicated by the coordinates k of the CTU including the boundary of the object.

量子化パラメータ決定部１４は、各画素の画素値勾配ｊと、オブジェクトの境界を含むＣＴＵの座標ｋと、分割回数ｂと、を入力とする。また、量子化パラメータ決定部１４には、処理スライスにおける量子化パラメータ（ＱＰ）も入力される（図示省略）。この量子化パラメータ決定部１４は、分割回数ｂに従って、ＣＵごとに画素値勾配の絶対値の平均値または分散値を算出し、閾値判定に基づいてＣＵごとに量子化パラメータに対する差分情報ｃを決定して出力する。 The quantization parameter determination unit 14 receives the pixel value gradient j of each pixel, the CTU coordinates k including the boundary of the object, and the division count b. The quantization parameter determination unit 14 also receives a quantization parameter (QP) in the processing slice (not shown). The quantization parameter determination unit 14 calculates an average value or a variance value of the absolute value of the pixel value gradient for each CU according to the division count b, and determines difference information c for the quantization parameter for each CU based on threshold determination. And output.

図１に戻って、映像分割部２０は、入力画像ａと、分割回数ｂと、を入力とする。この映像分割部２０は、まず、入力画像ａをＣＴＵに分割する。次に、ＣＴＵに分割された各画像について、分割回数ｂで示される回数を上限としてＲＤ最適化法により分割回数を決定し、決定した分割回数だけ四分木構造で分割し、分割された入力画像ｄとして出力する。 Returning to FIG. 1, the video dividing unit 20 receives the input image a and the division count b. The video dividing unit 20 first divides the input image a into CTUs. Next, for each image divided into CTUs, the number of divisions is determined by the RD optimization method with the number of divisions indicated by b as the upper limit. Output as image d.

予測値生成部３０は、分割された入力画像ｄと、ローカルメモリ７０から供給される後述の符号化済み画像ｉと、を入力とする。この予測値生成部３０は、イントラ予測やインター予測などのそれぞれの予測方法に基づいて予測値を生成し、最も高い符号化性能の期待される予測方法による予測値を、予測値ｅとして出力する。 The predicted value generation unit 30 receives the divided input image d and an encoded image i (described later) supplied from the local memory 70 as inputs. The prediction value generation unit 30 generates a prediction value based on each prediction method such as intra prediction or inter prediction, and outputs a prediction value based on a prediction method expected to have the highest coding performance as a prediction value e. .

ＤＣＴ／量子化部４０は、予測残差信号と、量子化パラメータに対する差分情報ｃと、を入力とする。予測残差信号とは、分割された入力画像ｄと、予測値ｅと、の差分信号のことである。このＤＣＴ／量子化部４０は、予測残差信号に対して直交変換処理を行い、この直交変換処理により得られた変換係数について量子化処理を行って、量子化された変換係数ｆを出力する。なお、量子化処理では、量子化パラメータに対する差分情報ｃとしてのΔＱＰだけ処理スライスの量子化パラメータ（ＱＰ）に対して変化させた量子化パラメータ値を、用いる。 The DCT / quantization unit 40 receives the prediction residual signal and difference information c for the quantization parameter as inputs. The prediction residual signal is a difference signal between the divided input image d and the predicted value e. The DCT / quantization unit 40 performs orthogonal transform processing on the prediction residual signal, performs quantization processing on the transform coefficient obtained by the orthogonal transform processing, and outputs a quantized transform coefficient f. . In the quantization process, a quantization parameter value that is changed with respect to the quantization parameter (QP) of the processing slice by ΔQP as the difference information c with respect to the quantization parameter is used.

エントロピー符号化部５０は、量子化された変換係数ｆを入力とする。このエントロピー符号化部５０は、量子化された変換係数ｆに対してエントロピー符号化処理を行って、その結果について、符号化データへの記述規則（符号化シンタックス）に従って符号化データに記述し、符号化データｇとして出力する。 The entropy encoding unit 50 receives the quantized transform coefficient f as an input. The entropy encoding unit 50 performs entropy encoding processing on the quantized transform coefficient f, and describes the result in the encoded data according to the description rule (encoding syntax) for the encoded data. , And output as encoded data g.

逆ＤＣＴ／逆量子化部６０は、量子化された変換係数ｆと、量子化パラメータに対する差分情報ｃと、を入力とする。この逆ＤＣＴ／逆量子化部６０は、量子化された変換係数ｆについて逆量子化処理を行い、この逆量子化処理により得られた変換係数に対して逆変換処理を行って、逆直交変換された変換係数ｈとして出力する。なお、逆量子化処理では、量子化パラメータに対する差分情報ｃとしてのΔＱＰだけ処理スライスの量子化パラメータ（ＱＰ）に対して変化させた量子化パラメータ値を、用いる。 The inverse DCT / inverse quantization unit 60 receives the quantized transform coefficient f and difference information c for the quantization parameter as inputs. The inverse DCT / inverse quantization unit 60 performs an inverse quantization process on the quantized transform coefficient f, performs an inverse transform process on the transform coefficient obtained by the inverse quantization process, and performs an inverse orthogonal transform. The converted conversion coefficient h is output. In the inverse quantization process, a quantization parameter value that is changed with respect to the quantization parameter (QP) of the processing slice by ΔQP as the difference information c with respect to the quantization parameter is used.

ローカルメモリ７０は、フィルタ処理済みのローカルデコード画像である符号化済み画像ｉを入力とする。このローカルメモリ７０は、入力された符号化済み画像ｉを蓄積する。そして、次の符号化処理単位ブロック以降において過去の符号化済み画像ｉを参照する必要がある場合に、適宜、予測値生成部３０に符号化済み画像ｉを供給する。 The local memory 70 receives an encoded image i, which is a filtered local decoded image. The local memory 70 stores the input encoded image i. Then, when it is necessary to refer to the past encoded image i after the next encoding processing unit block, the encoded image i is supplied to the predicted value generation unit 30 as appropriate.

以上の動画像符号化装置１によれば、以下の効果を奏することができる。 According to the above moving picture coding apparatus 1, the following effects can be produced.

動画像符号化装置１は、互いに隣接するＣＴＵ同士の画素値勾配の絶対値の平均値または分散値の差分に基づいて、これらＣＴＵに適した分割種類を決定する。このため、オブジェクトの境界を考慮してＣＴＵを四分木構造に従って分割することができる。したがって、主観品質の低下を抑制しつつ、高い符号化性能を実現できる。 The moving image encoding device 1 determines a division type suitable for these CTUs based on the average value or the difference value of the absolute values of the pixel value gradients between adjacent CTUs. Therefore, the CTU can be divided according to the quadtree structure in consideration of the object boundary. Therefore, high encoding performance can be realized while suppressing a decrease in subjective quality.

また、動画像符号化装置１は、入力画像ａの各画素の画素値勾配に基づいて、オブジェクトの境界を含むＣＴＵを検出し、検出したＣＴＵについての四分木構造の分割サイズを、各画素の画素値勾配に基づいて決定する。このため、オブジェクトの境界を含むＣＴＵについて、オブジェクトの境界を考慮して四分木構造に従って分割することができる。 In addition, the moving image encoding apparatus 1 detects a CTU including an object boundary based on the pixel value gradient of each pixel of the input image a, and determines the division size of the quadtree structure for the detected CTU for each pixel. Is determined based on the pixel value gradient. Therefore, the CTU including the object boundary can be divided according to the quadtree structure in consideration of the object boundary.

また、動画像符号化装置１は、入力画像ａの各画素の画素値に対してローパスフィルタを施した後に、ｓｏｂｅｌフィルタまたはラプラシアンフィルタを適用して、各画素の画素値勾配を算出する。このため、各画素の画素値勾配を適切に算出することができる。 In addition, the moving image encoding apparatus 1 calculates a pixel value gradient of each pixel by applying a low-pass filter to the pixel value of each pixel of the input image a and then applying a sobel filter or a Laplacian filter. For this reason, the pixel value gradient of each pixel can be calculated appropriately.

また、動画像符号化装置１は、ＣＴＵ内の各画素の画素値勾配の絶対値の平均値または分散値が閾値以上であることと、隣接するＣＴＵとの画素値勾配の絶対値の平均値または分散値の差分が閾値以上であることと、の少なくともいずれかを満たすＣＴＵを、オブジェクトの境界を含むＣＴＵとして検出する。このため、オブジェクトの境界を含むＣＴＵを、適切に検出することができる。 In addition, the moving image encoding apparatus 1 determines that the average value or the variance value of the pixel value gradient of each pixel in the CTU is equal to or larger than the threshold value, and the average value of the absolute value of the pixel value gradient with the adjacent CTU. Alternatively, a CTU that satisfies at least one of the difference between variance values being equal to or greater than a threshold value is detected as a CTU that includes an object boundary. For this reason, CTU including the boundary of an object can be detected appropriately.

また、動画像符号化装置１は、分割前のブロックの画素値勾配の絶対値の平均値または分散値と、分割後の各ブロックの画素値勾配の絶対値の平均値または分散値と、の差分の絶対値が閾値より大きいブロックが存在すれば、分割前のブロックを分割可能と判断する。このため、どのブロックにオブジェクトの境界が含まれており、分割可能なのかを、適切に判断することができる。 In addition, the moving image encoding device 1 calculates the average value or variance value of the absolute value of the pixel value gradient of the block before division and the average value or variance value of the absolute value of the pixel value gradient of each block after division. If there is a block whose absolute value of the difference is larger than the threshold, it is determined that the block before division can be divided. Therefore, it is possible to appropriately determine which block contains the boundary of the object and can be divided.

また、動画像符号化装置１は、ブロックごとに、画素値勾配の絶対値の平均値または分散値が閾値以上であるか否かにより、量子化パラメータを決定する。このため、ブロックごとにさらに適切な量子化パラメータを決定することができる。 Further, the moving image encoding apparatus 1 determines the quantization parameter for each block depending on whether the average value or the variance value of the absolute value of the pixel value gradient is equal to or greater than a threshold value. For this reason, a more appropriate quantization parameter can be determined for each block.

なお、本発明の動画像符号化装置１の処理を、コンピュータ読み取り可能な非一時的な記録媒体に記録し、この記録媒体に記録されたプログラムを動画像符号化装置１に読み込ませ、実行することによって、本発明を実現できる。 The processing of the moving picture coding apparatus 1 of the present invention is recorded on a computer-readable non-transitory recording medium, and the program recorded on the recording medium is read by the moving picture coding apparatus 1 and executed. Thus, the present invention can be realized.

ここで、上述の記録媒体には、例えば、ＥＰＲＯＭやフラッシュメモリといった不揮発性のメモリ、ハードディスクといった磁気ディスク、ＣＤ−ＲＯＭなどを適用できる。また、この記録媒体に記録されたプログラムの読み込みおよび実行は、動画像符号化装置１に設けられたプロセッサによって行われる。 Here, for example, a nonvolatile memory such as an EPROM or a flash memory, a magnetic disk such as a hard disk, a CD-ROM, or the like can be applied to the above-described recording medium. Further, reading and execution of the program recorded on the recording medium is performed by a processor provided in the moving image encoding apparatus 1.

また、上述のプログラムは、このプログラムを記憶装置などに格納した動画像符号化装置１から、伝送媒体を介して、あるいは、伝送媒体中の伝送波により他のコンピュータシステムに伝送されてもよい。ここで、プログラムを伝送する「伝送媒体」は、インターネットなどのネットワーク（通信網）や電話回線などの通信回線（通信線）のように情報を伝送する機能を有する媒体のことをいう。 Further, the above-described program may be transmitted from the moving image encoding apparatus 1 storing the program in a storage device or the like to another computer system via a transmission medium or by a transmission wave in the transmission medium. Here, the “transmission medium” for transmitting the program refers to a medium having a function of transmitting information, such as a network (communication network) such as the Internet or a communication line (communication line) such as a telephone line.

また、上述のプログラムは、上述の機能の一部を実現するためのものであってもよい。さらに、上述の機能を動画像符号化装置１にすでに記録されているプログラムとの組合せで実現できるもの、いわゆる差分ファイル（差分プログラム）であってもよい。 Further, the above-described program may be for realizing a part of the above-described function. Furthermore, what can implement | achieve the above-mentioned function in combination with the program already recorded on the moving image encoder 1, what is called a difference file (difference program) may be sufficient.

以上、この発明の実施形態につき、図面を参照して詳述してきたが、具体的な構成はこの実施形態に限られるものではなく、この発明の要旨を逸脱しない範囲の設計なども含まれる。 The embodiment of the present invention has been described in detail with reference to the drawings. However, the specific configuration is not limited to this embodiment, and includes a design that does not depart from the gist of the present invention.

例えば、上述の実施形態では、符号化処理ブロックを木構造に基づいて分割する例として、非特許文献２に示されている四分木構造の分割を示した。しかし、これに限らず、二分木構造といった任意の分岐数に基づく木構造の分割に、本発明は適用することができる。 For example, in the above-described embodiment, the quadtree structure division shown in Non-Patent Document 2 is shown as an example of dividing the encoding processing block based on the tree structure. However, the present invention is not limited to this, and the present invention can be applied to division of a tree structure based on an arbitrary number of branches such as a binary tree structure.

また、上述の実施形態では、画質制御部１０には入力画像ａが入力されるものとしたが、これに限らず、例えば入力画像ａの構造成分のみで構成される画像が入力されてもよい。これによれば、画質制御部１０によるオブジェクトの境界の検出性能を向上させることができ、その結果、主観品質をさらに向上させることができる。なお、入力画像ａから構造成分を抽出する手法としては、例えばＴｏｔａｌＶａｒｉａｔｉｏｎを適用することができる。 In the above-described embodiment, the input image a is input to the image quality control unit 10. However, the present invention is not limited to this. For example, an image including only the structural components of the input image a may be input. . According to this, the detection performance of the object boundary by the image quality control unit 10 can be improved, and as a result, the subjective quality can be further improved. As a method for extracting the structural component from the input image a, for example, Total Variation can be applied.

１・・・動画像符号化装置
１０・・・画質制御部
１１・・・画素値勾配評価部
１２・・・エッジ解析部
１３・・・分割決定部
１４・・・量子化パラメータ決定部
２０・・・映像分割部
３０・・・予測値生成部
４０・・・ＤＣＴ／量子化部
５０・・・エントロピー符号化部
６０・・・逆ＤＣＴ／逆量子化部
７０・・・ローカルメモリ DESCRIPTION OF SYMBOLS 1 ... Moving image encoding apparatus 10 ... Image quality control part 11 ... Pixel value gradient evaluation part 12 ... Edge analysis part 13 ... Division determination part 14 ... Quantization parameter determination part 20 ..Video segmentation unit 30 ... predicted value generation unit 40 ... DCT / quantization unit 50 ... entropy encoding unit 60 ... inverse DCT / inverse quantization unit 70 ... local memory

Claims

A video encoding device that allows an encoding processing unit block to be divided according to a tree structure,
A feature type of a pixel value is evaluated for each encoding process unit block, and a division type suitable for the encoding process unit block based on a difference in feature value of a pixel value between the encoding process unit blocks adjacent to each other A moving picture coding apparatus characterized by estimating.

A pixel value gradient evaluation means for calculating a pixel value gradient of each pixel of the input image as the feature value of the pixel value;
An edge analysis unit that detects an encoding processing unit block including an object boundary based on a pixel value gradient of each pixel calculated by the pixel value gradient evaluation unit;
Division determination for determining the division size of the tree structure based on the pixel value gradient of each pixel calculated by the pixel value gradient evaluation unit for the encoding processing unit block detected by the edge analysis unit as including an object boundary The moving picture coding apparatus according to claim 1, further comprising: means.

The pixel value gradient evaluation unit calculates a pixel value gradient of each pixel by applying a sobel filter or a Laplacian filter to the pixel value of each pixel of the input image. Video encoding device.

The pixel value gradient evaluation unit calculates a pixel value gradient of each pixel by applying a low-pass filter to a pixel value of each pixel of the input image and then applying a sobel filter or a Laplacian filter. The moving picture encoding apparatus according to claim 2.

The edge analysis means includes
The average value or variance value of the absolute value of the pixel value gradient of each pixel in the encoding processing unit block is equal to or greater than a predetermined threshold;
The difference between the average value or the variance value of the absolute value of the pixel value gradient with the adjacent encoding processing unit block is equal to or greater than a predetermined threshold;
5. The moving picture encoding apparatus according to claim 2, wherein an encoding processing unit block satisfying at least one of the following is detected as an encoding processing unit block including a boundary of the object.

The division determination means is the absolute value of the difference between the average value or variance value of the pixel value gradient of the block before division and the absolute value average value or variance value of the pixel value gradient of each block after division. 6. The moving picture encoding apparatus according to claim 2, wherein if there is a block larger than a predetermined threshold, it is determined that the block before the division can be divided.

A quantization parameter is determined based on the pixel value gradient of each pixel calculated by the pixel value gradient evaluation unit for each block obtained by dividing the encoding processing unit block detected as including an object boundary by the edge analysis unit. The moving picture encoding apparatus according to claim 2, further comprising a quantization parameter determining unit that performs the determination.

The quantization parameter determining means determines in advance an average value or variance value of absolute values of pixel value gradients for each block obtained by dividing the encoding processing unit block detected as including an object boundary by the edge analysis means. The moving picture coding apparatus according to claim 7, wherein the quantization parameter is determined based on whether or not the threshold value is equal to or greater than a predetermined threshold value.

A moving picture coding method in a moving picture coding apparatus that allows a coding processing unit block to be divided according to a tree structure,
A feature type of a pixel value is evaluated for each encoding process unit block, and a division type suitable for the encoding process unit block based on a difference in feature value of a pixel value between the encoding process unit blocks adjacent to each other A moving picture coding method characterized by estimating.

A program for causing a computer to execute a moving picture coding method in a moving picture coding apparatus that allows a coding processing unit block to be divided according to a tree structure,
A feature type of a pixel value is evaluated for each encoding process unit block, and a division type suitable for the encoding process unit block based on a difference in feature value of a pixel value between the encoding process unit blocks adjacent to each other A program for estimating.