JP5626728B2

JP5626728B2 - Image coding method, image coding apparatus, and image coding program

Info

Publication number: JP5626728B2
Application number: JP2010233230A
Authority: JP
Inventors: 幸浩坂東; 誠之高村; 裕尚如澤; 孝明石川; 渡辺　裕; 裕渡辺
Original assignee: Waseda University; Nippon Telegraph and Telephone Corp
Current assignee: Waseda University; Nippon Telegraph and Telephone Corp
Priority date: 2010-10-18
Filing date: 2010-10-18
Publication date: 2014-11-19
Anticipated expiration: 2030-10-18
Also published as: JP2012089964A

Description

本発明は，高能率画像信号符号化方法に関し，特に交互射影法におけるエネルギーコンパクションを向上させて符号化効率を向上させるための画像符号化方法に関する。 The present invention relates to a high-efficiency image signal encoding method, and more particularly to an image encoding method for improving energy compaction and improving encoding efficiency in an alternating projection method.

画像符号化における重要な要素技術の一つに，離散コサイン変換（ＤＣＴ：Discrete Cosine Transform ）に代表される変換符号化がある。画像符号化における変換符号化の役割は，空間的な画素間相関の除去を行うことにある。符号化器の全体の位置づけとしては，変換符号化により少数の変換係数に情報を集中させ，変換係数に対する量子化により情報の集中度の低い変換係数を切り捨てることで，符号化対象信号に対する情報量の削減に寄与する。 One of the important elemental techniques in image coding is transform coding represented by Discrete Cosine Transform (DCT). The role of transform coding in image coding is to remove spatial correlation between pixels. The overall position of the encoder is as follows: information is concentrated on a small number of transform coefficients by transform coding, and the transform coefficients with low information concentration are rounded down by quantizing the transform coefficients, thereby reducing the amount of information for the signal to be encoded. Contributes to the reduction of

図７に，映像信号を符号化する一般的な符号化装置１００の例を示す。符号化装置１００は映像信号を入力すると，予測部１０６により予測された予測信号との差分から予測残差信号を求め，変換部１０１により予測残差信号を直交変換する。その出力である変換係数を量子化部１０２にて量子化し，その量子化値をエントロピ符号化部１０７にて可変長符号化し，符号化ストリームとして出力する。一方，量子化部１０２の出力は，逆量子化部１０３で逆量子化され，さらに逆変換部１０４で逆直交変換される。その変換結果に予測信号を加えることにより復号信号が生成される。復号信号は，歪除去フィルタ１０５によりノイズ除去処理がなされ，参照復号信号として予測部１０６に入力される。予測部１０６では，動き探索などにより次の映像信号の符号化のための予測信号を生成する。 FIG. 7 shows an example of a general encoding apparatus 100 that encodes a video signal. When the video signal is input, the encoding apparatus 100 obtains a prediction residual signal from the difference from the prediction signal predicted by the prediction unit 106, and the transform unit 101 performs orthogonal transform on the prediction residual signal. The output transform coefficient is quantized by the quantization unit 102, and the quantized value is variable-length encoded by the entropy encoding unit 107 and output as an encoded stream. On the other hand, the output of the quantization unit 102 is inversely quantized by the inverse quantization unit 103 and further inversely orthogonally transformed by the inverse transform unit 104. A decoded signal is generated by adding a prediction signal to the conversion result. The decoded signal is subjected to noise removal processing by the distortion removal filter 105 and input to the prediction unit 106 as a reference decoded signal. The prediction unit 106 generates a prediction signal for encoding the next video signal by motion search or the like.

これまで，画像符号化への応用では，離散コサイン変換（ＤＣＴ）を始めとして，重複直交変換離散ウェーブレット変換（ＤＷＴ：Discrete Wavelet Transform）といった多くの変換符号化方式が検討されてきた。例えば，変換符号化として，ＪＰＥＧでは，離散コサイン変換（ＤＣＴ），ＪＰＥＧ２０００では，離散ウェーブレット変換（ＤＷＴ）が採用されている。また，直交変換は完備な基底（complete basis）を用いるため，変換前後のデータ数が不変である。このため，直交変換は非冗長変換（non-redundant transform ）である。動画像符号化装置においては，図７における変換部１０１が上記の技術に該当する。 Up to now, in application to image coding, many transform coding schemes such as discrete cosine transform (DCT) and overlapping orthogonal transform discrete wavelet transform (DWT) have been studied. For example, as transform coding, JPEG employs discrete cosine transform (DCT), and JPEG 2000 employs discrete wavelet transform (DWT). In addition, since orthogonal transformation uses a complete basis, the number of data before and after the transformation is unchanged. For this reason, the orthogonal transform is a non-redundant transform. In the moving image encoding apparatus, the conversion unit 101 in FIG. 7 corresponds to the above technique.

一方で，基底数が原信号のサンプル数よりも多い過完備な基底（overcomplete basis）を用いた冗長変換（redudant transform）と呼ばれる変換がある。このため，冗長変換は直交変換になり得ないが，変換後のデータに冗長性を持たせることで非冗長変換では実現できない特性をもつことができる。例えば，ダウンサンプリング処理を行わないＤＷＴである離散定常ウェーブレット変換（ＳＷＴ：Stationary Wavelet Transform）は変換後の冗長性により，ＤＷＴで失われるシフト不変性を成立させることができる。 On the other hand, there is a transform called a redundant transform using an overcomplete basis in which the number of bases is larger than the number of samples of the original signal. For this reason, the redundant transform cannot be an orthogonal transform, but by giving redundancy to the converted data, it is possible to have characteristics that cannot be realized by the non-redundant transform. For example, a discrete stationary wavelet transform (SWT) that is a DWT that does not perform downsampling processing can establish shift invariance lost in the DWT due to redundancy after the transformation.

また，画像処理分野では，“方向分離特性をもつ変換”が注目されている。このような変換は，一般的に冗長変換であり，代表例としてＣｕｒｖｅｌｅｔ変換がある。並列木複素ウェーブレット変換（ＤＴＣＷＴ：Dual Tree Complex Wavelet Transform ）も同様の特性をもつ変換である。方向分離特性をもつ変換は，画像信号中に含まれるエッジ等の曲線を２次元で定義される方向基底を用いて表現する変換である。方向基底を用いて２次元構造を高い精度で近似するため，ＤＷＴのような方向分離特性の乏しい変換と比較し，雑音除去や特徴抽出に対して，有効である。 In the field of image processing, “conversion with direction separation characteristics” has attracted attention. Such conversion is generally redundant conversion, and a typical example is Curvelet conversion. A parallel tree complex wavelet transform (DTCWT) is a transform having similar characteristics. The conversion having the direction separation characteristic is a conversion in which a curve such as an edge included in the image signal is expressed using a direction base defined in two dimensions. Since the two-dimensional structure is approximated with high accuracy using the direction base, it is more effective for noise removal and feature extraction than a transformation with poor direction separation characteristics such as DWT.

しかし，方向分離特性をもつ変換は，変換後のデータ数が増加するという問題がある。ｘを変換符号化への入力信号，Ψを変換行列とすると，変換により得られる変換係数ｙは，次式のように表わされる。 However, conversion with direction separation characteristics has a problem that the number of data after conversion increases. When x is an input signal to transform coding and Ψ is a transform matrix, a transform coefficient y obtained by the transform is expressed as follows.

ｙ＝Ψｘ (1)
一方，変換係数から信号領域での値を復号する処理は，過完備な基底系からなる変換の逆変換を表す行列をΦとすると，次式のように表わされる。 y = Ψx (1)
On the other hand, the process of decoding the value in the signal domain from the transform coefficient is represented by the following equation, where Φ is a matrix representing the inverse transform of the overcomplete basis set.

ｘ＝Φｙ (2)
ＤＴＣＷＴの場合，ｘがｎ次元ベクトルであるとすると，変換により得られる変換係数ｙは２ｎ次元ベクトルとなる。このため，同変換を画像符号化へ応用する場合，データ数の削減の観点から，変換係数を適切に選択する必要がある。この変換係数の選択は，以下の制約条件付き最小化問題として定式化できる。 x = Φy (2)
In the case of DTCWT, if x is an n-dimensional vector, the conversion coefficient y obtained by the conversion is a 2n-dimensional vector. For this reason, when the transformation is applied to image coding, it is necessary to appropriately select a transformation coefficient from the viewpoint of reducing the number of data. This selection of transform coefficients can be formulated as the following constrained minimization problem.

ｍｉｎ_y‖ｙ‖₀ ｓｕｂｊｅｃｔｔｏ Φｙ＝ｘ (3)
ここで，‖・‖₀は，Ｌ⁰ノルムであり，非ゼロ係数の個数を表している。上記の制約条件付きの最小化問題は，ラグランジュの未定乗数法により以下の最小化問題に帰着される。 min _y ‖y‖ ₀ subject to Φy = x (3)
Here, ‖ · ‖ ₀ is the L ⁰ norm and represents the number of non-zero coefficients. The above minimization problem with constraints is reduced to the following minimization problem by Lagrange's undetermined multiplier method.

ｍｉｎ_ｙ ‖ｙ‖_０＋λ‖Φｙ−ｘ‖_２ ² (4)
ここでλは，外部から与えられる重みパラメータである。第一項は，選択された変換係数の個数であり，変換係数の情報量を近似した値である。第二項は変換係数の選択に伴う再構成誤差を表しており，符号化歪みを表している。‖・‖_２ ² は，Ｌ² ノルムの二乗値であり，二乗和を表す。しかし，上記の最小化問題はＮＰ困難であるため，従来，以下のようなＬ ¹ ノルムに最小化問題として近似する方法がとられてきた。 min _y ‖y‖ ₀ + λ‖Φy-x‖ ₂ ² (4)
Here, λ is a weight parameter given from the outside. The first term is the number of selected transform coefficients, which is an approximate value of the information amount of transform coefficients. The second term represents the reconstruction error associated with the selection of the transform coefficient and represents the coding distortion. ‖ · ‖ ₂ ² is the square value of the L ² norm and represents the sum of squares. However, since the above minimization problem is difficult to NP, conventionally, the following L ^{1 is used.} A method of approximating the norm as a minimization problem has been taken.

ｍｉｎ_ｙ ‖ｙ‖_１＋λ‖Φｙ−ｘ‖_２ ² (5)
ここで，‖・‖は，Ｌ ¹ ノルムであり，ベクトルの要素の絶対値和を表している。 min _y ‖y‖ ₁ + λ‖Φy−x‖ ₂ ² (5)
Where ‖ and ‖ are L ¹ Norm, which represents the sum of absolute values of vector elements.

式(5) の最小化問題の準最適解を与える手法として，交互射影法と呼ばれる手法が提案されている（非特許文献１参照）。 As a method for giving a sub-optimal solution of the minimization problem of Equation (5), a method called an alternate projection method has been proposed (see Non-Patent Document 1).

式(3) は，以下の条件付最小化問題 Equation (3) expresses the following conditional minimization problem

と等価であることに着目すると，上式はｘとのユークリッド距離を最小化するＫ個の非ゼロ係数を持つｙを探していると解釈することができる。上式の最小化を実現する解を求めるため，式(2) の解としての集合（アフィン集合）Ｓ（Φ，ｘ）とＣ（Ｋ）の間の交互直交射影を繰り返す。Ｃ（Ｋ）からＳ（Φ，ｘ）への直交射影は， The above equation can be interpreted as looking for y having K non-zero coefficients that minimizes the Euclidean distance from x. In order to find a solution that realizes the minimization of the above equation, the alternating orthogonal projection between the set (affine set) S (Φ, x) and C (K) as the solution of Equation (2) is repeated. The orthogonal projection from C (K) to S (Φ, x) is

で与えられる。ここで，ｙ_LSは二乗誤差最小化の規範の基づき，
ｙ_LS＝Φ^T［ΦΦ^T］^-1ｘ
により求められた係数ベクトルである。また，Ｓ（Φ，ｘ）からＣ（Ｋ）への直交射影は，ｈｔ（ｙ，Ｋ）により与えられる。ここで，ｈｔ（ｙ，Ｋ）は，Ｃ（Ｋ）へのユークリッド距離を最小にする射影であり，最も大きい値からＫ個の係数を選択する操作に相当する。 Given in. Where y _LS is based on the norm of square error minimization,
y _LS = Φ ^T [ΦΦ ^T ] ⁻¹ x
Is a coefficient vector obtained by The orthogonal projection from S (Φ, x) to C (K) is given by ht (y, K). Here, ht (y, K) is a projection that minimizes the Euclidean distance to C (K), and corresponds to an operation of selecting K coefficients from the largest value.

係数の初期ベクトルをｙ⁽⁰⁾とおき，次の漸化式に従い，繰り返し処理を行う。 The initial vector of coefficients is set to y ⁽⁰⁾ , and iterative processing is performed according to the following recurrence formula.

なお，この繰り返し処理は，以下の終了条件を満たすまで繰り返す。 This iterative process is repeated until the following termination condition is satisfied.

‖ｙ⁽ⁿ⁺¹⁾−ｙ⁽ⁿ⁾‖₂＜ε
このような過完備系を用いた変換は，画像信号にｓｐａｒｓｅｎｅｓｓを仮定し，少数の変換係数で画像を表現するｓｐａｒｓｅｒｅｐｒｅｓｅｎｔａｔｉｏｎの一種である。 ^{^{‖Y (n + 1) -y (}} n) || _{2 <ε}
Such conversion using an overcomplete system is a kind of sparse representation in which an image signal is assumed to be sparseness and an image is expressed with a small number of conversion coefficients.

L. Mancera and J. Portilla, “ L0-norm-based sparse representation through alternate projections”，IEEE ICIP, Oct., 2006.L. Mancera and J. Portilla, “L0-norm-based sparse representation through alternate projections”, IEEE ICIP, Oct., 2006.

画像信号は空間的な局所性を有しており，その性質は一様ではない。つまり，画像信号を一定の精度で表現するために必要な変換係数の個数，あるいは，一定の個数の変換係数で表現可能な画像信号の表現精度は，画像内の領域の性質に応じて変動する。しかし，交互射影法は，画像全体に対して適用されている。このため，画像内の局所性を考慮できておらず，符号化効率の向上に改善の余地を残す。 Image signals have spatial locality, and their properties are not uniform. In other words, the number of transform coefficients necessary to represent an image signal with a certain accuracy, or the representation accuracy of an image signal that can be represented with a certain number of transform coefficients varies depending on the nature of the region in the image. . However, the alternating projection method is applied to the entire image. For this reason, locality in the image cannot be considered, and there is room for improvement in improving the coding efficiency.

本発明はかかる事情に鑑みてなされたものであって，過完備な基底を用いた冗長変換において，変換係数間の冗長性を除去する交互射影法において，符号化効率の向上を実現する画像分割手法に基づく画像符号化方法を確立することを目的とする。 The present invention has been made in view of such circumstances, and is an image segmentation that realizes an improvement in coding efficiency in an alternate projection method that eliminates redundancy between transform coefficients in a redundant transform using an overcomplete basis. It aims at establishing the image coding method based on a technique.

本発明は，上記課題を解決するため，入力された映像信号に対して，冗長系の変換基底を用いた変換を行い，得られた変換係数に対して，符号化に用いる係数の絞り込みを行う画像符号化において，画像の局所的な性質に応じて，画面内の係数総数が一定（全分割領域の係数総数が一定）という条件下で全領域の歪み量の総和を最小化するための領域分割，および，各分割領域に対する変換係数の選択を行うことを特徴とする。 In order to solve the above-described problems, the present invention performs a conversion using a redundant conversion base on an input video signal, and narrows down the coefficients used for encoding for the obtained conversion coefficients. In image coding, an area for minimizing the total sum of distortion in all areas under the condition that the total number of coefficients in the screen is constant (the total number of coefficients in all divided areas is constant) according to the local properties of the image. It is characterized in that the division and the conversion coefficient for each divided area are selected.

本発明は，画面内を性質の類似した領域に分割し，分割領域ごとにコストが小さくなる変換係数の選択を行うことにより，符号化効率を向上させることができる。 The present invention can improve coding efficiency by dividing the screen into regions of similar nature and selecting transform coefficients that reduce the cost for each divided region.

また，本発明は，入力された映像信号に対して，冗長系の変換基底を用いた変換を行い，得られた変換係数に対して，符号化に用いる係数の絞り込みを行う画像符号化において，画像の局所的な性質に応じて，画面内の係数総数が一定（全分割領域の係数総数が一定）という条件下で全領域の歪み量の総和と，分割領域の境界部に接する所定幅の二つの境界領域における変換係数の絞り込みに基づく近似誤差の乖離度との重み付き和を最小化するための領域分割，および，各分割領域に対する変換係数の選択を行うことを特徴とする。 In addition, the present invention performs the conversion using the redundant conversion base on the input video signal and narrows down the coefficients used for the encoding on the obtained conversion coefficients. Depending on the local nature of the image, the total number of distortions in the entire area and a predetermined width in contact with the boundary of the divided area under the condition that the total number of coefficients in the screen is constant (the total number of coefficients in all the divided areas is constant) . It is characterized in that region division for minimizing the weighted sum with the degree of deviation of the approximation error based on narrowing down of the transformation coefficients in the two boundary regions , and selection of transformation coefficients for each divided region are performed.

また，本発明は，上記発明において，分割領域の係数の個数を，同分割領域の画素数に応じて設定することを特徴とする。 Further, the present invention is characterized in that, in the above invention, the number of coefficients of the divided area is set according to the number of pixels of the divided area.

また，本発明は，上記発明において，選択された領域分割における各分割領域に対して，所定の分割の終了条件が満たされるまで，領域分割および変換係数を選択する処理を再帰的に繰り返すことを特徴とする。分割の終了条件は，例えば一方の分割領域の画素数が所定の閾値以下となること，または一方の分割領域の変換係数の個数が所定の閾値以下となることである。また，分割を行わないほうが最適な場合も分割の終了条件となる。 Also, the present invention recursively repeats the process of selecting area division and transform coefficients until a predetermined division termination condition is satisfied for each divided area in the selected area division. Features. The condition for terminating the division is, for example, that the number of pixels in one divided area is equal to or smaller than a predetermined threshold value, or that the number of transform coefficients in one divided area is equal to or smaller than the predetermined threshold value. In addition, when it is optimal not to divide, it becomes a condition for ending the division.

本発明により，画像の局所性を考慮して係数選択を行うことで，交互射影法におけるエネルギーコンパクションを向上させ，特定成分への情報の集約を図ることで，符号化効率の向上を実現することができる。 According to the present invention, by selecting a coefficient in consideration of the locality of an image, energy compaction in the alternate projection method is improved, and information is integrated into a specific component, thereby improving encoding efficiency. Can do.

本発明の実施形態に係る画像符号化装置の構成例を示す図である。It is a figure which shows the structural example of the image coding apparatus which concerns on embodiment of this invention. 係数選択処理部が実行する係数選択処理フローを示す図である。It is a figure which shows the coefficient selection process flow which a coefficient selection process part performs. 最適分割処理部が実行する最適分割処理フローを示す図である。It is a figure which shows the optimal division process flow which an optimal division process part performs. 最適分割処理部が実行する最適分割処理フローを示す図である。It is a figure which shows the optimal division process flow which an optimal division process part performs. コスト算出部が実行するコスト関数算出処理フローを示す図である。It is a figure which shows the cost function calculation process flow which a cost calculation part performs. ソフトウェアプログラムより実現するときのシステム構成図である。It is a system block diagram when implement | achieving from a software program. 一般的な符号化装置の構成図である。It is a block diagram of a general encoding apparatus.

以下，本発明の実施の形態について，図面を用いて説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

本発明の基本方針は，画面内を性質の類似した領域に分割し，分割領域ごとにｓｐａｒｓｅｒｅｐｒｅｓｅｎｔａｔｉｏｎを適用することである。以下，４種類の処理を示す。 The basic policy of the present invention is to divide the screen into regions having similar properties and apply sparse representation to each divided region. Hereinafter, four types of processing are shown.

［領域分割方法１］は，画面内の係数総数が一定（全分割領域の係数総数が一定）という条件下で全領域の歪み量の総和を最小化するための分割，およびパラメータ選択方法を示す。 [Area division method 1] shows a division and parameter selection method for minimizing the total distortion amount in all areas under the condition that the total number of coefficients in the screen is constant (the total number of coefficients in all divided areas is constant). .

［領域分割方法２］は，各分割領域において画素当りの係数の個数が一定という条件下で，各領域の歪み量を最小化するための分割，およびパラメータ選択方法を示す。 [Region division method 2] indicates a division and parameter selection method for minimizing the amount of distortion in each region under the condition that the number of coefficients per pixel is constant in each division region.

［領域分割方法３］は，［領域分割方法１］におけるコスト関数に対して領域境界の不連続性の影響を加味した修正を行ったものである。 [Area division method 3] is a modification of the cost function in [Area division method 1] that takes into account the effect of discontinuity of the area boundary.

同様に，［領域分割方法４］は，［領域分割方法２］におけるコスト関数に対して領域境界の不連続性の影響を加味した修正を行ったものである。 Similarly, [Area division method 4] is a modification of the cost function in [Area division method 2] that takes into account the influence of discontinuity of the area boundary.

画像信号の存在領域を０≦ｓ≦Ｗ，０≦ｔ≦Ｈとし，同領域内の画素値を一次元ベクトルとして並べ替えたものをｘとする。さらに，同領域内から，左上角の座標値を（ｓ₀，ｔ₀），水平方向・垂直方向の辺長をｗ，ｈとする矩形領域内の画素値を抽出し，一次元ベクトルとして並べ替えたベクトルをｘ（ｓ₀，ｔ₀，ｗ，ｈ）とする。 Assume that an image signal existence area is 0 ≦ s ≦ W and 0 ≦ t ≦ H, and pixel values in the area are rearranged as a one-dimensional vector. Further, from the same area, the pixel values in the rectangular area having the upper left corner coordinate value (s ₀ , t ₀ ) and the horizontal and vertical side lengths w and h are extracted and arranged as a one-dimensional vector. Let the changed vector be x (s ₀ , t ₀ , w, h).

ｘ（ｓ₀，ｔ₀，ｗ，ｈ）に対して，交互射影法を用いてｋ個の変換係数｛ｃ₀，…，ｃ_k-1｝を選択し，選択された基底信号を用いてｘ（ｓ₀，ｔ₀，ｗ，ｈ）を表現した場合の近似誤差をＥ［ｘ，ｓ₀，ｔ₀，ｗ，ｈ，ｋ，Ｃ］とおく。ここで，Ｃは，ｋ個の変換係数Ｃ＝｛ｃ₀，…，ｃ_k-1｝を表す。例えば，ＤＴＣＷＴの場合，総数２ｎ個の係数の中からｋ個の係数を選択したことになり，この選択に交互射影法を用いることができる。 For x (s ₀ , t ₀ , w, h), k transform coefficients {c ₀ ,..., c _k−1 } are selected using the alternate projection method, and the selected base signal is used. An approximation error when x (s ₀ , t ₀ , w, h) is expressed as E [x, s ₀ , t ₀ , w, h, k, C]. Here, C represents k conversion coefficients C = {c ₀ ,..., C _k−1 }. For example, in the case of DTCWT, k coefficients are selected from a total of 2n coefficients, and an alternate projection method can be used for this selection.

［領域分割方法１］
画面内の係数総数が一定（全分割領域の係数総数が一定）という条件下で全領域の歪み量の総和を最小化するための分割を考える。さらに，垂直に２分割する場合と水平に２分割する場合の結果を比較し，上記歪み量最小化の規範に基づき，垂直分割・水平分割のいずれかを選択する。 [Area Division Method 1]
Consider a division for minimizing the total distortion amount in all areas under the condition that the total number of coefficients in the screen is constant (the total number of coefficients in all divided areas is constant). Further, the results of dividing vertically into two and horizontally dividing are compared, and either vertical division or horizontal division is selected on the basis of the above-described distortion amount minimization rule.

歪み量の尺度として次の近似誤差を用いる。 The following approximation error is used as a measure of the amount of distortion.

Ξ_hor［ｘ，０，ｈ，Ｗ，Ｈ，ｋ_h，ｋ−ｋ_h，Ｃ_u，Ｃ_d］＝Ｅ［ｘ，０，０，Ｗ，ｈ，ｋ_h，Ｃ_d］＋Ｅ［ｘ，０，ｈ，Ｗ，Ｈ−ｈ，ｋ−ｋ_h，Ｃ_u］
上式は，領域０≦ｓ≦Ｗ，０≦ｔ≦Ｈを，上側領域０≦ｓ≦Ｗ，０≦ｔ≦ｈ−１と，下側領域０≦ｓ≦Ｗ，ｈ≦ｔ≦Ｈに分割し，上側領域および下側領域を各々，ｋ_h個の係数およびｋ−ｋ_h個の係数で表現した場合の近似誤差和を表している。 _{Ξ hor [x, 0, h} , W, H, k h, k-k h, C u, C d] = E [x, 0,0, W, h, k h, C d] + E [x, 0, h, W, H- h, k-k h, C u]
The above formula changes the region 0 ≦ s ≦ W, 0 ≦ t ≦ H to the upper region 0 ≦ s ≦ W, 0 ≦ t ≦ h−1, and the lower region 0 ≦ s ≦ W, h ≦ t ≦ H. divided, respectively upper region and a lower region, represents an approximation error sum when expressed in k _h number of coefficients and k-k _h number of coefficients.

Ξ_ver［ｘ，ｗ，０，Ｗ，Ｈ，ｋ_w，ｋ−ｋ_w，Ｃ_l，Ｃ_r］＝Ｅ［ｘ，０，０，ｗ，Ｈ，ｋ_w，Ｃ_l］＋Ｅ［ｘ，ｗ，０，Ｗ−ｗ，Ｈ，ｋ−ｋ_w，Ｃ_r］
上式は，領域０≦ｓ≦Ｗ，０≦ｔ≦Ｈを，左側領域０≦ｓ≦ｗ−１，０≦ｔ≦Ｈと，右側領域ｗ≦ｓ≦Ｗ，０≦ｔ≦Ｈに分割し，左側領域および右側領域を各々，ｋ_w個の係数およびｋ−ｋ_w個の係数で表現した場合の近似誤差和を表している。 Ξ _ver [x, w, 0, W, H, k _w , k−k _w , C _l , C _r ] = E [x, 0, 0, w, H, k _w , C _l ] + E [x, w, 0, W−w, H, k−k _w , C _r ]
The above expression divides the region 0 ≦ s ≦ W and 0 ≦ t ≦ H into the left region 0 ≦ s ≦ w−1, 0 ≦ t ≦ H and the right region w ≦ s ≦ W, 0 ≦ t ≦ H. and, respectively left area and right area, it represents an approximation error sum when expressed in k _w pieces of coefficients and k-k _w pieces of coefficients.

分割後の２つの領域での変換係数の総和をｋに保つ条件下で，垂直方向に２分割する場合，次式に基づき分割位置，および分割後の各領域へ割り当てられる係数の個数を決定する。垂直方向に２分割する場合の分割位置ｈ₀，および上側の分割領域で用いる変換係数の個数ｋ_0,hは，次式で与えられる（下側の分割領域で用いる変換係数の個数はｋ−ｋ_0,hとなる）。 When dividing into two in the vertical direction under the condition that the sum of transform coefficients in the two areas after division is kept at k, the division position and the number of coefficients assigned to each area after division are determined based on the following equations: . The division position h ₀ when dividing into two in the vertical direction and the number k _{0, h} of transform coefficients used in the upper divided area are given by the following equations (the number of transform coefficients used in the lower divided area is k− k _{0, h} ).

分割後の２つの領域での変換係数の総和をｋに保つ条件下で，水平方向に２分割する場合，次式に基づき分割位置，および分割後の各領域へ割り当てられる係数の個数を決定する。水平方向に２分割する場合の分割位置ｗ₀，および左側の分割領域で用いる変換係数の個数ｋ_0,wは次式で与えられる（なお，右側の分割領域で用いる変換係数の個数は，ｋ−ｋ_0,wとなる）。 When dividing horizontally into two under the condition that the sum of transform coefficients in the two areas after division is kept at k, the division position and the number of coefficients assigned to each area after division are determined based on the following equations: . The division position w ₀ in the case of dividing into two in the horizontal direction and the number k _{0, w} of transform coefficients used in the left divided area are given by the following equations (note that the number of transform coefficients used in the right divided area is k −k _{0, w} ).

このとき，
Ξ_hor［ｘ，０，ｈ₀，Ｗ，Ｈ，ｋ_0,h，ｋ−ｋ_0,h，Ｃ_u，Ｃ_d］
≦Ξ_ver［ｘ，ｗ₀，０，Ｗ，Ｈ，ｋ_0,w，ｋ−ｋ_0,w，Ｃ_l，Ｃ_r］
となる場合には，ｔ＝ｈ₀の位置で垂直方向に分割を行うこととし，それ以外の場合には，ｓ＝ｗ₀の位置で水平方向に分割を行うこととする。なお，ｈ₀＝０あるいはｗ₀＝０となる場合，分割を行わないほうが最適であるので分割は行われない。 At this time,
Hor _hor [x, 0, h ₀ , W, H, k _{0, h} , k−k _{0, h} , C _u , C _d ]
≦ Ξ _ver [x, w ₀ , 0, W, H, k _{0, w} , k−k _{0, w} , C _l , C _r ]
In such a case, division is performed in the vertical direction at the position t = h ₀ , and in other cases, division is performed in the horizontal direction at the position s = w ₀ . Note that when h ₀ = 0 or w ₀ = 0, it is more optimal not to perform the division, so the division is not performed.

上式は，分割位置および各分割領域へ割り当てられる変換係数の個数を求めている。 The above equation finds the division position and the number of transform coefficients assigned to each division area.

分割により得られた矩形領域の中で近似誤差が最大の領域（分割対象領域）に対して，分割対象領域に含まれる係数の個数を上式におけるｋとみなして，上記と同様の分割処理を施す。選択した領域に対して分割が行われなかった場合には，先に選択された領域の次に近似誤差が大きな領域に対して，上記と同様の分割処理を施す。この繰り返しは，全ての分割領域に対して分割が行われなくなるまで続ける。または，分割領域内の画素数の下限値を設定しておき，分割の結果，この下限値を下回るような領域は生成されないように制限を設けることも可能である。 For the region with the largest approximation error (division target region) among the rectangular regions obtained by the division, the number of coefficients included in the division target region is regarded as k in the above equation, and the same division process as above is performed. Apply. If the selected area has not been divided, the same dividing process as described above is performed on the area having the next largest approximation error after the previously selected area. This repetition is continued until all the divided areas are not divided. Alternatively, it is possible to set a lower limit value for the number of pixels in the divided area and to limit generation so that an area below the lower limit value is not generated as a result of the division.

［領域分割方法２］
２つの分割領域において画素当りの係数の個数が一定という条件下で，各領域の歪み量を最小化するための分割を考える。さらに，垂直に２分割する場合と水平に２分割する場合の結果を比較し，上記加重和最小化の規範に基づき，垂直分割・水平分割のいずれかを選択する。 [Area division method 2]
Consider a division for minimizing the amount of distortion in each region under the condition that the number of coefficients per pixel is constant in the two divided regions. Further, the results of dividing vertically into two and horizontally dividing are compared, and either vertical division or horizontal division is selected based on the above weighted sum minimization rule.

前記［領域分割方法１］との違いは，分割領域への係数割当てを分割領域内の画素数に応じて設定している点にある。このため，近似誤差最小化という観点から見た場合，［領域分割方法１］で求まる解を最適解とすれば，本方法で求まる方法は準最適解であるが，解を求める演算コストを低減することができる。 The difference from the [region dividing method 1] is that the coefficient allocation to the divided region is set according to the number of pixels in the divided region. For this reason, when viewed from the viewpoint of minimizing approximate error, if the solution obtained by [Area Division Method 1] is the optimal solution, the method obtained by this method is a suboptimal solution, but the calculation cost for obtaining the solution is reduced. can do.

この式中，Ｌ字型の記号と左右反転Ｌ字型の記号で挟まれる値（ｋ・ｈ／Ｈ）は，その値を超えない最大の整数を表す（以下，同様）。なお，以下の説明では，int(ｘ) とも表記する。すなわち，int(ｘ) は，実数ｘを超えない最大の整数を意味する。 In this formula, the value (k · h / H) between the L-shaped symbol and the left-right inverted L-shaped symbol represents the maximum integer that does not exceed the value (hereinafter the same). In the following description, it is also expressed as int (x). That is, int (x) means the largest integer that does not exceed the real number x.

上式は，領域０≦ｓ≦Ｗ，０≦ｔ≦Ｈを，上側領域０≦ｓ≦Ｗ，０≦ｔ≦ｈ−１と，下側領域０≦ｓ≦Ｗ，ｈ≦ｔ≦Ｈに分割し，上側領域および下側領域を各々，int(ｋ・ｈ／Ｈ) 個の係数およびｋ−int(ｋ・ｈ／Ｈ) 個の係数で表現した場合の近似誤差和を表している。 The above formula changes the region 0 ≦ s ≦ W, 0 ≦ t ≦ H to the upper region 0 ≦ s ≦ W, 0 ≦ t ≦ h−1, and the lower region 0 ≦ s ≦ W, h ≦ t ≦ H. This represents an approximate error sum when the upper region and the lower region are expressed by int (k · h / H) coefficients and k−int (k · h / H) coefficients, respectively.

上式は，領域０≦ｓ≦Ｗ，０≦ｔ≦Ｈを，左側領域０≦ｓ≦ｗ−１，０≦ｔ≦Ｈと，右側領域ｗ≦ｓ≦Ｗ，０≦ｔ≦Ｈに分割し，左側領域および右側領域を各々，int(ｋ・ｗ／Ｗ) 個の係数およびｋ−int(ｋ・ｗ／Ｗ) 個の係数で表現した場合の近似誤差和を表している。 The above expression divides the region 0 ≦ s ≦ W and 0 ≦ t ≦ H into the left region 0 ≦ s ≦ w−1, 0 ≦ t ≦ H and the right region w ≦ s ≦ W, 0 ≦ t ≦ H. The left side region and the right side region represent the approximate error sum when expressed by int (k · w / W) coefficients and k-int (k · w / W) coefficients, respectively.

分割後の２つの領域での変換係数の総和をｋに保つ条件下で，垂直方向に２分割する場合，次式に基づき分割位置を決定する。次式では，領域の画素数に応じて係数を配分するアプローチをとっている。 When dividing into two in the vertical direction under the condition that the sum of the transform coefficients in the two regions after division is kept at k, the division position is determined based on the following equation. The following equation takes an approach that allocates coefficients according to the number of pixels in the area.

分割後の２つの領域での変換係数の総和をｋに保つ条件下で，水平方向に２分割する場合，次式に基づき分割位置を決定する。次式では，領域の画素数に応じて係数を配分するアプローチをとっている。 When dividing into two in the horizontal direction under the condition that the sum of the transform coefficients in the two regions after division is kept at k, the division position is determined based on the following equation. The following equation takes an approach that allocates coefficients according to the number of pixels in the area.

ここで，Ｍ_h，Ｍ_wは最小変更幅として予め与えられる自然数である。このとき， Here, M _h and M _w are natural numbers given in advance as the minimum change width. At this time,

となる場合には，ｔ＝ｈ₀の位置で垂直方向に分割を行うこととし，それ以外の場合には，ｓ＝ｗ₀の位置で水平方向に分割を行うこととする。なお，ｈ₀＝０あるいはｗ₀＝０となる場合，分割を行わないほうが最適であるので分割は行われない。 In such a case, division is performed in the vertical direction at the position t = h ₀ , and in other cases, division is performed in the horizontal direction at the position s = w ₀ . Note that when h ₀ = 0 or w ₀ = 0, it is more optimal not to perform the division, so the division is not performed.

上式は，領域の画素数に応じて係数を配分する枠組みにおいて，最適な分割位置を求めている。 The above equation finds the optimal division position in the framework of allocating coefficients according to the number of pixels in the area.

［領域分割方法３］
画面内の係数総数が一定（全分割領域の係数総数が一定）という条件下で分割後の歪み量の総和および分割境界の不連続性の評価尺度（後述）の加重和（分割コストと呼ぶ）を最小化する分割を考える。さらに，垂直に２分割する場合と水平に２分割する場合の結果を比較し，上記分割コスト最小化の規範に基づき，垂直分割・水平分割のいずれかを選択する。 [Region division method 3]
Under the condition that the total number of coefficients in the screen is constant (the total number of coefficients in all the divided areas is constant), the sum of the distortion amounts after division and the weighted sum of the evaluation criteria for discontinuity of the division boundary (described later) (called division cost) Consider a partition that minimizes. Further, the result of dividing vertically into two and horizontally dividing into two is compared, and either vertical division or horizontal division is selected based on the above-mentioned norm of division cost minimization.

上式は，領域０≦ｓ≦Ｗ，０≦ｔ≦Ｈを，上側領域０≦ｓ≦Ｗ，０≦ｔ≦ｈ−１と，下側領域０≦ｓ≦Ｗ，ｈ≦ｔ≦Ｈに分割し，上側領域および下側領域を各々，ｋ_h個の係数およびｋ−ｋ_h個の係数で表現した場合の近似誤差和を表している。 The above formula changes the region 0 ≦ s ≦ W, 0 ≦ t ≦ H to the upper region 0 ≦ s ≦ W, 0 ≦ t ≦ h−1, and the lower region 0 ≦ s ≦ W, h ≦ t ≦ H. divided, respectively upper region and a lower region, represents an approximation error sum when expressed in k _h number of coefficients and k-k _h number of coefficients.

上式は，領域０≦ｓ≦Ｗ，０≦ｔ≦Ｈを，左側領域０≦ｓ≦ｗ−１，０≦ｔ≦Ｈと，右側領域ｗ≦ｓ≦Ｗ，０≦ｔ≦Ｈに分割し，左側領域および右側領域を各々，ｋ_w個の係数およびｋ−ｋ_w個の係数で表現した場合の近似誤差和を表している。 The above expression divides the region 0 ≦ s ≦ W and 0 ≦ t ≦ H into the left region 0 ≦ s ≦ w−1, 0 ≦ t ≦ H and the right region w ≦ s ≦ W, 0 ≦ t ≦ H. and, respectively left area and right area, it represents an approximation error sum when expressed in k _w pieces of coefficients and k-k _w pieces of coefficients.

領域分割の尺度として前述の有意係数の個数および分割境界の不連続性の評価尺度（不連続尺度）を用いる。不連続尺度は，以下の境界領域に対して定義する。上側と下側の２つの領域に分割する垂直分割の場合，上側領域（ｋ_h個の係数Ｃ_uで表現）内の下部０≦ｓ≦Ｗ，ｈ−Ｌ≦ｔ≦ｈ−１と下側領域（ｋ−ｋ_h個の係数Ｃ_dで表現）内の上部０≦ｓ≦Ｗ，ｈ≦ｔ≦ｈ＋Ｌからなる矩形領域０≦ｓ≦Ｗ，ｈ−Ｌ≦ｔ≦ｈ＋Ｌを対象とし，同矩形領域に対する不連続尺度を次のように表す。 The number of significant coefficients and the evaluation measure (discontinuity measure) of the discontinuity of the dividing boundary are used as a measure of region division. The discontinuity scale is defined for the following boundary regions: For vertical division that divides into two regions of the upper and lower, the lower 0 ≦ s ≦ W in the upper region (expressed in k _h pieces of coefficient _{C u), h-L ≦} t ≦ h-1 and the lower side A rectangular region 0 ≦ s ≦ W, h−L ≦ t ≦ h + L consisting of the upper part 0 ≦ s ≦ W and h ≦ t ≦ h + L in the region (expressed by k−k _h coefficients C _d ) The discontinuity scale for a rectangular area is expressed as follows.

ここで，第２，３引数は，分割前後の領域に対する水平方向の存在領域０≦ｓ≦Ｗの下限・上限を示している。第４，５引数は，境界領域に対する垂直方向の存在領域の下限ｈ−Ｌおよび同領域の垂直方向の幅２Ｌ＋１を示している。第６，７引数ｋ_h，ｋ−ｋ_hは，上側領域および下側領域における係数の個数である。第８，９引数Ｃ_u，Ｃ_dは，上側領域および下側領域における係数である。 Here, the second and third arguments indicate the lower and upper limits of the horizontal existence area 0 ≦ s ≦ W with respect to the areas before and after the division. The fourth and fifth arguments indicate the lower limit h−L of the existence region in the vertical direction with respect to the boundary region and the vertical width 2L + 1 of the region. The 6,7 argument k _{_h,} k-k _h is the number of coefficients in the upper region and a lower region. The eighth and ninth arguments C _u and C _d are coefficients in the upper region and the lower region.

左側と右側の２つの領域に分割する水平分割の場合，左側領域（ｋ_w個の係数Ｃ_lで表現）内の右端部ｗ−Ｌ≦ｓ≦ｗ−１，０≦ｔ≦Ｈと右側領域（ｋ−ｋ_w個の係数Ｃ_rで表現）内の左端部ｗ≦ｓ≦ｗ＋Ｌ，０≦ｔ≦Ｈからなる矩形領領域ｗ−Ｌ≦ｓ≦ｗ＋Ｌ，０≦ｔ≦Ｈを対象とし，同矩形領域に対する不連続尺度を次のように表す。 In the case of horizontal division into two areas on the left and right, the right end area w−L ≦ s ≦ w−1, 0 ≦ t ≦ H and the right area in the left area (represented by k _w coefficients C ₁ ) A rectangular region w-L≤s≤w + L, 0≤t≤H consisting of left end w≤s≤w + L, 0≤t≤H in (represented by k-k _w coefficients _Cr ), The discontinuity scale for the rectangular area is expressed as follows.

ここで，第２，３引数は，分割前後の領域に対する垂直方向の存在領域０≦ｔ≦Ｈの下限・上限を示している。第４，５引数は，境界領域に対する水平方向の存在領域の下限ｗ−Ｌおよび同領域の水平方向の幅２Ｌ＋１を示している。第６，７引数ｋ_w，ｋ−ｋ_wは，左側領域および右側領域における係数の個数である。第８，９引数Ｃ_l，Ｃ_rは，左側領域および右側領域における係数である。 Here, the second and third arguments indicate the lower and upper limits of the existence area 0 ≦ t ≦ H in the vertical direction with respect to the areas before and after the division. The fourth and fifth arguments indicate the lower limit w−L of the horizontal existence area with respect to the boundary area and the horizontal width 2L + 1 of the same area. The sixth and seventh arguments k _w and k−k _w are the numbers of coefficients in the left region and the right region. The eighth and ninth arguments C _l and C _r are coefficients in the left region and the right region.

不連続尺度としては，例えば，近似誤差の乖離度を用いる。これは，垂直分割の場合，上側領域（ｋ_ｈ個の係数Ｃ_ｕで表現）内の下部０≦ｓ≦Ｗ，ｈ−Ｌ≦ｔ≦ｈ−１における近似誤差と，下側領域（ｋ−ｋ_ｈ個の係数Ｃ_ｄで表現）内の上部０≦ｓ≦Ｗ，ｈ≦ｔ≦ｈ＋Ｌにおける近似誤差の差分値であり，水平分割の場合，左側領域（ｋ_ｗ個の係数Ｃ_ｌで表現）内の右端部ｗ−Ｌ≦ｓ≦ｗ−１，０≦ｔ≦Ｈにおける近似誤差と，右側領域（ｋ−ｋ_ｗ個の係数Ｃ_ｒで表現）内の左端部ｗ≦ｓ≦ｗ＋Ｌ，０≦ｔ≦Ｈにおける近似誤差との差分値である。あるいは，上記の近似誤差の差分値に対して，一定の重みを乗じた値を用いることも可能である。
As the discontinuity scale, for example, the deviation degree of the approximation error is used. This is because, in the case of vertical division, lower 0 ≦ s ≦ W in the upper region (expressed in _{k h} pieces of coefficient _{C u),} the approximation error in h-L ≦ t ≦ h- 1, the lower region (k- The difference value of the approximation error in the upper part 0 ≦ s ≦ W and h ≦ t ≦ h + L in k _h coefficients C _d ), and in the case of horizontal division, the left region (expressed by k _w coefficients C ₁₎ ) In the right end w−L ≦ s ≦ w−1, 0 ≦ t ≦ H, and the left end w ≦ s ≦ w + L in the right region (represented by k−k _w coefficients C _r ), It is a difference value from the approximation error when 0 ≦ t ≦ H. Alternatively, it is possible to use a value obtained by multiplying the difference value of the approximation error by a constant weight.

また，分割境界領域におけるＳＳＩＭ（参考文献１参照）の符号を反転した値を用いることも可能である。
［参考文献１］：Z. Wang and E. P. Simoncelli, “Translation insensitive image similarity in complex wavelet domain”, IEEE International Conference on Acoustics, Speech & Signal Processing, vol. II, pp. 573-576, Philadelphia, PA, March 2005。 It is also possible to use a value obtained by inverting the sign of SSIM (see Reference 1) in the divided boundary region.
[Reference 1]: Z. Wang and EP Simoncelli, “Translation insensitive image similarity in complex wavelet domain”, IEEE International Conference on Acoustics, Speech & Signal Processing, vol. II, pp. 573-576, Philadelphia, PA, March 2005.

分割後の２つの領域での変換係数の総和をｋに保つ条件下で，垂直方向に２分割する場合，次式に基づき分割位置，および分割後の各領域へ割り当てられる係数の個数を決定する。垂直方向に２分割する場合の分割位置ｈ₀および上側の分割領域で用いる変換係数の個数ｋ_0,hは，次式で与えられる（下側の分割領域で用いる変換係数の個数はｋ−ｋ_0,hとなる）。 When dividing into two in the vertical direction under the condition that the sum of transform coefficients in the two areas after division is kept at k, the division position and the number of coefficients assigned to each area after division are determined based on the following equations: . The dividing position h ₀ and the number k _{0, h} of transform coefficients used in the upper divided area when dividing into two in the vertical direction are given by the following equation (the number of transform coefficients used in the lower divided area is k−k _{0, h} ).

分割後の２つの領域での変換係数の総和をｋに保つ条件下で，水平方向に２分割する場合，次式に基づき分割位置，および分割後の各領域へ割り当てられる係数の個数を決定する。水平方向に２分割する場合の分割位置ｗ₀，および左側の分割領域で用いる変換係数の個数ｋ_0,wは，次式で与えられる（なお，右側の分割領域で用いる変換係数の個数はｋ−ｋ_0,wとなる）。 When dividing horizontally into two under the condition that the sum of transform coefficients in the two areas after division is kept at k, the division position and the number of coefficients assigned to each area after division are determined based on the following equations: . The division position w ₀ in the case of dividing into two in the horizontal direction and the number k _{0, w} of transform coefficients used in the left divided area are given by the following equations (note that the number of transform coefficients used in the right divided area is k −k _{0, w} ).

このとき，
Ξ_hor［ｘ，０，ｈ₀，Ｗ，Ｈ，ｋ_0,h，ｋ−ｋ_0,h，Ｃ_u，Ｃ_d］＋
λΘ_hor［ｘ，０，Ｗ，ｈ₀−Ｌ，２Ｌ＋１，ｋ_0,h，ｋ−ｋ_0,h，Ｃ_u，Ｃ_d］
≦ Ξ_ver［ｘ，ｗ₀，０，Ｗ，Ｈ，ｋ_0,w，ｋ−ｋ_0,w，Ｃ_l，Ｃ_r］＋
λΘ_ver［ｘ，０，Ｈ，ｗ₀−Ｌ，２Ｌ＋１，ｋ_0,w，ｋ−ｋ_0,w，Ｃ_l，Ｃ_r］
となる場合には，ｔ＝ｈ₀の位置で垂直方向に分割を行うこととし，それ以外の場合には，ｓ＝ｗ₀の位置で水平方向に分割を行うこととする。なお，ｈ₀＝０あるいはｗ₀＝０となる場合，分割を行わないほうが最適であるので分割は行われない。 At this time,
Hor _hor [x, 0, h ₀ , W, H, k _{0, h} , k−k _{0, h} , C _u , C _d ] +
λΘ _hor [x, 0, W, h ₀ −L, 2L + 1, k _{0, h} , k−k _{0, h} , C _u , C _d ]
≦ Ξ _ver [x, w ₀ , 0, W, H, k _{0, w} , k−k _{0, w} , C _l , C _r ] +
λΘ _ver [x, 0, H, w ₀ −L, 2L + 1, k _{0, w} , k−k _{0, w} , C _l , C _r ]
In such a case, division is performed in the vertical direction at the position t = h ₀ , and in other cases, division is performed in the horizontal direction at the position s = w ₀ . Note that when h ₀ = 0 or w ₀ = 0, it is more optimal not to perform the division, so the division is not performed.

［領域分割方法４］
２つの分割領域において画素当りの係数の個数が一定という条件下で，分割後の歪み量の総和および分割境界の不連続性の評価尺度（後述）の加重和（分割コストと呼ぶ）を最小化する分割を考える。さらに，垂直に２分割する場合と水平に２分割する場合の結果を比較し，上記加重和最小化の規範に基づき，垂直分割・水平分割のいずれかを選択する。前記［領域分割方法３］との違いは，分割領域への係数割当てを分割領域内の画素数に応じて設定している点にある。このため，分割コスト最小化という観点から見た場合，［領域分割方法３］で求まる解を最適解とすれば，本方法で求まる方法は準最適解であるが，解を求める演算コストを低減することができる。 [Area Division Method 4]
Minimize the weighted sum (referred to as the division cost) of the sum of the amount of distortion after division and the evaluation measure of discontinuity of the division boundary (described later) under the condition that the number of coefficients per pixel is constant in the two division regions Think of the division to be. Further, the results of dividing vertically into two and horizontally dividing are compared, and either vertical division or horizontal division is selected based on the above weighted sum minimization rule. The difference from the [region dividing method 3] is that the coefficient allocation to the divided regions is set according to the number of pixels in the divided regions. Therefore, from the viewpoint of minimizing the division cost, if the solution obtained by [Area division method 3] is the optimum solution, the method obtained by this method is a suboptimal solution, but the calculation cost for obtaining the solution is reduced. can do.

分割境界の不連続尺度として，例えば，分割境界領域におけるＳＳＩＭを用いる。上側領域内の下部０≦ｓ≦Ｗ，ｈ−Ｌ≦ｔ≦ｈ−１と下側領域内の上部０≦ｓ≦Ｗ，ｈ≦ｔ≦ｈ＋Ｌからなる矩形領域０≦ｓ≦Ｗ，ｈ−Ｌ≦ｔ≦ｈ＋Ｌに対する不連続尺度を次のように表す。 As the discontinuity measure of the division boundary, for example, SSIM in the division boundary region is used. A rectangular region 0 ≦ s ≦ W, h− that is composed of a lower portion 0 ≦ s ≦ W, h−L ≦ t ≦ h−1 in the upper region and an upper portion 0 ≦ s ≦ W, h ≦ t ≦ h + L in the lower region. The discontinuity scale for L ≦ t ≦ h + L is expressed as follows.

左側領域内の右端部ｗ−Ｌ≦ｓ≦ｗ−１，０≦ｔ≦Ｈと右側領域内の左端部ｗ≦ｓ≦ｗ＋Ｌ，０≦ｔ≦Ｈからなる矩形領域ｗ−Ｌ≦ｓ≦ｗ＋Ｌ，０≦ｔ≦Ｈに対する不連続尺度を次のように表す。 A rectangular region w−L ≦ s ≦ w + L composed of a right end portion w−L ≦ s ≦ w−1, 0 ≦ t ≦ H in the left region and a left end portion w ≦ s ≦ w + L, 0 ≦ t ≦ H in the right region. , 0 ≦ t ≦ H, the discontinuity scale is expressed as follows.

なお，Ｍ_h，Ｍ_wは，最小変更幅として予め与えられる自然数である。このとき， M _h and M _w are natural numbers given in advance as the minimum change width. At this time,

分割により得られた矩形領域の中で分割コストが最大の領域（分割対象領域）に対して，分割対象領域に含まれる係数の個数を上式におけるｋとみなして，上記と同様の分割処理を施す。選択した領域に対して分割が行われなかった場合には，先に選択された領域の次に分割コストが大きな領域に対して，上記と同様の分割処理を施す。この繰り返しは，全ての分割領域に対して分割が行われなくなるまで続ける。または，分割領域内の画素数の下限値を設定しておき，分割の結果，この下限値を下回るような領域は生成されないように制限を設けることも可能である。 For the rectangular area obtained by the division with the maximum division cost (division target area), the number of coefficients included in the division target area is regarded as k in the above equation, and the same division process as above is performed. Apply. If the selected area has not been divided, the same dividing process as described above is performed on the area having the next highest division cost after the previously selected area. This repetition is continued until all the divided areas are not divided. Alternatively, it is possible to set a lower limit value for the number of pixels in the divided area and to limit generation so that an area below the lower limit value is not generated as a result of the division.

［画像符号化装置の構成例］
図１は，本発明の実施形態に係る画像符号化装置の構成例を示す。画像符号化装置１０は映像信号を入力すると，予測部１４により予測された予測信号との差分から予測残差信号を求め，その予測残差信号を変換部１１により冗長系の変換基底を用いて変換する。係数選択処理部２０は，変換部１１の出力である変換係数の絞り込みを行うために，画面内を性質の類似した領域に分割し，分割領域ごとにコストが小さくなる変換係数を符号化対象として選択する。エントロピ符号化部１５は，選択された変換係数を可変長符号化し，符号化ストリームとして出力する。 [Configuration Example of Image Encoding Device]
FIG. 1 shows a configuration example of an image coding apparatus according to an embodiment of the present invention. When the image encoding device 10 receives the video signal, the image encoding device 10 obtains a prediction residual signal from the difference from the prediction signal predicted by the prediction unit 14, and the prediction residual signal is converted by the conversion unit 11 using the conversion base of the redundant system. Convert. The coefficient selection processing unit 20 divides the screen into regions of similar properties in order to narrow down the transform coefficients that are the output of the transform unit 11, and sets the transform coefficients that have a lower cost for each divided region as encoding targets. select. The entropy encoding unit 15 performs variable length encoding on the selected transform coefficient and outputs it as an encoded stream.

一方，係数選択処理部２０の出力は，逆変換部１２で逆変換され，その変換結果に予測信号が加えられて復号信号が生成される。復号信号は，歪除去フィルタ１３によりノイズ除去処理がなされ，参照復号信号として予測部１４に入力される。予測部１４では，次の映像信号の符号化のための予測信号の生成を行う。 On the other hand, the output of the coefficient selection processing unit 20 is inversely transformed by the inverse transformation unit 12, and a prediction signal is added to the transformation result to generate a decoded signal. The decoded signal is subjected to noise removal processing by the distortion removal filter 13 and input to the prediction unit 14 as a reference decoded signal. The prediction unit 14 generates a prediction signal for encoding the next video signal.

係数選択処理部２０における変換係数の選択は，次のように行われる。まず，最適分割処理部２２は，画像の領域を分割する複数の分割候補となる位置のそれぞれについて分割位置を設定し，領域分割を行い，係数選択部２２１により，全分割領域の係数の総数が一定という条件下で，全分割領域の歪み量の総和が最小となる最適な領域分割，または歪み量の総和と分割領域の境界部の不連続性を最小化する領域分割を求め，その各分割領域に対する変換係数の選択を行う。コスト算出部２２２により，領域分割に応じた符号化のコスト，すなわち，全分割領域の歪み量の総和（変換係数の切り捨てにより生じる誤差の和），または，分割領域の境界部における不連続性をコストとして考慮する場合には，その不連続性のコストを算出する。その結果をもとに，最適分割処理部２２は，コストが最も小さくなる領域分割を最適分割として選択する。 Selection of the conversion coefficient in the coefficient selection processing unit 20 is performed as follows. First, the optimum division processing unit 22 sets a division position for each of a plurality of division candidate positions for dividing an image region, performs region division, and the coefficient selection unit 221 calculates the total number of coefficients of all divided regions. Find the optimal area division that minimizes the sum of the distortion amounts of all the divided areas under the constant condition, or the area division that minimizes the discontinuity of the boundary between the total distortion amount and the divided areas. Select the conversion coefficient for the region. The cost calculation unit 222 calculates the encoding cost according to the region division, that is, the total sum of distortion amounts of all the divided regions (sum of errors caused by truncation of transform coefficients) or the discontinuity at the boundary portion of the divided regions. When considering it as a cost, the cost of the discontinuity is calculated. Based on the result, the optimum division processing unit 22 selects the area division with the smallest cost as the optimum division.

領域分割形状設定部２１は，最適分割処理部２２によって決定された領域分割を示す情報と，その領域分割に対して係数選択部２２１によって選択された変換係数とを符号化対象情報として，エントロピ符号化部１５に出力する。 The region division shape setting unit 21 uses the information indicating the region division determined by the optimum division processing unit 22 and the transform coefficient selected by the coefficient selection unit 221 for the region division as the encoding target information, and uses the entropy code. To the conversion unit 15.

［係数選択処理フロー］
図２は，係数選択処理部２０が実行する係数選択処理フローを示す図である。ここでは，主に［領域分割方法４］により領域分割を行う場合の実施例を説明するが，この方法は，［領域分割方法１］〜［領域分割方法３］の方法を包含しており，以下の説明から［領域分割方法１］〜［領域分割方法３］の領域分割による係数選択処理についても同様に実施することができることは明らかである。 [Coefficient selection process flow]
FIG. 2 is a diagram illustrating a coefficient selection processing flow executed by the coefficient selection processing unit 20. Here, an embodiment in the case of performing region division mainly by [region division method 4] will be described, but this method includes the methods of [region division method 1] to [region division method 3] From the following description, it is clear that the coefficient selection processing by area division of [area division method 1] to [area division method 3] can be similarly performed.

この例では，係数選択処理部２０は，分割対象領域の係数の総数Ｋと，処理対象画像の領域（最初は入力画像信号の全領域）とを引数とするＳｅｇｍｅｎｔ関数によって呼び出され，以下の処理を行う。
［ステップＳ１］：領域を領域１と領域２とに分割するとした場合の領域１の係数の個数ｋ１をループインデックスとして，初期値をｋ１＝０，増分を１とし，ｋ１＜Ｋの間，ステップＳ１〜Ｓ３の処理を繰り返す。なお，領域２の係数の個数ｋ２は，ｋ２＝Ｋ−ｋ１となる。
［ステップＳ２］：領域１の係数の個数ｋ１と，領域２の係数の個数ｋ２（＝Ｋ−ｋ１）と，入力領域とを引数として，Ｏｐｔ＿ｓｕｂ＿ｄｉｖ関数（最適分割処理関数）を呼び出す。この関数によって，最適分割処理部２２により，図３，図４に示す最適分割処理が実行される。
［ステップＳ３］：ループインデックスｋ１に１を加算し，ステップＳ１以降の処理を繰り返す。ｋ１がＫ以上になったならばループを終了する。
［ステップＳ４］：ステップＳ２の処理結果から領域１，領域２の最適な係数個数を算出する。
［ステップＳ５］：分割無しラベルを持たない領域の有無を判定し，分割無しラベルを持たない領域がある場合には，ステップＳ６へ進む。すべての領域が分割無しラベルを持つ領域になった場合には，ステップＳ８へ進む。
［ステップＳ６］：コスト関数値が最大の領域を同定する。この領域の係数の個数をＫ′とする。
［ステップＳ７］：係数の個数Ｋ′とコスト関数値が最大となる領域とを引数とするＳｅｇｍｅｎｔ関数を呼び出し，係数選択処理部２０による処理を，分割無しラベルを持つ領域が無くなるまで，再帰的に繰り返す。
［ステップＳ８］：コスト関数値が最大の領域を同定し，Ｓｅｇｍｅｎｔ関数の呼び出し元へ戻る。 In this example, the coefficient selection processing unit 20 is called by the Segment function having as arguments the total number K of coefficients in the division target area and the area of the processing target image (initially all areas of the input image signal). I do.
[Step S1]: When the region is divided into the region 1 and the region 2, the number k1 of the coefficients of the region 1 is set as a loop index, the initial value is k1 = 0, the increment is 1, and k1 <K. The processes of S1 to S3 are repeated. Note that the number k2 of the coefficients in the region 2 is k2 = K−k1.
[Step S2]: The Opt_sub_div function (optimal division processing function) is called with the number of coefficients k1 in region 1, the number k2 of coefficients in region 2 (= K−k1), and the input region as arguments. With this function, the optimum division processing unit 22 executes the optimum division processing shown in FIGS.
[Step S3]: 1 is added to the loop index k1, and the processing after step S1 is repeated. When k1 becomes K or more, the loop is terminated.
[Step S4]: Calculate the optimum number of coefficients for the regions 1 and 2 from the processing result of the step S2.
[Step S5]: It is determined whether or not there is an area having no undivided label. If there is an area having no undivided label, the process proceeds to step S6. If all the regions are regions having no division labels, the process proceeds to step S8.
[Step S6]: A region having the maximum cost function value is identified. Let K ′ be the number of coefficients in this region.
[Step S7]: The Segment function is called with the number of coefficients K ′ and the area having the maximum cost function value as arguments, and the process by the coefficient selection processing unit 20 is recursively performed until there is no area with no division label. Repeat.
[Step S8]: The area having the maximum cost function value is identified, and the process returns to the caller of the Segment function.

［最適分割処理フロー］
図３および図４に，最適分割処理部２２が実行する最適分割処理フローを示す。最適分割処理部２２は，図２に示すステップＳ２におけるＯｐｔ＿ｓｕｂ＿ｄｉｖ関数（最適分割処理関数）によって呼び出される。入力は，領域１の係数の個数ｋ１と，領域２の係数の個数ｋ２と，最適分割対象の領域である。
［ステップＳ１０］：縦分割位置（領域を垂直方向に２分割する位置）をループインデックスとして，初期値を０，増分をＭとし，縦分割位置＜画面縦幅の間，ステップＳ１０〜Ｓ１２の処理を繰り返す。
［ステップＳ１１］：係数の個数ｋ１，ｋ２および縦分割位置を引数として，コスト関数算出関数を呼び出す。この関数によって，コスト算出部２２２により，図５に示すコスト関数算出処理が実行される。
［ステップＳ１２］：ループインデックスの縦分割位置にＭを加算し，ステップＳ１０以降の処理を繰り返す。縦分割位置が画面縦幅以上になったならばループを終了する。
［ステップＳ１３］：コスト関数算出の処理結果から最適縦分割位置を算出し，図４のステップＳ２０へ進む。
［ステップＳ１４］：横分割位置（領域を水平方向に２分割する位置）をループインデックスとして，初期値を０，増分をＭとし，横分割位置＜画面横幅の間，ステップＳ１４〜Ｓ１６の処理を繰り返す。
［ステップＳ１５］：係数の個数ｋ１，ｋ２および横分割位置を引数として，コスト関数算出関数を呼び出す。この関数によって，コスト算出部２２２により，図５に示すコスト関数算出処理が実行される。
［ステップＳ１６］：ループインデックスの横分割位置にＭを加算し，ステップＳ１４以降の処理を繰り返す。横分割位置が画面横幅以上になったならばループを終了する。
［ステップＳ１７］：コスト関数算出の処理結果から最適横分割位置を算出し，図４のステップＳ２０へ進む。なお，ステップＳ１０〜Ｓ１３とステップＳ１４〜Ｓ１７の処理は，パラレルに実行してもシリアルに実行してもどちらでもよい。
［ステップＳ２０］：最適縦分割位置を用いたコストと最適横分割位置を用いたコストとの大小を比較し，最適縦分割位置を用いたコストのほうが小さい場合には，ステップＳ２１へ進み，そうでない場合には，ステップＳ２５へ進む。
［ステップＳ２１］：一方の分割領域の画素数が０か，または一方の分割領域の変換係数の個数が０の場合，ステップＳ２２へ進み，そうでない場合には，ステップＳ２４へ進む。
［ステップＳ２２］：入力された領域に対して，領域の分割は行わないことを示す分割無しラベルを付与する。
［ステップＳ２３］：Ｏｐｔ＿ｓｕｂ＿ｄｉｖ関数の呼び出し元へ分割無しラベルを返し，処理を終了する。
［ステップＳ２４］：最適縦分割位置の情報，各分割領域の変換係数，各分割領域の近似誤差，コスト関数値を，Ｏｐｔ＿ｓｕｂ＿ｄｉｖ関数の呼び出し元へ返し，処理を終了する。
［ステップＳ２５］：一方の分割領域の画素数が０か，または一方の分割領域の変換係数の個数が０の場合，ステップＳ２６へ進み，そうでない場合には，ステップＳ２８へ進む。
［ステップＳ２６］：入力された領域に対して，領域の分割は行わないことを示す分割無しラベルを付与する。
［ステップＳ２７］：Ｏｐｔ＿ｓｕｂ＿ｄｉｖ関数の呼び出し元へ分割無しラベルを返し，処理を終了する。
［ステップＳ２８］：最適横分割位置の情報，各分割領域の変換係数，各分割領域の近似誤差，コスト関数値を，Ｏｐｔ＿ｓｕｂ＿ｄｉｖ関数の呼び出し元へ返し，処理を終了する。 [Optimal split processing flow]
FIG. 3 and FIG. 4 show the optimum division processing flow executed by the optimum division processing unit 22. The optimum division processing unit 22 is called by the Opt_sub_div function (optimum division processing function) in step S2 shown in FIG. The inputs are the number of coefficients k1 in region 1, the number k2 of coefficients in region 2, and the region to be optimally divided.
[Step S10]: The vertical division position (position at which the region is divided into two in the vertical direction) is set as a loop index, the initial value is 0, the increment is M, and the processing of steps S10 to S12 is performed while the vertical division position <the screen vertical width. repeat.
[Step S11]: The cost function calculation function is called with the number of coefficients k1, k2 and the vertical division position as arguments. With this function, the cost calculation unit 222 executes the cost function calculation process shown in FIG.
[Step S12]: M is added to the vertical division position of the loop index, and the processing after step S10 is repeated. If the vertical division position exceeds the screen vertical width, the loop is terminated.
[Step S13]: The optimum vertical division position is calculated from the processing result of the cost function calculation, and the process proceeds to step S20 in FIG.
[Step S14]: The horizontal division position (position at which the region is divided into two in the horizontal direction) is set as a loop index, the initial value is 0, the increment is M, and the processing of steps S14 to S16 is performed while the horizontal division position <the horizontal width of the screen. repeat.
[Step S15]: The cost function calculation function is called with the number of coefficients k1 and k2 and the horizontal division position as arguments. With this function, the cost calculation unit 222 executes the cost function calculation process shown in FIG.
[Step S16]: M is added to the horizontal division position of the loop index, and the processes in and after step S14 are repeated. If the horizontal division position exceeds the screen horizontal width, the loop is terminated.
[Step S17]: The optimum horizontal division position is calculated from the processing result of the cost function calculation, and the process proceeds to Step S20 in FIG. Note that the processes of steps S10 to S13 and steps S14 to S17 may be executed in parallel or serially.
[Step S20]: The cost using the optimum vertical division position is compared with the cost using the optimum horizontal division position. If the cost using the optimum vertical division position is smaller, the process proceeds to step S21. If not, the process proceeds to step S25.
[Step S21]: If the number of pixels in one divided area is 0 or the number of transform coefficients in one divided area is 0, the process proceeds to step S22, and if not, the process proceeds to step S24.
[Step S22]: A non-division label indicating that the area is not divided is assigned to the input area.
[Step S23]: A non-divided label is returned to the caller of the Opt_sub_div function, and the process ends.
[Step S24]: The information of the optimal vertical division position, the conversion coefficient of each divided area, the approximate error of each divided area, and the cost function value are returned to the caller of the Opt_sub_div function, and the process is terminated.
[Step S25]: If the number of pixels in one divided area is 0 or the number of transform coefficients in one divided area is 0, the process proceeds to step S26, and if not, the process proceeds to step S28.
[Step S26]: A non-division label indicating that the area is not divided is assigned to the input area.
[Step S27]: A label without division is returned to the caller of the Opt_sub_div function, and the process is terminated.
[Step S28]: The information of the optimal horizontal division position, the conversion coefficient of each divided area, the approximate error of each divided area, and the cost function value are returned to the caller of the Opt_sub_div function, and the process is terminated.

［コスト関数算出処理フロー］
図５に，コスト算出部２２２が実行するコスト関数算出処理フローを示す。コスト算出部２２２は，図３に示すステップＳ１１およびＳ１５におけるコスト関数算出関数によって呼び出される。入力は，領域１の係数の個数ｋ１，領域２の係数の個数ｋ２，縦分割位置または横分割位置である。
［ステップＳ３０］：領域１に対して，ｋ１個の係数を選択する係数選択処理を行う。
［ステップＳ３１］：領域１に対して，上記の係数選択処理により選択されたｋ１個の係数で近似した場合の近似誤差を算出する。
［ステップＳ３２］：領域１に対して，上記の係数選択処理により選択されたｋ１個の係数で近似した場合の復号信号を得る。その後，ステップＳ３６へ進む。
［ステップＳ３３］：領域２に対して，ｋ２個の係数を選択する係数選択処理を行う。
［ステップＳ３４］：領域２に対して，上記の係数選択処理により選択されたｋ２個の係数で近似した場合の近似誤差を算出する。
［ステップＳ３５］：領域２に対して，上記の係数選択処理により選択されたｋ２個の係数で近似した場合の復号信号を得る。なお，ステップＳ３０〜Ｓ３２とステップＳ３３〜Ｓ３５とは，パラレルに実行してもシリアルに実行してもどちらでもよい。
［ステップＳ３６］：領域１，２の復号信号の境界領域に対して，不連続尺度を算出する。
［ステップＳ３７］：領域１，２の近似誤差和にλ×不連続尺度を加算して，コスト関数値を算出する。算出したコスト関数値を，関数の呼び出し元へ返し，処理を終了する。 [Cost function calculation processing flow]
FIG. 5 shows a cost function calculation process flow executed by the cost calculation unit 222. The cost calculation unit 222 is called by the cost function calculation function in steps S11 and S15 shown in FIG. The input is the number of coefficients k1 in region 1, the number k2 of coefficients in region 2, and the vertical division position or horizontal division position.
[Step S30]: A coefficient selection process for selecting k1 coefficients is performed for the region 1.
[Step S31]: An approximation error is calculated when the region 1 is approximated by k1 coefficients selected by the coefficient selection process.
[Step S32]: A decoded signal is obtained when the region 1 is approximated by k1 coefficients selected by the coefficient selection process. Thereafter, the process proceeds to step S36.
[Step S33]: A coefficient selection process for selecting k2 coefficients for the region 2 is performed.
[Step S34]: An approximation error is approximated when the region 2 is approximated by k2 coefficients selected by the coefficient selection process described above.
[Step S35]: A decoded signal is obtained when the region 2 is approximated by k2 coefficients selected by the coefficient selection process. Note that steps S30 to S32 and steps S33 to S35 may be executed in parallel or serially.
[Step S36]: A discontinuity measure is calculated for the boundary region of the decoded signals in regions 1 and 2.
[Step S37]: λ × discontinuity scale is added to the approximate error sum of regions 1 and 2 to calculate a cost function value. The calculated cost function value is returned to the function caller, and the process ends.

以上の画像符号化の処理は，コンピュータとソフトウェアプログラムとによっても実現することができ，そのプログラムをコンピュータ読み取り可能な記録媒体に記録することも，ネットワークを通して提供することも可能である。 The above-described image encoding processing can be realized by a computer and a software program, and the program can be recorded on a computer-readable recording medium or provided through a network.

図６は，本発明をソフトウェアプログラムを用いて実現する場合のシステムの構成例を示している。メモリ５２には，本発明の画像符号化処理を行うための画像符号化プログラム５３が格納される。ＣＰＵ５０は，メモリ５２に格納された画像符号化プログラム５３の命令を逐次フェッチして実行する。映像記憶装置５１は，符号化対象の映像信号を記憶する装置である。映像信号は，図示省略したカメラ等から入力するようにしてもよい。画像符号化プログラム５３によって生成された符号化ストリームは，符号化ストリーム記憶装置５４に格納される。または，ネットワークアダプタ等のインタフェースを介して，符号化ストリームを外部装置に出力してもよい。システムバス５５は，ＣＰＵ５０，映像記憶装置５１，メモリ５２，符号化ストリーム記憶装置５４を接続するバスである。 FIG. 6 shows an example of the system configuration when the present invention is implemented using a software program. The memory 52 stores an image encoding program 53 for performing the image encoding process of the present invention. The CPU 50 sequentially fetches and executes the instructions of the image encoding program 53 stored in the memory 52. The video storage device 51 is a device that stores a video signal to be encoded. The video signal may be input from a camera or the like (not shown). The encoded stream generated by the image encoding program 53 is stored in the encoded stream storage device 54. Alternatively, the encoded stream may be output to an external device via an interface such as a network adapter. The system bus 55 is a bus that connects the CPU 50, the video storage device 51, the memory 52, and the encoded stream storage device 54.

１０画像符号化装置
１１変換部
１２逆変換部
１３歪除去フィルタ
１４予測部
１５エントロピ符号化部
２０係数選択処理部
２１領域分割形状設定部
２２最適分割処理部
２２１係数選択部
２２２コスト算出部 DESCRIPTION OF SYMBOLS 10 Image coding apparatus 11 Conversion part 12 Inverse conversion part 13 Distortion removal filter 14 Prediction part 15 Entropy encoding part 20 Coefficient selection process part 21 Area division shape setting part 22 Optimal division process part 221 Coefficient selection part 222 Cost calculation part

Claims

In an image encoding method for performing conversion using a redundant conversion basis on an input image signal, narrowing down the obtained transform coefficients, and coding the transform coefficients obtained as a result of the narrowing down,
A process of setting a division position for each of a plurality of division candidate positions for dividing an image region;
Dividing the image area at the set division position and selecting transform coefficients for each divided area under the condition that the total number of coefficients of all the divided areas is constant;
Among all the region divisions respectively divided at the plurality of division candidate positions, the region division that minimizes the total amount of distortion due to the selection of transform coefficients for all regions is selected, and encoding in each division region is performed. And a process for selecting a transform coefficient to be used.

In an image encoding method for performing conversion using a redundant conversion basis on an input image signal, narrowing down the obtained transform coefficients, and coding the transform coefficients obtained as a result of the narrowing down,
A process of setting a division position for each of a plurality of division candidate positions for dividing an image region;
Dividing the image area at the set division position and selecting transform coefficients for each divided area under the condition that the total number of coefficients of all the divided areas is constant;
Among all the area divisions divided at the positions to be the plurality of division candidates, the total distortion amount by selection of the conversion coefficient of all areas and the conversion in two boundary areas having a predetermined width in contact with the boundary of the divided areas An image coding method comprising: selecting a region division that minimizes a weighted sum with a divergence degree of an approximation error based on a narrowing down of coefficients, and selecting a transform coefficient used for encoding in each divided region Method.

The image encoding method according to claim 1 or 2,
An image coding method characterized in that the number of coefficients in a divided area is set according to the number of pixels in the divided area.

In the image coding method of any one of Claim 1- Claim 3,
An image coding method comprising: recursively repeating the process of selecting the area division and transform coefficients until a predetermined division end condition is satisfied for each divided area in the selected area division.

In an image encoding apparatus that performs conversion using a conversion base of a redundant system on an input image signal, narrows down the obtained conversion coefficients, and encodes the conversion coefficients obtained as a result of the reduction,
Means for setting a division position for each of a plurality of division candidate positions for dividing an image area;
Means for dividing an image area at the set division position and selecting transform coefficients for each divided area under a condition that the total number of coefficients of all the divided areas is constant;
Among all the region divisions respectively divided at the plurality of division candidate positions, the region division that minimizes the total amount of distortion due to the selection of transform coefficients for all regions is selected, and encoding in each division region is performed. An image encoding device comprising: means for selecting a transform coefficient to be used.

In an image encoding apparatus that performs conversion using a conversion base of a redundant system on an input image signal, narrows down the obtained conversion coefficients, and encodes the conversion coefficients obtained as a result of the reduction,
Means for setting a division position for each of a plurality of division candidate positions for dividing an image area;
Means for dividing an image area at the set division position and selecting transform coefficients for each divided area under a condition that the total number of coefficients of all the divided areas is constant;
Among all the area divisions divided at the positions to be the plurality of division candidates, the total distortion amount by selection of the conversion coefficient of all areas and the conversion in two boundary areas having a predetermined width in contact with the boundary of the divided areas An image code comprising: selecting a region division that minimizes a weighted sum with a divergence degree of approximation error based on a coefficient narrowing down; and selecting a transform coefficient used for encoding in each divided region Device.

In the image coding device according to claim 5 or 6,
An image coding apparatus characterized in that the number of coefficients in a divided area is set according to the number of pixels in the divided area.

An image encoding program for causing a computer to execute the image encoding method according to any one of claims 1 to 4.