JP2003032680A

JP2003032680A - Image coded, image coding method, program code of the same image coder, and memory medium of the same image coder

Info

Publication number: JP2003032680A
Application number: JP2001218665A
Authority: JP
Inventors: Kinya Oosa; 欣也大佐
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2001-07-18
Filing date: 2001-07-18
Publication date: 2003-01-31

Abstract

PROBLEM TO BE SOLVED: To reduce the difference between the image quality of each tile and the one of input image of a image coder, even when decoding the coded data of each tile. SOLUTION: A code-quantity control portion 104 of the image code so decides every tile divided by a tile dividing portion 101 the basic code-quantity to be assigned to each tile as to control every tile the increase/decrease of the basic code-quantity to be assigned to each tile, by using the information of the flatness calculated every tile by a tile-flatness calculating portion 17.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、画像を符号化する
画像符号化装置、及び画像符号化方法、並びにプログラ
ムコード、記憶媒体に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image coding apparatus for coding an image, an image coding method, a program code, and a storage medium.

【０００２】[0002]

【従来の技術】ＪＰＥＧ２０００画像符号化方式（ＩＳ
Ｏ／ＩＥＣＦＤＩＳ１５４４４−１）のように、画像
をＷａｖｅｌｅｔ変換して符号化する符号化方式は、ブ
ロックベースのＤＣＴを利用するＪＰＥＧベースライン
符号化方式等と比較して、圧縮度を上げてもブロックひ
ずみを生じず画質的なメリットがある。このような画質
的なメリットを最大限に生かすには、入力画像全体に対
してＷａｖｅｌｅｔ変換を施すことが有効であるが、実
際に符号化装置を実装するにあたっては、入力画像のサ
イズが非常に大きい、あるいは、符号化装置のバッファ
容量が限られている、等の理由により、入力画像全体を
一度にＷａｖｅｌｅｔ変換、符号化できず、入力画像を
複数の領域（タイル）に分割して、各タイルをそれぞれ
符号化するという手法を取ることがある。2. Description of the Related Art JPEG2000 image coding system (IS
O / IEC FDIS15444-1), a coding method for Wavelet transforming and coding an image is higher than a JPEG baseline coding method using block-based DCT even if the degree of compression is increased. There is no block distortion and there is a merit in image quality. In order to make the most of such image quality advantage, it is effective to perform Wavelet conversion on the entire input image. However, when actually implementing the encoding device, the size of the input image is extremely large. Due to a large size or a limited buffer capacity of the encoding device, the entire input image cannot be wavelet-converted and encoded at one time, and the input image is divided into a plurality of regions (tiles), The technique of encoding each tile may be taken.

【０００３】このようなタイル分割の処理例を図２に示
す。同図において入力画像２１はタイル分割部２４によ
って点線部分でタイル２２やタイル２３に分割され、分
割されたタイル２２，２３は一つづつ符号化器で符号化
される。分割されたタイル２２，２３は入力画像２１と
比べてサイズが小さいため、この後の符号化器で持つバ
ッファ容量を限定し、より容易に低コストでの実装を可
能にする。An example of such tile division processing is shown in FIG. In the figure, the input image 21 is divided into tiles 22 and tiles 23 by a tile dividing part 24 at a dotted line portion, and the divided tiles 22 and 23 are encoded by an encoder one by one. Since the divided tiles 22 and 23 are smaller in size than the input image 21, the buffer capacity of the subsequent encoder is limited, which allows easier and low-cost implementation.

【０００４】図３に入力画像をタイル分割し、分割され
たタイル毎に符号化を行う画像符号化装置の機能構成を
示す。FIG. 3 shows a functional configuration of an image coding apparatus that divides an input image into tiles and encodes each divided tile.

【０００５】３０１はタイル分割部で、入力画像を所定
のサイズを有する領域（タイル）に分割し、分割したタ
イルを１つずつ変換符号化部３０２に出力する。３０２
は変換符号化部で、入力されたタイルに対して変換処理
を行い、変換係数を生成し、量子化・エントロピ符号化
部３０３に出力する。この変換処理は例えば周波数変換
があり、周波数変換として離散ウェーブレット変換やＤ
ＣＴ等がある。３０３は量子化・エントロピ符号化部３
０３で、まず変換係数に対して量子化を行い、変換係数
の量子化値を得る。そしてこの量子化値に対してエント
ロピ符号化を行い、符号化データとして出力する。な
お、この符号化データにはヘッダが添付され、このヘッ
ダには上述の符号化処理に用いたパラメータ、例えば量
子化ステップやタイルのサイズなどが含まれている。３
０４は符号量制御部で、量子化・エントロピ符号化部３
０３において量子化、エントロピ符号化を行う際に発生
する符号量を監視しており、発生する符号量に応じて量
子化・エントロピ符号化部３０３を制御（符号化制御）
する。例えば、発生する符号化データのサイズが予め指
定したサイズになるようにする。A tile dividing unit 301 divides the input image into regions (tiles) having a predetermined size, and outputs the divided tiles to the transform coding unit 302 one by one. 302
Is a transform coding unit, which transforms the input tiles, generates transform coefficients, and outputs them to the quantization / entropy coding unit 303. This conversion processing includes, for example, frequency conversion, and discrete wavelet conversion or D
There are CT etc. Reference numeral 303 denotes the quantization / entropy coding unit 3
At 03, first, the transform coefficient is quantized to obtain the quantized value of the transform coefficient. Then, the quantized value is entropy-encoded and output as encoded data. A header is attached to the encoded data, and the header includes parameters used in the above-described encoding process, such as a quantization step and a tile size. Three
Reference numeral 04 is a code amount control unit, which is a quantization / entropy encoding unit 3
In 03, the code amount generated when performing the quantization and entropy coding is monitored, and the quantization / entropy coding unit 303 is controlled according to the generated code amount (coding control).
To do. For example, the size of the generated encoded data is set to a size designated in advance.

【０００６】この符号量制御の手法として、ＩＳＯ／Ｉ
ＥＣＦＤＩＳ１５４４４−１の"Ｒａｔｅｃｏｎｔ
ｒｏｌ"の節では、指定された符号量において、符号化
された画像と入力画像の誤差を最小化するレート／ディ
ストーション最適化の手法を用いており、誤差が最小と
いう意味で最も良い画質が得られるよう、発生する符号
量を制御している。As a method of controlling the code amount, ISO / I
EC FDIS15444-1 "Rate cont
The "roll" section uses a rate / distortion optimization method that minimizes the error between the encoded image and the input image at the specified code amount, and the best image quality is obtained in the sense that the error is the minimum. The generated code amount is controlled so that

【０００７】[0007]

【発明が解決しようとする課題】レート／ディストーシ
ョン最適化の手法を用いて符号量制御を行うことは、誤
差が最小という意味では最良の画質が得られるような制
御となる。しかし、人間の視覚的な特性により、誤差が
最小であることと視覚的に画質が良いことは必ずしも一
致しない。特に、タイル分割を行って符号化を行う場合
には、誤差を最小化するだけでは次の様な問題を生じ
る。この問題を図２に示した入力画像２１を用いて説明
する。When the code amount control is performed by using the rate / distortion optimization method, the control is such that the best image quality is obtained in the sense that the error is minimum. However, due to human visual characteristics, the minimum error does not always correspond to the visually good image quality. In particular, in the case of performing tile division and encoding, the following problems occur only by minimizing the error. This problem will be described using the input image 21 shown in FIG.

【０００８】まず入力画像２１において平坦な画像領域
を多く含むタイル２２、平坦でない画像領域を多く含む
タイル２３の比較を行う。ここで平坦な画像領域とは、
夫々の画素値の変化が比較的小さい画素を含む領域を示
す。一方、平坦でない画像領域とは、夫々の画素値の変
化は比較的大きい画素を含む領域を示す。First, the input image 21 is compared with the tile 22 containing many flat image areas and the tile 23 containing many non-flat image areas. Here, the flat image area is
An area including pixels in which the change in each pixel value is relatively small is shown. On the other hand, an image area that is not flat indicates an area that includes pixels in which the change in each pixel value is relatively large.

【０００９】ＩＳＯ／ＩＥＣＦＤＩＳ１５４４４−１
に記述された符号量制御を行うと、画像全体に対する誤
差を最小化するように符号量制御が行われ、タイル２
２、タイル２３にも誤差の面で最適な符号量の配分が行
われる。ところが、実際に復号後の画像を目で見てみる
と、平坦でないタイル２３の画質は比較的良好なのに対
して、平坦なタイル２２はタイル２３に比べてぼやけた
画質になり、画像全体としてみると、平坦部と平坦でな
い部分の画質に差のある違和感のある画像となる。これ
は、人間の視覚特性におけるマスキング効果の影響であ
り、平坦でない画像領域では画像情報自体の変動により
符号化誤差がマスキングされて多少大きな誤差でも目立
たないのに対し、平坦な領域ではマスキングが働かない
ため小さな誤差でも目立つためである。ISO / IEC FDIS15444-1
When the code amount control described in the above is performed, the code amount control is performed so as to minimize the error with respect to the entire image.
2, the code amount is optimally distributed to the tile 23 in terms of error. However, when actually looking at the image after decoding, the image quality of the tile 23 that is not flat is relatively good, whereas the image quality of the tile 22 that is flat becomes blurry compared to the tile 23, and the entire image is viewed. Then, an image with a sense of discomfort is obtained in which there is a difference in image quality between the flat portion and the non-flat portion. This is due to the effect of the masking effect on human visual characteristics.In a non-flat image area, a coding error is masked by fluctuations in the image information itself, and even a slightly large error is inconspicuous, whereas in a flat area, masking works. This is because even a small error is noticeable because it is not present.

【００１０】本発明は以上の問題に鑑みて成されたもの
であり、各タイルの符号化データを復号しても、各タイ
ル毎の画質の差を軽減することを目的とする。The present invention has been made in view of the above problems, and it is an object of the present invention to reduce the difference in image quality between tiles even when the encoded data of each tile is decoded.

【００１１】[0011]

【課題を解決するための手段】本発明の目的を達成する
ために、例えば本発明の画像符号化装置は以下の構成を
備える。In order to achieve the object of the present invention, for example, an image coding apparatus of the present invention has the following configuration.

【００１２】すなわち、JPEG2000(ISO/IEC 15444-1)規
格に従って画像を符号化する画像符号化装置であって、画像をタイルに分割する分割手段と、各タイルに対して変換処理を施し、変換係数を生成する
変換手段と、前記変換係数に対して符号化処理を行う符号化手段と、各タイル毎に割り当てる基本符号量を計算する基本符号
量計算手段と、注目タイルのデータを元に注目タイルの平坦度を示す指
標を計算する指標計算手段と、前記基本符号量計算手段により計算された注目タイルに
割り当てる基本符号量を、前記指標計算手段により計算
された注目タイルの平坦度を示す指標に応じて、平坦度
が大きいほど割当量が増えるように、割当量を増減させ
る修正符号量計算手段と、前記修正符号量計算手段により計算された注目タイルの
割り当て符号量に基づいて前記符号化手段を制御する制
御手段とを備える。That is, an image coding apparatus for coding an image in accordance with the JPEG2000 (ISO / IEC 15444-1) standard, which comprises a dividing means for dividing an image into tiles, and a conversion process for each tile Transform means for generating a coefficient, encoding means for performing an encoding process on the transform coefficient, basic code amount calculation means for calculating a basic code amount to be assigned to each tile, and attention based on the data of the tile of interest An index calculation unit for calculating an index indicating the flatness of the tile, and an index indicating the flatness of the target tile calculated by the index calculation unit for the basic code amount to be assigned to the target tile calculated by the basic code amount calculation unit. Accordingly, the correction code amount calculation means for increasing or decreasing the allocation amount such that the allocation amount increases as the flatness increases, and the allocation of the tile of interest calculated by the correction code amount calculation means. And a control means for controlling the encoding means on the basis of the issue amount.

【００１３】[0013]

【発明の実施の形態】以下添付図面を参照して、本発明
を好適な実施形態に従って詳細に説明する。BEST MODE FOR CARRYING OUT THE INVENTION The present invention will now be described in detail according to preferred embodiments with reference to the accompanying drawings.

【００１４】［第１の実施形態］図１に本実施形態にお
ける画像符号化装置の機能構成を示す。[First Embodiment] FIG. 1 shows the functional arrangement of an image coding apparatus according to this embodiment.

【００１５】１０１はタイル分割部で、入力画像を所定
のサイズの領域（タイル）に分割し、各タイルは変換符
号化部１０２に出力する。A tile division unit 101 divides an input image into regions (tiles) of a predetermined size, and outputs each tile to a transform coding unit 102.

【００１６】１０２は変換符号化部で、各タイルに対し
て変換処理を行い、変換係数を生成する。この変換処理
として本実施形態では周波数変換を用いる。なお、この
周波数変換にはＤＣＴや離散ウェーブレット変換が含ま
れるがこのいずれでも良い。A transform coding unit 102 transforms each tile to generate transform coefficients. In this embodiment, frequency conversion is used as this conversion processing. The frequency conversion includes DCT and discrete wavelet conversion, but any of these may be used.

【００１７】１０３は量子化・エントロピ符号化部で、
変換符号化部１０２による変換係数に対して所定の量子
化ステップを用いて量子化を行い、量子化された変換係
数に対してエントロピ符号化を行う。Reference numeral 103 denotes a quantization / entropy coding unit,
The transform coefficient by the transform coding unit 102 is quantized using a predetermined quantization step, and entropy coding is performed on the quantized transform coefficient.

【００１８】１０４は符号量制御部で、量子化・エント
ロピ符号化部で行われる量子化、エントロピ符号化部に
より発生する符号化データの符号量を監視し、この符号
量に基づいて量子化・エントロピ符号化部１０４を制御
する。詳細は後述する。A code amount control unit 104 monitors the code amount of the coded data generated by the quantization / entropy coding unit and the entropy coding unit, and quantizes / quantizes based on this code amount. The entropy coding unit 104 is controlled. Details will be described later.

【００１９】１０５はタイル平坦度計算部で、タイル毎
の平坦度を計算し、計算した平坦度を符号量制御部１０
４に出力する。詳細は後述する。Reference numeral 105 denotes a tile flatness calculation unit, which calculates the flatness for each tile and uses the calculated flatness as the code amount control unit 10.
Output to 4. Details will be described later.

【００２０】また、本実施形態における画像符号化装置
の基本構成を図５に示す。FIG. 5 shows the basic configuration of the image coding apparatus according to this embodiment.

【００２１】５０１はＣＰＵで、ＲＡＭ５０２やＲＯＭ
５０３に格納されたプログラムやデータを用いて画像符
号化装置全体の制御を行うと共に、図１に示した機能構
成を有する本実施形態の画像符号化処理を実行する。Reference numeral 501 is a CPU, which is a RAM 502 or ROM.
The program and data stored in 503 are used to control the entire image encoding apparatus, and also execute the image encoding processing of this embodiment having the functional configuration shown in FIG.

【００２２】５０２はＲＡＭで、外部記憶装置５０４や
記憶媒体ドライブ５０９からロードされたプログラムや
データ等を一時的に格納するエリアを備えると共に、Ｃ
ＰＵ５０１が各種の処理を実行する際に用いるワークエ
リアを備える。A RAM 502 has an area for temporarily storing programs and data loaded from the external storage device 504 and the storage medium drive 509, and has a C
The PU 501 has a work area used when executing various processes.

【００２３】５０３はＲＯＭで、画像符号化装置全体の
制御を行うプログラムやデータなどを格納する。A ROM 503 stores programs and data for controlling the entire image coding apparatus.

【００２４】５０４はハードディスクなどの外部記憶装
置で、記憶媒体ドライブ５０９からロードされたプログ
ラムやデータなどを保存することができる。また、ワー
クエリアのサイズがＲＡＭ５０２の許容を越えた場合
に、越えた分のエリアをファイルとして提供することが
できる。Reference numeral 504 denotes an external storage device such as a hard disk, which can store programs and data loaded from the storage medium drive 509. Further, when the size of the work area exceeds the capacity of the RAM 502, the excess area can be provided as a file.

【００２５】５０５、５０６はキーボード、マウスで、
各種の指示を画像符号化装置に対して入力することがで
きる。Reference numerals 505 and 506 are a keyboard and a mouse.
Various instructions can be input to the image encoding device.

【００２６】５０７は表示装置で、ＣＲＴや液晶画面な
どにより構成されており、各種の画像情報や文字情報な
どを表示することができる。A display device 507 is composed of a CRT, a liquid crystal screen, etc., and can display various image information and character information.

【００２７】５０８は画像入力装置で、スキャナやディ
ジタルカメラ等により構成されており、画像をディジタ
ルデータとして画像符号化装置に入力することができ
る。なお、画像入力装置は画像符号化装置と接続する為
のインターフェースを含む。An image input device 508 is composed of a scanner, a digital camera and the like, and can input an image as digital data to the image coding device. The image input device includes an interface for connecting to the image encoding device.

【００２８】５０９は記憶媒体ドライブで、ＣＤ−ＲＯ
ＭやＤＶＤ−ＲＯＭ、フロッピー（登録商標）ディスク
などの記憶媒体からプログラムやデータなどを読み取
り、ＲＡＭ５０２や外部記憶装置５０４に出力すること
ができる。A storage medium drive 509 is a CD-RO.
Programs and data can be read from a storage medium such as an M, a DVD-ROM, and a floppy (registered trademark) disk, and output to the RAM 502 and the external storage device 504.

【００２９】５１０は上述の各部を繋ぐバスである。A bus 510 connects the above-mentioned units.

【００３０】以上の構成を備える本実施形態の画像符号
化装置の、符号量制御部１０４，タイル平坦度計算部１
０５における処理について以下、詳細に説明する。The code amount control unit 104 and the tile flatness calculation unit 1 of the image coding apparatus of the present embodiment having the above configuration.
The processing in 05 will be described in detail below.

【００３１】符号量制御部１０４は、例えば、ISO/IEC
FDIS15444-1に記載されたレート／ディストーション最
適化などの手法により、タイル毎に割り当てる基本的な
符号量を決定する。そして、タイル平坦度計算部１７に
よって計算されたタイル毎の平坦度の情報を用いて、割
り当てる基本的な符号量をタイル毎に増減する制御を行
う。The code amount control unit 104 is, for example, an ISO / IEC
A basic code amount to be assigned to each tile is determined by a method such as rate / distortion optimization described in FDIS15444-1. Then, using the flatness information for each tile calculated by the tile flatness calculation unit 17, control is performed to increase or decrease the basic code amount to be allocated for each tile.

【００３２】タイルｉの平坦度がＦｉで与えられる（Ｆ
ｉが大きいほどタイルが平坦である）とすると、タイル
毎の基本的な符号量割り当てＢ_{ｂａｓｅ−ｉ}を次式のよ
うにＦｉに応じて増減させることによって、タイルｉに
実際に割り当てる符号量Ｂｉを得ることができる。The flatness of tile i is given by Fi (F
Assuming that the larger i is, the flatter the tile is), the basic code amount allocation B _base-i for each tile is increased or decreased according to Fi as shown in the following equation, and the code amount Bi actually allocated to the tile i is Can be obtained.

【００３３】Ｂｉ＝Ｂ_{ｂａｓｅ−ｉ}×（１＋ｆ（Ｆｉ））ここで関数f(x)は、xの値に対して単調に増加する関数
であり、例えば、ｆ（ｘ）＝ａ（ｘ−ｂ）ただし、ａ，ｂは定数
で、ａ＞０が考えられる。この例の関数の場合、定数ａの値が、平
坦度Ｆｉの変化量に対する符号割当量Ｂｉの増減の度合
いを決め、平坦度Ｆｉの値が定数ｂの値より大きい場合
は、ｆ（Ｆｉ）＞０、すなわち、Ｂｉを増加させ、Ｆｉ
の値がｂの値より小さい場合は、ｆ（Ｆｉ）＜０、すな
わち、Ｂｉを減少させる。なお、定数ａ，ｂの値は、視
覚的な改善効果が得られ、かつ画像全体としての符号量
が意図する符号量になるように定められる。Bi = B _base−i × (1 + f (Fi)) Here, the function f (x) is a function that monotonically increases with respect to the value of x. For example, f (x) = a (x− b) However, a and b are constants, and a> 0 is considered. In the case of the function of this example, the value of the constant a determines the degree of increase or decrease of the code allocation amount Bi with respect to the change amount of the flatness Fi, and if the value of the flatness Fi is larger than the value of the constant b, f (Fi) > 0, that is, Bi is increased and Fi is increased.
When the value of is smaller than the value of b, f (Fi) <0, that is, Bi is decreased. The values of the constants a and b are determined so that a visual improvement effect can be obtained and the code amount of the entire image becomes the intended code amount.

【００３４】関数ｆ（ｘ）は、単調に増加する関数であ
れば、この例の関数に限らないが、非常に大きなｘの値
や、非常に小さなｘの値に対して、符号割当量Ｂｉが極
端な値に変化することを避けるため、このようなｘに対
しては、関数ｆ（ｘ）の値を上限または下限で飽和させ
ることが望ましい。The function f (x) is not limited to the function of this example as long as it is a function that increases monotonically, but for a very large value of x or a very small value of x, the code allocation amount Bi In order to avoid an extreme change in the value, it is desirable to saturate the value of the function f (x) at the upper limit or the lower limit for such x.

【００３５】タイル平坦度計算部１０５において計算さ
れる平坦度Ｆｉとしては、タイルｉ内の各画素値の分散
の逆数を用いるが、これに限定されるものではなく、タ
イル内の各画素値と画素値の平均値の差分の絶対値和を
求め、その逆数を用いても良い。また、あるいは、タイ
ル内の各画素値を順にスキャンしたときに、隣合う２画
素値の差の絶対値の総和を求め、その逆数を平坦度とし
て用いることもできる。As the flatness Fi calculated by the tile flatness calculation unit 105, the reciprocal of the variance of each pixel value in the tile i is used, but the invention is not limited to this, and the pixel value in each tile It is also possible to obtain the sum of absolute values of the differences between the average values of the pixel values and use the reciprocal thereof. Alternatively, when each pixel value in the tile is sequentially scanned, the sum of the absolute values of the differences between two adjacent pixel values may be obtained and the reciprocal thereof may be used as the flatness.

【００３６】タイルｉ内の画素値をＰ_ｉｊ（０≦ｊ＜Ｎ
−１、Ｎはタイルｉ内の画素数）、Ｐ_ｉｊの平均値を
Ｐ'とすると、平坦度Ｆｉとしてタイルｉ内の各画素値
の分散の逆数を用いた場合、以下の式に従って平坦度Ｆ
ｉを求めることができる。The pixel value in tile i is set to P _ij (0 ≦ j <N
-1, N is the number of pixels in tile i), and _P'is the average value of P _ij , and when the inverse of the variance of each pixel value in tile i is used as the flatness Fi, the flatness is calculated according to the following equation. F
i can be obtained.

【００３７】Ｆｉ＝Ｎ／｛（Ｐ_ｉ０−Ｐ'）^２＋（Ｐ
_ｉ１−Ｐ'）^２、、＋（Ｐ_{ｉ（Ｎ−１）}−Ｐ'）^２｝一方、平坦度Ｆｉとして、タイル内の各画素値と画素値
の平均値の差分の絶対値和を求め、その逆数を用いた場
合、平坦度Ｆｉは以下の式に従って求めることができ
る。Fi = N / {(P _i0 -P ') ² + (P
_i1− P ′) ² ,, + (P _{i (N−1)} −P ′) ² } On the other hand, as the flatness Fi, the absolute value sum of the difference between each pixel value in the tile and the average value of the pixel values is calculated. , And the reciprocal thereof, the flatness Fi can be obtained according to the following equation.

【００３８】Ｆｉ＝Ｎ／｛｜Ｐ_ｉ０−Ｐ'｜＋｜Ｐ_ｉ１
−Ｐ'｜、、＋｜Ｐ_{ｉ（Ｎ−１）}−Ｐ'｜｝あるいは、平坦度Ｆｉとして、タイル内の各画素値を順
にスキャンしたときに、隣合う２つの画素値の差の絶対
値の総和を求め、その逆数を用いた場合、平坦度Ｆｉは
以下の式に従って求めることができる。Fi = N / {| P _i0 −P ′ | + | P _i1
−P ′ | ,, + | P _{i (N−1)} −P ′ |} Or, as the flatness Fi, when scanning each pixel value in the tile in order, the absolute difference between two adjacent pixel values is obtained. When the sum of the values is calculated and its reciprocal is used, the flatness Fi can be calculated according to the following equation.

【００３９】Ｆｉ＝Ｎ／｛｜Ｐ_ｉ１−Ｐ_ｉ０｜＋｜Ｐ
_ｉ２−Ｐ_ｉ１｜、、＋｜Ｐ_{ｉ（Ｎ−１} _）−Ｐ
_{ｉ（Ｎ−２）}｜｝そして、タイル平坦度計算部１０５が上述の通り求めた
平坦度Ｆｉを用いて、符号量制御部１０４は各タイル毎
に実際に割り当てる符号量Ｂｉを上述の式に基づいて求
める。Fi = N / {| P _i1 −P _i0 | + | P
_i2- _Pi1 | ,, + | _Pi _(N-1 ₎ -P
_{i (N−2)} |} Then, using the flatness Fi obtained by the tile flatness calculation unit 105 as described above, the code amount control unit 104 sets the code amount Bi actually assigned to each tile into the above formula. Seek based on.

【００４０】以上の処理により全てのタイルに対して符
号化処理を施すと、各タイルの符号化結果を含む符号化
データを生成する。またこの符号化データには各タイル
毎の量子化ステップやエントロピ符号化に用いるパラメ
ータなどを含むヘッダを添付する。When all tiles are coded by the above processing, coded data including the coding result of each tile is generated. A header including a quantization step for each tile and parameters used for entropy coding is attached to this coded data.

【００４１】以上の説明により、本実施形態における画
像符号化装置及び画像符号化方法は、各タイル毎の平坦
度を考慮した符号化を行うので、これらのタイルを復号
しても各タイル毎の画質の差を軽減することができる。As described above, the image coding apparatus and the image coding method according to the present embodiment perform coding in consideration of the flatness of each tile. Therefore, even if these tiles are decoded, each tile is decoded. The difference in image quality can be reduced.

【００４２】［第２の実施形態］変換符号化部１０１に
おける周波数変換処理としてウェーブレット変換を用い
た場合について説明する。変換符号化部１０１がタイル
毎にウェーブレット変換を行うと、各タイルは図４に示
すようなサブバンドに分割される。[Second Embodiment] A case where wavelet transform is used as the frequency transform process in the transform coding unit 101 will be described. When the transform coding unit 101 performs wavelet transform for each tile, each tile is divided into subbands as shown in FIG.

【００４３】同図においてＬＬサブバンドの係数は、タ
イルの縮小画像に相当しているため、タイル全体に対し
て平坦度の計算をしなくても、ＬＬサブバンドに対して
平坦度の計算を行えば、少ない計算量でタイルの平坦度
を算出できる。具体的には、タイルｉのＬＬサブバンド
の係数値をＬＬ_ｉｋ（０≦ｋ＜Ｍ−１、ＭはＬＬサブバ
ンド内の変換係数の数）、ＬＬ_ｉｋの平均値をＬＬ'と
すると、平坦度Ｆｉとしてタイルｉ内の各変換係数の分
散の逆数を用いた場合、以下の式に従って平坦度Ｆｉを
求めることができる。In the figure, since the coefficient of the LL subband corresponds to the reduced image of the tile, the flatness of the LL subband can be calculated without calculating the flatness of the entire tile. If done, the flatness of the tile can be calculated with a small amount of calculation. Specifically, when the coefficient value of the LL subband of tile i is LL _ik (0 ≦ k <M−1, M is the number of transform coefficients in the LL subband), and the average value of LL _ik is LL ′, When the reciprocal of the variance of each transform coefficient in the tile i is used as the flatness Fi, the flatness Fi can be obtained according to the following formula.

【００４４】Ｆｉ＝Ｍ／｛（ＬＬ_ｉ０−ＬＬ'）^２＋
（ＬＬ_ｉ１−ＬＬ'）^２、、＋（ＬＬ_ｉ _{（Ｍ−１）}−Ｌ
Ｌ'）^２｝一方、平坦度Ｆｉとして、サブバンドＬＬ内の各変換係
数と変換係数の平均値の差分の絶対値を求め、その逆数
を用いた場合、平坦度Ｆｉは以下の式に従って求めるこ
とができる。Fi = M / {(LL _i0 -LL ') ² +
(LL _i1 −LL ′) ² , + (LL _i _(M−1) −L
L ′) ² } On the other hand, as the flatness Fi, the absolute value of the difference between each transform coefficient in the subband LL and the average value of the transform coefficients is obtained, and when the reciprocal thereof is used, the flatness Fi is obtained according to the following formula. be able to.

【００４５】Ｆｉ＝Ｍ／｛｜ＬＬ_ｉ０−ＬＬ'｜＋｜Ｌ
Ｌ_ｉ１−ＬＬ'｜、、＋｜ＬＬ_ｉ（Ｎ _−１）−ＬＬ'｜｝あるいは、平坦度Ｆｉとして、サブバンドＬＬ内の各変
換係数値を順にスキャンしたときに、隣合う２つの変換
係数値の差の絶対値の総和を求め、その逆数を用いた場
合、平坦度Ｆｉは以下の式に従って求めることができ
る。Fi = M / {| LL _i0 −LL ′ | + | L
L _i1 −LL ′ |, + | LL _{i (N} ₋₁₎ −LL ′ |} Alternatively, when the respective transform coefficient values in the subband LL are sequentially scanned as the flatness Fi, two adjacent transforms are obtained. When the sum of the absolute values of the difference between coefficient values is calculated and the reciprocal thereof is used, the flatness Fi can be calculated according to the following formula.

【００４６】Ｆｉ＝Ｎ／｛｜ＬＬ_ｉ１−ＬＬ_ｉ０｜＋｜
ＬＬ_ｉ２−ＬＬ_ｉ１｜、、＋｜ＬＬ _{ｉ（Ｎ−１）}−ＬＬ
_{ｉ（Ｎ−２）}｜｝Fi = N / {| LL_i1-LL_i0│ + │
LL_i2-LL_i1| ,, + | LL _{i (N-1)}-LL
_{i (N-2)}|}

【発明の効果】以上の説明により、本発明によって、各
タイルの符号化データを復号しても、各タイル毎の画質
の差を軽減することができる。As described above, according to the present invention, even when the encoded data of each tile is decoded, the difference in image quality between tiles can be reduced.

[Brief description of drawings]

【図１】本発明の第１の実施形態における画像符号化装
置の機能構成を示す図である。FIG. 1 is a diagram showing a functional configuration of an image encoding device according to a first embodiment of the present invention.

【図２】タイル分割の処理例を示す図である。FIG. 2 is a diagram illustrating a processing example of tile division.

【図３】入力画像をタイル分割し、分割されたタイル毎
に符号化を行う従来の画像符号化装置の機能構成を示す
図である。FIG. 3 is a diagram showing a functional configuration of a conventional image encoding device that divides an input image into tiles and encodes each divided tile.

【図４】サブバンドを示す図である。FIG. 4 is a diagram showing subbands.

【図５】本発明の実施形態における画像符号化装置の基
本構成を示す図である。FIG. 5 is a diagram showing a basic configuration of an image encoding device according to an embodiment of the present invention.

Claims

[Claims]

1. An image coding apparatus for coding an image in accordance with the JPEG2000 (ISO / IEC 15444-1) standard, wherein the image coding apparatus divides the image into tiles, and a conversion process is performed on each tile. Transform means for generating a coefficient, encoding means for performing an encoding process on the transform coefficient, basic code amount calculation means for calculating a basic code amount to be assigned to each tile, and attention based on the data of the tile of interest An index calculation unit for calculating an index indicating the flatness of the tile, and an index indicating the flatness of the target tile calculated by the index calculation unit for the basic code amount to be assigned to the target tile calculated by the basic code amount calculation unit. Accordingly, the correction code amount calculating means for increasing or decreasing the allocation amount so that the allocation amount increases as the flatness increases, and the allocation code amount of the target tile calculated by the correction code amount calculating means. The image encoding device characterized by comprising a control means for controlling the encoding means are.

2. The image coding apparatus according to claim 1, wherein the index calculation means uses an inverse number of a variance of the data of the tile of interest as an index of the tile of interest.

3. The index calculating means obtains an absolute sum of differences between each data of the tile of interest and an average value of the data, and uses the reciprocal thereof as an index of the tile of interest. 1. The image encoding device according to 1.

4. The index calculating means obtains a total sum of absolute values of differences between two adjacent data when scanning each data of the tile of interest, and uses the reciprocal thereof as an index of the tile of interest. The image coding apparatus according to claim 1, wherein:

5. The image coding apparatus according to claim 1, wherein the data of the tile of interest is a value of a pixel included in the tile of interest.

6. The image coding apparatus according to claim 1, wherein the data of the tile of interest is a transform coefficient forming an LL subband of the tile of interest.

7. The basic code amount calculation means calculates a basic code amount to be assigned to the tile of interest as a rate / rate.
The image coding apparatus according to claim 1, wherein the image coding apparatus is obtained by using distortion optimization.

8. The index of the target tile is F, the basic code amount to be allocated to the target tile is B _base ,
When the function f (x) is a function that monotonically increases with respect to the value of x, the modified code amount calculation means assigns the code amount B assigned to the target tile to B = B _base × (1 + f (F) )
The image coding apparatus according to claim 1, wherein the image coding apparatus obtains by calculating.

9. The modified code amount means is characterized in that f (x) saturates to an upper limit value for a large value of x and f (x) saturates to a lower limit value for a small value of x. The image encoding device according to claim 8.

10. An image encoding method for encoding an image according to the JPEG2000 (ISO / IEC 15444-1) standard, comprising a dividing step of dividing an image into tiles, and a conversion process for each tile A conversion process for generating coefficients, an encoding process for performing the encoding process on the conversion coefficient, a basic code amount calculation process for calculating a basic code amount to be assigned to each tile, An index calculation step of calculating an index indicating the flatness of the tile, and a basic code amount to be assigned to the target tile calculated in the basic code amount calculation step, an index indicating the flatness of the target tile calculated in the index calculation step. Accordingly, the correction code amount calculation step of increasing or decreasing the allocation amount so that the allocation amount increases as the flatness increases, and the allocation code amount of the target tile calculated in the correction code amount calculation step Picture coding method and a controlling step of controlling No. step.

11. The image coding method according to claim 10, wherein in the index calculating step, an inverse number of a variance of the data of the tile of interest is used as an index of the tile of interest.

12. The index calculating step obtains an absolute sum of differences between each data of the target tile and an average value of the respective data, and uses the reciprocal thereof as an index of the target tile. 10. The image coding method according to item 10.

13. In the index calculation step, when each data of the tile of interest is scanned in order, a sum of absolute values of differences between two adjacent data is calculated, and the reciprocal thereof is used as an index of the tile of interest. 11. The method according to claim 10,
The image coding method described in.

14. The image encoding method according to claim 10, wherein the data of the tile of interest is a value of a pixel included in the tile of interest.

15. The image encoding method according to claim 10, wherein the data of the tile of interest is a transform coefficient forming an LL subband of the tile of interest.

16. In the code amount calculation step, a basic code amount to be assigned to the tile of interest is set as a rate / rate.
The image coding method according to claim 10, wherein the image coding is performed using distortion optimization.

17. When the index of the tile of interest is F, the basic code amount assigned to the tile of interest is B _base , and the function f (x) is a function that monotonically increases with respect to the value of x, In the correction code amount calculation step, the code amount B to be assigned to the target tile is B = B _base × (1 + f
The image encoding method according to claim 10, wherein the image encoding method is obtained by calculating (F)).

18. The modified code amount means is characterized in that f (x) saturates to an upper limit value for a large value of x and f (x) saturates to a lower limit value for a small value of x. The image coding method according to claim 17.

19. A program code for executing the image coding method according to claim 10. Description:

20. A computer-readable storage medium storing the program code according to claim 19.