JP5482177B2

JP5482177B2 - Encoding device, program, recording medium

Info

Publication number: JP5482177B2
Application number: JP2009284416A
Authority: JP
Inventors: 宏幸作山; 潤一原; 純一山形; 多聞貞末
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2009-12-15
Filing date: 2009-12-15
Publication date: 2014-04-23
Anticipated expiration: 2029-12-15
Also published as: JP2011130026A

Description

本発明は、符号化装置、プログラム、記録媒体に関する。 The present invention relates to an encoding device, a program, and a recording medium.

一般的に、画像は、（１）絵柄と（２）文字・線画（絵柄と異なる記号なども含む、以下では、「非絵柄」という。）と、で構成されている。近年、この画像を符号化する際には、（１）絵柄と（２）文字・線画とに分離し、各々に対して符号化を施すことが提案されている。 Generally, an image is composed of (1) a picture and (2) a character / line drawing (including a symbol different from the picture, hereinafter referred to as “non-picture”). In recent years, when encoding this image, it has been proposed to separate (1) a picture and (2) a character / line image and perform encoding on each.

例えば、特許文献１などでは、ＭＲＣ（ミクストラスターコンテント）モデルとよばれる手法が提案されている。図１にＭＲＣモデルの概念図を示す。図１の例では、原画像αは、「Ｂｕｙｎｏｗｆａｓｔｅｓｔｐｒｉｎｔｅｒ」という（２）文字Ａと、コピー機の（１）絵柄Ｂとが示されている。以下では、「Ｂｕｙｎｏｗ」を文字Ａ１とし、「ｆａｓｔｅｓｔｐｒｉｎｔｅｒ」を文字Ａ２とする。そして、文字Ａ１．文字Ａ２をまとめて文字Ａという。 For example, Patent Literature 1 proposes a method called an MRC (Mixe Straster Content) model. FIG. 1 shows a conceptual diagram of the MRC model. In the example of FIG. 1, the original image α shows (2) character A “Buy now fastest printer” and (1) picture B of the copier. In the following, “Buy now” is referred to as a character A1, and “fastest printer” is referred to as a character A2. And the characters A1. Character A2 is collectively referred to as character A.

そして、前景Ｐ（以下、「非絵柄部色画像Ｐ」という。）と、マスクＱ（以下、「非絵柄部形状画像Ｑ」という。）と、背景Ｒ（以下、「絵柄部画像Ｒ」という。）との３つに分けられる。 A foreground P (hereinafter referred to as “non-pattern part color image P”), a mask Q (hereinafter referred to as “non-pattern part shape image Q”), and a background R (hereinafter referred to as “picture part image R”). And 3).

非絵柄部色画像Ｐは、文字Ａの色の画像を示した画像である。非絵柄部形状画像Ｑは文字Ａ１の形状の画像を示した画像である。絵柄部画像Ｒは、背景を示した画像である。このように、ＭＲＣモデルでは３レイヤにわけ、レイヤごとに符号化を行う。 The non-design part color image P is an image showing a color A character image. The non-pattern part shape image Q is an image showing an image of the shape of the character A1. The pattern part image R is an image showing a background. As described above, the MRC model is divided into three layers, and encoding is performed for each layer.

また、特許文献２では、ＭＲＣモデルを用いてマスクを分割し、部分画像へのアクセス性を向上させる。 In Patent Document 2, a mask is divided using an MRC model to improve accessibility to partial images.

ＭＲＣモデルにおける非絵柄部色画像Ｐの作成手法には、大略して２つある。第１の手法として、図２に模式的に示すように、文字Ａ１の色Ｓ（図１参照）と、文字Ａ２の色Ｔ（図１参照）との間をグラデーション状Ｇ（図１参照）にして、ぼかして非絵柄部色画像Ｐを作成する手法である。なお、図２の図面の簡略化のために、色Ｓを黒色で示し、色Ｔを白色で示す。図２では、文字Ａ１、文字Ａ２以外の画素を非文字画素という。 There are roughly two methods for creating the non-pattern part color image P in the MRC model. As a first method, as schematically shown in FIG. 2, a gradation G (see FIG. 1) is formed between the color S (see FIG. 1) of the character A1 and the color T (see FIG. 1) of the character A2. Thus, the non-pattern part color image P is created by blurring. In order to simplify the drawing of FIG. 2, the color S is shown in black and the color T is shown in white. In FIG. 2, pixels other than the characters A1 and A2 are referred to as non-character pixels.

図２に示すように、この第１の手法は、文字Ａ１および文字Ａ２との間の非文字画素の色Ｕ１〜Ｕ３を、徐々に遷移させたい色（グラデーション色）とする手法である。この第１の手法を用いると、少量のエッジしか生じないために、非絵柄部色画像Ｐの圧縮率を上げることができる。つまり、この第１の手法では、非絵柄部色画像Ｐは低周波の色成分で構成される。 As shown in FIG. 2, this first method is a method in which the colors U1 to U3 of the non-character pixels between the character A1 and the character A2 are gradually changed (gradation colors). When this first method is used, only a small amount of edges are generated, so that the compression rate of the non-pattern part color image P can be increased. That is, in this first method, the non-pattern part color image P is composed of low-frequency color components.

第２の手法による、３つのレイヤに分けた場合の模式図を図３に示す。図３は図１と比較して、非絵柄部色画像Ｐ'のみが異なる。非絵柄部色画像Ｐ'中の文字Ａ１とＡ２それぞれには、色Ｓ、色Ｔが色づけされている。そして、文字Ａ１とＡ２には背景色Ｃが付けられている。 FIG. 3 shows a schematic diagram in the case of dividing into three layers by the second method. FIG. 3 differs from FIG. 1 only in the non-pattern part color image P ′. The letters S1 and A2 in the non-pattern part color image P ′ are colored with the colors S and T, respectively. The characters A1 and A2 are given a background color C.

図４に、第２の手法について模式的に示す。なお、図４の図面の簡略化のために、色Ｓを黒色で示し、色Ｔを白色で示す。図４に示すように、この手法は、文字Ａ１（文字画素）および文字Ａ２（文字画素）との間に位置し、符号量が最も少なく、非文字画素である特定画素Ｃ（例えば、黒やグレー）である。なお、特定画素Ｃの集合が特定画像である。従って、以下では、特定画素Ｃの集合を特定画像Ｃともいう。 FIG. 4 schematically shows the second method. For simplification of the drawing of FIG. 4, the color S is shown in black and the color T is shown in white. As shown in FIG. 4, this method is located between the character A1 (character pixel) and the character A2 (character pixel), has the smallest code amount, and is a specific pixel C (for example, black or black) that is a non-character pixel. Gray). A set of specific pixels C is a specific image. Therefore, hereinafter, the set of specific pixels C is also referred to as a specific image C.

そして、非文字画素の領域を特定画像Ｃで埋めて、非絵柄部色画像Ｐ'を生成する。換言すれば、非絵柄部色画像Ｐ'は、原画像αの非絵柄（文字など）の色画像（色画素の集合）および該非絵柄の色画像以外の位置に占める特定画像で構成されるものである。また、特定画素Ｃの集合が特定画像になる。 Then, the non-character pixel region is filled with the specific image C to generate the non-pattern part color image P ′. In other words, the non-pattern part color image P ′ is composed of a color image (a set of color pixels) of the non-pattern (characters) of the original image α and a specific image occupying a position other than the color image of the non-pattern. It is. A set of specific pixels C becomes a specific image.

つまり、文字Ａ１の色ＳおよびＡ２の色Ｔに関らず、文字Ａ１とＡ２との間に、符合量が少ない特定画像Ｃが用いられる。特定画像Ｃは、符号量が少ないことからこの第２の手法により、非絵柄部色画像Ｐ'の圧縮率を上げることができる。また、この第２の手法では、文字Ａ１と文字Ａ２との間はなめらかに遷移せず、エッジが生じる。従って、非絵柄部色画像Ｐ'は高周波を含むものとなる。 That is, regardless of the color S of the character A1 and the color T of the A2, the specific image C having a small code amount is used between the characters A1 and A2. Since the specific image C has a small code amount, the compression rate of the non-picture part color image P ′ can be increased by the second method. Further, in the second method, the transition between the character A1 and the character A2 is not smoothly performed, and an edge is generated. Therefore, the non-pattern part color image P ′ includes a high frequency.

ここで、第１の手法と第２の手法とを比べる。原稿に占める文字・線画の割合が少ない場合には、第１の手法では、文字Ａ１と文字Ａ２との間に広い遷移範囲Ｅ１〜Ｅ３（グラデーション領域）が生じ、非絵柄部色画像Ｐの符号量が増える傾向にある。一方、第２の手法であれば、符号量が少ない特定画像Ｃが原稿の多くを占めるため、符号量が少なくなる。 Here, the first method and the second method are compared. When the ratio of characters / line drawings in the original is small, in the first method, a wide transition range E1 to E3 (gradation area) occurs between the characters A1 and A2, and the code of the non-pattern part color image P is generated. The amount tends to increase. On the other hand, in the case of the second method, since the specific image C having a small code amount occupies most of the original, the code amount is small.

また、原稿に占める文字・線画の割合が多い場合には、第１の手法では、文字Ａ１と文字Ａ２との間に狭い遷移範囲Ｅ１〜Ｅ３が生じ、非絵柄部色画像Ｐの符号量が減る傾向にある。一方、第２の手法であれば、符号量が少ない特定画像が原稿の割合は少なくなり、エッジが生じるため、符号量が多くなるという問題がある。 Further, when the ratio of the character or line drawing occupying the document is large, in the first method, the narrow transitive range E1~E3 occurs between the characters A1 and character A2, the code amount of the non-picture portion color images P Tend to decrease. On the other hand, in the second method, there is a problem in that the amount of code increases because a specific image with a small code amount has a small ratio of originals and an edge is generated.

本発明の符号化装置であれば、上記第２の手法を用いた場合でも、符号量を少なくして、適切な符号化することを目的とする。 The encoding device of the present invention aims to reduce the amount of codes and perform appropriate encoding even when the second method is used.

前記目的を達成するため、原画像から、該原画像の絵柄が表示される絵柄部画像と、該原画像の非絵柄の色画像および該非絵柄の色画像以外の位置に占める特定画像が表示される非絵柄部色画像と、前記非絵柄の形状が表示されるマスク画像と、を生成する生成部と、
前記絵柄部画像および前記非絵柄部色画像を周波数変換する周波数変換部と、
前記周波数変換部により得られた前記絵柄部画像の周波数係数および前記非絵柄部色画像の周波数係数を量子化する量子化部と、
前記量子化された前記絵柄部画像の周波数係数および前記非絵柄部色画像の周波数係数を符号化する符号化部とを、有し、
前記量子化は、周波数ゲインのための正規化のための量子化を除き、
前記非絵柄部色画像の高周波帯域の周波数係数の量子化ステップ数を、前記絵柄部画像の高周波帯域の周波数係数の量子化ステップ数よりも小さくすることを特徴とする符号化装置を提供する。 In order to achieve the above object, a pattern image in which a pattern of the original image is displayed, a non-picture color image of the original image, and a specific image occupying a position other than the non-picture color image are displayed from the original image. A generating unit that generates a non- image part color image and a mask image on which the shape of the non-pattern is displayed ;
A frequency converter that converts the frequency of the pattern part image and the non-pattern part color image;
A quantization unit that quantizes the frequency coefficient of the picture part image and the frequency coefficient of the non-picture part color image obtained by the frequency conversion unit ;
An encoding unit that encodes the frequency coefficient of the quantized picture part image and the frequency coefficient of the non-picture part color image;
The quantization, except for normalization for frequency gain,
To provide an encoding apparatus, wherein the quantization step number of the frequency coefficients of the high frequency band of the non-picture portion color images, small Kusuru than the quantization step number of the frequency coefficients of the high frequency band of the picture portion image .

本発明の符号化装置であれば、上記第２の手法を用いた場合でも、符号量を少なくして、適切な符号化することができる。 With the encoding apparatus of the present invention, even when the second method is used, it is possible to reduce the amount of codes and perform appropriate encoding.

原画像を第１の手法による３つのレイヤに分けた場合を示す図。The figure which shows the case where an original image is divided into three layers by the 1st method. 第１の手法を模式的に示した図。The figure which showed the 1st method typically. 原画像を第２の手法による３つのレイヤに分けた場合を示す図。The figure which shows the case where an original image is divided into three layers by the 2nd method. 第２の手法を模式的に示した図。The figure which showed the 2nd method typically. ＭＲＣ符号の構成例。An example of the configuration of an MRC code. ＪＰＭ符号の構成例。The structural example of a JPM code | symbol. 本実施例の符号化装置の構成例。1 is a configuration example of an encoding apparatus according to the present embodiment. 本実施例のＣＰＵの機能構成例を示した図。The figure which showed the function structural example of CPU of a present Example. 本実施例の符号化装置の主な処理の流れを示した図。The figure which showed the flow of the main processes of the encoding apparatus of a present Example. 原画像を５つのレイヤに分けた場合を示す図。The figure which shows the case where an original image is divided into five layers. Ｓｏｂｅｌオペレータの一例を示す図（その１）。The figure which shows an example of a Sobel operator (the 1). Ｓｏｂｅｌオペレータの一例を示す図（その２）。The figure which shows an example of a Sobel operator (the 2). ＤＣレベルシフト後の原画像と座標系を示した図。The figure which showed the original image and coordinate system after DC level shift. 垂直方向へのフィルタリング後の係数の配列。An array of coefficients after filtering in the vertical direction. 水平方向へのフィルタリング後の係数の配列。An array of coefficients after horizontal filtering. 並べ替えた係数の配列。A sorted array of coefficients. ２回目の変換後の並べ替えた係数の配列を示した図。The figure which showed the arrangement | sequence of the rearranged coefficient after the 2nd conversion. デコンポジションレベルと解像度レベルの関係を示した図。The figure which showed the relationship between a decomposition level and a resolution level. 画像、タイル、サブバンドなどを示した図。The figure which showed the image, tile, subband, etc. Ｖｉｓｕａｌｄｉｓｔａｎｃｅ１０００［ｄｏｔ］のＶｉｓｕａｌＷｅｉｇｈｔを示した図。The figure which showed Visual Weight of Visual distance1000 [dot]. Ｖｉｓｕａｌｄｉｓｔａｎｃｅ３０００［ｄｏｔ］のＶｉｓｕａｌＷｅｉｇｈｔを示した図。The figure which showed Visual Weight of Visual distance3000 [dot]. ９＊７ウェーブレット変換の周波数変換ゲインの平方根を示した図。The figure which showed the square root of the frequency conversion gain of 9 * 7 wavelet transform. 色変換のゲインの平方根を示す図。The figure which shows the square root of the gain of color conversion. 正規化のための量子化ステップ数を示す図。The figure which shows the quantization step number for normalization. 前景Ｐのための実質的な量子化を示す図。The figure which shows the substantial quantization for the foreground P. 背景Ｒのための実質的な量子化を示す図。FIG. 6 shows substantial quantization for background R. 色差Ｃｂの実質的な量子化ステップ数／輝度Ｙの実質的な量子化ステップ数を示した図。The figure which showed the substantial quantization step number of the substantial quantization step number / brightness | luminance Y of color difference Cb. 色差Ｃｒの実質的な量子化ステップ数／輝度Ｙの実質的な量子化ステップ数を示した図。The figure which showed the substantial quantization step number of the substantial quantization step number / brightness | luminance Y of color difference Cr. 前景Ｐ'のための実質的な量子化ステップ数を示した図。The figure which showed the number of substantial quantization steps for foreground P '. 変形例１の前景Ｐ'、背景Ｑの輝度の実質的な量子化ステップ数を示した図。The figure which showed the substantial number of quantization steps of the brightness | luminance of the foreground P 'of the modification 1, and the background Q. 変形例１の色差Ｃｂの実質的な量子化ステップ数／輝度Ｙの実質的な量子化ステップ数を示した図。The figure which showed the substantial quantization step number of the substantial quantization step number / brightness | luminance Y of the color difference Cb of the modification 1. FIG. 変形例１の色差Ｃｒの実質的な量子化ステップ数／輝度Ｙの実質的な量子化ステップ数を示した図。The figure which showed the substantial quantization step number of the substantial quantization step number / brightness | luminance Y of the color difference Cr of the modification 1. FIG. 変形例２の前景Ｐ'、背景Ｑの共通な符号化しないサブビットプレーン数を示した図。The figure which showed the number of sub bit planes which the foreground P 'of the modification 2 and the common background Q do not encode. ５＊３ウェーブレット変換の周波数変換ゲインの平方根を示した図。The figure which showed the square root of the frequency conversion gain of 5 * 3 wavelet transform. 色変換のゲインを示した図。The figure which showed the gain of color conversion. 変形例３での正規化のための除数を示した図。The figure which showed the divisor for the normalization in the modification 3. FIG. 変形例３での前景Ｐ'における符号化しないビットプレーン数を示した図。The figure which showed the number of bit planes which are not encoded in the foreground P ′ in Modification 3. 変形例３での背景Ｑにおける符号化しないビットプレーン数を示した図。The figure which showed the bit plane number which is not encoded in the background Q in the modification 3. 本実施例におけるＪＰＭファイルのレイアウトオブジェクトを示した図（その１）。The figure which showed the layout object of the JPM file in a present Example (the 1). 本実施例におけるＪＰＭファイルのレイアウトオブジェクトを示した図（その２）。The figure which showed the layout object of the JPM file in a present Example (the 2). 本実施例の状態通知装置の主要構成を示すハードウェア構成図を示した図。The figure which showed the hardware block diagram which shows the main structures of the status notification apparatus of a present Example.

「発明を実施するための形態」の説明をする前に、本実施例の背景を述べる。近年、インターネット等のネットワークや公衆回線の発達および画像圧縮技術の進歩により、画像データの伝送速度は飛躍的に向上している。これに伴い、伝送可能な画像データの高解像度化、高階調化、カラー化が進んでおり、４００ｄｐｉ（ｄｏｔｐｅｒｉｎｃｈ）のフルカラー画像も実用的な伝送対象となってきている。ところで、このようなフルカラー画像をネットワークあるいは公衆回線を通じて伝送する際には、通常、画像を圧縮する。そのときの圧縮アルゴリズムとしては、一般的にはＪＰＥＧアルゴリズムが用いられている。また、画像ファイリングなどの画像の蓄積も盛んに行なわれており、カラー画像の蓄積には同様にＪＰＥＧアルゴリズムなどを用いて圧縮されたデータが使用されている。 Prior to the description of the “DETAILED DESCRIPTION”, the background of this embodiment will be described. In recent years, with the development of networks such as the Internet and public lines and the advancement of image compression technology, the transmission speed of image data has been dramatically improved. Along with this, higher resolution, higher gradation, and colorization of image data that can be transmitted are progressing, and 400 dpi (dot per inch) full-color images are becoming practical transmission targets. By the way, when transmitting such a full-color image through a network or a public line, the image is usually compressed. As a compression algorithm at that time, the JPEG algorithm is generally used. Also, image filing and other image accumulation is actively performed, and data compressed using a JPEG algorithm or the like is similarly used for color image accumulation.

ここで、ＪＰＥＧアルゴリズムについて説明する。まず、圧縮アルゴリズムについて説明する。入力された画像データは、必要ならば例えばＲＧＢ色空間からＬ* ａ* ｂ* 色空間へというように色空間変換が施された後、８画素×８画素のブロックに分割され、ブロックごとにＤＣＴ（ＤｉｓｃｒｅｔｅＣｏｓｉｎｅＴｒａｎｓｆｏｒｍ）によって周波数領域に変換され、ブロックと同じ大きさの量子化テーブルで量子化された後、ジグザグスキャンでブロックデータからラスターデータに変換され、２次元ハフマン符号化によって圧縮データとなる。伸長アルゴリズムは、圧縮アルゴリズムの逆のアルゴリズムであり、ハフマン復号、ラスタ−ブロック変換、逆量子化、ＩＤＣＴ（ＩｎｖｅｒｓｅＤＣＴ）によって画像データが復元される。 Here, the JPEG algorithm will be described. First, the compression algorithm will be described. The input image data is subjected to color space conversion, for example, from RGB color space to L * a * b * color space, if necessary, and then divided into 8 × 8 pixel blocks. It is converted to the frequency domain by DCT (Discrete CosineTransform), quantized with a quantization table having the same size as the block, then converted from block data to raster data by zigzag scanning, and becomes compressed data by two-dimensional Huffman coding. . The decompression algorithm is an inverse algorithm of the compression algorithm, and image data is restored by Huffman decoding, raster-block transformation, inverse quantization, and IDCT (Inverse DCT).

しかし、上述の説明からもわかるように、ＪＰＥＧアルゴリズムでは非可逆過程である量子化を行なうため、伸長された画像は圧縮前の画像データよりも画質が劣化した画像となる。通常、量子化テーブルには圧縮率を稼ぐために高周波成分ほど量子化ステップが粗くなる量子化テーブルが用いられる。そのため、特に文字や線画のように高周波成分を多く含むエッジが存在する部分で、画質の劣化が著しく、また圧縮率も低下していた。 However, as can be seen from the above description, since the JPEG algorithm performs quantization, which is an irreversible process, the decompressed image becomes an image with a degraded image quality compared to the image data before compression. Usually, a quantization table is used in which the quantization step becomes coarser as the high frequency component increases in order to increase the compression ratio. For this reason, particularly in a portion where there is an edge containing a lot of high-frequency components such as characters and line drawings, the image quality is significantly deteriorated and the compression rate is also reduced.

また、図１で説明したように、ＭＲＣ方式では、１つの背景に対して、複数の「前景Ｐ（非絵柄部色画像Ｐ）＋マスクＱ（非絵柄部形状画像Ｑ）」を重ねあわせていく方式が一般的である。マスクＱとは前景Ｐを(例えば文字の形に)切り取るための形状情報を持つ。マスクＱの値が前景Ｐの透過率(前景と背景Ｑ（絵柄部画像Ｑ）の混合比)を持つ場合もある。また、前景Ｐ、マスクＱ、背景Ｒ、の解像度は、任意に選択できることもできる。 As described with reference to FIG. 1, in the MRC method, a plurality of “foreground P (non-pattern part color image P) + mask Q (non-pattern part shape image Q)” are superimposed on one background. It is common to go. The mask Q has shape information for cutting out the foreground P (for example, in the form of characters). In some cases, the value of the mask Q has the transmittance of the foreground P (the mixing ratio of the foreground and the background Q (the pattern portion image Q)). Further, the resolution of the foreground P, the mask Q, and the background R can be arbitrarily selected.

図５は、ＭＲＣモデルによる符号構成の説明図である。ＭＲＣを用いた符号フォーマットは、ＭＲＣ符号であること等を示す全体用ヘッダと、１つの背景符号およびそのヘッダと、それに重ね合わせる１つ若しくは複数の「前景符号とマスク符号のペア」およびそのペア用のヘッダで構成されるのが典型である。 FIG. 5 is an explanatory diagram of a code configuration based on the MRC model. The code format using MRC is an overall header indicating that it is an MRC code, etc., one background code and its header, and one or more “pairs of foreground codes and mask codes” and their pairs. It is typically composed of headers for

例えば、本実施の形態では、背景、前景、およびマスクの符号化方式として、ＪＰＥＧ
２０００を選択可能にしたＪＰＭ（ＪＰＥＧ２０００ＭｕｌｔｉＬａｙｅｒ）を用いる。もちろん、ＪＰＭでは、ＪＰＥＧ２０００以外の符号化方式としてＪＰＥＧ、ＭＭＲ、ＪＢＩＧ等も選択可能であり、そのＪＰＭ符号のフォーマットも、図５に示すような符号の構成をとる。 For example, in this embodiment, JPEG is used as the background, foreground, and mask encoding method.
JPM (JPEG2000 Multi Layer) which makes 2000 selectable is used. Of course, in JPM, JPEG, MMR, JBIG, or the like can be selected as an encoding method other than JPEG2000, and the format of the JPM code also has a code configuration as shown in FIG.

例えば、前記ＭＲＣモデルの前景、マスク、背景の圧縮方式としてＪＰＥＧ２０００の選択を可能にしたＪＰＭという符号化方式が標準化されている。 For example, an encoding method called JPM that enables selection of JPEG2000 as a foreground, mask, and background compression method of the MRC model has been standardized.

図６は、ＪＰＭ符号の構成例の説明図である。なお、点線部分はオプションであるの
で、実線部分を中心に簡単に説明する。図６において、「JPEG2000 Signature box」は
、当該符号がＪＰＥＧ２０００ファミリーに属することを示す全体のヘッダである。「Fi
le Type box」は、当該符号がＪＰＭフォーマットであることを示す全体のヘッダである
。「Compound Image Header box」は、当該符号の全般的な情報を含む全体のヘッダであ
る。「Page Collection box」は、当該符号がマルチページからなる場合に各ページの順
番を示す目次的なものである。「Page box」は、ページの解像度等を示す全体のヘッダで
ある。ここで、ページとは、画像を順次重ねて（合成して）いくためのキャンバスであり
、合成が終った後の画像と同じ大きさを持つ。ＪＰＭの場合、ページには、前景とマスク
のペアで構成される「layout object（レイアウト・オブジェクト）」が順次描画される
。「Layout Object box」は、前景とマスクのサイズや位置等を示す、前景及びマスク用
のヘッダである。「Media Data box」や「Contiguous Codestream box」は、前景やマス
クの符号を含む部分である。なお、ＪＰＭでは、背景（BasePage）は、レイアウト・オブ
ジェクトが描画される前の初期的なページとして扱われる。 FIG. 6 is an explanatory diagram of a configuration example of a JPM code. Since the dotted line portion is an option, a simple description will be given focusing on the solid line portion. In FIG. 6, “JPEG2000 Signature box” is an entire header indicating that the code belongs to the JPEG2000 family. "Fi
“Le Type box” is an entire header indicating that the code is in JPM format. The “Compound Image Header box” is an entire header including general information of the code. “Page Collection box” is a table of contents indicating the order of each page when the code consists of multiple pages. “Page box” is an overall header indicating the resolution of the page. Here, the page is a canvas for sequentially superimposing (combining) images, and has the same size as the image after the composition is finished. In the case of JPM, “layout object (layout object)” composed of a pair of foreground and mask is sequentially drawn on the page. “Layout Object box” is a header for the foreground and the mask indicating the size and position of the foreground and the mask. The “Media Data box” and “Contiguous Codestream box” are portions including foreground and mask codes. In JPM, the background (BasePage) is handled as an initial page before the layout object is drawn.

なお、ＪＰＭでは、上述したように、背景に対して前景とマスクとのペアとして定義さ
れるレイアウト・オブジェクトを順次重ねて画像を合成する処理において、前景と背景と
を合成する方法として、
（ｉ）前景か背景のいずれかを選択する方法
（ｉｉ）前景の値と背景の値の加重平均をとる方法
が選択可能である。（ｉ）の場合には、マスクを２値とし、その値が１のときに前景を選
択し、０のときに背景を選択するものである。また、（ｉｉ）の場合には、マスクに８ビ
ットの正の値を持たせ、以下に示す式（１）により前景と背景の加重平均により合成するものである。 In JPM, as described above, as a method of combining the foreground and the background in the process of combining the images by sequentially overlaying the layout objects defined as the foreground and mask pairs on the background,
(I) Method for selecting either foreground or background (ii) A method for taking a weighted average of foreground values and background values can be selected. In the case of (i), the mask is binary, and when the value is 1, the foreground is selected, and when the value is 0, the background is selected. In the case of (ii), the mask has a positive value of 8 bits and is synthesized by the weighted average of the foreground and the background according to the following equation (1).

合成画像=(マスク値/２５５)×前景+{(２５５-マスク値)/２５５}×背景（１）
これらのいずれの合成方法をとるかは、前景とマスクのペアごとに指定可能であり、各ペア用のヘッダに記載する。 Composite image = (mask value / 255) × foreground + {(255−mask value) / 255} × background (1)
Which of these synthesis methods is used can be specified for each pair of foreground and mask, and is described in the header for each pair.

さて、以上のＭＲＣモデルでの符号化では、高圧縮率化のために、前景や背景を高い圧縮率(例えば１／４０)で圧縮する。こうした高い圧縮率下では、量子化の巧拙が画質に大きく影響するため、前景や背景を圧縮する際に適用する量子化については、最適化の必要がある。
ここで、ＭＲＣ符号の作成法としては、前景画像を減色し、前景を数色で（極端な場合には一色で）表現する方法が典型的である。こうした減色を用いれば、前景画像は人工画像的になり、人工画像用の符号化方式を用いて高い圧縮率を得ることができる。 In the encoding with the above MRC model, the foreground and the background are compressed at a high compression rate (for example, 1/40) in order to increase the compression rate. Under such a high compression rate, the skill of quantization greatly affects the image quality, so the quantization applied when compressing the foreground and background needs to be optimized.
Here, as a method of creating the MRC code, a method of reducing the foreground image and expressing the foreground in several colors (in the extreme case, one color) is typical. If such color reduction is used, the foreground image becomes an artificial image, and a high compression rate can be obtained by using an encoding method for the artificial image.

しかしながら、減色を行う方式は、減色のための処理時間を要し、かつ減色のための画質劣化を生じる。この減色による画質劣化を防ぐためには、一色で表現する領域を狭くする必要があるが、そのためには前景画像を多くの部分画像に分割する必要が生じ、いずれにしても処理時間が必要となる（そもそもＭＲＣモデルでは、前景・背景の生成のために多くの処理時間を必要とするため、符号化にかける時間は極力少なくしたいという事情がある）。また、前景と背景の分離を完全に行うことは困難であり、本来背景であるべき部分が前景として分離された場合、これが減色されてしまうと、画像欠陥として認識されてしまうという問題もある。 However, the method of performing color reduction requires processing time for color reduction and causes image quality deterioration for color reduction. In order to prevent deterioration in image quality due to this color reduction, it is necessary to narrow the area represented by one color. For this purpose, it is necessary to divide the foreground image into many partial images, and in any case, processing time is required. (In the first place, since the MRC model requires a lot of processing time to generate the foreground / background, there is a circumstance that it is desired to reduce the time required for encoding as much as possible). In addition, it is difficult to completely separate the foreground and the background, and when a portion that should originally be the background is separated as the foreground, if the color is reduced, it is recognized as an image defect.

したがって、ＭＲＣ符号を高速に作成しようと思えば、前景画像をあまり減色・分割することなく、高速かつ高い圧縮率で符号化できる方法が必要である。 Therefore, if an MRC code is to be created at high speed, there is a need for a method that can encode the foreground image at a high speed and with a high compression rate without reducing or dividing the foreground image much.

このように前景画像の減色を省略する場合には、スキャンによってノイズが重畳された状態の前景画像を圧縮する必要があるため、人工画像用の符号化方式で高い圧縮率を得ることはできない。従って減色を省略して高い圧縮率を得るためには、前景・背景の両方に、周波数変換を利用した方式（自然画像用の符号化方式）を適用することになる。 Thus, when omitting the color reduction of the foreground image, it is necessary to compress the foreground image in a state where noise is superimposed by scanning, and thus a high compression rate cannot be obtained with the coding method for artificial images. Therefore, in order to obtain a high compression ratio by omitting color reduction, a method using frequency conversion (encoding method for natural images) is applied to both the foreground and the background.

さて、減色を行わない以上（＝高圧縮であるが画質・処理速度は維持したい）、前景画像の量子化については最適化を行いたいところであるが、減色を行わない理由が高速化であるため、最適化に処理時間をかけることはできない。例えば、画像の特徴量や内容を解析し、それに応じて符号化方式や量子化方式を選択する方法は知られているが、こうした解析は一般に処理量が多く、採用することは困難である。 As long as color reduction is not performed (= high compression but we want to maintain image quality and processing speed), we would like to optimize the foreground image quantization, but the reason for not performing color reduction is speeding up. The processing time cannot be spent on optimization. For example, there is known a method of analyzing feature amounts and contents of an image and selecting an encoding method or a quantization method according to the analysis. However, such analysis generally has a large processing amount and is difficult to adopt.

そこで出願人は、減色を行わない前景画像を高速かつ最適に符号化するに際し、ＭＲＣモデルの持つ構造・機能に着目した。すなわち、前景に対して周波数変換と量子化を用いた符号化を採用する場合、「前景は色」というレイヤ自体が持つ機能を考慮すると、画像の内容を解析することなしに、該機能の差を量子化に反映することができるのである。しかしながら(発明者の知る限り)これまでは、前景に対して、こうした機能を反映した量子化はなされてこなかった。 Therefore, the applicant paid attention to the structure and function of the MRC model when encoding the foreground image without color reduction at high speed and optimally. In other words, when the encoding using frequency transformation and quantization is adopted for the foreground, considering the function of the layer itself “foreground is color”, the difference between the functions is not analyzed without analyzing the contents of the image. Can be reflected in quantization. However, so far (as far as the inventors know), foreground has not been quantized to reflect these functions.

なお、ブロック図において同じ機能を持つ構成部やフローチャート図において同じ処理を行う過程には同じ番号を付し、重複説明を省略する。 In addition, the same number is attached | subjected to the process which performs the same process in the structure part and flowchart figure which have the same function in a block diagram, and duplication description is abbreviate | omitted.

また、以下の説明では、非絵柄とは、絵柄ではないもの、例えば、文字、線図、記号などを示す。
［全体的な流れ］
まず、全体的な流れについて説明する。図７は、本実施の形態にかかるＰＣのハードウェア構成を示すブロック図である。本実施の形態にかかるＰＣ１００は、ＣＰＵ（Central Processing Unit）１０１、ＲＡＭ（Random Access Memory）１０２、ＨＤＤ（Hard Disk Drive）１０３等をシステムバス１０４により相互接続した一般的なパーソナルコンピュータである。 Further, in the following description, the non-picture refers to something that is not a picture, for example, a character, a diagram, and a symbol.
[Overall flow]
First, the overall flow will be described. FIG. 7 is a block diagram showing a hardware configuration of the PC according to the present embodiment. A PC 100 according to this embodiment is a general personal computer in which a CPU (Central Processing Unit) 101, a RAM (Random Access Memory) 102, an HDD (Hard Disk Drive) 103, and the like are interconnected by a system bus 104.

ＣＰＵ１０１は、ＰＣ１００の全体を制御するためのマイクロプロセッサである。ＲＡＭ１０２は、各種データを書換え可能に記憶する性質を有していることから、ＣＰＵ１０１の作業エリアとして機能してバッファ等の役割を果たす。ＨＤＤ１０３は、画像やＣＰＵ１０１により実行される各種プログラムを記憶する情報記録媒体である。 The CPU 101 is a microprocessor for controlling the entire PC 100. Since the RAM 102 has the property of storing various data in a rewritable manner, it functions as a work area for the CPU 101 and functions as a buffer. The HDD 103 is an information recording medium that stores images and various programs executed by the CPU 101.

なお、情報記録媒体としては、ＨＤＤ１０３のみならずＣＤ（Compact Disk）−ＲＯＭ、ＤＶＤ（Digital Versatile Disk）などの各種の光ディスク、各種光磁気ディスク、フレキシブルディスクなどの各種磁気ディスク等、半導体メモリ等の各種方式のメディアを用いることができる。また、図示しない外部Ｉ／Ｆ装置を介してインターネットなどのネットワークからプログラムをダウンロードし、ＨＤＤ１０３にインストールするようにしてもよい。この場合に、送信側のサーバでプログラムを記憶している記憶装置も、この発明における情報記録媒体である。なお、プログラムは、所定のＯＳ（Operating System）上で動作するものであってもよいし、その場合に後述する各種処理の一部の実行をＯＳに肩代わりさせるものであってもよいし、所定のアプリケーションソフトやＯＳなどを構成する一群のプログラムファイルの一部として含まれるものであってもよい。 Information recording media include not only the HDD 103 but also various optical disks such as CD (Compact Disk) -ROM and DVD (Digital Versatile Disk), various magnetic disks such as various magneto-optical disks and flexible disks, semiconductor memories, and the like. Various types of media can be used. Alternatively, the program may be downloaded from a network such as the Internet via an external I / F device (not shown) and installed in the HDD 103. In this case, the storage device that stores the program in the server on the transmission side is also the information recording medium in the present invention. Note that the program may operate on a predetermined OS (Operating System), in which case the OS may perform execution of some of various processes described later, It may be included as a part of a group of program files constituting the application software or OS.

このシステム全体の動作を制御するＣＰＵ１０１は、このシステムの主記憶装置として使用されるＨＤＤ１０３に記憶されたプログラムに基づいて各種処理を実行する。 The CPU 101 that controls the operation of the entire system executes various processes based on programs stored in the HDD 103 used as the main storage device of the system.

次に、ＰＣ１００のＨＤＤ１０３に記憶されているプログラムがＣＰＵ１０１に実行させる機能のうち、本実施の形態のＰＣ１００が備える特長的な機能について説明する。次に、図７を用いて、ＰＣ１００が備える特長的な機能の概略について説明する。（１）ＨＤＤ１０３より処理対象となる原画像がＣＰＵ１０１からの命令によってＲＡＭ１０２に読み込まれる。（２）ＣＰＵ１０１は、ＲＡＭ１０２上の原画像を読み込み、ＭＲＣモデルを用いた符号化を行う。（３）ＣＰＵ１０１は、符号化されたデータをＲＡＭ１０２上の別の領域に書き込む。（４）ＣＰＵ１０１からの命令によってその符号化されたデータがＨＤＤ１０３に保存される。
［実施形態１］
次に、実施形態１のＣＰＵ１０１内の機能構成例を図８に示す。また、ＣＰＵ１０１の処理の流れを図９に示す。 Next, of the functions that the program stored in the HDD 103 of the PC 100 causes the CPU 101 to execute, the characteristic functions provided in the PC 100 of the present embodiment will be described. Next, an outline of characteristic functions of the PC 100 will be described with reference to FIG. (1) An original image to be processed is read from the HDD 103 into the RAM 102 according to a command from the CPU 101. (2) The CPU 101 reads the original image on the RAM 102 and performs encoding using the MRC model. (3) The CPU 101 writes the encoded data in another area on the RAM 102. (4) The encoded data is stored in the HDD 103 according to a command from the CPU 101.
[Embodiment 1]
Next, a functional configuration example in the CPU 101 of the first embodiment is shown in FIG. FIG. 9 shows the flow of processing of the CPU 101.

図８の例では、実施形態１のＣＰＵ１０１は、生成部２、ＤＣレベルシフト部４、周波数変換部８、量子化部１０、ビットプレーン符号化部１２、パケット生成部１４とを含む。また、原画像を復号化する場合には、図８中の矢印の順番で、処理が行われるが、符号化された原画像を復号化する場合には、矢印と逆向き（つまり、パケット生成部１４）から順番に行われる。また、本実施例ではＪＰＥＧ２０００を使用する例を説明する。 In the example of FIG. 8, the CPU 101 of the first embodiment includes a generation unit 2, a DC level shift unit 4, a frequency conversion unit 8, a quantization unit 10, a bit plane encoding unit 12, and a packet generation unit 14. Further, in the case of decrypt the original image by an arrow order in FIG. 8, the processing is performed when decoding encoded original image, the arrow direction opposite (i.e., packets It is performed in order from the generation unit 14). In this embodiment, an example in which JPEG2000 is used will be described.

まず、符号化の対象である原画像αが入力部（図示せず）に入力される（ステップＳ２）。入力された原画像αは生成部２に入力される。生成部２により原画像αから生成された画像について、図１０に示す。生成部２は、原画像αから、該原画像の絵柄が表示される絵柄部画像Ｒ（「背景Ｒ」ともいう。）と、該原画像αの非絵柄の色を表す非絵柄部色画像Ｐ'（「前景Ｐ'」ともいう。）と、非絵柄部形状画像（「マスクＱ」ともいう。）とを生成する。更に、生成部２は、第２マスクＶと第２背景Ｗとにを生成する。 First, an original image α to be encoded is input to an input unit (not shown) (step S2). The input original image α is input to the generation unit 2. An image generated from the original image α by the generation unit 2 is shown in FIG. The generation unit 2 generates, from the original image α, a pattern portion image R (also referred to as “background R”) on which the pattern of the original image is displayed, and a non-pattern portion color image representing the color of the non-pattern of the original image α. P ′ (also referred to as “foreground P ′”) and a non-picture part shape image (also referred to as “mask Q”) are generated. Further, the generation unit 2 generates a second mask V and a second background W.

次に生成部２の処理の詳細について説明する。生成部２は、ステップＳ２で入力された原画像に対し、画素単位で、文字・線画を構成する画素(以下、「文字画素」という。)であるかどうかが判別する。文字画素は、例えば、図２、図４の文字Ａ１、文字Ａ２を示す。 Next, details of the processing of the generation unit 2 will be described. The generation unit 2 determines whether or not the original image input in step S2 is a pixel (hereinafter referred to as “character pixel”) constituting a character / line drawing in units of pixels. The character pixel indicates, for example, the characters A1 and A2 in FIGS.

該判別は公知の像域判別技術によって行われ、例えば、原画像の各画素に対し、エッジ検出オペレータとして周知のＳｏｂｅｌフィルタを作用させる。Ｓｏｂｅｌフィルタは、注目画素を中心とした上下左右の９つの画素に対して、図１１に示す第１の重みマトリクスを乗算して、その和ＨＳを算出し、同様に図１２に示す第２のマトリクスを乗算してその和ＶＳを算する。そして、（ＨＳ^２＋ＶＳ^２）の平方根をフィルタの出力値とする。 The discrimination is performed by a known image area discrimination technique. For example, a Sobel filter known as an edge detection operator is applied to each pixel of the original image. The Sobel filter multiplies the first weight matrix shown in FIG. 11 by multiplying the top, bottom, left, and right nine pixels centered on the pixel of interest to calculate the sum HS. Similarly, the second filter shown in FIG. Multiply the matrix and calculate its sum VS. Then, the square root of (HS ² + VS ² ) is used as the output value of the filter.

該フィルタ出力値が例えば３０以上の場合に、注目画素が文字画素であると判断し、該注目画素位置の値を１にし、それ以外の画素の値を０にすることで、マスクＱ（非絵柄部形状画像Ｑ）を生成する(二値画像の場合、黒＝１、白＝０が通例である)(ステップＳ４)。 For example, when the filter output value is 30 or more, it is determined that the target pixel is a character pixel, the value of the target pixel position is set to 1, and the values of the other pixels are set to 0. The pattern part shape image Q) is generated (in the case of a binary image, black = 1 and white = 0 are common) (step S4).

また、文字・線画領域に属さない画素(以下、非文字画素)の色を、固定値(１２８、１２８、１２８)で置換し、前景Ｐ（非絵柄部色画像Ｐ）を生成する（ステップＳ６）。本実施例では上述のように、前景Ｐの生成手法に第２の手法を用いるからである。 Also, the color of pixels that do not belong to the character / line drawing area (hereinafter, non-character pixels) is replaced with a fixed value (128, 128, 128) to generate a foreground P (non-picture part color image P) (step S6). ). This is because the second method is used as the foreground P generation method in the present embodiment as described above.

ここで、上述したように、非絵柄部色画像Ｐは、原画像の非絵柄の色画像（図１０の色Ｓ、Ｔの箇所）および該非絵柄の色画像以外の位置に占める特定画像Ｃにより構成される。 Here, as described above, the non-pattern part color image P is composed of the non-pattern color image of the original image (the locations of the colors S and T in FIG. 10) and the specific image C that occupies a position other than the non-pattern color image. Composed.

次に、原画像αおける文字・線画領域の画素の色を、当該画素の最も近傍に位置する非文字画素の色で置換し、絵柄部画像Ｒ（背景Ｒ）を生成する(ステップＳ８)。 Next, the color of the pixel in the character / line drawing area in the original image α is replaced with the color of the non-character pixel located closest to the pixel to generate a picture portion image R (background R) (step S8).

続いて、原画像と同じサイズの画素値１の画像として第２マスクＶを作成し（ステップＳ１０）する。原画像と同じサイズの白い画像として第２背景画像Ｗを作成する（ステップＳ１２）。 Subsequently, a second mask V is created as an image having a pixel value 1 having the same size as the original image (step S10). A second background image W is created as a white image having the same size as the original image (step S12).

なお、ステップＳ４からステップＳ１２までの処理の流れはこれらに限られない。 Note that the flow of processing from step S4 to step S12 is not limited to these.

このうち、非絵柄部色画像Ｐ'と、絵柄部画像Ｒとは、ＤＣレベルシフト部４に入力される。
＜ＤＣレベルシフト部４の処理＞
次にＤＣレベルシフト部４の処理については説明する。本実施例では、「ＤＣレベルシフト部４により、画像信号に対して、色変換をしない実施形態１」と「ＤＣレベルシフト部４により、画像信号に対して、色変換をする実施形態２」を説明する。段落＜ＤＣレベルシフト部４の処理＞では、色変換の処理についても説明する。また、色変換した後の輝度信号をＹとし、色差信号をＣｂ、Ｃｒとする。 Among these, the non-picture part color image P ′ and the picture part image R are input to the DC level shift unit 4.
<Processing of DC Level Shift Unit 4>
Next, the processing of the DC level shift unit 4 will be described. In the present embodiment, “embodiment 1 in which color conversion is not performed on the image signal by the DC level shift unit 4” and “embodiment 2 in which color conversion is performed on the image signal by the DC level shift unit 4”. Will be explained. In the paragraph <Processing of DC level shift unit 4>, color conversion processing is also described. The luminance signal after color conversion is Y, and the color difference signals are Cb and Cr.

まず、画像は矩形のタイルに分割される（分割数≧１）。次に、タイルは、例えば、ＲＧＢの３コンポーネントで構成されるカラー画像を符号化する場合、ＤＣレベルシフトと輝度・色差のコンポーネントへの色変換が施される。より厳密には、色変換を施す前に、ＤＣレベルシフトを行う。ＤＣレベルシフトは、入力される画像信号がＲＧＢ信号値のような正の数である場合に所定の変換式を用いて各信号値から信号のダイナミックレンジの１／２を減算するレベルシフト処理を行い、色変換ではＲＧＢ画像を輝度信号（画像）Ｙ、色差信号Ｃｒ、Ｃｂ画像に変換することによりカラー画像の圧縮効率を高める処理を行うことができる。 First, the image is divided into rectangular tiles (number of divisions ≧ 1). Next, for example, when a color image composed of three RGB components is encoded, the tile is subjected to DC level shift and color conversion to luminance / color difference components. More precisely, DC level shift is performed before color conversion. DC level shift is a level shift process that subtracts ½ of the dynamic range of a signal from each signal value using a predetermined conversion formula when the input image signal is a positive number such as an RGB signal value. In the color conversion, the RGB image is converted into the luminance signal (image) Y and the color difference signals Cr and Cb images, whereby the processing for increasing the compression efficiency of the color image can be performed.

ＤＣレベルシフトは、画像信号がＲＧＢ信号値のような正の数（符号なし整数）である場合には、式（１）に示すように順変換では各信号値から信号のダイナミックレンジの半
分を減算するレベルシフトを、逆変換では各信号値に信号のダイナミックレンジの半分を
加算するレベルシフトを行う。但し、レベルシフトは画像信号がＹＣｂＣｒ信号における
ＣｂやＣｒのような符号付き整数の場合には適用しない。 In the DC level shift, when the image signal is a positive number (unsigned integer) such as an RGB signal value, as shown in Expression (1), half of the dynamic range of the signal is obtained from each signal value by forward conversion. In the inverse conversion, the level shift for subtracting is performed by adding half of the dynamic range of the signal to each signal value. However, the level shift is not applied when the image signal is a signed integer such as Cb or Cr in the YCbCr signal.

ＪＰＥＧ２０００のＤＣレベルシフトの順変換および逆変換に用いる式（１）は以下の
通りである。 Equation (1) used for forward and reverse conversion of JPEG2000 DC level shift is as follows.

Ｉ(ｘ，ｙ)←Ｉ(ｘ，ｙ)-２^{Ｓｓｉｚ(ｉ)}・・・順変換
（１）
Ｉ(ｘ，ｙ)←Ｉ(ｘ，ｙ)＋２^{Ｓｓｉｚ(ｉ)} ・・・逆変換
ここで、Ｓｓｉｚ(ｉ)とは画像の各コンポーネントi（ＲＧＢ画像ならｉ＝０，１，２）のビット深さから１を減じたものである。Ｉ(ｘ，ｙ)は座標(ｘ，ｙ)における信号値（画素値）である。 I (x, y) ← I (x, y) -2 ^{Ssiz (i)} ... forward conversion
(1)
I (x, y) ← I (x, y) +2 ^{Ssiz (i)} ... Inverse transformation Here, Ssiz (i) is an image component i (i = 0, 1, 2 for RGB image). This is the bit depth minus one. I (x, y) is a signal value (pixel value) at coordinates (x, y).

色変換は、カラー画像を効率良く圧縮するために行う。これはＪＰＥＧでもＲＧＢ画像
を輝度画像Ｙ、色差画像Ｃｒ、Ｃｂに変換後に圧縮を行うと圧縮率が向上し、結果的には再生画質が良くなることと目的は同じである。色変換には、可逆と非可逆の２つの方法が定義されている。 Color conversion is performed in order to efficiently compress a color image. The purpose of this is the same as that in JPEG, if the RGB image is converted into the luminance image Y and the color difference images Cr and Cb and then compressed, the compression rate is improved and as a result the reproduction image quality is improved. Two methods of reversible and irreversible are defined for color conversion.

可逆変換は、ＲＣＴ（Reversible multiple component transformation）と呼ばれ、変
換式の係数が整数値であることが特徴である。この変換式を式（２）に示す。 The reversible transformation is called RCT (Reversible multiple component transformation) and is characterized in that the coefficient of the transformation formula is an integer value. This conversion formula is shown in Formula (2).

Ｙ０(ｘ，ｙ)＝ｆｌｏｏｒ（(Ｉ０(ｘ，ｙ)＋２×(Ｉ１(ｘ，ｙ)＋Ｉ２(ｘ，ｙ))／４)
Ｙ１(ｘ，ｙ)＝Ｉ２(ｘ，ｙ)−Ｉ１(ｘ，ｙ) 順変換
Ｙ２(ｘ，ｙ)＝Ｉ０(ｘ，ｙ)−Ｉ１(ｘ，ｙ)
（２）
Ｉ１(ｘ，ｙ)＝Ｙ０(ｘ，ｙ)−ｆｌｏｏｒ((Ｙ２(ｘ，ｙ)＋Ｙ１(ｘ，ｙ))／４)
Ｉ０(ｘ，ｙ)＝Ｙ２(ｘ，ｙ)＋Ｉ１{ｘ，ｙ) 逆変換
Ｉ２(ｘ，ｙ)＝Ｙ１(ｘ，ｙ)＋Ｉ１[ｘ，ｙ)
なお、Iは原信号を示し、Ｙは変換後の信号を示している。また、ＩやＹに続く０〜２はサフィクスである。具体的には、ＲＧＢ信号を可逆変換する場合、I信号においてはＩ０＝Ｒ，Ｉ１＝Ｇ，Ｉ２＝Ｂと表され、Ｙ信号においてはＹ０＝Ｙ，Ｙ１＝Ｃｂ，Ｙ２＝Ｃｒと表される。さらに、ｆｌｏｏｒ(ｘ)は、ｘのフロア関数（実数ｘを、ｘを越えず、かつｘに最も近い整数に置換する関数）を示している。 Y0 (x, y) = floor ((I0 (x, y) + 2 × (I1 (x, y) + I2 (x, y)) / 4)
Y1 (x, y) = I2 (x, y) -I1 (x, y) Forward conversion Y2 (x, y) = I0 (x, y) -I1 (x, y)
(2)
I1 (x, y) = Y0 (x, y) -floor ((Y2 (x, y) + Y1 (x, y)) / 4)
I0 (x, y) = Y2 (x, y) + I1 {x, y) Inverse transformation I2 (x, y) = Y1 (x, y) + I1 [x, y)
Note that I indicates an original signal, and Y indicates a signal after conversion. In addition, 0 to 2 following I and Y are suffixes. Specifically, when reversibly converting an RGB signal, the I signal is represented as I0 = R, I1 = G, I2 = B, and the Y signal is represented as Y0 = Y, Y1 = Cb, Y2 = Cr. The Further, floor (x) represents a floor function of x (a function that replaces the real number x with an integer that does not exceed x and is closest to x).

非可逆変換はＩＣＴ（Irreversible multiple component transformation）と呼ばれ、変換式の係数がＲＣＴと異なり実数値であることが特徴である。この変換式を式（３）に
示す。 The irreversible transformation is called ICT (Irreversible multiple component transformation), and is characterized in that the coefficient of the transformation formula is a real value unlike the RCT. This conversion formula is shown in Formula (3).

Ｙ０(ｘ，ｙ)＝０．２９９×Ｉ０(ｘ，ｙ)＋０．５８７×Ｉ１(ｘ，ｙ)＋０．１４４×Ｉ２(ｘ，ｙ)
Ｙ１(ｘ，ｙ)＝−０．１６８７５×Ｉ０(ｘ，ｙ)−０．３３１２６×１(ｘ，ｙ)＋０．５＊Ｉ２(ｘ，ｙ) 順変換
Ｙ２(ｘ，ｙ)＝０．５×Ｉ０(ｘ，ｙ)−０．４１８６９×Ｉ１(ｘ，ｙ)−０．０８１３１×Ｉ２(ｘ，ｙ)
（３）
Ｉ０(ｘ，ｙ)＝Ｙ０(ｘ，ｙ)＋１．４０２×Ｙ２(ｘ，ｙ)
Ｉ１(ｘ，ｙ)＝Ｙ０(ｘ，ｙ)−０．３４４１３×Ｙ１(ｘ，ｙ)−０．７１４１４×Ｙ２(ｘ，ｙ) 逆変換
Ｉ２(ｘ，ｙ)＝Ｙ０(ｘ，ｙ)＋１．７７２×Ｙ１(ｘ，ｙ)
なお、Iは原信号を示し、Yは変換後の信号を示している。また、ＩやＹに続く０〜２はサフィクスである。具体的には、ＲＧＢ信号を非可逆変換する場合、I信号においては、Ｉ０＝Ｙ，Ｉ１＝Ｇ，Ｉ２＝Ｂと表され、Ｙ信号においてはＩ０＝Ｙ,Ｉ１＝Ｃｂ,Ｉ２＝Ｃｒと表される。
＜周波数変換部８について＞
次に周波数変換部８の処理について説明する。周波数変換部８は、絵柄部画像Ｒ、非絵柄部色画像Ｐ、特定画像Ｃを周波数変換する。周波数変換後は、絵柄部画像Ｒの周波数成分ｆ_Ｒ、および非絵柄部色画像Ｐの周波数成分ｆ_Ｐが出力される。周波数変換部８の処理の詳細について説明する。 Y0 (x, y) = 0.299 × I0 (x, y) + 0.587 × I1 (x, y) + 0.144 × I2 (x, y)
Y1 (x, y) = − 0.16875 × I0 (x, y) −0.33126 × 1 (x, y) + 0.5 * I2 (x, y) Forward conversion Y2 (x, y) = 0. 5 × I0 (x, y) −0.41869 × I1 (x, y) −0.0811 × I2 (x, y)
(3)
I0 (x, y) = Y0 (x, y) + 1.402 × Y2 (x, y)
I1 (x, y) = Y0 (x, y) −0.34413 × Y1 (x, y) −0.71414 × Y2 (x, y) Inverse transformation I2 (x, y) = Y0 (x, y) + 1.772 × Y1 (x, y)
Note that I indicates an original signal, and Y indicates a signal after conversion. In addition, 0 to 2 following I and Y are suffixes. Specifically, when irreversibly converting an RGB signal, the I signal is expressed as I0 = Y, I1 = G, I2 = B, and the Y signal is expressed as I0 = Y, I1 = Cb, I2 = Cr. expressed.
<About the frequency converter 8>
Next, processing of the frequency conversion unit 8 will be described. The frequency conversion unit 8 performs frequency conversion on the pattern portion image R, the non-pattern portion color image P, and the specific image C. After the frequency conversion, the frequency component f _R of the pattern part image R and the frequency component f P of the non-pattern part color image _P are output. Details of the processing of the frequency converter 8 will be described.

タイル毎に、色変換後のコンポーネント（タイルコンポーネントと呼ばれる）は、周波数変換部８によって周波数変換される。この実施例では、周波数変換の手法としてウェーブレット変換を用いる例を説明する。ウェーブレット変換によって、ＬＬ，ＨＬ，ＬＨ，ＨＨと略称される４つのサブバンドに分割される。そしてＬＬサブバンドに対して再帰的にウェーブレット変換（以下、「デコンポジション」という場合もある。）を繰返すと、最終的に１つのＬＬサブバンドと複数のＨＬ，ＬＨ，ＨＨサブバンドが生成される。ここで、ＪＰＥＧ２０００で採用されている２次元ウェーブレット変換について説明する。 For each tile, the component after color conversion (referred to as a tile component) is frequency-converted by the frequency converter 8. In this embodiment, an example in which wavelet transformation is used as a frequency transformation technique will be described. By wavelet transform, it is divided into four subbands abbreviated as LL, HL, LH, and HH. When the wavelet transform (hereinafter also referred to as “decomposition”) is repeated recursively on the LL subband, one LL subband and a plurality of HL, LH, and HH subbands are finally generated. The Here, the two-dimensional wavelet transform adopted in JPEG2000 will be described.

色変換後、施される２次元ウェーブレット変換では、５＊３フィルタを施す５＊３ウェーブレット変換と、９＊７フィルタを施す９＊７ウェーブレット変換を選択することができる。５＊３ウェーブレット変換とは、５画素を用いて１つのローパスフィルタの出力（ローパス係数）が得られ、３画素を用いて１つのハイパスフィルタの出力（ハイパス係数）が得られる変換である。 In the two-dimensional wavelet transform to be performed after the color conversion, a 5 * 3 wavelet transform for applying a 5 * 3 filter and a 9 * 7 wavelet transform for applying a 9 * 7 filter can be selected. The 5 * 3 wavelet transform is a transform that obtains one low-pass filter output (low-pass coefficient) using five pixels and obtains one high-pass filter output (high-pass coefficient) using three pixels.

同様に、９＊７ウェーブレット変換とは、９画素を用いて１つのローパスフィルタの出力（ローパス係数）が得られ、７画素を用いて１つのハイパスフィルタの出力（ハイパス係数）が得られる変換である。主な違いは、フィルタの範囲の違いであり、偶数位置中心にローパスフィルタ、奇数位置中心にハイパスフィルタが施されるのは同様である。９＊７フィルタにも同様に当てはまる。 Similarly, the 9 * 7 wavelet transform is a conversion in which 9 pixels are used to obtain one low-pass filter output (low-pass coefficient), and 7 pixels are used to obtain one high-pass filter output (high-pass coefficient). is there. The main difference is the difference in the filter range. The same applies to the low pass filter at the center of the even position and the high pass filter at the center of the odd position. The same applies to 9 * 7 filters.

５＊３ウェーブレット変換の変換式を式（４）に示す。 Equation (4) shows the 5 * 3 wavelet transform.

順変換
Ｃ(２ｉ＋１)＝Ｐ(２ｉ＋１)−ｆｌｏｏｒ((Ｐ(２ｉ)＋Ｐ(２ｉ＋２))／２) [step1]
Ｃ(２ｉ)＝Ｐ(２ｉ)＋ｆｌｏｏｒ((Ｃ(２ｉ−１)＋Ｃ(２ｉ＋１)＋２)／４) [step2]
逆変換（４）
Ｐ(２ｉ)＝Ｃ(２ｉ)−ｆｌｏｏｒ((Ｃ(２ｉ−１)＋Ｃ(２ｉ＋１)＋２)／４) [step3]
Ｐ(２ｉ＋１)＝Ｃ(２ｉ＋１)＋ｆｌｏｏｒ((Ｐ(２ｉ)＋Ｐ(２ｉ＋２))／２) [step4]
９＊７ウェーブレット変換の変換式を式（５）に示す。 Forward conversion C (2i + 1) = P (2i + 1) -floor ((P (2i) + P (2i + 2)) / 2) [step1]
C (2i) = P (2i) + floor ((C (2i-1) + C (2i + 1) +2) / 4) [step2]
Inverse transformation (4)
P (2i) = C (2i) -floor ((C (2i-1) + C (2i + 1) +2) / 4) [step3]
P (2i + 1) = C (2i + 1) + floor ((P (2i) + P (2i + 2)) / 2) [step4]
The conversion formula of the 9 * 7 wavelet transform is shown in Formula (5).

順変換
Ｃ(２ｎ＋１)＝Ｐ(２ｎ＋１)+α×(Ｐ(２ｎ)＋Ｐ(２ｎ＋２)) [step1]
Ｃ(２ｎ)＝Ｐ(２ｎ)＋β×(Ｃ(２ｎ−１)＋Ｃ(２ｎ＋１)) [step2]
Ｃ(２ｎ＋１)＝Ｃ(２ｎ＋１)＋γ×(Ｃ(２ｎ)＋Ｃ(２ｎ＋２)) [step3]
Ｃ(２ｎ)＝Ｃ(２ｎ)＋δ×(Ｃ(２ｎ−１)＋Ｃ(２ｎ＋１)) [step4]
Ｃ(２ｎ＋１)＝Ｋ×C(２n＋１) [step5]
Ｃ(２ｎ)＝(１／Ｋ)×Ｃ(２ｎ) [step6]
逆変換（５）
Ｐ(２ｎ)＝Ｋ×Ｃ(２ｎ) [step1]
Ｐ(２ｎ＋１)＝(１／Ｋ)×Ｃ(２n＋１) [step2]
Ｐ(２ｎ)＝Ｘ(２ｎ)−δ×(Ｐ(２ｎ−１)＋Ｐ(２ｎ＋１)) [step3]
Ｐ(２ｎ＋１)＝Ｐ(２ｎ＋１)−γ×(Ｐ(２ｎ)＋Ｐ(２ｎ＋２)) [step4]
Ｐ(２ｎ)＝Ｐ(２ｎ)−β×(Ｐ(２ｎ−１)＋Ｐ(２ｎ＋２)) [step5]
Ｐ(２ｎ)＝Ｐ(２ｎ＋１)-α×(Ｐ(２ｎ)＋Ｐ(２ｎ＋２)) [step6]
但し、α=-1.586134342059924
β=-0.052980118572961
γ=0.882911075530934
δ=0.443506852043971
K=1.230174104914001
次に、ウェーブレット変換の手順、並びにデコンポジションレベル、解像度レベル、サ
ブバンドの定義について説明する。 Forward conversion C (2n + 1) = P (2n + 1) + α × (P (2n) + P (2n + 2)) [step 1]
C (2n) = P (2n) + β × (C (2n−1) + C (2n + 1)) [step 2]
C (2n + 1) = C (2n + 1) + γ × (C (2n) + C (2n + 2)) [step 3]
C (2n) = C (2n) + δ × (C (2n−1) + C (2n + 1)) [step 4]
C (2n + 1) = K × C (2n + 1) [step5]
C (2n) = (1 / K) × C (2n) [step6]
Inverse transformation (5)
P (2n) = K × C (2n) [step1]
P (2n + 1) = (1 / K) × C (2n + 1) [step2]
P (2n) = X (2n) −δ × (P (2n−1) + P (2n + 1)) [step 3]
P (2n + 1) = P (2n + 1) −γ × (P (2n) + P (2n + 2)) [step 4]
P (2n) = P (2n) −β × (P (2n−1) + P (2n + 2)) [step 5]
P (2n) = P (2n + 1) −α × (P (2n) + P (2n + 2)) [step 6]
However, α = -1.586134342059924
β = -0.052980118572961
γ = 0.882911075530934
δ = 0.443506852043971
K = 1.230174104914001
Next, the procedure of wavelet transform and the definition of the decomposition level, resolution level, and subband will be described.

図１３は、１６×１６の画像の輝度コンポ−ネントに対して、ウェーブレット変換（５×３変換）を２次元(垂直方向および水平方向)で施す過程におけるＤＣレベルシフト後の原画像と座標系、ミラーリングを例示したものである。図１４は、上記同過程における垂直方向へのフィルタリング後の係数の配列を例示したものである。図１５は、上記同過程における水平方向へのフィルタリング後の係数の配列を例示したものである。図１６は、上記同過程における並べ替えた係数の配列を例示したものである。 FIG. 13 shows an original image and a coordinate system after DC level shift in the process of performing wavelet transform (5 × 3 transform) in two dimensions (vertical direction and horizontal direction) on the luminance component of a 16 × 16 image. This is an example of mirroring. FIG. 14 exemplifies the arrangement of coefficients after filtering in the vertical direction in the same process. FIG. 15 shows an example of the arrangement of coefficients after filtering in the horizontal direction in the same process. FIG. 16 shows an example of the rearranged coefficient array in the same process.

図１３を参照すれば、ｘｙ座標をとり、或るｘについて、ｙ座標がｙである画素の画素値をＰ（ｙ）（０≦ｙ≦１５）と表わす。ＪＰＥＧ２０００では、先ず垂直方向（Ｙ座標方向）にｙ座標が奇数（ｙ＝２ｉ＋１）の画素を中心にハイパスフィルタを施して係数Ｃ（２ｉ＋１）を取得し、次にｙ座標が偶数（ｙ＝２ｉ）の画素を中心にローパスフィルタを施して係数Ｃ（２ｉ）を取得するようにし、こうした処理を全てのｘについて行う。 Referring to FIG. 13, xy coordinates are taken, and for a certain x, the pixel value of a pixel whose y coordinate is y is expressed as P (y) (0 ≦ y ≦ 15). In JPEG2000, first, a high-pass filter is applied to a pixel whose y coordinate is odd (y = 2i + 1) in the vertical direction (Y coordinate direction) to obtain a coefficient C (2i + 1), and then the y coordinate is even (y = 2i). ) To obtain a coefficient C (2i) by applying a low-pass filter centering on the pixel of (), and such processing is performed for all x.

ここで、ハイパスフィルタ、ローパスフィルタは、順に上述したＳｔｅｐ１、Ｓｔｅｐ２の関係式で表わされる。尚、画像の端部にフィルタを施す場合には、中心となる画素に対して隣接画素が十分に存在しないことがあり、こうした場合はミラーリング（図１３参照）と呼ばれる手法によって適宜画素値を補うことになる。ミラーリングとは、画像端部の外側に仮想的な画素を想定し、画像端部の画素を対称軸として、端部の内側の画素値を、端部の外側の画素に対して線対称にコピーする周知の手法である。ミラーリングは画像(タイル)の上下左右の４端全てで行われる。 Here, the high-pass filter and the low-pass filter are expressed by the relational expressions of Step 1 and Step 2 described above in order. When a filter is applied to the edge of an image, there may be insufficient adjacent pixels with respect to the central pixel. In such a case, the pixel value is appropriately compensated by a technique called mirroring (see FIG. 13). It will be. Mirroring assumes a virtual pixel outside the edge of the image, and the pixel value inside the edge is copied symmetrically with respect to the pixel outside the edge, with the pixel at the edge of the image as the axis of symmetry. This is a well-known technique. Mirroring is performed at all four edges of the image (tile).

説明を簡単にするため、ハイパスフィルタで得られる係数をＨ、ローパスフィルタで得
られる係数をＬと表記すれば、垂直方向の変換によって図１３に示した画像は図１４に示されるようなＬ係数、Ｈ係数（周波数係数）の配列へと変換される。 For simplicity of explanation, if the coefficient obtained by the high-pass filter is denoted by H and the coefficient obtained by the low-pass filter is denoted by L, the image shown in FIG. 13 is converted to the L coefficient as shown in FIG. , Converted into an array of H coefficients (frequency coefficients).

次に、図１３に示されるような係数配列に対して、先ず水平方向にｘ座標が奇数（ｙ＝２ｉ＋１）の係数を中心にハイパスフィルタを施し、次にｘ座標が偶数（ｘ＝２ｉ）の係数を中心にローパスフィルタを施すようにし、これらの処理を全てのｙについて行う。この場合、第１ステップ、第２ステップのＰ（２ｉ）等は係数値を表わすものと読み替える。 Next, with respect to the coefficient array as shown in FIG. 13, first, a high-pass filter is applied around the coefficient whose x coordinate is an odd number (y = 2i + 1) in the horizontal direction, and then the x coordinate is an even number (x = 2i). A low-pass filter is applied centering on the coefficient, and these processes are performed for all y. In this case, P (2i) and the like in the first step and the second step are read as representing coefficient values.

次に、説明を簡単にするため、上述したＬ係数を中心にローパスフィルタを施して得られる係数をＬＬ、Ｌ係数を中心にハイパスフィルタを施して得られる係数をＨＬ、Ｈ係数を中心にローパスフィルタを施して得られる係数をＬＨ、Ｈ係数を中心にハイパスフィルタを施して得られる係数をＨＨと表記すれば、図１３に示される係数配列は、図１４に示されるような係数配列へと変換される。ここで同一の記号を付した係数群は、サブバンドと呼ばれ、図１１に示される形態では４つのサブバンドで構成される。 Next, for simplicity of explanation, the coefficient obtained by applying the low-pass filter centered on the L coefficient described above is LL, the coefficient obtained by applying the high-pass filter centered on the L coefficient is HL, and the coefficient obtained by applying the high-pass filter is centered on the low-pass filter. If the coefficient obtained by applying the filter is denoted by LH, and the coefficient obtained by applying the high-pass filter centering on the H coefficient is denoted by HH, the coefficient array shown in FIG. 13 becomes a coefficient array as shown in FIG. Converted. The coefficient group given the same symbol here is called a subband, and is composed of four subbands in the form shown in FIG.

以上は、垂直・水平の各方向に対して各１回のウェーブレット変換（１回のデコンポジ
ション（分解））が終了し、上述したＬＬ係数だけを集めて図１５に示されるようにサブバンド毎に集め、ＬＬサブバンドだけ取り出すと、丁度原画像の１／２の解像度の"画像"
が得られる。因みに、このようにサブバンド毎に分類することはデインターリーブと呼ば
れ、図１を参照して説明したような処理状態に配置することはインターリーブと呼ばれる
２回目のウェーブレット変換は、ＬＬサブバンドを原画像とみなして、上述した場合と
同様の変換を行えば良い。この場合、並べ替えを行うと、図１７に示されるような形態となる。ここでの係数の接頭の１や２は、水平・垂直に関し各何回のウェーブレット変換で係数が得られたかを示しており、デコンポジションレベルと呼ばれる。 As described above, each wavelet transform (one decomposition (decomposition)) is completed for each of the vertical and horizontal directions, and only the above-described LL coefficients are collected for each subband as shown in FIG. If only the LL subband is taken out, the “image” with exactly half the resolution of the original image
Is obtained. In this connection, the classification for each subband is called deinterleaving, and the arrangement in the processing state as described with reference to FIG. 1 is called interleaving. The second wavelet transform uses LL subbands. It can be regarded as an original image and the same conversion as described above may be performed. In this case, when rearrangement is performed, the form shown in FIG. 17 is obtained. The coefficient prefixes 1 and 2 here indicate how many times the wavelet transform has been obtained with respect to horizontal and vertical, and are called decomposition levels.

図１８は、デコンポジションレベルと解像度レベルとの関係を例示したものである。ここでは、解像度レベルがデコンポジションレベルとほぼ逆の関係にあることを示している。 FIG. 18 illustrates the relationship between the decomposition level and the resolution level. Here, it is shown that the resolution level is almost opposite to the composition level.

以上の説明において、１次元のみのウェーブレット変換を行う場合には、何れかの方向
だけの処理を行えば良く、その何れかの方向に関してウェーブレット変換を行った回数が
デコンポジションレベルとなる。 In the above description, when only one-dimensional wavelet transformation is performed, processing in only one direction may be performed, and the number of times wavelet transformation has been performed in any one direction becomes the decomposition level.

ウェーブレットフィルタとして９＊７フィルタを選択した場合には、上述した分割の後
、サブバンド毎に係数の線形量子化が成されるが、５＊３フィルタを選択した場合には、
サブバンド毎の線形量子化はしない仕様となっている。 When a 9 * 7 filter is selected as the wavelet filter, linear quantization of coefficients is performed for each subband after the above-described division, but when a 5 * 3 filter is selected,
The specification does not perform linear quantization for each subband.

次なる処理として、各サブバンドはプリシンクトとよばれる矩形に分割される。プリシ
ンクトとは、サブバンドを矩形に分割したものをＨＬ、ＬＨ、ＨＨの３つのサブバンドに
ついて集めたもので、大まかな意味では画像中の場所（Ｐｏｓｉｔｉｏｎ）を表わすもの
である。プリシンクトは３つで１纏まりであるが、ＬＬサブバンドを分割した場合には１
つで１纏まりとなる。また、プリシンクトはサブバンドと同じサイズにでき、プリシンク
トをさらに矩形に分割したものがコードブロックである。 As the next processing, each subband is divided into rectangles called precincts. The precinct is a collection of three subbands HL, LH, and HH obtained by dividing a subband into rectangles. In a broad sense, the precinct represents a position in an image. There are three precincts, which are one group, but 1 when the LL subband is divided.
One unit becomes one. The precinct can be the same size as the subband, and the code block is obtained by further dividing the precinct into rectangles.

図１９は、画像（タイル）、サブバンド、プリシンクト、コードブロックの関係を例示したものである。ここでは、物理的な大きさの序列として、画像≧タイル＞サブバンド≧プリシンクト≧コードブロックとなることを示している。
［量子化部１０について］
量子化部１０の量子化について、本実施例では、周波数ゲインのための正規化のための量子化を除く。量子化部１０の説明の前に、「周波数ゲインのための正規化のための量子化を除く」理由を説明する。 FIG. 19 illustrates the relationship among images (tiles), subbands, precincts, and code blocks. Here, as an order of physical size, it is shown that image ≧ tile> subband ≧ precinct ≧ code block.
[About the quantization unit 10]
With respect to the quantization of the quantization unit 10, in the present embodiment, quantization for normalization for frequency gain is excluded. Prior to the description of the quantizing unit 10, the reason for “excluding quantization for normalization for frequency gain” will be described.

変換符号化における量子化は、周波数変換ゲインの正規化のための量子化と、人の視覚特性や画像特性を反映するための量子化の２つにわかれ、通常以下のように成される。
（Ｉ）周波数変換部８による周波数変換が直交変換である場合
周波数変換が直交変換である場合は、その定義より、各帯域の周波数係数に生じた誤差は、周波数逆変換で算出される画素値に等しい割合で寄与する。このため、前記寄与度を考慮することなく(周波数変換ゲインの正規化のための量子化が必要なく)、人の視覚特性等を考慮して量子化（係数の量子化またはトランケーションまたは両者）を行う。
（ＩＩ）周波数変換が非直交変換（ex.双直交変換）である場合
（ＩＩ−ｉ）各帯域の周波数係数に生じた誤差が、周波数逆変換で算出される画素値に寄与する度合い（いわゆる周波数変換ゲイン）が帯域ごとに異なるため、まずは帯域ごとに前記寄与度を考慮した正規化がなされる。前記正規化は、帯域ごとの係数の線形量子化でなされるのが典型（ex．寄与度の小さい帯域の係数を予め量子化して、誤差を与えておく）であり、これは形態としては量子化であるが、その意図するところは正規化である。 Quantization in transform coding is divided into two types, that is, quantization for normalizing frequency conversion gain and quantization for reflecting human visual characteristics and image characteristics, and is usually performed as follows.
(I) When the frequency transformation by the frequency transformation unit 8 is an orthogonal transformation When the frequency transformation is an orthogonal transformation, an error occurring in the frequency coefficient of each band is a pixel value calculated by frequency inverse transformation according to the definition. Contributes at a rate equal to. For this reason, quantization (coefficient quantization or truncation or both) is performed in consideration of human visual characteristics, etc. without considering the contribution (no need for quantization for frequency conversion gain normalization). Do.
(II) When frequency transformation is non-orthogonal transformation (ex. Bi-orthogonal transformation) (II-i) Degree of contribution of error generated in frequency coefficient of each band to pixel value calculated by frequency inverse transformation (so-called Since the frequency conversion gain is different for each band, normalization is first performed in consideration of the contribution for each band. The normalization is typically performed by linear quantization of the coefficient for each band (ex. The coefficient of the band with a small contribution degree is quantized in advance to give an error). The intention is normalization.

（ＩＩ−ｉｉ）次に、人の視覚特性等を考慮して量子化（係数の量子化またはトランケーションまたは両者）を行う。 (II-ii) Next, quantization (quantization of coefficients or truncation or both) is performed in consideration of human visual characteristics and the like.

ただし、係数の量子化ができず、トランケーションのみを行える符号化においては、（ＩＩ−ｉ）における正規化を（ＩＩ−ｉｉ）でのトランケーション量に含めた形で反映することになる。前記のように、「(ｉｉ)におけるビットプレーンｎ枚分の符号化省略や破棄は、（ｉ）において係数を２のｎ乗で除算することと等価であり、(ｉｉｉ)における量子化は、Ｓ１×（２のｎ乗）で係数を除算するのと等価である」からである。 However, in the encoding in which the coefficient cannot be quantized and only truncation can be performed, the normalization in (II-i) is reflected in the form including the truncation amount in (II-ii). As described above, “encoding omission or discard of n bit planes in (ii) is equivalent to dividing the coefficient by 2 to the nth power in (i), and the quantization in (iii) is It is equivalent to dividing the coefficient by S1 × (2 to the power of n) ”.

また、（ＩＩ−ｉ）における正規化ができる方式でも、線形量子化を２度行うのを避けるために、（ＩＩ−ｉ）における量子化と（ＩＩ−ｉｉ）での量子化を一緒にあわせて行うのもの典型である。 Further, even in a method capable of normalization in (II-i), in order to avoid performing linear quantization twice, the quantization in (II-i) and the quantization in (II-ii) are combined together. This is a typical example.

尚カラー画像の符号化においては、周波数変換と同時に色変換を行う。本願で用いる色変換の典型例は、ＲＧＢ信号を輝度・色差信号に変換するものであるが、周波数変換での議論と同様、色変換が直交変換でない場合には、該色変換のゲインを反映して、輝度・色差ごとの誤差の正規化が同時に行われる。 In color image encoding, color conversion is performed simultaneously with frequency conversion. A typical example of color conversion used in the present application is to convert RGB signals into luminance / color difference signals. As in the case of frequency conversion, when color conversion is not orthogonal conversion, the gain of the color conversion is reflected. Thus, normalization of errors for each luminance / color difference is performed at the same time.

以上のように、本実施例での量子化は「周波数ゲインのための正規化のための量子化」または「輝度・色差変換ゲインの正規化のための量子化」を除く。つまり、正規化を除く視覚的量子化を指す。 As described above, the quantization in the present embodiment excludes “quantization for normalization for frequency gain” or “quantization for normalization of luminance / chrominance conversion gain”. In other words, it refers to visual quantization excluding normalization.

また、変換符号化がサブバンド変換を用いる方式である場合、サブバンドごとの係数の視覚的な重要度を定義することができる。例えば、ウェーブレットによるサブバンド変換を用いるＪＰＥＧ２０００である場合、本願実施の際の視覚的量子化の量を表す指標として、ＶｉｓｕａｌＷｅｉｇｈｔという名称の量が存在する。ＶｉｓｕａｌＷｅｉｇｈｔは、サブバンド(帯域)ごとの視覚的な重要度を示したものであり、値が大きいほど重要度が大きいことを示す指標である。 In addition, when transform coding is a method using subband transform, it is possible to define the visual importance of coefficients for each subband. For example, in the case of JPEG2000 that uses wavelet subband transformation, there is an amount named Visual Weight as an index representing the amount of visual quantization in the implementation of the present application. The Visual Weight indicates the visual importance for each subband (band), and is an index indicating that the importance is larger as the value is larger.

ＶｉｓｕａｌＷｅｉｇｈｔは、人の視覚特性(視覚の感度、実験値)を、当該サブバンドの帯域区間において積分した値であるから(簡単にいえば、その周波数帯域における人の視覚の感度を数値化したもの)、ウェーブレット変換に固有の指標ではなく、如何なるサブバンド変換に対しても定義可能な値である。ただし、輝度、色差の両方に対してＶｉｓｕａｌＷｅｉｇｈｔを規定したい場合には、輝度・色差空間を特定し、特定した色差空間における実験値をもとに当該色差成分に対する視覚感度を求め、積分する必要がある。 Since Visual Weight is a value obtained by integrating human visual characteristics (visual sensitivity, experimental value) in the band section of the subband (in short, it quantifies the human visual sensitivity in the frequency band). It is not an index specific to the wavelet transform, but a value definable for any subband transform. However, if you want to define Visual Weight for both luminance and color difference, it is necessary to specify the luminance / color difference space, determine the visual sensitivity for the color difference component based on the experimental value in the specified color difference space, and integrate it. There is.

図２０（Ａ）〜（Ｃ）には、輝度信号Ｙ、色差信号Ｃｂ、ＣｒのＶｉｓｕａｌｄｉｓｔａｎｃｅ１０００［ｄｏｔ］のＨＬ（図１５など参照）などのＶｉｓｕａｌＷｅｉｇｈｔのデコンポジションレベルを示す。また、図２１（Ａ）〜（Ｃ）には、輝度信号Ｙ、色差信号Ｃｂ、ＣｒのＶｉｓｕａｌｄｉｓｔａｎｃｅ３０００［ｄｏｔ］のＨＬなどのＶｉｓｕａｌＷｅｉｇｈｔのデコンポジションレベルを示す。 20A to 20C show visual weight decomposition levels such as luminance signal Y, chrominance signal Cb, and HL of Cr visual distance 1000 [dot] (see FIG. 15 and the like). Further, FIGS. 21A to 21C show visual weight decomposition levels such as HL of the luminance signal Y, the color difference signal Cb, and the visual distance 3000 [dot] of Cr.

ウェーブレット変換のサブバンドの帯域は、画像自体の帯域から算出されるものである。しかし、画像までのＶｉｓｕａｌｄｉｓｔａｎｃｅが近いほど高周波の誤差までが目に付き、Ｖｉｓｕａｌｄｉｓｔａｎｃｅが遠ければ低周波の誤差しか目に付かないことから、観察者の目にとってのサブバンドの帯域は、画像と観察者のＶｉｓｕａｌｄｉｓｔａｎｃｅの関数となる。このためＶｉｓｕａｌＷｅｉｇｈｔは、Ｖｉｓｕａｌｄｉｓｔａｎｃｅごとに定められる。そして、Ｖｉｓｕａｌｄｉｓｔａｎｃｅが遠いほど、低周波よりも高周波の重要度が小さく、かつ輝度成分よりも色差成分の重要度が小さくなるように設定されている（例えば、図２０、図２１参照）。 The subband band of the wavelet transform is calculated from the band of the image itself. However, the closer the visual distance to the image is, the more high frequency errors are noticeable, and the far distance the visual distance is, the lower frequency errors are only noticeable, so the subband bandwidth for the observer's eyes is It is a function of the observer's visual distance. For this reason, Visual Weight is determined for each Visual distance. The higher the distance, the lower the importance of the high frequency than the low frequency, and the less important the color difference component than the luminance component (see, for example, FIGS. 20 and 21).

ＪＰＥＧ２０００において、このＶｉｓｕａｌＷｅｉｇｈｔを反映して係数を線形量子化する場合には、量子化部１０による量子化処理の除算の分母（量子化ステップ数）を
量子化ステップ数＝（正規化のために除算に用いるサブバンドごとの値/ 当該サブバンドのVisual Weight）×ｎ（ｎは、圧縮率等に応じてユーザが選択する定数）（６）
とするのが一般的である。よってＶｉｓｕａｌＷｅｉｇｈｔが小さい程、すなわち視覚的な重要度が小さいサブバンドほど、大きく量子化されることになる。 In JPEG2000, when the coefficient is linearly quantized to reflect this Visual Weight, the denominator (quantization step number) of the quantization processing by the quantization unit 10 is expressed as follows: quantization step number = (for normalization) Value for each subband used for division / Visual Weight of the subband) × n (n is a constant selected by the user according to the compression ratio, etc.) (6)
Is generally. Therefore, the smaller the Visual Weight, that is, the sub-band with a smaller visual importance, the larger the quantization.

そして、式（６）によって量子化ステップ数を算出する場合、本実施例における「周波数変換ゲインの正規化および色変換ゲインの正規化のための量子化を除いた」量子化の程度とは、量子化ステップ数や、サブビットプレーン数などを示す。 Then, when calculating the number of quantization steps according to Equation (6), the degree of quantization in the present embodiment “excluding quantization for frequency conversion gain normalization and color conversion gain normalization” Indicates the number of quantization steps, the number of sub-bit planes, etc.

（１/当該サブバンドのＶｉｓｕａｌＷｅｉｇｈｔ）×ｎ（７）
となる。以下、式（７）で算出される値を「実質的な量子化ステップ数」と呼ぶ。前景Ｐ'と背景Ｑとでｎを共通にする場合には、「実質的な量子化ステップ数」はＶｉｓｕａｌＷｅｉｇｈｔの逆数となる。 (1 / Visual Weight of the subband) × n (7)
It becomes. Hereinafter, the value calculated by Expression (7) is referred to as “substantial quantization step number”. When n is common between the foreground P ′ and the background Q, the “substantial number of quantization steps” is the reciprocal of Visual Weight.

周波数変換部８の処理の説明について戻す。次に、周波数変換部８は、背景Ｒについて周波数変換（ウェーブレット変換）する。図２２に、９×７ウェーブレット変換の周波数ゲインの平方根の一例を示す。 The description of the processing of the frequency conversion unit 8 will be returned. Next, the frequency conversion unit 8 performs frequency conversion (wavelet conversion) on the background R. FIG. 22 shows an example of the square root of the frequency gain of 9 × 7 wavelet transform.

各周波数変換係数に生じた誤差の二乗和は、周波数逆変換時に図２２に示された値についてゲイン倍され、さらに逆色変換時に図２３に示すゲイン倍される。従って、公知の様に、周波数ごとの係数を「周波数変換ゲイン×色変換ゲイン」の平方根の逆数（図２４に示す）で除算しておけば、各係数に生じた量子化誤差の影響を、周波数間かつコンポ−ネント（各輝度信号Ｙ、色差信号Ｃｂ、Ｃｒ）間で均等にすることができる。 The sum of squares of errors generated in each frequency conversion coefficient is multiplied by the gain shown in FIG. 22 at the time of frequency reverse conversion, and further multiplied by the gain shown in FIG. 23 at the time of reverse color conversion. Therefore, as known, if the coefficient for each frequency is divided by the reciprocal of the square root of “frequency conversion gain × color conversion gain” (shown in FIG. 24), the influence of the quantization error generated in each coefficient is It can be made uniform between frequencies and between components (each luminance signal Y, color difference signals Cb, Cr).

前述のように、本実施例では、［量子化部１０において、周波数変換ゲインの正規化のための量子化を除く理由］で述べたように、図２４のような正規化のための量子化を除くため、本実施例においては、前記式（７）（実質的な量子化ステップ数）によって量子化の程度を規定する。 As described above, in the present embodiment, as described in [Reason for excluding quantization for frequency conversion gain normalization in the quantization unit 10], quantization for normalization as shown in FIG. In this embodiment, the degree of quantization is defined by the above equation (7) (substantial number of quantization steps).

以上より図２５（Ａ）（Ｂ）（Ｃ）は、それぞれ、Ｙ信号、Ｃｂ信号、Ｃｒ信号についての前景Ｐ'のための実質的な量子化ステップ数の一例であり、各周波数・コンポ−ネントごとに全て６４という値をとっている。これは式（７）において、ＶｉｓｕａｌＷｅｉｇｈｔを各周波数・コンポ−ネントについて１とし、かつｎ＝６４としたものである。図２５（Ａ）〜（Ｃ）では、前景Ｐ'の量子化ステップ数は、高周波から低周波まで全て同じである。 25A, 25B, and 25C are examples of the substantial number of quantization steps for the foreground P ′ for the Y signal, the Cb signal, and the Cr signal, respectively. All nunts have a value of 64. In Equation (7), Visual Weight is set to 1 for each frequency / component and n = 64. In FIGS. 25A to 25C, the number of quantization steps of the foreground P ′ is the same from high frequency to low frequency.

一方、図２６の（Ａ）（Ｂ）（Ｃ）は、それぞれ、本実施例における背景Ｒのための実質的な量子化ステップ数の一例である。これは式（７）において、Ｖｉｓｕａｌｄｉｓｔａｎｃｅ１０００のＶｉｓｕａｌＷｅｉｇｈｔを各周波数・コンポ−ネントについて適用し、かつｎ＝６４としたものである。図２６（Ａ）（Ｂ）（Ｃ）は、実質的な量子化ステップ数が、高周波ＨＨの方は低周波よりも大きくなっている。 On the other hand, (A), (B), and (C) in FIG. 26 are examples of the substantial number of quantization steps for the background R in this embodiment. This is obtained by applying Visual Weight 1000 of Visual distance 1000 to each frequency / component in equation (7) and setting n = 64. 26A, 26B, and 26C, the substantial number of quantization steps is larger at the high frequency HH than at the low frequency.

ここで、本実施形態１では、図２５（前景Ｐ’の量子化ステップ）、図２６（背景Ｒの量子化ステップ）に示すように、非絵柄部色画像（前景Ｐ’）の高周波成分ＨＨの量子化の程度を、絵柄部画像（背景Ｒ）の高周波数成分ＨＨの量子化の程度よりも弱く（小さく）する。ここで、量子化の程度とは、例えば、量子化ステップ、サブビットプレーン数などを意味する。また、量子化部１０は、周波数係数の量子化、または周波数係数の下位ビットの破棄、のうち少なくとも一方を行う。 Here, in the first embodiment, as shown in FIG. 25 (quantization step of the foreground P ′) and FIG. 26 (quantization step of the background R ), the high-frequency component HH of the non-pattern part color image (foreground P ′). Is made weaker (smaller) than the degree of quantization of the high-frequency component HH of the pattern portion image (background R 1 ). Here, the degree of quantization means, for example, a quantization step, the number of sub bit planes, and the like. Further, the quantization unit 10 performs at least one of quantization of the frequency coefficient and discarding the lower bits of the frequency coefficient.

上述のように、本実施例では、第２の手法（前景Ｐ'に符号量が最も小さい特定画像Ｃ（図４参照）を生成する手法）を用いている。特定画像Ｃは、色Ｓおよび色Ｔと極端に異なる色である場合が多い。従って、第１の問題として、前景Ｐ'の高周波成分を残さない場合には、色Ｓ、色Ｔと色Ｖとが混合してしまい不具合が生じてしまう。 As described above, in this embodiment, the second method (method for generating a specific image C (see FIG. 4) with the smallest code amount in the foreground P ′) is used. The specific image C is often a color extremely different from the colors S and T. Therefore, as a first problem, when the high frequency component of the foreground P ′ is not left, the color S, the color T, and the color V are mixed, resulting in a problem.

そこで、本実施形態のように、非絵柄部色画像（前景Ｐ'）の高周波成分ＨＨの量子化の程度を弱くする（つまり、高周波成分を残す）ことで、色Ｓ、色Ｔと色Ｖとが混合するという不具合は生じないという効果を奏する。 Therefore, as in this embodiment, the degree of quantization of the high-frequency component HH of the non-pattern part color image (foreground P ′) is weakened (that is, the high-frequency component is left), so that the colors S, T, and V There is an effect that there is no inconvenience of mixing and.

また第２の問題として、ＭＲＣの前景Ｐ'においては、文字Ａ１、Ａ２の色である色Ｓ、色Ｔの再現性が重要な一方で、特定画像Ｃである色Ｖの再現性は、ほとんど問われないという事情がある。特に、マスクによって前景Ｐ'または背景Ｑの一方が選択されるような形態では、非文字画素Ｃの色Ｖの再現性は全く問われない。 As a second problem, in the foreground P ′ of MRC, the reproducibility of the colors S and T which are the colors of the characters A1 and A2 is important, while the reproducibility of the color V which is the specific image C is almost the same. There are circumstances that are not questioned. In particular, in a form in which one of the foreground P ′ or the background Q is selected by the mask, the reproducibility of the color V of the non-character pixel C is not questioned at all.

従って、ＭＲＣの前景Ｐ'を量子化する場合、第１の手法、第２の手法に限らず、本来は特定画像Ｃに量子化誤差を集中させ、文字画素には誤差を生じさせないことが好ましい。 Therefore, when the foreground P ′ of MRC is quantized, it is not limited to the first method and the second method, but it is preferable that the quantization error is originally concentrated on the specific image C and no error is generated in the character pixel. .

しかし、第１の手法による前景Ｐでは、上記「非文字画素に量子化誤差を集中させ、文字画素には誤差を生じさせない」制御が行いにくい。なぜなら第１の手法は、文字Ａ１の色Ｓと、文字Ａ２の色Ｔの間にグラデーション状にぼかして前景Ｐを作成する方法である。従って文字画素、非文字画素は、周波数的には同じ帯域（ほぼ低域）の周波数に属してしまう（正確な表現は、文字画素、非文字画素から算出される周波数は、同じ低域の帯域に属する、となろうが、ここで便宜的に上記のような表現を用いている）。 However, in the foreground P according to the first method, it is difficult to perform the above-described control “concentrate quantization errors on non-character pixels and do not cause errors on character pixels”. This is because the first method is to create a foreground P by blurring in a gradation between the color S of the character A1 and the color T of the character A2. Therefore, the character pixel and the non-character pixel belong to the frequency of the same band (almost low frequency) in terms of frequency (the accurate expression is that the frequency calculated from the character pixel and the non-character pixel is the same low frequency band. The above expression is used for convenience here).

周波数変換を伴う符号化では、周波数係数の量子化を行うが、同じ帯域の周波数係数に対しては、ほぼ同じ程度の量子化を施すのが通常である。そして、量子化後の係数に、逆量子化と周波数逆変換を施して画素値にもどす場合、周波数係数に生じた量子化誤差は、その周波数係数の算出に用いられた画素値の誤差に反映されるため、同じ帯域に属する係数には、ほぼ同じ程度の誤差が生じることになる。 In the encoding with frequency conversion, the frequency coefficient is quantized, but the frequency coefficient in the same band is usually subjected to almost the same degree of quantization. When the quantized coefficient is returned to the pixel value by performing inverse quantization and frequency inverse transform, the quantization error generated in the frequency coefficient is reflected in the error of the pixel value used to calculate the frequency coefficient. Therefore, almost the same error occurs in the coefficients belonging to the same band.

つまり、あえてエッジを生じさせない第１の手法では、エッジを生じさせないがゆえに、非文字画素に誤差を集中させることができない。 That is, in the first method that does not cause an edge, an error cannot be concentrated on a non-character pixel because an edge is not generated.

一方、第２の手法による前景Ｐ'においては、エッジに隣接する２画素、すなわち文字画素Ａ１とそれに隣接する非文字画素（特定画像Ｃ）は高周波帯域に属する。しかし、それ以外の非文字画素は低周波帯域に属する。類似した色の文字画素同士が隣接する場合には、それらの文字画素も低周波帯域に属するが、通常、前景Ｐ'に占める文字画素の割合が少ないことを考えれば、低域に属する画素の殆どは非文字画素である。 On the other hand, in the foreground P ′ according to the second method, the two pixels adjacent to the edge, that is, the character pixel A1 and the non-character pixel (specific image C) adjacent thereto belong to the high frequency band. However, other non-character pixels belong to the low frequency band. When character pixels of similar colors are adjacent to each other, these character pixels also belong to the low frequency band, but in general, considering that the ratio of the character pixels occupying the foreground P ′ is small, Most are non-character pixels.

つまり、第２の手法によれば、非文字画素の殆どを低周数を生じさせる画素にすることができ、かつ低周波を多めに量子化すれば、そうしない場合に比べて非文字画素に誤差を集中させることができるのである。 In other words, according to the second method, most of the non-character pixels can be made to be pixels that generate a low frequency, and if the low frequency is quantized to a large extent, the non-character pixels are made to be non-character pixels as compared with the case where it does not. The error can be concentrated.

画像の符号化においては、低周波を多めに量子化することは通常行わない。何故なら、人間は低周波の誤差に敏感だからである。しかし、前景Ｐの生成に第２の手法を用いたＭＲＣ符号化という特殊な条件下においては、低周波こそ多めに量子化すべきであり、かつその量子化誤差は、再構成された状態（背景と前景が合成された状態）での画像においては、何ら観察者に見えることはない。 In encoding an image, it is not usually performed to quantize a large amount of low frequencies. This is because humans are sensitive to low frequency errors. However, under the special condition of MRC encoding using the second method for generating the foreground P, the low frequency should be quantized more, and the quantization error is in a reconstructed state (background In the image in a state where the foreground and the foreground are combined), the image is not visible to the observer.

以上より、「前景は色」というレイヤ自体が持つ機能を考慮する場合でも、前景画像の作成方法（これはアプリケーションによって予め決まっていることが多い）をさらに考慮に入れて、この作成方法を前景の量子化に反映すべきであるということになる。 From the above, even when considering the function of the layer itself called “foreground color”, the foreground image creation method (this is often determined in advance by the application) is further taken into consideration, and this creation method is It should be reflected in the quantization of.

このようにして、背景Ｒ、前景Ｐについては、量子化処理を行う（ステップＳ１８、ステップＳ２４）。その後、符号化部１２は背景Ｒ、前景Ｐを公知の技術で符号化する（ステップＳ２０、ステップＳ２６）。 In this way, the background R and the foreground P are quantized (steps S18 and S24). Thereafter, the encoding unit 12 encodes the background R and the foreground P by a known technique (steps S20 and S26).

また、符号化部１２は、係数のエントロピー符号化（ビットプレーン単位でのＭＱ符号化）が、プリシンクト単位で、コードブロック毎かつビットプレーン順に成される。本願実施例においては、最上位ビットプレーンから、テーブルで予め規定された下位ビットプレーンまでをMQ符号化する。 In addition, the encoding unit 12 performs entropy coding of coefficients (MQ coding in bit plane units) for each code block and in bit plane order in precinct units. In the embodiment of the present application, MQ coding is performed from the most significant bit plane to the lower bit plane defined in advance in the table.

また、符号化部１２は、前記「最上位ビットプレーンから、量子化テーブルで予め規定された下位ビットプレーンまで」を必要な符号と見なすため、で括弧で記載した不要な符号は存在しない。これら「最上位ビットプレーンから、量子化テーブルで予め規定された下位ビットプレーンまで」の符号に対して、ヘッダをつけたものをパケットと呼ぶ。パケットヘッダには、当該パケットに含まれる符号に関する情報が含まれ、各パケットは独立に扱うことができる。いわばパケットは符号の単位である。 Further, since the encoding unit 12 regards “from the most significant bit plane to the lower bit plane defined in advance in the quantization table” as a necessary code, there is no unnecessary code described in parentheses. For these codes “from the most significant bit plane to the lower bit plane defined in advance by the quantization table”, a header is added and called a packet. The packet header includes information about codes included in the packet, and each packet can be handled independently. In other words, a packet is a unit of code.

また、マスクＱ、第２マスクＶについては公知の技術であるＭＭＲ（Modified Modified READ）で符号化する（ステップＳ２８）。また、第２背景ＷをＪＰＭで符号化する（ステップＳ３０）。そして、パケット生成部１４が、全ての符号化を結合してパケットを生成する。そして、パケット生成部１４は、パケットを並べて符号を形成する（図３９、図４０参照）。また、パケット生成部１４は、サブバンド毎に上位ビットプレーンから（必要な）下位ビットプレーンまでをビットプレーン符号化してもよい。パケット生成部１４は、必要な符号をまとめ、パケットを生成する。そして、パケット生成部１４は、全てのプリンシクト（全てのコードブロック＝全てのサブバンド）に関してパケットを並べて符号を生成する。 Further, the mask Q and the second mask V are encoded by a known technique MMR (Modified Modified READ) (step S28). Further, the second background W is encoded by JPM (step S30). Then, the packet generation unit 14 combines all the encodings to generate a packet. The packet generator 14 arranges the packets to form a code (see FIGS. 39 and 40). Further, the packet generator 14 may perform bit-plane encoding from the upper bit plane to the (necessary) lower bit plane for each subband. The packet generator 14 collects necessary codes and generates a packet. Then, the packet generation unit 14 generates codes by arranging packets for all the prints (all code blocks = all subbands).

このように、本実施形態１では、非絵柄部色画像の高周波成分の量子化の程度を、絵柄部画像の高周波数成分の量子化の程度よりも弱くすることで、第２の手法を用いたとしても、復号化した場合に、特定画像が、文字画像（文字画素）に混ざらなくなる。
［実施形態２］
実施形態２では、ＤＣレベルシフト部４内の輝度色差変換部が、絵柄部画像Ｒ、非絵柄部色画像Ｐ'を輝度・色差変換して、前記絵柄部画像の輝度Ｙ_Ｒおよび色差Ｃ_Ｒ、および非絵柄部色画像Ｐの輝度Ｙ_Ｐおよび色差Ｃ_Ｐを生成する。 Thus, in the first embodiment, the second method is used by making the degree of quantization of the high-frequency component of the non-picture part color image weaker than the degree of quantization of the high-frequency component of the picture part image. Even if it decodes, when it decodes, a specific image will not be mixed with a character image (character pixel).
[Embodiment 2]
In the second embodiment, the luminance / color difference conversion unit in the DC level shift unit 4 performs luminance / color difference conversion on the pattern portion image R and the non-pattern portion color image P ′, and the luminance Y _R and the color difference C _{R of the} pattern portion image. , and it generates a luminance Y _P and the chrominance C _P of the non-picture portion color images P.

通常符号化においては、輝度・色差変換を行い、輝度毎、色差毎に量子化を行う。通常の自然画においては、色よりも輝度に敏感な人間の視覚特性を反映して、
輝度の量子化の程度＜＜色差の量子化の程度
とするのが一般的である。 In normal encoding, luminance / color difference conversion is performed, and quantization is performed for each luminance and each color difference. In normal natural images, reflecting human visual characteristics that are more sensitive to brightness than color,
The degree of luminance quantization is generally set to the degree of color difference quantization.

しかし、輝度の量子化の程度＜色差の量子化の程度
あるいは輝度の量子化の程度 ≒ 色差の量子化の程度
のように、自然画像ではあっても、ＭＲＣの前景Ｐ'の輝度と色差の量子化の割合（色差の量子化の程度／輝度の量子化の程度）を、背景Ｒのそれとは変えることが望ましい。 However, the degree of luminance quantization <the degree of color difference quantization or the degree of luminance quantization ≒ the degree of color difference quantization, even if it is a natural image, the luminance and color difference of the MRC foreground P ' It is desirable to change the quantization ratio (the degree of quantization of the color difference / the degree of quantization of the luminance) from that of the background R.

従って、実施形態２では、非絵柄部色画像Ｐ’の、色差の量子化の程度／輝度の量子化の程度を、絵柄部画像の、色差の量子化の程度／輝度の量子化の程度よりも小さくする。このようにすることで、実施形態１と同様の効果を得ることができる。 Therefore, in the second embodiment, the degree of quantization of the color difference / the degree of luminance quantization of the non-picture part color image P ′ is determined by the degree of the quantization of the color difference / the degree of luminance quantization of the picture part image. Also make it smaller. By doing in this way, the effect similar to Embodiment 1 can be acquired.

また、図２７の縦軸に、図２５記載の前景Ｐ'の色差Ｃｂの量子化のステップ数／輝度Ｙの量子化のステップ数（実線で示し、以下、「前景Ｐ'の色差Ｃｂの比」という。）と、図２６記載の背景Ｒの色差Ｃｂの量子化のステップ数／輝度Ｙの量子化のステップ数（破線で示し、以下、「背景Ｒの色差Ｃｂの比」という。）とを示す。破線、実線については以下も同様である。また、「色差の量子化の程度（量子化ステップ数）の輝度の量子化の程度（量子化ステップ数）に対する割合」を「色差Ｃｂ（またはＣｒ）の量子化のステップ数／輝度Ｙの量子化のステップ数」と示す。 27, the number of steps of quantization of the color difference Cb of the foreground P ′ shown in FIG. 25 / the number of steps of quantization of the luminance Y (shown by a solid line, hereinafter “ratio of the color difference Cb of the foreground P ′”). 26) and the number of steps of quantization of the color difference Cb of the background R / the number of steps of quantization of the luminance Y (indicated by a broken line, hereinafter referred to as “ratio of the color difference Cb of the background R”) shown in FIG. Indicates. The same applies to the broken and solid lines. Further, “the ratio of the degree of quantization of the color difference (number of quantization steps) to the degree of quantization of the luminance (number of quantization steps)” is expressed as “the number of steps of quantization of the color difference Cb (or Cr) / quantum of the luminance Y”. The number of steps of conversion.

また、図２８の縦軸に、図２５記載の前景Ｐ'の色差Ｃｒの量子化のステップ数／輝度Ｙの量子化のステップ数（実線で示し、以下、「前景Ｐ'の色差Ｃｒの比」という。）と、図２６記載の背景Ｒの色差Ｃｒの量子化のステップ数／輝度Ｙの量子化のステップ数（破線で示し、「以下、背景Ｒの色差Ｃｒの比」という。））とを示す。なお、図２７、図２８ともに横軸は、周波数帯である。該周波数帯において、２ＬＬは、全ての比は最も低意周波数体であり、２ＨＬと２ＬＨでは同一の周波数帯であり、１ＨＨ最も高い周波数帯である。 Also, the vertical axis of FIG. 28 shows the number of steps of quantization of the color difference Cr of the foreground P ′ / the number of steps of quantization of the luminance Y shown in FIG. 25 (indicated by a solid line, hereinafter “ratio of the color difference Cr of the foreground P ′”). )) And the number of steps of quantization of the color difference Cr of the background R shown in FIG. 26 / the number of steps of quantization of the luminance Y (indicated by a broken line, hereinafter referred to as “ratio of the color difference Cr of the background R”))) It shows. In both FIG. 27 and FIG. 28, the horizontal axis represents the frequency band. In the frequency band, 2LL is the lowest frequency body in all ratios, 2HL and 2LH are the same frequency band, and 1HH is the highest frequency band.

このように、実施形態２では、非絵柄部色画像Ｒの、色差の量子化の程度／輝度の量子化の程度を、絵柄部画像の、色差の量子化の程度／輝度の量子化の程度よりも、（２ＬＬを除き、）小さくする。 As described above, in the second embodiment, the degree of quantization of the color difference / the degree of luminance quantization of the non-picture part color image R is set as the degree of quantization of the color difference / the degree of luminance quantization of the picture part image. Than (except 2LL).

ここで、「２ＬＬを除き」としたのは、２ＬＬの実質的な量子化ステップ数は、輝度、色差ともに、１とすることが多いからである。つまり、大小関係の比較において、２ＬＬは除外すべきケースが多いということである。
［実施形態３］
本実施形態３の符号化装置では、非絵柄部色画像Ｐ'（前景Ｐ'）の輝度Ｙの高周波成分を最も量子化し、絵柄部画像の色差の高周波成分を最も量子化する。これにより、更に、符号量を少なく符号化できる。また、この実施形態３の符号化装置では、上記第１の手法、第２の手法のうちどちらを用いても良い。 Here, “except 2LL” is used because the number of substantial quantization steps of 2LL is often 1 for both luminance and color difference. That is, there are many cases where 2LL should be excluded in the comparison of magnitude relations.
[Embodiment 3]
In the encoding apparatus of the third embodiment, the high-frequency component of luminance Y of the non-picture part color image P ′ (foreground P ′) is most quantized, and the high-frequency component of the color difference of the picture part image is most quantized. Thereby, it is possible to further encode with a small code amount. Further, in the encoding apparatus of the third embodiment, either the first method or the second method may be used.

まず、「非絵柄部色画像Ｐ'（前景Ｐ'）の輝度Ｙの高周波成分を最も量子化する」ことについて、理由を説明する。非絵柄部色画像Ｐ'は、「色」を表す画像である。従って、非絵柄部色画像Ｐ'の色差を残すべく、輝度Ｙを最も量子化する。また、輝度Ｙの高周波成分を最も量子化することで、色差が多く残ることは、経験的に知られていることである。 First, the reason for “quantizing the high frequency component of the luminance Y of the non-pattern part color image P ′ (foreground P ′) most” will be described. The non-pattern part color image P ′ is an image representing “color”. Therefore, the luminance Y is most quantized in order to leave the color difference of the non-pattern part color image P ′. Further, it is known from experience that a large amount of color difference remains by quantizing the high-frequency component of luminance Y most.

また、「絵柄部画像の色差の高周波成分を最も量子化する」ことについて、理由を説明する。上述のように、人間は、色よりも輝度に敏感である。従って、絵柄部画像の色差の高周波成分を最も量子化する。
［具体例１］
実施形態１、２を実現するための具体例を説明する。具体例１として、図８に示す符号化装置に、重み付け部１６を更に設ける。具体例１では、重み付け係数を用いる。この具体例１ではＶｉｓｕａｌＤｉｓｔａｎｃｅを用いる。ＶｉｓｕａｌＤｉｓｔａｎｃｅについては後述する。 In addition, the reason for “quantizing the highest frequency component of the color difference of the pattern portion image most” will be described. As mentioned above, humans are more sensitive to brightness than color. Therefore, the high-frequency component of the color difference of the pattern portion image is most quantized.
[Specific Example 1]
A specific example for realizing the first and second embodiments will be described. As a specific example 1, the weighting unit 16 is further provided in the encoding apparatus shown in FIG. In specific example 1, a weighting coefficient is used. In this specific example 1, Visual Distance is used. The Visual Distance will be described later.

上述したウェーブレット変換のサブバンドの帯域は、画像自体の帯域から算出されるものである。しかし、画像までのＶｉｓｕａｌｄｉｓｔａｎｃｅが近いほど高周波の誤差までが目に付き、Ｖｉｓｕａｌｄｉｓｔａｎｃｅが遠ければ低周波の誤差しか目に付かないことから、観察者の目にとってのサブバンドの帯域は、画像と観察者のＶｉｓｕａｌｄｉｓｔａｎｃｅの関数（例えば、１次関数）となる。このため重み付け係数は、上述のＶｉｓｕａｌｄｉｓｔａｎｃｅごとに示されており、Ｖｉｓｕａｌｄｉｓｔａｎｃｅが遠いほど、低周波よりも高周波の重要度が小さく、かつ輝度成分よりも色差成分の重要度が小さくなるように設定されている。 The subband band of the wavelet transform described above is calculated from the band of the image itself. However, the closer the visual distance to the image is, the more high frequency errors are noticeable, and the far distance the visual distance is, the lower frequency errors are only noticeable, so the subband bandwidth for the observer's eyes is It becomes a function (for example, a linear function) of the observer's visual distance. For this reason, the weighting coefficient is shown for each of the above visual distances, and is set such that the farther the visual distance is, the less important the high frequency than the low frequency and the less important the color difference component than the luminance component. Has been.

また、本具体例１の周波数変換部８は、入力された信号に対して、周波数変換によりサブバンドごとに分割する。そして、重み付け部１６は、サブバンドごとの前記量子化の程度に対して重み付け係数を算出し、該重み付け係数で重み付けを行う。重み付け係数の算出に際しては、非絵柄部色画像のＶｉｓｕａｌＤｉｓｔａｎｃｅ＜絵柄部画像のＶｉｓｕａｌＤｉｓｔａｎｃｅとする。重み付け係数の算出については、［具体例２］で説明する。 In addition, the frequency conversion unit 8 according to the first specific example divides the input signal for each subband by frequency conversion. Then, the weighting unit 16 calculates a weighting coefficient for the degree of quantization for each subband, and performs weighting with the weighting coefficient. In calculating the weighting coefficient, it is assumed that Visual Distance of the non-picture part color image <Visual Distance of the picture part image. The calculation of the weighting coefficient will be described in [Specific Example 2].

具体的な数値を用いて、具体例１を説明する。図２９に、変形例１の非絵柄部色画像Ｐ'の実質的な量子化ステップ数を示す。また、図２９に示す値は、これは上記式（７）において、Ｖｉｓｕａｌｄｉｓｔａｎｃｅ８００のVisual Weightを各周波数・コンポ−ネントについて適用し、かつｎ＝64としたものである。実施形態１で説明したとおり、図２６、図２９に示す値は、非絵柄部色画像の高周波成分の量子化ステップ数を、絵柄部画像の高周波数成分の量子化ステップ数よりも弱くしている。なお、図３０に、輝度について、前景Ｐ'、背景Ｑの実質的な量子化ステップ数を比較したものを示す。なお、図３０では、背景Ｑを破線で示し、前景Ｐを実線で示す。 Specific example 1 will be described using specific numerical values. FIG. 29 shows the substantial number of quantization steps of the non-picture part color image P ′ of the first modification. The values shown in FIG. 29 are obtained by applying Visual Weight 800 of Visual distance 800 to each frequency / component in the above equation (7) and setting n = 64. As described in the first embodiment, the values shown in FIGS. 26 and 29 are obtained by making the number of quantization steps of the high frequency component of the non-picture part color image weaker than the number of quantization steps of the high frequency component of the picture part image. Yes. FIG. 30 shows a comparison of the substantial number of quantization steps of the foreground P ′ and the background Q with respect to luminance. In FIG. 30, the background Q is indicated by a broken line, and the foreground P is indicated by a solid line.

同様に、図２７、図３０、に示した前景Ｐ'、背景Ｑの実質的な量子化ステップ数を、周波数毎に、色差Ｃｂ/輝度Ｙ、色差Ｃｒ/輝度Ｙの比として表したものが、図３１、３２である。実施形態２で説明したように、図３１から明らかなように、「色差の実質的な量子化ステップ数/輝度の実質的な量子化ステップ数」は、（２ＬＬを除き）前景Ｐ'の方が背景Ｑより小さい。また、図３２においても、１ＨＨを除いて「色差の実質的な量子化ステップ数/輝度の実質的な量子化ステップ数」は、（ＬＬを除き）前景Ｐ'の方が背景Ｑより小さい。図３２においては、前記「１ＨＨを除いて」と記載したが、全ての周波数で比較すれば明らかなように、平均的には「色差の実質的な量子化ステップ数/輝度の実質的な量子化ステップ数」は、（LLを除き）前景２の方が前景１より小さい（実施形態２で説明）。 Similarly, the actual quantization step numbers of the foreground P ′ and the background Q shown in FIGS. 27 and 30 are expressed as a ratio of color difference Cb / luminance Y and color difference Cr / luminance Y for each frequency. 31 and 32. As described in the second embodiment, as is clear from FIG. 31, “substantial quantization step number of color difference / substantial quantization step number of luminance” is the value of foreground P ′ (except 2LL). Is smaller than the background Q. Also in FIG. 32, except for 1HH, the “substantial quantization step number of color difference / substantial quantization step number of luminance” is smaller for the foreground P ′ (except for LL) than for the background Q. In FIG. 32, the description is “except for 1HH”. However, as apparent from comparison at all frequencies, on average, “substantial quantization step number of color difference / substantial quantum of luminance”. The number of conversion steps ”(excluding LL) is smaller in the foreground 2 than in the foreground 1 (described in the second embodiment).

また、図２６のＶｉｓｕａｌｄｉｓｔａｎｃｅは１０００、図２９のＶｉｓｕａｌｄｉｓｔａｎｃｅは、８００である。従って、本変形例１の値である。
［具体例２］
この具体例２では、実施形態１、２を実現するために、ＶｉｓｕａｌＷｅｉｇｈｔを用いた具体例２を説明する。ＶｉｓｕａｌＷｅｉｇｈｔ、および具体例１で説明したＶｉｓｕａｌＤｉｓｔａｎｃｅについて説明する。
＜１＞ＣｏｎｔｒａｓｔＳｅｎｓｉｔｉｖｉｔｙＦｕｎｃｔｉｏｎ(ＣＳＦ)とその式
ＶｉｓｕａｌＷｅｉｇｈｔの算出に先立ち、まずＣｏｎｔｒａｓｔＳｅｎｓｉｔｉｖｉｔｙＦｕｎｃｔｉｏｎ(ＣＳＦ)とその式について説明する。後述するが、ＶｉｓｕａｌＷｅｉｇｈｔは、所定帯域におけるＣＳＦの積分値として求められるからである。ＣＳＦとは、所定Ｖｉｓｕａｌｄｉｓｔａｎｃｅにおいて所定空間周波数のラインペア（縞）を観察した場合に、観察者が判別できる濃淡の差/和（＝コントラスト）の逆数を示したものであり、実験で求められるものである。そして、一般的に、人間は高周波ほどコントラストを判別しにくい。 In addition, the Visual distance in FIG. 26 is 1000, and the Visual distance in FIG. 29 is 800. Therefore, it is the value of this modification 1.
[Specific Example 2]
In this second specific example, a second specific example using VisualWeight in order to realize the first and second embodiments will be described. The VisualWeight and the Visual Distance described in the specific example 1 will be described.
<1> Contrast Sensitivity Function (CSF) and its Formula Prior to the calculation of Visual Weight, the Contrast Sensitivity Function (CSF) and its formula will be described first. As will be described later, Visual Weight is obtained as an integral value of CSF in a predetermined band. CSF indicates the reciprocal of the difference / sum of contrast (= contrast) that an observer can discriminate when observing a line pair (stripes) of a predetermined spatial frequency in a predetermined visual distance, and is obtained by experiment. Is. In general, humans are less likely to discriminate contrast at higher frequencies.

コントラストを「原画と圧縮伸張後の画像の差」と考えた場合、ＣＳＦは「各周波数で、原画と圧縮後の画像の濃淡差がこれ以下なら判別できない」という限界を示したものである。そして、ウェーブレット係数が原画像のコントラストを概ね反映することから、ＣＳＦは「各周波数(＝各サブバンド)で、原画と圧縮後の画像のウェーブレット係数の差がこれ以下なら、その係数が反映する原画と伸張後の画像の差も判別できない」という限界を示す指標と解釈されている。したがって、ウェーブレット係数を「ＣＳＦの逆数の２倍以下」で量子化すれば、各係数に生じる量子化誤差はＣＳＦの逆数以下になるので、画質劣化が認識されないはず、というのが一般的である。
＜２＞ＶｉｓｕａｌＷｅｉｇｈｔの算出方法について、
ＣＳＦは「各周波数におけるウェーブレット係数の許容量子化誤差量」であるため、例えば垂直方向に１回デコンポジションして得られる１Ｈサブバンドの許容量子化量を考えるとする。１Ｈサブバンドは、ハイパスのウェーブレットフィルタが通過させた、ある周波数以上の周波数を広く含むが、どの周波数でのＣＳＦの値を採用すればいいかに関しては、簡易に以下のように考える。 When the contrast is considered as “the difference between the original image and the image after compression / expansion”, the CSF shows the limit that “the difference in density between the original image and the image after compression at each frequency cannot be determined”. Then, since the wavelet coefficient generally reflects the contrast of the original image, the CSF is “if the difference between the wavelet coefficient of the original image and the compressed image is less than this at each frequency (= each subband), the coefficient reflects. It is interpreted as an index indicating a limit that the difference between the original image and the image after expansion cannot be determined. Therefore, if the wavelet coefficient is quantized with “twice or less of the reciprocal of CSF”, the quantization error generated in each coefficient is less than or equal to the reciprocal of CSF, so that image quality degradation should not be recognized. .
<2> How to calculate VisualWeight,
Since CSF is “allowable quantization error amount of wavelet coefficient at each frequency”, for example, let us consider an allowable quantization amount of 1H subband obtained by decomposing once in the vertical direction. The 1H subband includes a wide range of frequencies higher than a certain frequency that have been passed by the high-pass wavelet filter. The frequency of the CSF to be used is simply considered as follows.

まず、原画像の解像度がd [dpi]で、これを距離v [inch]から観察した場合の空間周波数fの近似式は、
ｆ＝ｖ・ｄ・ｔａｎ(π／３６０) （８）
であり、これが原画像が含みうる最大の周波数である。 First, when the resolution of the original image is d [dpi] and this is observed from the distance v [inch], the approximate expression of the spatial frequency f is
f = v · d · tan (π / 360) (8)
This is the maximum frequency that the original image can contain.

ここで上記式（８）の右辺中のｖ・ｄがＶｉｓｕａｌＤｉｓｔａｎｃｅ(日本語だと、Ｖｉｓｕａｌｄｉｓｔａｎｃｅ)である。ＶｉｓｕａｌＤｉｓｔａｎｃｅは、本来は被験者の実際のＶｉｓｕａｌｄｉｓｔａｎｃｅから算出されるもので、後述の通り「そのＶｉｓｕａｌｄｉｓｔａｎｃｅにおいて通常の自然画像を観察した場合に、最も画質がよく感じられる量子化パラメータを算出するため」に使用される。 Here, v · d in the right side of the above equation (8) is Visual Distance (Visual distance in Japanese). The Visual Distance is originally calculated from the actual visual distance of the subject, and as described later, “To calculate a quantization parameter that gives the best image quality when an ordinary natural image is observed in the Visual distance” Is used.

ただし、ＶｉｓｕａｌＤｉｓｔａｎｃｅとして実際のＶｉｓｕａｌｄｉｓｔａｎｃｅ以外の値を用いると、「そのＶｉｓｕａｌｄｉｓｔａｎｃｅではない距離において通常の自然画像を観察した場合に、最も画質がよく感じられる量子化パラメータ」や「そのＶｉｓｕａｌｄｉｓｔａｎｃｅにおいて通常の自然画像ではない画像(ｅｘ．ＭＲＣの前景)を観察した場合に、最も画質がよく感じられる量子化パラメータ」を得ることが可能となる。本願では、後者のパラメータを算出するために、ＶｉｓｕａｌＤｉｓｔａｎｃｅとして実際のＶｉｓｕａｌｄｉｓｔａｎｃｅ以外の値を用いることがある。 However, if a value other than the actual visual distance is used as the visual distance, “a quantization parameter that gives the best image quality when a normal natural image is observed at a distance that is not the visual distance” or “in the visual distance” When an image that is not a normal natural image (ex. Foreground of MRC) is observed, it is possible to obtain a “quantization parameter that gives the best image quality”. In the present application, in order to calculate the latter parameter, a value other than the actual visual distance may be used as the visual distance.

さて、先の通り、原画像が含みうる最大の空間周波数ｆｍａｘは、
ｆｍａｘ＝ｖ・ｄ・ｔａｎ(π／３６０) （９）
であり、通常のウェーブレット変換では1回のデコンポジション毎に１／２にサブサンプリングされるため、デコンポジションレベル毎に空間周波数の最大値は半分になる。 As described above, the maximum spatial frequency fmax that can be included in the original image is
f max = v · d · tan (π / 360) (9)
In the normal wavelet transform, since the sub-sampling is halved for each decomposition, the maximum value of the spatial frequency is halved for each decomposition level.

そこで、ウェーブレットフィルタを理想的なハイパスフィルタおよびローパスフィルタであると見なし、０≦ｆ≦ｆｍａｘの帯域が、1回のデコンポジションで０≦ｆ＜ｆｍａｘ／２とｆｍａｘ／２≦ｆ≦ｆｍａｘに二分されると仮定する。つまり、１H係数の帯域はｆｍａｘ／２≦ｆ≦ｆｍａｘ、１L係数の帯域は０≦ｆ＜ｆｍａｘ／２と見なす。 Therefore, the wavelet filter is regarded as an ideal high-pass filter and low-pass filter, and the band of 0 ≦ f ≦ fmax is divided into 0 ≦ f <fmax / 2 and fmax / 2 ≦ f ≦ fmax in one decompression. Assume that That is, the band of 1H coefficient is regarded as fmax / 2 ≦ f ≦ fmax, and the band of 1L coefficient is regarded as 0 ≦ f <fmax / 2.

よって、１H係数のｃｓｆとしては、帯域ｆｍａｘ／２≦ｆ≦ｆｍａｘにおけるＣＳＦの平均値を用い、１L係数のＣＳＦとしては、帯域０≦ｆ＜ｆｍａｘ／２におけるＣＳＦの平均値を用いる。そして、シーケンシャルに連結した線形フィルタの特性が個々のフィルタの特性の積で表わされることを根拠に、１ＬＨ係数のＣＳＦとしては、帯域ｆｍａｘ／２≦ｆ≦ｆｍａｘにおける平均値と、帯域０≦ｆ＜ｆｍａｘ／２における平均値の積を用いる。 Therefore, the average value of CSF in the band fmax / 2 ≦ f ≦ fmax is used as the csf of the 1H coefficient, and the average value of CSF in the band 0 ≦ f <fmax / 2 is used as the CSF of the 1L coefficient. Then, based on the fact that the characteristics of the linear filters connected sequentially are expressed by the product of the characteristics of the individual filters, the CSF of the 1LH coefficient has an average value in the band fmax / 2 ≦ f ≦ fmax and the band 0 ≦ f. The product of the average values at <fmax / 2 is used.

ここで平均値とは、各帯域内におけるＣＳＦの積分を、積分区間の幅で割ったものである。 Here, the average value is obtained by dividing the integration of CSF in each band by the width of the integration interval.

よって、各サブバンドに対して適用すべきＣＳＦの値は、次式で算出され、これがＪＰＥＧ２０００においてＶｉｓｕａｌＷｅｉｇｈｔ（あるいはＣＳＦｗｅｉｇｈｔ）と呼ばれるものである。 Therefore, the value of CSF to be applied to each subband is calculated by the following equation, which is called Visual Weight (or CSFweight) in JPEG2000.

ｈはｈｏｒｉｚｏｎｔａｌ、ｖはｖｅｒｔｉｃａｌでＷｈは水平方向の区間の積分, Ｗｖは垂直方向の区間の積分である。またＷ(ｆ) は、最大値を1に正規化したＣＳＦであり、Ｙ，Ｃｂ，Ｃｒの各コンポ−ネント毎に１に正規化したものを用いる左辺のｗは、ＶｉｓｕａｌＷｅｉｇｈｔを示す。 h is horizontal, v is vertical, Wh is the integration in the horizontal section, and Wv is the integration in the vertical section. W (f) is a CSF normalized to 1 at the maximum value, and w on the left side using one normalized to 1 for each component of Y, Cb, and Cr represents VisualWeight.

そして、非絵柄部色画像に対しては、ＶｉｓｕａｌＷｅｉｇｈｔを適用せず、前記絵柄部画像に対しては、前記絵柄部画像に対しては、ＶｉｓｕａｌＷｅｉｇｈｔを適用する。このように、ＶｉｓｕａｌＷｅｉｇｈｔを用いることで実施形態１、２は実現される。
［変形例１］
変形例１について説明する。図９中のステップＳ１６、ステップＳ２２において、背景Ｒ、前景Ｐ'のウェーブレット変換を９＊７フィルタを用いて、行う場合を説明する。この場合には、図２４と図２６に示す値を用いて、上記式（７）によって、正規化を含めた量子化ステップ数を算出し、周波数係数の量子化を行う。ただし本変形例２の場合は、量子化後の係数の全ビットプレーンを符号化するのではなく、図３３に示した数の下位サブビットプレーンの符号化を行わないようにする。ここで図３３中のｔは任意の整数であり、ｔの大小によって、符号量自体を選択することができる。 Then, VisualWeight is not applied to the non-picture part color image, and VisualWeight is applied to the picture part image to the picture part image. As described above, the first and second embodiments are realized by using VisualWeight.
[Modification 1]
Modification 1 will be described. A case where the wavelet transform of the background R and the foreground P ′ is performed using a 9 * 7 filter in step S16 and step S22 in FIG. 9 will be described. In this case, by using the values shown in FIGS. 24 and 26, the number of quantization steps including normalization is calculated by the above equation (7), and the frequency coefficient is quantized. However, in the case of the second modification, not all the bit planes of the quantized coefficients are encoded, but the lower sub-bit planes of the number shown in FIG. 33 are not encoded. Here, t in FIG. 33 is an arbitrary integer, and the code amount itself can be selected depending on the magnitude of t.

上述したように、量子化後の係数は（ビットプレーン１枚あたりの）数学的な誤差が正規化され、かつ視覚的な重みも反映されたものであるから、全ての周波数において、同じ数の枚数だけのサブビットプレーンの符号を省略すれば、バランスがとれた主観画質が得られる。ただし本例においては、あくまでＬＬのサブビットプレーンは全て符号化することにする。
［変形例２］
次に変形例２について説明する。変形例２では、背景Ｒのウェーブレット変換を５＊３フィルタを用いて、行う場合を説明する。下位ビットプレーンの符号化の省略によって量子化を行う例である。
図３４は、５＊３フィルタを用いて、ウェーブレット変換の周波数ゲインの平方根である。また、図３５は、変形例２における、可逆色変換(RCT)のゲインの平方根である。上述したとおり、周波数ごとの係数を「周波数変換ゲイン×色変換ゲイン」の平方根の逆数（図３６に示す）で除算しておけば、各係数に生じた量子化誤差の影響を、周波数間かつコンポ−ネント間で均等にすることができるが、本例では線形量子化を用いずに、下位ビットプレーンの符号化の省略によって量子化を行う。つまり、式（７）を用いて、図３６の定数倍の値に、さらにVisual Weightを反映し、これを２のべき乗に変換した場合の指数（ただし整数に近似した値）として、符号化を省略するビットプレーン数を算出する。 As mentioned above, the quantized coefficients are normalized for mathematical errors (per bit plane) and also reflect visual weights, so the same number of coefficients at all frequencies. If the number of sub-bit planes corresponding to the number of sheets is omitted, balanced subjective image quality can be obtained. However, in this example, all LL sub-bit planes are encoded.
[Modification 2]
Next, Modification 2 will be described. In the second modification, a case where the wavelet transform of the background R is performed using a 5 * 3 filter will be described. This is an example in which quantization is performed by omitting encoding of the lower bit plane.
FIG. 34 shows the square root of the frequency gain of the wavelet transform using a 5 * 3 filter. FIG. 35 shows the square root of the gain of reversible color conversion (RCT) in the second modification. As described above, if the coefficient for each frequency is divided by the reciprocal of the square root of “frequency conversion gain × color conversion gain” (shown in FIG. 36), the influence of the quantization error generated in each coefficient can be reduced between frequencies and Although it can be made uniform between components, in this example, quantization is performed by omitting encoding of the lower bit plane without using linear quantization. That is, using Equation (7), the Visual Weight is further reflected in the value of the constant multiple in FIG. 36, and this is converted into an exponent (a value approximated to an integer) when this is converted to a power of 2. Calculate the number of bit planes to be omitted.

以上より図３７は、変形例２における、前景Ｐ'のため符号化しないビットプレーン数である。これは式（６）において、ＶｉｓｕａｌＷｅｉｇｈｔを各周波数・コンポ−ネントについて１とし（＝ＶｉｓｕａｌＷｅｉｇｈｔを適用せずに）、かつｎ＝６４として、２のべき乗に変換した場合の指数として算出したものである。 As described above, FIG. 37 shows the number of bit planes that are not encoded because of the foreground P ′ in the second modification. This is calculated as an exponent when converting Visual Weight to 1 for each frequency / component in equation (6) (= Not applying Visual Weight) and n = 64. It is.

一方、図３８は、本変形例３における背景Ｑのための実質的な量子化ステップ数である。これは式（６）において、Ｖｉｓｕａｌｄｉｓｔａｎｃｅ１０００のＶｉｓｕａｌＷｅｉｇｈｔを各周波数・コンポ−ネントについて適用し、かつｎ＝６４として、２のべき乗に変換した場合の指数として算出したものである。尚、RCT用のＶｉｓｕａｌＷｅｉｇｈｔは規定されていないため、本実施例ではＩＣＴ用のものを流用している。 On the other hand, FIG. 38 shows the substantial number of quantization steps for the background Q in the third modification. This is calculated as an exponent when the Visual Weight of Visual distance 1000 is applied to each frequency / component in Equation (6) and converted to a power of 2 with n = 64. In addition, since the Visual Weight for RCT is not prescribed | regulated, the thing for ICT is diverted in a present Example.

尚、JPMにおける、背景に対する各レイアウトオブジェクトの配置や大きさの情報であるが、図３９の様に、レイアウトオブジェクトのIDや重ね順、縦横の画素数、背景に対する配置上のオフセットは、Layout Object Header Box に記載する。 Note that the layout and layout information of each layout object with respect to the background in JPM. As shown in FIG. 39, the layout object ID, the stacking order, the number of vertical and horizontal pixels, and the layout offset with respect to the background are the layout object. Enter in the Header Box.

また、各レイアウトオブジェクトを構成する前景、背景の配置や大きさの情報であるが、図４０の様に、背景に対する配置上のオフセットは、Object Header Box に記載する。各オブジェクトの画素数は、各オブジェクトの符号中に記載する。 In addition, as for the foreground and background arrangement and size information constituting each layout object, the arrangement offset with respect to the background is described in the Object Header Box as shown in FIG. The number of pixels of each object is described in the code of each object.

また、以上の実施例ではファイルフォーマットとしてＪＰＭを用いたが、ＭＲＣタイプであれば何でも良く、ＭＲＣタイプのＰＤＦもその範疇である。 In the above embodiment, JPM is used as the file format. However, any MRC type may be used, and MRC type PDF is also in that category.

また以上の実施例では、周波数変換にウェーブレット変換を用いたが、これに限定されるものではなく、離散コサイン変換に代表される他の周波数変換の適用が可能なことは明らかである。また、本願請求項における「高周波成分」とは、最も周波数の高い帯域(の係数)だけを指す場合と、少なくとも最も周波数の高い帯域（の係数）を含み最も低域の帯域(の係数)は含まない場合、の２通りが考えられる。
［プログラム］
図４１に、本発明による状態通知装置の主要構成を示すハードウェア構成図である。状態通知装置は、主要な構成として、ＣＰＵ１０１、ＲＯＭ（Read Only Memory）２０２、ＲＡＭ（Random Access Memory）１０２、補助記憶部２０４、外部記憶装置Ｉ／Ｆ部２０５、入力部２０６、表示部２０７、及び通信装置２０８を含む構成である。 In the above embodiment, the wavelet transform is used for the frequency transform. However, the present invention is not limited to this, and it is apparent that other frequency transforms represented by the discrete cosine transform can be applied. In addition, the term “high frequency component” in the claims of the present application refers to only the highest frequency band (coefficient), and at least the highest frequency band (coefficient) and the lowest frequency band (coefficient) When not included, two ways are conceivable.
[program]
FIG. 41 is a hardware configuration diagram showing the main configuration of the status notification device according to the present invention. The state notification device includes, as main components, a CPU 101, a ROM (Read Only Memory) 202, a RAM (Random Access Memory) 102, an auxiliary storage unit 204, an external storage device I / F unit 205, an input unit 206, a display unit 207, And the communication device 208.

ＣＰＵ２０１は、マイクロプロセッサ及びその周辺回路から構成され、状態通知装置全体を制御する回路である。また、ＲＯＭ２０２は、ＣＰＵ２で実行される所定の制御プログラム（ソフトウェア部品）を格納するメモリであり、ＲＡＭ２０３は、ＣＰＵ２０１がＲＯＭ２０２に格納された所定の制御プログラム（ソフトウェア部品）を実行して各種の制御を行うときの作業エリア（ワーク領域）として使用するメモリである。 The CPU 201 is composed of a microprocessor and its peripheral circuits, and is a circuit that controls the entire state notification device. The ROM 202 is a memory that stores a predetermined control program (software component) executed by the CPU 2, and the RAM 203 executes various control operations by the CPU 201 executing a predetermined control program (software component) stored in the ROM 202. This is a memory used as a work area (work area) when performing.

補助記憶部２０４は、汎用のＯＳ（Operating System）、本発明によるプロジェクト管理プログラム、タスク情報などのプロジェクトに関する情報を含む各種情報を格納する装置であり、不揮発性の記憶装置であるＨＤＤ（Hard Disk Drive）などが用いられる。なお、上記各種情報は、補助記憶装置２０４以外にも、ＣＤ−ＲＯＭ（Compact Disk - ROM）やＤＶＤ（Digital Versatile Disk）などの記憶媒体やその他のメディアを記憶されてもよく、これらの記憶媒体に格納された各種情報は、記憶媒体読取装置２０５などのドライブ装置を介して読み取ることが可能である。よって、必要に応じて記録媒体５１５を記憶媒体読取装置２０５にセットすることで、各種情報が得られる。 The auxiliary storage unit 204 is a device that stores various information including information related to a project such as a general-purpose OS (Operating System), a project management program according to the present invention, and task information, and is an HDD (Hard Disk) that is a nonvolatile storage device. Drive) is used. In addition to the auxiliary storage device 204, the various types of information may be stored in a storage medium such as a CD-ROM (Compact Disk-ROM) and a DVD (Digital Versatile Disk), and other media. Various information stored in the storage medium can be read via a drive device such as the storage medium reader 205. Therefore, various information can be obtained by setting the recording medium 515 in the storage medium reader 205 as necessary.

入力装置２０６は、ユーザが各種入力操作を行うための装置である。入力装置２０６は、マウス、キーボード、表示装置２０７の表示画面上に重畳するように設けられたタッチパネルスイッチなどを含む。 The input device 206 is a device for the user to perform various input operations. The input device 206 includes a mouse, a keyboard, a touch panel switch provided so as to be superimposed on the display screen of the display device 207, and the like.

表示部２０は、プロジェクト管理に係る各種データを表示画面に表示する装置である。例えば、ＬＣＤ(Liquid Crystal Display)、ＣＲＴ(Cathode Ray Tube)などから構成される。 The display unit 20 is a device that displays various data related to project management on a display screen. For example, it is composed of LCD (Liquid Crystal Display), CRT (Cathode Ray Tube) and the like.

２・・・分解部
４・・・ＤＣレベルシフト部
８・・・周波数変換部
１０・・・量子化部
１２・・・符号化部
１４・・・パケット生成部 2 ... Decomposition unit 4 ... DC level shift unit 8 ... Frequency conversion unit 10 ... Quantization unit 12 ... Encoding unit 14 ... Packet generation unit

Ｈ１１−１７７９７７号公報H11-177777 ２００７−００５８４４号公報No. 2007-005844

Claims

From the original image, a pattern portion image in which the pattern of the original image is displayed, and a non-pattern portion color image in which a non-pattern color image of the original image and a specific image occupying a position other than the color image of the non-pattern are displayed A generating unit for generating a mask image on which the shape of the non-pattern is displayed ;
A frequency converter that converts the frequency of the pattern part image and the non-pattern part color image;
A quantization unit that quantizes the frequency coefficient of the picture part image and the frequency coefficient of the non-picture part color image obtained by the frequency conversion unit ;
An encoding unit that encodes the frequency coefficient of the quantized picture part image and the frequency coefficient of the non-picture part color image;
The quantization, except for normalization for frequency gain,
Wherein the quantization step number of the frequency coefficients of the high frequency band of the non-picture portion color images, the encoding apparatus characterized Reduces the than the quantization step number of the frequency coefficients of the high frequency band of the picture portion the image.

From the original image, a pattern portion image in which the pattern of the original image is displayed, and a non-pattern portion color image in which a non-pattern color image of the original image and a specific image occupying a position other than the color image of the non-pattern are displayed A generating unit for generating a mask image on which the shape of the non-pattern is displayed ;
The pattern portion image, the non-picture portion color image by converting luminance and color difference, the luminance and chrominance of the picture portion image, and the luminance and color difference conversion unit that generates a luminance and color difference of the non-picture portion color images,
Luminance and chrominance of the picture portion image, and a frequency converter for frequency-converting the luminance and chrominance of the non-picture portion color images,
Luminance and chrominance frequency coefficients of the picture portion image obtained by the frequency conversion unit, and a quantization unit for quantizing the luminance and color difference frequency coefficients of the non-picture portion color images,
Luminance and chrominance frequency coefficients of the quantized picture unit image, and has an encoding unit for encoding the luminance and chrominance frequency coefficients of the quantized non-picture portion color images,
The quantization includes quantization for normalization for frequency gain and quantization for normalization of luminance / chrominance conversion gain,
Wherein the non-picture portion color images, the number of quantization steps of the quantization step number / luminance color difference, the picture part image, smaller remote by number of quantization steps of the quantization step number / luminance color difference An encoding device.

From the original image, a pattern portion image in which the pattern of the original image is displayed, a non-pattern portion color image in which the non-pattern color image of the original image is displayed, and a mask image in which the shape of the non-pattern is displayed , a generating unit that generates a,
The pattern portion image, the non-picture portion color image by converting luminance and color difference, the luminance and chrominance of the picture portion image, and the luminance and color difference conversion unit that generates a luminance and color difference of the non-picture portion color images,
Luminance and chrominance of the picture portion image, and a frequency converter for frequency-converting the luminance and chrominance of the non-picture portion color images,
Luminance and chrominance frequency coefficients of the picture portion image after frequency conversion which is obtained by the frequency conversion unit, and a quantization unit for quantizing the luminance and color difference frequency coefficients of the non-picture portion color images,
Luminance and chrominance frequency coefficients of the quantized picture unit image, and has an encoding unit for encoding the luminance and chrominance frequency coefficients of the non-picture portion color images,
The quantization includes quantization for normalization for frequency gain and quantization for normalization of luminance / chrominance conversion gain,
An encoding apparatus characterized in that the frequency coefficient of the high-frequency band of luminance of the non-picture part color image is most quantized and the frequency coefficient of the high-frequency band of color difference of the picture part image is most quantized.

The quantization unit, the encoding apparatus according to any one of claims 1 to 3, characterized in that the lower bits of discarded of frequency coefficients.

The frequency conversion unit divides each subband by frequency conversion,
Calculating a weighting coefficient for the quantization for each subband, and having a weighting unit that performs weighting with the weighting coefficient,
5. The encoding device according to claim 1, wherein the weighting unit sets the visual distance of the non-picture part color image to the visual distance of the picture part image when calculating the weighting coefficient.

The encoding unit employs the JPEG2000 system,
The encoding device according to claim 1, wherein VisualWeight is not applied to the non-picture part color image, and VisualWeight is applied to the picture part image.

The program for functioning a computer as an encoding apparatus in any one of Claims 1-6.

A computer-readable recording medium on which the program according to claim 7 is recorded.