JP7390788B2

JP7390788B2 - Image encoding device and its control method and program

Info

Publication number: JP7390788B2
Application number: JP2018238646A
Authority: JP
Inventors: 智恵菊地
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2018-12-20
Filing date: 2018-12-20
Publication date: 2023-12-04
Anticipated expiration: 2038-12-20
Also published as: JP2020102704A

Description

本発明は画像データの圧縮符号化技術に関するものである。 The present invention relates to a compression encoding technique for image data.

従来から、顔認識などで抽出された領域や、像域分離により分離された文字・線画領域を注目領域：ＲＯＩ(Region of Interest)として特定し、ＲＯＩ以外の領域の符号量を削減し、圧縮する技術が様々知られている。ＲＯＩの圧縮技術として、より簡易なものは、画像をタイルなどの矩形領域に分割し、ＲＯＩ領域が含まれる矩形領域とそれ以外の矩形領域で画質を変える方法が知られている。国際標準符号化方式であるＪＰＥＧ２０００を用いて、このようなＲＯＩ符号化を実現する方法は特許文献１に記載されている。 Conventionally, regions extracted by face recognition, etc., and character/line drawing regions separated by image region separation are identified as regions of interest (ROI), and the amount of code in regions other than the ROI is reduced and compressed. Various techniques are known to do this. As a simpler ROI compression technique, a method is known in which an image is divided into rectangular regions such as tiles, and the image quality is changed between the rectangular regions that include the ROI region and the other rectangular regions. A method for realizing such ROI encoding using JPEG2000, which is an international standard encoding method, is described in Patent Document 1.

さらに、ＪＰＥＧ２０００では、離散ウェーブレット変換(DWT:Discrete wavelet transform)を用い、ＲＯＩに含まれるＤＷＴ変換係数をビットシフトするマックスシフト法と呼ばれる方式により画素単位でＲＯＩ領域を特定し符号化する方法が知られている。この仕組みを使ったＲＯＩ符号化を実現する方法は特許文献２に記載されている。 Furthermore, in JPEG2000, there is a method known for identifying and encoding the ROI region pixel by pixel using a method called the max shift method, which bit-shifts the DWT transform coefficients included in the ROI using discrete wavelet transform (DWT). It is being A method for realizing ROI encoding using this mechanism is described in Patent Document 2.

特開２００３－３３９０４７号公報Japanese Patent Application Publication No. 2003-339047 特開２００４－２３５９３５号公報Japanese Patent Application Publication No. 2004-235935

ＲＯＩ符号化といっても、ＲＯＩ領域もある程度の画質劣化を許容できる場合と、ＲＯＩ領域はロスレス圧縮する方が望ましい場合とがある。 Although it is referred to as ROI encoding, there are cases in which a certain degree of image quality deterioration can be tolerated in the ROI region, and cases in which it is desirable to perform lossless compression on the ROI region.

たとえば、文字・線画領域は劣化により認識できなくなる恐れがあり、ロスレス圧縮することが望ましい。あるいは、複数のカメラで撮影された人物の多視点映像から自由視点画像を合成する場合には、被写体の詳細について対応点を取る必要があり、各カメラの撮影画像における自由視点画像合成の対象物の領域はロスレス圧縮とすることが望ましい。 For example, text/line drawing areas may become unrecognizable due to deterioration, so it is desirable to perform lossless compression. Alternatively, when synthesizing a free-viewpoint image from multi-viewpoint videos of a person shot by multiple cameras, it is necessary to take corresponding points for details of the subject, and the target of free-viewpoint image synthesis in the images shot by each camera It is desirable to use lossless compression for the area.

しかしながら、ＲＯＩ領域のロスレス圧縮を想定すると、あらかじめ決められた矩形領域を単位としてＲＯＩ符号化を行うような従来技術では、符号データの削減が困難である。なぜなら、矩形領域サイズに対して重要領域が小さい場合には、ロスレス圧縮と判定される矩形領域が大きくなり、ＲＯＩ領域外のデータもロスレス圧縮されてしまうため、符号量を小さくすることが難しい。逆に、ＲＯＩ領域として設定可能な最小領域サイズに合わせて矩形領域サイズを決定すると、各矩形領域に付随するヘッダデータが符号量全体に占める割合が大きくなりすぎるため、やはり符号量を小さくすることが難しくなる。矩形領域サイズをある程度の大きさに限定し、ライン単位で量子化ステップを制御することも可能である。しかし、その場合は、高さ方向のみの制御になり、矩形領域の高さと重要領域の高さが同じになると、その矩形領域をロスレス圧縮することになり、符号量を小さくすることができない。 However, assuming lossless compression of the ROI region, it is difficult to reduce encoded data using conventional techniques that perform ROI encoding in units of predetermined rectangular regions. This is because if the important region is small relative to the rectangular region size, the rectangular region determined to be losslessly compressed becomes large, and data outside the ROI region is also losslessly compressed, making it difficult to reduce the amount of code. Conversely, if the rectangular area size is determined according to the minimum area size that can be set as the ROI area, the header data accompanying each rectangular area will occupy too much of the total code amount, so it is still necessary to reduce the code amount. becomes difficult. It is also possible to limit the rectangular area size to a certain size and control the quantization step on a line-by-line basis. However, in that case, control is performed only in the height direction, and if the height of the rectangular area and the height of the important area become the same, the rectangular area will be losslessly compressed, making it impossible to reduce the amount of code.

一方、ＪＰＥＧ２０００のマックスシフト法により、画素単位でＲＯＩ領域を決定すると、符号データの無駄は生じない。しかし、係数全体をビットシフトして、ＲＯＩ領域以外の下位ビットを削除することは、２のｎ乗の量子化に相当し、ロッシー領域の画質を細かく制御できないという課題がある。さらに、ＲＯＩが設定されたタイルはデコード時に、シフトした分だけ戻す処理が必要になるため、ＲＯＩが設定されているタイルとそうではないタイルでデコード処理が異なってしまうという課題もある。 On the other hand, if the ROI region is determined pixel by pixel using the JPEG2000 max shift method, no code data is wasted. However, bit-shifting the entire coefficient and deleting lower bits outside the ROI region corresponds to quantization of 2 to the nth power, and there is a problem in that the image quality of the lossy region cannot be precisely controlled. Furthermore, when decoding a tile to which an ROI has been set, it is necessary to perform processing to return the shifted amount, so there is a problem in that the decoding process is different between tiles with an ROI and tiles without such a setting.

本発明は、かかる課題に鑑み成されたものであり、注目領域を含む画像データを、その注目領域については高画質を維持しつつ、且つ、注目領域外の符号量制御を可能にするだけでなく、注目領域、注目領域外を同じアルゴリズムで効率よく復号可能な符号化データを生成させる技術を提供しようとするものである。 The present invention has been made in view of the above problems, and it is possible to maintain high image quality of image data including a region of interest while controlling the amount of code outside the region of interest. Rather, the aim is to provide a technique for generating coded data that can be efficiently decoded using the same algorithm for both the region of interest and the outside of the region of interest.

この課題を解決するため、例えば本発明の画像符号化装置は以下の構成を備える。すなわち、
画像符号化装置であって、
符号化対象画像データにおける注目領域を設定する設定手段と、
前記符号化対象画像データに対し可逆なフィルタを用いてウェーブレット変換を行うことで、複数のサブバンドを得る変換手段と、
前記変換手段で得た各サブバンドに含まれる係数を、ライン単位に量子化パラメータに従って量子化する量子化手段と、
量子化して得た各サブバンドの量子化後の係数を、ライン単位にエントロピー符号化し、前記量子化パラメータを特定する情報が付加された符号化データを生成する符号化手段と、
前記量子化手段及び符号化手段を制御する制御手段とを有し、
前記制御手段は、サブバンド毎に、
係数で構成される着目ラインが、前記注目領域内の画素を参照して生成された係数を含むか否かを判定する判定手段と、
前記量子化手段で得た量子化後の係数を修正する修正手段とを含み、
前記制御手段は、
前記判定手段が、前記着目ラインが前記注目領域内の画素を参照して生成された係数を含まないと判定した場合、前記量子化手段に対して前記着目ラインの各係数を前記量子化パラメータで量子化するよう制御すると共に、前記符号化手段に対して前記量子化パラメータを特定する情報を含む符号化データを生成させ、
前記判定手段が、前記着目ラインが前記注目領域内の画素を参照して生成された係数を含むと判定した場合、前記量子化手段に対し、前記着目ラインの係数のうち前記注目領域内の画素を参照して生成された係数に対してはその値を維持し、前記注目領域内の画素を参照しないで生成された係数に対しては前記量子化パラメータで量子化するよう制御し、且つ、前記修正手段に対して、前記量子化手段で得た量子化後の係数を前記量子化パラメータで逆量子化させることで修正させ、前記符号化手段に対して前記着目ラインに対して量子化ステップ幅が"１"であることを表す情報を含む符号化データを生成させることを特徴とする。 In order to solve this problem, for example, the image encoding device of the present invention has the following configuration. That is,
An image encoding device,
a setting means for setting a region of interest in image data to be encoded;
Transforming means for obtaining a plurality of subbands by performing wavelet transformation on the encoding target image data using a reversible filter;
quantization means for quantizing the coefficients included in each subband obtained by the conversion means on a line-by-line basis according to a quantization parameter;
encoding means for entropy encoding the quantized coefficients of each subband obtained by quantization on a line-by-line basis to generate encoded data to which information specifying the quantization parameter is added;
and a control means for controlling the quantization means and the encoding means,
The control means, for each subband,
determining means for determining whether a line of interest made up of coefficients includes a coefficient generated with reference to pixels in the region of interest;
correction means for correcting the quantized coefficients obtained by the quantization means,
The control means includes:
If the determining means determines that the line of interest does not include a coefficient generated by referring to pixels within the region of interest, the determining means causes the quantizing means to calculate each coefficient of the line of interest using the quantization parameter. controlling the quantization and causing the encoding means to generate encoded data including information specifying the quantization parameter;
If the determining means determines that the line of interest includes a coefficient generated by referring to the pixel within the region of interest, the determining means instructs the quantizing means to determine which of the coefficients of the line of interest includes the coefficient generated by referring to the pixel within the region of interest. The coefficients generated by referring to the pixel in the region of interest are controlled to maintain their values, and the coefficients generated without referring to the pixels in the region of interest are quantized using the quantization parameter, and , the correction means corrects the quantized coefficients obtained by the quantization means by inversely quantizing them using the quantization parameter, and the encoding means quantizes the line of interest. The method is characterized in that encoded data including information indicating that the step width is "1" is generated.

本発明によれば、注目領域を含む画像データを、その注目領域については高画質を維持しつつ、且つ、注目領域外の符号量の制御を可能にするだけでなく、注目領域、注目領域外を同じアルゴリズムで効率よく復号可能な符号化データを生成することができる。 According to the present invention, image data including a region of interest can be processed while maintaining high image quality for the region of interest, as well as controlling the amount of code outside the region of interest. It is possible to generate encoded data that can be efficiently decoded using the same algorithm.

第１の実施形態における符号化装置の構成を示すブロック図。FIG. 1 is a block diagram showing the configuration of an encoding device in a first embodiment. 第１の実施形態における画像の符号化処理を示すフローチャート。5 is a flowchart showing image encoding processing in the first embodiment. 第１の実施形態における符号化対象画像におけるＲＯＩと、ＤＷＴを説明するための図。FIG. 3 is a diagram for explaining the ROI and DWT in the encoding target image in the first embodiment. 各サブバンドの量子化ステップと、符号化データの構造を示す図。FIG. 3 is a diagram showing the quantization step of each subband and the structure of encoded data. 第１の実施形態におけるＲＯＩ用の符号化処理を示すフローチャート。5 is a flowchart showing encoding processing for ROI in the first embodiment. 第１の実施形態におけるＲＯＩ非参照変換係数の修正処理を示すフローチャート。7 is a flowchart illustrating ROI non-reference transformation coefficient correction processing in the first embodiment. 第１の実施形態におけるサブバンドのＤＷＴ変換係数の符号化処理を示すフローチャート。5 is a flowchart showing encoding processing of subband DWT transform coefficients in the first embodiment. 注目変換係数と、周辺の符号化済みの変換係数との位置関係を示す図。FIG. 3 is a diagram showing a positional relationship between a transformation coefficient of interest and surrounding encoded transformation coefficients. 第２の実施形態におけるＲＯＩ非参照変換係数の修正処理を示すフローチャート。7 is a flowchart illustrating ROI non-reference transformation coefficient correction processing in the second embodiment.

以下、添付図面に従って本発明に係る実施形態を詳細に説明する。なお、以下の実施形態は本発明を限定するものではなく、また、本実施形態で説明されている特徴の組み合わせの全てが本発明の解決手段に必須のものとは限らない。なお、同一の構成については、同じ符号を付して説明する。 DESCRIPTION OF EMBODIMENTS Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings. Note that the following embodiments do not limit the present invention, and not all combinations of features described in the present embodiments are essential to the solution of the present invention. Note that the same configurations will be described using the same reference numerals.

［第１の実施形態］
図１（ａ）は本実施形態における注目領域（ＲＯＩ: Region Of Interest）を含む画像の符号化を行う画像符号化装置のブロック構成図である。この画像符号化装置は、ＣＰＵ１０１、入力部１０２、ＲＯＩ設定部１０３、符号データ生成部１０４、表示部１０５、メモリ１０６、及び、蓄積部１０７を有する。 [First embodiment]
FIG. 1A is a block configuration diagram of an image encoding apparatus that encodes an image including a region of interest (ROI) in this embodiment. This image encoding device includes a CPU 101 , an input section 102 , an ROI setting section 103 , a code data generation section 104 , a display section 105 , a memory 106 , and a storage section 107 .

ＣＰＵ１０１は、各構成の処理にかかわり、装置全体の制御を司る。入力部１０２は、ユーザからの指示や、符号化対象の画像データなどを入力する。このため、入力部１０２は、キーボードやマウスなどのポインティングデバイスを含み、画像データを入力するためのインターフェースをふくむ。ＲＯＩ設定部１０３は、符号化対象の画像データ中の注目領域ＲＯＩを設定する。符号データ生成部１０４は、符号化対象の画像データが有する画素値に対し、離散ウェーブレット変換（ＤＷＴ：Discrete Wavelet Transform）を実行し、得られたＤＷＴ変換係数を、ランレングス符号化と予測符号化のいずれかを適宜切り替えて符号化する（詳細後述）。表示部１０５は、通常は液晶ディスプレイなどが用いられる。メモリ１０６は、ＲＯＭ、ＲＡＭで構成され、ＣＰＵ１０１が実行するプログラムや各種パラメータの格納、並びに、ＣＰＵ１０１のワークエリアとして使用される。蓄積部１０７は、画像データ、プログラムなどを蓄積する部分で、通常はハードディスクなどが用いられる。また、後述するフローチャートの処理に必要な制御プログラムは、蓄積部１０７に格納されているか、メモリ１０６のＲＯＭに格納されているものとする。蓄積部１０７に格納されている場合は、ＣＰＵ１０１は、一旦メモリ１０６内のＲＡＭにそのプログラムを読み出して実行される。なお、システム構成については、上記以外にも様々な構成要素が存在するが、本発明の主眼ではないので、その説明は省略する。 The CPU 101 is involved in the processing of each component and controls the entire device. The input unit 102 inputs instructions from the user, image data to be encoded, and the like. Therefore, the input unit 102 includes a pointing device such as a keyboard and a mouse, and includes an interface for inputting image data. The ROI setting unit 103 sets a region of interest ROI in the image data to be encoded. The code data generation unit 104 performs discrete wavelet transform (DWT) on pixel values of image data to be encoded, and performs run-length encoding and predictive encoding on the obtained DWT transform coefficients. The code is encoded by switching between the two as appropriate (details will be described later). As the display unit 105, a liquid crystal display or the like is normally used. The memory 106 is composed of a ROM and a RAM, and is used to store programs executed by the CPU 101 and various parameters, and as a work area for the CPU 101. The storage unit 107 is a part that stores image data, programs, etc., and usually uses a hard disk or the like. Further, it is assumed that a control program necessary for the processing of the flowchart described later is stored in the storage unit 107 or in the ROM of the memory 106. If the program is stored in the storage unit 107, the CPU 101 once reads the program into the RAM in the memory 106 and executes it. Regarding the system configuration, there are various components other than those described above, but since they are not the main focus of the present invention, their explanation will be omitted.

実施形態の符号データ生成部１０４は、符号化対象の画像データをＤＷＴし、得られたＤＷＴ変換係数を量子化し、符号化する。そして、符号化対象の画像データにおけるＲＯＩ内の画素をロスレス（可逆）符号化し、ＲＯＩ外の画素はロッシー（非可逆）符号化する。このため、ウェーブレット変換では可逆な５－３タップのＤＷＴフィルタを用いるものとするが、フィルタの種類やサイズはこれに限定されるものでない。ＲＯＩ内の画素をロスレス符号化するためには、ＲＯＩ内の画素を参照して生成されたＤＷＴ変換係数をロスレス符号化すればよい。言い換えれば、フィルタが持つ５入力の少なくとも１つが、ＲＯＩの画素データを入力した場合、そのフィルタで算出されるＤＷＴ変換係数をロスレス符号化する。そして、ＲＯＩ内の画素を参照しないで生成されたＤＷＴ変換係数についてはロッシー符号化を許容する。 The encoded data generation unit 104 of the embodiment performs DWT on the image data to be encoded, quantizes the obtained DWT transform coefficients, and encodes them. Then, pixels within the ROI in the image data to be encoded are losslessly (reversibly) encoded, and pixels outside the ROI are lossy (irreversibly) encoded. For this reason, a reversible 5-3 tap DWT filter is used in the wavelet transform, but the type and size of the filter is not limited to this. In order to losslessly encode pixels within the ROI, DWT transform coefficients generated with reference to pixels within the ROI may be losslessly encoded. In other words, when at least one of the five inputs of the filter receives pixel data of the ROI, the DWT transform coefficients calculated by that filter are losslessly encoded. Lossy encoding is allowed for DWT transform coefficients generated without referring to pixels within the ROI.

図１（ｂ）は、符号データ生成部１０４のブロック構成図である。符号データ生成部１０４は、ＤＷＴ部１１０、量子化部１１１、符号化方式選択部１１２、ランレングス符号化部１１３、予測符号化部１１４、符号列生成部１１６、及び、スイッチ１０９、１１０を含む。 FIG. 1(b) is a block diagram of the code data generation section 104. Code data generation section 104 includes a DWT section 110, a quantization section 111, an encoding method selection section 112, a run-length encoding section 113, a predictive encoding section 114, a code string generation section 116, and switches 109 and 110. .

ＤＷＴ部１１０は、符号化対象の画像データを１ライン単位に入力し、その入力したライン（以下、注目ラインという）に含まれる各画素に対してＤＷＴフィルタを用いてＤＷＴを行い、複数のサブバンドを生成する。 The DWT unit 110 inputs image data to be encoded line by line, performs DWT on each pixel included in the input line (hereinafter referred to as the line of interest) using a DWT filter, and performs DWT on each pixel included in the input line (hereinafter referred to as the line of interest). Generate a band.

量子化部１１１は、ＲＯＩ領域設定部１０３から供給されるＲＯＩ領域情報を参照して、注目サブバンド内の注目ラインがＲＯＩ内の画素を参照して生成されたＤＷＴ変換係数を含むか否かを判定する。なお、以降、ＤＷＴ変換係数を単に変換係数と呼ぶ。そして、ＲＯＩ内（注目領域内）の画素を参照して生成されたＤＷＴ変換係数をＲＯＩ参照変換係数と記し、ＲＯＩ内の画素を参照しない、すなわち、ＤＷＴフィルタに入力される全画素がＲＯＩ外の画素であった場合に生成されたＤＷＴ変換係数をＲＯＩ非参照変換係数と呼ぶ。 The quantization unit 111 refers to the ROI area information supplied from the ROI area setting unit 103 and determines whether the line of interest in the subband of interest includes a DWT transform coefficient generated by referring to pixels in the ROI. Determine. Note that, hereinafter, the DWT transform coefficients will be simply referred to as transform coefficients. The DWT transformation coefficients generated by referring to pixels within the ROI (within the region of interest) are referred to as ROI reference transformation coefficients, and pixels within the ROI are not referred to, that is, all pixels input to the DWT filter are outside the ROI. The DWT transform coefficients generated when the pixel is the ROI non-reference transform coefficient are called ROI non-reference transform coefficients.

注目ラインがＲＯＩ参照変換係数を含まない場合、量子化部１１１は、予め設定された量子化パラメータに従って注目ラインより得た全サブバンドの変換係数を量子化し、量子化後の変換係数を、スイッチ１０９を介して符号化方式選択部１１２に供給する。 If the line of interest does not include ROI reference transform coefficients, the quantization unit 111 quantizes the transform coefficients of all subbands obtained from the line of interest according to preset quantization parameters, and converts the quantized transform coefficients into switches. 109 to the encoding method selection unit 112.

一方、注目ラインがＲＯＩ参照変換係数を含む場合、量子化部１１１は設定された量子化パラメータに従って注目ラインより得た全サブバンドの変換係数のうち、ＲＯＩ参照変換係数については量子化ステップ“１”で量子化（量子化しないのと等価）し、ＲＯＩ非参照変換係数については予め設定された量子化パラメータに従って量子化する。そして、量子化部１１１は、得られた量子化後の変換係数を、スイッチ１０９を介して、係数修正部１２０に供給する。 On the other hand, when the line of interest includes ROI reference transform coefficients, the quantization unit 111 performs quantization step "1" for the ROI reference transform coefficients among the transform coefficients of all subbands obtained from the line of interest according to the set quantization parameter. ” (equivalent to not quantizing), and ROI non-reference transform coefficients are quantized according to preset quantization parameters. Then, the quantization section 111 supplies the obtained quantized transform coefficients to the coefficient modification section 120 via the switch 109.

係数修正部１２０は、ＲＯＩ情報に基づき、量子化部１１１から供給された、注目ラインの変換係数のうち、ＲＯＩ非参照変換係数を修正する。具体的に説明すると、係数修正部１２０は、量子化部１１１から供給された、ＲＯＩ参照変換係数はその値を維持し、ＲＯＩ非参照変換係数については逆量子化する。この結果、注目ラインから得た各サブバンドの変換係数のうち、ＲＯＩ参照変換係数はＤＷＴの直後の値のままとなり、可逆性が維持される。また、ＲＯＩ非参照変換係数は量子化ステップ値に依存した誤差を含むので非可逆となるものの、ＤＷＴで得た係数のレベルまで復元した値となり、符号化効率に寄与したデータとすることができる。 The coefficient modification unit 120 modifies ROI non-reference transformation coefficients among the transformation coefficients of the line of interest supplied from the quantization unit 111 based on the ROI information. Specifically, the coefficient correction unit 120 maintains the values of the ROI reference transform coefficients supplied from the quantizer 111, and dequantizes the ROI non-reference transform coefficients. As a result, among the transform coefficients of each subband obtained from the line of interest, the ROI reference transform coefficient remains the value immediately after DWT, and reversibility is maintained. In addition, although ROI non-reference transform coefficients are irreversible because they include errors that depend on the quantization step value, they are values that have been restored to the level of the coefficients obtained by DWT, and can be data that contributes to encoding efficiency. .

なお、注目ラインの変換係数の符号化データを生成するとき、ライン単位に量子化パラメータを示す情報がそのラインの符号化データもヘッダに格納される。このため、係数修正部１２０を経た変換係数の符号化データのヘッダには、量子化ステップ幅が“１”であることを示す情報が格納される。また、係数修正部１２０を経ない、変換係数の符号化データのヘッダには、設定された量子化パラメータを示す情報が格納されることになる。 Note that when generating encoded data of transform coefficients of a line of interest, information indicating a quantization parameter for each line is also stored in the header of the encoded data of that line. Therefore, information indicating that the quantization step width is "1" is stored in the header of the encoded data of the transform coefficients that has passed through the coefficient correction unit 120. Furthermore, information indicating the set quantization parameter is stored in the header of the encoded data of the transform coefficients that has not been passed through the coefficient correction unit 120.

符号化方式選択部１１２は、変換係数ごとに、ランレングス符号化部１１３または、予測符号化部１１４のいずれかに出力する。ランレングス符号化部１１３、予測符号化部１１４は、それぞれに応じた符号化処理を行い、生成した符号データを、符号データ出力部１１５内のバッファ１１５ａに格納する。すべてのサブバンドの変換係数の符号データがバッファ１１５ａに格納されると、符号列生成部１１６は、その符号化データを所定の順に並べて、符号化列（符号ストリーム）として成形し、出力する。 Coding method selection section 112 outputs each transform coefficient to either run-length encoding section 113 or predictive encoding section 114. The run-length encoding section 113 and the predictive encoding section 114 perform respective encoding processes and store the generated encoded data in a buffer 115a in the encoded data output section 115. When the coded data of the transform coefficients of all the subbands are stored in the buffer 115a, the coded stream generation unit 116 arranges the coded data in a predetermined order, shapes it as a coded stream (coded stream), and outputs it.

本実施形態では、説明を単純化するため、符号化対象の画像データは単一成分（例えば輝度）で表され、１画素当たり８ビットで表されるモノクロ多値画像データとして説明する。ただし、１画素のビット数が１０ビット、１２ビット、１４ビットなど８ビット以外のビット数で輝度値を表現している画像データにも適用できる。また、１画素が、ＲＧＢやＣＭＹＫなどの複数の色成分で構成される場合には、成分毎に分離した画像データを符号化対象とすればよいので、符号化対象がモノクロ多値画像データに限定されるものではない。１画素８ビットで、モノクロ多値画像データとするのは、あくまで例示であると理解されたい。 In this embodiment, in order to simplify the explanation, the image data to be encoded will be explained as monochrome multivalued image data expressed by a single component (for example, luminance) and expressed by 8 bits per pixel. However, it can also be applied to image data in which the brightness value is expressed using a bit number other than 8 bits, such as 10 bits, 12 bits, 14 bits, etc., for each pixel. Furthermore, when one pixel is composed of multiple color components such as RGB and CMYK, the image data separated for each component can be encoded, so the encoding target is monochrome multi-valued image data. It is not limited. It should be understood that the monochrome multivalued image data with 8 bits per pixel is merely an example.

次に、実施形態における符号データ生成部１０４における画像の符号化処理を図２のフローチャートを参照して説明する。 Next, image encoding processing in the encoded data generation unit 104 in the embodiment will be described with reference to the flowchart of FIG. 2.

Ｓ２０１にて、符号データ生成部１０４は、入力部１０２より入力された符号化対象画像の水平方向の画素数Ｗと垂直方向の画素数Ｈを取得する。符号化対象の画像データが撮像素子から得られた画像データの場合には、その撮像素子の解像度（水平、垂直方向画素数）を示す情報をＣＰＵ１０１が、符号データ生成部１０４に設定すればよい。また、符号化対象画像データが、ファイル形式で保存されていれば、ＣＰＵ１０１は、そのファイルヘッダから、Ｗ，Ｈの情報を取得し、符号データ生成部１０４に設定すればよい。 In S201, the code data generation unit 104 obtains the number W of pixels in the horizontal direction and the number H of pixels in the vertical direction of the encoding target image input from the input unit 102. If the image data to be encoded is image data obtained from an image sensor, the CPU 101 may set information indicating the resolution (number of horizontal and vertical pixels) of the image sensor in the code data generation unit 104. . Furthermore, if the image data to be encoded is saved in a file format, the CPU 101 may acquire information on W and H from the file header and set it in the encoded data generation unit 104.

Ｓ２０２にて、符号データ生成部１０４は、ＲＯＩ設定部１０３からＲＯＩ領域情報（ＲＯＩの形状、位置を示す情報）を取得する。ＲＯＩ領域の特定法は、本願の主眼ではなく、如何なる手法を採用しても良い。例えば、入力部１０２から入力された画像から文字・線画領域や顔、特定の物体などの自動検出結果をＲＯＩ領域として自動的に取得してもよい。あるいは、マウスなどを使ってユーザが指定した任意の閉領域をＲＯＩ領域として取得してもよい。本実施形態においてはあらかじめ学習した特定の物体の自動検出結果をＲＯＩ領域として取得するものとし、図３（ａ）の領域３０１がＲＯＩ領域として取得されたものとする。 In S202, the code data generation unit 104 acquires ROI region information (information indicating the shape and position of the ROI) from the ROI setting unit 103. The method for specifying the ROI region is not the main focus of this application, and any method may be used. For example, results of automatic detection of text/line drawing areas, faces, specific objects, etc. from the image input from the input unit 102 may be automatically acquired as the ROI area. Alternatively, any closed region specified by the user using a mouse or the like may be acquired as the ROI region. In this embodiment, it is assumed that a pre-learned automatic detection result of a specific object is acquired as an ROI region, and the region 301 in FIG. 3(a) is acquired as the ROI region.

Ｓ２０３にて、符号データ生成部１０４は、ＤＷＴの実行回数と、ＤＷＴによって得られる各サブバンドを量子化する際の量子化ステップ値を特定する量子化パラメータを取得する。これらの値は、ユーザが予め入力部１０２により設定するものとする。サブバンド数は、『ＤＷＴの実行回数×３＋１』で求めることができる。本実施形態ではＤＷＴの実行回数は“２”としている。したがって、生成されるサブバンドの数は、図３（ｂ）に示すように、２ＬＬ，２ＨＬ，２ＬＨ，２ＨＨ，１ＨＬ，１ＬＨ，１ＨＨの計７つとなる。また、それらのサブバンドそれぞれに対して図４（ａ）に示す量子化ステップ値が決められており、符号データ生成部１０４は、これらの値を取得する。また、符号データ生成部１０４は、ＤＷＴの実行回数とＤＷＴフィルタサイズに基づき、変換係数がＲＯＩ参照変換係数、ＲＯＩ非参照変換係数のいずれであるかを判定する。 In S203, the code data generation unit 104 acquires the number of executions of DWT and a quantization parameter that specifies a quantization step value when quantizing each subband obtained by DWT. It is assumed that these values are set in advance by the user using the input unit 102. The number of subbands can be determined by "number of times DWT is executed x 3 + 1". In this embodiment, the number of times the DWT is executed is "2". Therefore, the number of generated subbands is 2LL, 2HL, 2LH, 2HH, 1HL, 1LH, and 1HH, a total of seven, as shown in FIG. 3(b). Further, the quantization step values shown in FIG. 4A are determined for each of these subbands, and the code data generation unit 104 acquires these values. Further, the code data generation unit 104 determines whether the transform coefficient is an ROI-referenced transform coefficient or an ROI-non-referenced transform coefficient based on the number of executions of DWT and the DWT filter size.

Ｓ２０４にて、符号データ生成部１０４は、符号化対象の画像データの垂直方向の位置を示すカウンタｙに“０”を代入し、初期化する。実施形態では、符号化対象の画像の垂直方向の画素数をＨとしているので、カウンタｙの値は０～Ｈ－１の範囲を取り得ることになる。 In S204, the encoded data generation unit 104 initializes the counter y by assigning "0" to the counter y indicating the vertical position of the image data to be encoded. In the embodiment, since the number of pixels in the vertical direction of the image to be encoded is H, the value of the counter y can range from 0 to H-1.

Ｓ２０５にて、符号データ生成部１０４は、ｙ行目の１ライン分の画像データをＤＷＴ部１１０内のラインバッファ（不図示）に取得する。 In S205, the code data generation unit 104 acquires the image data for one line of the y-th row into a line buffer (not shown) in the DWT unit 110.

そして、Ｓ２０６にて、符号データ生成部１０４はＤＷＴ部１１０を制御し、ラインバッファに格納された画像データに対してＪＰＥＧ２０００で採用されている整数型フィルタである、５－３タップフィルタを用いて、ＤＷＴを２回実行する。一般に、２次元ＤＷＴは１次元ＤＷＴを変換対象の画像の水平・垂直方向にそれぞれ適用することで実現できる。なお、水平方向のＤＷＴは１ライン分の画像データに対して実行できるが、垂直方向のＤＷＴを行う場合には、フィルタのサイズとＤＷＴの実行回数に依存したライン数の画像データが必要なる。つまり、実際には、ＤＷＴ部１１０には、その容量のバッファメモリを有し、入力したラインに対して或る程度遅延してＤＷＴを実行し、その結果を出力することになる。ここでは、単純化するため、１ライン分の画像データが入力されるたびに、そのラインから７つのサブバンドの変換係数が得られものとして説明を継続する。 Then, in S206, the code data generation unit 104 controls the DWT unit 110 to apply a 5-3 tap filter, which is an integer type filter adopted in JPEG2000, to the image data stored in the line buffer. , execute DWT twice. Generally, two-dimensional DWT can be realized by applying one-dimensional DWT to the horizontal and vertical directions of the image to be transformed. Note that horizontal DWT can be performed on one line of image data, but when vertical DWT is performed, image data whose number of lines depends on the filter size and the number of times DWT is performed is required. In other words, in reality, the DWT unit 110 has a buffer memory of that capacity, executes DWT with a certain delay with respect to the input line, and outputs the result. Here, for the sake of simplicity, the explanation will be continued on the assumption that seven subband transformation coefficients are obtained from each line of image data each time that line is input.

Ｓ２０７にて、符号データ生成部１０４は、ｙ行、すなわち、注目ラインの画像データから得た変換係数内に、ＲＯＩ参照変換係数があるか否かを判定する。注目ラインの変換係数内にＲＯＩ参照変換係数が存在すると判定した場合、符号データ生成部１０４は処理をＳ２１５に進め、ＲＯＩ用の符号化処理を実行する。このＳ２１５に処理が進むのが、図１（ｂ）における、量子化部１１１による量子化で得た変換係数がスイッチ１０９を介して係数修正部１２０に供給される場合に相当することになる。 In S207, the code data generation unit 104 determines whether or not there is an ROI reference transformation coefficient among the transformation coefficients obtained from the image data of the y row, that is, the line of interest. If it is determined that the ROI reference transformation coefficient exists in the transformation coefficients of the line of interest, the encoded data generation unit 104 advances the process to S215 and executes ROI encoding processing. Proceeding to S215 corresponds to the case in which the transform coefficients obtained by quantization by the quantizer 111 are supplied to the coefficient correction unit 120 via the switch 109 in FIG. 1(b).

一方、注目ラインから得た変換係数内にＲＯＩ参照変換係数が無いと判定した場合、すなわち、全変換係数がＲＯＩ非参照変換係数である場合、符号データ生成部１０４は、処理をＳ２０９へ進める。本実施形態の場合、図３（ａ）からもわかるように、最初の数ラインはＲＯＩ領域外であるので、初期段階の数ラインの符号化では、Ｓ２０７からＳ２０９に分岐する。このＳ２０９に処理が進むのが、図１（ｂ）における、量子化部１１１による量子化で得た変換係数が、ダイレクトに（係数修正部１２０の介在無し）、符号化方式選択部１１２に供給される場合に相当する。 On the other hand, if it is determined that there is no ROI-referenced transformation coefficient among the transformation coefficients obtained from the line of interest, that is, if all the transformation coefficients are ROI-non-referenced transformation coefficients, the code data generation unit 104 advances the process to S209. In the case of this embodiment, as can be seen from FIG. 3(a), the first few lines are outside the ROI region, so the encoding of the first few lines branches from S207 to S209. The process proceeds to S209 because the transform coefficients obtained by quantization by the quantizer 111 in FIG. This corresponds to the case where

Ｓ２０９にて、符号データ生成部１０４は、サブバンドのカウンタｓに“０”を代入して初期化する。本実施形態では、２ＬＬ乃至１ＨＨの７つのサブバンドに対して、それぞれカウンタｓの値“０”～“６”が割り当てられているものとする。また、カウンタｓが示すサブバンドを単にサブバンドｓと表記する。 In S209, the code data generation unit 104 initializes the subband counter s by assigning "0" to it. In this embodiment, it is assumed that counter s values "0" to "6" are assigned to seven subbands 2LL to 1HH, respectively. Further, the subband indicated by the counter s is simply referred to as subband s.

Ｓ２１０にて、符号データ生成部１０４は、量子化部１１１を制御し、サブバンドｓの変換係数を、Ｓ２０３で取得した、対応する量子化ステップを使って量子化する。 In S210, code data generation unit 104 controls quantization unit 111 to quantize the transform coefficient of subband s using the corresponding quantization step obtained in S203.

Ｓ２１１にて、符号データ生成部１０４は、サブバンドｓの量子化ステップ幅を示す情報を、サブバンドｓの符号データのヘッダに付加するため、符号データ出力部１１５内に予め確保されたサブバンドｓ用の一時保存バッファにその情報を格納する。 In S211, the code data generation unit 104 adds information indicating the quantization step width of the subband s to the header of the code data of the subband s. The information is stored in the temporary storage buffer for s.

Ｓ２１２にて、符号データ生成部１０４は、Ｓ２１０で生成されたサブバンドｓの量子化後の変換係数について、ランレングス符号化部１１３、予測符号化部１１４のいずれを用いるかの判定を行い、その判定結果に基づいて選択された符号化部を用いて符号化を行わせる。選択された符号化部は、生成した符号化データを、符号データ出力部１１５内のサブバンドｓのヘッダに後続するように格納する。 In S212, the code data generation unit 104 determines whether to use the run-length encoding unit 113 or the predictive encoding unit 114 for the quantized transform coefficient of the subband s generated in S210, The encoding unit selected based on the determination result is used to perform encoding. The selected encoding section stores the generated encoded data so as to follow the header of the subband s in the encoded data output section 115.

ここで、符号データ出力部１１５内に確保される各サブバンドの符号データ一時保存バッファは、図４（ｂ）に示すように、ラインの量子化ステップ幅を示す情報に続けて、そのラインの変換係数の符号データが保存される。そして、先頭ラインから最終ラインまで、量子化ステップ幅と符号データの組み合わせが順に保存されていく。このＳ２１２の符号化処理の詳細は、図７を参照して後述する。 Here, as shown in FIG. 4(b), the code data temporary storage buffer for each subband secured in the code data output unit 115 stores information indicating the quantization step width of the line. The code data of the transform coefficients is saved. Then, combinations of quantization step width and code data are stored in order from the first line to the last line. Details of the encoding process in S212 will be described later with reference to FIG.

Ｓ２１３にて、符号データ生成部１０４は、サブバンドカウンタｓに“１”を加算し、更新する。Ｓ２１４にて、符号データ生成部１０４は、カウンタｓとサブバンドの総数『１＋(ＤＷＴ回数×３)』を比較する。sがサブバンド総数と同じであれば、注目ラインに対する全サブバンドの符号化が終了したことになるので、処理をＳ２１６に進める。また、符号データ生成部１０４は、ｓとサブバンド総数が異なる場合、残りのサブバンドの符号化を行うため、処理をＳ２１０に戻す。 In S213, the code data generation unit 104 adds "1" to the subband counter s to update it. In S214, the code data generation unit 104 compares the counter s with the total number of subbands "1+(DWT number×3)". If s is the same as the total number of subbands, this means that encoding of all subbands for the line of interest has been completed, and the process advances to S216. Furthermore, if s and the total number of subbands are different, code data generation section 104 returns the process to S210 in order to encode the remaining subbands.

Ｓ２１６にて、符号データ生成部１０４は、ラインカウンタｙに“１”を加算し更新する。そして、Ｓ２１７にて、符号データ生成部１０４は、ラインカウンタｙと画像の高さＨを比較する。ｙとＨが一致していれば、全ラインについての符号化処理を終えたことになるので、処理をＳ２１８へ進める。ｙとＨが不一致の場合、符号データ生成部１０４は処理をＳ２０５へ戻し、次のラインを処理する。 In S216, the code data generation unit 104 adds "1" to the line counter y to update it. Then, in S217, the code data generation unit 104 compares the line counter y and the height H of the image. If y and H match, this means that the encoding process has been completed for all lines, and the process advances to S218. If y and H do not match, the code data generation unit 104 returns the process to S205 and processes the next line.

Ｓ２１８にて、符号データ生成部１０４は、符号列生成部１１６を制御し、各サブバンドの符号データを、あらかじめ決められた順に並べ、符号化データ列（ストリーム）を成形及び出力を行い、このフローを終了する。本実施形態では、図４（ｂ）に示すように、低周波サブバンドから順に並べた符号化データ列が生成、出力される。 In S218, the code data generation unit 104 controls the code string generation unit 116, arranges the code data of each subband in a predetermined order, forms and outputs a coded data string (stream), and End the flow. In this embodiment, as shown in FIG. 4(b), encoded data strings arranged in order from the low frequency subband are generated and output.

次に、Ｓ２１５の処理を図５のフローチャートを参照して説明する。 Next, the process of S215 will be explained with reference to the flowchart of FIG.

Ｓ５０１にて、符号データ生成部１０４は、ＲＯＩ内の画素を参照して生成された変換係数を識別するための、ロスレス係数位置情報Ｐ（ｓ, ｘ）を生成する。このロスレス係数位置情報Ｐ（ｓ,ｘ）における、ｓがサブバンドを、ｘが水平方向の位置を表す。符号データ生成部１０４は、サブバンドｓ、水平方向ｘに位置する変換係数が、ＲＯＩ参照変換係数であれば“１”を、そうでなく、ＲＯＩ非参照変換係数であれば“０”の値を持つＰ（ｓ, ｘ）を生成する。 In S501, the code data generation unit 104 generates lossless coefficient position information P(s, x) for identifying the transformation coefficient generated with reference to pixels within the ROI. In this lossless coefficient position information P(s,x), s represents a subband and x represents a position in the horizontal direction. The code data generation unit 104 sets a value of "1" if the transform coefficient located in the horizontal direction x in the subband s is an ROI-referenced transform coefficient, and a value of "0" if it is an ROI-non-referenced transform coefficient. Generate P(s, x) with .

Ｓ５０２にて、符号データ生成部１０４は、予め決められた量子化対象とするサブバンド番号の最小値Ｎを取得する。このＮは、ＣＰＵ１０１によって符号データ生成部１０４に設定されるものであり、本実施形態ではＮ＝４であるとする。つまり、分解レベル１（最大解像度）の高周波サブバンド１ＨＬ（サブバンド番号４）, １ＬＨ（サブバンド番号５）, １ＨＨ（サブバンド番号６）のＤＷＴ変換係数が量子化対象となる。 In S502, the code data generation unit 104 obtains a predetermined minimum value N of subband numbers to be quantized. This N is set in the code data generation unit 104 by the CPU 101, and in this embodiment, it is assumed that N=4. That is, the DWT transform coefficients of high frequency subbands 1HL (subband number 4), 1LH (subband number 5), and 1HH (subband number 6) at decomposition level 1 (maximum resolution) are quantized.

Ｓ５０３にて、符号データ生成部１０４は、カウンタｓにゼロを代入し、初期化する。なお、カウンタｓが示すサブバンドを単にサブバンドｓとも表記する。Ｓ５０４にて、符号データ生成部１０４は、カウンタｓとＮとを比較することで、サブバンドｓの変換係数が量子化対象かどうかを判定する。符号データ生成部１０４は、カウンタｓがＳ５０２で取得したＮ以上であれば量子化対象と判定し、処理をＳ５０５へ進む。そうでなければ、符号データ生成部１０４は、処理をＳ５０６に進める。 In S503, the code data generation unit 104 assigns zero to the counter s to initialize it. Note that the subband indicated by the counter s is also simply referred to as subband s. In S504, code data generation section 104 compares counters s and N to determine whether the transform coefficient of subband s is to be quantized. If the counter s is equal to or greater than N acquired in S502, the code data generation unit 104 determines that the data is to be quantized, and advances the process to S505. Otherwise, the code data generation unit 104 advances the process to S506.

Ｓ５０５にて、符号データ生成部１０４は、係数修正部１２０を制御し、注目ラインから得たサブバンドｓの変換係数のうちＲＯＩ非参照変換係数の修正を行う。このＲＯＩ非参照変換係数の修正方法については、図６のフローを用いて、後に詳しく説明する。 In S505, the code data generation unit 104 controls the coefficient modification unit 120 to modify the ROI non-reference transformation coefficients among the transformation coefficients of the subband s obtained from the line of interest. The method for modifying the ROI non-reference transformation coefficients will be described in detail later using the flowchart of FIG. 6.

Ｓ５０６にて、符号データ生成部１０４は、サブバンドｓの注目ラインの変換係数を量子化する際に用いる量子化ステップ幅“１”をそのヘッダに格納するため、符号データ出力部１１５内のサブバンドｓの符号データ一時保存バッファに出力する。次のＳ５０７にて、符号データ生成部１０４は、サブバンドｓの変換係数を符号化する。この処理の詳細は図７を用いて後述する。Ｓ５０８にて、符号データ生成部１０４は、サブバンドのカウンタｓに“１”を加算し更新する。そして、Ｓ５０９にて、符号データ生成部１０４は、サブバンド番号ｓとサブバンド総数『１＋（ＤＷＴの実行回数×３）』を比較する。両者が一致すれば、すべてのサブバンドについて符号化が終了したことになるので、符号データ生成部１０４は、このフローを終了する。一方、不一致である場合、符号データ生成部１０４は、処理をＳ５０４に戻し、他のサブバンドの符号化を行う。 In S506, the code data generation unit 104 stores the quantization step width “1” used when quantizing the transform coefficient of the line of interest in the subband s in its header. Output to band s code data temporary storage buffer. In the next step S507, code data generation section 104 encodes the transform coefficients of subband s. Details of this processing will be described later using FIG. 7. In S508, the code data generation unit 104 adds "1" to the subband counter s to update it. Then, in S509, the code data generation unit 104 compares the subband number s and the total number of subbands "1+(number of times DWT is executed x 3)". If the two match, it means that encoding has been completed for all subbands, and the encoded data generation unit 104 ends this flow. On the other hand, if there is a mismatch, the code data generation unit 104 returns the process to S504 and encodes another subband.

次に、図６のフローを参照して、図５のＳ５０５の非ロスレス符号化するＲＯＩ非係数の修正処理について説明する。 Next, with reference to the flowchart in FIG. 6, the correction processing of ROI non-coefficients to be non-losslessly encoded in S505 in FIG. 5 will be described.

Ｓ６０１にて、符号データ生成部１０４は、注目サブバンドの１ラインのＤＷＴ変換係数の個数ｓＷを取得する。ＤＷＴ変換係数の数は、サブバンド番号毎に異なっており、ＤＷＴにより分割する画素、またはＤＷＴ変換係数の数Ｘの１／２になる。ただし、ＤＷＴをかける画素またはＤＷＴ変換係数の数Ｘが奇数の時には、サブバンドＬＨ、ＬＬは、（Ｘ＋１）／２となり、サブバンドＨＬ、ＨＨは（Ｘ－１）／２となる。 In S601, the code data generation unit 104 obtains the number sW of DWT transform coefficients in one line of the subband of interest. The number of DWT transform coefficients differs for each subband number, and is 1/2 of the number X of pixels divided by DWT or DWT transform coefficients. However, when the number X of pixels or DWT transform coefficients to which DWT is applied is an odd number, subbands LH and LL are (X+1)/2, and subbands HL and HH are (X-1)/2.

Ｓ６０２にて、符号データ生成部１０４は、カウンタｓＸに“０”を代入することで初期化する。Ｓ６０３にて、Ｓ５０１で生成したロスレス係数位置情報Ｐ（ｓ、ｓＸ）を参照し、その値が“１”であるかどうかを判定する。“１”であれば、ｓＸ番目の変換係数［ｓＸ］はＲＯＩ参照変換係数であり、修正は行わない。それ故、符号データ生成部１０４は、処理をＳ６０５に進める。 In S602, the code data generation unit 104 initializes the counter sX by assigning "0" to it. In S603, the lossless coefficient position information P(s, sX) generated in S501 is referred to, and it is determined whether the value is "1". If it is “1”, the sX-th transformation coefficient [sX] is the ROI reference transformation coefficient and is not modified. Therefore, the code data generation unit 104 advances the process to S605.

一方、Ｐ（ｓ、ｓＸ）が“０”であった場合、ｓＸ番目の変換係数［ｓＸ］はＲＯＩ非参照変換係数であることになる。それ故、符号データ生成部１０４は処理をＳ６０４に進める。このＳ６０４にて、符号データ生成部１０４は、ｓＸ番目の変換係数［ｓＸ］を、Ｓ２０４で取得した量子化ステップ幅に従った丸め処理を行う。具体的には、符号データ生成部１０４は、変換係数［ｓＸ］を量子化ステップ幅で量子化して得た値を、同量子化ステップ幅で逆量子化する。そして、符号データ生成部１０４は、その逆量子化で得た変換係数を、ｓＸ番目の新たな変換係数［ｓＸ］として出力する。なお、このＳ６０４を経て得られる変換係数は、用いる量子化ステップは幅が大きいほど、丸め誤差が大きくなるものの、取り得る値の限られたものとなり、符号化効率を上げることができる。特に、注目サブバンドが高周波サブバンドである場合は、Ｓ６０４の処理による修正後の変換係数の多くが“０”となり、符号化効率を上げることができることになる。 On the other hand, when P(s, sX) is “0”, the sX-th transformation coefficient [sX] is an ROI non-reference transformation coefficient. Therefore, the code data generation unit 104 advances the process to S604. In S604, the code data generation unit 104 rounds the sX-th transform coefficient [sX] according to the quantization step width obtained in S204. Specifically, the code data generation unit 104 quantizes the transform coefficient [sX] with the quantization step width, and inversely quantizes the value obtained with the same quantization step width. Then, the code data generation unit 104 outputs the transform coefficient obtained by the inverse quantization as the sX-th new transform coefficient [sX]. Note that the larger the quantization step width used, the larger the rounding error of the transform coefficient obtained through S604, but the values that can be taken are limited, and the encoding efficiency can be increased. Particularly, when the subband of interest is a high frequency subband, most of the transform coefficients after modification by the process of S604 become "0", which means that encoding efficiency can be improved.

Ｓ６０５にて、符号データ生成部１０４は、カウンタｓＸに“１”を加算し、更新する。そして、Ｓ６０６にて、符号データ生成部１０４は、カウンタｓＸとＳ６０１で取得した、１ラインの変換係数の数ｓＷとを比較する。ｓＸ＝ｓＷである場合、符号データ生成部１０４は、注目サブバンドの注目ラインにおけるＲＯＩ非参照変換係数の修正が終了したものと判定し、このフローを終了する。ｓＸ＜ｓＷの場合、未修正のＲＯＩ非参照変換係数が存在する可能性があることを意味する。それ故、符号データ生成部１０４は、処理をＳ６０３へ処理を戻す。 In S605, the code data generation unit 104 adds "1" to the counter sX and updates it. Then, in S606, the code data generation unit 104 compares the counter sX with the number sW of transform coefficients for one line obtained in S601. If sX=sW, the code data generation unit 104 determines that the modification of the ROI non-reference transform coefficients in the line of interest of the subband of interest has been completed, and ends this flow. If sX<sW, it means that there is a possibility that unmodified ROI non-reference transformation coefficients exist. Therefore, the code data generation unit 104 returns the process to S603.

次に、サブバンドの変換係数の符号化処理を、図７のフローチャートを参照して説明する。この処理は、図２のＳ２１２、及び、図５のＳ５０７の処理でもある。 Next, the encoding process of subband transform coefficients will be explained with reference to the flowchart of FIG. 7. This process is also the process of S212 in FIG. 2 and S507 in FIG.

Ｓ７０１にて、符号データ生成部１０４は、注目サブバンドの１ライン分の変換係数の個数ｓＷを取得する。そして、Ｓ７０２にて、符号データ生成部１０４は、１個の変換係数の特定するためのカウンタｓＸに“０”を代入し、初期化する。カウンタｓＸの値は、変換係数の左端から位置を示すものである。 In S701, the code data generation unit 104 obtains the number sW of transform coefficients for one line of the subband of interest. Then, in S702, the code data generation unit 104 assigns "0" to a counter sX for specifying one transform coefficient, and initializes the counter sX. The value of the counter sX indicates the position from the left end of the conversion coefficient.

続く、Ｓ７０３乃至７０５の処理は、符号データ生成部１０４における符号化方式選択部１１２の処理である。 The subsequent processes of S703 to S705 are the processes of the encoding method selection unit 112 in the coded data generation unit 104.

符号化方式の判定処理は、図８に示すように、注目変換係数の位置をｘとしたとき、その周辺の変換係数ａ，ｂ，ｃの値を参照して行われる。そして、これら３つの周辺変換係数ａ、ｂ、ｃを参照し、符号化方式選択部１１２は注目変換係数ｘの符号化方式を判定し、注目変換係数ｘをランレングス符号化部１１３、予測符号化部１１４のいずれかに供給する。実施形態における符号化方式の選択基準は、低周波成分であるサブバンドＬＬと、それ以外の高周波成分のサブバンドで異なっている。それ故、Ｓ７０３にて、符号化方式選択部１１２は、カウンタｓに基づき、サブバンドＬＬの変換係数を符号化しようとしているのか否かを判定する。 As shown in FIG. 8, the encoding method determination process is performed by referring to the values of the surrounding transform coefficients a, b, and c, where x is the position of the transform coefficient of interest. Then, referring to these three peripheral transform coefficients a, b, and c, the encoding method selection unit 112 determines the encoding method of the target transform coefficient It is supplied to one of the converting sections 114. The selection criteria for the encoding method in the embodiment is different between subband LL, which is a low frequency component, and subbands which are other high frequency components. Therefore, in S703, the encoding method selection unit 112 determines whether or not the transform coefficients of the subband LL are to be encoded based on the counter s.

サブバンドＬＬの変換係数を符号化しようとしていると判定した場合、符号化方式選択部１１２は処理をＳ７０５に進める。一方、サブバンドＬＬ以外であると判定した場合、符号化方式選択部１１２は処理をＳ７０４に進める。 If it is determined that the transform coefficients of subband LL are to be encoded, the encoding method selection unit 112 advances the process to S705. On the other hand, if it is determined that the subband is other than LL, the encoding method selection unit 112 advances the process to S704.

Ｓ７０４にて、符号化方式選択部１１２は、注目変換係数ｘの左隣の変換係数ａが“０”であるか否かを判定する。変換係数ａが“０”である場合、符号化方式選択部１１２は処理をＳ７０５に進める。一方、変換係数ａが“０”以外である場合、符号化方式選択部１１２は処理をＳ７０８に進める。 In S704, the encoding method selection unit 112 determines whether the transform coefficient a to the left of the focused transform coefficient x is "0". If the transform coefficient a is "0", the encoding method selection unit 112 advances the process to S705. On the other hand, if the transform coefficient a is other than "0", the encoding method selection unit 112 advances the process to S708.

Ｓ７０５にて、符号化方式選択部１１２は、周辺の変換係数ａ，ｂ，ｃの全てが同じ値であるか否かを判定する。変換係数ａ，ｂ，ｃの全てが同じ値であった場合、符号化方式選択部１１２は処理をＳ７０６に進め、変換係数ａ，ｂ，ｃの中に一つでも他と異なる値があった場合、符号化方式選択部１１２は処理をＳ７０８に進める。 In S705, the encoding method selection unit 112 determines whether all surrounding transform coefficients a, b, and c have the same value. If all of the transform coefficients a, b, and c have the same value, the encoding method selection unit 112 advances the process to S706, and determines that at least one of the transform coefficients a, b, and c has a value different from the others. If so, the encoding method selection unit 112 advances the process to S708.

なお、Ｓ７０４の判定結果がＹｅｓ，かつ、Ｓ７０５の判定結果がＹｅｓとなった場合の周辺の変換係数ａ，ｂ，ｃの関係は、ａ＝ｂ＝ｃ＝０である。また、符号化対象の変換係数ｘがサブバンドの左端にあるときは、変換係数ａは存在しない。また、変換係数ｘがサブバンドの最初のライン内にあるときは、変換係数ｂ、ｃは存在しない。このように存在しない変換係数は予め設定された値（例えば“０”）であると見なして処理を行うものとする（復号側と共通の認識となっていれば、その値は特に問わない）。 Note that when the determination result in S704 is Yes and the determination result in S705 is Yes, the relationship among the surrounding transformation coefficients a, b, and c is a=b=c=0. Further, when the transform coefficient x to be encoded is at the left end of the subband, the transform coefficient a does not exist. Furthermore, when transform coefficient x is within the first line of the subband, transform coefficients b and c do not exist. A conversion coefficient that does not exist in this way is assumed to be a preset value (for example, "0") and processed (as long as it is shared with the decoding side, the value does not matter). .

上記のＳ７０３乃至７０５の判定処理を分かり易く説明するのであれば、次の通りである。 The determination processing in S703 to S705 above will be explained in an easy-to-understand manner as follows.

一般に、同じ値が連続する状況下では、予測符号化よりもランレングス符号化の方が符号化効率で優れている。サブバンドＬＬに含まれる全変換係数は、オリジナルの画像の水平、垂直とも１／４のサイズの縮小画像と見ることができる。それ故、サブバンドＬＬの場合、同じ値が連続する可能性が高いか否かは、注目変換係数の周囲にある複数の変換係数が同じ値となっているか否かで推定できる。それ故、周囲の変換係数ａ，ｂ，ｃが同じ値の場合には、注目変換係数をランの始点とするランレングス符号化を開始する。一方、連続性が期待できない場合には、注目変換係数を符号化済みの周囲の変換係数に基づく予測符号化を行う。 In general, run-length encoding is superior to predictive encoding in terms of encoding efficiency under conditions where the same value continues. All the transform coefficients included in the subband LL can be viewed as a reduced image that is 1/4 the size of the original image both horizontally and vertically. Therefore, in the case of subband LL, whether or not there is a high possibility that the same value will continue can be estimated based on whether or not a plurality of transformation coefficients around the transformation coefficient of interest have the same value. Therefore, if the surrounding transform coefficients a, b, and c have the same value, run-length encoding is started with the transform coefficient of interest as the starting point of the run. On the other hand, if continuity cannot be expected, predictive encoding is performed on the transformation coefficient of interest based on encoded surrounding transformation coefficients.

一方、サブバンドＬＬ以外の高周波成分の変換係数は、その性質上、“０”が連続ことはあっても、“０”以外の、同じ値が連続するこことは稀である。そこで、高周波成分については、注目変換係数の周囲の変換係数ａ，ｂ，ｃが全て“０”である場合に限って、注目変換係数をランの始点とするランレングス符号化を開始する。周囲の変換係数ａ，ｂ，ｃのうち、１つでも“０”で無い場合、連続性が期待できないので、注目変換係数を符号化済みの周囲の変換係数に基づく予測符号化を行う。 On the other hand, due to the nature of the transform coefficients of high frequency components other than subband LL, although "0" may be continuous, it is rare that the same value other than "0" is continuous. Therefore, for high-frequency components, run-length encoding is started with the target transform coefficient as the start point of the run only when the transform coefficients a, b, and c around the target transform coefficient are all "0". If even one of the surrounding transform coefficients a, b, and c is not "0", continuity cannot be expected, so predictive encoding is performed on the target transform coefficient based on the encoded surrounding transform coefficients.

上記の通りなので、Ｓ７０６に処理が進んだ場合、符号データ生成部１０４は、ランレングス符号化部１１３を制御し、注目変換係数［ｓＸ］を始点とするラン長を計数させる。ランレングス符号化部１１３は、ランが途切れた場合に、計数したランを示す符号を出力する。そして、Ｓ７０７にて、Ｓ７０６で計数されたラン長をカウンタｓＸに加算し、次に、符号化方式選択部へ入力する注目係数の位置を更新する。その後、符号データ生成部１０４、ラン終端の係数を予測符号化するため、処理をＳ７０８へ進める。 As described above, when the process proceeds to S706, the code data generation unit 104 controls the run length encoding unit 113 to count the run length starting from the transformation coefficient of interest [sX]. The run length encoding unit 113 outputs a code indicating the counted run when the run is interrupted. Then, in S707, the run length counted in S706 is added to the counter sX, and then the position of the coefficient of interest to be input to the encoding method selection section is updated. Thereafter, the code data generation unit 104 advances the process to S708 in order to predictively encode the coefficients at the end of the run.

Ｓ７０８にて、符号データ生成部１０４は予測符号化部１１４を制御し、注目変換係数ｘに対する予測符号化を実行させる。予測符号化部１１４は、周辺の符号化済み変換係数ａ、ｂ、ｃから予測値ｐ（単純にはｐ＝ａである）を求め、注目変換係数ｘと予測値ｐとの差分ｄ（＝ｘ－ａ）をエントロピー符号化する。 In S708, the code data generation unit 104 controls the predictive encoding unit 114 to perform predictive encoding on the transformation coefficient x of interest. The predictive encoding unit 114 obtains a predicted value p (simply p=a) from the surrounding encoded transform coefficients a, b, and c, and calculates the difference d (= xa) is entropy encoded.

この後、Ｓ７０９にて、符号データ生成部１０４は、次の変換係数の符号化を行うためにカウンタｓＸに“１”を加算する。そして、Ｓ７１０にて、符号データ生成部１０４は、カウンタｓＸと１ラインの変換係数の数ｓＷとを比較する。ｓＸ＝ｓＷの場合、注目ラインの全変換係数の符号化を終えたことになるので、符号データ生成部１０４は、このフローを終了する。それ以外の場合、符号データ生成部１０４は、次の変換係数の符号化を行うために処理をＳ７０３に戻す。 After that, in S709, the code data generation unit 104 adds "1" to the counter sX in order to encode the next transform coefficient. Then, in S710, the code data generation unit 104 compares the counter sX with the number sW of transform coefficients for one line. If sX=sW, this means that all the transform coefficients of the line of interest have been encoded, so the code data generation unit 104 ends this flow. In other cases, the code data generation unit 104 returns the process to S703 in order to encode the next transform coefficient.

以上が、実施形態における符号データ生成部１０４の処理内容である。上記の処理を行うと、高周波サブバンドはゼロの係数が多いほどランレングス符号化されやすい。そのため、ＲＯＩに含まれないＤＷＴ変換係数は量子化されることで、“０”となる可能性が高くなり、結果、ランレングス符号化に移行しやすくなり、高い圧縮率が期待できる。つまり、ＲＯＩ以外の領域の符号量を小さくできる。 The above is the processing content of the code data generation unit 104 in the embodiment. When the above processing is performed, the higher the number of zero coefficients in a high frequency subband, the easier it is to be run-length coded. Therefore, the DWT transform coefficients that are not included in the ROI are quantized and are more likely to become "0". As a result, it becomes easier to shift to run-length encoding, and a high compression rate can be expected. In other words, the amount of code in areas other than the ROI can be reduced.

本実施形態の手法を用いると、図３（ｃ）の領域３０２、３０３のラインの符号化データのヘッダには、各サブバンドに対応した量子化ステップ幅が記載される。そして、領域３０４、３０５およびＲＯＩが含まれるラインの符号化データのヘッダには、量子化ステップ幅“１”が格納されることになる。また、領域３０４、３０５の多くは、ＲＯＩ領域内の画素を参照しないで得られたＤＷＴ変換係数であるので、量子化→逆量子化の処理を経た符号化データとなる。そのため、量子化ステップ幅が“１”と記載されていても、復号時にＲＯＩ領域と、それ以外の非ＲＯＩ領域のＤＷＴ変換係数を区別することなく復号することもでき、処理が簡略化され、高速な復号も期待できる。 When the method of this embodiment is used, the quantization step width corresponding to each subband is written in the header of the encoded data in the lines of areas 302 and 303 in FIG. 3(c). Then, the quantization step width "1" is stored in the header of the encoded data of the line including the regions 304 and 305 and the ROI. Further, since most of the regions 304 and 305 are DWT transform coefficients obtained without referring to pixels in the ROI region, they are encoded data that has undergone quantization→inverse quantization processing. Therefore, even if the quantization step width is described as "1", it is possible to decode the DWT transform coefficients of the ROI region and other non-ROI regions without distinguishing them during decoding, which simplifies the processing. You can also expect high-speed decoding.

さらに、本実施形態では、ＲＯＩ領域を含むＤＷＴ変換係数のライン(量子化ステップ幅として“１”の係数ライン)のうち、サブバンド１ＨＬ、１ＬＨ，１ＨＨについては、ＲＯＩ領域外のＤＷＴ変換係数を修正対象として量子化したが、このサブバンドの数を動的に変更してもよい。たとえば、ロスレス圧縮とするＲＯＩの画像全体に占める割合が小さい場合には、全体の符号量もあまり大きくならない。そのため、本実施形態のサブバンド１ＨＬ，１ＬＨ，１ＨＨのみを量子化対象とし、なるべくＲＯＩ領域外の画質劣化を防止する。逆に、ロスレス圧縮とする注目領域の画像全体に占める割合が大きい場合には、全体の符号量が大きくなりがちであるため、ＬＬサブバンド以外の高周波サブバンドすべてを量子化対象として、符号量を小さくしても良い。そのため、ＲＯＩが大きいか否かを判定するための閾値を設け、それに従って量子化対象を決定しても良い。 Furthermore, in this embodiment, among the lines of DWT transform coefficients that include the ROI region (coefficient lines with a quantization step width of "1"), for subbands 1HL, 1LH, and 1HH, the DWT transform coefficients outside the ROI region are Although quantization is used as a correction target, the number of subbands may be dynamically changed. For example, if the ROI to be subjected to lossless compression occupies a small proportion of the entire image, the total amount of code will not become very large. Therefore, only subbands 1HL, 1LH, and 1HH in this embodiment are targeted for quantization, and image quality deterioration outside the ROI region is prevented as much as possible. On the other hand, if the area of interest to be losslessly compressed occupies a large proportion of the entire image, the overall code amount tends to increase, so all high-frequency subbands other than the LL subband are quantized and the code amount is You can make it smaller. Therefore, a threshold value may be provided to determine whether the ROI is large, and the quantization target may be determined according to the threshold value.

さらに、ＲＯＩを含まないＤＷＴ変換係数の量子化ステップ幅を２のべき乗よりも細かく制御することができ、画像全体の符号量を制御しやすいだけでなく、ＲＯＩ領域外の画質を細かく制御することができる。 Furthermore, the quantization step width of DWT transform coefficients that do not include the ROI can be controlled more finely than a power of 2, which not only makes it easier to control the code amount of the entire image, but also allows finer control of the image quality outside the ROI region. Can be done.

また、ＤＷＴ変換係数を１ライン毎に処理可能なため、符号化時の使用メモリ量を小さくすることができる。 Furthermore, since DWT transform coefficients can be processed line by line, the amount of memory used during encoding can be reduced.

なお、上記実施形態では、符号化データ生成部１０４は図２の構成を持つものとして説明したが、ＣＰＵ１０１が、上記の説明で参照した各フローチャートに対応するコンピュータプログラムを実行させて実現させても構わない。 In the above embodiment, the encoded data generation unit 104 has been described as having the configuration shown in FIG. I do not care.

［第２の実施形態］
上記第１の実施形態では、ＲＯＩ参照変換係数を含むライン内の、ＲＯＩ非参照変換係数の修正処理（図５のＳ５０５）の詳細を図６のフローチャートに参照して説明した。その際、ＲＯＩ非参照変換係数については、量子化→逆量子化を行うことも説明した。 [Second embodiment]
In the first embodiment described above, details of the correction process (S505 in FIG. 5) of ROI non-reference conversion coefficients in a line including ROI reference conversion coefficients have been described with reference to the flowchart in FIG. 6. At that time, it has also been explained that quantization→inverse quantization is performed for ROI non-reference transform coefficients.

本実施形態では、ＲＯＩ非参照係数のうち、予め設定された範囲の値を持つ変換係数をゼロにする例について説明する。 In this embodiment, an example will be described in which, among ROI non-reference coefficients, conversion coefficients having values within a preset range are set to zero.

本第２の実施形態における、図５のＳ５０５の処理の詳細を、図９のフローチャートを参照して以下に説明する。同図のうち、第１の実施形態と同じ処理については、図６と同じ参照符号を付し、その詳しい説明は割愛する。 Details of the process of S505 in FIG. 5 in the second embodiment will be described below with reference to the flowchart in FIG. 9. In the figure, the same processes as in the first embodiment are given the same reference numerals as in FIG. 6, and detailed explanation thereof will be omitted.

Ｓ６０１，Ｓ６０２は第１の実施形態の説明と同じである。Ｓ９０１にて、符号データ生成部１０４は，ゼロに置き換えるための係数範囲を規定する正の閾値Ｂを取得する。閾値Ｂが正とするのは、ゼロに置き換えるＤＷＴ変換係数の取り得る範囲を絶対値で判定できるようにするためである。なお、この閾値Ｂは、ユーザの指示入力に従ってＣＰＵ１０１がその値を決定し、符号データ生成部１０４に設定するものとする。また、閾値Ｂは、そのサブバンドに用いる量子化ステップ幅よりは大きな値とする。 S601 and S602 are the same as those described in the first embodiment. In S901, the code data generation unit 104 obtains a positive threshold value B that defines a coefficient range for replacing with zero. The reason why the threshold value B is positive is to enable the possible range of the DWT conversion coefficient to be replaced with zero to be determined based on the absolute value. Note that this threshold value B is determined by the CPU 101 according to the user's instruction input, and is set in the code data generation unit 104. Further, the threshold value B is set to a value larger than the quantization step width used for that subband.

Ｓ６０３にて、符号データ生成部１０４は、ロスレス係数位置情報Ｐ（ｓ、ｓＸ）を参照し、その値が“１”か否かを判定する。ロスレス係数位置情報Ｐ（ｓ、ｓＸ）が“１”である場合、注目変換係数［ｓＸ］がＲＯＩ参照変換係数であり、ロスレス符号化しなければならない。それ故、符号データ生成部１０４は、注目変換係数については修正せずし、処理をＳ６０５に進める。 In S603, the code data generation unit 104 refers to the lossless coefficient position information P(s, sX) and determines whether the value is "1" or not. When the lossless coefficient position information P (s, sX) is “1”, the transformation coefficient of interest [sX] is the ROI reference transformation coefficient and must be losslessly encoded. Therefore, the code data generation unit 104 does not modify the transformation coefficient of interest and advances the process to S605.

ロスレス係数位置情報Ｐ（ｓ、ｓＸ）が“１”ではなく、“０”である場合、注目変換係数［ｓＸ］はＲＯＩ非参照変換係数であり、修正対象となる。したがって、符号データ生成部１０４は、Ｓ６０３からＳ９０２に処理を分岐する。Ｓ９０２にて、符号データ生成部１０４は、注目変換係数［ｓＸ］の絶対値と値Ｂとを比較する。注目変換係数［ｓＸ］の絶対値が値Ｂ以下である場合、符号データ生成部１０４は、処理をＳ９０３に進める。このＳ９０３にて、符号データ生成部１０４は、注目変換係数［ｓＸ］をゼロに置き換える。 When the lossless coefficient position information P (s, sX) is not "1" but "0", the transformation coefficient of interest [sX] is an ROI non-reference transformation coefficient and is subject to correction. Therefore, the code data generation unit 104 branches the process from S603 to S902. In S902, the code data generation unit 104 compares the absolute value of the conversion coefficient of interest [sX] and the value B. If the absolute value of the conversion coefficient of interest [sX] is less than or equal to the value B, the code data generation unit 104 advances the process to S903. In S903, the code data generation unit 104 replaces the conversion coefficient of interest [sX] with zero.

Ｓ６０４，Ｓ６０５，Ｓ６０６の処理は、第１の実施形態と同様である。 The processing in S604, S605, and S606 is the same as in the first embodiment.

なお、Ｓ９０２では、注目変換係数［ｓＸ］の絶対値と閾値Ｂとを比較するものとして説明したが、以下の条件を満たすか否かの判定を行うことと等価である。
条件：－Ｂ≦注目変換係数［ｓＸ］≦Ｂ Although S902 has been described as comparing the absolute value of the conversion coefficient of interest [sX] with the threshold B, this is equivalent to determining whether the following conditions are satisfied.
Condition: -B≦Attention conversion coefficient [sX]≦B

上記の結果、符号化対象の画像データの注目ラインから得たＤＷＴ変換係数に、ＲＯＩ内の画素を参照して算出されたＲＯＩ参照変換係数が含まれ、且つ、そのＲＯＩ参照変換係数の絶対値が閾値以下であると判定された場合には、“０”に置き換える。この結果、第１の実施形態と比較して、“０”の発生する確率が高くなり、より符号化効率を上げることができる。 As a result of the above, the DWT transform coefficients obtained from the line of interest of the image data to be encoded include the ROI reference transform coefficients calculated with reference to pixels in the ROI, and the absolute value of the ROI reference transform coefficients If it is determined that is less than the threshold value, it is replaced with "0". As a result, the probability of "0" occurring is higher than in the first embodiment, and encoding efficiency can be further improved.

［その他の実施形態］
上記第２の実施形態では、サブバンドによって一律にある一定範囲のＤＷＴ変換係数をゼロに修正するものとしたが、第１、第２の実施形態を組み合わせてもよい。すなわち、サブバンド２ＨＬ、２ＬＨ、２ＨＨについては第１の実施形態の方法で係数を修正し、より高周波なサブバンド１ＨＬ，１ＬＨ，１ＨＨについては第２の実施形態の方法で修正してもよい。 [Other embodiments]
In the second embodiment, the DWT transform coefficients in a certain range are uniformly corrected to zero depending on the subband, but the first and second embodiments may be combined. That is, the coefficients may be modified for subbands 2HL, 2LH, and 2HH using the method of the first embodiment, and the coefficients for higher frequency subbands 1HL, 1LH, and 1HH may be modified using the method of the second embodiment.

また、第２の実施形態では、ゼロに置き換えるＤＷＴ変換係数の範囲を規定する値Ｂを動的に変更してもよい。たとえば、サブバンドの種類によって、この値Ｂを変更しても良い。すなわち、例えばサブバンド１ＨＨは、より大きい値Ｂを設定し、広範囲なＤＷＴ変換係数をゼロにし、低周波に近いサブバンドに対しては値Ｂを小さくし、画質の過度な低下を防止してもよい。あるいは、符号化対象の画像データに対するＲＯＩが示す面積の比率から値Ｂを設定してもよい。画像全体に占めるＲＯＩの面積の比率が大きいときには、Ｂの値を大きくし、符号量を抑制する。一方、その比率が小さい場合には、値Ｂを小さくし、ＲＯＩ領域とそれ以外の画素の画質の差を小さくしてもよい。 Furthermore, in the second embodiment, the value B that defines the range of DWT transform coefficients to be replaced with zero may be dynamically changed. For example, this value B may be changed depending on the type of subband. That is, for example, for subband 1HH, a larger value B is set, a wide range of DWT transform coefficients are set to zero, and the value B is decreased for subbands close to low frequencies to prevent excessive deterioration of image quality. Good too. Alternatively, the value B may be set based on the ratio of the area indicated by the ROI to the image data to be encoded. When the ratio of the area of the ROI to the entire image is large, the value of B is increased to suppress the amount of code. On the other hand, if the ratio is small, the value B may be reduced to reduce the difference in image quality between the ROI region and other pixels.

なお、本実施形態では、説明を簡単にするために、ＤＷＴの実行回数を“２”として説明したが、ＤＷＴの実行回数に制限はない。 In this embodiment, in order to simplify the explanation, the number of times the DWT is executed is "2", but there is no limit to the number of times the DWT is executed.

また、実施形態では、符号化対象の画像データをタイル分割せずに符号化する方法について説明したが、複数のタイルに分割するようにしても良い。その場合、各タイルの符号化を図２のフローを用いて符号化すればよい。また、実施形態によれば、ＲＯＩを含むタイルとそうではないタイルで、まったく同じ方法でデコードすることができる点も特徴ということができる。 Further, in the embodiment, a method has been described in which image data to be encoded is encoded without being divided into tiles, but the image data may be divided into a plurality of tiles. In that case, each tile may be encoded using the flow shown in FIG. 2. Another feature of the embodiment is that tiles that include an ROI and tiles that do not include an ROI can be decoded using exactly the same method.

本実施形態では、各ラインのＤＷＴ変換係数の符号データ先頭に量子化ステップ幅を記載したが、それらをサブバンド毎やタイル毎、画像毎に１か所にまとめて記載してもよい。また、タイル分割した際に、ＲＯＩ外のタイルのサブバンドは量子化ステップ幅が１つに決まる。その場合には、各タイルの各サブバンドに１回だけ量子化ステップ幅を記載してもよい。また、量子化ステップ幅を圧縮して符号データに格納してもよい。 In this embodiment, the quantization step width is written at the beginning of the code data of the DWT transform coefficient of each line, but they may be written in one place for each subband, each tile, or each image. Furthermore, when dividing tiles, the quantization step width is determined to be one for subbands of tiles outside the ROI. In that case, the quantization step width may be written only once for each subband of each tile. Alternatively, the quantization step width may be compressed and stored in code data.

また、上記実施形態では、画像データを１ライン単位に入力し、符号化する例を説明したが、予め設定された所定サイズのブロック単位に入力し、ブロック単位に符号化しても良い。 Further, in the above embodiment, an example has been described in which image data is input in units of one line and encoded, but it may also be input in units of blocks of a predetermined size set in advance and encoded in units of blocks.

本発明は、上述の実施形態の１以上の機能を実現するプログラムを、ネットワーク又は記憶媒体を介してシステム又は装置に供給し、そのシステム又は装置のコンピュータにおける１つ以上のプロセッサーがプログラムを読出し実行する処理でも実現可能である。また、１以上の機能を実現する回路（例えば、ＡＳＩＣ）によっても実現可能である。 The present invention provides a system or device with a program that implements one or more of the functions of the embodiments described above via a network or a storage medium, and one or more processors in the computer of the system or device reads and executes the program. This can also be achieved by processing. It can also be realized by a circuit (for example, ASIC) that realizes one or more functions.

１０１…ＣＰＵ、１０２…入力部、１０３…ＲＯＩ設定部、１０４…符号データ生成部、１０５…表示部、１０６…メモリ、１０７…蓄積部、１１０…ＤＷＴ部、１１１…量子化部、１１２…符号化方式選択部、１１３…ランレングス符号化部、１１４…予測符号化部、１１５…符号データ出力部、１１６…符号列生成部 101... CPU, 102... Input unit, 103... ROI setting unit, 104... Code data generation unit, 105... Display unit, 106... Memory, 107... Storage unit, 110... DWT unit, 111... Quantization unit, 112... Code encoding scheme selection unit, 113... run length encoding unit, 114... predictive encoding unit, 115... code data output unit, 116... code string generation unit

Claims

An image encoding device,
a setting means for setting a region of interest in image data to be encoded;
Transforming means for obtaining a plurality of subbands by performing wavelet transformation on the encoding target image data using a reversible filter;
quantization means for quantizing the coefficients included in each subband obtained by the conversion means on a line-by-line basis according to a quantization parameter;
encoding means for entropy encoding the quantized coefficients of each subband obtained by quantization on a line-by-line basis to generate encoded data to which information specifying the quantization parameter is added;
and a control means for controlling the quantization means and the encoding means,
The control means, for each subband,
determining means for determining whether a line of interest made up of coefficients includes a coefficient generated with reference to pixels in the region of interest;
correction means for correcting the quantized coefficients obtained by the quantization means,
The control means includes:
If the determining means determines that the line of interest does not include a coefficient generated by referring to pixels within the region of interest, the determining means causes the quantizing means to calculate each coefficient of the line of interest using the quantization parameter. controlling the quantization, and causing the encoding means to generate encoded data including information specifying the quantization parameter;
If the determining means determines that the line of interest includes coefficients generated by referring to pixels within the region of interest, the determining means instructs the quantization means to determine whether the line of interest includes coefficients generated by referring to pixels within the region of interest, among the coefficients of the line of interest. The coefficients generated by referring to the pixel in the region of interest are controlled to maintain their values, and the coefficients generated without referring to the pixels in the region of interest are quantized using the quantization parameter, and , the correction means corrects the quantized coefficients obtained by the quantization means by inversely quantizing them using the quantization parameter, and the encoding means quantizes the line of interest. An image encoding device that generates encoded data that includes information indicating that a step width is "1".

The correction means includes:
The image encoding device according to claim 1, characterized in that when a condition that the absolute value of the value indicated by the coefficient obtained by the quantization means is equal to or less than a preset threshold is satisfied, the coefficient is changed to zero. .

3. The image encoding apparatus according to claim 1, wherein the reversible filter is an integer type filter, and is a 5-3 tap filter defined in JPEG2000.

The encoding means includes:
run-length encoding means for encoding a run of transform coefficients;
Predictive encoding means for predictively encoding the transform coefficients;
The target transform coefficient is set as the starting point of the run based on which subband the target transform coefficient to be entropy-encoded belongs to and the relationship among the plurality of encoded transform coefficients surrounding the target transform coefficient. and selecting means for selecting whether to encode the target transform coefficient using the run-length encoding means or to encode the target transform coefficient using the predictive encoding means. The image encoding device according to item 1.

A method for controlling an image encoding device, the method comprising:
a setting step of setting a region of interest in the image data to be encoded;
a transformation step of obtaining a plurality of subbands by performing wavelet transformation on the encoding target image data using a reversible filter;
a quantization step of quantizing the coefficients included in each subband obtained in the conversion step in accordance with a quantization parameter on a line-by-line basis;
an encoding step of entropy encoding the quantized coefficients of each subband obtained by quantization line by line to generate encoded data to which information specifying the quantization parameter is added;
a control step for controlling the quantization step and the encoding step,
The control process includes, for each subband,
a determination step of determining whether a line of interest made up of coefficients includes a coefficient generated with reference to pixels in the region of interest;
a correction step of correcting the quantized coefficients obtained in the quantization step,
The control step includes:
If the determination step determines that the line of interest does not include a coefficient generated by referring to pixels within the region of interest, the quantization step is performed to calculate each coefficient of the line of interest using the quantization parameter. controlling the quantization, and causing the encoding step to generate encoded data including information specifying the quantization parameter;
If the determination step determines that the line of interest includes a coefficient generated by referring to the pixel in the region of interest, the quantization step includes the coefficients of the line of interest that are generated by referring to the pixels in the region of interest. control to maintain the value of the coefficient generated with reference to the pixel in the region of interest, and quantize the coefficient generated without reference to the pixel in the region of interest using the quantization parameter; For the modification step, the quantized coefficients obtained in the quantization step are modified by inverse quantization using the quantization parameter, and for the encoding step, a quantization step is performed for the line of interest. A method for controlling an image encoding device, the method comprising: generating encoded data including information indicating that the width is "1".

A computer program for causing the computer to execute each step of the control method according to claim 5 by being read and executed by a computer.