JP2006262294A

JP2006262294A - Image processing apparatus, image processing method, program, and recording medium

Info

Publication number: JP2006262294A
Application number: JP2005079445A
Authority: JP
Inventors: Takanori Yano; 隆則矢野
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2005-03-18
Filing date: 2005-03-18
Publication date: 2006-09-28
Anticipated expiration: 2025-03-18
Also published as: JP4315926B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide an image processing apparatus for executing code quantity control after coding, with high accuracy, by taking into account the effects of the distortion amount due to image attributes on the image quality. <P>SOLUTION: The image processing apparatus sets the degree of importance (priority) on coded data in the unit of code sequences (packets) for configuring the coded data depending on a distortion amount, image attribute information, and distortion amount reference by each image attribute of a decoded and reproduced image of the coded data. Alternatively, the image processing apparatus places the degree of importance (priority) on coded data in units of code sequences (packets) for configuring the coded data, corresponding to an image partial region depending on the distortion amount, image attribute information, and distortion amount reference by each image attribute of a decoded and reproduced image of the coded data, corresponding to the image partial region. <P>COPYRIGHT: (C)2006,JPO&NCIPI

Description

本発明は、画像処理装置、画像処理方法、プログラムおよび記録媒体に関し、具体的には、符号データを構成する符号列の重要度（優先順位）に基づいて符号データを（再）形成する方法に関する。特に、符号データの復号再生画像の歪量と画像属性情報、及び画像属性毎の歪量基準とに基づいて符号列の重要度（優先順位）を設定し、設定した重要度に基づいて符号データを選択（符号量制御）する方法に関する。 The present invention relates to an image processing apparatus, an image processing method, a program, and a recording medium. Specifically, the present invention relates to a method for (re-) forming code data based on the importance (priority order) of a code string constituting the code data. . Particularly, the importance (priority order) of the code string is set based on the distortion amount and image attribute information of the decoded reproduction image of the code data, and the distortion amount standard for each image attribute, and the code data is set based on the set importance. Is selected (code amount control).

一般的に、符号処理の過程で符号量制御を進める上で形成される符号データは、部分符号列単位で重要度が識別されている。ところが、一般的には、符号処理過程での部分符号列単位での重要度算出結果は、符号処理後に反映されない。 In general, the importance of code data formed when advancing code amount control in the course of code processing is identified in units of partial code strings. However, in general, the importance calculation result for each partial code string in the code processing process is not reflected after the code processing.

また、一般に、符号化過程で算出した符号データの再生画像の歪量に対する劣化を感じる度合いは画像属性によって異なり、さらに、歪量が画像属性毎の感度を反映したものではなく、符号化誤差に対する周波数感度を反映しているので、符号化過程で算出した歪量の大きさだけからでは、好適な符号列単位の重要度の算出ができなかった。 In general, the degree to which deterioration of the reproduction amount of the encoded data calculated in the encoding process is different depending on the image attribute, and the distortion amount does not reflect the sensitivity for each image attribute. Since the frequency sensitivity is reflected, it is not possible to calculate a suitable degree of importance in units of code strings only from the amount of distortion calculated in the encoding process.

一方、画像圧縮方式としては、国際標準であるＪＰＥＧやＪＰＥＧ２０００が知られている。このＪＰＥＧ２０００（ＩＳＯ／ＩＥＣ１５４４４−１）規格の符号処理では、符号化後に、符号化時の符号量制御で使用した周波数感度視覚特性（ＶＷ情報）情報などを考慮に入れた高精度な符号量制御ができない。
また、符号化時に周波数感度視覚特性と画像領域毎の画像属性情報と出力機器の解像度特性とを同時に考慮した符号量制御することができない。 On the other hand, as image compression methods, international standards such as JPEG and JPEG2000 are known. In the JPEG2000 (ISO / IEC 15444-1) standard encoding process, a high-accuracy code amount that takes into account the frequency sensitivity visual characteristic (VW information) information used in the code amount control at the time of encoding after encoding. Control is not possible.
In addition, it is impossible to control the code amount in consideration of the frequency sensitivity visual characteristics, the image attribute information for each image area, and the resolution characteristics of the output device at the time of encoding.

また、符号データの歪量算出する技術としては、特許文献１がある。この特許文献１では、ＪＰＥＧ２０００のように、画像情報のビットプレーンのトランケーションにより画像情報を圧縮する画像処理方式において、各ビットプレーンにおける最上位有効ビット（ＭＳＢ）の個数に基づいてビットプレーンをトランケーションしたときの画像の歪量を推定してトランケーションしているが、符号化後に符号量制御する機能については言及していない。 Patent Document 1 discloses a technique for calculating the distortion amount of code data. In this patent document 1, as in JPEG2000, in an image processing method for compressing image information by truncation of a bit plane of image information, the bit plane is truncated based on the number of most significant bits (MSBs) in each bit plane. Truncation is performed by estimating the amount of distortion of the image at the time, but the function of controlling the amount of code after encoding is not mentioned.

また、特許文献２では、符号列からエラーの発生した部分を除去した残りのデータを用いて画像をプログレッシブに表示したときの画像の歪量を知ることができるようにして、符号化時における画像の歪量に係る情報を使用して符号化後に符号データの構成を調整している。 Further, in Patent Document 2, an image at the time of encoding can be obtained by making it possible to know the amount of distortion of an image when the image is progressively displayed using the remaining data obtained by removing an error portion from the code string. The configuration of the code data is adjusted after encoding using information relating to the amount of distortion.

また、特許文献３では、ＪＰＥＧ２０００による符号化を想定してタイル単位のタイル属性に基づいて、タイル属性が文字の場合は画像の歪量（誤差）に基づいたＳＮＲ（レイヤ）プログレシブに、タイル属性が写真の場合には解像度プログレシブにプログレシブ順を替えて符号データを形成している。ＳＮＲ（レイヤ）プログレシブで符号データを形成しておくことで、符号化後に画像の歪量（誤差）に基づいた符号量制御（レートコントロール）ができる。一般に各階層の符号データは複数の符号列により構成され、符号列単位での重要度の優先順位付けができないため、高精度な微細な符号量制御ができない。 Further, in Patent Document 3, assuming that encoding by JPEG2000 is performed, the tile attribute is set to SNR (layer) progressive based on the distortion amount (error) of the image when the tile attribute is a character based on the tile attribute of the tile unit. In the case of a photograph, code data is formed by changing the progressive order to resolution progressive. By forming code data by SNR (layer) progressive, code amount control (rate control) based on the distortion amount (error) of an image can be performed after encoding. In general, the code data of each layer is composed of a plurality of code strings, and prioritization of importance cannot be performed in units of code strings, so that high-precision and fine code amount control cannot be performed.

また、パケット単位のプライオリティ設定に関する技術として特許文献４や特許文献５がある。この特許文献４では、パケット単位でプライオリティを設定するテーブルをもち、パケット単位でのプライオリティに基づきパケットの送受信を制御している。
また、特許文献５では、符号データにプライオリティフィールドを設けているが、重複型符号データを対象にしたものではない。 Further, there are Patent Document 4 and Patent Document 5 as technologies related to priority setting in packet units. This Patent Document 4 has a table for setting priorities in units of packets, and packet transmission / reception is controlled based on priorities in units of packets.
In Patent Document 5, a priority field is provided for code data, but it is not intended for overlapping code data.

また、代表的なＪＰＥＧ２０００における符号量制御に関する技術として、特許文献６がある。この特許文献６では、ＪＰＥＧ２０００規格の符号化においてタイル単位に符号量制御を行っている。 As a technique related to code amount control in a typical JPEG2000, there is Patent Document 6. In Patent Document 6, code amount control is performed in units of tiles in encoding according to the JPEG2000 standard.

また、代表的なＪＰＥＧ２０００における符号列作成装置で符号データの編集を行う技術として特許文献７がある。しかし、画質を考慮して削除するパケットあるいは並び替える手段は備えていない。
特開２００３−３０４４０５号公報特開２００４−１８６８６１号公報特開２００３−２３５４４号公報特開２００３−３３８８４０号公報特開２００３−２０９８３９号公報特開２００３−３２６８０号公報特開２００３−１６９３３３号公報 Patent Document 7 discloses a technique for editing code data with a typical JPEG2000 code string creation device. However, there is no packet to be deleted or rearranging means considering the image quality.
JP 2003-304405 A JP 2004-186861 A JP 2003-23544 A JP 2003-338840 A JP 2003-209839 A JP 2003-32680 A JP 2003-169333 A

ＪＰＥＧ２０００規格の符号処理では、符号化処理後の符号レベルでの符号データ編集処理において、符号化時の符号量制御情報（符号データの周波数感度特性などに関する判別情報）に基づいた符号量制御できないという問題をもっている。 In the code processing of the JPEG2000 standard, code amount control based on code amount control information at the time of encoding (discrimination information related to frequency sensitivity characteristics of code data) cannot be performed in code data editing processing at a code level after encoding processing. I have a problem.

ところで、符号データの再生画像の歪量に対する劣化を感じる度合いは画像属性によって一般に異なり、歪量は符号化誤差に対する周波数感度を反映しているものの、画像属性毎の感度を反映していないため、歪量の大きさだけからでは、好適な復号画質を得ることはできなかった。 By the way, the degree to which the deterioration of the distortion amount of the reproduced image of the code data generally varies depending on the image attribute, and although the distortion amount reflects the frequency sensitivity to the encoding error, it does not reflect the sensitivity for each image attribute. A suitable decoded image quality could not be obtained only from the amount of distortion.

また、ＪＰＥＧ２０００規格の符号は、規定された最小単位の符号列（パケットと呼ぶ）の集まり（順番）として構成されていて、符号レベルでの符号列単位の編集ができるのが特徴であるが、ＪＰＥＧ２０００規格の符号のように符号列（パケット）の集合により構成されている符号データの転送時のエラー対応や復号時の便宜などの理由から、重要な符号列ほど先に転送したいという要求がある。 In addition, the JPEG2000 standard codes are configured as a set (order) of specified minimum unit code strings (referred to as packets), and can be edited in units of code strings at the code level. There is a demand to transfer more important code strings earlier for reasons such as error handling at the time of transfer of code data composed of a set of code strings (packets) such as JPEG2000 standard codes and convenience for decoding. .

例えば、符号列単位で優先順位を付けておいてかかる優先順に従って符号データを転送することにより実現できる。ところが、ＪＰＥＧ２０００規格の符号化においては、復号処理できる符号列の順番は、いくつかのプログレシブ順に限定されていて、それらのプログレシブ順の選択肢の中から選択されたプログレシブ順で構成される。
ところが、これらのプログレシブ順は、符号を構成する最小単位のパケット（符号列）毎の優先順位に基づいて決められているわけではないため、ＪＰＥＧ２０００（Ｊ２Ｋ：ＩＳＯ／ＩＥＣ１５４４４−１）規格で規定されたプログレシブ順における符号列の順番では、必ずしも重要な符号列が優先的に先に転送できなかったり、符号量制御時において、相対的に重要な符号列が削除され、重要でない符号列が削除されない場合が起こっていた。また、再現画質の制御も、符号列（パケット）よりも大きな単位で制御され、相対的にきめ細かい画質制御ができなかった。 For example, it can be realized by assigning priorities in units of code strings and transferring the code data according to such priorities. However, in the encoding of the JPEG2000 standard, the order of code strings that can be decoded is limited to several progressive orders, and is configured in a progressive order selected from these progressive order options.
However, since these progressive orders are not determined based on the priority order of each minimum unit packet (code string) constituting a code, they are defined in the JPEG2000 (J2K: ISO / IEC 15444-1) standard. In the order of the code sequence in the progressive order, the important code sequence cannot always be preferentially transferred first, or the relatively important code sequence is deleted and the unimportant code sequence is deleted at the time of code amount control. If not it was happening. Also, the reproduction image quality is controlled in units larger than the code string (packet), and relatively fine image quality control cannot be performed.

本発明は、上述のような実情を考慮してなされたものであって、画像属性による歪量の画質への影響を考慮して、符号化後の符号量制御を高精度に実行する画像処理装置、画像処理方法、プログラムおよび記録媒体を提供することを目的とする。 The present invention has been made in consideration of the above situation, and image processing that performs code amount control after encoding with high accuracy in consideration of the influence of image distortion on the image quality. An object is to provide an apparatus, an image processing method, a program, and a recording medium.

上記の課題を解決するために、請求項１に記載の画像処理装置の発明は、符号データの復号再生画像の歪量と画像属性情報、及び画像属性毎の歪量基準と、によって前記符号データを構成する符号列（パケット）単位で重要度（優先順位）を設定する機能を有することを特徴とする。 In order to solve the above-described problem, the image processing apparatus according to claim 1 is characterized in that the code data includes the distortion amount of the decoded reproduction image of the code data, the image attribute information, and the distortion amount criterion for each image attribute. It has the function to set importance (priority order) in the code string (packet) unit which constitutes.

請求項２に記載の画像処理装置の発明は、画像部分領域に対応する符号データの復号再生画像の歪量と画像属性情報、及び画像属性毎の歪量基準と、によって前記画像部分領域に対応する符号データを構成する符号列（パケット）単位で重要度（優先順位）を設定する機能を有することを特徴とする。 The invention of the image processing apparatus according to claim 2 corresponds to the image partial area by the distortion amount of the decoded reproduction image of the encoded data corresponding to the image partial area, the image attribute information, and the distortion amount standard for each image attribute. It has a function of setting importance (priority order) in units of code strings (packets) constituting code data to be performed.

請求項３に記載の発明は、請求項１または２に記載の画像処理装置において、前記符号列（パケット）単位の重要度（優先順位）によって、符号データを構成する符号列の順番を変更し符号データを再構成する機能を有することを特徴とする。 According to a third aspect of the present invention, in the image processing apparatus according to the first or second aspect, the order of the code strings constituting the code data is changed according to the importance (priority order) of the code string (packet) unit. It has a function of reconstructing code data.

請求項４に記載の発明は、請求項１または２に記載の画像処理装置において、前記符号列（パケット）単位の重要度（優先順位）によって、符号データを構成する符号列単位で符号列順に送信する機能を有することを特徴とする。 According to a fourth aspect of the present invention, there is provided the image processing apparatus according to the first or second aspect, wherein the code sequence of the code sequence constituting the code data is in code sequence order according to the importance (priority order) of the code sequence (packet) unit. It has the function to transmit.

請求項５に記載の発明は、請求項４に記載の画像処理装置において、前記符号列（パケット）単位の重要度（優先順位）によって、符号データを構成する符号列単位で送信状況に応じて符号列を途中で打ち切って送信する機能を有することを特徴とする。 According to a fifth aspect of the present invention, in the image processing apparatus according to the fourth aspect of the present invention, depending on the importance level (priority order) of the code string (packet) unit, the code string unit constituting the code data corresponds to the transmission status. It has a function of transmitting a code string by cutting it halfway.

請求項６に記載の発明は、請求項４または５に記載の画像処理装置において、符号データを構成する符号列の一部がレングスの短いダミーの符号列に置きかえて復号対象の符号データを構成する機能を有することを特徴とする。 According to a sixth aspect of the present invention, in the image processing apparatus according to the fourth or fifth aspect, the code data constituting the decoding target is configured by replacing a part of the code string constituting the code data with a dummy code string having a short length. It has the function to perform.

請求項７に記載の発明は、請求項３に記載の画像処理装置において、画像領域毎の歪量と歪量基準を比較する手段を有し、特定画像属性である領域に対応する前記符号データが歪量基準以上であるように符号データを再構成する手段を有することを特徴とする。 A seventh aspect of the present invention is the image processing apparatus according to the third aspect, further comprising means for comparing a distortion amount for each image area with a distortion amount reference, and the code data corresponding to the area having the specific image attribute. It is characterized by having means for reconstructing the code data so that is equal to or greater than the distortion amount standard.

請求項８に記載の発明は、請求項３に記載の画像処理装置において、画像属性毎の歪量と歪量基準を比較する手段を有し、前記再構成される符号データにより再生される画像が全ての画像属性で特定領域の画像属性毎の歪量基準以上であるような符号データであることを特徴とする。 According to an eighth aspect of the present invention, there is provided the image processing apparatus according to the third aspect, further comprising means for comparing a distortion amount for each image attribute with a distortion amount reference, and an image reproduced by the reconstructed code data. Is code data that is equal to or greater than a distortion amount standard for each image attribute of a specific region in all image attributes.

請求項９に記載の発明は、請求項２に記載の画像処理装置において、画像の部分領域が予め設定された注目（ＲＯＩ）領域であることを特徴とする。 According to a ninth aspect of the present invention, in the image processing apparatus according to the second aspect, the partial region of the image is a preset attention (ROI) region.

請求項１０に記載の発明は、請求項１乃至９のいずれかに記載の画像処理装置において、請求項１または２に記載された符号データの復号再生画像の歪量が符号データ生成過程で算出された歪量であることを特徴とする。 According to a tenth aspect of the present invention, in the image processing device according to any one of the first to ninth aspects, the distortion amount of the decoded reproduction image of the code data according to the first or second aspect is calculated in the code data generation process. It is the amount of distortion made.

請求項１１に記載の発明は、請求項１０に記載の画像処理装置において、請求項１または２に記載された符号データ生成過程が、入力画像に対して低域フィルタ及び高域フィルタを垂直方向及び水平方向に施してサブバンドを生成するサブバンド生成手段と、前記低域成分のサブバンドに対して階層的にフィルタリング処理を施すフィルタリング手段と、前記フィルタリング手段によって生成されたサブバンドを分割し、所定の大きさのコードブロックを生成するコードブロック生成手段と、前記コードブロック単位に最上位ビットから最下位ビットに至るビットプレーンを生成するビットプレーン生成手段と、前記ビットプレーン毎に符号化パスを生成する符号化パス生成手段と、上記符号化パス内で算術符号化を行う算術符号化手段と、算術符号の歪量を算出する算術符号歪量算出手段とを備え、請求項１または２に記載された符号データの復号再生画像の歪量が前記算術符号歪量算出手段で算出された歪量であることを特徴とする。 According to an eleventh aspect of the present invention, in the image processing device according to the tenth aspect, in the code data generation process according to the first or second aspect, the low-pass filter and the high-pass filter are vertically aligned with respect to the input image. And subband generation means for generating subbands by applying in a horizontal direction, filtering means for hierarchically filtering the lowband component subbands, and subbands generated by the filtering means. A code block generating means for generating a code block of a predetermined size, a bit plane generating means for generating a bit plane from the most significant bit to the least significant bit for each code block, and an encoding pass for each bit plane Coding pass generation means for generating, arithmetic coding means for performing arithmetic coding in the coding pass, An arithmetic code distortion amount calculating means for calculating the distortion amount of the technical code, wherein the distortion amount of the decoded reproduction image of the code data described in claim 1 or 2 is calculated by the arithmetic code distortion amount calculation means It is characterized by being.

請求項１２に記載の発明は、請求項１０に記載の画像処理装置において、請求項１乃至２に記載された符号データ生成過程で、画像領域毎にビットプレーン単位のトランケーションにより符号量を削減する符号化がなされ、前記画像領域毎の各ビットプレーンにおける最上位有効ビットの個数を歪量とすることを特徴とする。 According to a twelfth aspect of the present invention, in the image processing device according to the tenth aspect, in the code data generation process according to the first or second aspect, the code amount is reduced by truncation in units of bit planes for each image area. Encoding is performed, and the number of most significant bits in each bit plane for each image area is used as a distortion amount.

請求項１３に記載の発明は、請求項１乃至１２のいずれかに記載の画像処理装置において、画像領域を指定する手段と、前記画像領域に対応する範囲の符号データを抽出する手段とを有することを特徴とする。 A thirteenth aspect of the present invention is the image processing apparatus according to any one of the first to twelfth aspects, further comprising means for designating an image area and means for extracting code data in a range corresponding to the image area. It is characterized by that.

請求項１４に記載の発明は、請求項１３に記載の画像処理装置において、画像部分領域を指定する手段と、前記画像領域に対応する符号データを構成する符号列（パケット）の重要度に基づいて符号データを再構成する手段とを有することを特徴とする。 According to a fourteenth aspect of the present invention, in the image processing apparatus according to the thirteenth aspect, the means for designating the image partial area and the importance of the code string (packet) constituting the code data corresponding to the image area. And means for reconstructing the code data.

請求項１５に記載の発明は、請求項１乃至１４のいずれかに記載の画像処理装置において、前記符号データがＪＰＥＧ２０００（ＩＳＯ／ＩＥＣ１５４４４−１）規格に基づき符号化されたデータであることを特徴とする。 According to a fifteenth aspect of the present invention, in the image processing device according to any one of the first to fourteenth aspects, the code data is data encoded based on JPEG2000 (ISO / IEC 15444-1) standard. Features.

請求項１６に記載の発明は、請求項１５に記載の画像処理装置において、請求項２に記載の画像部分領域がＪＰＥＧ２０００（ＩＳＯ／ＩＥＣ１５４４４−１）規格のプレシンクト単位またはタイル単位であることを特徴とする。 According to a sixteenth aspect of the present invention, in the image processing apparatus according to the fifteenth aspect, the image partial area according to the second aspect is a JPEG2000 (ISO / IEC 15444-1) standard precinct unit or tile unit. Features.

請求項１７に記載の発明は、請求項１５に記載の画像処理装置において、請求項１または２に記載された符号列単位がＪＰＥＧ２０００（ＩＳＯ／ＩＥＣ１５４４４−１）規格のパケット単位であることを特徴とする。 According to a seventeenth aspect of the present invention, in the image processing apparatus according to the fifteenth aspect, the code string unit according to the first or second aspect is a packet unit of JPEG2000 (ISO / IEC 15444-1) standard. Features.

請求項１８に記載の画像処理方法の発明は、符号データの復号再生画像の歪量と画像属性情報、及び画像属性毎の歪量基準と、によって前記符号データを構成する符号列（パケット）単位で重要度（優先順位）を設定することを特徴とする。 The image processing method according to claim 18 is a code string (packet) unit constituting the code data by a distortion amount of the decoded reproduction image of the code data, image attribute information, and a distortion amount reference for each image attribute. The importance level (priority order) is set by.

請求項１９に記載の画像処理方法の発明は、画像部分領域に対応する符号データの復号再生画像の歪量と画像属性情報、及び画像属性毎の歪量基準と、によって前記画像部分領域に対応する符号データを構成する符号列（パケット）単位で重要度（優先順位）を設定することを特徴とする。 The image processing method according to claim 19 corresponds to the image partial area based on the distortion amount and image attribute information of the decoded reproduction image of the encoded data corresponding to the image partial area, and the distortion amount criterion for each image attribute. The importance (priority order) is set for each code string (packet) constituting the code data to be processed.

請求項２０に記載の発明は、コンピュータに、請求項１乃至１７のいずれかに記載の画像処理装置の機能を実現させるためのプログラムである。
請求項２１に記載の発明は、請求項２０に記載のプログラムを記録したコンピュータ読み取り可能な記録媒体である。 The invention according to claim 20 is a program for causing a computer to realize the function of the image processing apparatus according to any one of claims 1 to 17.
A twenty-first aspect of the invention is a computer-readable recording medium on which the program of the twentieth aspect is recorded.

本発明によれば、画像属性による歪量の画質への影響を考慮して、符号化後の符号量制御を高精度に実行することができる。 According to the present invention, code amount control after encoding can be performed with high accuracy in consideration of the influence of the distortion amount due to the image attribute on the image quality.

以下、図面を参照して本発明の画像処理装置の好適な実施形態について説明する。 Hereinafter, preferred embodiments of an image processing apparatus of the present invention will be described with reference to the drawings.

＜本発明の基本的な考え方＞ <Basic concept of the present invention>

まず、本発明の基本的な考え方について説明する。
ＪＰＥＧ２０００（Ｊ２Ｋ：ＩＳＯ／ＩＥＣ１５４４４−１）規格にみられるように、ペリフェラル構造を持った符号においては、出力機器の要求に合わせて符号データの全てを復号化することなく一部の符号を選択したり、削除して復号化し、再現することが可能である。
符号データは、規定された符号列の集まり（順番）によって構成されており、符号レベルでの符号列単位の編集ができるのが特徴である。 First, the basic concept of the present invention will be described.
As seen in the JPEG2000 (J2K: ISO / IEC 15444-1) standard, a code with a peripheral structure can be partially selected without decoding all of the code data according to the requirements of the output device. Or can be deleted, decoded, and reproduced.
The code data is composed of a set (order) of defined code strings, and is characterized in that it can be edited in units of code strings at the code level.

本発明における符号列の編集は、特定の符号を削除し、符号量を調整したり、早期に重要な符号列を転送するように符号列の順番を変更する操作が主な機能である。この場合、符号列が重要度または優先度によって適切に並べられていることが望ましい。さもないと、より重要な符号が削除され、より重要でない符号が削除されないということが起こり得る。あるいは、より重要な符号が後で転送され、より重要でない符号が先に転送されるということが起こり得るからである。そこで、どのようにして優先順位を決めるかが問題になる。 The editing of the code string in the present invention is mainly performed by deleting a specific code, adjusting the code amount, or changing the order of the code string so as to transfer an important code string at an early stage. In this case, it is desirable that the code strings are appropriately arranged according to importance or priority. Otherwise, it may happen that more important codes are deleted and less important codes are not deleted. Alternatively, it may happen that more important codes are transferred later and less important codes are transferred first. Therefore, how to determine the priority order becomes a problem.

ところで、ＪＰＥＧ２０００（Ｊ２Ｋ：ＩＳＯ／ＩＥＣ１５４４４−１）規格の符号に見られるように、階層的に符号が構成されていると、符号レベルでの符号列単位の編集が有効に機能する。このような階層符号化では、複数個の尺度で分割され階層的に符号が構成されている。 By the way, as seen in the code of the JPEG2000 (J2K: ISO / IEC 15444-1) standard, when the code is hierarchically configured, editing in units of code strings at the code level functions effectively. In such hierarchical encoding, codes are hierarchically divided by a plurality of scales.

複数個の尺度を含むということは、目的に応じて尺度を変えて先に述べた符号列単位の優先度を変更して符号データを構成するには都合がよい。つまり、基準が単一であるから、一つの尺度を選択して、一次元に並べるのには都合がよい。
しかし、一方、複数の尺度を総合的に評価して、一次元に並べることは困難である。基準が複数次元であるため単純に比較できないためである。 The inclusion of a plurality of scales is convenient for constructing code data by changing the scale according to the purpose and changing the priority of the code string unit described above. In other words, since the reference is single, it is convenient to select one scale and arrange it in one dimension.
However, it is difficult to comprehensively evaluate a plurality of scales and arrange them in one dimension. This is because the reference is multi-dimensional and cannot be simply compared.

本発明の画像処理は、次のような手順で全体が処理される。
（１）画像データを入力する。
（２）符号データを構成する符号列（パケット）単位に歪量を算出して、符号化処理を行う。
（３）画像属性毎の歪量基準を利用して、符号パケット単位の優先順位あるいは重要度を設定する。
（４）設定した符号パケット単位の優先順位あるいは重要度を活用して、符号列順制御、送信符号打切制御および符号量制御を行う。
（５）復号化処理を行う。
（６）画像データを出力する。 The entire image processing of the present invention is processed in the following procedure.
(1) Input image data.
(2) The distortion amount is calculated for each code string (packet) constituting the code data, and the encoding process is performed.
(3) The priority or importance of each code packet is set using the distortion amount criterion for each image attribute.
(4) Utilizing the set priority order or importance of the code packet unit, code sequence order control, transmission code stop control, and code amount control are performed.
(5) Perform decryption processing.
(6) Output image data.

本発明の特徴は、前述のように、符号化処理の際に符号データを構成する符号列（パケット）単位に歪量を算出し、符号化後に画像属性毎の歪量基準を利用して符号パケット単位優先順位（重要度）を設定して、符号列順制御、送信符号打切制御、符号量制御に活用するところにある。 As described above, the feature of the present invention is that, as described above, a distortion amount is calculated for each code string (packet) constituting code data at the time of encoding processing, and encoding is performed using a distortion amount criterion for each image attribute after encoding. The packet unit priority (importance) is set and used for code sequence control, transmission code abort control, and code amount control.

以下、ＪＰＥＧ２０００規格（ＩＳＯ／ＩＥＣ１５４４４−１）の仕様に基づく符号化を例にとりあげ、最初に、符号化過程における部分符号列単位での符号列の歪の算出について説明し、符号レベルでの符号列単位での重要度あるいは優先度を算出する機能について説明する。 Hereinafter, coding based on the specification of the JPEG2000 standard (ISO / IEC 154444-1) is taken as an example. First, calculation of distortion of a code string in a partial code string unit in the coding process will be described, and at the code level A function for calculating importance or priority in units of code strings will be described.

ＪＰＥＧ２０００標準規格に基づいた符号は、階層符号であり、単位符号（符号列）で区分されているので、符号列単位の並び替えや、特定の符号列を削除したり、あるいは追加したり、符号列レベルの符号データの編集（パーサ）が容易なため、ＪＰＥＧ２０００標準規格に基づいた符号化データを対象とすることで、本発明における重要度に基づく符号データのアクセスや符号量制御が容易に実現できる。 Codes based on the JPEG2000 standard are hierarchical codes and are divided by unit codes (code strings). Therefore, rearrangement of code string units, deletion of specific code strings, addition of codes, Since it is easy to edit (parse) the code data at the column level, it is possible to easily access the code data and control the code amount based on the importance in the present invention by targeting the encoded data based on the JPEG2000 standard. it can.

＜符号化後の符号列単位での重要度あるいは優先度の設定制御方式＞ <Control method for setting importance or priority for each encoded code string>

次に、ＪＰＥＧ２０００規格に基づく符号化後の符号列単位での重要度あるいは優先度の設定方式について説明する。
本発明の実施形態に係る画像処理装置においては、特定領域内の画像属性情報と、指定領域内歪量算出と、画像属性毎の歪量基準とによって、符号データを構成する符号列単位で優先順位を設定する機能を有することで、符号データを符号列単位で、重要でない符号列（パケット）を一部削除し、符号量と画質を調整するようにしている。 Next, an importance or priority setting method in units of code strings after encoding based on the JPEG2000 standard will be described.
In the image processing apparatus according to the embodiment of the present invention, priority is given to each code string constituting the code data based on the image attribute information in the specific area, the calculation of the distortion amount in the specified area, and the distortion amount criterion for each image attribute. By having the function of setting the order, the code data is deleted in units of code strings, and some unimportant code strings (packets) are deleted, and the code amount and image quality are adjusted.

図１は、本発明の実施形態に係る画像処理装置における重要度設定の構成を示すブロック図である。図１において、画像処理装置は、入力処理部１０、符号化処理部２０、符号列単位重要度算出部３０、符号データ編集部４０、復号処理部５０、出力処理部６０、画像データ保存部１（７０）、符号データ保存部８０、画像データ保存部２（９０）からなっている。 FIG. 1 is a block diagram showing a configuration of importance setting in the image processing apparatus according to the embodiment of the present invention. 1, the image processing apparatus includes an input processing unit 10, an encoding processing unit 20, a code string unit importance calculation unit 30, a code data editing unit 40, a decoding processing unit 50, an output processing unit 60, and an image data storage unit 1. (70), a code data storage unit 80, and an image data storage unit 2 (90).

入力処理部１０から入力された画像データは、画像データ保存部１（７０）に保存され、符号化処理部２０で符号化されたデータが符号データ保存部８０に保存される。保存された符号データは、ネットワーク等を介して復号側に転送される。また、保存された符号データは、復号処理部５０で復号され、画像データが再生され、画像データ保存部２（９０）に保存され、出力処理部６０で再生された画像データが出力される。 The image data input from the input processing unit 10 is stored in the image data storage unit 1 (70), and the data encoded by the encoding processing unit 20 is stored in the code data storage unit 80. The stored code data is transferred to the decoding side via a network or the like. The stored code data is decoded by the decoding processing unit 50, the image data is reproduced, stored in the image data storage unit 2 (90), and the image data reproduced by the output processing unit 60 is output.

さらに、符号データ保存部８０に保存されている符号データを、符号列単位重要度算出部３０で設定された、符号データを構成する符号列単位の重要度に基づいて、符号データ編集部４０で符号データを再構成（符号順制御、符号量制御等）する。 Further, the code data editing unit 40 converts the code data stored in the code data storage unit 80 on the basis of the importance of the code string unit constituting the code data set by the code string unit importance calculation unit 30. The code data is reconstructed (code order control, code amount control, etc.).

符号列単位重要度算出部３０は、符号列単位属性情報抽出部３１、符号列単位歪量抽出部３２、符号列単位重要度設定部３３により構成される。 The code string unit importance calculation unit 30 includes a code string unit attribute information extraction unit 31, a code string unit distortion amount extraction unit 32, and a code string unit importance setting unit 33.

符号列単位属性情報抽出部３１は、後述するように、画像属性情報記憶部３５に記憶した、予め設定した、あるいは、符号化過程で算出した画像属性情報を用いて、後述するように、符号化過程で、符号列単位に属性情報を抽出する。
符号列単位歪量抽出部３２は、後述するように、符号化過程で、符号列単位に歪量を抽出する。
符号列単位重要度設定部３３は、画像属性毎歪量基準記憶部３４に記憶された画像属性毎歪量基準データと、符号列単位歪量抽出部３２で抽出した符号列単位の歪量と、符号列単位属性情報抽出部３１で抽出した符号列単位の属性情報を使用して符号列単位の重要度を設定する。 As will be described later, the code string unit attribute information extraction unit 31 uses the image attribute information stored in the image attribute information storage unit 35 in advance or calculated in the encoding process, as described later. In the conversion process, attribute information is extracted for each code string.
As will be described later, the code string unit distortion amount extraction unit 32 extracts the distortion amount for each code string in the encoding process.
The code string unit importance level setting unit 33 includes the distortion amount reference data for each image attribute stored in the image attribute distortion amount reference storage unit 34, the distortion amount for each code string extracted by the code string unit distortion amount extraction unit 32, and The importance level of the code string unit is set using the attribute information of the code string unit extracted by the code string unit attribute information extracting unit 31.

（１）符号列再構成 (1) Code stream reconstruction

本発明では、特定領域内の画像属性情報と、指定領域内歪量と、画像属性毎の歪量基準とによって、符号データを構成する符号列（パケット）単位の重要度を算出し、符号データを重要度の高い符号列から順に並び替えて、符号データを再構成する。これにより、符号データを重要度に応じ符号列単位で再構成する手段を提供することができる。 In the present invention, the importance of each code string (packet) constituting the code data is calculated based on the image attribute information in the specific area, the distortion amount in the specified area, and the distortion amount criterion for each image attribute. Are rearranged in order from the code sequence having the highest importance to reconstruct the code data. As a result, it is possible to provide means for reconstructing the code data in units of code strings in accordance with the importance.

また、このように再構成された符号データを転送するようにして、復号化処理における符号データの仕様が規定するプログレッション順でない順番でパケットを転送することができるので、重要な符号列（パケット）を優先的に転送する手段を提供することができる。 In addition, since the reconstructed code data is transferred, packets can be transferred in an order other than the progression order defined by the code data specifications in the decoding process. It is possible to provide a means for preferentially transferring.

また、符号データを構成する符号列（パケット）単位の重要度あるいは優先順位を算出し、重要度あるいは優先度順に符号列を並び替えて、符号データを再構成して、この再構成された符号データを符号列単位に順次転送するときに、符号データを構成する符号列単位で転送状況に応じて符号列を途中で打ち切った場合、転送符号データを再構成するようにして、転送状況に応じて重要な符号列を優先的に送信する機能を提供することができる。 Also, the importance or priority of each code string (packet) constituting the code data is calculated, the code strings are rearranged in order of importance or priority, the code data is reconfigured, and the reconfigured code When data is sequentially transferred in units of code strings, if the code string is terminated in the middle according to the transfer status in units of code strings constituting the code data, the transfer code data is reconfigured according to the transfer status. Therefore, it is possible to provide a function of preferentially transmitting an important code string.

このとき、符号データを構成する符号列の一部がレングスの短いダミーの符号列に置きかえて復号対象の符号データを構成するようにして、符号データ送信後に符号データの条件を満たすように整合を図り、復号化処理の規定への整合性を確保する。即ち、符号列（パケット）が一部削除され、ＪＰＥＧ２０００の規定するプログレッション順の条件を満たさない場合に、ＪＰＥＧ２０００の規定するプログレッション順の条件を満たすようにする。 At this time, a part of the code string constituting the code data is replaced with a dummy code string having a short length to form the code data to be decoded, and matching is performed so that the code data condition is satisfied after the code data is transmitted. Ensure consistency with the rules of decryption processing. That is, when a part of the code string (packet) is deleted and the progression order condition defined by JPEG2000 is not satisfied, the progression order condition defined by JPEG2000 is satisfied.

（２）画像領域毎の歪量基準による歪量制御方式： (2) A distortion amount control method based on a distortion amount standard for each image area:

本発明では、画像領域毎の歪量と歪量基準を比較するようにして、特定画像領域に対応する符号データが歪量基準以上であるような符号データとすることで、画像領域毎の歪量基準以上の歪が生じないようにする。 In the present invention, the distortion amount for each image area is compared by comparing the distortion amount for each image area with the distortion amount standard so that the code data corresponding to the specific image area is equal to or greater than the distortion amount reference. Avoid distortion above the quantity standard.

また、画像部分領域に対応する符号データの復号再生画像の歪量と、画像属性情報および画像属性毎の歪量基準と、によって前記画像部分領域に対応する符号データを構成する符号列（パケット）単位で重要度（優先順位）を設定するようにして、指定領域内の符号データの重要でない符号列（パケット）を一部削除または追加し画質を調整する手段を提供することができる。 Also, a code string (packet) constituting code data corresponding to the image partial area by the distortion amount of the decoded reproduction image of the code data corresponding to the image partial area, the image attribute information and the distortion amount standard for each image attribute By setting the importance (priority order) in units, it is possible to provide means for adjusting the image quality by deleting or adding a part of the unimportant code string (packet) of the code data in the designated area.

上述の画像の特定領域を、予め設定された注目ＲＯＩ領域（高画質化処理されるべき領域）であるとすることで、重要領域に基づいて画像領域毎に符号量制御する手段を提供することができる。例えば、この特定の画像属性を文字情報とすることで、重要である文字情報の再現画質が向上する。
また、画質劣化の無い許容劣化幅以内のレートの算出にも使用できる。 Providing means for controlling the amount of code for each image area based on an important area by setting the specific area of the image as a target ROI area set in advance (an area to be subjected to high image quality processing) Can do. For example, the reproduction quality of important character information is improved by using the specific image attribute as character information.
It can also be used to calculate a rate within an allowable deterioration range without image quality deterioration.

＜符号列毎の重要度を使用した応用システム＞ <Application system using importance for each code string>

（１）重要度を利用して、クライアント側からサーバ上の符号データをアクセスする。 (1) The code data on the server is accessed from the client side using the importance.

次に、重要度を利用して、サーバ上の符号列から一部の符号データをクライアント側からアクセスする構成について説明する。
図２は、本発明の実施形態に係る画像処理システムの構成を示すブロック図である。同図において、画像処理システムは、サーバ１００と１つ以上のクライアント２００とがネットワークで接続されている。 Next, a configuration in which a part of code data is accessed from the code string on the server from the client side using the importance will be described.
FIG. 2 is a block diagram showing the configuration of the image processing system according to the embodiment of the present invention. In the figure, in the image processing system, a server 100 and one or more clients 200 are connected via a network.

サーバ１００は、画像領域指定部１１０、領域範囲対応符号データ選択部１２０、指定領域内画像属性情報抽出部１３０、指定領域内歪量算出部１４０、符号列選択部１５０、符号ストリーム生成部１７０、領域単位画像属性情報保存部１３５、領域単位歪情報保存部１４５、符号データ保存部１５５、画像属性毎の歪量基準保存部１６５とからなっている。 The server 100 includes an image region designation unit 110, a region range corresponding code data selection unit 120, a designated region image attribute information extraction unit 130, a designated region distortion amount calculation unit 140, a code string selection unit 150, a code stream generation unit 170, An area unit image attribute information storage unit 135, an area unit distortion information storage unit 145, a code data storage unit 155, and a distortion amount reference storage unit 165 for each image attribute are included.

画像領域指定部１１０では、クライアント２００毎の符号ストリームアップロードに基づいて要求画像領域を指定する。
領域単位歪情報保存部１４５は、上記で説明したような処理によって符号データが生成される段階で抽出された領域単位歪情報が保存されている。 The image area designating unit 110 designates the requested image area based on the code stream upload for each client 200.
The area unit distortion information storage unit 145 stores area unit distortion information extracted at the stage where code data is generated by the process described above.

領域範囲対応符号データ選択部１２０では、要求された画像領域指定に基づいて領域範囲対応符号データを選択する。
指定領域内画像属性情報抽出部１３０では、選択された領域範囲対応符号データに基づいて、領域単位画像属性情報保存部１３５から指定領域内の画像属性情報を抽出する。
指定領域内歪量算出部１４０では、選択された領域範囲対応符号データに対する領域単位歪情報を領域単位歪情報保存部１４５から取り出して指定領域内の歪量を算出する。 The area range corresponding code data selection unit 120 selects area range corresponding code data based on the requested image area designation.
The designated region image attribute information extraction unit 130 extracts image attribute information in the designated region from the region unit image attribute information storage unit 135 based on the selected region range corresponding code data.
The designated area distortion amount calculation unit 140 extracts area unit distortion information for the selected area range corresponding code data from the area unit distortion information storage unit 145 and calculates the distortion amount in the designated area.

符号列選択部１５０は、抽出された指定領域内の画像属性情報と、算出された指定領域内の歪量と、画像属性毎の歪量基準保存部１６５に保存されている画像属性毎の歪量基準に基づいて指定された領域の符号データを構成する符号列を符号データ保存部１５５から選択する。
符号ストリーム生成部１７０では、選択した符号列により符号ストリームを構成し、クライアント２００側に符号ストリームを送信する。 The code string selection unit 150 includes the extracted image attribute information in the designated area, the calculated distortion amount in the designated area, and the distortion for each image attribute stored in the distortion amount reference storage unit 165 for each image attribute. A code string that constitutes code data of a region designated based on the quantity criterion is selected from the code data storage unit 155.
The code stream generation unit 170 configures a code stream with the selected code string, and transmits the code stream to the client 200 side.

この構成により、符号データのパケット単位の歪量に基づいて符号データの一部分をアクセスすることができる。この場合、画像領域を指定するようにして、この画像領域に対応する範囲の符号データを抽出することで、画像領域をアクセスする手段を提供できる。さらに、画像領域に対応する範囲の符号データを構成する符号列（パケット）の重要度に基づいて符号データを抽出するようにすると、符号列の重要度に基づいて符号データをアクセスすることができる。 With this configuration, a part of the code data can be accessed based on the distortion amount of the code data in units of packets. In this case, a means for accessing the image area can be provided by designating the image area and extracting code data in a range corresponding to the image area. Furthermore, if the code data is extracted based on the importance of the code string (packet) constituting the code data in the range corresponding to the image area, the code data can be accessed based on the importance of the code string. .

（２）重要度を利用した符号量調整 (2) Code amount adjustment using importance

前述したように、ＪＰＥＧ２０００仕様準拠で生成された符号データのヘッダを解析して、タイル（画像領域）単位あるいはプリシンクト単位に符号データをトランケート（符号列制御）したり編集したりすることができる。
本発明では、符号データを構成する符号列単位に設定した重要度を利用し符号列（パケット）単位で符号量を調整することができる。
例えば、特定画像領域に対応する符号データが歪量基準以上であるような符号データとすることで、画像領域毎の歪量基準以上の歪が生じないような構成とすることができる。 As described above, it is possible to analyze the header of code data generated in compliance with the JPEG2000 specification, and to truncate (code string control) or edit the code data in tile (image area) units or precinct units.
In the present invention, the code amount can be adjusted in units of code strings (packets) using the importance set in units of code strings constituting the code data.
For example, by setting the code data corresponding to the specific image area so that the code data is equal to or higher than the distortion amount standard, it is possible to configure so that distortion exceeding the distortion amount standard for each image area does not occur.

また、特定画像属性である領域に対応する符号データが歪量基準以上であるように符号データを再構成することによって、符号データを符号列単位で、重要でない符号列（パケット）を一部削除または追加し符号量を調整するようにし、画像属性毎の画質を保証することができる。また、出力領域単位でパケット単位の符号データを選択し、関係する符号列の追加削除が容易にできるので出力画質管理を容易に行うことができる。 Also, by reconfiguring the code data so that the code data corresponding to the area having the specific image attribute is equal to or greater than the distortion amount standard, the code data is partially deleted in units of code strings and some unimportant code strings (packets) are deleted. Alternatively, it is possible to adjust the code amount and to guarantee the image quality for each image attribute. In addition, since code data in units of packets can be selected in units of output areas, and addition and deletion of related code strings can be easily performed, output image quality management can be easily performed.

また、再構成される符号データにより再生される画像が全ての画像属性で特定領域の画像属性毎の歪量基準以上であるような符号データとすることで、例えば、重要である文字情報の再現画質を向上させることができる。 Also, by making the code data such that the image reproduced by the reconstructed code data is equal to or greater than the distortion amount standard for each image attribute of the specific region in all image attributes, for example, reproduction of important character information Image quality can be improved.

次に、トランケーションを行う場合には、上記のように画像領域に対応する符号列毎に（例えば、ひとつのサブバンド内の各コードブロックに対して）異なる量のトランケーションを行うと、領域毎に（例えば、コードブロック間で）歪が生じ、これが歪誤差となって見えてくる場合がある。 Next, when truncation is performed, if a different amount of truncation is performed for each code string corresponding to the image area (for example, for each code block in one subband) as described above, There is a case where distortion occurs (for example, between code blocks) and this appears as a distortion error.

このような場合、画像領域毎に符号量を制御したり、あるいは、画像領域全体で一律に符号量を制御してもかまわない。勿論、符号列とサブバンドとの対応をとることにより、サブバンド単位で行ってもかまわない。 In such a case, the code amount may be controlled for each image region, or the code amount may be controlled uniformly for the entire image region. Of course, it may be performed in units of subbands by taking correspondences between code strings and subbands.

本発明では、次のようにして、算出した算術符号データの重要度算出結果に基づいてトランケートする算術符号データを定める。
まず、符号データを構成する符号列（パケット）単位の重要度と符号量とを算出し、重要度順に符号列を並べ、次に、重要度の小さい（重要でない）符号列から順に、符号量を順次加算していって加算値が削減すべき総符号量となるまで符号列を削除して、符号データを再構成する。 In the present invention, arithmetic code data to be truncated is determined based on the calculated importance calculation result of arithmetic code data as follows.
First, the importance and code amount in units of code strings (packets) constituting the code data are calculated, the code strings are arranged in the order of importance, and then the code quantity in ascending order of importance (not important). Are sequentially added, and the code string is deleted until the total code amount to be reduced is reached, and the code data is reconstructed.

上記の符号データの削除単位は、算術符号データであるが、単位は、別の区切り単位であってもかまわない。ＪＰＥＧ２０００符号化における符号量制御の場合であれば、コードブロック単位あるいはパケット単位で削除するのであってもかまわない。重要度がその単位毎に算出されていればよい。 The code data deletion unit is arithmetic code data, but the unit may be another delimiter unit. In the case of code amount control in JPEG2000 encoding, deletion may be performed in units of code blocks or packets. It is only necessary that the importance is calculated for each unit.

本発明のＪＰＥＧ２０００符号化における実施形態においては、コードブロック単位あるいはパケット単位の符号データは、算術符号データの集合で構成されているので、前述したような方法で、算術符号データ単位の重要度が判別できていれば、その結果をコードブロック単位あるいはパケット単位で集計することで容易に重要度を算出することができる。したがって、符号量制御情報単位がＪＰＥＧ２０００規格のコードブロック単位あるいはパケット単位としてもよい。 In the embodiment of JPEG2000 encoding according to the present invention, code data in code block units or packet units is composed of a set of arithmetic code data. Therefore, the importance of arithmetic code data units is increased by the method described above. If it is discriminated, the importance can be easily calculated by counting the results in units of code blocks or packets. Therefore, the code amount control information unit may be a JPEG2000 standard code block unit or a packet unit.

また同様にして、前述したような部分画像領域単位に符号量制御する場合にあっては、ＪＰＥＧ２０００符号化において、符号量制御の目標値がＪＰＥＧ２０００規格のプリシンクト単位またはタイル単位で、単位毎に符号量を制御してもかまわない。 Similarly, when code amount control is performed in units of partial image areas as described above, in JPEG2000 encoding, the target value for code amount control is JPEG2000 standard precinct units or tile units. The amount may be controlled.

このように、ＪＰＥＧ２０００（ＩＳＯ／ＩＥＣ１５４４４−１）規格の符号化方式では、画像の矩形領域単位（タイル単位）に独立に符号化するだけでなく、さらに、各タイル内の部分領域単位（プリシンクト単位）に階層的な符号データが形成される。そのため、プリシンクト単位で符号データの量（トランケート量）を管理する仕組みを具備することで、高精細の符号量制御が可能となるのである。 As described above, in the encoding method of the JPEG2000 (ISO / IEC 15444-1) standard, not only encoding is performed independently for each rectangular area (tile unit) of an image, but also a partial area unit (precinct) in each tile. Hierarchical code data is formed in (unit). Therefore, by providing a mechanism for managing the amount of code data (amount of truncation) in units of precincts, high-definition code amount control becomes possible.

＜歪量計算方式＞ <Distortion calculation method>

本発明は、歪量によって符号データの符号列毎に優先順位を設定し、特定画像領域に対応する符号データの歪量が歪量基準以上であるような符号データとすることで、画像領域毎の歪量基準以上の歪が生じないようにすることに特徴がある。
一方、ＪＰＥＧ２０００の符号化では、符号処理過程で歪量を使用してトランケートしているので、本発明では、符号データの復号再生画像の歪量を、符号データ生成過程で算出された歪量とする。 According to the present invention, priority is set for each code string of code data according to the distortion amount, and the code data in which the distortion amount of the code data corresponding to the specific image region is equal to or greater than the distortion amount reference is obtained for each image region. This is characterized in that no distortion exceeding the distortion amount standard is generated.
On the other hand, in JPEG2000 encoding, the amount of distortion is used for truncation in the encoding process. Therefore, in the present invention, the amount of distortion of the decoded reproduction image of the code data is calculated using the distortion amount calculated in the code data generation process. To do.

（１）ＪＰＥＧ２０００の符号列単位の重要度算出 (1) JPEG2000 code string unit importance calculation

ＪＰＥＧ２０００規格（ＩＳＯ／ＩＥＣ１５４４４−１）の仕様に基づく符号化における符号化処理過程では、いくつかの符号処理単位（サブバンド、コードブロックなどの処理単位）で、符号形成処理が進められ、符号処理単位でトランケート処理（符号量制御）される。符号処理の際に、符号処理単位で重要度が算出され、符号量制御されている。
本発明では、画像属性毎の歪量に対する劣化を感じる度合いを考慮して、符号列単位の重要度を算出する。 In the encoding process in the encoding based on the specification of the JPEG2000 standard (ISO / IEC 15444-1), the code forming process is performed in several code processing units (processing units such as subbands and code blocks). Truncation processing (code amount control) is performed in units of processing. At the time of code processing, importance is calculated for each code processing unit, and the code amount is controlled.
In the present invention, the degree of importance for each code string is calculated in consideration of the degree of feeling deterioration with respect to the distortion amount for each image attribute.

ところで、上述したＪＰＥＧ２０００規格の仕様の符号処理単位としては、画像データの周波数情報を意味する符号処理単位（サブバンドなど）と、画像領域を意味する符号処理単位（コードブロック、プリシンクトなど）がある。 By the way, as the coding processing unit of the above-mentioned specification of the JPEG2000 standard, there are a coding processing unit (subband or the like) meaning frequency information of image data and a coding processing unit (code block, precinct or the like) meaning an image area. .

また、ＪＰＥＧ２０００規格の符号処理では、符号処理時にビジュアル・ウェイティング（ＶｉｓｕａｌＷｅｉｇｈｔｉｎｇ）法に基づいて視覚特性を考慮に入れたＶＷ情報を使用して符号形成の過程で作られるコードブロック単位の算術符号データに対して符号量制御している。
以下、ＪＰＥＧ２０００（ＩＳＯ／ＩＥＣ１５４４４−１）規格の符号化処理について順次説明していくが、さらに詳しいＪＰＥＧ２０００規格の符号化についての説明については、特開２００３−１６９３３３号公報等に記載されている。 Also, in the code processing of the JPEG2000 standard, arithmetic code data in units of code blocks created in the process of code formation using VW information taking into consideration visual characteristics based on the visual weighting method at the time of code processing The code amount is controlled.
Hereinafter, the encoding process of the JPEG2000 (ISO / IEC 15444-1) standard will be described sequentially, but more detailed description of the encoding of the JPEG2000 standard is described in Japanese Patent Application Laid-Open No. 2003-169333. .

（１−１）ＪＰＥＧ２０００符号化処理 (1-1) JPEG2000 encoding process

ＪＰＥＧ２０００の符号処理は、タイルごとにＤＣレベルシフトおよび色変換を施し、タイルごとにウェーブレット変換し、サブバンドごとに量子化（正規化）を行って、コードブロックごとにビットプレーン符号化して、不要な符号を破棄して必要な符号をまとめてパケットを生成し、パケットを並べて、符号列を形成するように構成されている。復号処理はこの逆の手順で処理される。 JPEG2000 code processing does not require DC level shift and color conversion for each tile, wavelet conversion for each tile, quantization (normalization) for each subband, and bit-plane encoding for each code block In this configuration, a necessary code is discarded, a necessary code is collected to generate a packet, and the packets are arranged to form a code string. The decoding process is performed in the reverse procedure.

図３では、符号化対象である画像データと、符号形成単位となるタイル、サブバンド、プリシンクト、コードブロックの関係を示している。タイルとは、画像を矩形に分割した画像データの単位である。分割数が一つである場合は、一つの画像がタイルに相当している。 FIG. 3 shows the relationship between image data to be encoded and tiles, subbands, precincts, and code blocks that are code forming units. A tile is a unit of image data obtained by dividing an image into rectangles. When the number of divisions is one, one image corresponds to a tile.

プリシンクトに関する符号データは、符号化過程で生成されるのであるが、後で説明するように画像（タイル）の部分領域に対応している。プリシンクトに対応する画像領域は、プリシンクトを構成するコードブロックに対応する画像領域に対応する。コードブロックのサイズは、予め指定されていて区切り方が決められているので、コードブロックに対応する画像領域に基づいてプリシンクトに対応する画像領域を求めることができる。 Code data related to the precinct is generated in the encoding process, and corresponds to a partial area of an image (tile) as described later. The image area corresponding to the precinct corresponds to the image area corresponding to the code block constituting the precinct. Since the size of the code block is specified in advance and the way of separation is determined, the image area corresponding to the precinct can be obtained based on the image area corresponding to the code block.

本発明では、符号化後に元画像の画像領域毎の属性情報を使用するが、この段階で、コードブロックに対応する部分画像領域単位に画像属性を定め、後述するように、実画像と符号化との対応関係に基づいて、形成された符号データの符号列（パケット）単位での画像属性を定める。 In the present invention, the attribute information for each image area of the original image is used after encoding. At this stage, the image attribute is determined for each partial image area corresponding to the code block. The image attribute for each code string (packet) of the formed code data is determined on the basis of the corresponding relationship.

ＪＰＥＧ２０００（ＩＳＯ／ＩＥＣ１５４４４−１）規格の符号化方式では、実画像と符号データとが領域単位で対応づけされていることに着目している。
この時、コードブロックに対応する部分画像領域の全域に渡って一つの画像属性が定められれば問題はないが、複数の画像属性が一つのコードブロックに対応する画像領域に含まれている場合には、予め決められた規則に基づいて画像属性を決めればよい。 In the JPEG2000 (ISO / IEC 15444-1) standard encoding method, attention is paid to the fact that actual images and code data are associated with each other in units of regions.
At this time, there is no problem if one image attribute is defined over the entire partial image area corresponding to the code block, but when a plurality of image attributes are included in the image area corresponding to one code block. The image attribute may be determined based on a predetermined rule.

例えば、コードブロックに対応する部分画像領域単位に画像属性を定める方法は、画像入力後にコードブロックに対応する部分画像領域単位に画像を分割し、この部分画像領域単位に画像属性を定める。ここで、画像属性を定める方法はこれだけに限定されない。これ以外にも、画像の特徴を示す画像データが既に存在しているような場合に、属性分けされた画像を画像領域分割して部分画像領域単位に画像属性を定めるという方法もある。 For example, in the method of defining image attributes in units of partial image areas corresponding to code blocks, an image is divided into units of partial image areas corresponding to code blocks after image input, and image attributes are defined in units of partial image areas. Here, the method of determining the image attribute is not limited to this. In addition to this, when there is already image data indicating the characteristics of an image, there is also a method in which an attributed image is divided into image regions and image attributes are determined in units of partial image regions.

個々のタイルは、独立したタイル毎に１つの画像データとみなして、ウェーブレット変換以下の符号処理がなされるので、タイルを構成する全ての符号データが、タイル単位のアクセス単位となる。
タイルを、色空間変換部でＲＧＢからＹＵＶまたはＹＣｂＣｒに変換し、２次元Ｗａｖｅｌｅｔ変換部で色成分ごとにウェーブレット変換した結果として、サブバンドが生成され、サブバンド単位で量子化される。 Each tile is regarded as one piece of image data for each independent tile, and is subjected to code processing below the wavelet transform, so that all code data constituting the tile is an access unit in tile units.
As a result of converting the tiles from RGB to YUV or YCbCr by the color space conversion unit and performing wavelet conversion for each color component by the two-dimensional Wavelet conversion unit, subbands are generated and quantized in units of subbands.

前述したプリシンクトとは、サブバンドを（ユーザが指定可能なサイズの）矩形に分割したもの（をＨＬ，ＬＨ，ＨＨの３つのサブバンドについて集めたものであり、プリシンクトは３つで１まとまりをなす。ただし、ＬＬサブバンドを分割したプリシンクトは１つで１まとまり）で、タイル（画像）中の場所（Ｐｏｓｉｔｉｏｎ）を表すものである。 The precinct described above is a subband divided into rectangles (of a size that can be specified by the user) (collected for three subbands HL, LH, and HH, and one precinct is grouped into three. However, the precinct obtained by dividing the LL subband is a single unit), which represents a position in the tile (image).

このプリシンクトは、タイルの画像データをウェーブレット変換した結果に生成され、タイル（画像）中のある領域の符号データからなっている。プリシンクトのアクセス単位に相当する符号データは、タイル（画像）中のあるプリシンクト領域の符号データであり、プリシンクトをさらに（ユーザが指定可能なサイズの）矩形に分割したものがコードブロックである。 This precinct is generated as a result of wavelet transform of tile image data, and consists of code data of a certain area in the tile (image). Code data corresponding to a precinct access unit is code data of a precinct area in a tile (image), and a code block is obtained by further dividing the precinct into rectangles (of a size that can be designated by the user).

以下、ＪＰＥＧ２０００の符号化処理の過程で、コードブロック、プリシンクト単位の符号データが生成されていく流れを順に説明していく。 In the following, the flow of generation of code data in units of code blocks and precincts in the course of JPEG2000 encoding processing will be described in order.

最初に、入力画像が矩形領域（タイル）単位にウェーブレット変換が施されるわけであるが、ウェーブレット変換は、タイル内の特定領域の（フィルタリングするのに必要な長さの）入力画像に対してフィルタリングすることによって（通常低域フィルタと高域フィルタから構成されるフィルタバンクによって）実現される。 First, the input image is subjected to wavelet transform in units of rectangular areas (tiles). The wavelet transform is applied to an input image (of a length necessary for filtering) in a specific area within the tile. This is realized by filtering (usually by a filter bank composed of a low-pass filter and a high-pass filter).

ウェーブレット変換係数は、タイル内の特定領域の入力画像に対応して順次形成される。すなわち、タイル内の特定画像領域の画像データに対応して、ウェーブレット変換係数が形成される。このように、画像データはタイル単位に分割され、タイル単位で符号化されるわけであるが、以下に説明するように、ＪＰＥＧ２０００仕様の符号化では、タイル内のさらに小さいサイズの部分矩形画像領域単位でウェーブレット変換係数が形成され、対応する領域単位に符号データが形成される。 The wavelet transform coefficients are sequentially formed corresponding to the input image in a specific area in the tile. That is, wavelet transform coefficients are formed corresponding to the image data of the specific image area in the tile. In this way, the image data is divided into tile units and encoded in tile units. However, as described below, in the encoding of the JPEG2000 specification, a partial rectangular image area of a smaller size in a tile. Wavelet transform coefficients are formed in units, and code data is formed in corresponding region units.

ウェーブレット変換は、前述したように、通常、低域成分（ＬＬ）を繰り返し変換する方法を取り、さらにウェーブレット変換係数に対してウェーブレット変換が施される。この時、ウェーブレット変換は、ウェーブレット変換係数を形成するときに使用したタイル内の元画像領域の位置に配置されたデータとして施される。 As described above, the wavelet transform usually takes a method of repeatedly transforming a low frequency component (LL), and the wavelet transform coefficient is further subjected to wavelet transform. At this time, the wavelet transform is performed as data arranged at the position of the original image area in the tile used when forming the wavelet transform coefficients.

通常、ウェーブレット変換は圧縮率を高めるために画像の位置（領域）が互いに近い相関が大きいデータ（の集まり）に対して施されるためである。このことから、サブバンド毎のウェーブレット変換係数は、タイル内の元画像領域の位置との対応関係を保っている。 This is because the wavelet transform is usually applied to data (collection) having a large correlation between image positions (regions) close to each other in order to increase the compression rate. For this reason, the wavelet transform coefficient for each subband maintains a correspondence with the position of the original image area in the tile.

次に、サブバンド単位にウェーブレット変換係数に量子化処理が施されるが、求められた量子化係数もタイル内の元画像領域の位置との対応関係を保っている。
量子化は、ウェーブレット変換係数に対して非可逆圧縮を施す。ＪＰＥＧ２０００の量子化手段としては、ウェーブレット変換係数を量子化ステップサイズで除算するスカラ量子化を用いる。ここで、ＪＰＥＧ２０００の規格上、上述の非可逆圧縮を行うときに、非可逆の９×７ウェーブレット変換フィルタを用いる場合には、自動的にスカラ量子化を併用することが決められている。 Next, quantization processing is performed on the wavelet transform coefficients in units of subbands, and the obtained quantization coefficients also maintain a correspondence relationship with the position of the original image area in the tile.
Quantization performs lossy compression on wavelet transform coefficients. As a JPEG 2000 quantization means, scalar quantization is used in which a wavelet transform coefficient is divided by a quantization step size. Here, according to the JPEG2000 standard, when performing the above-described irreversible compression, if an irreversible 9 × 7 wavelet transform filter is used, it is determined that scalar quantization is automatically used together.

一方、可逆の５×３ウェーブレット変換フィルタを用いる場合には、量子化を行わず、後述のように、符号化パスを切り捨てることによって、符号量制御が行われる。ＪＰＥＧ２０００の量子化処理は、実際には非可逆の９×７ウェーブレット変換フィルタを用いた場合である。 On the other hand, when a reversible 5 × 3 wavelet transform filter is used, the code amount control is performed by truncating the coding pass as described later without performing quantization. JPEG2000 quantization processing is actually a case where an irreversible 9 × 7 wavelet transform filter is used.

続いて、量子化係数に対してビットプレーン符号化がなされる。ＪＰＥＧ２０００における符号化の規格では、ＥＢＣＯＴ（ＥｍｂｅｄｄｅｄＣｏｄｉｎｇｗｉｔｈＯｐｔｉｍｉｚｅｄＴｒｕｎｃａｔｉｏｎ）と呼ばれるエントロピー符号化処理が施される。ＥＢＣＯＴは、所定の大きさのブロック毎にそのブロック内の係数の統計量を測定しながら符号化を行う手段である。 Subsequently, bit-plane coding is performed on the quantized coefficients. In the encoding standard in JPEG2000, entropy encoding processing called EBCOT (Embedded Coding with Optimized Truncation) is performed. EBCOT is a means for performing coding while measuring the statistical amount of coefficients in each block of a predetermined size.

量子化係数をコードブロック（ｃｏｄｅ−ｂｌｏｃｋ）と呼ばれる所定のサイズのブロック単位に、エントロピー符号化する。コードブロックは、量子化係数のＭＳＢからＬＳＢ方向にビットプレーン毎に独立して符号化する。 The quantized coefficients are entropy-coded into a block unit of a predetermined size called a code block (code-block). The code block is encoded independently for each bit plane from the MSB to the LSB of the quantization coefficient.

コードブロック（ｃｏｄｅ−ｂｌｏｃｋ）は、タイル内の元画像領域の位置との対応関係を持っている量子化係数をブロック単位に集めたものであり、タイル内の元画像領域の特定の位置に対応する量子化係数の集まりになっている。コードブロックの縦横のサイズは、４から２５６までの２のべき乗で、通常使用される大きさは、３２×３２、６４×６４、１２８×１２８等がある。コードブロックの符号化は、ＭＳＢ側のビットプレーンから順番に、（i）ＳｉｇｎｉｆｉｃａｎｃｅＰａｓｓ、（ii）ＲｅｆｉｎｅｍｅｎｔＰａｓｓ、（iii）ＣｌｅａｎｕｐＰａｓｓの３種類の符号化パスによって行われる。即ち、ビットプレーンのＭＳＢ側からＬＳＢ側に向かい、順次、ＣｌｅａｎｕｐＰａｓｓ、ＳｉｇｎｉｆｉｃａｎｃｅＰａｓｓ、ＲｅｆｉｎｅｍｅｎｔＰａｓｓ、ＣｌｅａｎｕｐＰａｓｓの順序で各ビットプレーンの符号化が行われる。３つの算術符号化は、それぞれ以下のような処理を行う。 A code block (code-block) is a collection of quantized coefficients having a correspondence relationship with the position of the original image area in the tile, and corresponds to a specific position of the original image area in the tile. It is a collection of quantized coefficients. The vertical and horizontal sizes of the code block are powers of 2 from 4 to 256, and commonly used sizes include 32 × 32, 64 × 64, 128 × 128, and the like. Code blocks are encoded in three types of encoding passes: (i) Significance Pass, (ii) Refinement Pass, and (iii) Cleanup Pass, in order from the bit plane on the MSB side. That is, each bit plane is encoded in the order of Cleanup Pass, Significance Pass, Refinement Pass, and Cleanup Pass from the MSB side to the LSB side of the bit plane. Each of the three arithmetic encodings performs the following processing.

（i）ＳｉｇｎｉｆｉｃａｎｃｅＰａｓｓ：
あるビットプレーンを符号化するＳｉｇｎｉｆｉｃａｎｃｅＰａｓｓでは、８近傍の少なくとも１つの係数がｓｉｇｎｉｆｉｃａｎｔであるようなｎｏｎ−ｓｉｇｎｉｆｉｃａｎｔ係数のビットプレーンの値を算術符号化する。その符号化したビットプレーンの値が１である場合は、符号が＋であるか、−であるかを続けて算術符号化する。 (I) Significance Pass:
In the Significance Pass that encodes a certain bit plane, the bit plane value of a non-significant coefficient in which at least one coefficient in the vicinity of 8 is a significant is arithmetically encoded. When the value of the encoded bit plane is 1, arithmetic coding is continued by determining whether the code is + or-.

（ii）ＲｅｆｉｎｅｍｅｎｔＰａｓｓ：
ビットプレーンを符号化するＲｅｆｉｎｅｍｅｎｔＰａｓｓでは、ビットプレーンを符号化するＳｉｇｎｉｆｉｃａｎｃｅＰａｓｓで符号化していないｓｉｇｎｉｆｉｃａｎｔな係数のビットプレーンの値を算術符号化する。 (Ii) Refinement Pass:
In the Refinement Pass that encodes the bit plane, the value of the bit plane of the significant coefficient that is not encoded by the Significance Pass that encodes the bit plane is arithmetically encoded.

（iii）ＣｌｅａｎｕｐＰａｓｓ：
ビットプレーンを符号化するＣｌｅａｎｕｐＰａｓｓでは、ビットプレーンを符号化するＳｉｇｎｉｆｉｃａｎｃｅＰａｓｓで符号化していないｎｏｎ−ｓｉｇｎｉｆｉｃａｎｔな係数のビットプレーンの値を算術符号化する。その符号化したビットプレーンの値が１である場合は符号が＋であるか−であるかを続けて算術符号化する。 (Iii) Cleanup Pass:
In the Cleanup Pass that encodes the bit plane, the bit plane value of the non-significant coefficient that is not encoded by the Significance Pass that encodes the bit plane is arithmetically encoded. When the value of the encoded bit plane is 1, whether the code is + or-is continuously arithmetic encoded.

ビットプレーン符号化は、タイル内の元画像領域の特定の位置に対応したブロック毎（コードブロック単位）に独立して符号化を行い、且つ、算術符号化の統計量測定を当該符号化ブロック（上述した例では、コードブロック）内に閉じて処理が行われている。 Bit-plane coding is performed independently for each block (code block unit) corresponding to a specific position of the original image area in the tile, and the arithmetic coding statistic measurement is performed on the coded block ( In the above-described example, the processing is performed while being closed in the code block).

このように、各符号化パスで算術符号が生成された後、必要に応じて算術符号の符号量をカウントしながら、目標のビットレートまたは圧縮率に近づけるように、符号量制御が行われる。この符号量の制御は、コードブロック毎の算術符号データ（各符号化パスで生成された算術符号データ）の一部またはすべてを切り捨てる（トランケートする）ことで実現できる。 As described above, after the arithmetic code is generated in each coding pass, the code amount control is performed so as to approach the target bit rate or compression rate while counting the code amount of the arithmetic code as necessary. This control of the code amount can be realized by truncating (truncating) a part or all of the arithmetic code data (the arithmetic code data generated in each encoding pass) for each code block.

本発明の符号量制御の方式は、後で詳しい説明を加えるが、ビットプレーン符号化部で符号破棄前の符号を作り、符号量の調整を行いながら、（ポスト量子化部で）トランケーションを行い、パケットの並びを順次生成し、ＪＰＥＧ２０００の符号フォーマットに符号を形成するものである。 The code amount control method of the present invention will be described in detail later, but the bit plane encoding unit creates a code before code discard and performs truncation (by the post quantization unit) while adjusting the code amount. The packet sequence is sequentially generated to form a code in the JPEG2000 code format.

符号量制御完了後、コードブロック単位の算術符号データを編集して符号データを生成する。つまり、量子化後のサブバンドの係数は、コードブロック単位でビットプレーン符号化される（１つのビットプレーンは、３つのサブビットプレーンに分解されて符号化される場合もある）。 After the code amount control is completed, the code data is generated by editing the arithmetic code data in units of code blocks. That is, the subband coefficients after quantization are bit-plane encoded in units of code blocks (one bit plane may be decomposed into three sub-bit planes and encoded).

例えば、コードブロック内の算術符号データに対して、付加情報をヘッダとして生成する。通常、符号データを構成するヘッダ情報が生成され、一方で、コードブロック単位の算術符号データを編集してパケットを生成し、ヘッダ情報に、対応するパケットデータを付加して符号データを生成する。 For example, additional information is generated as a header for arithmetic code data in a code block. Normally, header information constituting code data is generated, while on the other hand, arithmetic code data in units of code blocks is edited to generate a packet, and corresponding packet data is added to the header information to generate code data.

パケットヘッダには、通常、プリシンクトに含まれるコードブロックを特定する情報と共に、符号化されなかった０ビットプレーン数、符号化されたコーディングパス数あるいはゼロレングスパケット（全てのコードブロック情報が０である）であることについての情報をもっている。 The packet header usually includes information specifying the code block included in the precinct, the number of 0 bit planes not encoded, the number of encoded coding passes, or zero length packets (all code block information is 0). ) Have information about

ＪＰＥＧ２０００仕様準拠の符号データでは、パケットは、プリシンクトと呼ばれるタイル内の矩形領域（位置）に対応するコードブロックを集めて構成されている。即ち、プリシンクトに含まれる全てのコードブロックから、符号の一部を取り出して集めたもの（例えば、全てのコードブロックのＭＳＢから３枚目までのビットプレーンの符号を集めたもの）がパケットである。ただし、パケットの中身が符号的には“空（から）”ということもある。 In code data compliant with the JPEG2000 specification, a packet is configured by collecting code blocks corresponding to a rectangular area (position) in a tile called a precinct. That is, a packet is obtained by collecting a part of the code from all the code blocks included in the precinct (for example, collecting the codes of the bit planes from the MSB of all the code blocks to the third one). . However, the content of the packet may be “empty” in terms of code.

全てのプリシンクト（＝全てのコードブロック＝全てのサブバンド）のパケットを集めると、画像全域の符号の一部（例えば、画像全域のウェーブレット係数の、ＭＳＢから３枚目までのビットプレーンの符号）ができるが、これをレイヤとよぶ。レイヤは、画像全体のビットプレーンの符号の一部であり、すべてのレイヤを集めると画像全域の全てのビットプレーンの符号になる。 When packets of all precincts (= all code blocks = all subbands) are collected, a part of the code of the entire image (for example, the code of the bit plane from the MSB to the third frame of the wavelet coefficients of the entire image) This is called a layer. The layer is a part of the code of the bit plane of the entire image, and when all the layers are collected, the code of all the bit planes of the entire image is obtained.

図４は、ウェーブレット変換の階層数（デコンポジションレベル）＝２、プリシンクトサイズ＝サブバンドサイズとしたときのレイヤ、図５はそれに含まれるパケットの例である。これらの場合は、プリシンクトサイズ＝サブバンドサイズであり、図３でいうプリシンクトの大きさと同じ大きさのコードブロックを採用しているため、デコンポジションレベル２のサブバンドは４つのコードブロックに、デコンポジションレベル１のサブバンドは９個のコードブロックに分割されている。パケットは、プリシンクトを単位とするものであるから、プリシンクト＝サブバンドとした場合、ＨＬ〜ＨＨサブバンドをまたいだものとなる。図５中、いくつかのパケットを太線で囲んである。これらのパケットは「コードブロックの符号の一部を取り出して集めたもの」である。 FIG. 4 shows layers when the number of wavelet transform layers (decomposition level) = 2 and precinct size = subband size, and FIG. 5 shows an example of a packet included therein. In these cases, the precinct size is equal to the subband size, and the code block having the same size as the precinct size shown in FIG. 3 is used. Therefore, the subband at the decomposition level 2 is divided into four code blocks. The sub-band of decomposition level 1 is divided into nine code blocks. Since the packet is based on the precinct, when precinct = subband, the packet spans the HL to HH subbands. In FIG. 5, some packets are surrounded by thick lines. These packets are “a collection of code blocks extracted and collected”.

ＪＰＥＧ２０００のパケット構成の例を示したのが、図６〜８である。パケットは、ＪＰＥＧ２０００（ＩＳＯ／ＩＥＣ１５４４４−１）規格の符号データの最小単位であり、プリシンクト単位の符号データにより構成される。 Examples of JPEG2000 packet configurations are shown in FIGS. The packet is the minimum unit of code data of the JPEG2000 (ISO / IEC 15444-1) standard, and is composed of code data in precinct units.

このように、ＪＰＥＧ２０００の符号化で形成された符号データは、タイル単位に形成され、さらに、タイル単位の符号データはパケット単位で形成されており、パケットは、プリシンクト単位の符号列の集まりにより形成されている。したがって、ＪＰＥＧ２０００の符号化で形成された符号列に対してプリシンクト単位にアクセスする場合は、符号列のヘッダを解析し、指定されたプリシンクトに該当するパケットを抽出することでアクセスできる。 In this way, code data formed by JPEG2000 encoding is formed in tile units, further, code data in tile units is formed in packet units, and packets are formed by a collection of code strings in precinct units. Has been. Therefore, when accessing a code string formed by JPEG2000 encoding in units of precincts, access can be made by analyzing the header of the code string and extracting packets corresponding to the designated precinct.

ところで、ＪＰＥＧ２０００（ＩＳＯ／ＩＥＣ１５４４４−１）規格の符号化では、パケットをさらにいくつかの構成単位に集めて符号データ（符号化コートストリーム）を形成するが、パケットの並びをプログレッション順序と呼び、ＪＰＥＧ２０００（ＩＳＯ／ＩＥＣ１５４４４−１）規格ではいくつかのプログレッション順序で符号データが構成される。 By the way, in the encoding of the JPEG2000 (ISO / IEC 15444-1) standard, packets are further collected into several structural units to form code data (encoded code stream). The sequence of packets is called a progression order, In the JPEG2000 (ISO / IEC 15444-1) standard, code data is configured in several progression orders.

即ち、ＪＰＥＧ２０００（ＩＳＯ／ＩＥＣ１５４４４−１）規格のパケットは、プログレシブ順にシーケンス化され、それぞれ、プリシンクト、解像度レベル、およびコンポーネント（色成分）、画質レベル（レイヤ）によって配列されている。
ここで、コンポーネントについては、これまで詳しく述べなかったが、コンポーネントとは色成分を指し、符号化が画像のタイル単位に色成分毎になされることに対応している。あるタイルあるいはプリシンクトの符号データをアクセスする場合は、対応するタイル領域の全てのコンポーネントで該当する符号データをアクセスする。 In other words, JPEG2000 (ISO / IEC 15444-1) standard packets are sequenced in a progressive order, and are arranged by precinct, resolution level, component (color component), and image quality level (layer), respectively.
Here, although the component has not been described in detail so far, the component refers to a color component, and corresponds to encoding performed for each color component in tile units of the image. When accessing code data of a certain tile or precinct, the corresponding code data is accessed in all the components of the corresponding tile area.

また、解像度レベルとは、サブバンドの階層（デコンポジションレベル）を指し、先に説明したように、サブバンド毎に求められたウェーブレット係数についてコードブロック単位の算術符号データが生成されているので、サブバンドの階層（デコンポジションレベル）毎に符号データは区別することができる。同様に、あるタイルあるいはプリシンクトの符号データをアクセスする場合は、対応するタイル領域の全ての解像度レベルで該当する符号データをアクセスする。 In addition, the resolution level refers to the subband hierarchy (decomposition level), and as described above, arithmetic code data in units of code blocks is generated for the wavelet coefficients obtained for each subband. Code data can be distinguished for each subband hierarchy (decomposition level). Similarly, when accessing code data of a certain tile or precinct, the corresponding code data is accessed at all resolution levels of the corresponding tile area.

また、画質レベル（レイヤ）は、画質レベルに対応して、例えば、３ＬＬのコードブロック単位のデータを最低レベルのレイヤとして構成し、３ＬＬに３ＨＬ乃至３ＨＨを加えたコードブロック単位のデータを次のレイヤとし、さらに、２ＨＬ乃至２ＨＨを加えたコードブロック単位のデータを次のレイヤとし、さらに、１ＨＬ乃至１ＨＨを２ＨＬ乃至２ＨＨを加えたコードブロック単位のデータを最高レイヤとしてレイヤデータを構成する。同様に、あるタイルあるいはプリシンクトの符号データをアクセスする場合は、対応するタイル領域の全てのレイヤレベルで該当する符号データをアクセスする。 Also, the image quality level (layer) corresponds to the image quality level, for example, 3LL code block unit data is configured as the lowest level layer, and 3LL to 3HH is added to the 3LL code block data. Layer data is configured with a layer, data in units of code blocks to which 2HL to 2HH is added as the next layer, and data in units of code blocks to which 1HL to 1HH is added to 2HL to 2HH as the highest layer. Similarly, when accessing code data of a certain tile or precinct, the corresponding code data is accessed at all layer levels of the corresponding tile area.

ＪＰＥＧ２０００規格では、画質（レイヤ（Ｌ））、解像度（Ｒ）、コンポーネント（Ｃ）、位置（プリシンクト（Ｐ））という４つの画像の要素の優先順位を変更することによって、以下に示す５通りのプログレッションが定義されている。 In the JPEG2000 standard, by changing the priority order of the four image elements of image quality (layer (L)), resolution (R), component (C), and position (precinct (P)), the following five patterns are displayed. Progression is defined.

・ＬＲＣＰプログレッション：プリシンクト、コンポーネント、解像度レベル、レイヤの順序に復号されるため、レイヤのインデックスが進む毎に画像全面の画質が改善されることになり、画質のプログレッションが実現できる。レイヤプログレッションとも呼ばれる。
・ＲＬＣＰプログレッション：プリシンクト、コンポーネント、レイヤ、解像度レベルの順序に復号されるため、解像度のプログレッションが実現できる。
・ＲＰＣＬプログレッション：レイヤ、コンポーネント、プリシンクト、解像度レベルの順序に復号されるため、ＲＬＣＰ同様、解像度レベルのプログレッションであるが、特定位置の優先度を高くすることができる。
・ＰＣＲＬプログレッション：レイヤ、解像度レベル、コンポーネント、プリシンクトの順序に復号されるため、特定部分の復号が優先されるようになり空間位置のプログレッションが実現できる。
・ＣＰＲＬプログレッション：レイヤ、解像度レベル、プリシンクト、コンポーネントの順序に復号されるため、例えば、カラー画像のプログレシブ復号の際に最初にグレーの画像を再現するようなコンポーネントのプログレッションが実現できる。 LRCP progression: Since the decoding is performed in the order of precinct, component, resolution level, and layer, the image quality of the entire image is improved each time the layer index is advanced, and the image quality progression can be realized. Also called layer progression.
RLCP progression: Since the decoding is performed in the order of precinct, component, layer, and resolution level, resolution progression can be realized.
RPCL progression: Since decoding is performed in the order of layer, component, precinct, and resolution level, it is a progression of resolution level as in RLCP, but the priority of a specific position can be increased.
PCRL progression: Since decoding is performed in the order of layer, resolution level, component, and precinct, decoding of a specific part is prioritized, and progression of spatial position can be realized.
CPRL Progression: Since decoding is performed in the order of layer, resolution level, precinct, and component, for example, it is possible to realize component progression that first reproduces a gray image when progressively decoding a color image.

このように、ＪＰＥＧ２０００仕様の符号化では、形成する符号データの符号順序（プログレッシブ方式）を選択することができて、そのプログレッシブ方式としては、解像度（ｒｅｓｏｌｕｔｉｏｎｌｅｖｅｌ）、プリシンクト（ｐｒｅｃｉｎｃｔ：ｐｏｓｉｔｉｏｎ）、色成分（ｃｏｍｐｏｎｅｎｔ）及びレイヤ（Ｌａｙｅｒ）の組合せによる５つのプログレッシブ順序を規定している。 As described above, in the coding according to the JPEG2000 specification, the code order (progressive method) of the code data to be formed can be selected. As the progressive method, resolution (resolution level), precinct (position), color Five progressive orders based on combinations of components and layers are defined.

一般に、マルチレイヤプログレッシブは、Ｌａｙｅｒ−ｒｅｓｏｌｕｔｉｏｎｌｅｖｅｌ−ｃｏｍｐｏｎｅｎｔ−ｐｏｓｉｔｉｏｎ（ＬＲＣＰ）のプログレッシブ順序を有し、ビットプレーンを最上位から符号化する方式である。このマルチレイヤプログレッシブで画像を表示すると、ユーザからは、少ない色数から徐々に色数が増えてくるように見える。これを文字や写真等が混在した文書に適用した場合、写真は中間調が見えないのでほぼ完全に復元されないと認識できない。一方、文字はエッジが強いので完全に復元される前に認識できる。 In general, the multi-layer progressive has a progressive order of Layer-resolution level-component-position (LRCP), and encodes a bit plane from the highest level. When an image is displayed in this multi-layer progressive manner, it seems to the user that the number of colors gradually increases from a small number of colors. When this is applied to a document in which characters, photos, etc. are mixed, the photo cannot be recognized unless it is restored almost completely because the halftone is not visible. On the other hand, since characters have strong edges, they can be recognized before they are completely restored.

これに対し、マルチレゾリューションプログレッシブは、ｒｅｓｏｌｕｔｉｏｎｌｅｖｅｌ−Ｌａｙｅｒ−ｃｏｍｐｏｎｅｎｔ−ｐｏｓｉｔｉｏｎ（ＲＬＣＰ）のプログレッシブ順序を有し、ウェーブレット変換の多重解像度性を利用して、小さい解像度から符号化することで、画像を画像のきめ細かさや粗さで圧縮する。復号化において、低い解像度から徐々に解像度が上がるマルチレゾリューションプログレッシブを用いて画像を表示すると、利用者からは、ピントがぼけた状態から徐々にピントが合ってくるように見える。これを文字及び写真等が混在した文書に適用すると、写真は完全に復元される前に大まかな内容を認識することができ、文字はほぼ完全に復元されないと認識できない。 On the other hand, multi-resolution progressive has a progressive level of resolution level-Layer-component-position (RLCP), and uses the multi-resolution property of wavelet transform to encode from a small resolution. Is compressed with the fineness and roughness of the image. In decoding, when an image is displayed using multi-resolution progressive resolution that gradually increases from a low resolution, it appears to the user that the focus is gradually in focus from the out-of-focus state. If this is applied to a document in which characters and photographs are mixed, the photograph can recognize the rough contents before being completely restored, and the characters cannot be recognized unless the characters are almost completely restored.

ＪＰＥＧ２０００仕様の符号化では、このように選択された符号順序（プログレッシブ方式）に基づいて符号列を形成しておいて、符号化後に一部の階層を削除することで符号量制御を行うこともできる。符号化後の符号量制御は、一般に符号データの各階層が複数の符号列により構成されるため、微細な符号量制御ができない。
本発明で説明したように、形成される符号データの符号列単位での重要度が算出されていることによって（符号列（パケット）単位での優先順位が明確になっていることで）、各階層内の符号データにおける符号列単位で非重要な符号列から先に削除することが可能となる。 In encoding according to the JPEG2000 specification, the code amount may be controlled by forming a code string based on the code order (progressive method) selected in this way and deleting a part of the hierarchy after encoding. it can. In code amount control after encoding, since each layer of code data is generally composed of a plurality of code strings, fine code amount control cannot be performed.
As described in the present invention, by calculating the importance of the code data to be formed in units of code strings (because the priority order in units of code strings (packets) is clear), It becomes possible to first delete an unimportant code string in units of code strings in the code data in the hierarchy.

すなわち、本発明の実施形態の一つとして、符号順序（プログレッシブ方式）を選択し階層構造の符号データが形成された後、階層毎に符号列の重要度に基づき優先順位付けし、優先順位の低い符号列から順次削除する符号量制御も含まれる。 That is, as one embodiment of the present invention, after a code order (progressive method) is selected and code data having a hierarchical structure is formed, priorities are assigned to the hierarchies based on the importance of the code string. Code amount control for sequentially deleting from a low code string is also included.

このように形成されたＪＰＥＧ２０００規格の符号データは、タイル単位に、プリシンクト、解像度レベル、画質レベル（レイヤ）、色成分毎に区分された符号化データを生成する。プログレッション順に係わらず、あるタイル単位の符号データをアクセスする場合は、対応するタイル領域の全てのプリシンクト、解像度レベル、画質レベル（レイヤ）、コンポーネントで該当する符号データをアクセスする。 The JPEG2000 standard code data formed in this way generates encoded data that is divided into tile units by precinct, resolution level, image quality level (layer), and color component. Regardless of the order of progression, when accessing code data in a certain tile unit, the corresponding code data is accessed for all precincts, resolution levels, image quality levels (layers), and components of the corresponding tile area.

一方、プリシンクト単位に対応する画像領域の符号データをアクセスする場合は、プリシンクト単位に対応する全ての解像度レベル、画質レベル（レイヤ）、コンポーネントで該当する符号データをアクセスする。 On the other hand, when accessing the code data of the image area corresponding to the precinct unit, the corresponding code data is accessed with all resolution levels, image quality levels (layers), and components corresponding to the precinct unit.

すなわち、ＪＰＥＧ２０００仕様準拠の符号データは、画像全体を矩形領域単位に分割したタイル（画像領域）単位でアクセスすること（タイルに含まれる符号データをアクセスする場合）と、タイルの内部の領域単位であるプリシンクト単位に（対応する画像領域の）符号データをアクセスするという、二種類の符号データにアクセスすることができる。二種類のサイズをもつ画像領域単位に容易にアクセスできるような、アクセス単位の区切り単位を持つ符号データが形成される仕様になっているのである。 In other words, code data complying with the JPEG2000 specification is accessed in units of tiles (image regions) obtained by dividing the entire image into units of rectangular regions (when accessing code data included in tiles), and in units of regions within the tiles. Two types of code data can be accessed, that is, code data (in a corresponding image area) is accessed in a certain precinct unit. The specification is such that code data having a delimiter unit of access units that can easily access image region units having two types of sizes is formed.

したがって、ＪＰＥＧ２０００仕様準拠で生成された符号データのヘッダを解析して、タイル（画像領域）単位あるいはプリシンクト単位に符号データをトランケート（符号列制御）したり編集したりすることができる。 Therefore, it is possible to analyze the header of code data generated in accordance with the JPEG2000 specification, and to truncate (code string control) or edit the code data in tile (image area) units or precinct units.

（１−２）歪量計算手段１： (1-2) Distortion amount calculation means 1:

次に、ＪＰＥＧ２０００における符号化過程における符号量制御について説明する。
ＪＰＥＧ２０００仕様準拠の符号化では、ビジュアル・ウェイティング（ＶＷ：ＶｉｓｕａｌＷｅｉｇｈｔｉｎｇ）法を使用、符号量制御の方式を実現する。
ＶＷ法は、人間の視覚システムを巧みに利用した手法であり、画像の空間周波数に対して人間の視覚システムの変動感受性をモデル化し、これをコントラスト感受性機能（ＣｏｎｔｒａｓｔＳｅｎｓｉｔｉｖｉｔｙＦｕｎｃｔｉｏｎ：ＣＳＦ）として体系付けたものである。 Next, code amount control in the encoding process in JPEG2000 will be described.
In the encoding conforming to the JPEG2000 specification, a visual weighting (VW) method is used to realize a code amount control method.
The VW method is a technique that skillfully uses the human visual system, and models the fluctuation sensitivity of the human visual system with respect to the spatial frequency of the image, and systematizes this as a contrast sensitivity function (CSF). It is a thing.

ＣＳＦの重みは、実際には画像の変換係数の視覚周波数によって決定されるものであり、視覚される波長帯域内で係数に対するビットの割り当てを増やし、視覚されない波長帯域内で係数に対するビットの割り当てを少なくする視覚的重み付けであり、人間の目により多く感知される特徴を強調することで、画像の主観的画質を向上させる。ウェーブレット変換後の個々のサブバンドに対してＣＳＦの重みが別個に用意される。 The CSF weight is actually determined by the visual frequency of the transform coefficient of the image, increasing the bit allocation for the coefficient within the visible wavelength band and increasing the bit allocation for the coefficient within the non-visible wavelength band. It is a visual weighting that reduces, and enhances the subjective perception of the image by enhancing features that are more perceived by the human eye. A CSF weight is separately prepared for each subband after wavelet transform.

なお、このＣＳＦの重みは、符号化側で設計され、復号画像がどのような条件下で視覚されるかに依存している。具体的には、以下に説明するように、サブバンド毎にＶＷ係数値を設定しておいて、設定されたＶＷ係数値に基づいて削減評価対象であるコードブロックの符号化パスの算術符号データの歪み削減量を計算し、歪み削減量が小さい算術符号データから優先的に削減するように符号量制御する。このようにコードブロックを構成する算術符号データの歪み削減量を基にコードブロック単位の重要度を算出することができる。 The CSF weight is designed on the encoding side and depends on the conditions under which the decoded image is viewed. Specifically, as will be described below, a VW coefficient value is set for each subband, and the arithmetic code data of the coding pass of the code block that is a reduction evaluation target based on the set VW coefficient value The amount of distortion reduction is calculated, and the amount of code is controlled so as to preferentially reduce the arithmetic code data having a small amount of distortion reduction. Thus, the importance of each code block can be calculated based on the distortion reduction amount of the arithmetic code data constituting the code block.

以下の説明では、ＣＳＦの重みによって符号データの一部をトランケートすることによる符号量制御する処理について説明するが、ＣＳＦの逆数によって量子化ステップサイズを調整し量子化によって符号量制御する場合もある。 In the following description, a code amount control process by truncating a part of code data by the CSF weight will be described. However, there is a case where the quantization step size is adjusted by the reciprocal number of the CSF and the code amount is controlled by quantization. .

図９に示すように、符号化パス符号量算出部３００及び符号化パス歪み削減量算出部３１０は、算術符号化後の算術符号Ｄ３４０を入力する。
符号化パス符号量算出部３００は、コードブロック内に生成される符号化パスで実際に発生する符号量Ｒを、全てのコードブロックの全ての符号化パスに対して求め、この全てのコードブロックの符号化パスの符号量Ｒを示す情報Ｄ３００をＲＤ特性算出部３２０に供給する。 As illustrated in FIG. 9, the coding pass code amount calculation unit 300 and the coding pass distortion reduction amount calculation unit 310 receive the arithmetic code D340 after arithmetic coding.
The coding pass code amount calculation unit 300 obtains the code amount R actually generated in the coding pass generated in the code block for all the coding passes of all the code blocks, and all the code blocks The information D300 indicating the code amount R of the encoding pass is supplied to the RD characteristic calculator 320.

符号化パス歪み削減量算出部３１０は、その符号化パスを含めた場合にどの程度歪みが低減されるかを示す歪み削減量Ｄを算出する。歪量は、対象の符号化パスをトランケーションした状態でデコードを行った画像データの原画像データに対する誤差である。誤差は、ＭＳＥ（ＭｅａｎＳｑｕａｒｅｄＥｒｒｏｒ）を使用して計算する。このように、ＪＰＥＧ２０００では、各ビットプレーンまでトランケーションした場合の歪量をそれぞれ求めるために、各ビットプレーンまでトランケーションした状態でそれぞれデコードを行ってＭＳＥで誤差を調べる。 The encoding pass distortion reduction amount calculation unit 310 calculates a distortion reduction amount D indicating how much distortion is reduced when the encoding pass is included. The distortion amount is an error with respect to the original image data of the image data decoded in a state where the target coding pass is truncated. The error is calculated using MSE (Mean Squared Error). As described above, in JPEG2000, in order to obtain the amount of distortion when truncating to each bit plane, decoding is performed in a state where each bit plane is truncated, and an error is examined by MSE.

例えば、最初に各コードブロックにおいてビットプレーンを下位側から１つトランケーションした場合の歪量を求め、次に、ビットプレーンを下位側から２つトランケーションした場合の歪量を求め、同様にして、すべてのビットプレーンをトランケーションしたときの歪量を求める。このため、歪量を求めるための処理時間が非常に長くなるという問題があり、後述するような簡易な方式で歪量を求めてもかまわない。 For example, first determine the amount of distortion when truncating one bit plane from the lower side in each code block, and then determine the amount of distortion when truncating two bit planes from the lower side. The amount of distortion when truncating the bit plane is obtained. For this reason, there is a problem that the processing time for obtaining the distortion amount becomes very long, and the distortion amount may be obtained by a simple method as described later.

この時、ＪＰＥＧ２０００仕様準拠の符号化処理では、歪み削減量ＤはＶＷ係数値Ｄ３３０を加味して算出する。すなわち、サブバンド毎に設定してあるＶＷ係数値Ｄ３３０を、各符号化パスの歪み削減量に乗算した結果を、歪み削減量Ｄとする。これは、サブバンド毎の優先的な重み付けをしたことと等価であり、符号量制御の際に低域のサブバンドに存在する符号化パスほど切り捨てられにくくなり、逆に高域のサブバンドに存在する符号化パスほど切り捨てられやすくなる。全てのコードブロックの全ての符号化パスに対して削減量Ｄを求め、この全てのコードブロックの符号化パスの歪み削減量Ｄを示す情報Ｄ３１０をＲＤ特性算出部３２０に供給する。 At this time, in the encoding process compliant with the JPEG2000 specification, the distortion reduction amount D is calculated in consideration of the VW coefficient value D330. In other words, the distortion reduction amount D is obtained by multiplying the distortion reduction amount of each coding pass by the VW coefficient value D330 set for each subband. This is equivalent to preferential weighting for each subband, and the coding pass existing in the low-frequency subband is less likely to be discarded during code amount control, and conversely the high-frequency subband. Existing encoding passes are more likely to be truncated. The reduction amount D is obtained for all the coding passes of all the code blocks, and information D310 indicating the distortion reduction amount D of the coding pass of all the code blocks is supplied to the RD characteristic calculation unit 320.

ＲＤ特性算出部３２０は、各符号化パスのＲＤ特性、通常はＲＤ曲線の傾きを算出する。そして、ＲＤ特性算出部３２０は、この結果得られたＲＤ特性値及び各符号化パスの情報Ｄ３２０を全符号化パス分記憶保持しておき、また、このＲＤ特性値及び各符号化パスの情報Ｄ３２０を符号化パス選択部３３０に供給する。 The RD characteristic calculation unit 320 calculates the RD characteristic of each coding pass, usually the slope of the RD curve. Then, the RD characteristic calculation unit 320 stores and holds the RD characteristic value and the information D320 of each coding pass obtained as a result for all the coding passes, and the RD characteristic value and the information of each coding pass. D320 is supplied to the encoding pass selection unit 330.

符号化パス選択部３３０は、ＲＤ特性算出部３２０から供給されたＲＤ特性値及び各符号化パスの情報Ｄ３２０に基づいて、画像全体の目標符号量に最も近くなるように、各符号化パスを含めるか切り捨てるかの選択を行う。この際、符号化パス選択部３３０は、図１０に示すようなＲＤ特性を利用する。図１０において、黒丸のポイントは切り捨ての候補となる符号化パスであり、白丸のポイントは切り捨ての対象外となる符号化パスであることを示している。
したがって、符号化パス選択部３３０は、目標符号量に最も近くなるように、これらの白丸の符号化パスを選択する動作を行う。
以上の動作によって、最終的に選択された符号化パスの情報である算術符号Ｄ３５０が符号形成対象となる。 Based on the RD characteristic value supplied from the RD characteristic calculation unit 320 and the information D320 of each coding pass, the coding pass selection unit 330 sets each coding pass so as to be closest to the target code amount of the entire image. Select whether to include or truncate. At this time, the encoding pass selection unit 330 uses RD characteristics as shown in FIG. In FIG. 10, black circle points indicate encoding passes that are candidates for truncation, and white circle points indicate encoding passes that are not subject to truncation.
Therefore, the coding pass selection unit 330 performs an operation of selecting these white circle coding passes so as to be closest to the target code amount.
Through the above operation, the arithmetic code D350, which is information of the finally selected coding pass, becomes a code formation target.

また、目標符号量に達した際に、又は達する直前に符号化パスを切り捨てる、又はそれ以上の符号化パスを選択せず打ち切ることで、符号量制御を行うこともできる。 In addition, when the target code amount is reached, or just before reaching the target code amount, the coding pass can be discarded, or the code amount can be controlled by terminating the coding pass without selecting any more.

ＪＰＥＧ２０００仕様準拠の符号化では、サブバンド毎に設定されたＶＷ係数値Ｄ３３０を使用して、ＶＷ法による符号量制御を実行している。ＶＷ係数値Ｄ３３０の実際の値としては、例えば図１１のように各分割レベルのサブバンド毎に決めておくこともできる。図１１では、輝度（Ｙ）と色差（Ｃｂ、Ｃｒ）とで別々に係数値が決められており、分割レベルが大きいほど、すなわち、より低域成分ほど、係数値も大きくなっていることが分かる。ここで、係数値が大きいことは、符号量制御の際に符号化パスの切り捨てが行われにくいことを意味する。 In the encoding conforming to the JPEG2000 specification, the code amount control by the VW method is executed using the VW coefficient value D330 set for each subband. The actual value of the VW coefficient value D330 can be determined for each subband of each division level as shown in FIG. 11, for example. In FIG. 11, coefficient values are determined separately for luminance (Y) and color differences (Cb, Cr), and the coefficient value increases as the division level increases, that is, the lower frequency component. I understand. Here, a large coefficient value means that the encoding pass is not easily truncated during code amount control.

なお、図１１において、Ｙ、Ｃｂ、Ｃｒの全て低域なほど（分割レベルが大きい程）係数値が大きくなっているのは、画像のエネルギーが低域に集中していることを利用しているためである。ここで、入力画像の特徴を利用して、ある特定のサブバンドの係数値を大きい値又は小さい値に設定するようにしてもよい。例えば、入力画像がインタレース画像であった場合、ＬＨ成分のエネルギーが大きくなる傾向にあるため、図１２に示すようにＬＨ成分に相当する係数の重み値を大きくすることが有効である。これにより、インタレース成分の画像が保持され、高画質化に繋がる効果がある。 In FIG. 11, the coefficient value increases as all of Y, Cb, and Cr are lower (the higher the division level), the fact that the energy of the image is concentrated in the lower frequency. Because it is. Here, the coefficient value of a specific subband may be set to a large value or a small value using the characteristics of the input image. For example, when the input image is an interlaced image, the energy of the LH component tends to increase. Therefore, it is effective to increase the weight value of the coefficient corresponding to the LH component as shown in FIG. As a result, the image of the interlace component is retained, and there is an effect that leads to high image quality.

なお、図１１、図１２共に、ＹよりもＣｂ又はＣｒの重み係数値の方が小さい値に設定されている。これは、輝度成分よりも色差成分に対する人間の視覚特性が鋭敏でない特徴を生かしたためである。 11 and 12, the weight coefficient value of Cb or Cr is set to a smaller value than Y. This is because the human visual characteristic with respect to the color difference component is utilized rather than the luminance component.

符号量制御は、上記に示したＪＰＥＧ２０００仕様によるものに限らず、特開２００３−３０４４０５号公報のように、画像情報の各ビットプレーンにおける最上位有効ビットの個数に基づいてビットプレーンをトランケーションしたときの画像の歪量を推定しトランケーションする方式でも実現できる。 Code amount control is not limited to the JPEG2000 specification shown above, but when a bit plane is truncated based on the number of most significant bits in each bit plane of image information, as in Japanese Patent Laid-Open No. 2003-304405. This method can also be realized by estimating the amount of image distortion and truncating the image.

本発明の符号データの符号列単位での重要度設定では、画像属性ごとの基準を使用する。上述したように、画像領域情報はコードブロック毎に識別可能になっているので、歪み削減量Ｄを計算する場合に、対象とするコードブロックの像領域情報を考慮して設定する。ＶＷ係数値がサブバンドごとに設定されているのに対して、画像領域毎の画像属性情報はコードブロック単位に設定されている。
この場合、サブバンド毎に設定してあるＶＷ係数値Ｄ３３０と、コードブロック毎の画像領域毎の画像属性毎に設定された値を、各符号化パスの歪み削減量に乗算した結果を、歪み削減量Ｄとする。 In the importance setting in the code string unit of the code data of the present invention, a standard for each image attribute is used. As described above, since the image area information can be identified for each code block, when calculating the distortion reduction amount D, the image area information is set in consideration of the image area information of the target code block. Whereas the VW coefficient value is set for each subband, the image attribute information for each image area is set for each code block.
In this case, the distortion reduction amount of each coding pass is multiplied by the VW coefficient value D330 set for each subband and the value set for each image attribute for each image area for each code block. A reduction amount D is assumed.

以上のことから、本発明は、符号データ生成過程が、入力画像に対して低域フィルタ及び高域フィルタを垂直方向及び水平方向に施してサブバンドを生成するサブバンド生成手段と、低域成分のサブバンドに対して階層的にフィルタリング処理を施すフィルタリング手段と、フィルタリング手段によって生成されたサブバンドを分割し、所定の大きさのコードブロックを生成するコードブロック生成手段と、前記コードブロック単位に最上位ビットから最下位ビットに至るビットプレーンを生成するビットプレーン生成手段と、前記ビットプレーン毎に符号化パスを生成する符号化パス生成手段と、上記符号化パス内で算術符号化を行う算術符号化手段と、算術符号の歪量を算出する算術符号歪量算出手段とを備え、前記算出された歪量を本発明における符号データの復号再生画像の歪量とするようにした。 From the above, the present invention provides a subband generating means for generating a subband by applying a low-pass filter and a high-pass filter to an input image in a vertical direction and a horizontal direction, and a low-frequency component. Filtering means for hierarchically filtering the subbands, code block generation means for dividing the subband generated by the filtering means to generate a code block of a predetermined size, and for each code block Bit plane generating means for generating a bit plane from the most significant bit to the least significant bit, coding path generating means for generating a coding pass for each bit plane, and arithmetic for performing arithmetic coding in the coding pass Encoding means and arithmetic code distortion amount calculating means for calculating the distortion amount of the arithmetic code, and calculating the calculated distortion amount And so that the strain amount of the decoded reproduced image of the code data in the bright.

（１−３）歪量計算手段２ (1-3) Strain amount calculation means 2

上述したように、ＪＰＥＧ２０００では、量子化されたウェーブレット係数データをビットプレーンに分割して符号化するので、ビットプレーンの切り捨てによる画像情報の圧縮（符号量制御）が可能である。例えば、ビットプレーンを下位側から切り捨てていく（トランケーション）ことによる画像情報の圧縮（符号量制御）が行われている。符号量制御では、ある圧縮率が目標値として存在する場合、目標値になるまでデータを切り捨てていくことになるが、当然データを切り捨てていくと画質が劣化していくことになる。そのため、データを切り捨てる場合にどれだけデータを切り捨てるとどれだけ画質が劣化するかを検出する必要があるが、その劣化の度合いが歪量である。 As described above, in JPEG 2000, quantized wavelet coefficient data is divided into bit planes and encoded, so that image information can be compressed (code amount control) by truncating the bit planes. For example, image information compression (code amount control) is performed by truncating the bit plane from the lower side (truncation). In the code amount control, when a certain compression rate exists as a target value, data is truncated until the target value is reached. Naturally, when the data is truncated, the image quality deteriorates. For this reason, when data is cut off, it is necessary to detect how much data is cut off and how much the image quality deteriorates. The degree of the deterioration is the distortion amount.

この歪量の検出は、ＪＰＥＧ２０００仕様によるものに限らず、特開２００３−３０４４０５号公報のように、画像情報の各ビットプレーンにおける最上位有効ビットの個数に基づいてビットプレーンをトランケーションしたときの画像の歪量を推定しトランケーションする方式でも実現できる。同様な考え方により、前記各符号化パスの歪み削減量は、各符号化パスの上位の有効ビット（値）に基づいて計算してもかまわない。 The detection of the distortion amount is not limited to the one based on the JPEG2000 specification, and an image when truncating a bit plane based on the number of most significant bits in each bit plane of image information as disclosed in Japanese Patent Laid-Open No. 2003-304405. This method can also be realized by estimating the amount of distortion and truncating. Based on the same idea, the distortion reduction amount of each coding pass may be calculated based on the upper significant bits (values) of each coding pass.

すなわち、歪量の検出方法として、ＪＰＥＧ２０００のように、画像情報のビットプレーンのトランケーションにより画像情報を圧縮する画像処理方式においては、ビットプレーン（あるいは、レイヤ）をトランケーションしたときの画像の歪量を、トランケーションを行ったときのＭＳＢ（データの最上位有効ビット）の個数の変化量とすることもできる。 That is, as a method for detecting distortion, in an image processing method for compressing image information by truncation of a bit plane of image information, such as JPEG 2000, the distortion amount of an image when a bit plane (or layer) is truncated is used. , The amount of change in the number of MSBs (most significant bits of data) when truncation is performed.

これは、ウェーブレット係数のＭＳＢ成分が多く分布している、ビットプレーン（レイヤ）をトランケーションすると、そのビットプレーン（レイヤ）に関するコードブロックのデータの分布状況が大きく変わってしまうことになるように、ビットプレーン（レイヤ）のトランケーションを行ったときのＭＳＢの個数の変化と歪量の変化との間には相関があると考えられるからである。 This means that when truncating a bit plane (layer) in which many MSB components of wavelet coefficients are distributed, the distribution status of code block data relating to that bit plane (layer) will change significantly. This is because it is considered that there is a correlation between the change in the number of MSBs and the change in the distortion amount when truncation of a plane (layer) is performed.

以上のことから、本発明では、歪量の検出が、符号データ生成過程で、画像領域毎にビットプレーン単位のトランケーションにより符号量を削減する符号化が行われる。ここで、歪量は、前記画像領域毎の各ビットプレーンにおける最上位有効ビットの個数である。 From the above, according to the present invention, the amount of distortion is detected by encoding in which the amount of code is reduced by truncation in units of bit planes for each image area in the process of generating code data. Here, the distortion amount is the number of most significant bits in each bit plane for each image area.

（２）ＪＰＥＧ２０００の符号列単位の重要度設定 (2) JPEG2000 code string unit importance setting

ＪＰＥＧ２０００（ＩＳＯ／ＩＥＣ１５４４４−１）規格の符号処理では、符号化過程で、周波数特性に対応したサブバンド単位に符号データの形成処理がなされるが、その後符号データは符号化処理の過程で複雑に処理されて形成されていくため、符号化処理後においては、サブバンド単位で符号データを区別することができなくなっており、かかる単位の視覚特性を考慮した符号列単位で重要度の設定ができない。 In the coding process of the JPEG2000 (ISO / IEC 15444-1) standard, the code data is formed in units of subbands corresponding to the frequency characteristics in the coding process. Thereafter, the code data is complicated in the process of the coding process. Therefore, after the encoding process, the code data cannot be distinguished in units of subbands, and the importance level is set in units of code strings considering the visual characteristics of such units. Can not.

そこで、本発明の構成では、符号化後に、符号データを構成する符号列単位で重要度を設定する方法としては、符号化処理時に前述したようなコードブロック単位に歪量を算出しておいて、符号化後に、係る情報を使用して前記歪量に基づいて符号形成における符号列単位毎の重要度を算出する。ＪＰＥＧ２０００の符号化においては、符号形成における符号列（パケット）は、コードブロック単位の集合によって形成されていることから、符号形成における符号列（パケット）の重要度は、該符号列を形成するコードブロック単位の重み付けの和として算出する。 Therefore, in the configuration of the present invention, as a method of setting the importance in units of code strings constituting the code data after encoding, the distortion amount is calculated in units of code blocks as described above during the encoding process. Then, after encoding, importance is calculated for each code string unit in code formation based on the distortion amount using the information. In JPEG2000 encoding, since a code string (packet) in code formation is formed by a set of code block units, the importance of the code string (packet) in code formation is the code forming the code string. Calculated as the sum of weights in block units.

具体的には、符号化処理の過程で算出された、コードブロックの符号化パスの算術符号データについての歪み削減量Ｄとその符号量Ｒの値を保存しておいて符号化後の処理で使用する。この時、歪み削減量の計算は、ＪＰＥＧ２０００（ＩＳＯ／ＩＥＣ１５４４４−１）規格のＶＷ係数値Ｄ３３０を考慮した計算値でもよいし、さらに画像領域毎の画像属性を考慮した値でもよい。 Specifically, in the process after encoding, the distortion reduction amount D and the code amount R value for the arithmetic code data of the encoding pass of the code block calculated in the encoding process are stored. use. At this time, the calculation of the distortion reduction amount may be a calculated value considering the VPEG coefficient value D330 of the JPEG2000 (ISO / IEC 154444-1) standard, or may be a value considering the image attribute for each image region.

図１３を用いて、本発明における符号列単位での重要度の設定について説明する。本実施形態では、コードブロック単位で歪量を算出する手段を備え、符号化後の符号データ編集時あるいは復号時に符号列（パケット）単位で重要度を設定する機能を備えているところが特徴である。 The setting of the importance level in units of code strings in the present invention will be described with reference to FIG. The present embodiment is characterized in that it is provided with means for calculating a distortion amount in units of code blocks, and has a function of setting importance in units of code strings (packets) at the time of editing or decoding of encoded data after encoding. .

符号化処理部４１０には、符号化の過程で一時的に生成されるサブバンド毎のコードブロック単位における算術符号算出部４１１を備え、サブバンド毎コードブロック単位の算術符号保存部４１２に前記算術符号が保存されている。
コードブロック単位の算術符号の歪量の算出は、コードブロック単位歪量算出部４０３にて施される。 The encoding processing unit 410 includes an arithmetic code calculation unit 411 in units of code blocks for each subband that is temporarily generated in the course of encoding, and the arithmetic code storage unit 412 in units of code blocks for each subband includes the arithmetic code storage unit 412. The code is stored.
The code block unit distortion amount calculation unit 403 calculates the distortion amount of the arithmetic code in code block units.

符号列単位重要度算出部４００は、コードブロック対応画像領域単位の画像属性情報保存部４０１、ＶＷ係数保存部４０２に保存されたデータを使用して、サブバンド毎コードブロック単位の前記算術符号の歪量（その算術符号データが削減された場合の画質の劣化度合い）を算出するコードブロック単位歪量算出部４０３を備え、符号列（パケット）単位歪量算出部４０４にて、符号列（パケット）単位の歪量を算出し、符号列（パケット）単位の歪量を使用して、符号列単位重要度算出部４０５にて、符号列（パケット）単位の算術符号の重要度を計算する。 The code string unit importance calculation unit 400 uses the data stored in the image attribute information storage unit 401 and the VW coefficient storage unit 402 in the code block corresponding image region unit, and the arithmetic code of the code block unit in each subband. A code block unit distortion amount calculation unit 403 that calculates a distortion amount (degradation degree of image quality when the arithmetic code data is reduced) includes a code string (packet) unit distortion amount calculation unit 404. ) The distortion amount of the unit is calculated, and the importance of the arithmetic code of the code string (packet) unit is calculated by the code string unit importance calculation unit 405 using the distortion amount of the code string (packet) unit.

なお、図１３の構成では、符号量算出部４０６が符号列単位重要度算出部４００に具備した構成になっているが、符号化処理部４１０に具備していてもかまわない。
図１３で示すように、本発明では、コードブロック単位の歪量を使用して、符号列単位で重要度を設定する。
ＪＰＥＧ２０００符号化における符号列（パケット）単位で重要度を設定するには、次のような手順で処理する。 In the configuration of FIG. 13, the code amount calculation unit 406 is included in the code string unit importance calculation unit 400, but may be included in the encoding processing unit 410.
As shown in FIG. 13, in the present invention, the degree of importance is set in units of code strings using the distortion amount in units of code blocks.
In order to set the importance level in units of code strings (packets) in JPEG2000 encoding, the following procedure is used.

(i)画像データを入力し、画像データをウェーブレット変換しウェーブレット係数をサブバンド単位で算出する。
(ii)各サブバンドのウェーブレット係数単位で以下の処理(iii)〜(vi)を繰り返す。
(iii)コードブロック単位で算術符号データを算出し、以下の処理(iv)〜(v)を繰り返す。
(iv)コードブロックに含まれる各算術符号データに対して歪量を算出し、該歪量を合計し符号コードブロックの歪量を算出する。
(v)全算術符号データの処理が終了したら次のコードブロック単位を処理する。
(vi)全コードブロック単位の処理が終了したら次のサブバンド単位を処理する。
(vii)全サブバンド単位の処理が終了したらコードブロック単位の符号データを組み合わせて符号列を構成し符号データ（パケット単位）を形成する。
(viii)符号列を構成するコードブロック単位の歪量を合算し、プリシンクト単位の歪量を算出する。
(ix)算出されたプリシンクト単位の歪量を、符号列（パケット）単位の歪量とする。
(x)算出された符号列単位の歪量を歪量基準値と比較することにより、符号列（パケット）単位の重要度を算出する。 (i) Input image data, wavelet transform the image data, and calculate wavelet coefficients in subband units.
(ii) The following processes (iii) to (vi) are repeated for each subband wavelet coefficient unit.
(iii) Arithmetic code data is calculated for each code block, and the following processes (iv) to (v) are repeated.
(iv) A distortion amount is calculated for each piece of arithmetic code data included in the code block, and the distortion amounts of the code code block are calculated by adding the distortion amounts.
(v) When the processing of all arithmetic code data is completed, the next code block unit is processed.
(vi) When the processing for all code blocks is completed, the next subband unit is processed.
(vii) When the processing in units of all subbands is completed, code data in code block units are combined to form a code string to form code data (packet units).
(viii) The amount of distortion in units of code blocks constituting the code string is added up to calculate the amount of distortion in units of precinct.
(ix) The calculated distortion amount in units of precinct is used as the distortion amount in units of code strings (packets).
(x) The degree of importance for each code string (packet) is calculated by comparing the calculated distortion amount for each code string with a distortion amount reference value.

すなわち、ＪＰＥＧ２０００の符号データ形成過程においては、符号データをプリシンクト単位で構成されるパケットを生成する場合に、プリシンクトを構成するコードブロック単位の算術符号データの歪量を集計しプリシンクト単位の歪量を計算するようにした。 That is, in the code data formation process of JPEG2000, when generating a packet composed of code data in units of precinct, the amount of distortion of arithmetic code data in units of code blocks constituting the precinct is totaled to calculate the amount of distortion in units of precinct. Calculated.

いずれにしても、符号データ形成時にプリシンクト単位の歪量情報または符号列（パケット）単位の歪量情報または重要度情報を記録として残すことによって、符号列形成後の符号データ編集処理（パーサー）においてだけでなく、後段の復号処理あるいは復号後の画像処理あるいは、復号後における符号データのアクセスにおいても、これらの情報を活用することができる。
ここで、歪量情報あるいは重要度情報を保持する方法としては、符号データのヘッダに記載しておいてもかまわないし、テーブルとして別途保持しておくようにしてもよい。 In any case, in the code data editing process (parser) after forming the code string, the distortion amount information in the precinct unit or the distortion amount information or importance information in the code string (packet) unit is left as a record when the code data is formed. In addition, the information can be used not only for the subsequent decoding process, the image process after decoding, or the access of the code data after decoding.
Here, as a method of holding the distortion amount information or the importance level information, it may be described in the header of the code data, or may be held separately as a table.

また、本発明の実施形態における画像処理装置においては、図２に示すように、指定領域内画像属性情報と、指定領域内歪量と、画像属性毎の歪量基準と、によって符号データを構成する符号列単位で重要度（優先順位）を算出する。 Further, in the image processing apparatus according to the embodiment of the present invention, as shown in FIG. 2, the code data is constituted by the designated area image attribute information, the designated area distortion amount, and the distortion amount standard for each image attribute. The importance (priority order) is calculated for each code string.

同様に、先に部分画像領域単位に画像属性を定め、画像領域毎に符号量を制御するための重要度を設定することを示したが、係る部分画像領域単位の画像属性についても別のテーブルの形式で保持していてもかまわない。これらの情報は、元画像領域と符号データの対応をもった情報であるので、これらの情報を活用する場合に画像領域毎の符号データ処理も容易に進めることができる。 Similarly, it was previously shown that image attributes are defined in units of partial image areas, and importance is set for controlling the code amount for each image area. However, another table is also provided for image attributes in units of such partial image areas. It may be held in the form of Since these pieces of information have correspondence between the original image area and the code data, the code data processing for each image area can be easily performed when using these pieces of information.

（３）ＪＰＥＧ２０００の画像属性情報を使用した重要度設定 (3) Importance setting using image attribute information of JPEG2000

図１４は、本発明の実施形態に係る画像処理装置における、画像属性に基づく歪量基準を使用した符号列単位の重要度設定を説明するための図である。画像データを構成する部分領域の画像属性毎に符号データの重要度を設定する様子を示している。
図１４に示すように、本発明では、符号化過程で算出された画像領域単位の画像属性情報と画像属性毎の歪量基準情報によって符号列単位の歪量と属性情報を算出し、符号化後に符号列単位の重要度設定情報を算出し、符号データの編集処理を施している。 FIG. 14 is a diagram for explaining importance level setting in units of code strings using a distortion amount criterion based on image attributes in the image processing apparatus according to the embodiment of the present invention. It shows how the importance of the code data is set for each image attribute of the partial area constituting the image data.
As shown in FIG. 14, in the present invention, a distortion amount and attribute information for each code string are calculated based on image attribute information for each image area calculated in the encoding process and distortion amount reference information for each image attribute. Later, importance level setting information for each code string is calculated, and code data editing processing is performed.

符号列単位の歪量（符号列を削除した場合に再現画質に影響する画質の歪量）と属性情報は、前述したように符号化過程で算出される。符号列単位の重要度は、符号列単位の歪量を画像属性毎の歪量基準と比較して、歪量がその画像属性の基準値より少ないと判断される場合には重要度が小さく、そうでない場合は大きいという具合に設定される。
本発明の符号処理における画像属性情報算出は、コードブロックに対応する部分画像領域単位に画像を分割し、コードブロック単位に画像属性を算出して、符号列（パケット）単位に画像属性を算出する。 As described above, the distortion amount in units of code strings (the distortion amount of image quality that affects the reproduction image quality when the code string is deleted) and the attribute information are calculated in the encoding process as described above. The importance of the code string unit is less important when the distortion amount of the code string is compared with the distortion amount standard for each image attribute and the distortion amount is determined to be smaller than the reference value of the image attribute. Otherwise, it is set to be large.
In the image attribute information calculation in the coding process of the present invention, an image is divided into partial image area units corresponding to code blocks, image attributes are calculated in code block units, and image attributes are calculated in code string (packet) units. .

このように、符号データを構成する符号列の重要度は、特に画像属性によって符号データの重要度の設定基準を変更するようにすることで、画像属性に対する歪量に対する感度の違いを反映することができる。 As described above, the importance of the code string constituting the code data reflects the difference in sensitivity to the distortion amount with respect to the image attribute, in particular, by changing the setting criteria for the importance of the code data depending on the image attribute. Can do.

なお、ＪＰＥＧ２０００などの周波数特性を符号化過程で算出する方式においては、上述したような方法で部分画像領域単位の画像属性を定めるのではなく、符号化過程で算出される前記周波数特性を使用してコードブロックに対応する部分画像領域単位に画像属性を算出してもかまわない。 In the method of calculating the frequency characteristics such as JPEG2000 in the encoding process, the frequency characteristics calculated in the encoding process are used instead of determining the image attributes for each partial image area by the method described above. Thus, the image attribute may be calculated for each partial image area corresponding to the code block.

例えば、符号化処理部において、画像データをウェーブレット変換し、ウェーブレット係数をサブバンド単位で算出し、ＬＬ成分データを使用して、コードブロックに対応する部分画像領域単位に画像属性を算出し、符号列（パケット）単位に画像属性を算出するようにしてもよい。
なお、上記において、部分画像領域単位に画像属性を定める場合に、誤って識別されてしまう場合がある。そのような場合に合わせて、重要領域の設定範囲が算出された領域よりも広くなるように設定するようにしてもよい。 For example, the encoding processing unit performs wavelet transform on image data, calculates wavelet coefficients in subband units, uses LL component data to calculate image attributes in units of partial image regions corresponding to code blocks, The image attribute may be calculated for each column (packet).
In the above description, when an image attribute is defined for each partial image area, it may be erroneously identified. In accordance with such a case, the setting range of the important area may be set to be wider than the calculated area.

このような画像の属性情報は、スキャナなどの像域分離技術によって抽出する。この時、前述したように、部分画像領域単位に画像属性を判別し、符号列を構成する符号データとの対応をとる。 Such attribute information of the image is extracted by an image area separation technique such as a scanner. At this time, as described above, the image attribute is discriminated in units of partial image areas, and the correspondence with the code data constituting the code string is taken.

本発明は、上述した実施形態のみに限定されたものではない。上述した実施形態の画像処理装置や画像処理システムを構成する各機能をそれぞれプログラム化して、予め記録媒体に書き込んでおき、この記録媒体に記録されたこれらのプログラムをコンピュータに備えられたメモリあるいは記憶装置に格納し、そのプログラムを実行することによって、本発明の目的が達成されることは言うまでもない。この場合、記録媒体から読み出されたプログラム自体が上述した実施形態の機能を実現することになり、そのプログラムおよびそのプログラムを記録した記録媒体も本発明を構成することになる。
また、上記プログラムは、そのプログラムの指示に基づき、オペレーティングシステムあるいは他のアプリケーションプログラム等と共同して処理することによって上述した実施形態の機能が実現される場合も含まれる。 The present invention is not limited only to the above-described embodiments. Each function constituting the image processing apparatus and the image processing system of the above-described embodiment is programmed and written in a recording medium in advance, and these programs recorded in the recording medium are stored in a memory or storage provided in the computer. Needless to say, the object of the present invention is achieved by storing the program in the apparatus and executing the program. In this case, the program itself read from the recording medium realizes the functions of the above-described embodiment, and the program and the recording medium recording the program also constitute the present invention.
In addition, the program includes a case where the functions of the above-described embodiment are realized by processing in cooperation with an operating system or another application program based on an instruction of the program.

なお、上述した実施形態の機能を実現するプログラムは、ディスク系（例えば、磁気ディスク、光ディスク等）、カード系（例えば、メモリカード、光カード等）、半導体メモリ系（例えば、ＲＯＭ、不揮発性メモリ等）、テープ系（例えば、磁気テープ、カセットテープ等）等のいずれの形態の記録媒体で提供されてもよい。あるいは、ネットワークを介して記憶装置に格納されたプログラムをサーバコンピュータから直接供給を受けるようにしてもよい。この場合、このサーバコンピュータの記憶装置も本発明の記録媒体に含まれる。
このように、上述した実施形態の機能をプログラム化して流通させることによって、コストの低廉化、および可搬性や汎用性を向上させることができる。 Note that the program for realizing the functions of the above-described embodiments includes a disk system (for example, a magnetic disk, an optical disk, etc.), a card system (for example, a memory card, an optical card, etc.), and a semiconductor memory system (for example, a ROM, a nonvolatile memory). Etc.) and a recording medium of any form such as a tape system (for example, magnetic tape, cassette tape, etc.). Alternatively, the program stored in the storage device may be directly supplied from the server computer via the network. In this case, the storage device of this server computer is also included in the recording medium of the present invention.
As described above, by programming and distributing the functions of the above-described embodiment, the cost can be reduced, and the portability and versatility can be improved.

本発明の実施形態に係る画像処理装置における重要度設定の構成を示すブロック図である。It is a block diagram which shows the structure of the importance level setting in the image processing apparatus which concerns on embodiment of this invention. 本発明の実施形態に係る画像処理システムの構成を示すブロック図である。1 is a block diagram illustrating a configuration of an image processing system according to an embodiment of the present invention. 符号化対象である画像データと、符号形成単位となるタイル、サブバンド、プリシンクト、コードブロックの関係を示す図である。It is a figure which shows the relationship between the image data which are encoding objects, and the tile, subband, precinct, and code block which are code formation units. ＪＰＥＧ２０００におけるレイヤの構成例である。It is an example of a layer structure in JPEG2000. ＪＰＥＧ２０００におけるパケットの構成例である。It is a structural example of a packet in JPEG2000. ＪＰＥＧ２０００におけるパケットデータの構成例（その１）である。It is a structural example (the 1) of the packet data in JPEG2000. ＪＰＥＧ２０００におけるパケットデータの構成例（その２）である。It is a structural example (the 2) of the packet data in JPEG2000. ＪＰＥＧ２０００におけるパケットデータの構成例（その３）である。It is a structural example (the 3) of the packet data in JPEG2000. ＪＰＥＧ２０００における符号量制御処理の流れを示す図である。It is a figure which shows the flow of the code amount control processing in JPEG2000. ＪＰＥＧ２０００の符号量制御処理におけるＲＤ特性（歪量と符号量の関係）テーブル例である。It is an example of an RD characteristic (relationship between distortion amount and code amount) table in a code amount control process of JPEG2000. ＪＰＥＧ２０００の符号量制御処理におけるＶＷ法の重み係数テーブル例である。It is an example of the weight coefficient table of the VW method in the code amount control process of JPEG2000. ＪＰＥＧ２０００の符号量制御処理におけるＶＷ法の他の重み係数テーブル例である。It is an example of another weighting coefficient table of the VW method in the code amount control processing of JPEG2000. 本発明の実施形態に係る画像処理装置における重要度設定の他の構成を示すブロック図である。It is a block diagram which shows the other structure of the importance level setting in the image processing apparatus which concerns on embodiment of this invention. 本発明の実施形態に係る画像処理装置における、画像属性に基づく歪量基準を使用した符号列単位の重要度設定を説明するための図である。It is a figure for demonstrating the importance level setting of the code sequence unit using the distortion amount reference | standard based on an image attribute in the image processing apparatus which concerns on embodiment of this invention.

Explanation of symbols

１０…入力処理部、２０…符号化処理部、３０…符号列単位重要度算出部、３１…符号列単位属性情報抽出部、３２…符号列単位歪量抽出部、３３…符号列単位重要度設定部、３４…画像属性毎歪量基準記憶部、３５…画像属性情報記憶部、４０…符号データ編集部、５０…復号処理部、６０…出力処理部、７０…画像データ保存部１、８０…符号データ保存部、９０…画像データ保存部２、１００…サーバ、１１０…画像領域指定部、１２０…領域範囲対応符号データ選択部、１３０…指定領域内画像属性情報抽出部、１３５…領域単位画像属性情報保存部、１４０…指定領域内歪量算出部、１４５…領域単位歪情報保存部、１５０…符号列選択部、１５５…符号データ保存部、１６５…歪量基準保存部、１７０…符号ストリーム生成部、２００…クライアント、３００…符号化パス符号量算出部、３１０…削減量算出部、３２０…ＲＤ特性算出部、３３０…符号化パス選択部、４００…符号列単位重要度算出部、４０１…コードブロック対応画像領域単位の画像属性情報保存部、４０２…ＶＷ係数保存部、４０３…コードブロック単位歪量算出部、４０４…符号列単位歪量算出部、４０５…符号列単位重要度算出部、４０６…符号量算出部、４１０…符号化処理部、４１１…サブバンド毎コードブロック単位算術符号算出部、４１２…サブバンド毎コードブロック単位算術符号保存部、４２０…符号データ編集処理部、４２１…符号量制御処理部、４３０…符号量保存部、４４０…符号列単位重要度保存部。 DESCRIPTION OF SYMBOLS 10 ... Input processing part, 20 ... Coding process part, 30 ... Code sequence unit importance calculation part, 31 ... Code string unit attribute information extraction part, 32 ... Code string unit distortion amount extraction part, 33 ... Code string unit importance Setting unit 34... Image attribute distortion amount reference storage unit 35. Image attribute information storage unit 40. Code data editing unit 50. Decoding processing unit 60. Output processing unit 70 70 Image data storage unit 1 80 ... Code data storage unit, 90 ... Image data storage unit 2, 100 ... Server, 110 ... Image area designation part, 120 ... Area range corresponding code data selection part, 130 ... Image attribute information extraction part in designated area, 135 ... Area unit Image attribute information storage unit, 140... Designated area distortion amount calculation unit, 145... Region unit distortion information storage unit, 150... Code string selection unit, 155 .. code data storage unit, 165. Stream generator, 00 ... client, 300 ... coding path code amount calculation unit, 310 ... reduction amount calculation unit, 320 ... RD characteristic calculation unit, 330 ... coding path selection unit, 400 ... code string unit importance calculation unit, 401 ... code block Corresponding image area unit image attribute information storage unit 402... VW coefficient storage unit 403... Code block unit distortion amount calculation unit 404. Code string unit distortion amount calculation unit 405. Code amount calculation unit, 410 ... encoding processing unit, 411 ... code block unit arithmetic code calculation unit for each subband, 412 ... code block unit arithmetic code storage unit for each subband, 420 ... code data editing processing unit, 421 ... code amount Control processing unit, 430 ... code amount storage unit, 440 ... code string unit importance storage unit.

Claims

A function of setting importance (priority order) in units of code strings (packets) constituting the code data based on the distortion amount of the decoded reproduction image of the code data, the image attribute information, and the distortion amount standard for each image attribute An image processing apparatus.

The code amount corresponding to the image partial area, the amount of distortion of the decoded reproduction image, the image attribute information, and the distortion amount standard for each image attribute, in units of code strings (packets) constituting the code data corresponding to the image partial area An image processing apparatus having a function of setting importance (priority order).

The image processing apparatus according to claim 1, wherein the code processing unit has a function of changing the order of the code strings constituting the code data and reconfiguring the code data according to the importance (priority order) of the code string (packet) unit. An image processing apparatus.

The image processing apparatus according to claim 1, wherein the image processing apparatus has a function of transmitting in order of code strings in units of code strings constituting code data according to the importance (priority order) of the code strings (packets). An image processing apparatus.

5. The image processing apparatus according to claim 4, wherein the code string is cut off in the middle according to the transmission status and transmitted in units of code strings constituting the code data according to the importance (priority order) of the code strings (packets). An image processing apparatus having a function.

6. The image processing device according to claim 4, wherein a part of the code string constituting the code data is replaced with a dummy code string having a short length to constitute code data to be decoded. Image processing device.

4. The image processing apparatus according to claim 3, further comprising means for comparing a distortion amount for each image area with a distortion amount reference, so that the code data corresponding to the area having the specific image attribute is greater than or equal to the distortion amount reference. An image processing apparatus comprising means for reconstructing code data.

The image processing apparatus according to claim 3, further comprising a unit that compares a distortion amount for each image attribute with a distortion amount reference, and an image reproduced by the reconstructed code data has a specific region in all image attributes. An image processing apparatus characterized by being code data that is equal to or greater than a distortion amount standard for each image attribute.

3. The image processing apparatus according to claim 2, wherein the partial area of the image is a preset attention (ROI) area.

10. The image processing apparatus according to claim 1, wherein the distortion amount of the decoded reproduction image of the code data according to claim 1 or 2 is a distortion amount calculated in a code data generation process. An image processing apparatus.

11. The image processing apparatus according to claim 10, wherein the code data generation process according to claim 1 or 2 applies a low-pass filter and a high-pass filter to an input image in a vertical direction and a horizontal direction to generate subbands. A subband generating means for generating, a filtering means for performing a hierarchical filtering process on the subbands of the low-frequency component, and a subband generated by the filtering means is divided into code blocks of a predetermined size. Code block generating means for generating, bit plane generating means for generating a bit plane from the most significant bit to the least significant bit for each code block, and an encoding path generating means for generating an encoding path for each bit plane. Arithmetic coding means for performing arithmetic coding in the coding pass, and arithmetic for calculating the distortion amount of the arithmetic code. An image processing comprising: a code distortion amount calculation unit, wherein the distortion amount of the decoded reproduction image of the code data according to claim 1 or 2 is a distortion amount calculated by the arithmetic code distortion amount calculation unit apparatus.

The image processing apparatus according to claim 10, wherein in the code data generation process according to claim 1 or 2, encoding is performed to reduce a code amount by truncation in units of bit planes for each image area. An image processing apparatus characterized in that the number of most significant bits in each bit plane is a distortion amount.

13. The image processing apparatus according to claim 1, further comprising means for designating an image area and means for extracting code data in a range corresponding to the image area.

14. The image processing apparatus according to claim 13, wherein means for designating an image partial area and means for reconstructing code data based on importance of a code string (packet) constituting code data corresponding to the image area An image processing apparatus comprising:

15. The image processing apparatus according to claim 1, wherein the code data is data encoded based on JPEG2000 (ISO / IEC 15444-1) standard.

The image processing apparatus according to claim 15, wherein the image partial area according to claim 2 is a precinct unit or a tile unit of JPEG2000 (ISO / IEC 15444-1) standard.

16. The image processing apparatus according to claim 15, wherein the code string unit described in claim 1 or 2 is a packet unit of JPEG2000 (ISO / IEC 15444-1) standard.

The importance (priority order) is set in units of code strings (packets) constituting the code data based on the distortion amount of the decoded reproduction image of the code data, the image attribute information, and the distortion amount standard for each image attribute. An image processing method.

The code amount corresponding to the image partial area, the amount of distortion of the decoded reproduction image, the image attribute information, and the distortion amount standard for each image attribute, in units of code strings (packets) constituting the code data corresponding to the image partial area An image processing method characterized by setting importance (priority order).

A program for causing a computer to realize the functions of the image processing apparatus according to any one of claims 1 to 17.

A computer-readable recording medium on which the program according to claim 20 is recorded.