JP2004201048A

JP2004201048A - Image information processing method

Info

Publication number: JP2004201048A
Application number: JP2002367520A
Authority: JP
Inventors: Hiroshi Kakii; 弘柿井
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2002-12-19
Filing date: 2002-12-19
Publication date: 2004-07-15

Abstract

<P>PROBLEM TO BE SOLVED: To provide an image information processing method capable of simplifying the management of image information and image related information related to the image information. <P>SOLUTION: The image information processing method manages the image related information related to a rectangular region image or a plurality of hierarchically organized divisions of the rectangular region image by using a prescribed marker segment specified by an encoding format by each rectangular region in the case of compression, and processes the image related information together with an expanded image of a prescribed hierarchy of a related rectangular region in the case of expansion. Thus, in the case of displaying an image by revising e.g. its hierarchy (resolution), the image related information managed according to the hierarchy (resolution) can be displayed together with the image of the hierarchy (with the resolution). That is, in the case of applying reduction processing to map image by using a district name being text information displayed overlapped on the map image as the image related information, the district name (image related information) in response to a degree of reduction can be displayed in cross-reference with the map image by using one file without the need for preparation of a database in addition to the image information. <P>COPYRIGHT: (C)2004,JPO&NCIPI

Description

【０００１】
【発明の属する技術分野】
本発明は、画像を一又は複数の矩形領域に分割し当該矩形領域毎に画素値を離散ウェーブレット変換、量子化及び符号化という手順で階層的に圧縮し、圧縮符号化した画像データを逆の手順で伸長する画像情報処理方法に関する。
【０００２】
【従来の技術】
画像入力技術およびその出力技術の進歩により、画像に対して高精細化の要求が、近年非常に高まっている。例えば、画像入力装置として、デジタルカメラ（Digital Camera）を例にあげると、３００万以上の画素数を持つ高性能な電荷結合素子（ＣＣＤ：Charge Coupled Device）の低価格化が進み、普及価格帯の製品においても広く用いられるようになってきた。そして、５００万画素の製品の登場も間近である。そして、このピクセル数の増加傾向は、なおしばらくは続くと言われている。
【０００３】
一方、画像出力・表示装置に関しても、例えば、レーザプリンタ、インクジェットプリンタ、昇華型プリンタ等のハード・コピー分野における製品、そして、ＣＲＴやＬＣＤ（液晶表示デバイス）、ＰＤＰ（プラズマ表示デバイス）等のフラットパネルディスプレイのソフト・コピー分野における製品の高精細化・低価格化は目を見張るものがある。
【０００４】
こうした高性能・低価格な画像入出力製品の市場投入効果によって、高精細画像の大衆化が始まっており、今後はあらゆる場面で、高精細画像の需要が高まると予想されている。実際、パーソナルコンピュータ（Personal Computer）やインターネットをはじめとするネットワークに関連する技術の発達は、こうしたトレンドをますます加速させている。特に最近は、携帯電話やノートパソコン等のモバイル機器の普及速度が非常に大きく、高精細な画像を、あらゆる地点から通信手段を用いて伝送あるいは受信する機会が急増している。
【０００５】
これらを背景に、高精細画像の取扱いを容易にする画像圧縮伸長技術に対する高性能化あるいは多機能化の要求は、今後ますます強くなっていくことは必至と思われる。
【０００６】
そこで、近年においては、こうした要求を満たす画像圧縮方式の一つとして、高圧縮率でも高画質な画像を復元可能なJPEG2000という新しい方式が規格化されつつある。かかるJPEG2000においては、画像を矩形領域（タイル）に分割することにより、少ないメモリ環境下で圧縮伸長処理を行うことが可能である。すなわち、個々のタイルが圧縮伸長プロセスを実行する際の基本単位となり、圧縮伸長動作はタイル毎に独立に行うことができる。
【０００７】
また、近年においては、画面に日本地図全体を表示装置に表示し、日本地図上の一部の地域が指示された場合には、その指示された地域を表示装置に拡大表示するようにして、当該指示地域に関連する各種情報を提供するようにした情報提供装置やコンピュータソフトウェア等が開発されている。
【０００８】
このような情報提供装置やコンピュータソフトウェア等においては、上述したような地域に関連する各種情報をテキスト情報としてデータベース等に記憶保持しており、表示装置に表示される画像情報に合せてデータベースから抽出して当該画像情報に重ねて表示するようにしている。
【０００９】
【発明が解決しようとする課題】
ところが、前述したように表示装置に表示される画像情報に重ねて表示されるテキスト情報をデータベースを利用して画像情報とは別に管理する場合、データベースの管理が大変であるし、汎用性もないという問題がある。特に、インターネットにおける地図情報をはじめとする画像表示では、拡大率によって複数の画像情報及び各画像情報に関連したテキスト情報を持つ必要があるため、画像情報及びテキスト情報の管理保守が複雑である。
【００１０】
本発明の目的は、画像情報及びその画像情報に関連する画像関連情報の管理の簡便化を図ることができる画像情報処理方法を提供することである。
【００１１】
【課題を解決するための手段】
請求項１記載の発明の画像情報処理方法は、画像を一又は複数の矩形領域に分割し当該矩形領域毎に画素値を離散ウェーブレット変換、量子化及び符号化という手順で階層的に圧縮し、圧縮符号化した画像データを逆の手順で伸長する画像情報処理方法において、圧縮の際に、一又は複数の矩形領域に分割されて階層化された矩形領域画像に関連する画像関連情報を、関連する矩形領域毎に符号フォーマットで規定された所定のマーカセグメントで管理するステップと、伸長の際に、前記マーカセグメントで管理されている前記画像関連情報を、関連する矩形領域の所定階層の伸長画像とともに処理するステップと、を含む。
【００１２】
したがって、圧縮の際には、一又は複数の矩形領域に分割されて階層化された矩形領域画像に関連する画像関連情報が矩形領域毎に符号フォーマットで規定された所定のマーカセグメントで管理され、伸長の際には、当該画像関連情報が関連する矩形領域の所定階層の伸長画像に対応付けられる。これにより、例えば階層（解像度）を変更して表示するような場合に、階層（解像度）に応じて管理されている画像関連情報を当該階層（解像度）の画像とともに表示することが可能になる。つまり、地図画像に重ねて表示されるテキスト情報である地名を画像関連情報とし地図画像を縮小処理するような場合に、画像情報とは別にデータベースを用意しなくとも１ファイルで、縮小程度に応じた地名（画像関連情報）を地図画像に対応付けて表示することが可能になる。
【００１３】
請求項２記載の発明は、請求項１記載の画像情報処理方法において、符号フォーマットはJPEG2000の符号フォーマットであって、前記画像関連情報を管理する前記マーカセグメントはＣＯＭマーカである。
【００１４】
したがって、ファイルの互換性を維持することが可能になり、汎用性を有することになる。
【００１５】
請求項３記載の発明は、請求項２記載の画像情報処理方法において、前記ＣＯＭマーカは、JPEG2000の符号フォーマットにおけるメインヘッダ（Main Header）に設けられている。
【００１６】
したがって、階層（解像度）毎に画像関連情報を変更することが可能になる。
【００１７】
請求項４記載の発明は、請求項２記載の画像情報処理方法において、前記ＣＯＭマーカは、JPEG2000の符号フォーマットにおけるタイルヘッダ（Tile Header）に設けられている。
【００１８】
したがって、矩形領域（タイル）毎及び階層（解像度）毎に画像関連情報を設定することが可能になる。
【００１９】
請求項５記載の発明は、請求項１ないし４のいずれか一記載の画像情報処理方法において、前記画像関連情報は、バイナリフォーマットで生成されている。
【００２０】
したがって、画像関連情報を簡易に管理することが可能になる。
【００２１】
請求項６記載の発明は、請求項１ないし４のいずれか一記載の画像情報処理方法において、前記画像関連情報は、ＸＭＬ（eXtensible Markup Language）で生成されている。
【００２２】
したがって、画像関連情報を簡易に管理することが可能になる。
【００２３】
【発明の実施の形態】
最初に、本実施の形態の前提となる「階層符号化アルゴリズム」及び「JPEG2000アルゴリズム」の概要について説明する。
【００２４】
図１は、JPEG2000方式の基本となる階層符号化アルゴリズムを実現するシステムの機能ブロック図である。このシステムは、色空間変換・逆変換部１０１、２次元ウェーブレット変換・逆変換部１０２、量子化・逆量子化部１０３、エントロピー符号化・復号化部１０４、タグ処理部１０５の各機能ブロックにより構成されている。
【００２５】
このシステムが従来のJPEGアルゴリズムと比較して最も大きく異なる点の一つは変換方式である。JPEGでは離散コサイン変換（ＤＣＴ：Discrete Cosine Transform）を用いているのに対し、この階層符号化アルゴリズムでは、２次元ウェーブレット変換・逆変換部１０２において、離散ウェーブレット変換（ＤＷＴ：Discrete Wavelet Transform）を用いている。ＤＷＴはＤＣＴに比べて、高圧縮領域における画質が良いという長所を有し、この点が、JPEGの後継アルゴリズムであるJPEG2000でＤＷＴが採用された大きな理由の一つとなっている。
【００２６】
また、他の大きな相違点は、この階層符号化アルゴリズムでは、システムの最終段に符号形成を行うために、タグ処理部１０５の機能ブロックが追加されていることである。このタグ処理部１０５で、画像の圧縮動作時には圧縮データが符号列データとして生成され、伸長動作時には伸長に必要な符号列データの解釈が行われる。そして、符号列データによって、JPEG2000は様々な便利な機能を実現できるようになった。例えば、ブロック・ベースでのＤＷＴにおけるオクターブ分割に対応した任意の階層（デコンポジション・レベル）で、静止画像の圧縮伸長動作を自由に停止させることができるようになる（後述する図３参照）。
【００２７】
原画像の入出力部分には、色空間変換・逆変換１０１が接続される場合が多い。例えば、原色系のＲ（赤）／Ｇ（緑）／Ｂ（青）の各コンポーネントからなるＲＧＢ表色系や、補色系のＹ（黄）／Ｍ（マゼンタ）／Ｃ（シアン）の各コンポーネントからなるＹＭＣ表色系から、ＹＵＶあるいはＹＣｂＣｒ表色系への変換又は逆変換を行う部分がこれに相当する。
【００２８】
次に、JPEG2000アルゴリズムについて説明する。
【００２９】
カラー画像は、一般に、図２に示すように、原画像の各コンポーネント１１１（ここではＲＧＢ原色系）が、矩形をした領域によって分割される。この分割された矩形領域は、一般にブロックあるいはタイルと呼ばれているものであるが、JPEG2000では、タイルと呼ぶことが一般的であるため、以下、このような分割された矩形領域をタイルと記述することにする（図２の例では、各コンポーネント１１１が縦横４×４、合計１６個の矩形のタイル１１２に分割されている）。このような個々のタイル１１２（図２の例で、Ｒ００，Ｒ０１，…，Ｒ１５／Ｇ００，Ｇ０１，…，Ｇ１５／Ｂ００，Ｂ０１，…，Ｂ１５）が、画像データの圧縮伸長プロセスを実行する際の基本単位となる。従って、画像データの圧縮伸長動作は、コンポーネントごと、また、タイル１１２ごとに、独立に行われる。
【００３０】
画像データの符号化時には、各コンポーネント１１１の各タイル１１２のデータが、図１の色空間変換・逆変換部１０１に入力され、色空間変換を施された後、２次元ウェーブレット変換部１０２で２次元ウェーブレット変換（順変換）が施されて、周波数帯に空間分割される。
【００３１】
図３には、デコンポジション・レベル数が３の場合の、各デコンポジション・レベルにおけるサブバンドを示している。すなわち、原画像のタイル分割によって得られたタイル原画像（０ＬＬ）（デコンポジション・レベル０）に対して、２次元ウェーブレット変換を施し、デコンポジション・レベル１に示すサブバンド（１ＬＬ，１ＨＬ，１ＬＨ，１ＨＨ）を分離する。そして引き続き、この階層における低周波成分１ＬＬに対して、２次元ウェーブレット変換を施し、デコンポジション・レベル２に示すサブバンド（２ＬＬ，２ＨＬ，２ＬＨ，２ＨＨ）を分離する。順次同様に、低周波成分２ＬＬに対しても、２次元ウェーブレット変換を施し、デコンポジション・レベル３に示すサブバンド（３ＬＬ，３ＨＬ，３ＬＨ，３ＨＨ）を分離する。図３では、各デコンポジション・レベルにおいて符号化の対象となるサブバンドを、網掛けで表してある。例えば、デコンポジション・レベル数を３としたとき、網掛けで示したサブバンド（３ＨＬ，３ＬＨ，３ＨＨ，２ＨＬ，２ＬＨ，２ＨＨ，１ＨＬ，１ＬＨ，１ＨＨ）が符号化対象となり、３ＬＬサブバンドは符号化されない。
【００３２】
次いで、指定した符号化の順番で符号化の対象となるビットが定められ、図１に示す量子化・逆量子化部１０３で対象ビット周辺のビットからコンテキストが生成される。
【００３３】
この量子化の処理が終わったウェーブレット係数は、個々のサブバンド毎に、「プレシンクト」と呼ばれる重複しない矩形に分割される。これは、インプリメンテーションでメモリを効率的に使うために導入されたものである。図４に示したように、一つのプレシンクトは、空間的に一致した３つの矩形領域からなっている。更に、個々のプレシンクトは、重複しない矩形の「コード・ブロック」に分けられる。これは、エントロピー・コーディングを行う際の基本単位となる。
【００３４】
ウェーブレット変換後の係数値は、そのまま量子化し符号化することも可能であるが、JPEG2000では符号化効率を上げるために、係数値を「ビットプレーン」単位に分解し、画素あるいはコード・ブロック毎に「ビットプレーン」に順位付けを行うことができる。
【００３５】
ここで、図５はビットプレーンに順位付けする手順の一例を示す説明図である。図５に示すように、この例は、原画像（３２×３２画素）を１６×１６画素のタイル４つで分割した場合で、デコンポジション・レベル１のプレシンクトとコード・ブロックの大きさは、各々８×８画素と４×４画素としている。プレシンクトとコード・ブロックの番号は、ラスター順に付けられており、この例では、プレンシクトが番号０から３まで、コード・ブロックが番号０から３まで割り当てられている。タイル境界外に対する画素拡張にはミラーリング法を使い、可逆（５，３）フィルタでウェーブレット変換を行い、デコンポジション・レベル１のウェーブレット係数値を求めている。
【００３６】
また、タイル０／プレシンクト３／コード・ブロック３について、代表的な「レイヤ」構成の概念の一例を示す説明図も図５に併せて示す。変換後のコード・ブロックは、サブバンド（１ＬＬ，１ＨＬ，１ＬＨ，１ＨＨ）に分割され、各サブバンドにはウェーブレット係数値が割り当てられている。
【００３７】
レイヤの構造は、ウェーブレット係数値を横方向（ビットプレーン方向）から見ると理解し易い。１つのレイヤは任意の数のビットプレーンから構成される。この例では、レイヤ０，１，２，３は、各々、１，３，１，３のビットプレーンから成っている。そして、ＬＳＢ（Least Significant Bit：最下位ビット）に近いビットプレーンを含むレイヤ程、先に量子化の対象となり、逆に、ＭＳＢ（Most Significant Bit：最上位ビット）に近いレイヤは最後まで量子化されずに残ることになる。ＬＳＢに近いレイヤから破棄する方法はトランケーションと呼ばれ、量子化率を細かく制御することが可能である。このように、ビットプレーン（又はサブビットプレーン）を削っていない状態の符号から所定の圧縮率になるまで符号を破棄する処理はポスト量子化と呼ばれており、JPEG2000アルゴリズムの最も大きな特徴である。
【００３８】
図１に示すエントロピー符号化・復号化部１０４では、コンテキストと対象ビットから確率推定によって、各コンポーネント１１１のタイル１１２に対する符号化を行う。こうして、原画像の全てのコンポーネント１１１について、タイル１１２単位で符号化処理が行われる。最後にタグ処理部１０５は、エントロピー符号化・復号化部１０４からの全符号化データを１本の符号列データ（コードストリーム）に結合するとともに、それにタグを付加する処理を行う。
【００３９】
一方、復号化時には、画像データの符号化時とは逆に、各コンポーネント１１１の各タイル１１２の符号列データから画像データを生成する。この場合、タグ処理部１０５は、外部より入力した符号列データに付加されたタグ情報を解釈し、符号列データを各コンポーネント１１１の各タイル１１２の符号列データに分解し、その各コンポーネント１１１の各タイル１１２の符号列データ毎に復号化処理（伸長処理）を行う。このとき、符号列データ内のタグ情報に基づく順番で復号化の対象となるビットの位置が定められるとともに、量子化・逆量子化部１０３で、その対象ビット位置の周辺ビット（既に復号化を終えている）の並びからコンテキストが生成される。エントロピー符号化・復号化部１０４で、このコンテキストと符号列データから確率推定によって復号化を行い、対象ビットを生成し、それを対象ビットの位置に書き込む。このようにして復号化されたデータは周波数帯域毎に空間分割されているため、これを２次元ウェーブレット変換・逆変換部１０２で２次元ウェーブレット逆変換を行うことにより、画像データの各コンポーネントの各タイルが復元される。復元されたデータは色空間変換・逆変換部１０１によって元の表色系の画像データに変換される。
【００４０】
以上が、「JPEG2000アルゴリズム」の概要である。
【００４１】
以下、本発明の実施の一形態について説明する。なお、ここでは、JPEG2000を代表とする画像圧縮伸長技術に関する例について説明するが、言うまでもなく、本発明は以下の説明の内容に限定されるものではない。
【００４２】
図６は、本発明が適用される画像処理装置１を含むシステムを示すシステム構成図である。図６に示すように、本発明が適用される画像処理装置１は、例えばパーソナルコンピュータであり、インターネットであるネットワーク５を介して各種画像データを記憶保持するサーバコンピュータＳに接続可能とされている。
【００４３】
本実施の形態においては、サーバコンピュータＳに記憶保持されている画像データは、「JPEG2000アルゴリズム」に従って生成された圧縮符号である。図７は、JPEG2000の符号フォーマットの概略構成を示すものである。符号フォーマットは、符号データの始まりを示すＳＯＣ（Start of Codestream）マーカで始まる。ＳＯＣマーカの後には符号化のパラメータや量子化のパラメータ等を記述したタグ情報であるメインヘッダ（Main Header）が続き、その後に実際の符号データが続く。実際の符号データは、ＳＯＴ（Start of Tile-part）マーカで始まり、タグ情報であるタイルヘッダ（Tile Header）、ＳＯＤ（Start of data）マーカ、タイルデータ（符号：bit stream）で構成される。これら画像全体に相当する符号データの後に、符号の終了を示すタグ情報であるＥＯＣ（End of Codestream）マーカが付加される。上述したようなメインヘッダ及びタイルヘッダには、画像に関するコメント等の画像関連情報を付加する際に用いるコメントマーカ（ＣＯＭマーカ）を備えることができる。
【００４４】
本実施の形態においては、メインヘッダまたはタイルヘッダのコメントマーカには、画像に関連付けられるテキスト情報である画像関連情報が記憶される。このテキスト情報には、画像の階層、テキストの画像上での位置、テキストのフォントあるいはサイズ、色などが付帯情報として設定される。このような画像関連情報は、メタデータとして生成されており、ＸＭＬ（eXtensible Markup Language）やバイナリフォーマット等が利用される。
【００４５】
図８は、画像処理装置１のハードウェア構成を概略的に示すブロック図である。図８に示すように、画像処理装置１は、情報処理を行うＣＰＵ（Central Processing Unit）６、情報を格納するＲＯＭ（Read Only Memory）７及びＲＡＭ（Random Access Memory）８等の一次記憶装置、ネットワーク５を介してサーバコンピュータＳからダウンロードした圧縮符号（図７参照）を記憶する記憶部であるＨＤＤ（Hard Disk Drive）１０、情報を保管したり外部に情報を配布したり外部から情報を入手するためのＣＤ−ＲＯＭドライブ１２、ネットワーク５を介して外部の他のコンピュータと通信により情報を伝達するための通信制御装置１３、処理経過や結果等を操作者に表示するＣＲＴ（Cathode Ray Tube）やＬＣＤ（Liquid Crystal Display）等の表示装置１５、並びに操作者がＣＰＵ６に命令や情報等を入力するためのキーボードやマウス等の入力装置１４等から構成されており、これらの各部間で送受信されるデータをバスコントローラ９が調停して動作する。
【００４６】
ＲＡＭ８は、各種データを書換え可能に記憶する性質を有していることから、ＣＰＵ６の作業エリアとして機能し、例えば後述する画像表示処理におけるバッファメモリ等の役割を果たす。
【００４７】
このような画像処理装置１では、ユーザが電源を投入するとＣＰＵ６がＲＯＭ７内のローダーというプログラムを起動させ、ＨＤＤ１０よりオペレーティングシステムというコンピュータのハードウェアとソフトウェアとを管理するプログラムをＲＡＭ８に読み込み、このオペレーティングシステムを起動させる。このようなオペレーティングシステムは、ユーザの操作に応じてプログラムを起動したり、情報を読み込んだり、保存を行ったりする。オペレーティングシステムのうち代表的なものとしては、Ｗｉｎｄｏｗｓ（登録商標）、ＵＮＩＸ（登録商標）等が知られている。これらのオペレーティングシステム上で走る動作プログラムをアプリケーションプログラムと呼んでいる。
【００４８】
ここで、画像処理装置１は、アプリケーションプログラムとして、画像処理プログラムをＨＤＤ１０に記憶している。この意味で、ＨＤＤ１０は、画像処理プログラムを記憶する記憶媒体として機能する。
【００４９】
また、一般的には、画像処理装置１のＨＤＤ１０にインストールされる動作プログラムは、ＣＤ−ＲＯＭ１１やＤＶＤ−ＲＯＭ等の光情報記録メディアやＦＤ等の磁気メディア等に記録され、この記録された動作プログラムがＨＤＤ１０にインストールされる。このため、ＣＤ−ＲＯＭ１１等の光情報記録メディアやＦＤ等の磁気メディア等の可搬性を有する記憶媒体も、画像処理プログラムを記憶する記憶媒体となり得る。さらには、画像処理プログラムは、例えば通信制御装置１３を介して外部から取り込まれ、ＨＤＤ１０にインストールされても良い。
【００５０】
画像処理装置１は、オペレーティングシステム上で動作する画像処理プログラムが起動すると、この画像処理プログラムに従い、ＣＰＵ６が各種の演算処理を実行して各部を集中的に制御する。画像処理装置１のＣＰＵ６が実行する各種の演算処理のうち、本実施の形態の特長的な処理について以下に説明する。
【００５１】
ここでは、画像処理プログラムに従いＣＰＵ６によって実行される画像表示処理のうち、コメントマーカのメタデータを用いた画像表示処理について説明する。
【００５２】
まず、メインヘッダのコメントマーカのメタデータを用いた画像表示処理について説明する。
【００５３】
ここで、図９は画像表示の一例を示す説明図である。図９に示す例では、日本地図の地域名を階層的に表示することを目的としている。また、図１０は図９に示す画像のメインヘッダのコメントマーカに記憶されているメタデータの一例を示す説明図である。図１０に示す例では、このメタデータは、ＸＭＬで生成されており、画像の階層が＜layer＞タグで分類されている。そして、第１階層においては、日本地図の画像に対して、画像ピクセルでｘ＝１８０，ｙ＝３６０の位置に１４ポイントの青色で「日本」と表示することを示している（図９（ａ）参照）。同様に、第２階層においては、日本地図の画像に対して、ｘ＝１００，ｙ＝２００の位置に１２ポイントの緑色で「関東」と表示し、ｘ＝８０，ｙ＝２４０の位置に「東北」と表示することを示している（図９（ｂ）参照）。このようにして、階層毎にテキストと画像とが関連付けられることになる。
【００５４】
次に、タイルヘッダのコメントマーカのメタデータを用いた画像表示処理について説明する。
【００５５】
ここで、図１１はタイル分割された画像表示の一例を示す説明図である。図１１に示す例では、日本地図を部分的に拡大表示することを目的としている。また、図１２は図１１に示す画像のメインヘッダのコメントマーカに記憶されているメタデータの一例を示す説明図である。タイルヘッダのコメントマーカのメタデータは、当該タイルヘッダに係るタイル画像に関連した部分のみの情報である。すなわち、図１２に示す例では、関東エリアを含むタイル画像に対して、ｘ＝２００，ｙ＝４００の位置に１４ポイントの青色で「横浜」と表示し、ｘ＝１８０，ｙ＝３６０の位置に１４ポイントの青色で「川崎」と表示することを示している（図１１（ｂ）参照）。
【００５６】
ここに、圧縮の際には、一又は複数の矩形領域に分割されて階層化された矩形領域画像に関連する画像関連情報を矩形領域毎に符号フォーマットで規定された所定のマーカセグメントで管理し、伸長の際には、当該画像関連情報を関連する矩形領域の所定階層の伸長画像とともに処理することにより、例えば階層（解像度）を変更して表示するような場合に、階層（解像度）に応じて管理されている画像関連情報を当該階層（解像度）の画像とともに表示することができる。つまり、地図画像に重ねて表示されるテキスト情報である地名を画像関連情報とし地図画像を縮小処理するような場合に、画像情報とは別にデータベースを用意しなくとも１ファイルで、縮小程度に応じた地名（画像関連情報）を地図画像に対応付けて表示することができるので、画像情報及びその画像情報に関連する画像関連情報の管理の簡便化を図ることができる。
【００５７】
【発明の効果】
請求項１記載の発明の画像情報処理方法によれば、画像を一又は複数の矩形領域に分割し当該矩形領域毎に画素値を離散ウェーブレット変換、量子化及び符号化という手順で階層的に圧縮し、圧縮符号化した画像データを逆の手順で伸長する画像情報処理方法において、圧縮の際に、一又は複数の矩形領域に分割されて階層化された矩形領域画像に関連する画像関連情報を、関連する矩形領域毎に符号フォーマットで規定された所定のマーカセグメントで管理するステップと、伸長の際に、前記マーカセグメントで管理されている前記画像関連情報を、関連する矩形領域の所定階層の伸長画像とともに処理するステップと、を含み、圧縮の際には、一又は複数の矩形領域に分割されて階層化された矩形領域画像に関連する画像関連情報を矩形領域毎に符号フォーマットで規定された所定のマーカセグメントで管理し、伸長の際には、当該画像関連情報を関連する矩形領域の所定階層の伸長画像とともに処理することにより、例えば階層（解像度）を変更して表示するような場合に、階層（解像度）に応じて管理されている画像関連情報を当該階層（解像度）の画像とともに表示することができる。つまり、地図画像に重ねて表示されるテキスト情報である地名を画像関連情報とし地図画像を縮小処理するような場合に、画像情報とは別にデータベースを用意しなくとも１ファイルで、縮小程度に応じた地名（画像関連情報）を地図画像に対応付けて表示することができるので、画像情報及びその画像情報に関連する画像関連情報の管理の簡便化を図ることができる。
【００５８】
請求項２記載の発明によれば、請求項１記載の画像情報処理方法において、符号フォーマットはJPEG2000の符号フォーマットであって、前記画像関連情報を管理する前記マーカセグメントはＣＯＭマーカであることにより、ファイルの互換性を維持することができ、汎用性を有することができる。
【００５９】
請求項３記載の発明によれば、請求項２記載の画像情報処理方法において、前記ＣＯＭマーカは、JPEG2000の符号フォーマットにおけるメインヘッダ（Main Header）に設けられていることにより、階層（解像度）毎に画像関連情報を変更することができる。
【００６０】
請求項４記載の発明によれば、請求項２記載の画像情報処理方法において、前記ＣＯＭマーカは、JPEG2000の符号フォーマットにおけるタイルヘッダ（Tile Header）に設けられていることにより、矩形領域（タイル）毎及び階層（解像度）毎に画像関連情報を設定することができる。
【００６１】
請求項５記載の発明によれば、請求項１ないし４のいずれか一記載の画像情報処理方法において、前記画像関連情報は、バイナリフォーマットで生成されていることにより、画像関連情報を簡易に管理することができる。
【００６２】
請求項６記載の発明によれば、請求項１ないし４のいずれか一記載の画像情報処理方法において、前記画像関連情報は、ＸＭＬ（eXtensible Markup Language）で生成されていることにより、画像関連情報を簡易に管理することができる。
【図面の簡単な説明】
【図１】本発明の前提となるJPEG2000方式の基本となる階層符号化アルゴリズムを実現するシステムの機能ブロック図である。
【図２】原画像の各コンポーネントの分割された矩形領域を示す説明図である。
【図３】デコンポジション・レベル数が３の場合の、各デコンポジション・レベルにおけるサブバンドを示す説明図である。
【図４】プレシンクトを示す説明図である。
【図５】ビットプレーンに順位付けする手順の一例を示す説明図である。
【図６】本発明の実施の一形態の画像処理装置を含むシステムを示すシステム構成図である。
【図７】JPEG2000の符号フォーマットの概略構成を示す説明図である。
【図８】画像処理装置のハードウェア構成を概略的に示すブロック図である。
【図９】画像表示の一例を示す説明図である。
【図１０】図９に示す画像のメインヘッダのコメントマーカに記憶されているメタデータの一例を示す説明図である。
【図１１】タイル分割された画像表示の一例を示す説明図である。
【図１２】図１１に示す画像のメインヘッダのコメントマーカに記憶されているメタデータの一例を示す説明図である。[0001]
TECHNICAL FIELD OF THE INVENTION
The present invention divides an image into one or a plurality of rectangular areas, and compresses pixel values hierarchically by a discrete wavelet transform, quantization, and encoding procedure for each of the rectangular areas. The present invention relates to an image information processing method for extending in a procedure.
[0002]
[Prior art]
2. Description of the Related Art Due to advances in image input technology and output technology thereof, demands for higher definition of images have been extremely increased in recent years. For example, taking a digital camera as an example of an image input device, the price of a high-performance charge-coupled device (CCD: Charge Coupled Device) having more than 3 million pixels has been reduced, and the price range has become widespread. Products have come to be widely used. And a product with 5 million pixels is coming soon. It is said that this increasing tendency of the number of pixels will continue for a while.
[0003]
On the other hand, with regard to image output / display devices, for example, products in the hard copy field such as laser printers, ink jet printers and sublimation printers, and flat devices such as CRTs, LCDs (liquid crystal display devices), and PDPs (plasma display devices). The high definition and low price of products in the field of soft copy of panel displays are remarkable.
[0004]
Due to the effect of bringing high-performance, low-cost image input / output products to the market, the popularization of high-definition images has begun, and it is expected that demand for high-definition images will increase in all scenes in the future. Indeed, the development of technologies relating to networks, including personal computers and the Internet, has accelerated these trends. In particular, recently, mobile devices such as mobile phones and notebook personal computers have become very popular, and opportunities to transmit or receive high-definition images from any point using communication means have been rapidly increasing.
[0005]
Against this background, the demand for higher performance or multifunctional image compression / decompression technology that facilitates the handling of high-definition images will inevitably increase in the future.
[0006]
Therefore, in recent years, as one of the image compression methods satisfying such demands, a new method called JPEG2000, which can restore a high-quality image even at a high compression rate, is being standardized. In JPEG2000, by dividing an image into rectangular areas (tiles), it is possible to perform compression / expansion processing in a small memory environment. That is, each tile is a basic unit when executing the compression / decompression process, and the compression / decompression operation can be performed independently for each tile.
[0007]
In recent years, the entire map of Japan is displayed on a display device on a screen, and when a partial area on the map of Japan is designated, the designated area is enlarged and displayed on the display device. Information providing devices, computer software, and the like that provide various information related to the designated area have been developed.
[0008]
In such information providing devices and computer software, various types of information relating to the above-mentioned areas are stored and held as text information in a database or the like, and extracted from the database in accordance with image information displayed on the display device. Then, the image information is superimposed and displayed.
[0009]
[Problems to be solved by the invention]
However, as described above, when the text information superimposed on the image information displayed on the display device is managed separately from the image information by using the database, the management of the database is difficult and there is no versatility. There is a problem. In particular, in image display including map information on the Internet, it is necessary to have a plurality of image information and text information related to each image information depending on an enlargement ratio, and thus management and maintenance of the image information and text information are complicated.
[0010]
An object of the present invention is to provide an image information processing method capable of simplifying management of image information and image-related information related to the image information.
[0011]
[Means for Solving the Problems]
The image information processing method according to the first aspect of the present invention divides an image into one or a plurality of rectangular regions, and compresses the pixel values for each of the rectangular regions hierarchically by a procedure of discrete wavelet transform, quantization, and encoding, In an image information processing method for decompressing compression-encoded image data in a reverse procedure, at the time of compression, image-related information related to a rectangular area image divided into one or a plurality of rectangular areas and hierarchized is related. Managing with a predetermined marker segment specified by a code format for each rectangular area to be expanded, and, at the time of decompression, expanding the image-related information managed by the marker segment into a decompressed image of a predetermined hierarchy of the associated rectangular area And processing together.
[0012]
Therefore, at the time of compression, image-related information related to a rectangular area image divided into one or a plurality of rectangular areas and hierarchized is managed by a predetermined marker segment defined in a code format for each rectangular area, At the time of decompression, the image-related information is associated with a decompressed image of a predetermined layer in a rectangular area to which the image-related information relates. Accordingly, for example, when the display is performed with the hierarchy (resolution) changed, it becomes possible to display the image-related information managed according to the hierarchy (resolution) together with the image of the hierarchy (resolution). In other words, in a case where a map image is reduced by using a place name, which is text information superimposed on a map image as image-related information, and a database is not prepared separately from the image information, one file can be used according to the degree of reduction. It becomes possible to display the place name (image-related information) in association with the map image.
[0013]
According to a second aspect of the present invention, in the image information processing method according to the first aspect, the code format is a JPEG2000 code format, and the marker segment for managing the image-related information is a COM marker.
[0014]
Therefore, it is possible to maintain the compatibility of the files, and to have versatility.
[0015]
According to a third aspect of the present invention, in the image information processing method of the second aspect, the COM marker is provided in a main header (Main Header) in a JPEG2000 code format.
[0016]
Therefore, it is possible to change the image-related information for each layer (resolution).
[0017]
According to a fourth aspect of the present invention, in the image information processing method of the second aspect, the COM marker is provided in a tile header (Tile Header) in a JPEG2000 code format.
[0018]
Therefore, image-related information can be set for each rectangular area (tile) and each layer (resolution).
[0019]
According to a fifth aspect of the present invention, in the image information processing method according to any one of the first to fourth aspects, the image-related information is generated in a binary format.
[0020]
Therefore, it is possible to easily manage the image-related information.
[0021]
According to a sixth aspect of the present invention, in the image information processing method according to any one of the first to fourth aspects, the image-related information is generated in XML (extensible Markup Language).
[0022]
Therefore, it is possible to easily manage the image-related information.
[0023]
BEST MODE FOR CARRYING OUT THE INVENTION
First, the outlines of the “hierarchical encoding algorithm” and the “JPEG2000 algorithm” which are the premise of the present embodiment will be described.
[0024]
FIG. 1 is a functional block diagram of a system that implements a basic hierarchical encoding algorithm of the JPEG2000 system. This system includes a color space conversion / inverse transformation unit 101, a two-dimensional wavelet transformation / inverse transformation unit 102, a quantization / inverse quantization unit 103, an entropy coding / decoding unit 104, and a tag processing unit 105. It is configured.
[0025]
One of the biggest differences between this system and the conventional JPEG algorithm is the conversion method. In JPEG, a discrete cosine transform (DCT: Discrete Cosine Transform) is used, whereas in this hierarchical coding algorithm, a two-dimensional wavelet transform / inverse transform unit 102 uses a discrete wavelet transform (DWT: Discrete Wavelet Transform). ing. DWT has an advantage that the image quality in a high compression area is better than DCT, and this is one of the major reasons why DWT was adopted in JPEG2000 which is a successor algorithm of JPEG.
[0026]
Another major difference is that in the hierarchical coding algorithm, a functional block of the tag processing unit 105 is added in order to form a code at the last stage of the system. The tag processing unit 105 generates compressed data as code string data at the time of image compression operation, and interprets code string data required for decompression at the time of decompression operation. Then, JPEG2000 can realize various convenient functions by the code string data. For example, the compression / expansion operation of a still image can be freely stopped at an arbitrary layer (decomposition level) corresponding to octave division in a DWT on a block basis (see FIG. 3 described later).
[0027]
In many cases, a color space conversion / inverse conversion 101 is connected to the input / output portion of the original image. For example, an RGB color system composed of R (red) / G (green) / B (blue) components of a primary color system, and Y (yellow) / M (magenta) / C (cyan) components of a complementary color system The conversion or inverse conversion from the YMC color system to the YUV or YCbCr color system corresponds to this.
[0028]
Next, the JPEG2000 algorithm will be described.
[0029]
Generally, in a color image, as shown in FIG. 2, each component 111 (here, the RGB primary color system) of the original image is divided by a rectangular area. The divided rectangular area is generally called a block or a tile. In JPEG2000, it is generally called a tile. Hereinafter, such a divided rectangular area is referred to as a tile. (In the example of FIG. 2, each component 111 is divided into a total of 16 rectangular tiles 112, 4 × 4 vertically and horizontally.) When such individual tiles 112 (in the example of FIG. 2, R00, R01,..., R15 / G00, G01,..., G15 / B00, B01,. Is the basic unit of Therefore, the compression / decompression operation of the image data is performed independently for each component and for each tile 112.
[0030]
At the time of encoding image data, the data of each tile 112 of each component 111 is input to the color space conversion / inverse conversion unit 101 of FIG. 1 and subjected to color space conversion. A dimensional wavelet transform (forward transform) is performed, and spatial division into frequency bands is performed.
[0031]
FIG. 3 shows subbands at each decomposition level when the number of decomposition levels is three. That is, a two-dimensional wavelet transform is performed on the tile original image (0LL) (decomposition level 0) obtained by dividing the original image into tiles, and the subbands (1LL, 1HL, 1LH) indicated by the decomposition level 1 are obtained. , 1HH). Subsequently, two-dimensional wavelet transform is performed on the low-frequency component 1LL in this layer to separate the subbands (2LL, 2HL, 2LH, 2HH) shown in the decomposition level 2. Similarly, two-dimensional wavelet transform is performed on the low-frequency component 2LL in the same manner to separate the sub-bands (3LL, 3HL, 3LH, 3HH) shown in the decomposition level 3. In FIG. 3, the subbands to be coded at each decomposition level are shaded. For example, when the number of decomposition levels is 3, the subbands (3HL, 3LH, 3HH, 2HL, 2LH, 2HH, 1HL, 1LH, 1HH) indicated by shading are to be encoded, and the 3LL subbands are encoded. Is not converted.
[0032]
Next, bits to be encoded are determined in the designated order of encoding, and the quantization / inverse quantization unit 103 shown in FIG. 1 generates a context from bits around the target bits.
[0033]
The wavelet coefficients after the quantization process are divided into non-overlapping rectangles called “precincts” for each subband. This was introduced to make efficient use of memory in the implementation. As shown in FIG. 4, one precinct is formed of three spatially coincident rectangular areas. Further, each precinct is divided into non-overlapping rectangular "code blocks". This is a basic unit when performing entropy coding.
[0034]
The coefficient values after the wavelet transform can be quantized and encoded as they are, but in JPEG2000, in order to increase the encoding efficiency, the coefficient values are decomposed into `` bit planes '', and each pixel or code block is decomposed. "Bit planes" can be ranked.
[0035]
Here, FIG. 5 is an explanatory diagram showing an example of a procedure for prioritizing bit planes. As shown in FIG. 5, in this example, the original image (32 × 32 pixels) is divided into four 16 × 16 pixel tiles, and the size of the precinct of the decomposition level 1 and the size of the code block are as follows. Each has 8 × 8 pixels and 4 × 4 pixels. The numbers of precincts and code blocks are assigned in raster order. In this example, the precincts are assigned numbers 0 to 3, and the code blocks are assigned numbers 0 to 3. The pixel expansion outside the tile boundary is performed by using a mirroring method, performing a wavelet transform using a reversible (5, 3) filter, and obtaining a wavelet coefficient value of decomposition level 1.
[0036]
FIG. 5 also shows an explanatory diagram showing an example of a typical “layer” configuration concept of tile 0 / precinct 3 / code block 3. The converted code block is divided into subbands (1LL, 1HL, 1LH, 1HH), and each subband is assigned a wavelet coefficient value.
[0037]
The layer structure is easy to understand when the wavelet coefficient value is viewed from the horizontal direction (bit plane direction). One layer is composed of an arbitrary number of bit planes. In this example, layers 0, 1, 2, and 3 are made up of 1, 3, 1, and 3 bit planes, respectively. Then, a layer including a bit plane closer to LSB (Least Significant Bit: Least Significant Bit) is subject to quantization first, and conversely, a layer closer to MSB (Most Significant Bit: Most Significant Bit) is quantized to the end. It will remain without being. A method of discarding from a layer close to the LSB is called truncation, and it is possible to finely control the quantization rate. As described above, the process of discarding a code from a code in which a bit plane (or a sub-bit plane) is not deleted until a predetermined compression rate is obtained is called post-quantization, which is the most significant feature of the JPEG2000 algorithm. .
[0038]
The entropy encoding / decoding unit 104 shown in FIG. 1 performs encoding on the tile 112 of each component 111 by probability estimation from the context and the target bit. In this way, the encoding process is performed on all the components 111 of the original image in tile 112 units. Finally, the tag processing unit 105 performs a process of combining all the encoded data from the entropy encoding / decoding unit 104 into one piece of code string data (code stream) and adding a tag thereto.
[0039]
On the other hand, at the time of decoding, the image data is generated from the code string data of each tile 112 of each component 111, contrary to the case of encoding the image data. In this case, the tag processing unit 105 interprets the tag information added to the code string data input from the outside, decomposes the code string data into code string data of each tile 112 of each component 111, and A decoding process (decompression process) is performed for each code string data of each tile 112. At this time, the positions of the bits to be decoded are determined in the order based on the tag information in the code string data, and the quantization / dequantization unit 103 sets the peripheral bits of the target bit position (the A context is generated from the sequence of (finished). The entropy coding / decoding unit 104 performs decoding by probability estimation from the context and the code string data, generates a target bit, and writes it to the position of the target bit. Since the data decoded in this way is spatially divided for each frequency band, the two-dimensional wavelet transform / inverse transform unit 102 performs an inverse two-dimensional wavelet transform on the data to obtain each component of the image data. The tile is restored. The restored data is converted by the color space conversion / inverse conversion unit 101 into the original color system image data.
[0040]
The above is the outline of the “JPEG2000 algorithm”.
[0041]
Hereinafter, an embodiment of the present invention will be described. Here, an example relating to an image compression / decompression technique represented by JPEG2000 will be described, but it goes without saying that the present invention is not limited to the content of the following description.
[0042]
FIG. 6 is a system configuration diagram showing a system including the image processing apparatus 1 to which the present invention is applied. As shown in FIG. 6, an image processing apparatus 1 to which the present invention is applied is, for example, a personal computer, and is connectable to a server computer S that stores and holds various image data via a network 5 that is the Internet. .
[0043]
In the present embodiment, the image data stored and held in the server computer S is a compression code generated according to the “JPEG2000 algorithm”. FIG. 7 shows a schematic configuration of a JPEG2000 code format. The code format starts with an SOC (Start of Codestream) marker indicating the start of code data. After the SOC marker, a main header, which is tag information describing coding parameters, quantization parameters, and the like, follows, followed by actual code data. The actual code data starts with a SOT (Start of Tile-part) marker, and is composed of a tile header (Tile Header) as tag information, a SOD (Start of data) marker, and tile data (code: bit stream). After the code data corresponding to the entire image, an EOC (End of Codestream) marker, which is tag information indicating the end of the code, is added. The main header and the tile header as described above can include a comment marker (COM marker) used when adding image-related information such as a comment on an image.
[0044]
In the present embodiment, image-related information that is text information associated with an image is stored in the comment marker of the main header or the tile header. In the text information, the image hierarchy, the position of the text on the image, the font or size of the text, the color, and the like are set as supplementary information. Such image-related information is generated as metadata, and XML (extensible Markup Language), a binary format, or the like is used.
[0045]
FIG. 8 is a block diagram schematically illustrating a hardware configuration of the image processing apparatus 1. As shown in FIG. 8, the image processing apparatus 1 includes a CPU (Central Processing Unit) 6 for performing information processing, a ROM (Read Only Memory) 7 for storing information, and a primary storage device such as a RAM (Random Access Memory) 8; A hard disk drive (HDD) 10, which is a storage unit for storing a compression code (see FIG. 7) downloaded from the server computer S via the network 5, for storing information, distributing information to the outside, and obtaining information from the outside CD-ROM drive 12, a communication control device 13 for transmitting information by communication with another external computer via the network 5, a CRT (Cathode Ray Tube) for displaying the progress and results of processing to an operator 15 such as a display and an LCD (Liquid Crystal Display), and an input device 14 such as a keyboard and a mouse for an operator to input commands, information, and the like to the CPU 6. The bus controller 9 operates by arbitrating data transmitted and received between these units.
[0046]
Since the RAM 8 has a property of storing various data in a rewritable manner, the RAM 8 functions as a work area of the CPU 6 and plays a role of, for example, a buffer memory in an image display process described later.
[0047]
In such an image processing apparatus 1, when the user turns on the power, the CPU 6 starts a program called a loader in the ROM 7, reads a program called an operating system, which manages computer hardware and software, from the HDD 10 into the RAM 8, Start the system. Such an operating system starts a program, reads information, and saves information in response to a user operation. As typical operating systems, Windows (registered trademark), UNIX (registered trademark), and the like are known. The operation programs running on these operating systems are called application programs.
[0048]
Here, the image processing apparatus 1 stores an image processing program in the HDD 10 as an application program. In this sense, the HDD 10 functions as a storage medium that stores the image processing program.
[0049]
In general, an operation program installed in the HDD 10 of the image processing apparatus 1 is recorded on an optical information recording medium such as a CD-ROM 11 or a DVD-ROM, or a magnetic medium such as an FD, and the recorded operation program. The program is installed on the HDD 10. Therefore, a portable storage medium such as an optical information recording medium such as the CD-ROM 11 or a magnetic medium such as an FD can be a storage medium for storing the image processing program. Further, the image processing program may be fetched from outside via the communication control device 13 and installed in the HDD 10, for example.
[0050]
In the image processing apparatus 1, when an image processing program operating on an operating system is started, the CPU 6 executes various arithmetic processes according to the image processing program to centrally control each unit. Among the various types of arithmetic processing executed by the CPU 6 of the image processing apparatus 1, the characteristic processing of the present embodiment will be described below.
[0051]
Here, among the image display processes executed by the CPU 6 according to the image processing program, an image display process using the metadata of the comment marker will be described.
[0052]
First, an image display process using the metadata of the comment marker in the main header will be described.
[0053]
Here, FIG. 9 is an explanatory diagram showing an example of image display. In the example shown in FIG. 9, the purpose is to hierarchically display the area names of the Japan map. FIG. 10 is an explanatory diagram showing an example of metadata stored in the comment marker of the main header of the image shown in FIG. In the example shown in FIG. 10, this metadata is generated in XML, and the layers of the image are classified by the <layer> tag. Then, in the first hierarchical level, it is shown that "Japan" is displayed in blue at 14 points at the position of x = 180 and y = 360 in the image pixel with respect to the image of the Japanese map (FIG. 9 (a) )reference). Similarly, in the second layer, a 12-point green “Kanto” is displayed at the position of x = 100, y = 200 on the image of the Japanese map, and “Kanto” is displayed at the position of x = 80, y = 240. Tohoku "is displayed (see FIG. 9B). In this way, the text and the image are associated with each other for each hierarchy.
[0054]
Next, an image display process using the metadata of the comment marker in the tile header will be described.
[0055]
Here, FIG. 11 is an explanatory diagram showing an example of an image display divided into tiles. The example shown in FIG. 11 is intended to partially enlarge and display the Japan map. FIG. 12 is an explanatory diagram showing an example of the metadata stored in the comment marker of the main header of the image shown in FIG. The metadata of the comment marker of the tile header is information of only a portion related to the tile image related to the tile header. That is, in the example shown in FIG. 12, for the tile image including the Kanto area, “Yokohama” is displayed in blue at 14 points at the position of x = 200, y = 400, and the position of x = 180, y = 360 "Kawasaki" is displayed in blue at 14 points (see FIG. 11B).
[0056]
Here, at the time of compression, image-related information relating to a rectangular area image divided into one or a plurality of rectangular areas and hierarchized is managed by a predetermined marker segment defined in a code format for each rectangular area. At the time of decompression, the image-related information is processed together with the decompressed image of a predetermined layer of the related rectangular area, so that, for example, when the layer (resolution) is changed and displayed, the The image-related information managed by the user can be displayed together with the image of the hierarchy (resolution). In other words, in a case where a map image is reduced by using the place name, which is text information superimposed on the map image as image-related information, a database is not prepared separately from the image information. Since the place name (image-related information) can be displayed in association with the map image, the management of the image information and the image-related information related to the image information can be simplified.
[0057]
【The invention's effect】
According to the image information processing method of the present invention, an image is divided into one or a plurality of rectangular areas, and pixel values are hierarchically compressed by a discrete wavelet transform, quantization, and encoding for each of the rectangular areas. Then, in the image information processing method for decompressing the compression-encoded image data in the reverse procedure, at the time of compression, image-related information related to a rectangular area image divided into one or a plurality of rectangular areas and hierarchized. Managing a predetermined marker segment defined by a code format for each related rectangular area, and, at the time of decompression, the image related information managed by the marker segment is stored in a predetermined hierarchy of the related rectangular area. Processing together with the decompressed image. In the case of compression, the image-related information relating to the rectangular area image divided into one or a plurality of rectangular areas and hierarchized For example, a layer (resolution) is managed by managing the image-related information together with a decompressed image of a predetermined layer of a related rectangular area at the time of decompression by managing with a predetermined marker segment defined by a code format for each area. In the case of changing the display, the image-related information managed according to the hierarchy (resolution) can be displayed together with the image of the hierarchy (resolution). In other words, in a case where a map image is reduced by using a place name, which is text information superimposed on a map image as image-related information, and a database is not prepared separately from the image information, one file can be used according to the degree of reduction. Since the place name (image-related information) can be displayed in association with the map image, the management of the image information and the image-related information related to the image information can be simplified.
[0058]
According to the second aspect of the present invention, in the image information processing method according to the first aspect, the code format is a JPEG2000 code format, and the marker segment for managing the image-related information is a COM marker. File compatibility can be maintained and versatility can be achieved.
[0059]
According to the third aspect of the present invention, in the image information processing method according to the second aspect, the COM marker is provided in a main header (Main Header) in a JPEG2000 code format, so that each layer (resolution) is provided. The image related information can be changed.
[0060]
According to the fourth aspect of the present invention, in the image information processing method according to the second aspect, the COM marker is provided in a tile header (Tile Header) in a JPEG2000 code format, thereby providing a rectangular area (tile). Image-related information can be set for each layer and for each layer (resolution).
[0061]
According to the invention described in claim 5, in the image information processing method according to any one of claims 1 to 4, the image-related information is generated in a binary format, so that the image-related information is easily managed. can do.
[0062]
According to a sixth aspect of the present invention, in the image information processing method according to any one of the first to fourth aspects, the image-related information is generated in XML (eXtensible Markup Language). Can be easily managed.
[Brief description of the drawings]
FIG. 1 is a functional block diagram of a system for realizing a hierarchical coding algorithm which is a basis of the JPEG2000 system which is a premise of the present invention.
FIG. 2 is an explanatory diagram showing a divided rectangular area of each component of an original image.
FIG. 3 is an explanatory diagram showing subbands at each decomposition level when the number of decomposition levels is three.
FIG. 4 is an explanatory diagram showing a precinct.
FIG. 5 is an explanatory diagram showing an example of a procedure for ranking bit planes.
FIG. 6 is a system configuration diagram showing a system including an image processing apparatus according to an embodiment of the present invention.
FIG. 7 is an explanatory diagram illustrating a schematic configuration of a JPEG2000 code format.
FIG. 8 is a block diagram schematically illustrating a hardware configuration of the image processing apparatus.
FIG. 9 is an explanatory diagram illustrating an example of an image display.
10 is an explanatory diagram showing an example of metadata stored in a comment marker of a main header of the image shown in FIG.
FIG. 11 is an explanatory diagram showing an example of displaying an image divided into tiles.
12 is an explanatory diagram showing an example of metadata stored in a comment marker of a main header of the image shown in FIG.

Claims

The image is divided into one or a plurality of rectangular areas, and pixel values are hierarchically compressed by a discrete wavelet transform, quantization, and encoding for each of the rectangular areas, and the compressed and encoded image data is expanded in a reverse procedure. In the image information processing method,
At the time of compression, a step of managing image-related information related to a rectangular area image divided into one or a plurality of rectangular areas and hierarchized by a predetermined marker segment defined in a code format for each relevant rectangular area When,
At the time of decompression, processing the image-related information managed by the marker segment together with a decompressed image of a predetermined hierarchy of an associated rectangular area;
An image information processing method comprising:

2. The image information processing method according to claim 1, wherein the code format is a JPEG2000 code format, and the marker segment for managing the image-related information is a COM marker.

The image information processing method according to claim 2, wherein the COM marker is provided in a main header (Main Header) in a JPEG2000 code format.

The image information processing method according to claim 2, wherein the COM marker is provided in a tile header (Tile Header) in a JPEG2000 code format.

5. The image information processing method according to claim 1, wherein the image related information is generated in a binary format.

The image information processing method according to claim 1, wherein the image-related information is generated in XML (extensible Markup Language).